Job Description :
Senior Data Engineer (Java, Python, Hadoop)
Sunnyvale, CA
3-6 Months Contract


Responsibilities
Design & Build data pipelines for handling real-time data streams and integrating with machine learning models.
Understand and analyze large, complex data sets to identify anomalies and data quality issues.
Design and Build Data quality and anomaly detection framework for IoT time series data.
Build data pipeline with Apache Spark in Java and Python and Google Cloud Platform Data Proc.
Work with cross functional stakeholders to assist with their data needs via optimal & reusable data pipelines.


Qualifications
We are looking for a candidate with 7+ years of experience in a Data Engineer role, with Bachelor or Master’s degree in computer science or related field.
Highly Proficient in Java programming language with Python programming skills as a huge plus.
3+ years of hands on experience in building and optimizing big data pipelines using Apache Spark in Java and Python programming languages.
2+ years of hands on experience of building & integrating data pipelines with machine learning models.
1+ years of experience with Google Cloud Platform esp. Google Pub-Sub, Big Query, Data Proc, Data Flow, Cloud Storage.
Highly desirable to have experience on data quality analysis and detection.
Experience with big data tools: Kafka, Cassandra, Hadoop, Storm and Spark