Job Description :
Role: Big Data Engineer

Location: Long Beach, CA

Duration: Contract



Mandatory Skills: Hive, Apache Spark and Python



Job description:



Java background a plus
HBASE (MO SQL) – Nice to have
Position Summary (Big Data Engineer)
We are looking for a Big Data Engineer who will work on collecting, storing, processing, and analyzing large sets of data. The primary focus will be on choosing optimal solutions to use for these purposes, then maintaining, implementing, and monitoring them. He/she will also be responsible to follow architecture and best practices used across the enterprise. Experience working in Healthcare industry is preferred.

Responsibilities:
- Define ideal Architecture, Evaluating tools and Frameworks, Standards & Best Practices for implementing scalable business solutions
- Implement Batch and Real-time data ingestion/extraction processes through ETL, Streaming, API, etc., between diverse source and target systems with structured and unstructured datasets
- Design and build data solutions with an emphasis on performance, scalability, and high-reliability
- Code, test, and document new or modified data systems to create robust and scalable applications for data analytics
· Build data model for analytics and application layers
- Working closely with multiple teams and Business partners, for collecting requirement and providing optimal solution

Required Knowledge and Skills:
· Proven experience on Hadoop cluster components and services (like HDFS, YARN, ZOOKEEPER, AMBARI/CLOUDERA MANAGER, SENTRY/RANGER, KERBEROS, etc
- Ability to participate in troubleshooting technical issues while engaged with infrastructure and vendor support teams
- Experience in building stream-processing systems, using solutions such as Kafka, Storm or Spark-Streaming
- Proven experience on Big Data tools such as, Spark, Hive, Impala, Polybase, Phoenix, Presto, Kylin, etc.
- Experience with integration of data from multiple data sources (using ETL tool such, Talend, etc
- Experience building solutions with NoSQL databases, such as HBase, Memsql
· Strong experience on Database technologies, Data Warehouse, Data Validation & Certification, Data Quality, Metadata Management and Data Governance
- Experience with programming language such as, Java/Scala/Python, etc.
· Experience implementing Web application and Web Services APIs (REST/SOAP)