Job Description :
BIG DATA ENGINEER

LOCATION: RESTON, VA



JOB DESCRIPTION / BACKGROUND:



The Lead Big Data Engineer independently designs and builds Big Data solutions leveraging Cloudera Big Data technology stack. With minimal supervision, performs Development activities, technical documentation, system performance support, and internal customer support. Takes complete technical ownership of a given project and provides guidance and support to other team members. Works with Solutions Architects, Big Data Administrators and other Big Data and BI team members.



TASKS:



The incumbent's accountabilities include, but are not limited to, the

following:



* 70% System Design and Implementation: Designs and Builds Big Data solutions to meet business requirements. Assumes complete ownership ofDelivery from Data Engineering stand point in given project(s)

* 15% Leadership: Assists other Developers in resolving complex issues.Performs code reviews and ensures that standards are followed consistently across projects

* 15% Procedural: Creates and maintains Coding standards, Design Documents and, Production Run books. Recommends and implements new technologies to benefit the business



REQUIRED QUALIFICATIONS:



* Advanced Level experience (7+ years ) with Java , Python/Scala programming languages

* Advanced level Experience (3+ years ) building Real Time streaming systems, using Flume, Kafka and Apache Spark streaming

* Experience tuning Hadoop/Spark parameters to for optimal performance

* Advanced level experience with at least one NoSQL stores (Hbase,Cassandra, MongoDB etc

* Experience with Big Data querying tools including Impala

* Advanced experience with SQL and at least one major RDBMS (Oracle,DB2 etc

* Advanced experience with Shell scripting

* Rigor in high code quality, automated testing, and other engineering best practices, ability to write reusable code components



PREFERRED QUALIFICATIONS:



* BS/MS in Mathematics, Engineering, or Computer Science

* Working knowledge of U.S. Healthcare Industry

* Experience working with large data environments - petabytes or hundreds/thousands of terabytes

* Cloudera Search experience will be a plus