Role : Hadoop Admin / Developer

Location : RTP, NC

Duration : 9+ Months

Interview : Phone then Face to Face

Position is 75% Hadoop Administration and 25% Hadoop Development.
Requires experience with Big Data, Hadoop, Spark, and RDBMS. Must be local
to NC or surrounding states and available to interview onsite with a 2 day

We have a long-term renewable position for a Hadoop
Developer/Hadoop Administrator with a client of ours in RTP, NC.
Strong experience with Big Data and Spark is required. We are
seeking an individual able to design and develop scalable systems that
combine high performance computing techniques, big data techniques and
distributed computing using in-house or cloud services.
The individual will need to have strong UNIX skills, strong RDBMS
skills and strong big data skills (esp Spark
Machine Learning will also be used in select projects.
These skills will be put to use in a collaborative environment
where you will work with architects, domain experts, business analysts, and

The best candidate will have professional level skills and
experience (5+ years) with: - Hadoop, HDFS, Spark, Hive - *RDBMS as a
developer - *Python, Scripting languages, (Java, C/C++ a plus) - Parsing
text files regex)
Experience with fault tolerant design approaches - Can download,
configure, compile, install, test and use open source and big data software
Service oriented and fully object oriented development skills - (
Can use OO frameworks and has performed refactoring of existing designs )
Demonstrated understanding of schema-on-read - Ability to consult
and/or be hands-on with Hadoop administration.
Able to create ad hoc web reports, CSV or other common report
Can effectively translate business requirements into application
Able to communicate effectively in person and through written or
presentation modes
Understands or has worked with genomic research data is a plus but
not required
The ability to set up and use cloud resources is a plus but not

Job Responsibilities:

Participate or lead in design of large scale analytical systems
Participate or develop large scale systems
Spend up to 25% of their time doing administration and maintenance
activities during critical data gathering seasons
Establish effective working patterns with our UNIX Administration
Spends as much time as possible learning the data types of our
science domain