Fulltime Bigdata Developer in San Francisco, CA

San Francisco, CA San Francisco CA 94188

Date : May-29-19

San Francisco, CA

May-29-19

Work Authorization

US Citizen
GC
H1B
OPT EAD, GC EAD, TN EAD

Preferred Employment

Corp-Corp
W2-Permanent
W2-Contract
1099-Contract
Contract to Hire

Job Details

Experience

Architect, Senior, Midlevel

Rate/Salary ($)

Market

Duration

Sp. Area

BigData, NoSQL

Sp. Skills

Hadoop

Permanent Direct Hire

Direct Client Requirement

Required Skills :

Hadoop, Big Data, SQL, SAN, Agile, Business Intelligence, Real time

Preferred Skills :

Domain :

Work Authorization

US Citizen
GC
OPT EAD, GC EAD, TN EAD
H1B

Preferred Employment

Corp-Corp
W2-Permanent
W2-Contract
1099-Contract
Contract to Hire

Job Details

Experience

Architect, Senior, Midlevel

Rate/Salary ($)

Market

Duration

Sp. Area

BigData, NoSQL

Sp. Skills

Hadoop

Permanent Direct Hire

Direct Client Requirement

Required Skills :

Hadoop, Big Data, SQL, SAN, Agile, Business Intelligence, Real time

Preferred Skills :

Domain :

Nityo Infotech Corporation
Pittsburgh, PA
Post Resume to
View Contact Details &
Apply for Job

Job Description :

Role: Bigdata Developer
Location: San Francisco, CA/ McLean VA/ Tysons, VA

Fulltime

Job Details:
Must have skills*
Pyspark, Kafka, Nifi, Sqoop, AWS, Python, Hive, Hadoop, SQL, Pig, Python, Spark, SQL, Pyspark, Kafka, Nifi, Sqoop, AWS
Secondary Skills
NoSQL, Statistical Models, Financial Models, Machine Learning

Key responsibilities*
Responsible for delivery in the areas of: big data engineering with Hadoop, Python and Spark (PySpark & Nifi) and a high level understanding of machine learning
Develop scalable and reliable data solutions to move data across systems from multiple sources in real time (Nifi, Kafka) as well as batch modes (Sqoop)
Construct data staging layers and fast real-time systems to feed BI applications and machine learning algorithms
Utilize expertise in technologies and tools, such as Python, Hadoop, Spark, AWS, as well as other cutting-edge tools and applications for Big Data
Demonstrated ability to quickly learn new tools and paradigms to deploy cutting edge solutions.
Develop both deployment architecture and scripts for automated system deployment in AWS
Create large scale deployments using newly researched methodologies.
Work in Agile environment
Strong SQL skills to process large sets of data
Data Engineering and Research:
Develop methods to Cleanse, manipulate and analyze large datasets (Structured and Unstructured data – XMLs, JSONs, PDFs) using Hadoop platform.
Lead and Design to implement common data ingestion processes and analytical tools to proactively identify, quantify and monitor risks.
Design and Lead Extract, transform and summarize information from large data sets to inform management decisions using Hadoop platform.
Develop techniques from statistics and machine learning to build data quality controls for predictive models on numeric, categorical, textual, geographic, and other features.
Develop and maintain data ingestion pattern using Python, Spark, HIVE scripts to filter/map/aggregate data.
Lead and work with the Credit Risk Analytical and Model team members to ensure successful implementation of models in valuation tools used by Single Family business model managers and other model domain users.
Analysis and Modeling:
Develop and Perform R&D and exploratory analysis using statistical techniques and machine learning clustering methods to understand data.
Develop data profiling, deduping logic, matching logic for analysis.
Present ideas and recommendations on Hadoop and other technologies best use to management.

Keywords
Python, Spark, Pyspark, Kafka, Nifi, Sqoop, AWS

NEERAJ KUSHWAHA
Desk No Ext-2260