Job Description :
Hadoop Developer
Plano, TX

Need Passport No.

Competency: Digital : BigData and Hadoop Ecosystems, Digital : Python

Role:

Handled importing of data from various data sources, performed data control checks using Spark and loaded data into
HDFS.

Used Spark for interactive queries, processing of streaming data and integration with popular NoSQL database for huge
volume of data.

Creating end to end Spark-Solr applications using Scala to perform various data cleansing, validation, transformation and
summarization activities according to the requirement.

Developed Scala scripts using both Data frames/SQL/Data sets and RDD/MapReduce in Spark for Data Aggregation,
queries and writing data back into OLTP system through Sqoop.

Experienced in handling large datasets using Partitions, Spark in Memory capabilities, Broadcasts in Spark, Effective &
efficient Joins, Transformations and other during ingestion process itself.
             

Similar Jobs you may be interested in ..