Lead Data Engineer 22-91789

Santa Clara, CA Santa Clara CA 95054

Date : Sep-24-22

Santa Clara, CA

Sep-24-22

Work Authorization

US Citizen
GC
H1B
EAD (OPT/CPT/GC/H4)

Preferred Employment

Corp-Corp
W2-Permanent
W2-Contract
1099-Contract
Contract to Hire

Job Details

Experience

Architect

Rate/Salary ($)

Market

Duration

Full Time

Sp. Area

Data Warehousing/ETL

Sp. Skills

Data Engineer

Consulting / Contract

Required Skills :

hadoop, kafka, pyspark, spark, aws, python

Preferred Skills :

Domain :

IT/Software

Work Authorization

US Citizen
GC
EAD (OPT/CPT/GC/H4)
H1B

Preferred Employment

Corp-Corp
W2-Permanent
W2-Contract
1099-Contract
Contract to Hire

Job Details

Experience

Architect

Rate/Salary ($)

Market

Duration

Full Time

Sp. Area

Data Warehousing/ETL

Sp. Skills

Data Engineer

Consulting / Contract

Required Skills :

hadoop, kafka, pyspark, spark, aws, python

Preferred Skills :

Domain : IT/Software

tanishasystems
Boston, MA
Post Resume to
View Contact Details &
Apply for Job

Job Description :

Location : (Remote role)
Duration : Full-Time
INCEDO Client
Data Engineer (Hadoop Kafka, AWS) - Lead Data Engineer
Submit me candidates with next 2 days time slot
Will be responsible to design, develop, integrate, and maintain Enterprise level Big Data Systems with both batch and streaming datasets. Should have a very strong understanding of MPP databases, Shared Nothing, Shared disk, and other Modern Big Data tech stack like Spark, Hadoop, AWS Redshift, Confluent Kafka, AWS Kinesis, and other Streaming technologies. This individual is expected to design Big Data systems based on Industry best practices and architectural guidance of Big Data systems, with understanding of integration with other data sources and tools like Markit, Business objects, Informatica, MS SQL, PL/SQL, etc. Enhance/Maintain/support existing applications. Collaborate with other developers in designing.
Must Skill:

Partner with business leadership to identify problems, and opportunities for technology innovation with a focus on Big Data implementations.
Help establish a clear, consistent technology vision through collaboration, influence, and enablement.
Research, recommend, design and develop Big Data systems and with a sound understanding of the Big Data application architecture and Integration.
Identify and assess the organizational impact of enterprise architecture and standards, including change in skills, processes and structures with an emphasis on the DataWarehouse.
Should have developed ETL pipeline using Python & Spark or PySpark

Extensive experience in SQL query tuning
In educational qualification candidate must have someone with Computer Science degree / diplomas
In experience Focus on 4+ years of total experience with 1+ years of experience in Databricks and PySpark
Creating robust and extensible data pipelines for production systems
Creating secure, performant, and well-modeled data stores
Must be fluent in any one of the scripting languages such as Python/Java
Experience working in an onsite client technical consulting environment preferred.
Source code version control management using tools like Git/GitHub
Experiences working within Agile Frameworks, such as Scrum or Kanban
Excellent communication skills to be able to interact directly with non-technical client stakeholders and act in a business-to-technical translation role.