Job Description :
Location : (Remote role)
Duration : Full-Time
Data Engineer (Hadoop Kafka, AWS) - Lead Data Engineer
Submit me candidates with next 2 days time slot
Will be responsible to design, develop, integrate, and maintain Enterprise level Big Data Systems with both batch and streaming datasets. Should have a very strong understanding of MPP databases, Shared Nothing, Shared disk, and other Modern Big Data tech stack like Spark, Hadoop, AWS Redshift, Confluent Kafka, AWS Kinesis, and other Streaming technologies. This individual is expected to design Big Data systems based on Industry best practices and architectural guidance of Big Data systems, with understanding of integration with other data sources and tools like Markit, Business objects, Informatica, MS SQL, PL/SQL, etc. Enhance/Maintain/support existing applications. Collaborate with other developers in designing.
Must Skill:
  • Partner with business leadership to identify problems, and opportunities for technology innovation with a focus on Big Data implementations.
  • Help establish a clear, consistent technology vision through collaboration, influence, and enablement.
  • Research, recommend, design and develop Big Data systems and with a sound understanding of the Big Data application architecture and Integration.
  • Identify and assess the organizational impact of enterprise architecture and standards, including change in skills, processes and structures with an emphasis on the DataWarehouse.
  • Should have developed ETL pipeline using Python & Spark or PySpark
  • Extensive experience in SQL query tuning
  • In educational qualification candidate must have someone with Computer Science degree / diplomas
  • In experience Focus on 4+ years of total experience with 1+ years of experience in Databricks and PySpark
  • Creating robust and extensible data pipelines for production systems
  • Creating secure, performant, and well-modeled data stores
  • Must be fluent in any one of the scripting languages such as Python/Java
  • Experience working in an onsite client technical consulting environment preferred.
  • Source code version control management using tools like Git/GitHub
  • Experiences working within Agile Frameworks, such as Scrum or Kanban
  • Excellent communication skills to be able to interact directly with non-technical client stakeholders and act in a business-to-technical translation role.

Preferred Skills:
  • Knowledge and use of DataOps practices preferred
  • Experience using NoSQL databases (e.g., AWS DynamoDB, MongoDB, Neo4j)
  • Knowledge of Orchestration tools like Airflow.

Similar Jobs you may be interested in ..