Job Description :

Role : Lead Data Engineer
Location : Scottsdale AZ (100% onsite)
Hire Type : FTE and CTH

Must have skill set: Spark, S3, Glue, AWS Redshift , python and stream set exp

6-8 years of IT experience focusing on enterprise data architecture and management.
Experience in Conceptual/Logical/Physical Data Modelling & expertise in Relational and Dimensional Data Modelling
Experience with Databricks & on Prem , Structured Streaming, Delta Lake concepts, and Delta Live Tables required
Experience with Spark scala
Data Lake concepts such as time travel and schema evolution and optimization
Structured Streaming and Delta Live Tables with Databricks a bonus
Experience leading and architecting enterprise-wide initiatives specifically system integration, data migration, transformation, data warehouse build, data mart build, and data lakes implementation / support
Advanced level understanding of streaming data pipelines and how they differ from batch systems
Formalize concepts of how to handle late data, defining windows, and data freshness
Advanced understanding of ETL and ELT and ETL/ELT tools such as Data Migration Service etc
Understanding of concepts and implementation strategies for different incremental data loads such as tumbling window, sliding window, high watermark, etc.
Familiarity and/or expertise with Great Expectations or other data quality/data validation frameworks a bonus
Familiarity with concepts such as late data, defining windows, and how window definitions impact data freshness
Advanced level SQL experience (Joins, Aggregation, Windowing functions, Common Table Expressions, RDBMS schema design performance optimization)
Indexing and partitioning strategy experience
Debug, troubleshoot, design and implement solutions to complex technical issues
Experience with large-scale, high-performance enterprise big data application deployment and solution
Architecture experience in AWS environment a bonus
Familiarity working with Lambda specifically with how to push and pull data, how to use AWS tools to view data for processing massive data at scale a bonus
Experience with Gitlabs and CloudWatch and ability to write and maintain gitlabs for supporting CI/CD pipelines
Experience working with AWS Lambdas for configuration and optimization and experience with S3
Familiarity with Schema Registry, message formats such as Avro, ORC, etc.
Ability to thrive in a team-based environment
Experience briefing the benefits and constraints of technology solutions to technology partners, stakeholders, team members, and senior level of management

             

Similar Jobs you may be interested in ..