Job Description :
Role : Data processing with Python or Scala-based Spark

Location : Carlsbad ,CA

Job Description :
Strong Experience in developing Python or Scala-based Spark Data processing pipeline for large data set of data integration in AWS Cloud environment
Strong Experience with AWS Glue Catalog, ETL and Athena for Data Analysis
Experience with Databricks-based Spark data ingestion and data stream is a Plus
Experience in deploying data services in AWS using lambda functions through Spark and EMR
Experience in building high-performance data queries from relational and as well non-relational (NoSQL) data sources: RedShift, MongoDB, Aurora,
Experience in tuning performance of data services
Experience with Kafka and Kinesis
Experience with continuous delivery and associated tooling (Ansible, Jenkins, Terraform
Experience with micro service or event driven architectures
Experience with Docker, Linux and shell scripting
Excellent Communication and Interpersonal skills
             

Similar Jobs you may be interested in ..