Working Knowledge of Python, JSON, Scala, Pyspark UI designing knowledge in Jquery, html, css, bootstrap, angular, python, flask, django, postgresql, nginx, gunicorn Build data ingestion and data processing pipelines to move the data from S3 into Amazon Redshift, Pandas. Knowledge of creation of EMR cluster if required Knowledge to use to python programs to integrate with Spark and utilize AWS Cloud tools such as Data Pipeline etc. Knowledge of JSON programs to utilize EMR cluster to use the metadata to write pipelines Working knowledge on converting orc to parquet |