Job Description :
Job description ::
Experience with MapR distribution 5.x is mandatory
Experience with Linux (RHEL or others) administration is mandatory
MapR Certified Cluster Administrator - Mandatory /
Expertise in building Data Hub or Data lake using Hadoop and AWS technologies and taking this to production (PoC level experience not acceptable
)• Experience with Impala and other SQL engines with Hadoop on Production (PoC level is not acceptable)
Experience of integrating Impala with Active Directory and Enterprise Security
Experience of working with AWS Redshift and deeper knowledge of architecture
Expertise in using Hive, Spark
Strong experience with Python, Linux Shell scripting (Basic command level knowledge at PoC level is not acceptable)
Experience with ETL & Data Ingestion tools (Configuration in Enterprise secure env and data load both) like Informatic, Kafka, Sqoop etc
.,• Exposure to AWS Services like S3, EC2, Redshift, EMR, Lambda, data pipeline
Performance tuning
Excellent Communication & presentation skill
l• Ability to work with offshore team as needed
Handling different file formats in Hadoop Ecosystem
Preferably data management and DWH background and experience
Experience of working in Pharma domain using any Hadoop Distribution (R&D, Sales & Marketing, Supply Chain)

Client : xxx