Job Description :
Architect, design and automate large scale data ingestion containing Imaging, Digital, EMR/EHR and Omics from different sources around the globe.
? Design and lead the implementation of data ingestion pipelines from multiple internal and external sources, landing zone, data curation, metadata tagging and data loads in public Cloud’s e.g. AWS and GCP
? Highlights short-term trade-offs vs long-term commitments and where those are worth implementing or not. Ensures that the Solutions are scalable (technology), efficient (process), effective (cost), and supportable
? Have a broad knowledge and leverage the technical capabilities of the internal teams and external technology providers and vendors.


Qualifications:
? Bachelor’s degree in Biology, Computer Science, Mathematics, Electrical Engineering, Information Systems or related field; Master in Mathematics, Science, or Computer Science preferred
? Overall 10+ years experience in data management, solutions design and/or architecture, of which at least 5-10 years as a Solution Architect with hand-on experience.
? Strong Experience in designing and implementing high available and highly scalable big data systems; 3+ years of experience in leading implementation of high scalable data management systems required
? Experience in designing, implementing Clinical Decision Support Systems, Digital Health/Sensor Applications, Patient Engagement Platform is preferred
? Experience in database, application development, RESTful APIs, Agile development, Full-stack web development, creating and maintaining CI/CD pipelines
? Hand-on experience with common JavaScript libraries, data visualization and data integration
? Hand-on experience in Architecting and designing Data Lake in AWS, with potential to grow across different cloud providers i.e. GCP and Azure
? Hands on experience widata management on tools like AWS Redshift and Teradata
? Deep hands-on experience with ingestion and processing of large Genomic, Digital, EMR, EHR, Omics, Imaging data sets coming from different data sources
? Experience with the Hadoop ecosystem (Map Reduce, Spark, Oozie, Impala, HBase, etc and big data ecosystems (Kafka, Cassandra, etc with experience in atleast one of the SQL language.
? Hands-on experience in designing and building data solutions in AWS, e.g. S3, EC2, Aurora, Glue, Lambda.

Mandatory Skills

Python, R Programming

Client : Panacea Direct Inc