Job Description:
This role will focus on designing, optimizing, and implementing data models as the organization transitions its enterprise data to the Databricks Lakehouse platform. The ideal candidate will have strong data modeling expertise, healthcare domain knowledge, and experience in modern cloud-based data platforms.
Key Responsibilities
- Collaborate with business analysts, data architects, and engineering teams to translate business requirements into logical and physical data models.
- Design scalable, flexible, and performance-optimized data models to support analytics, reporting, and machine learning workloads on Databricks.
- Map and migrate legacy data structures (from EHR, EMR, billing, and operational systems) to the Databricks Lakehouse environment.
- Ensure compliance with HIPAA and other healthcare data privacy regulations in data model design. (Good to have)
- Define and maintain data dictionaries, entity-relationship diagrams, and metadata documentation.
- Work with ETL/ELT developers to ensure models are implemented effectively and aligned with data ingestion and transformation pipelines.
- Partner with data governance teams to enforce data quality, standardization, and stewardship best practices.
- Optimize models for structured, semi-structured, and unstructured healthcare data (e.g., HL7, FHIR, claims data, imaging metadata).
Qualifications
We are an equal opportunity employer. All aspects of employment including the decision to hire, promote, discipline, or discharge, will be based on merit, competence, performance, and business needs. We do not discriminate on the basis of race, color, religion, marital status, age, national origin, ancestry, physical or mental disability, medical condition, pregnancy, genetic information, gender, sexual orientation, gender identity or expression, national origin, citizenship/ immigration status, veteran status, or any other status protected under federal, state, or local law.