We are seeking a Data Solutions Architect with 15+ years of experience in designing enterprise-scale data platforms. This role focuses on building Azure + Databricks Lakehouse solutions for clinical trial and life sciences data, enabling advanced analytics, machine learning workflows, and regulatory compliance.
Responsibilities
Architect Lakehouse solutions leveraging Azure Data Lake (ADLS Gen2), Databricks, and Delta Lake.
Design data models (star, snowflake, data vault), ingestion pipelines, and CDC strategies with schema evolution and performance tuning.
Implement data governance, security, and compliance aligned with GxP, HIPAA, and 21 CFR Part 11.
Enable data science and ML workflows using MLflow, Feature Store, and curated datasets.
Collaborate with clinical operations and biometrics teams to deliver business-aligned solutions.
Experience
15+ years in data architecture/engineering; 5+ years with Azure; 3+ years with Databricks.