Since 2005, MDCalc has been an essential part of the clinician’s workflow to help achieve better patient outcomes. Actively used by more than 65% of physicians worldwide, MDCalc is the most broadly used medical reference – at the point-of-care – for clinical decision tools and content, and one of only four references used by >50% of US HCPs. These evidence-based tools and content are used by millions of medical professionals globally and support 50+ specialties and cover 200+ patient conditions.
We are seeking a Lead Data Scientist to play a central role in shaping how MDCalc uses data to drive product innovation, inform business decisions, and improve clinical outcomes. This is a high-impact opportunity to define data strategy, build scalable solutions, and deliver insights that guide the future of our platform and help millions of clinicians care for hundreds of millions of patients.
As a Lead Data Scientist, you will apply rigorous statistical methods, predictive analytics, and applied machine learning techniques to solve meaningful product and business problems. You will work closely with Product, Engineering, and Clinical teams to design models, evaluate impact, and build scalable data solutions that directly influence MDCalc’s platform and mission. While this role focuses on applied statistical modeling and predictive analytics exposure too and progress use of generative AI and/or large language model development is considered a plus.
This role is ideal for someone who is deeply curious, technically strong, and motivated by applying data science to real-world healthcare problems with tangible impact.
The responsibilities of this individual include the following, but are not limited to:
Define and lead analytical projects that influence product direction, business strategy, and clinical impact
Develop predictive models and behavioral analyses that help identify which clinicians are using MDCalc, how they engage with clinical tools, and how usage patterns evolve across specialties, care settings, and patient conditions
Analyze large and complex healthcare datasets, including clinician engagement, product usage patterns, and provider behavior, to uncover patterns, opportunities, and actionable insights
Design and implement robust data pipelines, workflows, and dashboards that scale with MDCalc’s growth
Partner with product managers, engineers, and clinical experts to translate data into product requirements and measurable outcomes
Establish experimentation frameworks and lead A/B testing to optimize product performance
Set standards for data integrity, reproducibility, and quality, ensuring confidence in insights across the organization
Act as a thought partner for leadership, communicating findings and recommendations clearly to drive strategic decisions
Bachelor’s, Master’s, or PhD in Data Science, Statistics, Computer Science, Applied Mathematics, or a related field
10+ years of experience in applied data science, statistical modeling, or predictive analytics, ideally working with healthcare, clinical, or provider behavior datasets
Strong foundation in statistical methods, predictive modeling, and machine learning concepts
Proficiency in Python or R and associated data science libraries such as pandas, scikit-learn, statsmodels, NumPy, or similar
Strong SQL skills and experience working with large datasets
Experience developing, evaluating, and improving machine learning models in real-world applications
Experience designing and analyzing experiments and interpreting results with statistical rigor
Ability to communicate complex technical concepts clearly and effectively
Strong ownership mindset and ability to operate independently in a fast-moving environment
Familiarity with data visualization and BI tools such as Tableau or Looker, with the ability to tell compelling data stories to diverse audiences
Expertise in Python (pandas, NumPy, scikit-learn) and SQL, with experience building production-grade predictive or statistical models
Deep knowledge of machine learning techniques such as regression, classification, clustering, recommendation systems, and causal inference
Proven track record of leading data initiatives from concept to implementation and influencing product roadmaps
Experience designing experiments, running A/B tests, and measuring product success through data
Experience working with healthcare, clinical, or life sciences datasets such as provider behavior, claims data, utilization data, or clinical decision support platforms is strongly preferred
Ability to make a true difference in medicine: MDCalc is the most broadly used medical reference used by 65% of physicians worldwide.
Medical, Dental, & Vision coverage, with option to extend to your dependents
Company-sponsored short-term insurance
Fully-paid 8 week parental leave, after 6 months of employment
Company-sponsored 401k, after 3 months of employment
Unlimited vacation for salaried roles - we trust you to take the time you need
Tri-annual company offsites to connect, reflect, and plan together
Work from home monthly stipend
Hybrid work environment with a great team office in Greenwich Village, NYC
A culture of fun and motivated team members who believe in a greater mission here at MDCalc