Arbitalhealth

Senior Data Engineer

Arbitalhealth San Francisco OR Remote 1 day ago
data
Arbital Health is a rapidly growing healthcare technology and actuarial leader that centralizes, measures, and adjudicates value-based care contracts at scale.  We enable payers and providers to design, measure, and execute value-based agreements with greater transparency, efficiency, and financial predictability.
We invest in hiring high potential and humble individuals who thrive in fast-paced environments and can rapidly grow their responsibilities as we continue to accelerate our growth.

We were co-founded by Brian Overstreet and Travis May (founder & former CEO of LiveRamp and Datavant, the two biggest data companies of the last 20 years), and are backed by Transformation Capital, Valtruis and other leading investors. In our first 2 years, Arbital Health has established itself as a trusted partner for over 40 payers, providers, and other stakeholders looking to navigate the complexities of risk-based contracting.

The role:
In 2024, we successfully launched our production platform, establishing scalable data pipelines to ingest, enrich, and summarize vital healthcare financial data. Building on this foundation, we recently launched Merlin AI, the first VBC AI assistant that allows users to interact with complex VBC contracts & performance data in natural language and visualize it on demand. Built by the industry’s leading VBC actuaries and engineers, Merlin AI makes complex contract performance data conversational, transparent, and instantly actionable., The continued success of this feature hinges on the quality, volume, and freshness of the data we organize and provide. As a key data engineer, you will be crucial in curating this data as it evolves, scaling our pipelines as we grow ensuring our customers gain effortless, high-value insights from a wide range of questions.

What You'll Do:

  • Leverage your advanced skills in Python and SQL and with platforms like Databricks and AWS to build highly scalable data pipelines and warehouses
  • Design and maintain context engineering pipelines (embedding generation, indexing, vector storage) for conversational AI, collaborating with the Senior AI Engineers and Data Scientists.
  • Develop and deploy scalable AI/ML data pipelines, specializing in data prep/serving for LLMs and RAG systems.
  • Stay ahead of emerging tech and integrate it into business solutions
  • Automate data workflows for ingestion, cleansing, quality assurance, enrichment, and aggregation
  • Ensure data accuracy, integrity, privacy, security, and compliance through automated quality control procedures
  • Collaborate with an actuarial science team
  • Contribute to the design, implementation, and overall development of our products
  • Drive innovation and deliver valuable features for our customers
  • What You'll Bring:

  • Experienced in data-intensive, full-stack development projects
  • Experience with data/AI architecture and MLOps
  • Proven, hands-on experience designing and implementing data-intensive solutions using LLMs, vector databases, embeddings, and context engineering techniques (RAG, summarization).
  • Able to work entrepreneurially – self-motivated, ambitious, and fast-paced
  • Able to ship extremely high caliber code and build exceptional products
  • High level of attention to detail
  • Ability to perform under minimal supervision with accountability for specific objectives and work in a rapidly changing, ambiguous start-up environment
  • Passionate about improving and innovating
  • Startup experience is highly preferred
  • Our team works hybrid from the San Francisco Bay Area. We will prioritize candidates who are able to work 1-2x per week from our office and we will consider highly qualified remote candidates who are able to travel for in-person collaboration in San Francisco at least one full week per month.
  • Tools We Use:

  • Core Tools: Python, R, SQL, Next.js, React, TypeScript, Tailwind CSS
  • Infrastructure: AWS/GCP, Databricks, Heroku, Airflow, Sigma
  • Version Control: GitHub
  • Team Planning: Jira, Confluence, Figma
  • Why Join Us?
    We are assembling a team of creative, talented visionaries seeking to build a new technology that will change healthcare. You will be able to learn, build, and scale our team and technology in a collaborative, creative culture that values every team member.

    We Offer:
    •  Generous equity grants of ISO stock options
    •  We offer an exceptional benefits package with high employer-paid contributions for health, dental, and vision insurance
    •  4% 401(k) match
    •  Flexible PTO, a weeklong winter shutdown, and 10 holidays each year
    •  Occasional travel required - Quarterly team offsites
    •  The opportunity to build a critical software platform that accelerates the American healthcare system's transition to value-based care

    Sponsored

    Explore Data

    Skills in this job

    People also search for