What is the salary for this Software Engineer (Synthetic Data) role?

Salary information is not publicly listed for this position. Apply directly to discuss compensation with V7labs.Com.

Where is this Software Engineer (Synthetic Data) position located?

This is an on-site position at V7labs.Com located in Remote.

How do I apply for this Software Engineer (Synthetic Data) job at V7labs.Com?

Click the 'Apply' button on this page to be redirected to V7labs.Com's application portal. Make sure to have your resume ready and tailor your application to highlight relevant experience.

V7labs.Com is actively hiring for Engineering roles. Visit the company page to see all open positions and learn more about working at V7labs.Com.

Software Engineer (Synthetic Data)

V7labs.Com Remote Today

engineering

V7

At V7, we’re building AI platforms that help humans do their best work, at incredible scale and speed. Our mission is to turn human knowledge into trustworthy AI, making complex tasks faster, smarter, and more accurate.

We’re growing fast, backed by leading investors and AI pioneers (including the minds behind Transformers and Gemini).

The team you'll be joining and impact you'll have

You'll join a high-impact research and engineering team developing large-scale synthetic data pipelines that power frontier AI models
Bridge the gap between cutting-edge research and production systems, translating experimental insights into scalable infrastructure
Work directly on training data that shapes the next generation of multimodal AI capabilities
Collaborate with researchers and ML practitioners who care deeply about both scientific rigor and real-world impact

What you'll be doing from day one:

Design and maintain modular synthetic data generation pipelines for multimodal training tasks, using proxy metrics and statistical experiments to validate output quality
Own end-to-end LLM experiments—from prompt engineering to reproducible evaluation—accelerating iteration with modern AI tooling like Cursor and GitHub Copilot
Build rigorous evaluation frameworks that assess pipeline performance through sound experimental design and clear statistical reasoning
Translate research breakthroughs into production-ready systems, working across the stack with researchers and engineers to scale what works

Who you are

You have 3+ years of software engineering experience (Python or JavaScript preferred) and an MS+ in Computer Science, Engineering, Mathematics, or related field with strong foundations in statistics and experimental design
You're fluent in LLM systems—from context engineering to output optimization—and stay current with research on training datasets and evaluation benchmarks (ChartQA, ChartGalaxy, synthetic data techniques)
You're comfortable with data infrastructure: Git, DVC, shell environments, and pipeline orchestration are second nature to you
You thrive in ambiguity with a bias toward action, viewing iteration and feedback as catalysts for discovery rather than setbacks, while maintaining scientific rigor in both code and evaluation

V7 champions equality and inclusion because diverse teams build better products. Don't check every box? Apply anyway — we value what makes you unique and will support you through the process, just let our Talent team know how they can help.