About Onyx Capital Group
Onyx Capital Group is the world’s #1 liquidity provider for energy derivatives, trading over $3.2 trillion in notional value annually. Through Onyx Commodities, the Flux Terminal, and our trading division Axis, we dominate the benchmarks that matter — Dated Brent, S2F contracts, and beyond. We are not a bank. We are not a hedge fund. We are a technology-driven market-making powerhouse at the intersection of global energy markets, data, and engineering — and we are growing fast.
Engineering at Onyx
We don’t hire passengers — we hire pilots. Every engineer here takes ownership, pushes back when the design isn’t right, and holds the bar high without being asked. We practise TDD, design for observability from day one, and treat software craftsmanship as a genuine competitive advantage. The team is small, senior, and embedded directly in the trading floor. There are no layers of abstraction between you and the business — your work has immediate, measurable impact from day one.
The Opportunity
We are establishing our Sri Lanka engineering hub and looking for two Senior Platform Infrastructure Engineers to anchor the team’s cloud and reliability function. Working closely with our UK/Dubai platform team, you will own the cloud infrastructure that underpins our live trading systems — responding to incidents, driving automation, and building the reliability and tooling that the business depends on around the clock. These roles provide critical coverage during Singapore business hours and the early Dubai / UK hours.
Requirements
- 5–10+ years in a cloud infrastructure or platform engineering role
- Microsoft Azure required — compute, networking, storage, and managed services; comfortable designing, deploying, and operating cloud environments at scale
- Infrastructure as Code: proven Terraform experience including state management, modules, and multi-environment deployments — treating infrastructure as versioned, reviewed software
- Kubernetes in production — deploying and managing containerised workloads, troubleshooting pod and cluster issues, and understanding networking and storage within Kubernetes
- Python required — scripting for automation, operational tooling, monitoring, and process improvement
- Source control and CI/CD: GitHub, Azure DevOps, or GitLab, including branching, PR workflows, and pipeline configuration
- Incident response experience — able to diagnose, triage, and communicate issues clearly under pressure
- Highly self-directed — able to prioritise independently, manage your own workload, and escalate appropriately without needing day-to-day oversight
- Customer-obsessed approach to platform reliability — treats internal engineering teams and traders as users, takes their pain points seriously, and drives improvements without being asked
- Experience providing remote infrastructure support to distributed engineering teams — comfortable being the point of contact for platform issues across time zones and communicating status and resolution clearly
- Strong written and spoken English — essential for asynchronous communication with the UK team across time zones, including incident reports, runbooks, and technical documentation
- Experience working with globally distributed teams is strongly welcomed; comfortable collaborating across time zones and cultures in a fast-moving environment
Nice to Have
Monitoring and observability tooling — Prometheus, Grafana, Azure Monitor
- Security and compliance in cloud environments
- Cloud cost management and optimisation
- AWS experience alongside Azure
- Prior experience in financial services or fintech, or a genuine interest in financial technology and capital markets — not required, but candidates who are curious about the domain tend to thrive here
- C# or other object-oriented programming experience — C# particularly advantageous given our existing platform stack
- Experience writing and maintaining runbooks, incident post-mortems, and knowledge base documentation for distributed teams
What We’re Looking For
You treat infrastructure as a craft — versioned, tested, observable, and built to last. You are comfortable being on the front line of incidents and staying calm when systems misbehave. You communicate clearly with engineers thousands of miles away and take pride in the reliability of systems you can’t always see. You automate what others accept as manual, and you never ship something you wouldn’t be happy to own at 2am.