Join TeraWatt’s innovative technology team as a DevOps Software Engineer and help drive the evolution of our platform to support one of the leading fleet EV charging solutions in North America. In this role, you will contribute to the development and reliability of TeraWatt’s charging network management system and the capabilities that enable seamless charging and a high-quality site experience for customers.
Working closely with the Director of Software, the Product team, and other technical partners, you will play a key role in expanding our platform and delivering meaningful value in a rapidly evolving industry. This role will help scale our cloud infrastructure and support organizational growth by implementing best practices that improve the reliability, performance, and maintainability of our charging infrastructure and software systems.
If you are a DevOps or Cloud Infrastructure Software Engineer who is motivated to help build scalable, impactful technology with a collaborative and innovative team, we encourage you to apply. Your work at TeraWatt will contribute directly to advancing the sustainable transportation infrastructure of the future.
Lead and architect the evolution of our cloud infrastructure using Terraform, building resilient and scalable systems to support business growth.
Maintain helm charts and deployment patterns that enable teams to manage the lifecycle of their services while adhering to established deployment standards.
Build tooling to enable engineering teams to own the application deployment process through CI/CD pipelines using GitHub Actions.
Promote security best practices across all layers of the stack, including software access, managed workloads, and services running in pre-production and production environments.
Strengthen cloud and network security using industry-standard tools to detect vulnerabilities and anomalies, and help prevent suspicious or malicious activity.
Advance observability practices using frameworks such as OpenTelemetry (OTel) and tools like Grafana Cloud for monitoring and alerting across services and infrastructure.
Develop tooling that supports both local and remote container-based cloud development workflows.
Create and automate simulated production scenarios used for testing during development and validating production releases.
Implement automation and alerting to maintain security and compliance standards, including SOC 2 controls.
Design and manage infrastructure that supports machine learning model training and deployment, ensuring scalable compute resources for ML workloads.
Partner with the Data team to manage core data infrastructure, including our Databricks data lake and Kafka event streams (Aiven/AWS), while advising on scalable data architecture and infrastructure improvements.
Participate in a 24/7 on-call rotation to support the reliability of production systems.
8+ years of experience building and operating high availability production software systems, preferably in DevOps or platform engineering teams.
Experience building and maintaining scalable cloud-based infrastructure, including services running in managed Kubernetes (EKS).
Experience building or maintaining CI/CD pipelines (e.g., GitHub Actions) to support reliable software delivery.
Experience leading or contributing to SRE or DevOps initiatives supporting production cloud platforms.
Experience with observability frameworks and tools (e.g., OpenTelemetry, Grafana, or similar platforms).