Core Responsibilities (High Level)
- Design, provision, and manage cloud infrastructure using Infrastructure as Code
- Operate and support Kubernetes clusters in production environments
- Build and maintain GitOps-based deployment and configuration workflows
- Support cloud database platforms and platform services in production
- Contribute to environment setup, upgrades, migrations, and reliability initiatives
- Develop automation and tooling to improve operational efficiency
- Troubleshoot infrastructure and platform issues across environments
- Collaborate with Cloud Operations and Engineering teams to ensure stable, scalable systems
Required Skills & Experience
- 4+ years of hands-on experience in Cloud / Platform / SRE / DevOps engineering
- Strong hands-on experience with Terraform (Infrastructure as Code)
- Excellent knowledge of Kubernetes (operations, troubleshooting, upgrades, networking, storage)
- Strong experience with MySQL in production environments
- Very good hands-on expertise with GitOps workflows (e.g., ArgoCD, Flux, or similar)
- Proficiency in at least one scripting language (Bash, Python, etc.)
- Solid understanding of:
- CI/CD pipelines
- Cloud networking and security fundamentals
- Monitoring and logging systems
- High availability and reliability concepts
Preferred / Plus Skills
- Experience with MariaDB
- Experience operating Kafka in production environments
- Experience with multi-region or large-scale cloud platforms
- Exposure to zero-downtime upgrades and platform migrations
Qualifications
B.E./ ME (CS/EE) / MCA/Graduate or equivalent higher-level degree
Additional Information
Soft Skills
- Strong problem-solving and debugging ability
- Comfortable working in production environments
- Good communication and collaboration skills
- Ownership mindset and attention to detail
- Ability to work independently in a remote, distributed team
Work Requirements
- Must support US East time zone