This role is for one of the Weekday's clients
Location: Hyderabad
Work Type: 5 Days WFO (On-Site)
Work Timings: 2 PM – 11 PM
Type: Full-Time
Experience: 5+ Years (Senior Level)
Domain: Digital Adoption Platform (DAP)
We are looking for a highly skilled, hands-on Senior / Lead DevOps Engineer to own and manage the complete DevOps ecosystem end-to-end for a rapidly scaling B2B SaaS platform. The core product is a Digital Adoption Platform (DAP) delivered via browser extensions that overlay and guide users across enterprise applications. With millions of active users, the platform demands highly scalable, secure, and resilient infrastructure.
This role requires an engineer who can architect, build, optimize, and operate complex cloud environments, scale real-time data pipelines, and support multi-tenant SaaS as well as on-prem enterprise deployments. You will play a critical role in ensuring platform reliability, performance, and security at scale.
If you thrive in ownership-driven environments and enjoy designing infrastructure that supports large-scale SaaS systems, this role is for you.
Requirements
Key Responsibilities
Cloud Infrastructure & Architecture (AWS)
- Architect, implement, and maintain scalable cloud infrastructure using AWS services including:
VPC, Route53, CloudFront, S3, EKS, EC2, ALB/ELB, Lambda, RDS, Elasticache, API Gateway, Kinesis Streams, ClickHouse - Design and maintain a multi-tenant SaaS architecture supporting millions of active users
- Optimize and scale high-throughput event pipelines for user activity tracking and analytics processing
DevOps & Automation
- Own and enhance the complete CI/CD ecosystem using GitHub Actions
- Build automated deployment pipelines for browser extensions, microservices, backend services, and frontend applications
- Implement Infrastructure as Code (IaC) using Terraform or CloudFormation
- Build and maintain Docker images and containerized services
- Manage Kubernetes workloads using Helm charts
Monitoring, Security & Reliability
- Implement end-to-end observability: metrics, logs, traces, and alerting
- Enforce cloud and application security best practices across infrastructure and deployments
- Ensure high availability through auto-scaling, disaster recovery, and backup strategies
- Lead security hardening initiatives including CVE remediation, container security, and dependency management
On-Premise Deployments
- Design and deliver on-prem enterprise deployments that mirror the cloud SaaS architecture
- Collaborate with enterprise customers to customize deployment models as required
- Build automation tools and scripts for installation, upgrades, monitoring, and maintenance
Collaboration & Technical Leadership
- Partner with Engineering, QA, and Product teams to enable reliable and frequent releases
- Mentor junior DevOps engineers and promote cloud and DevOps best practices across teams
- Participate in architecture reviews and influence long-term technical decisions
Qualifications
Education
- Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field
Required Experience & Skills
- 5+ years of DevOps experience, with at least 3+ years in a senior or lead role
- Strong expertise in AWS cloud architecture (mandatory)
- Hands-on experience with:
- AWS VPC, EC2, RDS, Elasticache
- Kubernetes (EKS), Docker, Helm
- S3, CloudFront, Route53
- API Gateway, Lambda, Kinesis Streams
- ClickHouse or similar columnar databases
- Strong CI/CD experience using GitHub Actions
- Infrastructure as Code using Terraform
- Proficiency in scripting and automation using Python, Shell, and Node.js
- Solid understanding of distributed systems, caching, load balancing, and event-driven architectures
Scalability & Performance
- Proven experience scaling distributed systems for high-traffic, large user bases
- Hands-on experience designing high-throughput analytics and real-time data pipelines
On-Prem Deployment Experience
- Demonstrated experience replicating SaaS architectures for on-prem environments
- Ability to automate both containerized and non-containerized deployments
Other Requirements
- Strong debugging, troubleshooting, and root-cause analysis skills
- Ownership mindset with the ability to independently deliver end-to-end solutions
- Excellent communication and cross-functional collaboration skills
Nice-to-Have Skills
- Experience with Digital Adoption Platforms or browser-based SaaS products
- Familiarity with observability tools such as Grafana, Prometheus, Datadog
- Exposure to SOC 2, ISO, or similar compliance environments
Sponsored
Explore Engineering
Skills in this job
People also search for
Similar Jobs
More jobs at Weekday AI
Apply for this position
Sign In to ApplyAbout Weekday AI
At Weekday (backed by YC; also Product Hunt #1 product of the day), we are building the next frontier in hiring. We have built the largest database of white collar talent in India and have built outreach tools on top of it to generate highest response ...