Role Mission:
As a Lead CRM DevOps Engineer at StarHub, you will own the end-to-end DevOps strategy, platform reliability, and release governance for mission-critical CRM and Order Management (CRM/OM) platforms. You will lead the design and evolution of CI/CD pipelines, Kubernetes platform architecture, and cloud infrastructure, ensuring high availability, scalability, and secure delivery of CRM/OM microservices on AWS EKS.
This role goes beyond execution requiring you to drive DevOps best practices, establish SRE capabilities, and ensure end-to-end stability of customer journeys (order, provisioning, billing). You will act as a technical leader, guiding engineers and working across teams to enable faster, safer, and more reliable platform delivery.
Responsibilities:
- Drive DevOps strategy and operating model for CRM/OM platforms, acting as the technical escalation point for complex production issues.
- Architect and govern end ‑ to ‑ end CI/CD pipelines using Jenkins (Pipeline ‑ as ‑ Code) and GitOps (Argo CD / Flux), enabling safe, repeatable releases.
- Lead trunk ‑ based development adoption, automated testing, quality gates, and release safety mechanisms across microservices.
- Design and operate multi ‑ cluster Kubernetes platforms (EKS/OCP), including networking, RBAC, scaling, and resilience.
- Govern AWS cloud infrastructure and Infrastructure ‑ as ‑ Code using Terraform and/or CloudFormation, ensuring security, scalability, and cost efficiency.
- Apply SRE practices by defining SLOs/SLIs, improving reliability and performance, and leading incident management and RCA.
- Ensure end ‑ to ‑ end reliability of CRM and API integrations across systems.
- Establish observability and operational excellence using Splunk, Prometheus, Grafana, CloudWatch, and on ‑ call tooling.
- Contribute to automation and code improvements, participate in architecture reviews, mentor engineers, and drive documentation and knowledge sharing.
Qualifications:
- Bachelor's degree in Computer Science, Software Engineering, or a related field.
- 6+ years of hands-on experience across DevOps, SRE, and platform or software engineering, with at least 2–3 years in a senior or technical leadership role.
- Strong proficiency in Java and Spring Boot, with hands-on experience supporting microservices-based and distributed architectures.
- Deep understanding of scalability, reliability engineering, and production system performance.
- Advanced hands-on experience with:
- Kubernetes (EKS preferred), Helm, and container orchestration
- CI/CD platforms (Jenkins, GitLab CI) and GitOps practices
- AWS cloud services and architecture
- Infrastructure as Code (Terraform and/or CloudFormation)
- Strong SQL skills with hands-on PostgreSQL administration and performance tuning experience.
- Proficiency in Python and shell scripting for automation and platform tooling.
- Experience implementing observability and monitoring using tools such as Splunk, Prometheus, Grafana, and CloudWatch.
- Proven experience working in Agile/Scrum environments, with strong stakeholder management, communication, and technical leadership skills.
- Ability to operate effectively in a fast-paced, production-critical environment.