Lead SRE and AI/ML Platform Engineer with 12 years of experience owning platform strategy, reliability, and
cloud cost governance across multi-region AWS, GCP, and Azure estates. Built and operated Kubernetes clusters
at 5,000+ node scale at Walmart and leads AI/ML infrastructureโGPU clusters, LLM hosting, RAG systemsโat
Grab. Consistently delivered measurable cost reduction: $10K/day savings at Walmart through open-source
autoscaler innovation; 30% cloud cost reduction at Grab via VPA right-sizing, spot instance adoption, and
intelligent autoscaling. Deep practitioner in GitOps (ArgoCD), observability (Prometheus, Grafana, ELK,
OpenTelemetry), and DevSecOps. Proven record of setting technical direction, leading SRE teams, and shipping
platform changes that multiply across dozens of product teams.
| ๐ Nationality | ๐ฎ๐ณ India |
| ๐ก Residency | ๐ฎ๐ณ India |
| ๐ Location | ๐ฎ๐ณ India |
|
|
rok.co/@sandeep_devops |
|
|
sdfljasfjkhsdfajsf โญ๏ธ Upgrade to Premium to contact |
| Skilled in | kubernetes eks aks gke helm openshift ai cluster api gitops argocd flagger jenkins ci cd prometheus grafana elk loki opentelemetry datadog dynatrace terraform ansible kyverno opa cilium istio ai ml infrastructure gpu clusters cuda llm hosting rag pipelines openfga aws azure gcp openstack python bash yaml sre slo sli rca bcdr velero |
| Fluent in | englishhindiapunjabi |
| Preferred annual pay (min) | $80,000/year |
| Preferred hourly pay (min) | $40/hour |
| Last seen | 2 days ago |
| Signed up | 2 years ago |
| Badges |
๐ Early adopter |
2024 - Now: lead sre @ grab
2022 - 2024: SSE Platform @ walmart
2021 - 2022: SDE2 โ DevOps Lead @ Maveric Systems Pvt Ltd
2018 - 2021: Senior Associate Consultant @ infosys
2015 - 2018: Senior Associate Technology โ DevOps @ Nagarro Software Pvt Ltd
2005 - 2009: Engineering @ kurukshetra university