Professional Journey

Driving innovation through platform engineering, DevOps excellence, and AI-powered solutions - from startups to enterprise scale

Salesforce, Inc. logo

Salesforce, Inc.

Member of Technical Staff

Hyderabad, IndiaOct 2024 - Present

Member of Technical Staff - (Hybrid/Remote)

Empowering platform resilience and scale - owning software development, CI/CD pipeline optimisation, infrastructure automation, proactive monitoring & alerting, security hardening, performance tuning, incident diagnosis, and cross-team collaboration.

Key Highlights

  • Achieved 99.99% service availability for 800+ Kubernetes clusters across hybrid cloud (AWS/Ali/GCP) and on-prem environments through systematic OS patching, kernel upgrades, and IPVS optimization.
  • Cut deployment cycles by 30% by architecting streamlined CI/CD pipelines with Terraform, Spinnaker, and ArgoCD, enabling faster iteration velocity for engineering teams.
  • Reduced mean time to detect (MTTD) issues by 40% through comprehensive Grafana monitoring dashboards and Splunk integration, dramatically improving incident response times.

Key Achievements & Contributions

  • Oversaw critical infrastructure tasks, including regular OS patching, kernel upgrades, and IPVS module rollouts to optimize load balancing for on-prem systems.
  • Designed and developed an advanced AI agentic framework (Warden AI Ops) enabling multi-tenancy, allowing tenants to build customized AI agents directly on top of this platform.
  • Leveraged this framework to significantly enhance automated incident detection & remediation capabilities, achieving a 30% reduction in incident resolution times, empowering tenants to innovate rapidly.
  • Created agents like weekly SQR report; K8s Agent troubleshooting that uses K8sGPT to solve & auto heal the k8s related issues; Pager Duty alert automation to describe the alert & suggest remediation.
  • Created Kubernetes Operators for automated self-healing workflows, reducing manual intervention by approximately 40%.
  • Actively engaged in cross-team collaboration, contributing to architecture design, security assessments, quarterly reliability initiatives, and performance tuning.
  • Building Customer Engagement Framework (CEF), which remediates the Pager Duty alerts by intercepting the metrics and raising the same back to our tenants from the Argus queries, resulting in reducing the ops toil and burden for unnecessary alerts by 30%.

Impact & Results

99.99% service availability across 800+ K8s clusters
30% reduction in deployment cycles
40% faster issue detection (MTTD)
30% reduction in incident resolution times
40% reduction in manual intervention
30% reduction in unnecessary alerts
  • Enabled multi-tenant AI operations platform for rapid innovation
  • Established self-healing infrastructure patterns
  • Improved cross-team collaboration and architectural standards

Technologies & Tools

KubernetesTerraformSpinnakerArgoCDGrafanaSplunkK8sGPTAI OpsRHELAWSGCPAlibaba Cloud
Airtel International logo

Airtel International

Site Reliability Engineer (E2)

Gurgaon, IndiaMay 2022 - Sep 2024

Site Reliability Engineer (E2) - (Hybrid/Remote)

Kept the ship sailing! Was responsible for software development, release, deployments, monitoring, automation (CI/CD), security, diagnosis for optimal performance, and supporting the IT infrastructure environment.

Key Highlights

  • Earned Einstein Award within months of joining by delivering exceptional performance and driving critical platform improvements across the organization.
  • Grew and led a team of 3 engineers from formation to high performance, implementing mentorship programs that improved project delivery speed and code quality.
  • Enabled telecom operations across 14 countries by successfully deploying NPO, KYC 2.0, E-Sim, and CLM platforms, directly impacting millions of subscribers.

Technologies & Tools

KubernetesRancherFluxJenkinsGrafanaKibanaMEAN StackJavaPythonAI/ML OpsSLO Monitoring
Amway India logo

Amway India

DevOps → DevSecOps Engineer

Gurgaon, IndiaJun 2020 - Apr 2022

DevOps Engineer (Remote) → DevSecOps Engineer (Hybrid)

Responsible for development, deployments, and engaged with multiple activities to provide users a frictionless shopping experience. Worked as a Technical SEO Manager, Technical DevSecOps Engineer across various functional teams and as a Product Owner & Engineer to mitigate production support incidents.

Key Highlights

  • Eliminated 500 annual production incidents (100 monthly, 400 annual) by architecting ACUTE - a centralized platform serving 5+ critical business domains including reconciliation, sales, tax, and support.
  • Achieved 100% Lighthouse and DeepCrawl SEO scores (up from stagnant 67%) as Technical SEO Manager, directly improving organic traffic and search visibility for Amway India's e-commerce platform.
  • Earned rapid promotion to DevSecOps Engineer within 11 months and received 3 performance awards (Flame & Ignite) for consistently exceeding expectations.

Technologies & Tools

AWSJenkinsNode.jsInfluxDBGrafanaDockerLambdaKinesisIAMHeadless ChromeSEOCI/CD