Shubham SinghScaling Platforms
Engineering Intelligence
Building Relations

What I Bring to the Table

I architect resilient infrastructure at enterprise scale while maintaining the velocity and innovation mindset of a startup founder. My approach: eliminate toil through intelligent automation, build platforms that empower teams rather than constrain them, and leverage smart solutions to anticipate failures before they impact users. I don't just maintain systems - I transform them into competitive advantages.

800+
K8s Clusters Managed
99.99%
Uptime Achieved
70%
Cost Reduction

Core Strengths

Where I Create Leverage

From AI-driven operations to cost-optimized multi-cloud platforms, here are the themes I reliably deliver on.

AI Ops & SRE
  • Operate 800+ Kubernetes clusters across AWS, GCP & Alibaba
  • Design AI agents (Warden, K8sGPT) for self-healing & auto-remediation
  • Deliver 99.99% uptime via chaos testing, runbooks, and proactive detection
Platform Engineering & DevSecOps
  • Build GitOps-driven supply chains with Terraform, Spinnaker, ArgoCD
  • Create multi-tenant developer platforms that remove toil and accelerate releases
  • Embed security (OPA, Vault, GuardDuty) into CI/CD and infra automation
Cloud Architecture & FinOps
  • Lead cost reductions up to 70% via workload rightsizing & smart caching
  • Architect hybrid infrastructure with AWS CDK, Crossplane, Aurora, Redis
  • Establish performance baselines and governance for enterprise platforms
Observability & Incident Intelligence
  • Build full-fidelity telemetry stacks with Prometheus, Grafana, Splunk, Loki
  • Translate SLOs into actionable alerts, runbooks, and on-call automation
  • Use AI summarization to speed triage and keep humans focused on impact
Backend Systems & Software Architecture
  • Design event-driven services (Kafka, gRPC, WebSockets) that handle 10K+ RPS with 99.9% uptime
  • Lead monolith-to-microservices migrations with clear domain boundaries and DX-first APIs
  • Build zero-downtime CI/CD pipelines, feature flags, and rollout strategies for mission-critical apps

What Colleagues Say

Real recommendations from engineers, mentees, and teammates

Technical Leadership

"Shubham was highly regarded by the team and his expertise and knowledge, alongside his long practical experience, helped drive the project in the right direction."

Adrian Anghel

Senior Software Engineer, Octonius Inc.

Teaching & Communication

"He is a great teacher and I found it very easy to learn from him and retain all the information. He is extremely passionate and hardworking about the work he is doing."

Charishma Thota

Solutions Architect at BigPanda

Mentorship & Growth

"As a leader, he cares about the growth of others. His kindness and patience help me grow from an intern who knows a little about Node.js to someone who can write full-stack code within 3 months."

Jessie Jia

Founder | Ex Meta