BlackLine
Jun 2024 — PresentSite Reliability EngineerBengaluru, KA
- › Operating high-availability production infrastructure with 99.9%+ uptime SLAs and on-call incident management.
- › Led GCP cost optimization across right-sizing, resource auditing, and database upgrades — $250K+ saved in a single quarter.
- › Migrated legacy release pipelines to GitHub Actions, reducing manual release intervention by 40%.
- › Built an auto-healing system wiring New Relic telemetry → PagerDuty → GitHub Actions → Ansible runbooks for instant remediation of known failures.
- › Led zero-downtime migration of Apache NiFi clusters from Chef to Ansible, then orchestrated automated multi-region deployments.
GCPKubernetesAnsibleGitHub ActionsNew RelicPagerDuty