What will you do
- Lead and evolve our AWS and EKS infrastructure to ensure reliability, scalability, and performance.
- Improve and automate our CI/CD pipelines, and help mentor engineers on infrastructure and DevOps best practices.
- Strengthen our observability stack (Prometheus, Grafana, ELK) and drive automation across our systems.
- Continuously analyze and optimize system performance, identifying bottlenecks and building long-term improvements.
- Work closely with developers and product teams to streamline deployments and reduce friction.
- Own incident response and contribute to our on-call rotation with a focus on prevention and resilience.