Senior Site Reliability Engineer

Senior Site Reliability EngineerCentral London (Hybrid)Up to 100k + Car Allowance and Bonus TRIA are working with a leading hospitality client to hire a Senior SRE, where they are investing heavily in the performance, stability, and reliability of its digital platforms. This is a hands-on leadership role - you won''t just guide others, you''ll be the go-to expert when systems are under pressure. You''ll lead incident response, own root cause analysis, and solve performance issues like memory leaks, outages, and flaky services.You will take ownership of the site reliability and drive that as a discipline.Your focus will include:Leading incident management, post-mortems, and blameless RCAsBuilding scalable, resilient microservices with the dev teamsUplifting observabilityImproving alerting, monitoring, and system-level metricsDriving better SLOs, SLIs, and overall uptimeWhat you''ll bring:Experience in high-traffic digital or eCommerce platforms5+ years in SRE/DevOps roles; strong background in incident responseObservability, automation, and infrastructure as code expertiseLeadership skills - mentoring others or leading from the front The stack includes Kubernetes, Terraform, AWS, Python, and modern CI/CD tools, and it''s evolving.If you understand what a good SRE practice looks like, and want to leave systems in a better place than you found them, please apply to be considered and learn more!
Other jobs of interest...





Perform a fresh search...
-
Create your ideal job search criteria by
completing our quick and simple form and
receive daily job alerts tailored to you!