Rightmove Logo

Rightmove

Engineering Manager (Site Reliability)

Posted 11 Days Ago
Be an Early Applicant
Easy Apply
In-Office
London, Greater London, England
Senior level
Easy Apply
In-Office
London, Greater London, England
Senior level
The Engineering Manager for Site Reliability will lead teams ensuring operational excellence, incident management, and monitoring across platform services to support product development and enhance reliability.
The summary above was generated by AI

Our vision is to give everyone the belief they can make their move. We aim to make moving simpler, by giving everyone the best place to turn to and return to for access to the tools, expertise, trust, and belief to make it happen.

We’re home to the UK’s largest choice of properties and are the go-to destination for millions of people planning their next move, reading the latest industry news, or just browsing what’s on the market. 


The Role: Engineering Manager (Site Reliability)

Location: Reporting to: London / Hybrid (2 days per week in office)

Reporting to: Head of Technology Operations


The Role

The Platform and Reliability Engineering Teams are responsible for the services that underpin the Rightmove website and enable all of our product development teams to ship functionality rapidly and safely. We strive to deliver annual availability of at least 99.99% (less than 5 mins downtime a month).  

The Site Reliability Engineering Manager’s role is to ensure operational excellence, drive observability and reliability at scale, and own the incident management processes and tools. 

This position blends people leadership, full stack reliability engineering, service management, influencing without authority. The successful candidate brings strong technical experience in reliability engineering, monitoring, alerting and observability for product‑led technology companies combined with strong customer empathy and communication skills. 

Monitoring, Alerting, and Observability 

  • Product teams have high reliability confidence, incident detection and resolution are smooth due to proactive monitoring, well‑maintained alerts/logs and high levels of observability coverage. 
  • Clear reliability expectations between platform, security, product & business. Prioritisation based on reliability risk and real data.  

Incident Management 

  • Consistency and standardisation of incident management resulting fast incident detection and resolution
  • Maintaining a culture of accountability, transparency, collaboration & learning
  • Good data quality, insights & decision‑making with strong feedback loops to all relevant stakeholders 

Reliability Engineering 

  • Clear reliability patterns and standards drive strong reliability and fewer cascading failures. E.g. probes, graceful termination/degradation, timeouts, retries, backoff, jitter, circuit breakers, bulkheads. 
  • Shared understanding how our system fails, where any weak points are with prioritised improvement plans in place. 

Delivery and Execution 

  • Own and manage reliability roadmap and metrics, initiatives/projects, and OKR delivery in line with expectations.
  • Align reliability strategy and delivery plans with business goals, partnering with technical product manager, DX, CF, DBA, security, product and data teams. 

People Leadership & Team Development 

  • Supports team with objectives and growth plans to improve skills, confidence, and impact aligned with business objectives
  • Guides engineers in designing scalable, secure, resilient platform services
  • Create an inclusive, psychologically safe environment; tailor leadership to individual strengths and motivations. 

What you’ll bring  

  • Proven experience in site reliability engineering management, overseeing observability, monitoring, reliability, and service delivery in production environments.
  • Understanding of reliability in distributed software microservices and cloud-based environments.
  • Experience implementing and running modern SRE tooling and incident management workflows, SRE service management frameworks e.g. SLO/SLIs.
  • Familiarity with platform engineering concepts including developer platforms, reusable platform components, reducing friction for product teams.
  • Experience improving operational processes and developing documented procedures (monitoring, DR, incident response, upgrade processes).
  • Leadership, team management, collaboration and communication skills, aligned with expectations for managing technical/engineering teams.  


About Rightmove

Our vision is to give everyone the belief that they can make their move. We aim to make moving simpler, by giving everyone the best place to turn to and return to for access to the tools, expertise, trust and belief to make it happen.

We're home to the UK's largest choice of properties, and are the go-to destination for millions of people planning their next move, reading the latest industry news, or just browsing what's on the market.

Despite this growth, we’ve remained a friendly, supportive place to work, with employee #1 still working here!  We’ve done this by placing the Rightmove Hows at the heart of everything we do. These are the essential values that reflect our culture, and include:

  • We create value…by delivering results and building trust with partners and consumers.
  • We think bigger…by acting with curiosity and setting bold aspirations.
  • We care deeply…by being real, having fun, and valuing diversity.
  • We move together…by being one team - internally collaborative, externally competitive.
  • We make a difference…by focusing on delivering measurable impact.

We believe in careers that open doors and help our team develop by providing an open and inclusive work environment, offering ongoing training opportunities, and supporting charity fundraising events. And with 88% of Rightmovers saying we’re a great place to work, we’re clearly doing something right! 

If all of this has caught your eye, you may well be a Rightmover in the making......

People are the foundation of Rightmove - We’ll help you build a career on it.What we offer 
  • Cash plan for dental, optical and physio treatments.
  • Private Medical Insurance, Pension and Life Insurance, Employee Assistance Plan.
  • 27 days holiday plus two (paid) volunteering days a year to give back, and holiday buy schemes.
  • Hybrid working pattern with 2 days in the office.
  • Contributory stakeholder pension.
  • Life assurance at 4x your basic salary to a spouse, family member or other nominated person in your life.
  • Competitive compensation package.
  • Paid leave for maternity, paternity, adoption & fertility.
  • Travel Loans, Bike to Work scheme, Rental Deposit Loan.
  • Charitable contributions through Payroll Giving and donation matching.
  • Access deals and discounts on things like travel, electronics, fashion, gym memberships, cinema discounts and more.
As an Equal Opportunity Employer, Rightmove will never discriminate based on age, disability, sex, race, religion or belief, gender reassignment, marriage / civil partnership, pregnancy/maternity or sexual orientation. 

At Rightmove, we believe that a diverse and inclusive workforce leads to better innovation, productivity, and overall success., We are committed to creating a welcoming and inclusive environment for all employees, regardless of their background or identity, to develop and promote a diverse culture that reflects the communities we serve. 
By applying, you confirm that you are aged at least 18 or over and that you’ve read and understood our Privacy Policy, which explains how we handle and protect your personal information during the recruitment process.

Top Skills

Cloud-Based Environments
Incident Management
Microservices
Monitoring
Observability
Platform Engineering
Site Reliability Engineering
Slis
Slo
Sre Tooling

Rightmove London, England Office

33 Soho Square, London, United Kingdom, W1D 3QU

Similar Jobs

11 Days Ago
In-Office
London, Greater London, England, GBR
Senior level
Senior level
Fintech • Analytics
The Manager - Site Reliability Engineering ensures the stability and performance of production systems, leads a team, manages incidents, and drives improvements.
Top Skills: AWSCloudOraclePowershellPythonShell
12 Days Ago
In-Office
SL6 8AA, Maidenhead, England, GBR
Senior level
Senior level
Logistics • Transportation
The SRE Senior Engineering Manager leads a global team of Service Reliability Engineers to enhance reliability, automation, and operational excellence for enterprise services, driving DevOps practices, SLO metrics, and customer success.
Top Skills: AiopsAutomationCloudDatabasesDevOpsMonitoring Tools
4 Days Ago
Easy Apply
In-Office
32 Locations
Easy Apply
Senior level
Senior level
Cloud • Software
Lead a high-performance Site Reliability Engineering team, oversee software delivery and operations management, and implement engineering processes to meet global customer service level agreements.
Top Skills: Agile DevelopmentCloud TechnologiesDevOpsGitopsInfrastructure As CodeKubernetesLinux

What you need to know about the London Tech Scene

London isn't just a hub for established businesses; it's also a nursery for innovation. Boasting one of the most recognized fintech ecosystems in Europe, attracting billions in investments each year, London's success has made it a go-to destination for startups looking to make their mark. Top U.K. companies like Hoptin, Moneybox and Marshmallow have already made the city their base — yet fintech is just the beginning. From healthtech to renewable energy to cybersecurity and beyond, the city's startups are breaking new ground across a range of industries.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account