Blackstone Logo

Blackstone

BXTI, Site Reliability Engineer - Data, Cloud & Developer Experience

Posted Yesterday
Be an Early Applicant
In-Office
London, Greater London, England
Expert/Leader
In-Office
London, Greater London, England
Expert/Leader
Lead adoption of SRE practices firm-wide, implement and maintain observability tooling, automate operations, manage incident response and on-call rotations, provide performance insights and instrumentation, drive reliability standards, and collaborate on postmortems and continuous improvement.
The summary above was generated by AI

Blackstone is the world’s largest alternative asset manager. We seek to create positive economic impact and long-term value for our investors, the companies we invest in, and the communities in which we work. We do this by using extraordinary people and flexible capital to help companies solve problems. Our $1.1 trillion in assets under management include investment vehicles focused on private equity, real estate, public debt and equity, infrastructure, life sciences, growth equity, opportunistic, non-investment grade credit, real assets and secondary funds, all on a global basis. Further information is available at www.blackstone.com. Follow @blackstone on LinkedInX, and Instagram.
 

Blackstone’s Site Reliability Engineering team is responsible for improving the reliability of systems and services to meet the needs of the business. This is achieved through collaboration with the development and engineering teams to leverage SRE practices and principles. You’ll have the opportunity to identify and solve new problems as they arise, deploy and maintain observability systems and pipelines, mature the operations and support of services and platforms, and pursue emerging opportunities for efficiency and business value.

This position involves the selection, implementation, and maintenance of key observability tooling. It requires ongoing evaluation of the firm’s needs in observability, monitoring, alerting, resilience, and recovery. We work alongside service owners on design, implementation, and management of services for continuous improvement. We achieve the requisite reliability of services using clear definitions and measurable targets. We plan for and practice recovery from disaster scenarios and respond in real time to incidents. We guide the postmortem process in order to mitigate risks, prevent future disruptions, and improve the on-call experience. We aim to eliminate manual work, improve operational efficiency, and ensure the high quality outputs in all that we do.

Key Responsibilities:

  • Provide technical leadership in the understanding and adoption of SRE methodologies across the firm
  • Incorporating observability standards into code and deployment pipelines.
  • Evolving the SRE standards that are adopted across all teams
  • Partnering with colleagues in various roles and reporting lines to improve service reliability and operational efficiency
  • Assisting developers and engineers directly and through AI assistants.
  • Implement instrumentation and provide comprehensive performance insights to service owners
  • Ensuring monitoring and alerting that reflects the reliability of services for users and enables effective on-call operations
  • Implementing strategic observability tools and working to control overhead in maintenance and cost
  • Participate in on-call rotations and respond to system incidents to ensure service availability and minimize operational impact
  • Using automation to manage, maintain, and scale SRE systems with minimal human intervention
  • Fostering a blameless culture while assisting in postmortem discussions and reporting
    Qualifications:
  • Ability to write automation scripts, as well as read and troubleshoot code (Python, C#, Typescript, etc.),
  • Make effective use of coding assistants and chat models (Anthropic, Open AI)
  • Proficiency with public cloud providers (strong AWS experience required, preferred Azure experience)
  • Configuration as code, infrastructure management, and CI/CD tooling (Terraform, Puppet, Gitlab CI)
  • Hand on experience with Docker and container schedulers including AWS ECS & EKS
  • Excellent troubleshooting skills for Linux, Windows, and Networking
  • Experience with observability tools (Grafana, Prometheus, Splunk, etc)
  • Comfortable under pressure with incident management and collaborating during postmortems
  • Excellent communication and organizational skills
  • Curiosity and drive to improve systems and processes through a sense of shared ownerships


The duties and responsibilities described here are not exhaustive and additional assignments, duties, or responsibilities may be required of this position.  Assignments, duties, and responsibilities may be changed at any time, with or without notice, by Blackstone in its sole discretion.


Blackstone is committed to providing equal employment opportunities to all employees and applicants for employment without regard to race, color, creed, religion, sex, pregnancy, national origin, ancestry, citizenship status, age, marital or partnership status, sexual orientation, gender identity or expression, disability, genetic predisposition, veteran or military status, status as a victim of domestic violence, a sex offense or stalking, or any other class or status in accordance with applicable federal, state and local laws. This policy applies to all terms and conditions of employment, including but not limited to hiring, placement, promotion, termination, transfer, leave of absence, compensation, and training.  All Blackstone employees, including but not limited to recruiting personnel and hiring managers, are required to abide by this policy.

If you need a reasonable accommodation to complete your application, please contact Human Resources at 212-583-5000 (US), +44 (0)20 7451 4000 (EMEA) or +852 3656 8600 (APAC).

Depending on the position, you may be required to obtain certain securities licenses if you are in a client facing role and/or if you are engaged in the following:

  • Attending client meetings where you are discussing Blackstone products and/or and client questions;

  • Marketing Blackstone funds to new or existing clients;

  • Supervising or training securities licensed employees;

  • Structuring or creating Blackstone funds/products; and

  • Advising on marketing plans prepared by a sales team or developing and/or contributing information for marketing materials.

  

Note: The above list is not the exhaustive list of activities requiring securities licenses and there may be roles that require review on a case-by-case basis.  Please speak with your Blackstone Recruiting contact with any questions.
To submit your application please complete the form below. Fields marked with a red asterisk * must be completed to be considered for employment (although some can be answered "prefer not to say"). Failure to provide this information may compromise the follow-up of your application. When you have finished click Submit at the bottom of this form.

Top Skills

Python,C#,Typescript,Anthropic,Openai,Aws,Azure,Terraform,Puppet,Gitlab Ci,Docker,Aws Ecs,Aws Eks,Grafana,Prometheus,Splunk,Linux,Windows,Networking

Similar Jobs

57 Minutes Ago
Hybrid
London, Greater London, England, GBR
Expert/Leader
Expert/Leader
Big Data • Information Technology • Productivity • Software • Analytics • Business Intelligence • Consulting
Lead strategic FSI customer engagements to drive adoption and measurable value from Celonis. Own executive relationships, program delivery, change management, and cross-functional team coordination to ensure successful transformation and customer advocacy.
Top Skills: Celonis Process Intelligence Platform,Process Mining,Ai
57 Minutes Ago
Hybrid
London, Greater London, England, GBR
Expert/Leader
Expert/Leader
Big Data • Information Technology • Productivity • Software • Analytics • Business Intelligence • Consulting
Lead strategic FSI customer engagements as the accountable program lead to drive Celonis adoption and measurable business value. Build C-level relationships, define long-term roadmaps, orchestrate cross-functional teams, resolve escalations, mentor teams, and champion customer advocacy, advisory boards, and industry thought leadership to accelerate transformation and platform adoption.
Top Skills: Celonis,Process Mining,Process Intelligence,Ai
An Hour Ago
Hybrid
Leeds, West Yorkshire, England, GBR
Senior level
Senior level
Big Data • Fintech • Information Technology • Business Intelligence • Financial Services • Cybersecurity • Big Data Analytics
Manage collections and escalation of overdue invoices, liaise with debtors via phone/email, own complex client billing and monthly credit note provisions, run Direct Debit in PeopleSoft and ensure cash and debtor-day targets are met.
Top Skills: Peoplesoft

What you need to know about the London Tech Scene

London isn't just a hub for established businesses; it's also a nursery for innovation. Boasting one of the most recognized fintech ecosystems in Europe, attracting billions in investments each year, London's success has made it a go-to destination for startups looking to make their mark. Top U.K. companies like Hoptin, Moneybox and Marshmallow have already made the city their base — yet fintech is just the beginning. From healthtech to renewable energy to cybersecurity and beyond, the city's startups are breaking new ground across a range of industries.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account