xAI Jobs

Site Reliability Engineer (SRE)

xAI

Site Reliability Engineer (SRE)

Reposted 12 Days Ago

Be an Early Applicant

In-Office

London, Greater London, England, GBR

Expert/Leader

In-Office

London, Greater London, England, GBR

Expert/Leader

Responsible for backend services at xAI, focusing on scalability and reliability, requiring expertise in Kubernetes and monitoring technologies.

The summary above was generated by AI

ABOUT xAI

xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive on curiosity. We operate with a flat organizational structure. All employees are expected to be hands-on and to contribute directly to the company’s mission. Leadership is given to those who show initiative and consistently deliver excellence. Work ethic and strong prioritization skills are important. All employees are expected to have strong communication skills. They should be able to concisely and accurately share knowledge with their teammates.

ABOUT THE ROLE:

You will work on the team that is responsible for the backend services that power our products such as grok.com and the API. We focus on writing and maintaining highly scalable and reliable services that can efficiently process tens of thousands of queries per second. The services are hosted on a number of Kubernetes clusters (on-prem & cloud).

BASIC QUALIFICATIONS:

Expert knowledge of Kubernetes.
Expert knowledge of continuous deployment systems such as Buildkite and ArgoCD.
Expert knowledge of monitoring technologies such as Prometheus, Grafana, and PagerDuty.
Expert knowledge of infrastructure as code technologies such as Pulumi or Terraform.
Familiarity with a systems programming language like Rust, C++ or Go
Experience with traffic management and HTTP proxies such as nginx and envoy.

xAI is an equal opportunity employer. For details on data processing, view our Recruitment Privacy Notice.

20 Air Street, London, London, United Kingdom

Similar Jobs

Navan

Senior Site Reliability Engineer

4 Days Ago

Easy Apply

Hybrid

London, Greater London, England, GBR

Easy Apply

Senior level

Fintech • Information Technology • Payments • Productivity • Software • Travel • Automation

Seeking a Senior Site Reliability Engineer to design and develop automation and infrastructure services that ensure reliable, scalable systems for business travelers, while collaborating with development and security teams.

Top Skills: AWSCi/CdCloudFormationDatadogGoJavaJenkinsKibanaMavenNewrelicNode.jsPythonSignalfxTerraform

iManage

Senior Site Reliability Engineer

22 Days Ago

Hybrid

London, Greater London, England, GBR

Senior level

Artificial Intelligence • Cloud • Information Technology • Legal Tech • Productivity • Software

The Senior Site Reliability Engineer will automate processes, collaborate across teams, and enhance service resilience in a cloud-native environment, focusing on system scalability and best practices.

Top Skills: AksAzureBashChefDockerEfkElkGoGrafanaJavaKubernetesPowershellPrometheusPythonRubyTerraform

NatWest Group

Senior Site Reliability Engineer

2 Days Ago

In-Office

Senior level

Fintech • Payments • Financial Services

The Senior Site Reliability Engineer ensures reliability and performance of production platforms, leads SRE practices, incident management, and automation using AWS and Kubernetes.

Top Skills: Argo CdAWSGitopsGrafanaKarpenterKubernetesLokiPrometheusTempoTerraform

What you need to know about the London Tech Scene

London isn't just a hub for established businesses; it's also a nursery for innovation. Boasting one of the most recognized fintech ecosystems in Europe, attracting billions in investments each year, London's success has made it a go-to destination for startups looking to make their mark. Top U.K. companies like Hoptin, Moneybox and Marshmallow have already made the city their base — yet fintech is just the beginning. From healthtech to renewable energy to cybersecurity and beyond, the city's startups are breaking new ground across a range of industries.

xAI

Site Reliability Engineer (SRE)

xAI London, England Office

Similar Jobs

Senior Site Reliability Engineer

Senior Site Reliability Engineer

Senior Site Reliability Engineer

What you need to know about the London Tech Scene