C3 AI Logo

C3 AI

Site Reliability Engineer

Posted 2 Days Ago
Be an Early Applicant
London, Greater London, England
Mid level
London, Greater London, England
Mid level
The Site Reliability Engineer will maximize system uptime, establish monitoring and alerting systems, solve complex service issues, and automate processes to improve deployment cycles. They will work with cross-functional teams to enhance platform support and create new designs and standards.
The summary above was generated by AI

C3.ai, Inc. (NYSE:AI) is a leading Enterprise AI software provider for accelerating digital transformation. The proven C3 AI Platform provides comprehensive services to build enterprise-scale AI applications more efficiently and cost-effectively than alternative approaches. The C3 AI Platform supports the value chain in any industry with prebuilt, configurable, high-value AI applications for reliability, fraud detection, sensor network health, supply network optimization, energy management, anti-money laundering, and customer engagement. Learn more at: C3 AI

We are looking for a Site Reliability Engineer to join our team in London.

Responsibilities:

  • Maximize system uptime and availability, ensuring functional and performance SLAs.
  • Establish end-to-end monitoring and alerting on all critical aspects.
  • Solve complex problems for critical services and build automation to prevent problem recurrence.
  • Influence and create new designs, architectures, standards, and methods for supporting the platform.
  • Initiate and lead scripting and automation to streamline system updates and upgrades.
  • Set up critical infrastructure, tools, and framework to streamline the deployment cycle.
  • Work cross-functionally with Services and Engineering teams.

Qualifications:

  • Demonstrated experience in deploying, managing, and operating scalable and fault-tolerant Linux/Kubernetes/JVM-based infrastructure in AWS, GCP, and other public clouds.
  • Expertise in Linux Operating Systems, Networking, and Database concepts.
  • Experience with Cassandra (or another NoSQL alternative).
  • Expertise in cloud providers, such as Amazon Web Services, Azure, and GCP.
  • Experience with configuration management systems such as Ansible or Puppet.
  • Experience in Ruby or Python; to automate and monitor systems.
  • Excellent problem-solving, critical thinking, and communication skills.
  • Experience supporting as a DevOps or sys admin for commercial SaaS solutions.
  • BS or MS in Computer Science, related field, or equivalent professional experience.

C3 AI provides excellent benefits and a competitive compensation package.

C3 AI is proud to be an Equal Opportunity and Affirmative Action Employer. We do not discriminate on the basis of any legally protected characteristics, including disabled and veteran status. 

Top Skills

AWS
Cassandra
GCP
Jvm
Kubernetes
Linux
Python
Ruby

Similar Jobs

10 Hours Ago
London, Greater London, England, GBR
1,300 Employees
Mid level
1,300 Employees
Mid level
HR Tech • Software • Travel
As a Site Reliability Engineer (SRE) at TravelPerk, you will design and maintain cloud infrastructure, monitor system performance and reliability, improve automation processes, and collaborate with development teams to enhance application scalability while participating in on-call rotations to resolve production issues.
Be an Early Applicant
2 Days Ago
Bournemouth, Dorset, England, GBR
Hybrid
289,097 Employees
Senior level
289,097 Employees
Senior level
Financial Services
As a Principal Site Reliability Engineer at JPMorgan Chase, you'll define non-functional requirements and availability targets for services, ensuring their integration during design and testing phases. You'll mentor other engineers and implement observability designs for complex systems, contributing significantly to site reliability and the firm's technology strategy.
Be an Early Applicant
2 Days Ago
London, Greater London, England, GBR
Hybrid
289,097 Employees
Senior level
289,097 Employees
Senior level
Financial Services
As a Site Reliability Engineer III, you will enhance the reliability and performance of mission-critical applications through continuous integration and delivery practices, technical observability, and collaborative problem-solving in a cloud environment. You'll mentor others, develop infrastructure as code, and promote best practices in site reliability engineering.

What you need to know about the London Tech Scene

London isn't just a hub for established businesses; it's also a nursery for innovation. Boasting one of the most recognized fintech ecosystems in Europe, attracting billions in investments each year, London's success has made it a go-to destination for startups looking to make their mark. Top U.K. companies like Hoptin, Moneybox and Marshmallow have already made the city their base — yet fintech is just the beginning. From healthtech to renewable energy to cybersecurity and beyond, the city's startups are breaking new ground across a range of industries.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account