Cloudflare

Systems Engineer, Metrics and Alerting

Reposted 5 Hours Ago

Be an Early Applicant

Hybrid

London, Greater London, England, GBR

Junior

Hybrid

London, Greater London, England, GBR

Junior

Design, deliver, and operate software for observability; solve scaling issues in Metrics & Alerting; participate in on-call rotation and mentorship.

The summary above was generated by AI

Available Locations: London or Lisbon
About the Department
Production Engineering is responsible for the world's most reliable, observable, performant, and safe network ecosystem. Our customers rely on our products and systems to safely modify, troubleshoot, and release products without external impact.
Our external customers rely on us to provide seamless and predictable incident, traffic, policy management, resulting in the fastest and safest network services in the world.
We are accountable for the overall performance of internal and external facing services, guiding our product teams to optimal configurations and maximum efficiency. From the moment that a packet enters the Cloudflare ecosystem, we know exactly what its expected purpose and behaviour is and we are capable of determining and exposing anomalous behaviour.
The Cloudflare network makes it possible to solve challenges at massive scale and efficiency which would be impossible for almost any other organization.
About the Team
This role is for the internal Observability Team, responsible for the observability platform and stack to make our engineering teams productive. This includes (but is not limited to) areas like metrics, alerting, error tracking, logging, tracing, and more.
In this role, you can expect to:

Design, deliver, and operate software and a platform that progresses Cloudflare's Observability competency
Solve scaling bottlenecks in critical services in our Metrics & Alerting pipeline
Work on highly distributed and scalable systems
Participate in the constant cycle of knowledge sharing and mentoring
Participate in the global on-call rotation for the services your team owns
Research and introduce cutting-edge technologies
Contribute to open-source

We are a small team, well-funded, growing and focused on building an extraordinary company. This is a software engineering/systems engineering role and is a superb opportunity to be part of a high performing team to help to support Cloudflare's mission and help build a better internet.
You may be a good fit for our team if you have:

A Software Engineering background and proficiency in high-level programming languages (e.g., Go)
Proficiency in Data structures and databases like TSDBs, Columnar stores or related
Proficiency in distributed Linux environments
Proficiency in designing high-scale distributed systems
Proficiency in Prometheus, Alertmanager, Thanos
Experience working in a fast, high-growth environment
Experience working in a 24/7/365 service environment
Exquisite written and verbal communication skills
Familiarity with Internetworking, networking protocols Layer 2-7 of the OSI model and BGP
Strong bias for action

Bonus points if you have:

Experience with high-bandwidth transit Internetworking and routing
Passion for code simplicity and performance

Top Skills

Alertmanager

Linux

Prometheus

Thanos

Riverside Building, 6th Floor, County Hall/The, Belvedere Rd, London, United Kingdom, SE1 7PB

Similar Jobs at Cloudflare

Cloudflare

Front-end Engineer

5 Hours Ago

Hybrid

Mid level

Cloud • Information Technology • Security • Software • Cybersecurity

Join the AI Crawl Control team as a Frontend Engineer, building innovative products focused on security and speed for internet users. Collaborate effectively with design teams and drive product features based on technical capabilities. Help achieve ambitious engineering goals in a fast-paced environment.

Top Skills: CSSGitHTMLJavaScriptReactTypescript

Cloudflare

Software Engineer

5 Hours Ago

Hybrid

Internship

Cloud • Information Technology • Security • Software • Cybersecurity

As a Software Engineer Intern at Cloudflare, you'll work on impactful projects, gain mentorship, develop skills, and present your work to the company, all while collaborating across teams.

Top Skills: C/C++GoJavaScriptPythonRustTypescript

Cloudflare

Senior Software Engineer

5 Hours Ago

Hybrid

London, Greater London, England, GBR

Senior level

Cloud • Information Technology • Security • Software • Cybersecurity

The Senior Software Engineer will enhance egress connectivity in Cloudflare's network services, collaborate across teams, and support system health and operations.

Top Skills: ClickhouseGoGrafanaKubernetesLinux NetworkingPostgresRust

What you need to know about the London Tech Scene

London isn't just a hub for established businesses; it's also a nursery for innovation. Boasting one of the most recognized fintech ecosystems in Europe, attracting billions in investments each year, London's success has made it a go-to destination for startups looking to make their mark. Top U.K. companies like Hoptin, Moneybox and Marshmallow have already made the city their base — yet fintech is just the beginning. From healthtech to renewable energy to cybersecurity and beyond, the city's startups are breaking new ground across a range of industries.

Cloudflare

Systems Engineer, Metrics and Alerting

Top Skills

Cloudflare London, England Office

Similar Jobs at Cloudflare

Front-end Engineer

Software Engineer

Senior Software Engineer

What you need to know about the London Tech Scene