Waabi Logo

Waabi

Senior / Staff Software Engineer (Observability / SRE)

Posted 5 Days Ago
Remote or Hybrid
3 Locations
Senior level
Remote or Hybrid
3 Locations
Senior level
Design and develop Waabi's observability stack, optimize performance, build automation tooling, and support application requirements while leading projects and mentoring teams.
The summary above was generated by AI
Waabi, founded by AI visionary Raquel Urtasun, is the leader in Physical AI. With a world-class team, we're unlocking the next era of autonomous transportation with technology that's powering commercial autonomous trucks and robotaxis. Waabi is backed by and partners with world leaders in AI, automotive, logistics, and deep tech.

With offices in Toronto, San Francisco, Dallas, and Pittsburgh, Waabi is growing quickly and looking for diverse, innovative and collaborative candidates who want to impact the world in a positive way. To learn more visit: www.waabi.ai

We are constantly expanding our compute footprint in the cloud, and need to expand our observability and monitoring capabilities alongside. We currently use the built in AWS monitoring tools, but this doesn’t work with our on-premise stuff and aren’t user friendly. There are a number of options out there we could deploy, but all of them require some attention and work. Even if we go a vendored route, we still need at least one person to own this area. 

You Will..
- Design and lead the architecture and development of Waabi’s monitoring and observability stack, used to monitor the health and performance of cloud and on-prem environments.
- Develop and extend workloads and benchmarks (compute, storage, network, ML/AI) and integrate stress, chaos, and regression tests to validate hardware and platform choices.
- Analyze and optimize end-to-end performance across hardware, firmware, Linux kernel, runtimes, and distributed services using advanced profiling tools (perf, eBPF, flamegraphs, tracing frameworks).
- Build automation and observability tooling (Go/Python/Java, Kubernetes/Docker) for CI/CD-based performance regression detection, telemetry, alerting, and anomaly detection.
- Work with client teams to support their applications’ observability requirements.
- Influence system architecture and tooling decisions that improve how Waabi builds, monitors, and scales its infrastructure.
- Drive execution and quality, writing design docs, setting milestones, mentoring ICs, and communicating insights and results to stakeholders and leadership.

Qualifications:
- 5+ years software engineering or systems/performance engineering experience (BS in CS/EE or related), with demonstrated end-to-end ownership of complex projects.
- Proficient in at least one of: Python, Rust, C/C++; strong CS fundamentals and system design skills.
- Hands-on with Linux internals (CPU scheduling, memory, I/O, networking) and perf tooling (perf, eBPF, flamegraphs, tracing frameworks).
- Experience with Kubernetes, microservices, and distributed systems; comfort building production services and pipelines.
- Proven track record of clear communication, writing design docs, and leading cross-functional efforts.

Bonus:
- Experience deploying and managing observability platforms (OpenTelemetry, Grafana OSS).
- Performance tuning for databases/streaming/batch/ML platforms; GPU/xPU or Arm performance exposure.
- Experience tuning stream processing, batch or ML platforms (e.g. Argo Workflows, PyTorch).
- Familiarity with microservices debugging and distributed tracing (OpenTelemetry, Prometheus).

The US yearly salary range for this role is: $148,000 - $249,000 USD in addition to competitive perks & benefits. Waabi (US) Inc.’s yearly salary ranges are determined based on several factors in accordance with the Company’s compensation practices. The salary base range is reflective of the minimum and maximum target for new hire salaries for the position across all US locations.  Note: The Company provides additional compensation for employees in this role, including equity incentive awards and an annual performance bonus.

Perks/Benefits:
- Competitive compensation and equity awards.
- Health and Wellness benefits encompassing Medical, Dental and Vision coverage (for full-time employees only).
- Unlimited Vacation.
- Flexible hours and Work from Home support.
- Daily drinks, snacks and catered meals (when in office).
- Regularly scheduled team building activities and social events both on-site, off-site & virtually.
- As we grow, this list continues to evolve! 

Waabi is a technology start-up building technologies to transform the way the world moves. Join our talented team to be a part of the future and to make an impact!

Waabi is an equal opportunity employer. We celebrate diversity and are committed to creating a supportive, inclusive, and accessible workplace for all our employees. We seek applicants of all backgrounds and identities, across race, color, ethnicity, national origin or ancestry, age, citizenship, religion, sex, sexual orientation, gender identity or expression, military or veteran status, marital status, pregnancy or parental status, caregiver status, disability, or any other characteristic protected by law. We make workplace accommodations for qualified individuals with disabilities as required by applicable law. If reasonable accommodation is needed to participate in the job application or interview process please let our recruiting team know.

Top Skills

AWS
C/C++
Docker
Go
Grafana
Java
Kubernetes
Opentelemetry
Python
Rust

Similar Jobs

2 Hours Ago
Remote or Hybrid
United States
Mid level
Mid level
Fintech • Information Technology • Insurance • Financial Services • Big Data Analytics
Provide production support for Salesforce CRM and workflow platforms (Bizflow/Pega), manage incidents end-to-end, perform root cause analysis, monitor jobs and integrations, support releases and operational readiness, coordinate across teams, and drive continuous improvements to reduce incidents and improve platform stability.
Top Skills: Salesforce,Bizflow,Pega,Salesforce Service Cloud,Splunk,Apis,Deployment Pipelines
4 Hours Ago
Remote
United States
Senior level
Senior level
Big Data • Information Technology • Software • Analytics • Energy
Drive enterprise sales in power and renewables by building C-suite relationships, leading consultative pursuits, aligning Enverus SaaS solutions to customer strategy, managing pipeline (3.5x quota), creating business cases/ROI, closing deals, and maintaining sales data and reports.
Top Skills: AccessExcelMicrosoft WordOutlookPowerPointSaaS
8 Hours Ago
Remote or Hybrid
Austin, TX, USA
Senior level
Senior level
Cloud • Software
Lead product strategy for AI/ML features on ThousandEyes, translate customer needs into model-driven capabilities, collaborate with engineering and research, design AI-first UX, make product trade-offs, and operationalize experiments at scale.
Top Skills: Agentic AiAIMachine LearningNetwork TelemetryThousandeyesTransformers

What you need to know about the London Tech Scene

London isn't just a hub for established businesses; it's also a nursery for innovation. Boasting one of the most recognized fintech ecosystems in Europe, attracting billions in investments each year, London's success has made it a go-to destination for startups looking to make their mark. Top U.K. companies like Hoptin, Moneybox and Marshmallow have already made the city their base — yet fintech is just the beginning. From healthtech to renewable energy to cybersecurity and beyond, the city's startups are breaking new ground across a range of industries.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account