OneSignal Logo

OneSignal

Staff Engineer, Platform/ SRE

Posted 14 Days Ago
Be an Early Applicant
Remote
Hiring Remotely in United Kingdom
Senior level
Remote
Hiring Remotely in United Kingdom
Senior level
As a Staff Platform Engineer at OneSignal, you will focus on optimizing performance, automating operations, and ensuring the reliability of services at scale. Your tasks will include operating infrastructure, improving internal services, collaborating with teams, and implementing CI/CD best practices. You'll work with technologies like Kubernetes and Terraform, manage database operations, and contribute to incident response efforts.
The summary above was generated by AI

OneSignal is a leading omnichannel customer engagement solution, powering personalized customer journeys across mobile and web push notifications, in-app messaging, SMS, and email. On a mission to democratize engagement, we enable over a million businesses to keep their users - including readers, fans, players and shoppers - engaged and up to date by delivering billions of messages daily. 

1 in 5 new apps launches using OneSignal! We support companies in 140 countries, including Zynga, USA Today, Bitcoin.com, Eventbrite, Tribune, and many more - from startups and small businesses just getting off the ground to established companies communicating with millions of customers.

We’re Series C, venture-backed by SignalFire, Rakuten Ventures, Y Combinator, HubSpot, and BAM Elevate. We offer remote work as the default option in the United States in California, New York, New Jersey, Oregon, Pennsylvania, Texas, Utah and Washington. As well as in the UK and Singapore - with plans to expand the locations we support in the future. Some roles are hybrid roles and will be listed as such. We have offices in San Mateo, CA and London, UK, and offer flex seating options for employees to work together in-person where we don't have offices. Hiring in Singapore is done in partnership with a local PEO.

OneSignal has a lot of the great tech startup qualities you'd expect, but we don't stop there. Our massive scale and small team, emphasis on healthy life balance and kindness in all our interactions, and focus on ownership and personal growth make OneSignal a uniquely great place to work. 

About The Team:

We have grown rapidly to where we are today serving billions of HTTP requests daily.. We achieved this scale by writing scale-sensitive components in languages like Rust and Go. This potent combination of high performance with efficient resource utilization has given us an incredible competitive edge.

We are hiring a Platform Engineer to help us continue to scale by operating and engineering the future of our infrastructure. We are maintaining 99.95% uptime today, and we are investing to ensure we maintain that as the business continues to grow and as the product evolves.

Your primary task will be software engineering with a focus on infrastructure, operations, and automation. You'll be building systems to run our product, improving internal services, and advising product teams on architecture as it relates to the operability of the service.

The systems you'll be responsible for include all of the services which power our product. This ranges from off-the-shelf services like PostgreSQL, Scylla,  Redis, Kafka, Kubernetes, etc. to our in-house services such as the Rails web app, various Rust backend services, and our high-performance API layer written in Go.

You'll be working with Kubernetes to automate our data center operations and writing operational services to automate database operations. One of the key challenges in this role is to not only understand systems to the point of being able to manually operate by hand but also to understand in sufficient detail to write software systems to automate such operations.

Our blog contains more information about the OneSignal Engineering career ladder, remote-first culture, and our diverse team.

What You'll Do:

  • Optimize and Elevate Performance: Identify bottlenecks in our systems and unleash your creativity to introduce cutting-edge optimizations. You'll have the chance to improve the performance of our databases and evaluate innovative storage technologies that will elevate our infrastructure to new heights.
  • Forge Infrastructure as Code: Take the lead in setting up robust infrastructure and configuration as code using Kubernetes and Terraform. You'll be at the forefront of shaping our foundational architecture, ensuring it’s both resilient and scalable.
  • Drive Observability and Monitoring: Establish and maintain a state-of-the-art observability and monitoring stack. Your insights will enable us to stay ahead of potential issues, ensuring our services remain reliable and performant.
  • Craft the Golden Path for CI/CD: Define and implement best practices for continuous integration and deployment. Your work will streamline the deployment process for our engineering teams, allowing them to roll out new features swiftly and safely.
  • Collaborate Across Teams: Work closely with various engineering teams to architect highly scalable and observable services. Your collaboration will be essential in creating a cohesive and efficient development environment.
  • Be a Key Player in Incident Response: Join the on-call rotation and be a crucial part of maintaining our systems' health. Your expertise will be vital in troubleshooting and resolving issues, ensuring our services always meet the highest standards.

What You'll Bring:

  • At least 8 years of platform experience
  • Experience operating reliable production systems at scale
  • Knowledge of Linux systems internals
  • Desire and ability to automate tasks
  • Experience in managing PostgreSQL for high-scale throughput systems or similar experience with other relatable SQL datastores’.
  • Operational experience deploying and managing Kubernetes
  • Experience working with Cloud Providers (AWS/GCP/Azure)

We value a variety of experiences, so these are not required. It would be an added bonus if you have experience in any of the following:

  • Recently writing Go and/or Rust
  • Working with Layers 1-3 of the OSI networking model
  • Redis, Kafka, etcd, ZooKeeper, nginx, haproxy
Qualities we look for:
  • Friendliness & Empathy
  • Accountability & Collaboration
  • Proactiveness & Urgency
  • Growth Mindset & Love of Learning

In keeping with our beliefs and goals, no employee or applicant will face discrimination/harassment based on: race, color, ancestry, national origin, religion, age, gender, marital domestic partner status, sexual orientation, gender identity, disability status, or veteran status. Above and beyond discrimination/harassment based on 'protected categories,' we also strive to prevent other, subtler forms of inappropriate behavior (e.g., stereotyping) from ever gaining a foothold in our office. Whether blatant or hidden, barriers to success have no place in our workplace.

Applicants with disabilities may be entitled to reasonable accommodation under the terms of the Americans with Disabilities Act and certain state or local laws. A reasonable accommodation is a change in the way things are normally done which will ensure an equal employment opportunity without imposing undue hardship on OneSignal. Please inform us if you need assistance completing any forms or to otherwise participate in the application and/or interview process.

OneSignal collects and processes personal data submitted by job applicants in accordance with our Privacy Policy - including GDPR and CCPA compliance. Please see our privacy notice for job applicants.

Top Skills

Go
Rust

Similar Jobs

14 Days Ago
Remote
8 Locations
Mid level
Mid level
Cloud • Software
As an SRE & Gitops Engineer, you'll automate software operations, enhance infrastructure as code practices, maintain core services at Canonical, and collaborate with development teams to improve products. Responsibilities include troubleshooting, capacity planning, and using observability tools for monitoring and alerting.
Top Skills: Python
14 Days Ago
Remote
8 Locations
Senior level
Senior level
Cloud • Software
As a Senior Site Reliability/GitOps Engineer at Canonical, you'll drive automation and infrastructure as code practices. You'll automate software operations across private and public clouds, support core services, troubleshoot issues, and collaborate with global teams. You'll also provide feedback to improve Canonical products and work alongside talented individuals in a remote-first environment.
Top Skills: KubernetesLinuxPython
8 Minutes Ago
Easy Apply
Remote
28 Locations
Easy Apply
Junior
Junior
Artificial Intelligence • Machine Learning • Natural Language Processing • Conversational AI
The QA Engineer will lead quality assurance efforts for automated tests, working collaboratively with other QA engineers and developers. Responsibilities include maintaining the test infrastructure, creating and automating test plans, improving test coverage, and conducting manual checks when necessary. The role demands teamwork, adherence to quality standards, and the ability to implement effective testing methodologies.
Top Skills: C#

What you need to know about the London Tech Scene

London isn't just a hub for established businesses; it's also a nursery for innovation. Boasting one of the most recognized fintech ecosystems in Europe, attracting billions in investments each year, London's success has made it a go-to destination for startups looking to make their mark. Top U.K. companies like Hoptin, Moneybox and Marshmallow have already made the city their base — yet fintech is just the beginning. From healthtech to renewable energy to cybersecurity and beyond, the city's startups are breaking new ground across a range of industries.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account