Locus Robotics Jobs

Site Reliability Engineer (UK)

Locus Robotics

Site Reliability Engineer (UK)

Posted Yesterday

Be an Early Applicant

In-Office

London, Greater London, England, GBR

Senior level

In-Office

London, Greater London, England, GBR

Senior level

As a Site Reliability Engineer, you will manage remote devices, ensuring stability and security of the LocusONE platform, develop monitoring solutions, and execute incident responses.

The summary above was generated by AI

Locus Robotics is a global leader in warehouse automation, delivering unmatched flexibility, unlimited throughput, and actionable intelligence to optimize operations. Powered by LocusONE, an AI-driven platform, our advanced autonomous mobile robots seamlessly integrate into existing warehouse environments to enhance efficiency, reduce costs, and scale operations with ease. Trusted by over 150 industry-leading retail, healthcare, 3PL, and industrial brands across more than 350 sites worldwide, Locus enables warehouse operators to achieve rapid ROI, minimize labor costs, and continuously improve productivity. Our industry-first Robots-as-a-Service (RaaS) model ensures ongoing innovation, scalability, and cost-effectiveness without the burden of significant capital investments, with proven capabilities across diverse workflows, from picking and replenishment to sorting and pack-out. Locus Robotics empowers businesses to meet peak demands and adapt to ever-changing operational needs.

Locus Robotics is seeking a Site Reliability Engineer (SRE) with a specialized focus on Remote Device Management. As a core member of our reliability team, you will ensure the stability, security, and scalability of the LocusONE platform supporting our growing fleet of Autonomous Mobile Robots (AMRs), peripherals, and reporting devices. You will bridge the gap between software development and field operations, using Linux expertise and Mobile Device Management (MDM) tools to manage thousands of edge devices globally.

This is a remote role.

Responsibilities

Fleet Management at Scale: Design, implement, and maintain robust and secure device management strategies for remote devices using Unified Endpoint Management (UEM), MDM solutions, and orchestration tools.
Reliability & Monitoring: Develop and manage observability pipelines to track device health, connectivity, and performance metrics across diverse warehouse environments.
OTA & Lifecycle Management: Own the end-to-end lifecycle of device software, including secure Over-the-Air (OTA) firmware updates, rollback strategies, and OS hardening.
Incident Response: Participate in on-call rotations to troubleshoot complex system failures, performing root cause analysis (RCA) to drive long-term reliability improvements.
Self-Healing Infrastructure: Develop automated remediation scripts that detect and fix common edge issues such as hung scanning processes or display driver freezes without manual intervention.
Zero-Touch Scalability: Architect and maintain remote provisioning and management workflows for a global fleet of Linux, iPads, and Android devices using secure remote management strategies.
Secure Remote Access: Implement and manage secure remote access protocols such as SSH, VPNs, and private APNs to enable out-of-band troubleshooting and real-time device control without physical site visits.
SLO/SLI Frameworks: Define and enforce Service Level Objectives (SLOs) and Service Level Indicators (SLIs) for device availability, connectivity, and peripheral performance.
Error Budget Management: Use error budgets to balance the pace of innovation with fleet reliability, ensuring data-driven decisions for feature releases versus stability fixes.
Security Governance: Align fleet operations with industry standards such as the NIST Cybersecurity Framework (CSF), ISO/IEC 27001, and CIS Controls.
Vulnerability Management: Drive continuous monitoring and automated patching schedules to mitigate risks and ensure regulatory compliance across all managed device platforms.

Qualifications

Master’s degree in Computer Science, Software Engineering, Systems Engineering, Robotics, or equivalent experience.
7+ years of experience: Proven track record in SRE, DevOps, or Systems Engineering with a focus on IoT, remote devices, or distributed edge hardware.
Deep proficiency in Linux/Unix systems (Debian/Ubuntu preferred), including kernel tuning, shell scripting (Python, Bash), and networking protocols (TCP/IP, MQTT, CoAP, HTTPS/REST, DNS).
Knowledge of security best practices for IoT and remote devices, including secure boot, encryption at rest/in transit, and certificate management.
Expert proficiency in Python, Rust, or Go-based configuration management (Ansible/Terraform) for fleet-wide deployments.
Strong understanding of SRE principles, including SLIs/SLOs, error budgets, and automation over manual "toil."
Experience with enterprise MDM or Unified Endpoint Management (UEM) platforms (such as Jamf Pro, Microsoft Intune, FleetDM, Mosyle, Esper, 42Gears SureMDM, SOTI MobiControl, VMware Workspace ONE, or Headwind MDM).
Experience with open-source device management solutions is a plus (such as FleetDM, Mender.io, Balena, Micromdm, Memfault, or RAUC).
Experience with building Linux images and containers (with tools such as Yocto, PTXdist, ubuntu-image, Packer, Debian live-build, debootstrap).
Experience with Linux packaging formats (such as deb, snap, flatpak, nixpkg).
Hands-on experience troubleshooting hardware interfaces, specifically USB/Bluetooth barcode scanners and industrial touchscreen displays.
Experience configuring and locking down browsers or native apps into dedicated kiosk environments on both Linux and mobile OSs.
Hands-on experience with cloud infrastructure (AWS or Azure) and containerization technologies like Docker and Kubernetes.
Experience with CI/CD pipelines tailored for edge device deployment.
Experience with ROS (Robot Operating System) or managing hardware-in-the-loop systems is a plus.
Background in warehouse automation, logistics, or industrial IoT.

Locus Robotics is an Equal Opportunity Employer.

The expected base salary range for this role is £100k – £140k annually, based on external market data, plus bonus and equity. Actual offers will depend on factors such as the candidate’s experience, education, training, key or critical skills, geographic location, and current market and business conditions

Application Fraud Detection Notice: To help maintain a fair and secure hiring process, Locus Robotics may use AI-assisted and other automated tools to detect suspected fraud, misrepresentation, or misuse of the application process. Hiring decisions are not made solely by automated means unless otherwise disclosed where required by law

Similar Jobs

NatWest Group

Senior Site Reliability Engineer

An Hour Ago

In-Office

Senior level

Fintech • Payments • Financial Services

The Senior Site Reliability Engineer ensures reliability and performance of production platforms, leads SRE practices, incident management, and automation using AWS and Kubernetes.

Top Skills: Argo CdAWSGitopsGrafanaKarpenterKubernetesLokiPrometheusTempoTerraform

Experian

Senior Site Reliability Engineer

22 Days Ago

Hybrid

Senior level

Big Data • Marketing Tech • Analytics

The Senior Site Reliability Engineer will enhance system reliability, manage AWS infrastructure, automate processes, and respond to incidents while collaborating with teams to improve overall system performance.

Top Skills: AWSBashCloudFormationCloudwatchDynatraceGrafanaPrometheusPythonSplunkTerraform

duvo.ai

Site Reliability Engineer

24 Days Ago

In-Office or Remote

London, Greater London, England, GBR

Mid level

Artificial Intelligence • Software • Automation

The Site Reliability Engineer will manage platform reliability and infrastructure, ensuring security and observability while automating deployments. Responsibilities include incident response, monitoring, and improving reliability practices.

Top Skills: DockerGCPGrafanaLokiOpentelemetryPostgresPrometheusPythonRedisTerraformTypescript

What you need to know about the London Tech Scene

London isn't just a hub for established businesses; it's also a nursery for innovation. Boasting one of the most recognized fintech ecosystems in Europe, attracting billions in investments each year, London's success has made it a go-to destination for startups looking to make their mark. Top U.K. companies like Hoptin, Moneybox and Marshmallow have already made the city their base — yet fintech is just the beginning. From healthtech to renewable energy to cybersecurity and beyond, the city's startups are breaking new ground across a range of industries.