Docker, Inc Logo

Docker, Inc

Principal Software Engineer, Docker Agents (London)

Posted 5 Days Ago
Be an Early Applicant
In-Office
London, Greater London, England
Expert/Leader
In-Office
London, Greater London, England
Expert/Leader
The Principal Software Engineer will lead the development of containerized AI agents, guide architectural decisions, and enhance the cagent open-source project while mentoring team members.
The summary above was generated by AI

At Docker, we make app development easier so developers can focus on what matters. Our remote-first team spans the globe, united by a passion for innovation and great developer experiences. With over 20 million monthly users and 20 billion image pulls, Docker is the #1 tool for building, sharing, and running apps—trusted by startups and Fortune 100s alike. We’re growing fast and just getting started. Come join us for a whale of a ride!

We are looking for a Principal Software Engineer (Docker Agents) to join Docker’s AI engineering team to build the future of containerized AI agents. Docker containers are the perfect vehicle to host and run AI agents—providing isolation, portability, and reproducibility. You’ll be working on cagent, our open-source project (https://github.com/docker/cagent), and expanding on it to enable developers to build, deploy, and scale intelligent agents using Docker’s container technology.

This is a greenfield opportunity to shape how developers leverage containers for AI agents at massive scale. You’ll define the technical vision, lead architecture decisions, and partner with engineers and leaders across Docker to bring containerized agent capabilities into Docker’s developer experience.

Responsibilities
  • Technical Leadership & Architecture: Define and drive the long-term technical strategy for Docker’s containerized agent platform, including core primitives, APIs, and extensibility patterns

  • Build Containerized Agent Systems: Design and implement systems that leverage Docker containers as the ideal runtime for AI agents, ensuring isolation, scalability, and portability

  • Expand cagent: Maintain and evolve the open-source cagent project, adding new capabilities for containerized agent deployment, orchestration, and lifecycle management

  • Agent Runtime Development: Build robust infrastructure for packaging, deploying, and managing agents in containers across local and cloud environments

  • Evaluation & Testing: Define evaluation frameworks to measure agent quality, reliability, and production readiness; plus the deployment effectiveness of containerized runtimes

  • Reliability & Operability: Establish standards for observability, performance, and operational excellence; lead critical production decision-making and incident learnings as needed

  • Rapid Prototyping: Iterate quickly on new agent capabilities and deployment patterns, moving from concept to production efficiently

  • Open Source Community: Engage with the cagent community, review contributions, and help grow the ecosystem

  • Cross-functional Collaboration: Lead cross-functional technical discussions and influence architectural decisions across Docker’s AI initiatives (including sister teams and platform efforts)

  • Mentorship & Enablement: Mentor senior engineers, raise the bar through design reviews, and accelerate team execution through clear technical direction and coaching

Qualifications
  • 10+ years of software engineering experience, including 3+ years in technical leadership roles (Staff/Principal level or equivalent scope)

  • Go Expertise: Strong proficiency in Go (this is absolutely required) - Docker’s primary language for backend systems

  • AI/ML Knowledge: Practical experience with large language models (LLMs) and agent development patterns

  • System Architecture: Proven ability to design scalable, distributed systems in production environments

  • Container Technology: Deep understanding of Docker, containerization best practices, and container orchestration

  • Cloud/Platform Depth: Experience building and operating platform services with strong foundations in observability, CI/CD, and security principles

  • Operational Excellence: Experience operating and evolving high-availability production systems with a focus on reliability and performance

  • Influence & Communication: Exceptional communication skills and ability to influence across technical and business domains

  • AI Frameworks: Experience with CrewAI, AGNO, ADK, LangChain/LangGraph or similar AI orchestration frameworks (preferred)

  • Python Proficiency: Experience with Python for AI prototyping and tooling (preferred)

  • Experience with Kubernetes or container orchestration platforms (preferred)

  • Open source contributions and community engagement (preferred)

  • Experience with agent evaluation, reliability, and observability techniques (preferred)

What to ExpectFirst 30 days
  • Integrate into our AI engineering team building containerized agent infrastructure

  • Deep dive into cagent’s architecture, project roadmap, and the developer problems we’re solving

  • Identify the highest-leverage architectural and execution risks/opportunities; align with stakeholders on priorities

  • Contribute initial improvements to cagent and the containerized agent runtime foundations

First 90 days
  • Lead significant platform features or architectural improvements to cagent and our containerized agent ecosystem

  • Establish (or materially improve) technical standards for evaluation, reliability, and operability of agent systems

  • Drive alignment across internal teams on APIs, integration points, and a cohesive developer experience

  • Mentor engineers through design reviews and help accelerate onboarding and execution

One-Year Outlook
  • Drive major architectural decisions for our containerized agent platform that will impact millions of Docker users

  • Shape the long-term technical vision and execution plan for Docker’s agent ecosystem (open-source and product surfaces)

  • Establish repeatable engineering practices for quality, performance, and operational excellence in agent systems

  • Lead initiatives to expand containerized agent capabilities for enterprise use cases and broader platform integrations

  • Grow the team’s technical capabilities through mentorship, strategy, and pragmatic delivery


Docker does not offer visa sponsorship for this role.

We use Covey as part of our hiring and / or promotional process for jobs in NYC and certain features may qualify it as an AEDT. As part of the evaluation process we provide Covey with job requirements and candidate submitted applications. We began using Covey Scout for Inbound on April 13, 2024.

Please see the independent bias audit report covering our use of Covey here.

Perks

  • Freedom & flexibility; fit your work around your life

  • Designated quarterly Whaleness Days plus end of year Whaleness break

  • Home office setup; we want you comfortable while you work

  • 16 weeks of paid Parental leave

  • Technology stipend equivalent to $100 net/month

  • PTO plan that encourages you to take time to do the things you enjoy

  • Training stipend for conferences, courses and classes

  • Equity; we are a growing start-up and want all employees to have a share in the success of the company

  • Docker Swag

  • Medical benefits, retirement and holidays vary by country

  • Remote-first culture, with offices in Seattle and Paris

Docker embraces diversity and equal opportunity. We are committed to building a team that represents a variety of backgrounds, perspectives, and skills. The more inclusive we are, the better our company will be.

#LI-REMOTE

Top Skills

Ai Frameworks
Cloud Services
Docker
Go
Kubernetes
Python

Similar Jobs

An Hour Ago
In-Office or Remote
2 Locations
Senior level
Senior level
Cloud • Information Technology • Internet of Things • Machine Learning • Software • Cybersecurity • Infrastructure as a Service (IaaS)
Lead the IPBB Security Operations team ensuring service availability, managing network security devices, and maintaining incident recovery and configuration management for security nodes.
Top Skills: Alarm HandlingConfigurationFirewall (Juniper SrxFortinet)HardwareIpbb SecuritySoftware
4 Hours Ago
In-Office
Middle Wallop, Test Valley, Hampshire, England, GBR
Entry level
Entry level
Aerospace • Information Technology • Cybersecurity • Defense • Manufacturing
The Aircraft Maintenance Apprentice will study aircraft maintenance, conduct scheduled and unscheduled maintenance, and work under experienced engineers while completing qualifications over 36 months.
Top Skills: Aircraft MaintenanceAvionicsElectrical EngineeringMechanical Engineering
4 Hours Ago
In-Office
Odiham, Hart, Hampshire, England, GBR
Entry level
Entry level
Aerospace • Information Technology • Cybersecurity • Defense • Manufacturing
This apprenticeship involves hands-on training in aerospace engineering, focusing on aircraft maintenance, with structured programs and qualifications over three years.
Top Skills: AirframeAvionicsElectricalPropulsion

What you need to know about the London Tech Scene

London isn't just a hub for established businesses; it's also a nursery for innovation. Boasting one of the most recognized fintech ecosystems in Europe, attracting billions in investments each year, London's success has made it a go-to destination for startups looking to make their mark. Top U.K. companies like Hoptin, Moneybox and Marshmallow have already made the city their base — yet fintech is just the beginning. From healthtech to renewable energy to cybersecurity and beyond, the city's startups are breaking new ground across a range of industries.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account