Recursion Logo

Recursion

Senior AI/HPC Storage Engineer

Posted 16 Hours Ago
Be an Early Applicant
London, Greater London, England
Senior level
London, Greater London, England
Senior level
The Senior AI/HPC Storage Engineer will design, implement, and manage advanced AI/HPC data systems to support drug discovery at Recursion. Responsibilities include optimizing storage infrastructure, automating processes, conducting performance analysis, and leading customer collaboration to enhance data integrity and operational performance.
The summary above was generated by AI

Your work will change lives. Including your own. 


The Impact You'll Make

Recursion is a pioneering TechBio company that leverages AI and machine learning to decode biology and accelerate drug discovery, with data as a key differentiator and value driver. We are seeking a Senior AI/HPC Storage Engineer to join our innovative team. In this role, you will be instrumental in designing, implementing, and managing advanced AI/HPC data systems that propel our groundbreaking drug discovery research.

You will leverage your expertise in infrastructure solutions for Science to ensure the performance, scalability, and reliability of our storage systems. Your work will involve creating and maintaining robust infrastructure, automating processes, and optimizing storage systems to handle massive amounts of data and complex computational workloads, while ensuring high data integrity. In this role:

  • You will be responsible for designing, implementing, testing, maintaining, and optimizing our data storage infrastructure and services, utilizing an Infrastructure as Code approach across both on-premises and public cloud environments.
  • Your leadership and technical expertise will be key in driving innovation across all storage tiers within our AI/HPC infrastructure, ensuring we deliver a scalable and effective data platform to support our mission. 
  • By developing scripts and workflows, you will automate and verify storage infrastructure provisioning and dynamic reconfiguration, enhancing support for our AI/HPC storage environments.
  • Your meticulous attention to detail will be crucial for performance analysis, benchmarking, troubleshooting and fine-tuning of our data storage systems and services, while efficiently managing user tickets.
  • Your role also includes researching, deploying, and optimizing accessibility, performance, security, and data lifecycle management policies.
  • Regular assessments of our storage platforms' health and operational performance against established metrics will be part of your responsibilities, with a focus on meeting and exceeding operational service level objectives.
  • Finally, as a lead in technical communication and customer collaboration, your efforts will ensure high levels of customer satisfaction. This role presents a unique opportunity to make a meaningful impact within our organization and the broader scientific community.

Location:

This position is based at our headquarters in Salt Lake City, Utah, or in our offices in Toronto, Canada, or London, United Kingdom. We may also consider a hybrid working arrangement. We ask that hybrid employees commit to regular on-site visits for routine work and departmental events. 

The Team You'll Join 

As a Senior AI/HPC Storage Engineer, you will be a part of our dedicated HPC Engineering and Operations team, reporting directly to the Director. This dynamic team includes 3 experienced Engineers, and with the addition of this role, you'll be part of an empowered, cross-functional unit.

Our HPC team works in a fast-paced, collaborative environment, handling a broad spectrum of Scientific Infrastructure projects. These range from developing advanced, scalable infrastructure to deploying and managing AI/HPC resources and automating operational processes. The team also plays a crucial role in the curation of our vast data platform, which caters to a diverse set of professionals, including biologists, data scientists, and automation engineers.

We're home to BioHive, the industry's most powerful supercomputer and our HPC team is constantly pushing the boundaries in the field of supercomputing in the TechBio industry. As part of this team, you will collaborate on projects that streamline and optimize our machine learning workflows and scientific computing tasks, driving efficient and transformative solutions. This is a unique opportunity to join a team that thrives on innovation, collaboration, and inclusivity in a role that is pivotal to our mission.


The Experience You'll Need

  • A minimum of 7 years of experience in managing data storage infrastructure, preferably within global BioPharma organizations.
  • In-depth knowledge of distributed/parallel file systems (IBM Storage Scale GPFS), multi-tier file (NAS), hybrid object storage (MinIO), and storage access and data transfer networking protocols.
  • Experience with RDMA-capable high-speed networking.
  • Extensive experience designing, deploying, testing, supporting, and troubleshooting complex Linux-based computing and data storage environments.
  • Python programming and Bash scripting experience. In-depth hands-on experience in provisioning, configuring, and managing infrastructure through modern CI/CD techniques, GitOps, Infrastructure as Code (IaC) and cloud automation principles. 
  • Solid experience with software-defined infrastructure and cloud computing platforms, including Kubernetes, GCP, AWS, and others.
  • Practical knowledge of resource management and job scheduling using Slurm and Kubernetes. Knowledge of container technologies like Apptainer and Docker.
  • Strong verbal and written communication skills for effective documentation and collaboration.
  • Prior experience mentoring, guiding, and cross-training team members.

How You'll be Supported

At Recursion, we're working to solve some of the most meaningful challenges in human health. During onboarding, you'll be introduced to the Recursion Mindset and Recursion OS through a blend of in-person and online resources designed to help you quickly embrace our culture. You'll be paired with an onboarding "Trail Guide" to support you in your first months and connect with colleagues, both in person and remotely, who will guide you through the traditional and Recursion-specific aspects of your role. Additionally, you'll join onboarding events, like Decoding Recursion, to deepen your understanding and integration into the team.

#LI-CP1

At Recursion, we believe that every employee should be compensated fairly. Based on the skill and level of experience required for this role, the estimated current annual base range for this role is:

  • Developing: £92,000
  • Skilled: £101,000
  • Expert: £112,000

To learn more about our level within levels, click here.

You will also be eligible for bonuses and equity compensation + our comprehensive benefits package for United States based candidates. The range displayed on each job posting reflects target ranges for US new hire salaries and is determined by job, level, and market factors.

During the interview selection process, you will connect with a Talent Acquisition Partner who will be your advocate and ally to ensure you receive the appropriate compensation that meets your needs for your skills, experience, and relevant education/training, while also reviewing our very competitive total rewards package.

The Values That We Hope You Share:

  • We Care: We care about our drug candidates, our Recursionauts, their families, each other, our communities, the patients we aim to serve and their loved ones. We also care about our work.
  • We Learn: Learning from the diverse perspectives of our fellow Recursionauts, and from failure, is an essential part of how we make progress.
  • We Deliver: We are unapologetic that our expectations for delivery are extraordinarily high. There is urgency to our existence: we sprint at maximum engagement, making time and space to recover. 
  • Act Boldly with Integrity: No company changes the world or reinvents an industry without being bold. It must be balanced; not by timidity, but by doing the right thing even when no one is looking.
  • We are One Recursion: We operate with a 'company first, team second' mentality. Our success comes from working as one interdisciplinary team.

Recursion spends time and energy connecting every aspect of work to these values. They aren't static, but regularly discussed and questioned because we make decisions rooted in those values in our day-to-day work. You can read more about our values and how we live them every day here.

More About Recursion 

Recursion is a clinical stage TechBio company leading the space by decoding biology to industrialize drug discovery. Enabling its mission is the Recursion OS, a platform built across diverse technologies that continuously expands one of the world's largest proprietary biological and chemical datasets. Recursion leverages sophisticated machine-learning algorithms to distill from its dataset a collection of trillions of searchable relationships across biology and chemistry unconstrained by human bias. By commanding massive experimental scale - up to millions of wet lab experiments weekly - and massive computational scale - owning and operating one of the most powerful supercomputers in the world, Recursion is uniting technology, biology and chemistry to advance the future of medicine.

Recursion is headquartered in Salt Lake City, where it is a founding member of BioHive, the Utah life sciences industry collective. Recursion also has offices in London, Toronto, Montreal and the San Francisco Bay Area. Learn more at www.Recursion.com, or connect on X (formerly Twitter) and LinkedIn.

Recursion is an Equal Opportunity Employer that values diversity and inclusion. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, age, disability, veteran status, or any other characteristic protected under applicable federal, state, local, or provincial human rights legislation. Recursion welcomes and encourages applications from people with disabilities. Accommodations are available on request for candidates taking part in all aspects of the selection process.
Recruitment & Staffing Agencies: Recursion Pharmaceuticals and its affiliate companies do not accept resumes from any source other than candidates. The submission of resumes by recruitment or staffing agencies to Recursion or its employees is strictly prohibited unless contacted directly by Recursion's internal Talent Acquisition team. Any resume submitted by an agency in the absence of a signed agreement will automatically become the property of Recursion, and Recursion will not owe any referral or other fees. Our team will communicate directly with candidates who are not represented by an agent or intermediary unless otherwise agreed to prior to interviewing for the job.

Top Skills

Bash
Python

Similar Jobs

Be an Early Applicant
3 Hours Ago
London, Greater London, England, GBR
Hybrid
13,000 Employees
Senior level
13,000 Employees
Senior level
Artificial Intelligence • Healthtech • Professional Services • Analytics • Consulting
The role involves driving growth in the Clinical Technology business by managing client accounts and delivering bespoke technology solutions for Clinical functions in Pharma and Biotech. Responsibilities include business development, client relationship management, project delivery, and mentoring team members, while demonstrating deep industry knowledge.
Be an Early Applicant
5 Hours Ago
Birmingham, West Midlands, England, GBR
Hybrid
90,000 Employees
Entry level
90,000 Employees
Entry level
Big Data • Food • Hardware • Machine Learning • Retail • Automation • Manufacturing
As a Cadbury Process Engineer, you will support continuous improvement processes by providing training on CI tools, facilitating data collection, and conducting root-cause analysis to enhance departmental performance. You'll help establish new work standards and train colleagues in a collaborative environment that values employee safety and wellbeing.
Be an Early Applicant
5 Hours Ago
Birmingham, West Midlands, England, GBR
Hybrid
90,000 Employees
Junior
90,000 Employees
Junior
Big Data • Food • Hardware • Machine Learning • Retail • Automation • Manufacturing
The Logistics Project Engineer will support the development of warehousing and logistics solutions, manage project elements according to safety and efficiency standards, and collaborate with stakeholders to optimize processes. Responsibilities include conducting data analysis, creating layouts, and contributing to the engineering strategy.

What you need to know about the London Tech Scene

London isn't just a hub for established businesses; it's also a nursery for innovation. Boasting one of the most recognized fintech ecosystems in Europe, attracting billions in investments each year, London's success has made it a go-to destination for startups looking to make their mark. Top U.K. companies like Hoptin, Moneybox and Marshmallow have already made the city their base — yet fintech is just the beginning. From healthtech to renewable energy to cybersecurity and beyond, the city's startups are breaking new ground across a range of industries.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account