Faptic Technology Logo

Faptic Technology

Site Reliability Engineer

Job Posted 3 Days Ago Posted 3 Days Ago
Be an Early Applicant
United Kingdom
Senior level
United Kingdom
Senior level
Manage Azure cloud services and software development, ensuring system reliability and performance through monitoring, automation, and incident response. Drive improvements in service delivery and ensure compliance with security policies.
The summary above was generated by AI

Description

Faptic Technology is a leading provider of IT consulting and managed services, specializing in Azure cloud solutions, software development, and site reliability engineering (SRE). We partner with enterprises to optimize their IT operations, ensuring scalability, reliability, and innovation in the cloud. 

As part of our growth in managed cloud services, we are looking for an Engagement Manager to oversee Azure cloud services and software development services. You will be responsible for managing service delivery, coordinating technical teams, and ensuring a seamless experience for customers—both directly and via strategic partners. 


Who We Are 

At Faptic, we thrive on solving complex challenges. By blending design, engineering, and analytics, we help organizations harness the power of their data to create innovative digital experiences and shape their future. 

We are looking for an Azure Site Reliability Engineer (SRE) with proven expertise in Azure cloud platforms and cloud native software development to join our dynamic team. 
 

Your Role at Faptic 

We are seeking a highly skilled Azure Site Reliability Engineer (SRE) to join our team on a part-time, B2B basis. The ideal candidate will have expertise in managing cloud infrastructure, monitoring services, and ensuring system reliability using Microsoft Azure. This role requires a deep understanding of Terraform, Azure DevOps (CI/CD), and Azure monitoring services to optimize performance and maintain system availability. A strong focus on service operations, incident response, proactive monitoring, disaster recovery, and service reporting is essential. 

Key Responsibilities: 

  • Apply Site Reliability Engineering (SRE) principles to ensure system reliability and performance. 
  • Implement log management and diagnostics using Azure Monitor, Application Insights, and Log Analytics. 
  • Configure and monitor firewall services. 
  • Ensure the reliable operation and monitoring of Azure-based services. 
  • Utilize Terraform to automate infrastructure provisioning and management. 
  • Develop, configure, and manage CI/CD pipelines using Azure DevOps. 
  • Set up and manage alerts to proactively detect and mitigate system issues. 
  • Conduct incident response, root cause analysis, and implement long-term fixes to enhance system stability. 
  • Collaborate with teams to improve observability and system health metrics. 
  • Establish and maintain incident management processes to minimize downtime. 
  • Track and report service uptime, performance, and quality metrics. 
  • Ensure compliance with security and governance policies within Azure environments. 
  • Optimize cost management strategies across Azure resources. 
  • Prepare, test, and execute disaster recovery plans to ensure business continuity. 
     
Requirements
  • Microsoft certification at an intermediate level or above (e.g., Microsoft Certified: Azure Administrator Associate, Azure DevOps Engineer Expert, or Azure Solutions Architect Expert). 
  • Hands-on experience with Terraform for infrastructure automation. 
  • Strong expertise in Azure DevOps for CI/CD pipeline management. 
  • Proficiency in Azure Monitor, Log Analytics, and Application Insights. 
  • Experience setting up alerts and monitoring solutions for proactive issue resolution. 
  • Strong background in incident response and troubleshooting in cloud environments. 
  • Exposure to Palo Alto services and systems. 
  • Solid understanding of cloud security best practices and networking in Azure. 
  • Familiarity with PowerShell, Bash, or Python for automation and scripting tasks. 
  • Experience working in an on-call rotation for incident response. 
  • Ability to generate and analyze service reports on uptime, performance, and reliability. 
  • Experience in disaster recovery planning, testing, and execution to ensure high availability. 
  • Preferred Qualifications: 
  • Experience with configuration management tools (Ansible, Chef, or Puppet). 
  • Knowledge of alternative cloud platforms. 
  • Experience with Azure Functions, Logic Apps, and Event Grid. 

Top Skills

Application Insights
Azure
Azure Devops
Azure Monitor
Bash
Log Analytics
Powershell
Python
Terraform

Faptic Technology London, England Office

London, United Kingdom

Similar Jobs

8 Days Ago
Hybrid
Bournemouth, Dorset, England, GBR
Senior level
Senior level
Financial Services
The Sr Lead Site Reliability Engineer will optimize performance of applications in cloud environments, mentor engineers, and enhance system reliability.
Top Skills: BlazemeterDatadogDynatraceGrafanaJmeterPrometheusSplunk
8 Days Ago
London, Greater London, England, GBR
Mid level
Mid level
Information Technology • Software • Financial Services • Big Data Analytics
Site Reliability Engineers at Citadel ensure application reliability and performance by automating tasks, managing incidents, and improving engineering solutions.
Top Skills: Ci/CdCSSJavaScriptPythonReactSQL
6 Days Ago
Hybrid
Bournemouth, Dorset, England, GBR
Mid level
Mid level
Financial Services
As a Site Reliability Engineer III, you'll optimize applications and infrastructure, collaborate with software engineers, implement reliability practices, and support project goals.
Top Skills: .NetDatadogDockerDynatraceEcsGitlabGrafanaJavaJenkinsKubernetesPrometheusPythonSplunkSpring BootTerraform

What you need to know about the London Tech Scene

London isn't just a hub for established businesses; it's also a nursery for innovation. Boasting one of the most recognized fintech ecosystems in Europe, attracting billions in investments each year, London's success has made it a go-to destination for startups looking to make their mark. Top U.K. companies like Hoptin, Moneybox and Marshmallow have already made the city their base — yet fintech is just the beginning. From healthtech to renewable energy to cybersecurity and beyond, the city's startups are breaking new ground across a range of industries.
By clicking Apply you agree to share your profile information with the hiring company.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account