SiGMA World Logo

SiGMA World

Site Reliability Engineer (SRE)

Reposted 12 Hours Ago
Be an Early Applicant
In-Office
Yerevan
Senior level
In-Office
Yerevan
Senior level
The Site Reliability Engineer ensures system reliability and performance, develops automation for deployments, manages cloud infrastructure, and introduces AI tooling for enhanced operations.
The summary above was generated by AI

Job Title: Site Reliability Engineer (SRE)

Department: Tech
Location: CCyprus / Serbia / Armenia / India
Employment Type: Full-time


About SiGMA Group

Founded in 2014 and headquartered in Malta, SiGMA Group now employs over 250 professionals across six global offices, including Malta, Cyprus, Serbia, São Paulo, Manila, and India. SiGMA is a global leader in gaming, emerging tech, affiliate marketing, events, and media, best known for its marquee summits like SiGMA Malta and iGaming Academy. The company’s culture champions inclusivity, sustainability, collaboration, and philanthropic impact under its SiGMA Foundation initiative


About the role 

The Site Reliability Engineer (SRE) is responsible for ensuring the reliability, performance, and scalability of SiGMA Group's event platforms, digital products, and iGaming‑related services. This role blends software engineering, systems engineering, and operational excellence to build resilient systems that support high‑traffic global events.


The SRE will also play a key role in introducing AI‑powered tooling, automation, and monitoring capabilities, as well as supporting AI‑enabled products and services across the organisation.

Key Responsibilities  

Reliability Engineering & System Performance

  • Ensures the availability, performance, and resilience of production systems supporting live events and iGaming platforms.
  • Builds and maintain monitoring, alerting, and observability systems to detect issues before they impact users.
  • Conducts capacity planning, load testing, and performance tuning to support traffic spikes during major events.
  • Leads incident response, root‑cause analysis, and post‑mortem processes.

Automation & Operational Excellence

  • Develops automation for deployments, scaling, configuration management, and routine operational tasks.
  • Implements Infrastructure‑as‑Code (IaC) to standardise and automate environment provisioning.
  • Improves CI/CD pipelines to ensure fast, reliable, and repeatable releases.
  • Reduces manual toil through scripting, tooling, and process optimisation.

AI Tooling & Intelligent Operations

  • Introduces AI‑powered tools to enhance reliability and operational efficiency, including:
    • intelligent anomaly detection
    • AI‑assisted incident prediction
    • automated diagnostics and remediation
    • AI‑enhanced monitoring and log analysis
  • Supports the deployment and scaling of AI‑enabled products and services across event and iGaming platforms.
  • Collaborates with AI and data teams to ensure infrastructure supports model training, inference, and real‑time AI workloads.

Infrastructure & Cloud Engineering

  • Manages cloud infrastructure (compute, storage, networking) with a focus on scalability and cost efficiency.
  • Implements best practices for security, resilience, and compliance across cloud environments.
  • Supports containerisation and orchestration technologies (e.g., Docker, Kubernetes).
  • Ensures infrastructure is optimised for both event‑driven and continuous‑use workloads.

Events & iGaming Platform Support

  • Ensures systems are event‑ready, with robust failover, redundancy, and real‑time monitoring.
  • Supports event operations teams with technical readiness, live‑event monitoring, and rapid issue resolution.
  • Builds systems capable of handling unpredictable traffic patterns common in iGaming and live events.

Security, Compliance & Risk Management

  • Implements secure‑by‑design principles across infrastructure and operations.
  • Ensures compliance with data‑privacy regulations and responsible‑gaming requirements where applicable.
  • Identifies and mitigates operational risks, vulnerabilities, and single points of failure.

Collaboration & Cross‑Functional Support

  • Works closely with engineering, product, data, and platform teams to ensure reliability is embedded throughout the development lifecycle.
  • Provides guidance on best practices for performance, scalability, and operational readiness.
  • Communicates system health, risks, and improvements to stakeholders.

Qualifications 

Key Skills & Competencies

  • Strong proficiency in cloud platforms, Linux systems, and distributed architectures
  • Experience with monitoring tools (e.g., Prometheus, Grafana, Datadog, New Relic)
  • Strong scripting and automation skills (Python, Bash, Go, or similar)
  • Familiarity with AI‑assisted operations and emerging intelligent‑monitoring tools
  • Experience with CI/CD, containerisation, and orchestration
  • Strong problem‑solving and analytical skills
  • Ability to thrive in fast‑paced, event‑driven environments
  • Excellent communication and collaboration skills

Preferred Experience

  • Educated to degree level in a numerate or technical discipline, Masters preferred.
  • 5–7+ years of technical experience in SRE, DevOps, platform engineering, or systems engineering
  • 1–2+ years of management or mentorship experience, such as leading incident response, guiding junior engineers, or owning reliability initiatives
  • Experience supporting high‑availability, high‑traffic systems in production
  • Background working with event‑driven architectures or iGaming platforms
  • Proven track record of implementing automation and reliability improvements

Why SiGMA Group?

  • Grow with us - Be part of SiGMA’s global expansion and make your mark.
  • Free iGaming Academy access -Learn the ins and outs of the industry with access to courses.
  • Travel perks - Visit our international offices and attend industry events worldwide.
  • Performance rewards - High performers are recognized and fast-tracked with annual reviews and bi-yearly performance checks ins.
  • Interest-free car loan after probation (T&Cs apply)

Top Skills

AI
Bash
Datadog
Docker
Go
Grafana
Kubernetes
New Relic
Prometheus
Python

Similar Jobs

2 Days Ago
Hybrid
Yerevan, ARM
Senior level
Senior level
Artificial Intelligence • Cloud • Fintech • Machine Learning • Mobile • Software
The Senior Site Reliability Engineer manages SQL Server and PostgreSQL databases, optimizes performance, automates tasks, and ensures system reliability in cloud and on-prem environments.
Top Skills: AWSAzureAzure DevopsCi/CdDatadogDockerElk StackGithub ActionsGrafanaInfrastructure As CodeKubernetesPostgresPrometheusSQL ServerTeamcityTerraform
23 Days Ago
In-Office or Remote
27 Locations
Expert/Leader
Expert/Leader
Cloud • Information Technology • Internet of Things • Machine Learning • Software • Cybersecurity • Infrastructure as a Service (IaaS)
Responsible for driving business development and sales targets in Data & AI, managing stakeholder engagements, and providing technical support while leading complex sales proposals.
Top Skills: 5G6GAIApi MonetizationBusiness Support SystemsCloudGenaiMicro-Services ArchitectureOperations Support Systems
12 Hours Ago
In-Office
Yerevan, ARM
Senior level
Senior level
Fintech • Payments • Financial Services
As a Senior QA Engineer, you'll perform testing of mobile applications, report bugs, create test cases, and prepare automation scenarios.
Top Skills: Android Studio EmulatorCharles ProxyGitJenkinsJIRAPostmanRest ApisTestrailXcode Simulator

What you need to know about the London Tech Scene

London isn't just a hub for established businesses; it's also a nursery for innovation. Boasting one of the most recognized fintech ecosystems in Europe, attracting billions in investments each year, London's success has made it a go-to destination for startups looking to make their mark. Top U.K. companies like Hoptin, Moneybox and Marshmallow have already made the city their base — yet fintech is just the beginning. From healthtech to renewable energy to cybersecurity and beyond, the city's startups are breaking new ground across a range of industries.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account