Principal Site Reliability Engineer Ι

IT & Infrastructure

Athens, Greece

We are Kaizen Gaming

Kaizen Gaming, the team powering Betano, is one of the biggest GameTech companies in the world, operating in 19 markets. We always aim to leverage cutting-edge technology, providing the best experience to our millions of customers who trust us for their entertainment.

We are a diverse team of more than 2.700 Kaizeners, from 40+ nationalities spreading across 3 continents. 

Our #oneteam is proud to be among the Best Workplaces in Europe and certified Great Place to Work across our offices. Here, there’ll be no average day for you. Ready to Press Play on Potential?

Let's Start With The Role

We are looking for a Principal Site Reliability Engineer I who will be a subject matter expert, mastering specific infrastructure domains to drive technical excellence across the organization. In this capacity, you will act as the primary escalation point for resolving high-complexity problems while proactively defining backlog items to maintain and modernize your areas of responsibility.

As a Principal Site Reliability Engineer,  you will:

Work closely with our SRE teams, providing technical mentorship to help them optimally refine tasks and improve implementations. Furthermore, you will collaborate with the Principal Engineer 2 to effectively disseminate architectural best practices and engineering standards, ensuring the team remains aligned with the organization's strategic vision.

What You'll Bring 

  • 3-5 years of experience building and maintaining scalable production environments in a Senior SRE, Tech Lead, or Architect capacity.
  • Expert-level knowledge in at least 50% of the following tools and domains, with a senior-level understanding of the rest:
    • Observability/Monitoring/Logging: Prometheus, Grafana, Graylog, Zabbix, Instana
    • Brokers: RabbitMQ, Kafka
    • Database Infra: Redis, Mongo, CockroachDB
    • Networking & Traffic Management: Cloudflare, HAProxy, Varnish, Nginx
    • Platform Infrastructure:Kubernetes (k8s), OpenShift, ESXi
    • GitOps: ArgoCD
    • CI/CD: GitLab, RedHat Tower (AWX/Ansible)
    • Cloud & IaC:Ansible, Terraform, Azure
  • Strong scripting skills in languages such as Bash, PowerShell, Python, or Go.
  • Solid programming skills in either Java or .NET.
  • Demonstrated ability to utilize AI tools (e.g., AI code assistants, AIOps platforms) to increase productivity, quality, and reliability.
  • Ability to resolve complex technical challenges optimally, taking into account time, capacity, and budget constraints.
  • Proven ability to work effectively as part of a distributed, international team.
  • An easy-going, flexible personality with a genuine eagerness to learn new technologies and work outside of your comfort zone.
  • A "people-first" and continuous improvement mentality, always looking for ways to make things better for our teams and our customers.
  • A strong understanding of Scrum/Agile methodologies and principles.