logo

View all jobs

Site Reliability Engineer (SRE) – Sofia, Bulgaria

Sofia, Sofia

Site Reliability Engineer (SRE) – Sofia, Bulgaria

Our client is seeking an experienced Site Reliability Engineer (SRE) to join their technology team in Sofia. The role focuses on ensuring the reliability, scalability, and performance of modern cloud-based and on-premises systems, with a strong emphasis on AWS, automation, and infrastructure as code (IaC).

This position blends software engineering and systems engineering to drive resilience, efficiency, and high availability across production platforms.


Key Responsibilities

  • Design, build, and maintain reliable, scalable, and performant systems across AWS and on-premises environments (cloud-first approach).

  • Implement monitoring, alerting, and observability tools to ensure visibility into system health and performance.

  • Automate deployments, configuration management, and operational tasks to increase efficiency and reduce manual effort.

  • Participate in incident response and postmortems, reducing MTTR and strengthening reliability practices.

  • Collaborate with developers to embed reliability and scalability into the software development lifecycle.

  • Oversee capacity planning, performance tuning, and AWS cost optimization.

  • Ensure compliance with security, regulatory, and audit requirements.


Requirements

  • 5+ years in Site Reliability Engineering, DevOps, or related roles.

  • Strong Linux systems administration background.

  • Proficiency in at least one scripting/programming language (Python, Go, Bash, etc.).

  • Deep expertise with AWS services (EC2, ECS/EKS, RDS, S3, IAM, networking).

  • Proven experience with Terraform and configuration management tools (Puppet, Chef, Ansible).

  • Strong knowledge of CI/CD pipelines (Jenkins, GitLab, or similar).

  • Hands-on experience with monitoring and observability tools (Prometheus, Grafana, ELK, Datadog, etc.).

  • Solid understanding of networking, load balancing, and DNS.

  • Excellent troubleshooting and problem-solving skills, especially during high-pressure incidents.


Preferred Skills

  • Experience with Kubernetes or container orchestration systems.

  • Familiarity with SLOs, SLIs, and error budgeting.

  • Previous work in financial systems or other mission-critical environments.


Working Hours

  • Full-time, 40 hours/week

  • Monday to Friday

  • Hybrid model: 3 days in-office, 2 days remote


Benefits

  • Competitive salary plus uncapped quarterly performance bonus

  • Hybrid work model (3 days office, 2 days remote)

  • Additional health insurance

  • Food vouchers and fresh fruit in the office

  • Sports card, fitness center, and game room on-site

  • Company-sponsored sports and team events

  • Budget for professional development (courses, certifications, conferences)

  • Exclusive employee discounts and perks


How to Apply

Send your CV in English. All applications will be treated with strict confidentiality. Only shortlisted candidates will be contacted.

InterContinental Recruiting Ltd.
Recruitment License No. 2087/22.07.2016


InterContinental Recruiting

Please contact us with any questions:

Email: sofia@icrecruiting.eu 
Phone: (w) 359 2 811 1366
Recruitment license from National Agency of Employment No 2087/22.07.2016

Share This Job

Powered by