
Senior Site Reliability Engineer
Senior Site Reliability Engineer
Join us, and be part of a team that celebrates your unique work style. With flexible work options, a supportive team, and rewards that reflect your value, you can focus on what matters most – driving your growth, while fuelling ours. Our commitment to respect, transparency, and simplicity means you can trust us to always choose to do the right thing. As a trusted partner for purpose-built AI and intelligent automation, we solve highly complex problems for our enterprise customers and put their information to work to transform the way they do business. Over 10, 000 customers trust us, including many Fortune 500 ones. You will work on further developing a portfolio already containing client names such as DHL, Johnson & Johnson, FDA, DMV, PwC, KeyBank, Spotify, and H&R BLOCK.
Responsibilities:- Сo-own critical production service designs to ensure high reliability is achievable and measurable- Drive reliability and observability improvements in the services within the engineering verticals- Using monitoring and telemetry data, help teams make informed decisions on where reliability challenges may exist and help design and build solutions to improve them. - Build and improve internal tools and automation software to make maintaining production services easier and safer- Lead reliability-focused practices such as Failure Analysis, Load and Capacity Planning, Service Reviews, Architecture Designs, Incident Postmortems, and others- Developing Infrastructure as a Code. - You will build SRE dashboards from SLIs to measure SLO adherence. - Define (from design to implementation details) necessary auto-healing and fault-tolerant systems- Point of contact for production application issues, working closely with engineering leadership
Requirements:- 4+ years of experience in an Infrastructure, SRE, DevOps, CloudOps role- Proven experience in diagnosing and resolving indexing issues in relational databases (e. g. , PostgreSQL, MySQL, SQL Server, Oracle)- Strong understanding of query optimisation, execution plans, and database internals- Proficient in using database performance monitoring tools (e. g. , pg_stat_statements, Percona Toolkit, SQL Profiler)- Experience with Terraform, Ansible, or any similar programming language- Experience with Azure- Experience with cloud-performant microservices and event-driven architectures- Experience with Kubernetes administration is an added advantage. - Understanding of information security concepts and terminology- Distributed monitoring experience: logging, metrics, tracing, etc- Strong knowledge of software development methodologies and passion for creating high-standard tool sets for infrastructure-as-code- Ability to analyse problems quickly and find suitable solutions based on available resources- A proactive and open-minded individual with a clear client focus and structured approach
Here are some of our local benefits:- Work from home, remotely, or hybrid- Partial compensation for glasses and lenses- Private health insurance- Volunteering Time Off (2 days/ year)- SZÉP Card for recreational activities- 3 extra days/ month for 'sick leave' without doctors visit- Flexible working hours
- Részletes információk az állásajánlatról
Vállalat: HAYS Hungary Kft. Hely: Budapest Munkaviszony: Teljes munkaidős Hozzáadva: 10. 8. 2025
Aktív álláslehetőségek
A friss munkaajánlatra Ön elsőként jelentkezhet!