Senior Site Reliability Engineer

Posted 4 days 2 hours ago by IT Human Resources

623,00 € Daily

Contract

Not Specified

Other

Noord-Holland, Amsterdam, Netherlands

Job Description

Location: Amsterdam (Hybrid)
Contract Type: 6-Month Contract
Start Date: ASAP
Hours: Full-Time, 40h per week

About the Role

We're looking for a highly skilled Senior Site Reliability Engineer I to join a forward-thinking engineering organization in Amsterdam. As part of this team, you'll focus on building reliable, scalable, and automated systems that keep services performant, resilient, and secure.

This role is perfect for someone who thrives at the intersection of software engineering and systems operations, with a deep passion for automation, observability, and continuous improvement. You'll have end-to-end ownership of key infrastructure components, ensuring reliability and performance while minimizing manual toil.

Key Responsibilities Operational Excellence & System Ownership

Own and maintain key operational capabilities: Configuration Management, Secret Management, Certificate Management, and Runtime Configuration.
Ensure end-to-end system reliability by monitoring application health, setting metrics, and responding proactively to alerts and incidents.
Troubleshoot complex distributed systems and provide responsive support to internal stakeholders.
Lead incident response efforts and contribute to postmortem reviews to drive long-term improvements.
Maintain high standards of data security, integrity, and quality across all services.

Automation & Continuous Improvement

Automate recurring operations to reduce manual intervention and improve consistency.
Continuously enhance CI/CD processes and infrastructure tooling.
Identify and eliminate operational bottlenecks by reducing technical debt and implementing scalable solutions.
Partner with development teams to improve observability, monitoring, and alerting frameworks.

Reliability & Performance

Enhance system performance, scalability, and latency through infrastructure optimization and proactive capacity planning.
Ensure infrastructure and services remain current and cost-effective by leveraging new technologies.
Contribute to system architecture discussions and provide technical leadership on reliability topics.

Communication & Collaboration

Act as a trusted partner to internal clients, product stakeholders, and peers, ensuring alignment and clarity.
Communicate effectively within a multi-functional, globally distributed team.
Mentor junior engineers and promote best practices in system design, automation, and incident management.

Key Skills & Qualifications

5-8 years of experience as a Site Reliability Engineer, Systems Engineer, or Software Engineer in a high-availability environment.
Proven troubleshooting skills for complex distributed systems and production issues.
Strong programming skills in Go and Python.
Hands-on experience with Kubernetes, AWS, and bare-metal infrastructure.
Expertise in Certificate Management and Public Key Infrastructure (PKI).
Experience with configuration and secret management tools such as Vault and Puppet.
Familiarity with PostgreSQL or similar relational databases.
Deep understanding of observability practices (metrics, logs, tracing) and incident management.
Excellent analytical, critical-thinking, and communication skills.

Why Join Us?

Work in a cutting-edge engineering environment where reliability and automation are at the core of everything.
Opportunity to influence large-scale system architecture and reliability strategies.
Collaborate with some of the brightest minds in SRE and infrastructure engineering.
Flexible hybrid working setup based in vibrant Amsterdam.
Competitive daily or monthly compensation aligned with experience.
A chance to contribute to meaningful projects that directly impact system performance and customer satisfaction.

How to Apply

If you're passionate about reliability engineering, automation, and large-scale system design-and you're ready to take ownership of complex technical challenges-we'd love to hear from you. Apply today and join a team that's redefining what reliability means in modern infrastructure.