Test Environment Manager
Posted 3 hours 36 minutes ago by Intuition IT Solutions Ltd
£75,000 Annual
Permanent
Not Specified
I.T. & Communications Jobs
London, United Kingdom
Job Description
- Automate environment life cycle: Develop Infrastructure as Code (IaC) to automate the provisioning, teardown, and configuration of test environments, integrating them with the CI/CD pipeline.
- Establish service level objectives (SLOs): Define and measure key service indicators (SLIs) for test environments, such as availability and provisioning time, to ensure they meet the needs of development and testing teams.
- Monitor environment health and performance: Use observability tools like Prometheus and Grafana to track the health of test environments, identify bottlenecks, and resolve issues proactively, not reactively.
- Manage incident response: Lead the incident management process for test environment issues, conducting blameless post-mortems to understand the root causes and implement lasting fixes.
- Minimize toil: Automate manual, repetitive tasks associated with test environments to free up engineering time for more strategic work.
- Strategic and cultural responsibilities
Technical skills
- Expertise in tooling: Proficiency with monitoring and logging tools (eg, Prometheus, Splunk, Grafana), CI/CD platforms (eg, Jenkins, GitLab CI), and configuration management tools (eg, Ansible, Terraform).
- Cloud infrastructure knowledge: Deep understanding of cloud platforms like AWS, including experience with containerization technologies (Docker, Kubernetes) and serverless computing.
- Scripting and programming: Strong Scripting skills in languages such as Python or Bash to automate environment management tasks.
- Systems and networking knowledge: Solid understanding of Linux systems, networking concepts, and database management.
Soft skills
- Leadership and influence: The ability to champion SRE practices and influence technical and business stakeholders across different teams.
- Problem-solving: Strong analytical and debugging skills for investigating and resolving complex environment issues under pressure.
- Communication: Excellent communication and collaboration skills to bridge the gap between development, QA, and operations teams.
- Adaptability: A proactive and adaptable mindset to keep pace with evolving technology and development methodologies.