Chaos and Kraken Expert for a multinational Institution in Strasbourg

Posted 1 day 10 hours ago by infeurope S.A.

Contract
Not Specified
University and College Jobs
Bas-Rhin, France
Job Description

We are looking for a Chaos and Kraken Expert to work for a client project in Strasbourg.

Location: 80% on-site work in Strasbourg and 20% off-site work

Start date: immediately

End Date: 12 months

Preliminary Requirements: Candidate must be citizen of member states of the European Union (European Union nationality), and should be able to get their criminal record.

Role and Tasks description:

The Chaos and Kraken Expert will be in charge to manage, define, execute, and support Chaos and Kraken scenario on distributed systems.

He/she will be Responsible for creating best practices for using these tools to automate crash scenario testing and communicating them to the teams.

He/she will have to be able to draw up procedures and white paper recommendations to use. These procedures will then be used by the testing teams to carry out resilience and service level tests.

He/she will have to demonstrate a high degree of autonomy, be open-minded and know how to transfer knowledge to the testing teams.

The candidate will need to have experience of managing Chaos and Kraken tools in an OpenShift microservice environment and a virtualised platform.

Main skills required:

  • Chaos Experiment Design: Planning and designing experiments that simulate system failures, such as service outages, network latency, packet loss, etc.
  • Experiment Execution: Implementing and executing these experiments in test or controlled production environments.
  • Monitoring and Analysis: Monitoring system behaviour during experiments and analysing results to identify weak points.
  • Documentation and Reporting: Documenting findings and providing detailed reports with recommendations to improve system resilience.
  • Automation: Developing scripts and tools to automate the execution of chaos experiments.
  • Collaboration: Working with testing, development, operations, and security teams to implement improvements based on experiment results.

Key skills:

  • University degree in Computer Science: Master or equivalent;
  • Knowledge of Distributed Systems: Understanding how distributed systems work and potential failures that can occur.
  • Programming and Scripting: Skills in languages such as Python, Go, and other Scripting languages.
  • Chaos Engineering Tools: Familiarity with tools like Chaos Monkey, Gremlin, Litmus, and others.
  • Data Analysis: Ability to analyse experiment results and draw useful conclusions.
  • Communication: Ability to communicate findings and recommendations to technical and non-technical teams.
  • Problem Solving: Ability to identify and solve complex problems in distributed systems.
  • Openshift/Kubernetes (corporate microservices platform used)
  • VMWare/Linux/Windows/Shell
  • DevOps - Github/Ansible/Helm/ArgoCD/Jenkins/
  • Very good English speaking & writing skills;
  • Experience and willingness of working in an international/multicultural environment.

infeurope is a Luxembourg-based IT service provider, designing, developing and managing multilingual information and documentary systems in many application areas and business sectors. For more than 40 years we have delivered IT systems and solutions.