Leave us your email address and we'll send you all the new jobs according to your preferences.
Data Network Engineer - SRE, Telemetry, Observability, Monitoring & Performance
Posted 5 days 17 hours ago by La Fosse Associates
Seeking a Network Engineer with experience of Telemetry, Observability, Monitoring & Peformance, ideally within a high availability Network Infrastructure Site Reliability Engineering environment. The network strategy is highly focused towards Next-Gen, Software Defined Networking and in this role you you will work at the intersection of software engineering, Networks SRE & platform operations & engineering, with the ulitmate aim of developing actionable insights from telemetry data and enhancing the value of observability tooling.
Previous experience might include:
- Collaborate cross-functionally to ensure observability is embedded into the SDLC & CI/CD pipelines.
- Designing & implementing telemetry pipelines for metrics, logs, traces, and events.
- Developing observability standards, NMS tooling, dashboards, alerting frameworks, and SLOs.
- Integrating & optimising observability tools such as OpenTelemetry, Prometheus, Grafana, Splunk & Elastic.
This role will require:
- Having previously worked within Network/Platform Observability, Networks SRE, or Platform Engineering roles within complex, distributed environments.
- Strong expertise with telemetry tools such as OpenTelemetry, Prometheus, Grafana, Splunk, Elastic, Loki, Jaeger, or similar.
- Proficiency in at least one programming language (e.g., Python, Go, Java) and infrastructure-as-codetools (e.g., Terraform, Helm).
- Deep understanding of cloud-native architectures (Kubernetes, microservices, service meshes).
Highly desired:
- Industry experience such as the following Media/Streaming, High Frequency Trading e.g. Investment Banking, Online Gaming, Hyperscalers, High Availability, Low Latency Network Infrastructure