Infrastructure Reliability Engineering, Senior Manager
Posted 4 hours 54 minutes ago by London Metal Exchange Limited
£80,000 - £100,000 Annual
Permanent
Full Time
Academic Jobs
London, United Kingdom
Job Description
Overall Purpose of Role role is accountable for Infrastructure Reliability Engineering (IRE) function, embedding reliability engineering as a core discipline across the technology lifecycle, from design through live operation, in support of To provide senior leadership across Infrastructure Reliability Engineering, accountable for the resilience, availability, and operational readiness of the LME Group technology estate. Lead the design and delivery of complex infrastructure transformation, platform modernisation, and re-architecture initiatives, ensuring secure, compliant, and highly reliable services that support trading critical operations and regulatory obligations. Responsibilities: Establish, mature, and continuously evolve the Infrastructure Reliability Engineering function, defining the IRE operating model, engagement patterns, and service boundaries across infrastructure, architecture, operations, security, and application teams. Set, maintain, and enforce consistent reliability engineering standards, patterns, and tooling across the infrastructure estate, balancing resilience, regulatory assurance, and operational efficiency.Act as senior Infrastructure Reliability Engineering SME across major programmes end to end (discovery, dependency mapping, design, planning, build, cutover, fall back), with direct accountability for service stability and risk reduction for trading critical platforms. Act as the accountable owner for Infrastructure Operational Readiness, ensuring platforms and services do not transition into live operation without meeting mandated readiness, observability, recoverability, and supportability criteria. Define and embed a consistent reliability measurement framework across infrastructure platforms, including service level indicators, objectives, and leading indicators of operational risk, enabling data driven prioritisation and informed investment decisions.Build, lead, and develop a high performing Infrastructure Reliability Engineering team, defining clear role expectations, capability standards, and development pathways.Foster a culture of engineering excellence, shared ownership, and continuous improvement, ensuring operational knowledge and resilience capability are institutionalised and not dependent on individuals. Act as a senior authority on infrastructure resilience and operational risk, influencing strategic decisions, architectural direction, and investment priorities to ensure reliability is designed in, not retrofitted. Own measurable infrastructure reliability outcomes, including availability, resilience, recovery performance, and operational risk reduction, with regular executive level reporting against agreed targets.Define and drive the LME Infrastructure Reliability posture, including fault tolerance, redundancy, capacity planning, disaster recovery, and failover strategies across on prem and hybrid environments. Ensure infrastructure platforms meet security and compliance requirements (e.g. CIS, ISO 27001, NIST), covering identity and access management, encryption, auditability, and regulatory evidence. Academic and Professional Qualifications Required: Demonstrable track record of continuous professional development in infrastructure, solutions engineering, or technology transformation. Required Knowledge and Level of Experience: 10+ years of experience leading largescale Infrastructure or Reliability Engineering functions, with demonstrable accountability for the availability, resilience, and operational performance of missioncritical systems.Proven experience establishing, scaling, or materially maturing an Infrastructure Reliability, Platform Reliability, or equivalent function within a complex, regulated, or highavailability environment.Significant experience operating in regulated or highassurance environments (e.g. financial services, exchanges, clearing, or equivalent).Experience influencing senior leadership and steering complex transformation initiatives across multiple technology domains. Significant experience leading or assuring largescale, enterprise Linux estates (e.g. RHELbased), including responsibility for reliability, resilience, and operational risk in regulated or highavailability environments. Skills set and Core Competencies Required for Role: Deep expertise in infrastructure reliability engineering, resilience patterns, and operational risk managementExcellent stakeholder engagement and senior communication skills Ability to lead multi disciplinary technical teams through complex change Technical Skills -Infrastructure Reliability Engineering Curious and improvement-oriented, always seeking to challenge the status quo and drive innovation with data-driven insights.In recruiting for our teams, we welcome the unique contributions that you can bring in terms of education, ethnicity, race, sex, gender identity, expression and reassignment, nation of origin, age, languages spoken, colour, religion, disability, sexual orientation and beliefs. In doing so, we want every LME employee to feel our commitment to showing respect for all and encouraging open collaboration and communication. Hong Kong Exchanges and Clearing Limited (HKEX) is a publicly-traded company (HKEX Stock Code:388) and one of the world's leading global exchange groups, offering a range of equity, derivative, commodity, fixed income and other financial markets, products and services, including the London Metals Exchange.As a superconnector and gateway between East and West, HKEX facilitates the two-way flow of capital, ideas and dialogue between China and the rest of world, through its pioneering Connect schemes, increasingly diversified product ecosystem and its deep, liquid and international markets.HKEX is a purpose-led organisation which, across its business and through the work of HKEX Foundation, seeks to connect, promote and progress its markets and the communities it supports for the prosperity of all.