Data Engineer

Posted 4 days 1 hour ago by Alexander Mann Solutions - Public Sector Resourcing

Contract

Not Specified

Other

London, United Kingdom

Job Description

On behalf of The Ministry of Justice, we are looking for a Data Engineer (Inside IR35) for a 3 month contract based hybrid in London.

The Ministry of Justice (MoJ) priorities include improving public safety and reducing reoffending by reforming prisons, probation and youth justice, and building a justice system which makes access to justice swifter and more certain for all citizens whatever their background. Project professionals in the MoJ help to improve the government's ability to protect the public and reduce reoffending, and to provide a more effective, transparent and responsive criminal justice system for victims and the public.

SC Clearance is an essential requirement for this role, as a minimum you must be willing & eligible to undergo checks. Please note, due to the exceptional requirements of this position (short-term nature of this role and speed at which we require a postholder in situ) preference may be given to candidates who meet all of the essential criteria and hold active security clearance.

Role:
. We're looking for a Data Engineer to support development of the MoJ's data catalogue, Find MoJ Data. Find MoJ data is a searchable catalogue service to help users find and understand data, supporting the department's data strategy. Find MoJ Data unites many different MoJ sources in one place, describing datasets with metadata but without including actual data. This is because data owners are responsible for controlling access to potentially sensitive data.
. The successful candidate will join the Data Modelling and Engineering Team (DMET) but will be Embedded within our Data Catalogue team to support this work. A summary of the type of work the teams are involved in can be found in our handbook.

Essential:
. Demonstrable passion for data engineering
. Python programming
. Automated testing (eg, with PyTest)
. Version control using Git
. Working with diverse data sources
. Data manipulation, cleansing, wrangling, anonymisation and transformation
. Working with and optimising SQL for querying and structuring large datasets
. Communicating technical ideas and translating user needs into solutions

Desirable:
. Kubernetes and container-based workflows
. CI/CD tools (eg, GitHub Actions)
. Cloud and infrastructure-as-code (eg, AWS Athena, Glue, S3, IAM and Terraform)
. Orchestration tools (eg, Airflow)
. Big data tools (eg, Spark)
. Data transformation tools (eg, dbt)
. Data Lake table formats (eg, Iceberg)

Please be aware that this role can only be worked within the UK and not Overseas.