Location
East Midlands (England), East of England, London (region), North East England, North West England, Scotland, South East England, South West England, Wales, West Midlands (England), Yorkshire and the Humber
About the job
*We offer a hybrid working model, allowing for a balance between remote work and time spent in your local office. Office locations can be found ON THIS MAP
The Role
We’re recruiting for a Lead Site Reliability Engineer here at Justice Digital, to lead our site reliability engineering team in HMPPS Digital.
Within the team, you will be helping to build and maintain platforms that underpin the digital services we are delivering. You will work closely with development teams, cloud platforms teams, live service teams and security teams to help maintain and develop services. We use modern best practices like DevOps and agile, use cloud native architectures and prefer modern open-source tools.
This role aligns against the Lead DevOps Engineer role from the Government Digital and Data Framework.
About Us:
At Justice Digital, we're dedicated to leveraging technology to drive impactful change across the justice system. As a Lead Site Reliability Engineer, you'll play a pivotal role in enhancing access to justice and improving outcomes for users through innovative digital solutions.
Responsibilities: You’ll be working on our acclaimed open-source public services, with user needs at the heart of everything, helping us transform Government for the future. Working as part of a multi-disciplinary team, you’ll be helping define how we do what we do and making sure that our systems are built to be changed rapidly, leading teams of site reliability engineer specialists across teams.
Collaboration: You’ll collaborate closely with software developers, product managers, designers, delivery managers, technical architects and content specialists who share our vision of leveraging technology to transform government services.
Our Tech Stack
Technologies: We use a diverse range of technologies, and we’re seeking individuals who specialise in one or more and are eager to learn new languages and frameworks. Our tech stack includes:
? Cloud infrastructure: AWS
? Infrastructure as code: Terraform, AWS CloudFormation
? Containerisation: Docker
? CI/CD deployments: GitHub Actions, Concourse, CircleCI
? Application code: Python, Ruby, JavaScript
Learning and Support: Once part of Justice Digital, we'll support you in mastering our tech stack, regardless of your current experience. Explore our GitHub for insights into our technologies and the services we develop and maintain.
Our Community: Join over 150 experienced software and site reliability engineers who form our vibrant engineering community across the MoJ. You’ll have opportunities to mentor junior colleagues and participate in informal support networks with peers. We encourage active engagement in shaping our engineering culture and community.
Career Development: We take pride in our supportive and effective line management. Your skills are highly valued, and we’re committed to helping you expand them within the civil service. You'll have opportunities to move between teams or departments, explore new technologies, and take on increased responsibilities aligned with your career goals.
Explore Further: Dive deeper into our work and culture by visiting our Developer Blog and Justice Digital Blog.
Key Responsibilities:
As a Lead DevOps engineer, you will:
Provide strong leadership to set the future site reliability engineering strategy for a fast paced, demanding environment
Take ownership of improving the site reliability engineering capability across the large number of diverse development and engineering teams
Work with the Head of Profession, the wider engineering leadership team and development operations community to ensure we build maintainable and sustainable digital products across Digital & Technology
Work closely with the Service Owner to ensure provision of a high-quality, cost-effective service.
Stay up to date with, and lead the creation of standards around development operations practices and techniques to best enable our teams to consistently deliver at pace
Mentor the site reliability engineers, through the design and implementation of solutions whilst ensuring alignment with the organisations standards, identifying opportunities for collaboration where appropriate.
Collaborate with technical architects and software developers to build and maintain a strong site reliability culture
Advocate user-centric, agile approaches which focus on rapid, effective delivery of high-quality digital services
Assist in transforming technical requirements into automated processes including managing tools and testing environments, central code control, maintaining development standards and writing software that automates systems
Support site reliability team in delivering automated software components that form part of a tool chain and transform technical requirements into automated processes
Work collaboratively and supportively with other local professions leads to identify and resolve technical, operational and business issues preventing delivery.
Support sharing of methods and technologies across teams, government, and the industry by helping to organise events
Help publicise our achievements and learning, and celebrate our successes through blog posts, social media and/ or speaking at events/ conferences
Build and maintain a diverse, inclusive culture within the local web operations community, growing awareness, inclusivity, and balance
Participate in support out of hours on a rotational basis as required (for which you’ll be paid an allowance)
Coordinate and manage site reliability engineering recruitment, shaping our in-house team, making it more diverse and inclusive
If this feels like an exciting challenge, something you are enthusiastic about, and want to join our team please read on and apply!
Person Specification
Essential
Technical Leadership and Collaboration
Provides day-to-day technical leadership, setting standards for build, deployment and operational practices working across platform, security and delivery teams.
Programming and Build (Software Engineering)
Designs, codes, tests, reviews, and documents software of medium to high complexity, applying sound engineering principles to balance innovation with operational stability.
Service Support and Reliability
Leads incident resolution in live environments, ensuring root causes are addressed, fixes are repeatable, and support documentation is robust.
Improves observability, monitoring, and service recovery based on real-world support experience, enhancing reliability across services.
Systems Design and Integration
Shapes and reviews system designs for architectural alignment, leading integration efforts to ensure interoperability and smooth deployment across shared environments.
Continuous Improvement and Delivery Practice
Drives platform consistency through automation and repeatability, identifying and implementing improvements in pipelines, monitoring, and infrastructure-as-code practices.
Balances delivery speed with long-term maintainability and security, ensuring sustainable engineering practices across teams.
Platform and Organisational Context
Guides teams in delivering secure, compliant, and efficient solutions by applying deep knowledge of MoJ cloud platforms and shaping best practices aligned with broader engineering strategy.
Willingness to be assessed against the requirements for SC clearance
We welcome the unique contribution diverse applicants bring and do not discriminate based on culture, ethnicity, race, nationality or national origin, age, sex, gender identity or expression, religion or belief, disability status, sexual orientation, educational or social background or any other factor.
Our values are Purpose, Humanity Openness and Together. Find out more here about how we celebrate diversity and an inclusive culture in our workplace.
The Civil Service is committed to attract, retain and invest in talent wherever it is found. To learn more please see the Civil Service People Plan and the Civil Service D&I Strategy.
Do you agree to our terms & conditions & privacy statement?
Receive updates & notifications from Ex-MilitaryCareers.com