Dev Ops Manager - London

Expiring today

City of London, London
23 Jul 2017
22 Aug 2017
Contract Type
Full Time
My client is a British multinational retailing company headquartered in London , with over 2,500 outlets in the UK and around the world.

My clients talented IT team work tirelessly to provide a seamless internal and external customer experience with the purpose of delivering genuine customer value. Working in close partnership with the brands, the team strive to be a catalyst for business transformation, showcasing industry leading technology solutions.

The Software Engineering community are responsible for creating world class software solutions to provide our customers with the best possible retail experience.

My cleints Engineers work within a clear framework of accountability, ensuring substantial personal responsibility and promoting autonomy.
Our platform strategy delivers cloud based infrastructure across all our digital touchpoints supporting an in-house written platform.

The role of Operations Engineering Manager to manage the team of Site Reliability and DevOps Engineers to maintain the highest standards of operation, uptime and performance for the Arcadia Digital Platforms whilst at the same time embedding the practices of automation, speed and performance into our software engineering teams and delivery processes.


*Site Reliability Engineering - Flawless customer experience
oOwn and operate the cloud infrastructure for our in-house built e-commerce platform, exploiting real-time telemetry to prevent operational issues leading to poor customer experiences.
oCreating feedback loops to continuously eradicate errors from the platform
oOwn and operate platform alerting, 3rd line incident response, post mortems.
oCapacity planning and performance improvements
oDesigning and testing for failures to ensure the platform is resilient
oEnsuring the customer sensitive and payment data is safe and secured as it transits the platform
*Cloud Infrastructure Support - Operational Excellence
oSupport and maintain our wider digital services cloud infrastructure across in-house and 3rd party applications such as Customer Care, Order Management, CDN and CMS.
oDrive an automation first culture seeking to optimise and accelerate at every opportunity.
oEnsure the highest levels of uptime, performance and security are continuously maintained

*Demonstrable experience of leading DevOps and Site Reliability teams to deliver strong technical and commercial results.
*Experience of driving change within an organisation, pushing through resistance and success in adopting new ways of working
*Successful experience of implementing continuous integration/delivery,
*Attention to detail to ensure code management, code workflow, security and performance analysis standards are adhered to
*Excellent experience of building a practice that adopts a data driven continuous improvement, taking metrics, analysing data and building technical pipelines of improvement tasks


This is a leadership role that requires strong relationships with many areas of the engineering community. Solid technical grounding and experience in the following is therefore required
*Cloud compute - AWS Elastic Beanstalk, EC2, ElastiCache Redis, DynamoDB
*Code and Containers - Github, DockerHub
*Logging - NewRelic, ELK stack, Cloudwatch
*Networks - Akamai CDN and WAF, Cloudfront, Route 53
*Automation and configuration - Jenkins, Terraform, Ansible
*Some exposure to a broad and diverse range of web technologies such as NodeJS, ReactJS

Spring Technology is acting as an Employment Business in relation to this vacancy.

Spring Technology is an Equal Opportunities employer; we welcome applicants from all backgrounds.