Site Reliability Engineer (Linux, Windows, Cloud)
Our Software House client has an exciting role for a Site Reliability Engineer (Windows, Linux, Cloud) to be based in Ne wcastle -upon-Tyne where you will enjoy a salary of £40,000 - £50,000 plus benefits Non-Negotiable for this role: ● Strong understanding of both Windows and Linux Operating Systems ● Experience with cloud operations and site reliability ● Understanding of emerging technologies and practices for operating modern distributed services within the cloud ● Experience with common monitoring systems such as Nagios, Icinga , New Relic. ● Strong understanding of Git ● Experience in using Puppet or other similar tool like Chef, Ansible etc. ● Skilled in one or more scripting languages ( Bash, Python, Powershell ) ● Experience with SQL and/or NoSQL data store technologies. ● Familiarity with agile development practices, continuous integration and test automation ● Desire to continually learn, improve and challenge our current methods of operating their platform Team Setup: There are 5 people in the BA team The role will be reporting to ISO The BA team is part of the Products & Solutions which has 20 people in the team. Benefits: Pension Private Healthcare, cash plan package with BUPA. Life assurance Company device ( laptop). Free snacks Free breakfast Work socials etc. Our client is a leading supplier of software t hey power some of the biggest brands globally, working in regulated markets and processing tens of millions of transactions per month. You'll be based in their headquarters in Newcastle upon Tyne, the y also have offices in London and Sofia, Bulgaria. We are looking to recruit a Site Reliability Engineer to join thei r Fabric Team. You'll work in a highly collaborative way to drive efforts to build, support and improve thei r infrastructure and tools used by thei r development teams to run the services that make up thei r platform. They expect that you demonstrate and apply exemplary engineering practices to increase agility, improve quality and help reduce downtime in all thei r solutions. They embrace DevOps culture and you’re also expected to drive this change and ensure that they embed Agile and DevOps principles in everything they do. KEY RESPONSIBILITIES: ● Automate, automate, automate ● Deliver solid infrastructure as code and desired configuration state solutions by using automation tools such as Terraform and Puppet ● Design and implement solutions that boost the stability, scalability, performance and security of Fabric products ● Support services once they are live by measuring and monitoring availability, latency and overall system health ● Work towards integrating the delivery of the infrastructure into the CI/CD pipeline, including helping to implement automated testing. ● Mentoring / supporting engineers regarding tools, concepts and best practices ● Evangelise DevOps culture of continuous improvement ● Conduct knowledge sharing sessions with people within and outside the team and evolve Fabric products documentation ● Contribute to healthy team culture and engagement in the team’s current priorities. ● Escalate any issues and propose solutions for mitigation DESIRABLE SKILLS: ● Experience in Terraform or similar ● Experience in Azure ● Experience in Automated Infrastructure Testing ( Beaker, Test Kitchen) ● Understanding of compiled or interpreted programming languages such as C#, Go, Ruby. ● Knowledge of/experience with containerisation technologies such as Docker, Kubernetes, Nomad etc.