Arbetsbeskrivning
About the CompanyAvaron AB is a growing consultancy focused on technology, finance, and business support.
We match your expertise with the market's most interesting assignments, offering a platform where your professional development is central.
About the AssignmentYou will take a leading role in establishing and evolving a Site Reliability Engineering (SRE) practice as part of a modernization journey towards cloud-native solutions on Microsoft Azure.
Working closely with development and platform stakeholders, you’ll help build the strategy, tooling, ways of working and culture needed to deliver a highly reliable digital experience within the pet insurance domain.
Job Description- Lead the design, implementation and continuous improvement of SRE practices for Azure workloads.
- Partner with development teams to embed SRE principles throughout the development lifecycle.
- Own and improve the operational health, reliability and performance of services running in Azure.
- Define and implement monitoring, incident management routines and post-incident reviews.
- Automate operational tasks using software/scripting (e.g., C#, Python, Node.js or other suitable languages).
- Mentor colleagues on reliability, observability and cloud operations best practices.
- Participate in and lead incident response efforts.
- Establish and maintain a desired-state operational model aligned with stakeholders and the platform team.
Requirements- At least 5 years of experience in the IT field.
- Proven experience working with Site Reliability Engineering in a reliability/ops team in close collaboration with development teams.
- Deep knowledge of Microsoft Azure and operating/troubleshooting cloud workloads such as Azure Functions, Azure Container Apps, Azure App Service, SQL Azure and Cosmos DB.
- Solid understanding of databases (SQL and NoSQL), including backup and restore operations.
- Software development proficiency enabling automation of repetitive operational work (C#, Python, Node.js or similar).
- Hands-on experience with Azure DevOps, including Git branching and code review practices.
- Good understanding of networking, especially Azure virtual networks.
- Practical knowledge of incident and problem management.
Nice to have- Azure certification(s).
- Understanding of distributed systems and microservices.
- Experience with Kubernetes platforms such as AKS and/or OpenShift.
- Familiarity with ITSM tools such as ServiceNow.
ApplicationSelections are made on an ongoing basis, so we recommend that you apply as soon as possible.