Job Description:

About the role:

Ensono Digital is continuing its growth and building a cloud-native managed service offering for our clients. We are looking for energetic and skilled remote Site Reliability Engineers to join us on this exciting new journey.

As a Site Reliability Engineer, you and your team will be responsible for between four and ten of Ensono Digital's cloud-native managed services clients. Ensono Digital has invested time to create templated cloud-native solutions to provide value to our clients. They have loved what we’ve done so far and want us to operate these applications in production on their behalf.

In response to this demand, Ensono Digital is applying Site Reliability Engineering principles to disrupt the traditional Managed Services approach and deliver something that empowers our customers and turns technology into an efficiency, growth and innovation multiplier.

The successful candidate will be reporting into the Head of SRE and will start supporting our clients immediately. New projects are in the pipeline, so you will also be working with our pre-sales and delivery teams to ensure operations are considered long before handover.

We are just starting on our journey to Site Reliability Engineering, so we are eager to continue to learn from industry leaders and your experiences in delivering Site Reliability Engineering to build a sustainable workplace that delivers a service which will delight our customers.

What you'll be doing:

As a Site Reliability Engineer, your overarching responsibility is to ensure we meet our clients’ Service Level Objectives, and we respond to incidents in a timely and professional manner.

Your responsibilities will include:

  • Monitoring our client’s services using modern tools and SRE practices.
  • Responding to incidents originating from 2nd line support within the times set out in the SLA (being on-call).
  • Performing and assisting in root cause analysis and blameless post-mortems to enable incidents to be understood and avoided in the future.
  • Improving the testing and release procedure.
  • Planning for and making changes to capacity to balance the demand vs. cost saving equation better.
  • Undertaking improvements to the infrastructure and product.
  • Making changes to client’s services based upon operational or business needs.
  • Advising and supporting the further development of Ensono Digital’s Intellectual Property to ensure future projects benefit from what we learn.

What you’ll bring to Ensono Digital:

  • A comprehensive understanding of Site Reliability Engineering
  • Experience working with a cloud service provider (ideally Azure or AWS)
  • Strong examples of implementing automation/solutions by code (preferably Python, C#, Java, or Go)
  • Commercial experience working with compute technologies (such as Kubernetes, or Serverless)
  • Designed, implemented, and/or supported solutions in a production environment
  • Strong interpersonal and communication skills to work in a fast-paced and rapidly changing dynamic environment

Any additional expertise of the following will be very beneficial:

  • Experience with CI/CD pipeline tools (such as Azure DevOps, GitHub Actions, Gitlab CI)
  • Experience with monitoring, logging tools (such as Azure Monitor, CloudWatch or Prometheus)
  • Experience with ITSM tools (such as ServiceNow, OpsGenie, or PagerDuty)
  • Working with an Infrastructure as Code tool (Terraform, ARM, CloudFormation or Deployment Manager)
  • Excellent troubleshooting skills that span systems, networks (TCP/IP), and code
  • Expert knowledge of Linux internals and tuning

What we can offer you:

We will give you a place to strive and grow, where you will have the opportunity to work on interesting, yet challenging projects. Applying your thinking to build a better world founded on intelligent technologies.

This will be an extremely varied position that’s all about problem-solving and finding the right technical solutions for your client. You will be given autonomy over your work so that you will have the opportunity to shape the projects you’re working on.

We are a people-first business, which means people are at the heart of everything we do here. We offer our consultants a safe environment where knowledge sharing, and open communication is encouraged. Whether at one of the internal monthly events, such as Lunch & Learns, Tech Time, and internal competency meet-ups, or at one of our community groups, such as football, gaming, yoga, or wellbeing; we have strived to build a business where everyone feels welcomed, included, and valued.

Our benefits include:

  • 27 days annual leave (plus bank holidays)
  • 1/2 day on your birthday
  • Sabbatical options at 5&10 years
  • Discretionary bonus
  • An annual training budget, up to 5 days study leave and 10+ training vendors
  • Generous company pension
  • Private healthcare for you and your family
  • Payroll giving
  • Enhanced parental and dependent leave
  • Equity appreciation program and incentive plan
  • Life and income protection
  • Additional perks such as discounted gym memberships, cycle scheme, EAP and more!

Company Benefits

  • 27 days annual leave plus bank holidays
  • Discretionary bonus based on individual and company performance
  • Private medical insurance
  • Life assurance
  • 1:1 financial planning
  • Corporate gym membership
  • £2k unrestricted training budget
  • Company Pension
  • Season ticket loan
  • Flexible working
  • Cycle to work scheme

Interview Process

  • Screening call with Talent Partner
  • 2 Stage Technical Interview
  • Final Cultural and Values interview

Other Jobs in DevOps & SysAdmins