Site Reliability Engineer at Santander Auto Software

  • Anywhere (100% Remote) Only
  • Santander Auto Software
Job Description:

Santander Auto Software is an exciting new greenfield programme currently at early stages, where we are building a brand-new, state of the art software platform and product suite that will be rolled out in 13 countries.

The business is 100% tech focused, comprised of skilled cross-border software builders, and with a leadership team made up of esteemed individuals with accomplished backgrounds from the likes of Amazon, Microsoft, and other major global tech players.

What are we looking for?

We are seeking a Site Reliability Engineer (SRE) to build, deploy, operate, sustain, and grow the software systems. The team will drive the stability and sustainability of our systems and discover innovative ways to scale and operate them reliably as we grow.

In this role, you will work with Engineers to create proactive engineering mechanisms that will enable your team to manage the health of our products. You will deploy and monitor the systems and automation to ensure that our tools and products are operating optimally. You will utilize trends and metrics to identify opportunities for improvements within existing frameworks, tools and processes to continuously improve systems.

The ideal candidate will have a strong technical background, be detail driven, and have excellent problem-solving abilities. You will be comfortable designing, building, deploying, and operating.

Responsibilities and what you will doing:

  • Participation in reliability and software engineering
  • Enable, train and support engineers to implement operational tasks autonomously
  • Continuous operations of our mobility platform and incident resolution
  • Monitoring of platform and development of predictive alert systems to prevent incidents.
  • Development of fault-tolerant mechanisms ensuring the availability of services and applications
  • Participate in post-mortems and communicate stats on service availability
  • Availability to be periodically on call

Experience required:

  • Advanced knowledge in public cloud providers, preferably AWS
  • Solid knowledge in Observability, monitoring preferably with Datadog and self-healing services
  • Good experience in CI/CD, version control, cloud computing and container orchestration
  • Ability to troubleshoot problems using high- and low-level debugging tools and techniques
  • Experience working with Java
  • Working knowledge of agile development methods
  • Ability to take ownership of technical problems and driving solutions Strong communication skills
  • Someone who thrives in not being constrained by job description and will instead own and see a problem through to a solution

Company Benefits

  • Private Pension
  • Healthcare
  • Equipment budget
  • Bonus Scheme
  • Flexi Benefits

Interview Process

2 stages - 30 minute & 60 minutes Teams interviews

Other Jobs in DevOps & SysAdmins