Principal Site Reliability Engineer at Featurespace

  • UK Only
  • Featurespace
Job Description:

At Featurespace, we strive to be the world’s best software company at protecting our clients and their customers from fraud attacks. We do that with personality, heart, and professionalism, cultivating an innovative, fun, and positive team atmosphere where everybody can contribute to solving our clients’ problems in new, innovative ways. We are always seeking to be the best at what we do and make our customers smile.

The Opportunity:

We are looking for a passionate and enthusiastic Principal Site Reliability Engineer to take a leadership role on our cloud operations team to build and operate a new SaaS platform for our industry-leading, fraud and financial crime fighting technology, ARIC Risk Hub.

We currently have roles available for SREs at different levels of seniority, but at all levels we are looking for individuals who are driven to deliver a feature-rich and robust product through teamwork and technical skill.

The role is based in Cambridge, but will benefit from Featurespace’s flexible working policy.

As a Principal SRE at Featurespace, you will help us achieve our goals and deliver success on behalf of our customers by:

  • Developing a stable and performant cloud-native SaaS platform for ARIC, on Kubernetes
  • Operating the new SaaS platform and additional cloud infrastructure
  • Continually improving our software, services and infrastructure to ensure we continue to be on the forefront of innovation in our industry In the role you will work alongside other SREs of varying levels of seniority, but also with software developers, support engineers, data scientists, implementation consultants, vendors and customers. The ideal candidate for this role will combine technical and communication skills with enthusiasm and creativity.

Day to Day:

  • Designing, developing, deploying, maintaining, monitoring and upgrading production deployments of ARIC Risk Hub SaaS
  • Building software and systems to manage platform infrastructure and applications
  • Continually be evaluating and improving our technology and processes to increase quality, decrease costs and improve time-to-market
  • Periodically be testing the service with predictable and unpredictable failures
  • Providing 2nd-line operational support to our SaaS customers
  • Gathering data and generating reports on the service performance
  • Developing and documenting internal processes
  • Working with engineering/data science to drive and develop new and improved ARIC Risk Hub capabilities
  • Sharing knowledge through mentorship
  • Leading (and if appropriate, managing) projects and people

About you:

We are primarily looking to appoint a talented candidate with a can-do attitude and desire to learn new skills and technologies. The ideal candidate will have extensive breadth and/or depth in the following list, but all are not essential.

  • Bachelor’s, Master’s or higher qualification in computer science or related discipline
  • Experience with cloud computing (preferably AWS)
  • System administration (shell-scripting, command-line tools, networking, monitoring, etc.)
  • Experience developing in high level programming languages (preferably Python)
  • Experience of the Systems Development Life Cycle (source control, testing, code review, etc.)
  • Knowledge of containerisation & Kubernetes (as an administrator or application developer)
  • Experience operating or supporting production systems
  • Experience with any of: Terraform, SaltStack, MongoDB, Elasticsearch, Kafka, Prometheus, Grafana or HashiCorp Vault
  • Excellent interpersonal and communication (written and verbal) skills
  • Experience leading projects and/or people

Personal Qualities:

The work is often challenging and fast paced. We are looking for someone who has the following qualities:

  • Strong desire to be a member of interdisciplinary teams of intelligent, like-minded people to solve complex problems
  • Strong desire to work as a part of interdisciplinary teams of intelligent, like-minded people to solve complex problems
  • High level of attention to detail
  • Desire to be proactive, do things the right way and following best practices
  • Passion to learn new skills and technologies and keep up to date with industry developments
  • Naturally curious, innovative and enthusiastic with a capability to think orthogonally
  • Excellent time management skills
  • Desire to take on a high level of autonomy and responsibility
  • And most importantly, a small-company attitude: willingness to adapt to a variable role, wearing many different hats from day to day, and a great can-do attitude

Company Benefits

  • A 4% matched pension scheme
  • Growth share equity scheme
  • Quarterly discretionary bonus scheme
  • 25 days annual leave + UK Bank Holidays
  • Training, development, and mentoring schemes
  • Discounted gym membership and free daily exercise classes
  • Career growth and training opportunities
  • Private healthcare scheme with Vitality
  • Death in Service scheme
  • Regular social events
  • Electric Vehicle Scheme
  • Free weekly takeaway lunches at our Cambridge, London & Atlanta offices
  • Cycle to Work scheme
  • **Fridges packed full of edible treats and drinks for lunches and snacks

Interview Process

  • 1st stage - 30 min Zoom with the hiring manager
  • 2nd stage - 2 hour technical zoom with hiring manager and a member of the team
  • 3rd stage - 30 final call with Director

Other Jobs in DevOps & SysAdmins