Full-time · Remote

Senior Site Reliability Engineer

This is a pivotal SRE role at a high-growth startup that is defining the incident management category. It combines deep software engineering with infrastructure architecture, offering the chance to build the reliability foundation for a platform used by tech giants. It is a perfect fit for someone who wants to move away from 'maintenance' and into 'creation'.

R
Hiring company

Rootly

San Francisco, California, United States · Posted 16 April 2026

The role

Overview

This is a pivotal SRE role at a high-growth startup that is defining the incident management category. It combines deep software engineering with infrastructure architecture, offering the chance to build the reliability foundation for a platform used by tech giants. It is a perfect fit for someone who wants to move away from 'maintenance' and into 'creation'.

The hiring side

About Rootly

Rootly is a small business working in software development / incident management, based in San Francisco, California, United States.

Industry

Software Development / Incident Management

Size

Small

Location

San Francisco, California, United States

Go deeper

See Rootly the way an insider would

Unlock company research, key people to know, recent moves, and how this role fits into their wider picture.

Research Rootly

What they need

Requirements & Skills

Key Responsibilities

  • Embed with product squads to enhance service reliability and observability
  • Design and maintain robust CI/CD pipelines and monitoring systems
  • Develop automation tools to improve engineering velocity and reduce manual work
  • Lead capacity planning and infrastructure scaling efforts
  • Advocate for reliability and performance standards across the entire engineering organization

Essential

  • Minimum of 5 years in SRE, Platform, or Infrastructure Engineering roles
  • At least 5 years of professional software development experience in production environments
  • Deep technical expertise in cloud-native infrastructure and distributed systems
  • Proven track record of managing observability, performance tuning, and scaling strategies
  • Experience defining and managing Service Level Objectives (SLOs) and error budgets
  • Ability to write high-quality code to automate manual processes and eliminate toil

Preferred

  • Experience in a high-growth startup environment
  • Previous leadership or foundational role in an SRE team
  • Strong background in developer experience (DevEx) improvements

Key Skills

Cloud InfrastructureDistributed SystemsCI/CD Pipeline ArchitectureObservability & MonitoringAutomation & ToolingCapacity PlanningIncident Response ManagementPerformance Engineering

Networking

People to Know

Sign up to discover hiring managers, team leads, and key people at Rootly.

Perks

Benefits & perks

  • Radical transparency (monthly financial reviews)
  • Opportunity to work with high-profile global customers
  • Early-stage equity and leadership influence
  • Fast-paced, mission-driven work environment
  • Weekly public recognition of work via changelogs

Next step

Apply now

Found via www.opentoworkremote.com

We use cookies to improve your experience

We use essential cookies for functionality and analytics cookies to understand how you use Career Steer and improve our services. You can manage your preferences or learn more in our Privacy Policy.