Full-time · Remote

Senior Site Reliability Engineer

This is a pivotal SRE role at a high-growth startup that is defining the incident management category. It combines deep software engineering with infrastructure architecture, offering the chance to build the reliability foundation for a platform used by tech giants. It is a perfect fit for someone who wants to move away from 'maintenance' and into 'creation'.

Hiring company

Rootly

San Francisco, California, United States · Posted 16 April 2026

The role

Overview

The hiring side

About Rootly

Rootly is a small business working in software development / incident management, based in San Francisco, California, United States.

Industry

Software Development / Incident Management

Size

Small

Location

San Francisco, California, United States

Go deeper

See Rootly the way an insider would

Unlock company research, key people to know, recent moves, and how this role fits into their wider picture.

Research Rootly

What they need

Requirements & Skills

Key Responsibilities

Embed with product squads to enhance service reliability and observability
Design and maintain robust CI/CD pipelines and monitoring systems
Develop automation tools to improve engineering velocity and reduce manual work
Lead capacity planning and infrastructure scaling efforts
Advocate for reliability and performance standards across the entire engineering organization

Essential

Minimum of 5 years in SRE, Platform, or Infrastructure Engineering roles
At least 5 years of professional software development experience in production environments
Deep technical expertise in cloud-native infrastructure and distributed systems
Proven track record of managing observability, performance tuning, and scaling strategies
Experience defining and managing Service Level Objectives (SLOs) and error budgets
Ability to write high-quality code to automate manual processes and eliminate toil

Preferred

Experience in a high-growth startup environment
Previous leadership or foundational role in an SRE team
Strong background in developer experience (DevEx) improvements

Key Skills

Cloud InfrastructureDistributed SystemsCI/CD Pipeline ArchitectureObservability & MonitoringAutomation & ToolingCapacity PlanningIncident Response ManagementPerformance Engineering

Networking

People to Know

Hiring influence

James Mitchell

Head of Talent Acquisition

Over 12 years leading hiring strategy across EMEA. Oversees all senior-level recruitment and partners closely with department heads on headcount planning.

Hiring influence

Sarah Reynolds

Engineering Team Lead

Manages a team of 8 engineers and is closely involved in technical interviews. Previously scaled engineering teams at two high-growth startups.

Hiring influence

Alex Kim

Senior Product Manager

Drives product roadmap and cross-functional collaboration. Regularly involved in hiring for product and design roles.

Perks

Benefits & perks

Radical transparency (monthly financial reviews)
Opportunity to work with high-profile global customers
Early-stage equity and leadership influence
Fast-paced, mission-driven work environment
Weekly public recognition of work via changelogs

Next step

Apply now

Apply on www.opentoworkremote.com

Found via www.opentoworkremote.com

Career Steer

Senior Site Reliability Engineer

Overview

About Rootly

Requirements & Skills

Key Responsibilities

Essential

Preferred

Key Skills

People to Know

James Mitchell

Sarah Reynolds

Alex Kim

Benefits & perks

Apply now

We use cookies to improve your experience