Senior Site Reliability Engineer
Remote - Referral Board · Remote
📍 Remote💰 $53,300via greenhousePosted 2026-06-26
Apply on company site ↗
CareerRiver pulls this listing straight from the employer's hiring system — no recruiter middleman, no reposts. Applying takes you directly to Remote - Referral Board.
About Remote
Remote is solving modern organizations’ biggest challenge – navigating global employment compliantly with ease. We make it possible for businesses of all sizes to recruit, pay, and manage international teams. With our core values at heart and future focused work culture, our team works tirelessly on ambitious problems, asynchronously, around the world. You can find Remoters working from 6 different continents (Antarctica left to go!) and all of our positions are fully remote.
With Innovation as one of the core values, we have built Automation and AI capabilities into the requirements for every role.
We encourage every member of the Remote team to bring their talents, experiences and culture to the table to help us build the best-in-class HR platform.
If you are energetic, curious, motivated and ambitious, be part of our world. Apply now and define the future of work! This position
As a Senior SRE at Remote, you'll work with a high degree of autonomy on complex reliability and platform problems, owning the plan and execution of features and projects within our SRE/Platform domain. You'll contribute to the platform's architecture and reliability strategy, translating ambiguous requirements into robust, maintainable solutions and raise the technical bar of the engineers around you while collaborating closely with product and security teams in an async-first, fully remote environment.
You'll work AI-natively day to day and build reusable AI workflows that make the whole team faster and more reliable, not just yourself.
What you’ll bring
Solid professional experience in SRE, DevOps, or Platform Engineering.
Solid hands-on Kubernetes: operating and scaling production clusters and container tooling (Docker) and its ecosystem.
Experience building and managing cloud infrastructure on AWS (or similar).
Strong infrastructure-as-code practice with Terraform.
Experience with reliability frameworks: SLOs, SLIs, error budgets, alerting strategies.
Solid observability background: OpenTelemetry, Grafana/Prometheus or similar.
Proficiency with CI/CD (GitLab CI, GitHub Actions, or similar) and deployment automation.
Comfortable with Golang, Bash/scripting; broader programming a plus.
Practical, embedded use of AI in infra/ops/dev work, agentic workflows with concrete, observable results, not just familiarity with the tools.
Clear and thoughtful communication, especially in an async-first, global setting
Proactive, curious, and comfortable taking ownership of challenges
Collaborative and respectful across cultures, time zones, and backgrounds
Nice to have
Experience with 1 back-end programming language (Elixir, Nodejs, Python, etc)
Experience running and configuring Linux systems in a non-cloud environment
Security knowledge and capabilities from a defensive and offensive standpoint
What you’ll do
Lead solution discovery and delivery for reliability and infrastructure problems with real ambiguity, complexity, or scope. Autonomously, coordinating with other contributors where needed.
Contribute to the platform's architecture, tooling, and roadmap. Influence team priorities and advocate for technical initiatives.
Help define and operate reliability practices for our platform: SLOs/SLIs, error budgets, alerting, observability. Take responsibility for the team's operational stance, using support/incident metrics to shape technical strategy.
Resolve cross-team requests, identify systemic issues, and turn recurring ones into reusable fixes and runbooks rather than one-off answers.
Work AI-natively and operationalise it for the team: use agentic workflows by default; build reusable prompts, skills, and tooling embedded in the codebase so others ship faster, safely; design agent-ready systems (clean interfaces, good observability) that make AI-assisted changes easy to review. Establish shared standards and domain-level guardrails (secure-by-default patterns, CI protections, AI-assisted review practices).
Mentor and give timely, actionable feedback to less-senior engineers; participate in hiring, onboarding, and RFC discussions.
Collaborate with Security on platform hardening and threat mitigation; contribute to capacity and cost-efficiency of the infrastructure.
Participate in incident response and on-call rotations to rapidly resolve issues and maintain system reliability.
Practicals
You'll report to: SRE Team Lead
Team: Engineering
Location: For this hire, due to diversity and timezones requirements, we’re prioritising Europe
Start date: As soon as possible
Application process
Interview with recruiter
Interview with HM
(async) Infrastructure exercise (you're not expected to spend more than 2 - 4 hours)
Interview with the team (without any manager in the call so you can really get to know the people and ask all you want to ask)
Bar Raiser Interview
Executive Interview
Offer + Background check ( Veremark & Remote )
#LI-DNP
Remote's Total Rewards philosophy is to ensure fair, unbiased compensation and fair equity pay along with competitive benefits in all locations in which we operate. We do not agree to or encourage cheap-labor practices and therefore we ensure to pay above in-location rates. We hope to inspire other companies to support global talent-hiring and bring local wealth to developing countries.
At first glance our salary bands seem quite wide - here is some context. At Remote we have international operations and a globally distributed workforce. We use geo ranges to consider geographic pay differentials as part of our global compensation strategy to remain competitive in various markets while we hiring globally.
Our salary ranges are determined by role, level and location, and our job titles may span more than one career level. The actual base pay for the successful candidate in this role is dependent upon many factors such as location, transferable or job-related skills, work experience,
More Remote jobs
Remote jobs · Browse all locations