Site Reliability Engineer


  • Job Type: Full-Time
  • Function: IT
  • Post Date: 06/08/2021
  • Website:
  • Company Address: 5913 Blackwelder St, Culver City, CA, 90232, US

About Replicated

Replicated is the modern way to ship on-prem software. We give SaaS vendors a container-based platform for easily deploying their cloud-native applications inside customers'‚Äč environments. Because security and control still matter.

Job Description

Replicated is looking for a Site Reliability Engineer to help us improve the automation and resiliency of our software infrastructure. We build tools to help other software developers deliver private on-premise versions of their software to enterprise customers. These tools include automated cluster-installers, container image registries, automated troubleshooting collectors, vendor dashboards, and microservices to connect them all. We run our Kubernetes infrastructure in the public cloud, and have invested heavily in automation, and are looking to expand that investment with additional cloud-native tools and automation.
The culture at Replicated is entrepreneurial, fast-paced, and engineering oriented. We value cutting-edge technology and aim to get the most out of emerging open source standards. Our engineering team is working every day to solve complex problems and develop world-class software. You will work with the core engineering team to continuously improve the observability and reliability of the services we deploy. This role is perfect for someone who has developed some technical depth in a few areas, and is eager to get a lot deeper (particularly with Kubernetes).
About Replicated:
We help software companies serve the needs of enterprise customers. Our mission is to shift the landscape of enterprise software, helping organizations of all sizes modernize their application architecture and management. Our tools enable vendors to efficiently ship and manage on-premise, self-hosted versions of their software. These products have had great momentum in the market, attracting customers like HashiCorp, CircleCI, Gradle, and many others. Half of the Fortune 100 deploy applications using our platform.

What you’ll do:
  • Manage the Replicated software infrastructure and the Kubernetes clusters it runs on
  • Improve the observability and resiliency of our systems
  • Write code and help support new services, products, and features
  • Help Replicated customers solve challenging problems with their applications and the clusters they’re deploying to
  • Occasional on-call support

You should have:
  • 1+ years experience as a developer, site reliability engineer, or devops engineer
  • Strong command of at least 1 programming language
  • Experience with Linux systems administration
  • Excellent technical communication skills
  • Passion for software engineering and automation

Other things we’re looking for:
  • Experience with Kubernetes, Docker, and containers
  • Experience with Terraform, Ansible, Cloudformation, Saltstack, Puppet, Chef, or other infrastructure automation tools.
  • Experience with CNCF tools
  • Contributions to open source projects or involvement with OSS communities
  • Development experience in Go, Typescript, bash, Node, Javascript, Python
Replicated is committed to cultivating an efficient, respectful workplace. We provide strong health and dental benefits, mandatory vacation, and are eager to advance the skills and careers of our employees. We encourage applicants of all backgrounds and we work to make sure that all team members have an equal opportunity to succeed.

Related Jobs

Site Reliability Engineer

Replicated - Remote

Account Executive

Replicated - Remote

Senior Product Marketing Manager

Replicated - Remote

Sales Development Representative

Replicated - Remote

Solutions Architect(Remote)

Replicated - United States of America
Disclaimer: Local Candidates Only
This company does NOT accept candidates from outside recruiting firms. Agency contacts are not welcome.