Senior Site Reliability Engineer - Observability

Squarespace

New York City, NY, US
  • Job Type: Full-Time
  • Function: IT
  • Post Date: 01/06/2021
  • Website: squarespace.com
  • Company Address: 225 Varick St 12th Floor, New York, NY, 10014

About Squarespace

Squarespace empowers millions of dreamers, makers, and doers by providing them with the tools they need to bring their creative ideas to life.

Job Description

The Site Reliability Engineering (SRE) teams are looking for experienced and passionate software engineers and systems engineers who have backgrounds in systems, programming and networking to help in ensuring that customers around the world can access Squarespace sites, that they load quickly, and that all of the features work. We work together with the product teams to maintain the reliability of our features that are backed by a fleet of microservices, with the infrastructure teams to grow our current data centers and build new ones, with the data teams to solve challenging “big data” issues, and the security teams to keep a close eye on the latest vulnerabilities and protect our product, infrastructure and networks.

The Site Reliability Engineering (SRE) teams are also responsible for designing the systems that allow us to operate on our data centers and evolve the massive Squarespace software engine that serves millions of users worldwide. We implement software for internal DNS management, IP address allocation, and Linux image and package repositories. We build tools for the provisioning of compute capacity on top of VMware, Kubernetes clusters or public clouds. We work together with Server Engineering teams to scale up our storage, database and search infrastructure, which ingests close to 2 TBs of data per day. Our internal products include source code management tools, clusters of build agents for frontend and server-side code, automated testing infrastructure, Docker image generation tools, and continuous deployment pipelines, all used by 150+ software engineers in the wider organization. We engineer ELK clusters that ingest 5 TB of logs daily, Prometheus deployments that process 14 million data points every minute, distributed tracing services that record 10K spans every second, and alerting frameworks that monitors thousands of servers and hundreds of software services. You will report to the team lead in NYC.

RESPONSIBILITIES

    Architect reliable ELK logging clusters, Prometheus deployments and open-telemetry compliant distributed tracing solutions.
    Write high-quality infrastructure-as-code that automates the provisioning, deployment, scaling, and monitoring of Squarespace’s infrastructure and ensure it's redundant and fast around the globe.
    Share our globally distributed on-call rotation and be an escalation point to assess complex problems in real-time across any of our high-traffic software systems.
    Work with a team of experienced engineers to test your ideas and understand the system, and mentor junior team members.

QUALIFICATIONS

    Programming experience in Java, C++, Python, Go, or Ruby. Deep experience managing large-scale software and distributed systems in *nix environments.
    An understanding of and experience with, web application development.
    A solid foundation in computer science, with competencies in data structures, algorithms, and software design practices.
    Understand database design, caching, scalability, and network fundamentals.
    5+ years of experience with Docker, Kubernetes, Prometheus, or other CNCF software is a big plus.
    BS, or MS degree in Computer Science or related technical field or equivalent industry experience.

About Squarespace

Squarespace makes beautiful products to help people with creative ideas succeed. By blending elegant design and sophisticated engineering, we empower millions of people — from individuals and local artists to entrepreneurs shaping the world’s most iconic businesses — to share their stories with the world. Squarespace’s team of more than 1,000 is headquartered in downtown New York City, with offices in Dublin and Portland. For more information, visit www.squarespace.com/about.

Benefits & Perks

    Health insurance with 100% premium covered for you and your dependent children
    Flexible vacation & paid time off
    Up to 20 weeks of paid family leave
    Equity plan for all employees
    Retirement benefits with employer match
    Fertility and adoption benefits
    Free lunch and snacks at all offices
    Education reimbursement
    Dog-friendly workplace in New York office
    Commuter benefit in the form of reduced tax (Ireland) and pretax (US)

Related Jobs

Senior Software Engineer - Android

Squarespace - New York City, NY, US

Business Systems Analyst

Squarespace - New York City, NY, US

Senior Product Insights Analyst

Squarespace - New York City, NY, US

Senior Marketing Manager, Australia

Squarespace - Remote

Senior Site Reliability Engineer - Observability

Squarespace - New York City, NY, US
Disclaimer: Local Candidates Only
This company does NOT accept candidates from outside recruiting firms. Agency contacts are not welcome.