Job Description
Overview
We are seeking a senior technical engineer as a foundational member of a cross-functional team that is tasked with maintaining the quality, availability, and reliability of the Clumio Data Platform. The ideal candidate will be self-motivated, an excellent communicator, have experience with Linux, Amazon Web Services, Kubernetes, Docker, Terraform, VMWare vSphere, IP Networking, ITIL Service Management, and strict Change Control procedures. This candidate should be very comfortable working in a fast-paced SDLC that includes the infrastructure-as-code model. The role will work closely with software engineering, quality assurance, customer success and product management teams to ensure the timely delivery and high availability of platform features and services. Through Event, Incident and Problem Management, this role will personally drive continuous improvement throughout the organization.
In this role you will
Work alongside Engineering and QA teams to ensure a stable and scalable infrastructure
Work with developers to drive their requirements for resources, capacity, configuration, deployment and monitoring
Work with senior management to develop project plans and drive tasks accordingly
Maintain escalation policies, incident communication, and follow-the-sun support between multiple geographically disperse virtual NOCs
Handle 24x7x365 Incident Management and drive Problem Management continuous improvements to enforce and maintain SLAs
Routinely survey and evaluate available technology options to improve processes, tooling, and monitoring
Collaborate with QA to help provide comprehensive coverage for software releases by building and maintaining suitable test environments
Collaborate with Sales to build and maintain demonstration/Proof of Concept environments
Interface with Customer Success to provide scope and detail for incident reports and maintenance activities
Acquire and maintain a thorough working knowledge of the products and services that are live and under development
Requirements
B.S. in Computer Science or equivalent experience.
Familiarity with ITIL practices
Experience documenting policies and detailed procedures for an ISMS (ISO/IEC 27001)
Familiarity with ISO27001 Annex A controls as they apply to cloud hosted SaaS/PaaS/IaaS operations
Demonstrated experience managing projects and delivering results in an ITIL framework
Extensive experience with cloud systems architecture, monitoring frameworks, network architecture
Experience using source control tools such as Git
Significant scripting (Python, Bash) experience
Experience with continuous integration platforms such as Jenkins
Experience with configuration management and orchestration tools such as Terraform or SaltStack
Occasional travel may be required