Senior DevOps Engineer

C3

Redwood City, CA, US
  • Job Type: Full-Time
  • Function: IT
  • Post Date: 05/05/2021
  • Website: c3.ai
  • Company Address: 1300 Seaport Boulevard Suite 500, Redwood City, CA, 94063

About C3

C3.ai is a leading enterprise AI software provider for accelerating digital transformation. C3.ai delivers a comprehensive and proven set of capabilities for rapidly developing, deploying, and operating large scale AI, predictive analytics, and IoT applications for any enterprise value chain in any industry. The C3 AI Suite and C3.ai applications are proven and tested at petabyte scale, solving previously unsolvable challenges. At the core of the C3 AI Suite is a revolutionary and powerful model-driven AI architecture that dramatically enhances the productivity of data scientists and application developers while future-proofing applications against underlying IT evolution.

Job Description

C3.ai is a leading enterprise AI software provider for accelerating digital transformation. The comprehensive and proven C3 AI Suite uses a model-driven abstraction layer to enable organizations to develop, deploy, and operate enterprise scale AI applications 40x to 100x faster than alternative approaches. www.c3.ai 

C3.ai is hiring Cloud Computing DevOps Engineers at our beautiful campus in Redwood City, California. In this role, you will work alongside a tight-knit and talented engineering team to build and deploy AI applications for our customers. You will use your expertise to solve complex challenges and support the core of the C3.ai Platform. 

Meaningful work. Top technology. An award-winning culture and talented team. Join us at C3.ai! 

Your Responsibilities: 

  • Develop and test the cloud infrastructure to scale a rapidly growing C3.ai ecosystem
  • Design, deploy, and manage a massive scale, highly available, fault tolerant, multi-tenant SAAS product 
  • Develop and maintain Continuous Integration (CI)/Continuous Delivery (CD) pipelines on kubernetes 
  • Tier 1 point of escalation from Support for any service availability challenges reported by multitenant customers 
  • Improve automated cloud configuration, deployments, monitoring, management and incident response to support enterprise grade multi-tenant systems
  • Work cross-functionally with various teams to improve C3.ai infrastructure through automation
  • Build internal tools to demonstrate performance and operational efficiency
  • Work with other teams to resolve issues related to application configuration, deployment, or debugging
  • Provide documentation and training of duties to Operations, new staff and related groups
  • Provide system administration, configuration, and troubleshooting of the Linux environment

 Requirements: 

  • Bachelor’s degree in Computer Science, Electrical Engineering, or related field
  •  5+ years' experience in high-availability large-scale Kubernetes cluster deployments, operation, monitoring and maintenance  
  • Strong experience with log monitoring and management with tools including but not limited to, Splunk or Elastic 
  • Strong experience in automating Continuous Integration (CI) and Continuous Deployment (CD) and release management using tools such a Jenkins and Docker Registry 
  • Strong experience with metric monitoring and alerting tools such as Prometheus and Grafana 
  • Strong experience with Python, Bash, Jscript and automation tools (Chef, Puppet, Ansible,  etc.) 
  • 3+ years of experience with using and developing technologies like Cassandra, Spark, Relational Databases, Postgres, RedShift and Docker 
  • Experience with incident response automation tools such as PagerDuty 
  • Experience with Application Performance Monitoring principles and related tools such as NewRelic, Dynatrace, or AppDynamics 
  • Experience with Amazon EKS/Azure AKS or Openshift/Rancher is a plus 
  • Experience with securing cloud environments and monitoring for security breaches 
  • Experience with monitoring and reporting on cloud spend 
  • Experience reporting on Service Level Agreement (SLA) performance metrics such as service up time 
  • Proficiency in Linux administration, configuration, and automation tools
  • Working knowledge of Cloud platforms (AWS, Azure, Google and Cloud Platform)
  • Knowledge of performance benchmarking and diagnostic tools
  • Rigor in high code quality, automated testing, and other engineering best practices

Preferred 

  • Master’s degree in Computer Science, Electrical Engineering, or related field
  • Experience with Scala and Spark 
  • Working experience in deploying Spark on Kubernetes  

C3.ai provides a competitive compensation package and excellent benefits including:

  • Competitive salary, generous stock options, 401K, medical, dental, and vision benefits. At the office, we offer a fully stocked kitchen with catered breakfast and lunch, table tennis and pool table, free membership at our on-site gym, Friday evening social hours with food, drink and music and a fun team of great people.

C3.ai is proud to be an Equal Opportunity and Affirmative Action Employer. We do not discriminate on the basis of any legally protected characteristics, including disabled and veteran status.

Related Jobs

Forward Deployed Engineer (Federal)

C3 - Tysons Corner, VA, US

Site Reliability Engineer - Federal

C3 - Redwood City, CA, US

Site Reliability Engineer

C3 - Redwood City, CA, US

Software Engineer, Full-Stack

C3 - Redwood City, CA, US

Software Engineer - Spark

C3 - Redwood City, CA, US
Disclaimer: Local Candidates Only
This company does NOT accept candidates from outside recruiting firms. Agency contacts are not welcome.