Data Scientist

Catalog

Boston, MA, US
  • Job Type: Full-Time
  • Function: Data Science
  • Post Date: 06/16/2021
  • Website: catalogdna.com
  • Company Address: 127 Western Ave, Boston, MA, 02134

About Catalog

Catalog is bringing cutting-edge synthetic biology technologies to the world of information storage and computation

Job Description

At Catalog, we are building the world's first massive data storage and computing platform using DNA as the storage medium. As an information storage medium, DNA is unique in its stability over millennia (think fossils), information density (gigabits in a cellular volume), and replicability (billions of copies, made cheaply and efficiently). Among its many groundbreaking applications, these properties make DNA an attractive medium for high density storage of latency tolerant data and massively parallel computation. To this end, we are developing a novel exabyte-scale DNA computing platform and looking for motivated engineers to join us.

In this role, you will help our team interpret the performance data gathered from our write and read pipelines with statistical analysis and modeling. 

What you want:

  • Work with the world’s fastest DNA data storage writer

  • An opportunity to define the analysis pipeline for DNA data storage and computing

  • The challenge of modeling large heterogeneous data about software, hardware, and chemistry

  • A culture of excellence embedded in a collegial and supportive environment

  • Responsibility, leadership, and trust

What you will do:

  • Work with a cross disciplinary team of scientists and engineers to analyze data gathered from our core pipelines

  • Develop statistical models to predict performance, troubleshoot experiments, and answer “what if” questions

  • Develop simulators to support strategic proof-of-concept demonstrations in DNA storage and computing

  • Define and implement an automated data analysis pipeline to complement our core pipelines

  • Apply machine learning to address analysis bottlenecks

  • Communicate results to a cross disciplinary audience

What we need:

  • Experience in developing statistical models to explain hardware, chemical, or biological data

  • Experience in developing cloud-based analysis pipelines in Python or in similar languages

  • Experience with analysis of NGS, imaging, or sensor data 

  • Facility with at least one cloud-based machine learning framework

  • Excellent oral and written communication skills

  • An MS or PhD in applied statistics or a related field

  • Experience in a non-Academic/non-research environment

At Catalog, we value perseverance, leadership, teamwork, science, and communication. You will have the opportunity to shape the future of an entirely new way of manipulating information. You will have a chance to formulate and solve challenging computing problems arising in this uncharted territory. You will have a chance to work and learn with a team of talented scientists, engineers, and entrepreneurs, and a chance to wear multiple hats and grow with us. 

We hope that you will consider joining us for a chat; we promise it will be intriguing.

Related Jobs

Scientist, DNA Computing

Catalog - Boston, MA, US

Data Scientist

Catalog - Boston, MA, US

Molecular Biologist

Catalog - Boston, MA, US

Scientist, Synthetic Biologist

Catalog - Boston, MA, US
Disclaimer: Local Candidates Only
This company does NOT accept candidates from outside recruiting firms. Agency contacts are not welcome.