Job Description :

Data Scientist

Remote

Project Description:

Next-generation Artificial Intelligence for Genomics will use more complex datatypes and be applied to new crop contexts. We need a Data Scientist with demonstrated expertise in designing, training, and evaluating transformers such as BERT and its derivatives.

Responsibilities:

Refactor our AI codebase to enable new features and training/inference contexts. Redesign neural network architecture to incorporate new sequencing datatypes. Collaborate with genomics experts to define and pre-process training and validation datasets appropriate for the tasks of interest. Execute training on multiple sequencing datasets. Collaborate with Machine Learning Engineers to deploy trained models at scale.

Skills/Experience:

Required: Proficiency with Python, pyTorch, Docker, Kubernetes, Jupyter. Expertise in Deep Learning, Transformers, Natural Language Processing, Large Language Models.

Preferred: Experience with genomics data, molecular genetics. Distributed computing tools like Ray, Dask, Spark.

             

Similar Jobs you may be interested in ..