Data Scientist, Computational Biology

Cambridge, MA

Full Time

Platform & Target Discovery and Validation

Mid Level

Data Scientist, Computational Biology

Repertoire Immune Medicines is a biotechnology company working to unlock and direct the remarkable power of the human immune system to treat cancer and autoimmune disease. The company was founded on the belief that understanding the repertoire of T cell receptor (TCR)-antigen immune synapses that maintain health and drive disease represents one of the greatest opportunities for innovation in medical science. Repertoire scientists created and developed the DECODE^TM platform, which allows in-depth characterization of TCR-antigen pairs, and the ability to deploy this information in the form of novel targeted immune medicines to fundamentally reprogram the immune system to kill tumors or induce immune homeostasis.

From its sites in Cambridge, Massachusetts and Zurich, Switzerland, Repertoire’s team is advancing a pipeline of DECODE-enabled immune medicines. For cancer, we are developing a pipeline of TCR bispecifics molecules for treatment of multiple cancer types. In addition, we are developing a pipeline of mRNA tolerizing vaccines for treatment of autoimmune diseases.

Repertoire was founded by Flagship Pioneering and is supported by a strong investor base. In addition, the company recently entered a strategic partnership with Bristol Myers Squibb to develop tolerizing vaccines for up to three autoimmune diseases.

Role Overview

Repertoire Immune Medicines is seeking a talented Data Scientist to enable the discovery of new insights from our extensive and growing DECODE immune synapse database. The successful candidate will work at the interface of data analytics, data mining, statistics, bioinformatics, and machine learning with broad impact across early discovery, candidate development, and biomarker discovery. In addition, this role is responsible for analyzing large, multi-dimensional datasets and developing methods to identify and visualize signal in noise.

The selected candidate will be a part of the Computational Analytics Team, working alongside Computational Engineering, and interfacing directly with experimentalists in Platform Discovery, Immunology, and Protein Sciences to scope, build, and implement computational solutions. The ability to work in a fast-paced, highly collaborative environment will be critical to success, as well as the ability to communicate effectively across various teams. The Data Scientist will take ownership of challenging projects and approach problems systematically to achieve robust solutions. This person is a team player who contributes positively to group and company culture.

Key Responsibilities

Use statistical techniques to find relationships in complex biological data.
Develop, evaluate, and implement robust analytical methods/models/workflows/apps as needed for in-house discovery and development.
Use analytical methods to identify patterns, signals, and features in highly multiplexed experimental assay data.
Assist in the conception, development, optimization, and assessment of machine-learning models.
Maintain familiarity with scientific literature to assist in the development and benchmarking of new methods.
Build and deploy visualizations and user interfaces to be used by wet lab scientists.
Support various teams for the processing and interpretation of next-generation sequencing (NGS) data and ensure timely delivery of results.
Maintain high-quality documentation of work and discoveries, creating written reports, electronic lab notebooks, technical presentations for internal or external audiences, internal database records, code comments, and software documentation.
Communicate key data insights to various audiences within R&D, as well as continuous project status updates, setbacks, and modification of strategy.
Manage and execute multiple projects in across matrixed teams, working with leadership to meet short timelines while maintaining scientific rigor.
Seek out external resource and expertise when required.

Required Qualifications

Master’s degree in Data Science, Bioinformatics, Computational Biology, Machine Learning, Statistics, Mathematics, Physics, and 3+ years of professional experience. PhD preferred.
Extensive experience working with multi-dimensional datasets.
Extensive experience with Python analysis modules including pandas, numpy, scipy.
Experience performing principal component analysis, multi-variate regressions, ANOVA, Bayesian statistics and/or other statistical methods in a biological field to identify relevant parameters and/or outcomes.

Preferred Qualifications

Experience processing and/or building pipelines for next-generation sequencing data including gene expression, whole exome, TCR, single-cell.
Familiarity with machine-learning model development, optimization, and assessment.
Familiarity with the development of deep generative models (e.g., autoregressive models, VAEs, CNNs, GANs, etc.).
Demonstrated expertise in core coding environments including Python, R, SQL, bash scripts.
Experience working in cloud computing environments.
Experience with AI frameworks like TensorFlow, Keras, PyTorch, or sklearn.

Experience with python Streamlit or R Shiny apps
Experience with data visualization packages like matplotlib, seaborn, plotly, altair, ggplot2.

Repertoire is committed towards social responsibility and developing an inclusive culture. Much as the power of the immune system lies in the diversity of T and B cells, our work requires the creativity and ingenuity of a diverse workforce. We believe in actively pursuing equity in all facets of the work experience at Repertoire. We will continue to educate ourselves about the inequities and barriers present in our society and take action as a company where we can make a difference.

Repertoire is proud to be an Equal Opportunity Employer.

Recruitment & Staffing Agencies: Repertoire Immune Medicines (“Repertoire”) does not accept unsolicited resumes from any source other than candidates. The submission of unsolicited resumes by recruitment or staffing agencies to Repertoire or its employees is strictly prohibited unless contacted directly by Repertoire’s internal Human Resources team. Any resume submitted by an agency in the absence of a signed agreement will automatically become the property of Repertoire, and Repertoire will not owe any referral or other fees with respect thereto.

Apply for this position

Required*

Apply with Indeed

First Name*

Last Name*

Email Address*

Phone*

Address*

Resume*

We've received your resume. Click here to update it.

Attach resume or Paste resume

Attach resume as .pdf, .doc, .docx, .odt, .txt, or .rtf (limit 5MB) or Paste resume

Paste your resume here or Attach resume file

Cover Letter

What's your highest level of education completed?

LinkedIn Profile URL:

Earliest start date?

Are you authorized to work in the United States?*

Will you now or in the future require sponsorship for employment visa status? (e.g., H-1B visa status?)*

If you were referred by a current Repertoire employee, please provide the employee's name here:

We are only evaluating candidates who currently reside in reasonable commuting distance to our office in Cambridge, MA. Does your current location meet this requirement?* YesNo

The following questions are entirely optional.

To comply with government Equal Employment Opportunity and/or Affirmative Action reporting regulations, we are requesting (but NOT requiring) that you enter this personal data. This information will not be used in connection with any employment decisions, and will be used solely as permitted by state and federal law. Your voluntary cooperation would be appreciated. Learn more.

Gender

Race/Ethnicity

Human Check*

Submit Application

Thanks for visiting our Career Page.

Please review our open positions and apply to the positions that match your qualifications.

About Repertoire Immune Medicines

Data Scientist, Computational Biology

Apply for this position