Bioinformatics Engineer, Pipelines

Mithrl Inc

San Francisco, CA

JOB DETAILS
SKILLS
Amazon Web Services (AWS), Analysis Skills, Artificial Intelligence (AI), Bioinformatics, Biotech and Pharmaceutical, Cloud Computing, Computational Engineering, Data Cleaning, Data Processing, Docker, Genomics, Health Plan, Medical Products, Metadata, Metadata Identification, Modality, Painting (Facilities and Maintenance), Patents, Preferred Provider Organization (PPO), Problem Solving Skills, Python Programming/Scripting Language, Quality Control, R Programming Language, Research & Development (R&D), Sales Pipeline, Science Software, Spatial Data, Startup, Wideband Gapfiller Satellites (WGS)
LOCATION
San Francisco, CA
POSTED
30+ days ago

ABOUT MITHRLWe imagine a world where new medicines reach patients in months, not years, and where scientific breakthroughs happen at the speed of thought.Mithrl is building the world's first commercially available AI Co-Scientist. It is a discovery engine that transforms messy biological data into real insights in minutes. Scientists ask questions in natural language, and Mithrl responds with analysis, novel targets, hypotheses, and patent-ready reports.Our traction speaks for itself:12X year-over-year revenue growthTrusted by leading biotechs and big pharma across three continentsDriving real breakthroughs from target discovery to patient outcomes.ABOUT THE ROLEWe are looking for a Lead Bioinformatics Pipeline Engineer to build and scale Mithrl's multi modal scientific processing pipelines. You will own the workflows that transform raw biological data into clean, reproducible outputs that power Mithrl's AI Co-Scientist. These workflows include microarray, imaging, spatial transcriptomics, genomics, epigenomics, flow cytometry, and more.This role sits at the center of our technical stack. You will architect Nextflow and nf-core style pipelines, implement modality-specific validation and QC layers, and collaborate with the Tabular Data Team and Knowledge Curation Team to ensure downstream data harmonization, variable ID mapping, and schema alignment. Your work ensures that scientists can ask questions and receive accurate data-backed answers instantly.If you enjoy building robust scientific workflows and want to work on high impact problems, you will thrive here.WHAT YOU WILL DODesign and maintain production grade bioinformatics pipelines for a wide range of data modalities, including microarray, cell painting, WGS and WES, spatial transcriptomics, flow cytometry, ATAC-seq, and methyl-seqBuild workflows using Nextflow, nf-core modules, or similar engines with a focus on reproducibility, validation, and scalabilityImplement quality control, validation, and provenance tracking for all supported modalitiesCollaborate with the Tabular Data Team to ensure pipeline outputs map cleanly into Mithrl's internal schemas, including variable ID coercions, metadata normalization, and feature name harmonizationWork with the Knowledge Curation Team to align outputs with reference genomes, annotations, and biological ontologiesProduce structured output artifacts so users can download processed data and supporting metadata directly through the platformWHAT YOU BRINGRequired Qualifications6 to 8 years of experience in bioinformatics workflow engineering or computational biologyStrong experience with Nextflow, nf-core, WDL, CWL, Snakemake, or similar workflow systemsProficiency in Python or R for data processing, QC, and pipeline logicHands-on experience building pipelines for multiple biological data types, including genomics, single cell, imaging, flow cytometry, spatial data, or epigenomicsAbility to design pipelines that are reproducible and containerized using Docker or SingularityStrong understanding of secondary and tertiary data layers and how they integrate with downstream analysis systemsExperience integrating pipeline outputs with data stores, schemas, or ML-ready formatsNice to HaveExperience executing pipelines in cloud environments such as AWS Batch, ECS, Tower, or Nextflow CloudExperience with imaging workflows such as CellProfiler, DeepCell, or SquidpyFamiliarity with genomic reference databases, annotation formats, and biological ontologiesPrevious work in a tech bio startup, biotech R&D group, or scientific software companyWHAT YOU WILL LOVE AT MITHRLYou will build the core pipelines that transform raw biological data into insights used by the AI Co-ScientistTeam: Join a tight-knit, talent-dense team of engineers, scientists, and buildersCulture: We value consistency, clarity, and hard work. We solve hard problems through focused daily executionSpeed: We ship fast (2x/week) and improve continuously based on real user feedbackLocation: Beautiful SF office with a high-energy, in-person cultureBenefits: Comprehensive PPO health coverage through Anthem (medical, dental, and vision) + 401(k) with top-tier plansWe encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if youre interested in this work. We think AI systems like the ones were building have enormous social and ethical implications. We think this makes representation even more important, and we strive to include a range of diverse perspectives on our team.

About the Company

M

Mithrl Inc