New bio data + AI businesses

Concept

Unlocking the new bio data sources with full-stack diagnostics and model building

Longer Description

The next generation of life science giants will not be hardware vendors, but full-stack data and insight companies. Historically, bio hardware manufacturers (selling sequencers and reagents) have struggled to capture value compared to the diagnostic companies that utilize that hardware to generate patient insights. This is evidenced by the disparity in value accrual between hardware pure-plays (e.g., PacBio, Oxford Nanopore) and diagnostic/insight platforms (e.g., Natera, Tempus, BillionToOne, Caris, Adaptive).

Most bio-AI companies train on the same public or weakly differentiated databases (protein structures, DNA sequences, transcriptomes), leading to converging model performance. To break this ceiling, frontier ML companies require a change in data inputs. We’re specifically interested in the unlocking of proteomics and metabolomics. These modalities are functionally more relevant to bodily function in health and disease but are harder to collect at scale.

However, with new, scaled data types coming online in the next 10 years, we believe it’s time to make new businesses for proteomics and metabolomics data than those we’ve had for genomics. Specifically unlocking new data should be productized and used to make net new insights.

Other thoughts

This company will be uniquely difficult to build as there will be different phases. Excellent hardware, commercial, and ML talent is needed at every phase of the company development. Normally the three of these talent pools aren’t under the same company so organization structure and culture will be crucial.
Price compression of proteomics and metabolomics sequencing might be difficult without selling optimized hardware.

Comparable Companies

Tempus
Caris
Adaptive
Natera
Illumina
PacBio
Olink

Concept

Unlocking the new bio data sources with full-stack diagnostics and model building

Longer Description

Other thoughts

This company will be uniquely difficult to build as there will be different phases. Excellent hardware, commercial, and ML talent is needed at every phase of the company development. Normally the three of these talent pools aren’t under the same company so organization structure and culture will be crucial.
Price compression of proteomics and metabolomics sequencing might be difficult without selling optimized hardware.

Comparable Companies

Tempus
Caris
Adaptive
Natera
Illumina
PacBio
Olink

New bio data + AI businesses

Concept

Longer Description

Other thoughts

Comparable Companies

Related Reading

Related Theses

Marketplaces Requiring Private Intelligence

Ozempic for Sleep

Cancer Vaccines

New bio data + AI businesses

Concept

Longer Description

Other thoughts

Comparable Companies

Related Reading

Related Theses

Marketplaces Requiring Private Intelligence

Ozempic for Sleep

Cancer Vaccines