Life in the Singularity

Life in the Singularity

Using Deep Learning and Python to Cure Diseases

Matt McDonagh's avatar
Matt McDonagh
Jun 30, 2024
∙ Paid

This is a follow-up piece, if you missed Part I dig in here for the baseline:

Predicting Medicines Into Existence

June 25, 2024
Predicting Medicines Into Existence

Read full story

Using Deep Learning to Invent Medicine

As you will recall from Part I, I am solo attempting one of the hottest Kaggle.com competitions right now. The game is predicting protein bonding affinity… will these molecules form or not?

Drug companies are crowdsourcing different methods (machine learning models mostly) for digesting SMILES data, creating representations of the data, using those to build a prediction machine (a model) and leveraging that digital machine to find the best candidate compounds to develop in real life.

Using math to save lives.

Since the dataset is massive and unbalanced I need to:

  1. work with a much smaller subset of the data to build my v1 pipeline

  2. ensure my tiny sample is representative of the population across 295,000,000 rows

  3. ensure my v1 pipeline can actually scale sufficiently to “see” enough of the training data to develop generalizability

If I build a super powerful machine at predicting …

User's avatar

Continue reading this post for free, courtesy of Matt McDonagh.

Or purchase a paid subscription.
© 2026 Matt McDonagh · Privacy ∙ Terms ∙ Collection notice
Start your SubstackGet the app
Substack is the home for great culture