276: AlphaGenome: 1-Mb multimodal deep model predicts regulatory variant effects including splicing and TAL1 mechanisms episode artwork

EPISODE · Jan 30, 2026 · 19 MIN

276: AlphaGenome: 1-Mb multimodal deep model predicts regulatory variant effects including splicing and TAL1 mechanisms

from Base by Base · host Gustavo Barra

Avsec et al., Nature, doi:10.1038/s41586-025-10014-0 - AlphaGenome, a 1 Mb DNA deep‑learning model, predicts base‑pair‑resolution genome tracks (RNA‑seq, splicing, chromatin) and scores variant effects, achieving state‑of‑the‑art performance across modalities. Key terms: AlphaGenome, splicing, eQTL, chromatin-accessibility, 1Mb-sequence. Study Highlights:AlphaGenome is a unified sequence‑to‑function deep learning model trained on human and mouse genomes that consumes 1 Mb of DNA and predicts 5,930 human genome tracks across 11 modalities using a U‑Net‑inspired encoder, transformer tower and decoder. The model was pretrained with fold splits and distilled into a single student model for efficient variant scoring, enabling base‑pair resolution outputs and splice junction prediction alongside splice site usage and RNA‑seq coverage. Quantitatively, AlphaGenome outperformed or matched external models on 22 of 24 genome track tasks and on 25 of 26 variant effect benchmarks, improving eQTL sign prediction and QTL effect correlations. The multimodal outputs enable mechanistic interpretation of variants, for example recapitulating oncogenic TAL1 enhancer mutations and identifying splice‑disrupting variants. Conclusion:AlphaGenome provides a unified 1‑Mb multimodal, base‑resolution sequence model that substantially improves genome track and regulatory variant effect prediction and enables mechanistic, cross‑modality interpretation. QC:This episode was checked against the original article PDF and publication metadata for the episode release published on 2026-01-30. QC Scope:- article metadata and core scientific claims from the narration- excludes analogies, intro/outro, and music- transcript coverage: Audited the transcript portions presenting AlphaGenome architecture, multimodal outputs, benchmarking, splicing, TAL1 mechanism, 3D genome, in silico mutagenesis, and limitations.- transcript topics: AlphaGenome architecture (1 Mb input, U‑Net backbone, transformers, sequence parallelism); Multimodal genome tracks (5,930 human tracks, 1,128 mouse tracks across 11 modalities); Variant effect benchmarking (26 tasks; 25/26 ahead); Splicing variant predictions (splice sites, splice junctions, usage); TAL1 oncogene mechanism (neo-enhancer, MYB motif); 3D genome contact maps predictions QC Summary:- factual score: 10/10- metadata score: 10/10- supported core claims: 8- claims flagged for review: 0- metadata checks passed: 4- metadata issues found: 0 Metadata Audited:- article_doi- article_title- article_journal- license Factual Items Audited:- AlphaGenome takes input of 1 Mb DNA and predicts thousands of genome tracks across 11 modalities at base-pair resolution- 5,930 human genome tracks and 1,128 mouse genome tracks predicted- AlphaGenome matched or outperformed strongest external models on 25 of 26 variant-effect benchmarks- Splicing variant predictions include splice sites, splice junctions, and splice site usage- TAL1 oncogene mechanism recapitulated via a neo-enhancer and MYB motif- 3D genome folding predicted via contact maps with improved performance over specialized tools QC result: Pass.

Avsec et al., Nature, doi:10.1038/s41586-025-10014-0 - AlphaGenome, a 1 Mb DNA deep‑learning model, predicts base‑pair‑resolution genome tracks (RNA‑seq, splicing, chromatin) and scores variant effects, achieving state‑of‑the‑art performance across modalities. Key terms: AlphaGenome, splicing, eQTL, chromatin-accessibility, 1Mb-sequence. Study Highlights:AlphaGenome is a unified sequence‑to‑function deep learning model trained on human and mouse genomes that consumes 1 Mb of DNA and predicts 5,930 human genome tracks across 11 modalities using a U‑Net‑inspired encoder, transformer tower and decoder. The model was pretrained with fold splits and distilled into a single student model for efficient variant scoring, enabling base‑pair resolution outputs and splice junction prediction alongside splice site usage and RNA‑seq coverage. Quantitatively, AlphaGenome outperformed or matched external models on 22 of 24 genome track tasks and on 25 of 26 variant effect benchmarks, improving eQTL sign prediction and QTL effect correlations. The multimodal outputs enable mechanistic interpretation of variants, for example recapitulating oncogenic TAL1 enhancer mutations and identifying splice‑disrupting variants. Conclusion:AlphaGenome provides a unified 1‑Mb multimodal, base‑resolution sequence model that substantially improves genome track and regulatory variant effect prediction and enables mechanistic, cross‑modality interpretation. QC:This episode was checked against the original article PDF and publication metadata for the episode release published on 2026-01-30. QC Scope:- article metadata and core scientific claims from the narration- excludes analogies, intro/outro, and music- transcript coverage: Audited the transcript portions presenting AlphaGenome architecture, multimodal outputs, benchmarking, splicing, TAL1 mechanism, 3D genome, in silico mutagenesis, and limitations.- transcript topics: AlphaGenome architecture (1 Mb input, U‑Net backbone, transformers, sequence parallelism); Multimodal genome tracks (5,930 human tracks, 1,128 mouse tracks across 11 modalities); Variant effect benchmarking (26 tasks; 25/26 ahead); Splicing variant predictions (splice sites, splice junctions, usage); TAL1 oncogene mechanism (neo-enhancer, MYB motif); 3D genome contact maps predictions QC Summary:- factual score: 10/10- metadata score: 10/10- supported core claims: 8- claims flagged for review: 0- metadata checks passed: 4- metadata issues found: 0 Metadata Audited:- article_doi- article_title- article_journal- license Factual Items Audited:- AlphaGenome takes input of 1 Mb DNA and predicts thousands of genome tracks across 11 modalities at base-pair resolution- 5,930 human genome tracks and 1,128 mouse genome tracks predicted- AlphaGenome matched or outperformed strongest external models on 25 of 26 variant-effect benchmarks- Splicing variant predictions include splice sites, splice junctions, and splice site usage- TAL1 oncogene mechanism recapitulated via a neo-enhancer and MYB motif- 3D genome folding predicted via contact maps with improved performance over specialized tools QC result: Pass.

NOW PLAYING

276: AlphaGenome: 1-Mb multimodal deep model predicts regulatory variant effects including splicing and TAL1 mechanisms

0:00 19:40

No transcript for this episode yet

We transcribe on demand. Request one and we'll notify you when it's ready — usually under 10 minutes.

MG Show MG Show The MG Show, hosted by Jeffrey Pedersen and Shannon Townsend, is a leading alternative media platform dedicated to uncovering the truth behind today’s most pressing political issues. Launched in 2019, the show has grown exponentially, offering unfiltered insights, comprehensive research, and real-time analysis. With a commitment to independent journalism and factual integrity, the MG Show empowers its audience with knowledge and encourages active participation in the political discourse. That Hoarder: Overcome Compulsive Hoarding That Hoarder Hoarding disorder is stigmatised and people who hoard feel vast amounts of shame. This podcast began life as an audio diary, an anonymous outlet for somebody with this weird condition. That Hoarder speaks about her experiences living with compulsive hoarding, she interviews therapists, academics, researchers, children of hoarders, professional organisers and influencers, and she shares insight and tips for others with the problem. Listened to by people who hoard as well as those who love them and those who work with them, Overcome Compulsive Hoarding with That Hoarder aims to shatter the stigma, share the truth and speak openly and honestly to improve lives. Flottengeflüster ALD Automotive Österreich | LeasePlan Beim Flottengeflüster powered by ALD Automotive | LeasePlan präsentieren Jörg Janik und Peter Gutenbrunner alle zwei Wochen spannende Informationen rund um das Thema nachhaltige Mobilität. Beide beschäftigen sich schon lange mit der Thematik und bringen umfangreiches Fachwissen mit. Sollten sie aber doch einmal nicht weiter wissen, werden unsere Expert*innen hinzugezogen, die ihnen gerne mit Rat und Tat zur Seite stehen. The Small Business Startup School – Business Notes | Financial Literacy | Retail Psychology – For Professionals & Entrepreneurs The Small Business Startup School Inc. Starting or buying a small business? While personal circumstances may vary, business patterns remain timeless. On The Small Business Startup School, we explore strategies, insights, and practical solutions to help entrepreneurs confidently navigate their journey.Hosted by Ola Williams—a retail entrepreneur, fintech founder, and financial coach with over two decades of experience—this podcast marries financial awareness and retail psychology with optimism to deliver actionable takeaways.Join us to learn, grow, and connect as we uncover the keys to business success.Let’s continue to learn together and be encouraged to keep on connecting!

Frequently Asked Questions

How long is this episode of Base by Base?

This episode is 19 minutes long.

When was this Base by Base episode published?

This episode was published on January 30, 2026.

What is this episode about?

Avsec et al., Nature, doi:10.1038/s41586-025-10014-0 - AlphaGenome, a 1 Mb DNA deep‑learning model, predicts base‑pair‑resolution genome tracks (RNA‑seq, splicing, chromatin) and scores variant effects, achieving state‑of‑the‑art performance across...

Can I download this Base by Base episode?

Yes, you can download this episode by clicking the download button on the episode player, or subscribe to the podcast in your preferred podcast app for automatic downloads.
URL copied to clipboard!