3: Data-driven heuristics for splice-altering variants episode artwork

EPISODE · Apr 16, 2025 · 22 MIN

3: Data-driven heuristics for splice-altering variants

from Base by Base · host Gustavo Barra

Sullivan P et al., The American Journal of Human Genetics - A concise walkthrough of data-driven heuristics and a splicing checklist derived from large-scale exon, branchpoint, and experimentally validated variant analyses to improve interpretation of splice-altering variants. Key terms: splicing, splice-altering variants, heuristics, SpliceVarDB, pseudoexon. Study Highlights:The authors analyzed ~202,000 canonical protein-coding exons and 19,034 experimentally validated branchpoints to define sequence, spacing, and motif-strength requirements for U2 splicing, finding 95.9% of exons met these criteria. Using 11,860 experimentally validated single-nucleotide variants from SpliceVarDB, they derived data-driven heuristics, subgroups, and a spliceogenicity metric to predict whether variants alter splicing. Heuristics were implemented as donor (DD) and acceptor (DA) flowcharts plus an in silico checklist, applied to pseudoexon formation, and used to quantify typical splicing outcomes. The study notes limitations including sparse examples for splicing regulatory elements and context- or tissue-specific effects not fully captured. Conclusion:Empirically grounded heuristics and a practical checklist bridge biological splicing rules and in silico prediction, improving evaluation of putative splice-altering variants while acknowledging limits from incomplete SRE data and context dependence; ongoing contribution to SpliceVarDB and integration into tools should refine performance. QC:This episode was checked against the original article PDF and publication metadata for the episode release published on 2025-04-16. QC Scope:- article metadata and core scientific claims from the narration- excludes analogies, intro/outro, and music QC Summary:- factual score: 10/10- metadata score: 10/10- supported core claims: 7- claims flagged for review: 0- metadata checks passed: 4- metadata issues found: 0 Metadata Audited:- article_doi- article_title- article_journal- license Factual Items Audited:- Data-driven splicing heuristics framework replaces black-box AI for SAV interpretation and uses a transparent checklist grounded in splicing biology.- Minimal splicing requirements were defined from extensive analysis, with 95.9% of exons meeting these criteria.- SpliceVarDB provided data for over 11k experimentally validated SAVs, used to derive heuristics with a minimum threshold of 10 variants per heuristic.- Disruption of donor and acceptor sites is organized into DD and DA flowcharts to classify spliceogenicity, with subgroups.- Splicing outcomes are quantitatively distributed: exon skipping ~51.3%, exon truncation ~36.3%, intron retention ~17.4%, exon extension ~16.2%.- Deep intronic variants can create pseudoexons; the checklist flags 82% of confirmed pseudoexons. QC result: Pass.

Sullivan P et al., The American Journal of Human Genetics - A concise walkthrough of data-driven heuristics and a splicing checklist derived from large-scale exon, branchpoint, and experimentally validated variant analyses to improve interpretation of splice-altering variants. Key terms: splicing, splice-altering variants, heuristics, SpliceVarDB, pseudoexon. Study Highlights:The authors analyzed ~202,000 canonical protein-coding exons and 19,034 experimentally validated branchpoints to define sequence, spacing, and motif-strength requirements for U2 splicing, finding 95.9% of exons met these criteria. Using 11,860 experimentally validated single-nucleotide variants from SpliceVarDB, they derived data-driven heuristics, subgroups, and a spliceogenicity metric to predict whether variants alter splicing. Heuristics were implemented as donor (DD) and acceptor (DA) flowcharts plus an in silico checklist, applied to pseudoexon formation, and used to quantify typical splicing outcomes. The study notes limitations including sparse examples for splicing regulatory elements and context- or tissue-specific effects not fully captured. Conclusion:Empirically grounded heuristics and a practical checklist bridge biological splicing rules and in silico prediction, improving evaluation of putative splice-altering variants while acknowledging limits from incomplete SRE data and context dependence; ongoing contribution to SpliceVarDB and integration into tools should refine performance. QC:This episode was checked against the original article PDF and publication metadata for the episode release published on 2025-04-16. QC Scope:- article metadata and core scientific claims from the narration- excludes analogies, intro/outro, and music QC Summary:- factual score: 10/10- metadata score: 10/10- supported core claims: 7- claims flagged for review: 0- metadata checks passed: 4- metadata issues found: 0 Metadata Audited:- article_doi- article_title- article_journal- license Factual Items Audited:- Data-driven splicing heuristics framework replaces black-box AI for SAV interpretation and uses a transparent checklist grounded in splicing biology.- Minimal splicing requirements were defined from extensive analysis, with 95.9% of exons meeting these criteria.- SpliceVarDB provided data for over 11k experimentally validated SAVs, used to derive heuristics with a minimum threshold of 10 variants per heuristic.- Disruption of donor and acceptor sites is organized into DD and DA flowcharts to classify spliceogenicity, with subgroups.- Splicing outcomes are quantitatively distributed: exon skipping ~51.3%, exon truncation ~36.3%, intron retention ~17.4%, exon extension ~16.2%.- Deep intronic variants can create pseudoexons; the checklist flags 82% of confirmed pseudoexons. QC result: Pass.

NOW PLAYING

3: Data-driven heuristics for splice-altering variants

0:00 22:42

No transcript for this episode yet

We transcribe on demand. Request one and we'll notify you when it's ready — usually under 10 minutes.

MG Show MG Show The MG Show, hosted by Jeffrey Pedersen and Shannon Townsend, is a leading alternative media platform dedicated to uncovering the truth behind today’s most pressing political issues. Launched in 2019, the show has grown exponentially, offering unfiltered insights, comprehensive research, and real-time analysis. With a commitment to independent journalism and factual integrity, the MG Show empowers its audience with knowledge and encourages active participation in the political discourse. That Hoarder: Overcome Compulsive Hoarding That Hoarder Hoarding disorder is stigmatised and people who hoard feel vast amounts of shame. This podcast began life as an audio diary, an anonymous outlet for somebody with this weird condition. That Hoarder speaks about her experiences living with compulsive hoarding, she interviews therapists, academics, researchers, children of hoarders, professional organisers and influencers, and she shares insight and tips for others with the problem. Listened to by people who hoard as well as those who love them and those who work with them, Overcome Compulsive Hoarding with That Hoarder aims to shatter the stigma, share the truth and speak openly and honestly to improve lives. Flottengeflüster ALD Automotive Österreich | LeasePlan Beim Flottengeflüster powered by ALD Automotive | LeasePlan präsentieren Jörg Janik und Peter Gutenbrunner alle zwei Wochen spannende Informationen rund um das Thema nachhaltige Mobilität. Beide beschäftigen sich schon lange mit der Thematik und bringen umfangreiches Fachwissen mit. Sollten sie aber doch einmal nicht weiter wissen, werden unsere Expert*innen hinzugezogen, die ihnen gerne mit Rat und Tat zur Seite stehen. The Small Business Startup School – Business Notes | Financial Literacy | Retail Psychology – For Professionals & Entrepreneurs The Small Business Startup School Inc. Starting or buying a small business? While personal circumstances may vary, business patterns remain timeless. On The Small Business Startup School, we explore strategies, insights, and practical solutions to help entrepreneurs confidently navigate their journey.Hosted by Ola Williams—a retail entrepreneur, fintech founder, and financial coach with over two decades of experience—this podcast marries financial awareness and retail psychology with optimism to deliver actionable takeaways.Join us to learn, grow, and connect as we uncover the keys to business success.Let’s continue to learn together and be encouraged to keep on connecting!

Frequently Asked Questions

How long is this episode of Base by Base?

This episode is 22 minutes long.

When was this Base by Base episode published?

This episode was published on April 16, 2025.

What is this episode about?

Sullivan P et al., The American Journal of Human Genetics - A concise walkthrough of data-driven heuristics and a splicing checklist derived from large-scale exon, branchpoint, and experimentally validated variant analyses to improve interpretation...

Can I download this Base by Base episode?

Yes, you can download this episode by clicking the download button on the episode player, or subscribe to the podcast in your preferred podcast app for automatic downloads.
URL copied to clipboard!