The FAIR² Chronicles: Data stories for an AI world podcast artwork

PODCAST · science

The FAIR² Chronicles: Data stories for an AI world

The FAIR² Chronicles: Data Stories for an AI World” is an AI-generated podcast bringing published FAIR² datasets to life. Each episode spotlights real-world datasets structured under FAIR² principles—FAIR + AI-readiness, Responsible AI, and Context—showcasing their impact on scientific discovery, innovation, and global challenges. From climate research to biomedical breakthroughs, AI narrates the data’s journey, revealing how structured, machine-actionable datasets are driving the future of open science. senscience.substack.com

  1. 14

    From Trash to Trajectories: Linking Waste, Emissions, and Development in a Single Global System

    Waste is often reported as a management problem. Emissions are reported as a climate problem. This episode is about what happens when those two are finally treated as the same system.We dive into the Frontiers Planet Prize National Champion-Awarded Global Waste Sector Dataset (1990–2050), a harmonized resource spanning historical data through mid-century projections that explicitly links municipal solid waste generation to greenhouse gas emissions, while tying both back to the socioeconomic forces that drive them. Developed by Hoy, Woon, Chin, Fan, Yoo, and an international consortium of researchers, the dataset is framed not as a single study outcome, but as durable research infrastructure designed for reuse, comparison, and modeling.At its core, the dataset connects population growth and PPP-adjusted economic development to physical waste generation, then traces how that waste translates into carbon dioxide, methane, and nitrous oxide emissions through different treatment pathways. Historical data from major public sources—including the World Bank, OECD, Eurostat, and UNFCCC national reports—is rigorously harmonized before being extended into the future using Shared Socioeconomic Pathways (SSPs).Methodologically, the project is notable for how seriously it treats system complexity. Historical waste generation is reconstructed using fixed-effects panel regression to control for country-specific characteristics, while future emissions are modeled using country-level machine learning ensembles that capture nonlinear relationships—particularly critical for methane, whose climate impact is handled using GWP-STAR rather than conventional metrics.The result is a dataset that allows researchers to do more than track growth. It supports cross-country benchmarking, long-term decoupling analysis, and exploration of how waste management choices shape near- and long-term climate outcomes. By keeping waste generation and emissions structurally linked, the dataset avoids the common pitfall of treating climate impacts as detached from material flows.The authors are also explicit about the limits: national-scale resolution only, scenario-dependent futures, no explicit uncertainty intervals, and uneven country coverage driven by historical data availability. These constraints are documented as part of the dataset’s context, reinforcing responsible reuse rather than obscuring uncertainty.Delivered through a FAIR²-aligned data portal with persistent identifiers, rich metadata, and machine-actionable structure, this resource is designed to move directly into lifecycle assessment, climate modeling, and AI-driven analysis.If you’re interested in understanding waste not just as an output of consumption, but as a measurable driver of emissions across decades—and in how economic development, infrastructure, and climate impacts intersect at national scale—this episode offers a clear, integrated starting point.Hoy, Z.X., Woon, K.S., Chin, W.C., Fan, Y.V., & Yoo, S.J. (2025). Global Waste Sector Dataset (1990–2050): Scenario-Based Projections of Generation, Emissions, and Socioeconomic Drivers. Front. Environ. Sci., section Environmental Economics and Management.Data article: https://doi.org/10.3389/fenvs.2025.1717992.FAIR² Data portal: https://doi.org/10.71728/senscience.k2f7-p5v9. This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit senscience.substack.com

  2. 13

    Seeing Atolls as a System: Inside the First Fully Integrated Indo-Pacific Atoll Dataset

    If you work in ecology, conservation, island biogeography, or environmental data—and you’ve ever struggled to connect biodiversity, climate, oceanography, and human history across small islands—this episode is for you.We take a deep dive into a landmark effort to harmonize data for all 310 Indo-Pacific atolls with permanent emergent land, transforming decades of scattered literature, field surveys, satellite products, and historical records into a single, integrated, machine-readable resource. Led by Frontiers Planet Prize national champion Sebastian Stiebel and an international team spanning academia and conservation organizations, this project represents a foundational shift in how atolls can be studied—not as isolated case studies, but as a connected system.The dataset synthesizes over 4,200 species records from 677 sources, standardized across 90 environmental, biological, and contextual variables. It combines terrestrial biodiversity inventories, seabird population estimates, climate and oceanographic drivers, reef and land habitat classifications, human population data, and a uniquely detailed layer on historical military land use—capturing legacy impacts that often shape present-day ecological outcomes but are rarely included in large-scale models.What makes this resource especially powerful is how it’s delivered. Rather than a static download, the data is available through an interactive, FAIR²-certified portal, designed to be immediately usable by both researchers and machines. By prioritizing AI readiness and responsible reuse, the project removes long-standing barriers between ecological data and predictive modeling.We also discuss the realities and limitations of working with historical sources—uneven sampling, taxonomic gaps, and the impossibility of retroactively standardizing past fieldwork—and why acknowledging those constraints is essential for responsible analysis. Even so, this dataset establishes a long-needed baseline for comparative research, conservation planning, and data-driven forecasting across one of the world’s most fragile and important ecosystems.If you’re interested in moving from descriptive ecology to predictive conservation—and in understanding how climate, biodiversity, and human history intersect across remote island systems—this episode is for you.Steibl S, Burnett MW, Holmes ND, Wegmann AS and Russell JC (2025). Atoll biodiversity and environments: an AI-ready, interactive data portal for Indo-Pacific atolls. Front. Environ. Sci. , section Environmental Informatics and Remote Sensing.Data article: https://doi.org/10.3389/fenvs.2025.1723851FAIR² Data portal: https://doi.org/10.71728/senscience.4f2j-8h1k This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit senscience.substack.com

  3. 12

    A Multi-Site View of Brain Injury: The TOP-NT Harmonized Rat MRI Dataset

    If you work in neuroimaging, neurotrauma, or computational neuroscience—and especially if you’ve ever wished for a truly comparable, cross-lab view of traumatic brain injury—this episode is for you.We dig into a remarkable new FAIR² dataset from the TOP-NT consortium: the first harmonized, multi-site diffusion MRI resource for preclinical TBI. It brings together 343 high-resolution scans from 184 rats across four research centers, all acquired under a unified protocol and processed through a rigorous, standardized pipeline.What makes this dataset so valuable? It marries tightly controlled acquisition with advanced harmonization methods like neuroCombat and multi-site template registration—removing scanner biases while preserving the biological injury signal. The result is a clean, comparable view of how structural brain changes unfold at 3 and 30 days after controlled cortical impact. Researchers can now reliably detect diffusion abnormalities, quantify tissue atrophy, and visualize injury progression across institutions.We also note the limitations—like the single injury model, two timepoints, and the challenge of fully removing site effects—and emphasize how the dataset, now viewable through the interactive FAIR² Data Portal and archived in the ODC-TBI repository, can still advance harmonization research, benchmark AI models, and strengthen reproducible TBI science moving forward.If you’re looking for a benchmark dataset for AI model training, injury signature discovery, cross-site reproducibility, or simply a clearer map of TBI evolution, this episode has you covered.Kislik G, Fox R, Korotcov A, Zhou J, Febo M, Moghadas B, Bibic A, Zou Y, Wan J, Koehler RC., Adebayo T, Burns MP., McCabe JT., Wang KK.W., Huie J.R, Ferguson AR., Paydar A, Wanner IB., Harris NG. and The TOP-NT Investigators (2025) Multi-site, in vivo MRI dataset of brain diffusivity measures before and after harmonization, and atrophy measures following controlled cortical impact in male and female adult rats. Front. Neurol. 16:1719618. doi: 10.3389/fneur.2025.1719618 This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit senscience.substack.com

  4. 11

    Staying Ahead of SARS-CoV-2: Inside the Ultimate Spike Protein Mutation Dataset

    If you work anywhere near virology, structural biology, or computational genomics—and especially if you’ve ever wished for a true map of the SARS-CoV-2 spike protein’s mutational landscape—this episode and the new FAIR² Data Article from the Stay Ahead Project is for you.We’re diving into the Stay Ahead Project’s latest data release: a thoroughly curated, structure-informed resource focused on the spike protein’s receptor binding domain (RBD). Created by a team led by Erik Schultes (LACDR/GoFair Foundation) with Max Van de Boom and Thomas Hankemeyer, this dataset systematically catalogs every possible single-point mutation in the RBD—over 3,700 in total—plus real-world Omicron variants.What sets this resource apart? It combines state-of-the-art protein structure prediction (using both AlphaFold2 and ESMFold), deep mutational scanning data for ACE2 binding and surface expression, and biophysical sequence features. We discuss the technical details, the challenges of integrating computational and experimental data, and how this dataset can inform predictive modeling of variant behavior.We also talk openly about the limitations—like the focus on the RBD and the challenges of modeling higher-order mutational effects—and the ways this resource, accessible via the FAIR² Data Portal, can support the research community moving forward.If you’re looking for new tools for variant surveillance, functional annotation, or just want a deeper understanding of spike protein evolution, this episode is for you.van den Boom M, Schultes E and Hankemeier T (2025) Structure-based prediction of SARS-CoV-2 variant properties using machine learning on mutational neighborhoods. Front. Bioinform. 5:1634111. doi: 10.3389/fbinf.2025.1634111 This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit senscience.substack.com

  5. 10

    Publish or Perish. But Who’s Reading?

    Science is accelerating. We’re publishing more papers and generating more data than ever before. But much of that knowledge—however valuable—goes unread, unexamined, and ultimately unused.In this AI-narrated episode, we explore a growing tension at the heart of modern research: a system built to produce knowledge is now struggling to process it. Scientists are overwhelmed, attention is scarce, and crucial datasets remain invisible—not because they don’t matter, but because they’re not designed to be reused.We ask the provocative question: Who are we really publishing for?And we introduce FAIR²—a new, open specification for making scientific data and methods machine-actionable, not just human-readable. Paired with intelligent tools like Senscience’s AI data steward, FAIR² represents a foundational shift in how science is structured, shared, and ultimately amplified.A new kind of collaborator is emerging—and it might change everything. This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit senscience.substack.com

  6. 9

    Making Data Count: How FAIR² Portals Turn Open Data into Real Scientific Impact

    Have you ever been excited to use an open dataset, only to find yourself lost in a maze of cryptic files, missing context, and dead-end links? You’re not alone. In this episode, we unpack why so much “open data” still isn’t truly reusable—and why that matters for science, machine learning, and policy alike.Join us as we explore how FAIR² Data Portals are raising the bar, transforming messy spreadsheets and inaccessible metadata into actionable, trustworthy resources. You’ll hear real-world stories of research frustration, learn what it means for data to be FAIR² (Findable, Accessible, Interoperable, Reusable—plus AI-Ready and Responsible), and discover practical features like interactive data explorers, contextualized methods, AI-powered assistants, and machine learning-ready formats.Whether you’re a scientist, a data steward, an engineer, or just curious about the future of research, this episode shows how a new approach to data infrastructure is multiplying scientific impact—and how you can be part of the movement.Listen in, share your feedback, and help shape the next era of scientific data! This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit senscience.substack.com

  7. 8

    The Myth of Objective Data

    We often hear about making AI responsible—but what if the real problem isn’t the model? What if it starts much earlier, with the data we feed into it?In this episode, we unpack the hidden assumptions baked into the way data is measured, labeled, and used. We explore why objectivity in data is often a myth—and why designing for subjectivity is essential for building trustworthy AI. From mental health dashboards and inconsistent clinical definitions to the neuroscience of neuron behavior and brain simulations, we examine how context—not just content—shapes what AI learns.We introduce the FAIR² framework: a practical approach to making data not only Findable, Accessible, Interoperable, and Reusable—but also AI-ready, Responsible, and Context-Rich. This isn’t just about fixing bias. It’s about understanding how data is made—and building integrity into the foundation of every AI system.If you care about scientific rigor, equity in AI, or reproducible research, this episode will shift how you think about data. This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit senscience.substack.com

  8. 7

    What Will We Carve in Stone?

    In a world shaped by constant change, memory is fragile — and preservation is never automatic.Inspired by a visit to the Qianlong Stone Steles in Beijing, this episode reflects on how civilizations have fought against impermanence by choosing what must endure.Today, as our knowledge increasingly rests on unstable digital foundations, we face the same challenge:Preservation is not a given; it is an act of will and judgment.What, in our time, do we believe is worth enduring — and what will we choose to carve in stone? This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit senscience.substack.com

  9. 6

    From FAIR to FAIR²: Raising the Bar for Scientific Data Practice

    What does it really take to make data usable—not just technically FAIR, but truly findable, accessible, interoperable, and reusable in the ways science needs today?In this episode of The FAIR² Chronicles, we explore the frustrating reality of pseudo-FAIR data and the critical gap between aspiration and implementation. We introduce FAIR² as a powerful framework that embeds context, clarity, and computability into the data lifecycle—from rich metadata and semantic alignment to transparent workflows and reproducibility tools.Whether you’re a researcher, funder, policymaker, or data steward, this conversation will help you rethink what it means to share data that’s not just open, but trustworthy, understandable, and ready to drive discovery. This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit senscience.substack.com

  10. 5

    If You Can’t Reproduce It, Is It Even Science?

    In this episode, we dig into the often-overlooked cornerstone of scientific progress: reproducibility. It’s not flashy, but it’s fundamental. Without it, science loses its grounding—and so does our ability to trust and build upon results.We explore how FAIR² Data Management is reshaping the way researchers handle, document, and share their data. From clear variable descriptions and transparent processing pipelines to the use of ontologies, version control, and data articles, FAIR² goes far beyond just making data open—it makes it usable, understandable, and truly reusable.We also introduce tools like the AI Data Steward, designed to support researchers in creating context-rich, FAIR, and AI-ready datasets without adding extra burden.Whether you’re a seasoned data scientist or just beginning to think about data stewardship, this episode offers a practical and philosophical look at why reproducibility isn’t just a technical detail—it’s the very foundation of trustworthy science. This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit senscience.substack.com

  11. 4

    What If Sharing Your Data Was Easier Than You Think?

    Every day, researchers generate valuable datasets. But many of them never get shared—let alone reused. Why? Because data sharing still feels hard: confusing formats, missing documentation, and no clear roadmap. For many, it’s not part of their training. It’s just another overwhelming task.In this episode, we unpack the very real barriers to sharing research data—and introduce a way forward. FAIR² is an open specification designed to make research data not only open, but AI-ready, responsibly shared, and documented with context. And FAIR² Data Management—a service offered by Frontiers and powered by Senscience—helps researchers apply this framework without needing to become data experts.We dive into how the platform’s AI Data Steward supports everything from structuring spreadsheets to drafting FAIR² Data Articles. You’ll learn why this matters not just for open science, but for recognition, reuse, and real-world impact.If you’ve ever thought, “I’d love to share my data, but I don’t know where to start,”—this episode is for you.Links mentioned:🔗 Learn more about FAIR²: fair2.ai📬 Subscribe to The FAIR² Pulse: senscience.substack.com✅ Apply to join the FAIR² pilot: frontiersin.org/about/fair-data-management This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit senscience.substack.com

  12. 3

    FAIR² Data Articles: Unlocking Research Data Value

    In this episode, we explore a growing opportunity for researchers: turning their datasets into recognized, reusable, and citable scientific contributions.Too often, datasets sit unused—buried in supplemental materials or left unpublished entirely. But what if those datasets could become a first-class research output in their own right?Enter the FAIR² Data Article, a new article type from Frontiers that focuses not on findings, but on the dataset itself—how it was collected, structured, and how others can reuse it. We discuss how these articles come with structured, AI-ready data packages, interactive portals, and rich metadata that make reuse easy and powerful.We also dive into one of the most exciting aspects of the FAIR² platform: the built-in AI Data Steward, a smart assistant that helps researchers clean, structure, document, and publish their datasets—without getting bogged down by technical overhead.If you’ve got tabular data—whether from a past study or an unpublished project—this episode will show you how FAIR² can help you share it responsibly, extend its impact, and get the credit you deserve.🔗 Apply to join the pilot: frontiersin.org/about/fair-data-management🌐 Learn more at: fair2.ai📰 Subscribe to FAIR² Pulse: senscience.substack.com This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit senscience.substack.com

  13. 2

    An introduction to FAIR² and Senscience

    This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit senscience.substack.com

  14. 1

    Ocean Insights: Three Decades of Marine Monitoring in the Basque Country

    This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit senscience.substack.com

Type above to search every episode's transcript for a word or phrase. Matches are scoped to this podcast.

Searching…

We're indexing this podcast's transcripts for the first time — this can take a minute or two. We'll show results as soon as they're ready.

No matches for "" in this podcast's transcripts.

Showing of matches

No topics indexed yet for this podcast.

Loading reviews...

ABOUT THIS SHOW

The FAIR² Chronicles: Data Stories for an AI World” is an AI-generated podcast bringing published FAIR² datasets to life. Each episode spotlights real-world datasets structured under FAIR² principles—FAIR + AI-readiness, Responsible AI, and Context—showcasing their impact on scientific discovery, innovation, and global challenges. From climate research to biomedical breakthroughs, AI narrates the data’s journey, revealing how structured, machine-actionable datasets are driving the future of open science. senscience.substack.com

HOSTED BY

Senscience

CATEGORIES

Frequently Asked Questions

How many episodes does The FAIR² Chronicles: Data stories for an AI world have?

The FAIR² Chronicles: Data stories for an AI world currently has 14 episodes available on PodParley. New episodes are automatically indexed when they're published to the podcast feed.

What is The FAIR² Chronicles: Data stories for an AI world about?

The FAIR² Chronicles: Data Stories for an AI World” is an AI-generated podcast bringing published FAIR² datasets to life. Each episode spotlights real-world datasets structured under FAIR² principles—FAIR + AI-readiness, Responsible AI, and Context—showcasing their impact on scientific discovery,...

How often does The FAIR² Chronicles: Data stories for an AI world release new episodes?

The FAIR² Chronicles: Data stories for an AI world has 14 episodes. Check the episode list to see recent publication dates and frequency.

Where can I listen to The FAIR² Chronicles: Data stories for an AI world?

You can listen to The FAIR² Chronicles: Data stories for an AI world on PodParley by clicking any episode. We provide an embedded audio player for direct listening, and you can also subscribe via your preferred podcast app using the RSS feed.

Who hosts The FAIR² Chronicles: Data stories for an AI world?

The FAIR² Chronicles: Data stories for an AI world is created and hosted by Senscience.
URL copied to clipboard!