AI on Air cover art

All Episodes

AI on Air — 79 episodes

#
Title
1

Shadow AI

2

Meta AI's V-JEPA 2: World Models for Understanding and Planning

3

NovelSeek Autonomous Scientific Research Framework

4

Qwen2.5-Math RLVR: Learning from Errors

5

AlphaEvolve: A Gemini-Powered Coding Agent

6

OpenAI Codex: Parallel Coding in ChatGPT

7

Agentic AI Design Patterns

8

Machine Learning for High-Risk Pregnancy Prediction

9

AI Mobile Edge Offloading for QoE and Energy Efficiency

10

Blockchain Chatbot CVD Screening

11

Deep Learning for Mammographic Breast Density Prediction

12

RLHF for Large Language Model Fine-Tuning

13

UB-Mesh: Advancing LLM Training Infrastructure

14

FASTCURL: Reinforcement Learning for Enhanced AI Reasoning

15

National AI for Cardiovascular Care: Nature Medicine Analysis

16

Vision-Language Reward Models: Advancements and Benchmarking

17

Advancing Vision-Language Reward Models

18

Mix-LN: A Hybrid Normalization Technique

19

Direct Q-Function Optimization for LLMs

20

RAG Attacks on LLMs

21

SmolAgents: AI Agents in Few Lines of Code

22

ByteDance's 1.58-bit FLUX AI Model

23

HuatuoGPT-o1: Advanced Medical Reasoning

24

FDA Authorizes AI Sepsis Detection Tool

25

Safe and Efficient Agentic AI

26

Apple's AI Strategy

27

Mix-LN: Hybrid Normalization for Transformers

28

LOTUS 1.0.0: Open-Source Query Engine

29

OpenAI o3: A Measured Advancement in AI Reasoning

30

LLM Alignment Faking: A New Threat

31

TOMG-Bench: A New AI Benchmark for Molecule Generation

32

Multi-Agent AI Frameworks

33

Alibaba vs. OpenAI: The AI Race Heats Up

34

Gemini 2.0: AI Research Assistant Capabilities

35

Maya: An Open-Source Multilingual AI Model

36

EXAONE 3.5: Enhanced Bilingual AI

37

Hugging Face TGI v3.0: Faster Text Generation

38

Density: A New Metric for Evaluating LLMs

39

Snowflake's Arctic Embed 2.0

40

ALAMA: Adaptive Language Model with Auxiliary Memory

41

Alibaba's AI Challenge to OpenAI

42

Building Effective AI Agents

43

Evaluating and Improving LLMs: Four Novel Approaches

44

AI Scientists: Revolutionizing Scientific Research

45

TamGen: AI for Antibiotic Discovery

46

SEALONG: Extending LLM Context Windows

47

AI Unveils Hidden Climate Extremes

48

Microsoft GraphRAG: Revolutionizing Data Analysis

49

Meet OpenCoder: A Completely Open-Source Code LLM Built on the Transparent Data Process Pipeline and Reproducible Dataset

50

New Scaling Laws for Optimizing Model and Dataset Proportions in Behavior Cloning and World Modeling Tasks

51

NVIDIA Launches LLaMA-Mesh, a Unified 3D Mesh Generation Method Using LLMs

52

BLIP3-KALE: An Open-Source Dataset of 218 Million Image-Text Pairs Transforming Image Captioning with Knowledge-Augmented Dense Descriptions

53

A Robust AI Solution for Managing Memory Constraints and Improving Classification Accuracy in Transformer-Based NLP Models

54

This AI Paper by Inria Introduces the Tree of Problems: A Simple Yet Effective Framework for Complex Reasoning in Language Models

55

Is Your LLM Agent Enterprise-Ready? Salesforce AI Research Introduces CRMArena

56

Databricks Mosaic Research Examines Long-Context Retrieval-Augmented Generation

57

RT-Affordance: A Hierarchical Method that Uses Affordances as an Intermediate Representation for Policies

58

Researchers at Peking University Introduce A New AI Benchmark for Evaluating Numerical Understanding and Processing in LLM

59

FrontierMath: The Benchmark that Highlights AI’s Limits in Mathematics

60

Databricks Mosaic Research Examines Long-Context Retrieval-Augmented Generation: How Leading AI Models Handle Expansive Information for Improved Response Accuracy

61

UniMTS: A Unified Pre-Training Procedure for Motion Time Series that Generalizes Across Diverse Device Latent Factors and Activities

62

Meet Hawkish 8B: A New Financial Domain Model that can Pass CFA Level 1 and Outperform Meta Llama-3.1-8B-Instruct in Math & Finance Benchmarks

63

MiniCTX: Advancing Context-Dependent Theorem Proving in Large Language Models

64

How TrigFlow’s Innovative Framework Narrowed the Gap with Leading Diffusion Models Using Just Two Sampling Steps

65

MathGAP: An Evaluation Benchmark for LLMs’ Mathematical Reasoning Using Controlled Proof Depth, Width, and Complexity for Out-of-Distribution Tasks

66

Can LLMs Follow Instructions Reliably? A Look at Uncertainty Estimation Challenges

67

Zhipu AI Releases GLM-4-Voice: A New Open-Source End-to-End Speech Large Language Model

68

Meta AI Researchers Introduce Token-Level Detective Reward Model (TLDR) to Provide Fine-Grained Annotations for Large Vision Language Models

69

Google Researchers Introduce UNBOUNDED: An Interactive Generative Infinite Game based on Generative AI Models

70

This AI Paper Explores If Human Visual Perception can Help Computer Vision Models Outperform in Generalized Tasks

71

Microsoft To Launch 'AI Agents' to Help You Handle Routine Tasks

72

IBM unveils new open source AI ‘Granite 3.0’ models for business

73

Refined Local Learning Coefficients (rLLCs): A Novel Machine Learning Approach to Understanding the Development of Attention Heads in Transformers

74

This AI Research from Cohere for AI Compares Merging vs Data Mixing as a Recipe for Building High-Performant Aligned LLMs

75

Are Brains and AI Converging?—an excerpt from ‘ChatGPT and the Future of AI: The Deep Language Revolution’

76

CREAM: A New Self-Rewarding Method that Allows the Model to Learn more Selectively and Emphasize on Reliable Preference Data

77

Embed-then-Regress: A Versatile Machine Learning Approach for Bayesian Optimization Using String-Based In-Context Regression

78

Graph-Constrained Reasoning (GCR): A Novel AI Framework that Bridges Structured Knowledge in Knowledge Graphs with Unstructured Reasoning in LLMs

79

MMed-RAG: A Versatile Multimodal Retrieval-Augmented Generation System Transforming Factual Accuracy in Medical Vision-Language Models Across Multiple Domains