All Episodes
Daily Paper Cast (Test) — 25 episodes
Scaling Agents via Continual Pre-training
WebWeaver: Structuring Web-Scale Evidence with Dynamic Outlines for Open-Ended Deep Research
WebWeaver: Structuring Web-Scale Evidence with Dynamic Outlines for Open-Ended Deep Research
Scaling Agents via Continual Pre-training
OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling
UI-S1: Advancing GUI Automation via Semi-online Reinforcement Learning
One-Step Residual Shifting Diffusion for Image Super-Resolution via Distillation
Survey on Evaluation of LLM-based Agents
VisionZip: Longer is Better but Not Necessary in Vision Language Models
Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion
CLEAR: Character Unlearning in Textual and Visual Modalities
AutoKaggle: A Multi-Agent Framework for Autonomous Data Science Competitions
CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation Generation
BitStack: Fine-Grained Size Control for Compressed Large Language Models in Variable Memory Environments
SelfCodeAlign: Self-Alignment for Code Generation
AAAR-1.0: Assessing AI's Potential to Assist Research
Learning Video Representations without Natural Videos
BenchX: A Unified Benchmark Framework for Medical Vision-Language Pretraining on Chest X-Rays
Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders
What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective
What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective
Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders
ROCKET-1: Master Open-World Interaction with Visual-Temporal Context Prompting
CompassJudger-1: All-in-one Judge Model Helps Model Evaluation and Evolution
Knowing When to Ask - Bridging Large Language Models and Data