1

Scaling Agents via Continual Pre-training

Mar 13, 2026

4:29

2

WebWeaver: Structuring Web-Scale Evidence with Dynamic Outlines for Open-Ended Deep Research

Mar 13, 2026

3:39

3

WebWeaver: Structuring Web-Scale Evidence with Dynamic Outlines for Open-Ended Deep Research

Sep 22, 2025

6:00

4

Scaling Agents via Continual Pre-training

Sep 22, 2025

5:03

5

OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling

Sep 22, 2025

3:30

6

UI-S1: Advancing GUI Automation via Semi-online Reinforcement Learning

Sep 22, 2025

3:50

7

One-Step Residual Shifting Diffusion for Image Super-Resolution via Distillation

Mar 23, 2025

4:04

8

Survey on Evaluation of LLM-based Agents

Mar 23, 2025

4:30

9

VisionZip: Longer is Better but Not Necessary in Vision Language Models

Dec 8, 2024

3:06

10

Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion

Dec 8, 2024

3:28

11

CLEAR: Character Unlearning in Textual and Visual Modalities

Nov 3, 2024

3:12

12

AutoKaggle: A Multi-Agent Framework for Autonomous Data Science Competitions

Nov 3, 2024

4:07

13

CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation Generation

Nov 3, 2024

3:26

14

BitStack: Fine-Grained Size Control for Compressed Large Language Models in Variable Memory Environments

Nov 3, 2024

17:57

15

SelfCodeAlign: Self-Alignment for Code Generation

Nov 3, 2024

18:51

16

AAAR-1.0: Assessing AI's Potential to Assist Research

Nov 3, 2024

22:23

17

Learning Video Representations without Natural Videos

Nov 3, 2024

22:48

18

BenchX: A Unified Benchmark Framework for Medical Vision-Language Pretraining on Chest X-Rays

Nov 3, 2024

21:42

19

Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders

Nov 3, 2024

4:18

20

What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective

Nov 3, 2024

3:48

21

What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective

Nov 3, 2024

3:14

22

Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders

Nov 3, 2024

3:16

23

ROCKET-1: Master Open-World Interaction with Visual-Temporal Context Prompting

Oct 30, 2024

22:06

24

CompassJudger-1: All-in-one Judge Model Helps Model Evaluation and Evolution

Oct 28, 2024

24:29

25

Knowing When to Ask - Bridging Large Language Models and Data

Oct 28, 2024

20:58

All Episodes