Papers
-
Generated Reality: Human-centric World Simulation using Interactive Video Generation with Hand and Camera ControlStanford University
-
The Geometry of Noise: Why Diffusion Models Don't Need Noise Conditioning
-
SARAH: Spatially Aware Real-time Agentic Humans
-
MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention
-
Improved Object-Centric Diffusion Learning with Registers and Contrastive Alignment
-
El Agente Gráfico: Structured Execution Graphs for Scientific Agents
-
Unified Latents (UL): How to train your latents
-
Learning to Learn from Language Feedback with Social Meta-Learning
-
Flow Map Language Models: One-step Language Modeling via Continuous Denoising
-
OpenSage: Self-programming Agent Generation Engine
-
Multi-agent cooperation through in-context co-player inference
-
Factored Latent Action World Models
-
Tuning-free Visual Effect Transfer across Videos
-
EVMbench: Evaluating AI Agents on Smart Contract Security
-
EgoScale: Scaling Dexterous Manipulation with Diverse Egocentric Human Data
-
Statistical approximation is not general intelligenceNew York University, Sapienza University of Rome, University of Milan-Bicocca
-
GLM-5: from Vibe Coding to Agentic Engineering
-
Perceptive Humanoid Parkour: Chaining Dynamic Human Skills via Motion Matching
-
jina-embeddings-v5-text: Task-Targeted Embedding Distillation
-
World Action Models are Zero-shot Policies
-
On Surprising Effectiveness of Masking Updates in Adaptive Optimizers
-
GLM-5: from Vibe Coding to Agentic Engineering
-
Image Generation with a Sphere Encoder
-
GUI-GENESIS: Automated Synthesis of Efficient Environments with Verifiable Rewards for GUI Agent Post-Training
-
OmniVideo-R1: Reinforcing Audio-visual Reasoning with Query Intention and Modality AttentionTencent / Hunan University, National University of Singapore, The Chinese University of Hong Kong, Tsinghua University, Xi'an Jiaotong University
-
BitDance: Scaling Autoregressive Generative Models with Binary Tokens
-
Experiential Reinforcement Learning
-
Chain of Thought Monitorability: A New and Fragile Opportunity for AI Safety
-
Speculative Decoding with a Speculative Vocabulary
-
GEPA: Reflective Prompt Evolution Can Outperform Reinforcement Learning
-
Hippocampus: An Efficient and Scalable Memory Module for Agentic AI
-
Joint Time Series Chain: Detecting Unusual Evolving Trend across Time Series
-
3D-Aware Implicit Motion Control for View-Adaptive Human Video Generation
-
Unifying Ranking and Generation in Query Auto-Completion via Retrieval-Augmented Generation and Multi-Objective Alignment
-
Isaac Lab: A GPU-Accelerated Simulation Framework for Multi-Modal Robot LearningNVIDIA / Georgia Institute of TechnologyUniversity of Texas, Massachusetts Institute of Technology, Robotics and AI Institute, Swiss Federal Institute of Technology in Zurich, University of California, University of Southern California, University of Texas, University of Toronto
-
WizardLM: Empowering large pre-trained language models to follow complex instructions
-
Florence: A New Foundation Model for Computer Vision
-
Xiaomi-Robotics-0: An Open-Sourced Vision-Language-Action Model with Real-Time Execution
-
GISA: A Benchmark for General Information-Seeking Assistant
-
SkillsBench: Benchmarking How Well Agent Skills Work Across Diverse TasksAmazon / Boston University, Carnegie Mellon University, Columbia University, Dartmouth College, Duke University, Michigan State University, Princeton University, Stanford University, The Ohio State University, University of California, University of Oxford, University of Southern California, University of Texas
-
GigaBrain-0.5M*: a VLA That Learns From World Model-Based Reinforcement Learning
-
Think like a Scientist: Physics-guided LLM Agent for Equation DiscoveryUniversity of California
-
Intelligent AI Delegation
-
AdaptEvolve: Improving Efficiency of Evolutionary AI Agents through Adaptive Model Selection
-
HAIC: Humanoid Agile Object Interaction Control via Dynamics-Aware World Model
-
LUVE : Latent-Cascaded Ultra-High-Resolution Video Generation with Dual Frequency Experts
-
Abstractive Red-Teaming of Language Model Character
-
Kelix Technical Report
-
DSO: Direct Steering Optimization for Bias Mitigation
-
LLM-in-Sandbox Elicits General Agentic Intelligence
