Papers
-
Efficient Autoregressive Video Diffusion with Dummy HeadMicrosoft / ETH Zurich, Johns Hopkins University, Tsinghua University, University of Science and Technology of China
-
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
-
Youtu-VL: Unleashing Visual Potential via Unified Vision-Language Supervision
-
Astra: General Interactive World Model with Autoregressive Denoising
-
Towards Pixel-Level VLM Perception via Simple Points Prediction
-
Differentiable Semantic ID for Generative Recommendation
-
Dep-Search: Learning Dependency-Aware Reasoning Traces with Persistent Memory
-
Agentic Very Long Video Understanding
-
EvoCUA: Evolving Computer Use Agents via Learning from Scalable Synthetic Experience
-
AnyView: Synthesizing Any Novel View in Dynamic Scenes
-
Latent Diffusion for Internet of Things Attack Data Generation in Intrusion Detection SystemsUniversidad Rey Juan Carlos
-
DSGym: A Holistic Framework for Evaluating and Training Data Science Agents
-
CamPilot: Improving Camera Control in Video Diffusion Model with Efficient Camera Reward Feedback
-
Can LLMs Clean Up Your Mess? A Survey of Application-Ready Data Preparation with LLMsMicrosoft / Hong Kong University of Science and Technology, Massachusetts Institute of Technology, Shanghai Artificial Intelligence Laboratory, Shanghai Jiao Tong University, Tsinghua University
-
DeepASMR: LLM-Based Zero-Shot ASMR Speech Generation for Anyone of Any Voice
-
RayRoPE: Projective Ray Positional Encoding for Multi-view Attention
-
OmniView: An All-Seeing Diffusion Model for 3D and 4D View Synthesis
-
Unified Text-Image Generation with Weakness-Targeted Post-Training
-
Zebra-Llama: Towards Extremely Efficient Hybrid Models
-
From Chains to Graphs: Self-Structured Reasoning for General-Domain LLMs
-
Learning Latent Action World Models In The WildMeta Platforms / National Institute for Research in Digital Science and Technology, New York University
-
SOFAI-LM: A Cognitive Architecture for Building Efficient and Reliable Reasoning Systems with LLMs
-
Small Models, Big Impact: Tool-Augmented AI Agents for Wireless Network PlanningKing Abdullah University of Science and Technology (KAUST)
-
Understanding and Steering the Cognitive Behaviors of Reasoning Models at Test-Time
-
RISER: Orchestrating Latent Reasoning Skills for Adaptive Activation Steering
-
S2DiT: Sandwich Diffusion Transformer for Mobile Streaming Video Generation
-
Recurrent Confidence Chain: Temporal-Aware Uncertainty Quantification in Large Language ModelsUniversity of Florida
-
Power Aware Dynamic Reallocation For Inference
-
ToolPRMBench: Evaluating and Advancing Process Reward Models for Tool-using Agents
-
MMDeepResearch-Bench: A Benchmark for Multimodal Deep Research Agents
-
Agentic Reasoning for Large Language Models
-
Terminal-Bench: Benchmarking Agents on Hard, Realistic Tasks in Command Line Interfaces
-
Deep GraphRAG: A Balanced Approach to Hierarchical Retrieval and Adaptive Integration
-
FlashLabs Chroma 1.0: A Real-Time End-to-End Spoken Dialogue Model with Personalized Voice Cloning
-
VINO: A Unified Visual Generator with Interleaved OmniModal Context
-
KernelEvolve: Scaling Agentic Kernel Coding for Heterogeneous AI Accelerators at Meta
-
OCTOBENCH: Benchmarking Scaffold-Aware Instruction Following in Repository-Grounded Agentic Coding
-
TranslateGemma Technical Report
-
AgencyBench: Benchmarking the Frontiers of Autonomous Agents in 1M-Token Real-World ContextsShanghai Jiao Tong University, The Hong Kong Polytechnic University
-
Toward Ultra-Long-Horizon Agentic Science: Cognitive Accumulation for Machine Learning Engineering
-
The Assistant Axis: Situating and Stabilizing the Default Persona of Language Models
-
Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning
-
Reasoning Models Generate Societies of Thought
-
Hardware Acceleration for Neural Networks: A Comprehensive SurveyArizona State University
-
RPC-Bench: A Fine-grained Benchmark for Research Paper Comprehension
-
The Hierarchy of Agentic Capabilities: Evaluating Frontier Models on Realistic RL Environments
-
Silence the Judge: Reinforcement Learning with Self-Verifier via Latent Geometric Clustering
-
TerraFormer: Automated Infrastructure-as-Code with LLMs Fine-Tuned via Policy-Guided Verifier Feedback
-
Apollo: Unified Audio-Video Joint Generation
-
Controlled LLM Training on Spectral Sphere
