Papers

Filter by company

When is Your LLM Steerable?

Published on: 2026-06-10 5 authors
Breaking Entropy Bounds: Accelerating RL Training via MTP with Rejection Sampling

Published on: 2026-06-10 17 authors
RLCSD: Reinforcement Learning with Contrastive On-Policy Self-Distillation

Published on: 2026-06-10 8 authors
From AGI to ASI

Published on: 2026-06-10 14 authors
FACTR 2: Learning External Force Sensing for Commodity Robot Arms Improves Policy Learning

Published on: 2026-06-10 8 authors
i1: A Simple and Fully Open Recipe for Strong Text-to-Image Models

Published on: 2026-06-09 7 authors
Test-Time Gradient Guidance of Flow Policies in Reinforcement Learning

Published on: 2026-06-09 7 authors
Recalling Too Well: Sycophancy Evaluation and Mitigation in Memory-Augmented Models

Published on: 2026-06-09 4 authors
Self-Harness: Harnesses That Improve Themselves

Published on: 2026-06-08 8 authors
Introducing the Third Generation of Apple's Foundation Models

Apple, Google, NVIDIA

Published on: 2026-06-08
Sparrow: Sparse Rollout for Stable and Efficient Long-context RL of Large Language Models

Published on: 2026-06-07 10 authors
How AI Agents Reshape Knowledge Work: Autonomy, Efficiency, and Scope

Perplexity

Published on: 2026-06-05 Venue: Kate Zyskowski 1 author
How AI Agents Reshape Knowledge Work: Autonomy, Efficiency, and Scope

Published on: 2026-06-05 Venue: arXiv preprint 4 authors
Latent Reasoning with Normalizing Flows

Published on: 2026-06-04 8 authors
Slim attention: cut your context memory in half without loss – K-cache is all you need for MHA

Openmachine

Published on: 2025-06-03 1 author
MAI-Thinking-1: Building a Hill-Climbing Machine

Microsoft

Published on: 2026-06-02 Venue: Technical report (Microsoft AI) 1 author
AFUN: Towards an Affordance Foundation Model for Functionality Understanding

Published on: 2026-06-01 5 authors
MPMWorlds: Material-Point-Method Simulations for Inferring and Extrapolating Physical Dynamics

Published on: 2026-06-01 2 authors
Harness-1: Reinforcement Learning for Search Agents with State-Externalizing Harnesses

Published on: 2026-06-01 8 authors
Physical Atari: A Robust and Accessible Platform for Real-time Reinforcement Learning on Robots

Published on: 2026-05-29 5 authors
StressDream: Steering Video World Models for Robust Policy Evaluation and Improvement

Published on: 2026-05-29 9 authors
RealityTest: How People Probe AI Identity and Whether Models Disclose It

Published on: 2026-05-29 5 authors
Representation Forcing for Bottleneck-Free Unified Multimodal Models

Published on: 2026-05-29 13 authors
mRNAutilus: Multi-Objective-Guided Discrete Generation of mRNA with Optimized Therapeutic Properties

Published on: 2026-05-29 11 authors
Stable-Layers: Fine-Tuning Image Layer Decomposition Models with VLM-Scored Reinforcement Learning

Published on: 2026-05-28 5 authors
The Little Book of Generative AI Foundations: An Intuitive Mathematical Primer

Published on: 2026-05-28 1 author
Scaling Laws for Agent Harnesses via Effective Feedback Compute

Published on: 2026-05-28 5 authors
Harness Updating Is Not Harness Benefit: Disentangling Evolution Capabilities in Self-Evolving LLM Agents

Published on: 2026-05-28 17 authors
Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments

Published on: 2026-05-28 40 authors
AutoScientists: Self-Organizing Agent Teams for Long-Running Scientific Experimentation

Published on: 2026-05-27 3 authors
Self-Improving Language Models with Bidirectional Evolutionary Search

Published on: 2026-05-27 7 authors
Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players

Published on: 2026-05-27 10 authors
Elias in the Lighthouse, Again? Diagnosing Low Diversity in LLM Stories

Published on: 2026-05-26 2 authors
Laguna M.1/XS.2 Technical Report

Published on: 2026-05-26 96 authors
Learn from your own latents and not from tokens: A sample-complexity theory

Published on: 2026-05-26 3 authors
MobileMoE: Scaling On-Device Mixture of Experts

Published on: 2026-05-26 8 authors
Gemini Embedding 2: A Native Multimodal Embedding Model from Gemini

Published on: 2026-05-26 89 authors
The MiniMax-M2 Series: Mini Activations Unleashing Max Real-World Intelligence

Published on: 2026-05-26 207 authors
When Does LeJEPA Learn a World Model?

Published on: 2026-05-25 3 authors
Unified Neural Scaling Laws

Published on: 2026-05-25 4 authors
Language Models Need Sleep

Published on: 2026-05-25 4 authors
LLaVA-OneVision-2: Towards Next-Generation Perceptual Intelligence

Published on: 2026-05-25 30 authors
Training-Free Looped Transformers

Published on: 2026-05-22 5 authors
Polar: Agentic RL on Any Harness at Scale

Published on: 2026-05-22 12 authors
GenRecon: Bridging Generative Priors for Multi-View 3D Scene Reconstruction

Published on: 2026-05-22 5 authors
SkillOpt: Executive Strategy for Self-Evolving Agent Skills

Published on: 2026-05-22 15 authors
Epicure: Navigating the Emergent Geometry of Food Ingredient Embeddings

Published on: 2026-05-21 2 authors
Forecasting Scientific Progress with Artificial Intelligence

Published on: 2026-05-21 10 authors
Vector Policy Optimization: Training for Diversity Improves Test-Time Search

Published on: 2026-05-21 9 authors
Advancing Mathematics Research with AI-Driven Formal Proof Search

Published on: 2026-05-21 20 authors

Prev 1 2 3 4 5 6 7 Next

Go to section

Search

Papers

Help

People also viewed

Create AI Tools

Mini Tool

Vibe code an AI Tool

Choose listing type: