Papers

Filter by company

Slim attention: cut your context memory in half without loss – K-cache is all you need for MHA

Openmachine

Published on: 2025-06-03 1 author
Just-in-Time: Training-Free Spatial Acceleration for Diffusion Transformers

Published on: 2026-03-11 3 authors
DiT4DiT: Jointly Modeling Video Dynamics and Actions for Generalizable Robot Control

Published on: 2026-03-11 7 authors
MegaScale-MoE: Large-Scale Communication-Efficient Training of Mixture-of-Experts Models in Production

AMD / Peking University

1 author
Two Teachers Better Than One: Hardware-Physics Co-Guided Distributed Scientific Machine Learning

Published on: 2026-03-10 7 authors
SCALAR: Learning and Composing Skills through LLM Guided Symbolic Planning and Deep RL Grounding

Published on: 2026-03-10 7 authors
WS-Net: Weak-Signal Representation Learning and Gated Abundance Reconstruction for Hyperspectral Unmixing via State-Space and Weak Signal Attention Fusion

Published on: 2026-03-10 5 authors
The Epistemic Support-Point Filter: Jaynesian Maximum Entropy Meets Popperian Falsification

Published on: 2026-03-10 1 author
Time, Identity and Consciousness in Language Model Agents

Published on: 2026-03-10 2 authors
FlexServe: A Fast and Secure LLM Serving System for Mobile Devices with Flexible Resource Isolation

Published on: 2026-03-10 6 authors
EPOCH: An Agentic Protocol for Multi-Round System Optimization

Published on: 2026-03-10 3 authors
From Days to Minutes: An Autonomous AI Agent Achieves Reliable Clinical Triage in Remote Patient Monitoring

Published on: 2026-03-10 11 authors
Sim2Act: Robust Simulation-to-Decision Learning via Adversarial Calibration and Group-Relative Perturbation

Published on: 2026-03-10 8 authors
Spectral-Structured Diffusion for Single-Image Rain Removal

Published on: 2026-03-10 2 authors
Streaming Autoregressive Video Generation via Diagonal Distillation

Published on: 2026-03-10 6 authors
Reviving ConvNeXt for Efficient Convolutional Diffusion Models

Swiss Federal Institute of Technology in Zurich, University of Pisa

Published on: 2026-03-10 8 authors
Decoupling Reasoning and Confidence: Resurrecting Calibration in Reinforcement Learning from Verifiable Rewards

University of Chinese Academy of Sciences

Published on: 2026-03-10 9 authors
TiPToP: A Modular Open-Vocabulary Planning System for Robotic Manipulation

MIT Computer Science and Artificial Intelligence Laboratory, University of Pennsylvania

Published on: 2026-03-10 10 authors
ReCoSplat: Autoregressive Feed-Forward Gaussian Splatting Using Render-and-Compare

NVIDIA / Hong Kong University of Science and Technology, Shanghai Jiao Tong University, Swiss Federal Institute of Technology in Zurich, University of California, Merced

Published on: 2026-03-10 6 authors
Towards a Neural Debugger for Python

Johannes Kepler University Linz

Published on: 2026-03-10 4 authors
ZeroWBC: Learning Natural Visuomotor Humanoid Control Directly from Human Egocentric Video

Northwestern Polytechnical University, Shanghai Jiao Tong University, Tsinghua University, University of Science and Technology of China

Published on: 2026-03-10 8 authors
On the Width Scaling of Neural Optimizers Under Matrix Operator Norms I: Row/Column Normalization and Hyperparameter Transfer

Northwestern University, University of British Columbia, University of Chicago

Published on: 2026-03-10 3 authors
Reinforced Generation of Combinatorial Structures: Ramsey Numbers

Google, Google DeepMind / University of California, Berkeley

Published on: 2026-03-10 3 authors
Thinking to Recall: How Reasoning Unlocks Parametric Knowledge in LLMs

Google Research / Technion – Israel Institute of Technology, Tel Aviv University

Published on: 2026-03-10 6 authors
MM-Zero: Self-Evolving Multi-Model Vision Language Models From Zero Data

Adobe, NVIDIA / Brown University, University of Illinois Urbana-Champaign, University of Maryland, University of Southern California, Washington University in St. Louis

Published on: 2026-03-10 11 authors
OpenClaw-RL: Train Any Agent Simply by Talking

Princeton University

Published on: 2026-03-10 5 authors
InternVL-U: Democratizing Unified Multimodal Models for Understanding, Reasoning, Generation and Editing

Fudan University, Nanjing University, Shanghai AI Laboratory, Shanghai Jiao Tong University, South China University of Technology, The Chinese University of Hong Kong Multimedia Laboratory, Tsinghua University, University of Science and Technology of China, Xiamen University

Published on: 2026-03-10 29 authors
Reward Prediction with Factorized World States

East China Normal University, Hong Kong University of Science and Technology

Published on: 2026-03-10 7 authors
Hybrid Quantum-Classical Encoding for Accurate Residue-Level pKa Prediction

Published on: 2026-03-09 2 authors
Hybrid Quantum Neural Network for Multivariate Clinical Time Series Forecasting

Published on: 2026-03-09 4 authors
TALON: Test-time Adaptive Learning for On-the-Fly Category Discovery

Published on: 2026-03-09 8 authors
Tiny Autoregressive Recursive Models

Published on: 2026-03-09 3 authors
High-Fidelity Pruning for Large Language Models

Published on: 2026-03-09 3 authors
From Reactive to Map-Based AI: Tuned Local LLMs for Semantic Zone Inference in Object-Goal Navigation

Published on: 2026-03-09 2 authors
EAGLE-Pangu: Accelerator-Safe Tree Speculative Decoding on Ascend NPUs

Published on: 2026-03-09 3 authors
DSH-Bench: A Difficulty- and Scenario-Aware Benchmark with Hierarchical Subject Taxonomy for Subject-Driven Text-to-Image Generation

Published on: 2026-03-09 14 authors
Toward Robust LLM-Based Judges: Taxonomic Bias Evaluation and Debiasing Optimization

Published on: 2026-03-09 8 authors
DC-W2S: Dual-Consensus Weak-to-Strong Training for Reliable Process Reward Modeling in Biological Reasoning

Published on: 2026-03-09 9 authors
TrianguLang: Geometry-Aware Semantic Consensus for Pose-Free 3D Localization

Published on: 2026-03-09 4 authors
Adaptive MLP Pruning for Large Vision Transformers

Published on: 2026-03-09 1 author
Invisible Safety Threat: Malicious Finetuning for LLM via Steganography

Published on: 2026-03-09 4 authors
Tau-BNO: Brain Neural Operator for Tau Transport Model

Published on: 2026-03-09 9 authors
SAMoE-VLA: A Scene Adaptive Mixture-of-Experts Vision-Language-Action Model for Autonomous Driving

Published on: 2026-03-09 7 authors
UIS-Digger: Towards Comprehensive Research Agent Systems for Real-world Unindexed Information Seeking

Published on: 2026-03-09 7 authors
Model-based Offline RL via Robust Value-Aware Model Learning with Implicitly Differentiable Adaptive Weighting

Published on: 2026-03-09 6 authors
SaiVLA-0: Cerebrum--Pons--Cerebellum Tripartite Architecture for Compute-Aware Vision-Language-Action

Published on: 2026-03-09 4 authors
Ramsa: A Large Sociolinguistically Rich Emirati Arabic Speech Corpus for ASR and TTS

Published on: 2026-03-09 1 author
Foley-Flow: Coordinated Video-to-Audio Generation with Masked Audio-Visual Alignment and Dynamic Conditional Flows

Published on: 2026-03-09 2 authors
EvoScientist: Towards Multi-Agent Evolving AI Scientists for End-to-End Scientific Discovery

Published on: 2026-03-09 12 authors
TRIAGE: Type-Routed Interventions via Aleatoric-Epistemic Gated Estimation in Robotic Manipulation and Adaptive Perception -- Don't Treat All Uncertainty the Same

Published on: 2026-03-09 7 authors

1 2 3 4 5 6 Next

Search

Papers

Help

People also viewed

Create AI Tools

Mini Tool

Vibe code an AI Tool

Choose listing type: