Papers
-
Slim attention: cut your context memory in half without loss – K-cache is all you need for MHA
-
Single-Pass Discrete Diffusion Predicts High-Affinity Peptide Binders at >1,000 Sequences per Second across 150 Receptor Targets
-
\texttt{BayesBreak}: Generalized Hierarchical Bayesian Segmentation with Irregular Designs, Multi-Sample Hierarchies, and Grouped/Latent-Group Designs
-
E2EGS: Event-to-Edge Gaussian Splatting for Pose-Free 3D Reconstruction
-
MVHOI: Bridge Multi-view Condition to Complex Human-Object Interaction Video Reenactment via 3D Foundation Model
-
AgentTrace: Causal Graph Tracing for Root Cause Analysis in Deployed Multi-Agent Systems
-
Applications of Intuitionistic Temporal Logic to Temporal Answer Set Programming
-
Scaling Autoregressive Models for Lattice Thermodynamics
-
Design Space of Self--Consistent Electrostatic Machine Learning Interatomic Potentials
-
AURORA-KITTI: Any-Weather Depth Completion and Denoising in the Wild
-
Fractal Autoregressive Depth Estimation with Continuous Token Diffusion
-
Beyond Local Code Optimization: Multi-Agent Reasoning for Software System Optimization
-
Chain-of-Trajectories: Unlocking the Intrinsic Generative Optimality of Diffusion Models via Graph-Theoretic Planning
-
AdapterTune: Zero-Initialized Low-Rank Adapters for Frozen Vision Transformers
-
Transition Flow Matching
-
Visual Confused Deputy: Exploiting and Defending Perception Failures in Computer-Using Agents
-
Cross-RAG: Zero-Shot Retrieval-Augmented Time Series Forecasting via Cross-Attention
-
Towards Next-Generation LLM Training: From the Data-Centric Perspective
-
Training-Free Generation of Protein Sequences from Small Family Alignments via Stochastic Attention
-
Multimodal Deep Learning for Early Prediction of Patient Deterioration in the ICU: Integrating Time-Series EHR Data with Clinical Notes
-
Beyond Creed: A Non-Identity Safety Condition A Strong Empirical Alternative to Identity Framing in Low-Data LoRA Fine-Tuning
-
GameUIAgent: An LLM-Powered Framework for Automated Game UI Design with Structured Intermediate Representation
-
Enhancing Hands in 3D Whole-Body Pose Estimation with Conditional Hands Modulator
-
Automated Diabetic Screening via Anterior Segment Ocular Imaging: A Deep Learning and Explainable AI Approach
-
DeFRiS: Silo-Cooperative IoT Applications Scheduling via Decentralized Federated Reinforcement Learning
-
GNNVerifier: Graph-based Verifier for LLM Task Planning
-
Loosely-Structured Software: Engineering Context, Structure, and Evolution Entropy in Runtime-Rewired Multi-Agent Systems
-
Criterion-referenceability determines LLM-as-a-judge validity across physics assessment formats
-
A Skill-augmented Agentic Framework and Benchmark for Multi-Video Understanding
-
Gauge-Equivariant Intrinsic Neural Operators for Geometry-Consistent Learning of Elliptic PDE Maps
-
Efficient Event Camera Volume System
-
TrajMamba: An Ego-Motion-Guided Mamba Model for Pedestrian Trajectory Prediction from an Egocentric Perspective
-
PHAC: Promptable Human Amodal Completion
-
CAMD: Coverage-Aware Multimodal Decoding for Efficient Reasoning of Multimodal Large Language Models
-
Face-Guided Sentiment Boundary Enhancement for Weakly-Supervised Temporal Sentiment Localization
-
Learning Constituent Headedness
-
Towards Privacy-Preserving Machine Translation at the Inference Stage: A New Task and Benchmark
-
BrainBench: Exposing the Commonsense Reasoning Gap in Large Language Models
-
Online Learning for Supervisory Switching Control
-
LiDAR-EVS: Enhance Extrapolated View Synthesis for 3D Gaussian Splatting with Pseudo-LiDAR Supervision
-
Topology-Preserving Data Augmentation for Ring-Type Polygon Annotations
-
SSR: A Training-Free Approach for Streaming 3D Reconstruction
-
Investigating the Impact of Speech Enhancement on Audio Deepfake Detection in Noisy Environments
-
Understanding the geometry of deep learning with decision boundary volume
-
POLCA: Stochastic Generative Optimization with LLM
-
AnyPhoto: Multi-Person Identity Preserving Image Generation with ID Adaptive Modulation on Location Canvas
-
OpenHospital: A Thing-in-itself Arena for Evolving and Benchmarking LLM-based Collective Intelligence
-
Zero-Shot Reconstruction of Animatable 3D Avatars with Cloth Dynamics from a Single Image
-
HO-SFL: Hybrid-Order Split Federated Learning with Backprop-Free Clients and Dimension-Free Aggregation
-
$p^2$RAG: Privacy-Preserving RAG Service Supporting Arbitrary Top-$k$ Retrieval
