Papers
-
Better Bounds for the Distributed Experts Problem
-
Progressive Split Mamba: Effective State Space Modelling for Image Restoration
-
Point Cloud as a Foreign Language for Multi-modal Large Language Model
-
Differentiable Stochastic Traffic Dynamics: Physics-Informed Generative Modelling in Transportation
-
Dissecting Chronos: Sparse Autoencoders Reveal Causal Feature Hierarchies in Time Series Foundation Models
-
DuplexCascade: Full-Duplex Speech-to-Speech Dialogue with VAD-Free Cascaded ASR-LLM-TTS Pipeline and Micro-Turn Optimization
-
Latent-DARM: Bridging Discrete Diffusion And Autoregressive Models For Reasoning
-
DEO: Training-Free Direct Embedding Optimization for Negation-Aware Retrieval
-
The Costs of Reproducibility in Music Separation Research: a Replication of Band-Split RNN
-
Explainable Innovation Engine: Dual-Tree Agent-RAG with Methods-as-Nodes and Verifiable Write-Back
-
$P^2$GNN: Two Prototype Sets to boost GNN Performance
-
The Reasoning Trap -- Logical Reasoning as a Mechanistic Pathway to Situational Awareness
-
The Radio-Frequency Transformer for Signal Separation
-
Evaluate-as-Action: Self-Evaluated Process Rewards for Retrieval-Augmented Agents
-
Emotion is Not Just a Label: Latent Emotional Factors in LLM Processing
-
Strategically Robust Multi-Agent Reinforcement Learning with Linear Function Approximation
-
Abundant Intelligence and Deficient Demand: A Macro-Financial Stress Test of Rapid AI Adoption
-
Geometry-Aware Metric Learning for Cross-Lingual Few-Shot Sign Language Recognition on Static Hand Keypoints
-
PrivPRISM: Automatically Detecting Discrepancies Between Google Play Data Safety Declarations and Developer Privacy Policies
-
Why LLMs Fail: A Failure Analysis and Partial Success Measurement for Automated Security Patch Generation
-
SPAR-K: Scheduled Periodic Alternating Early Exit for Spoken Language Models
-
TubeMLLM: A Foundation Model for Topology Knowledge Exploration in Vessel-like Anatomy
-
Embodied Human Simulation for Quantitative Design and Analysis of Interactive Robotics
-
Distributed Convolutional Neural Networks for Object Recognition
-
Beyond Test-Time Training: Learning to Reason via Hardware-Efficient Optimal Control
-
LooComp: Leverage Leave-One-Out Strategy to Encoder-only Transformer for Efficient Query-aware Context Compression
-
UniField: A Unified Field-Aware MRI Enhancement Framework
-
Marginals Before Conditionals
-
Cognitively Layered Data Synthesis for Domain Adaptation of LLMs to Space Situational Awareness
-
How Contrastive Decoding Enhances Large Audio Language Models?
-
Summarize Before You Speak with ARACH: A Training-Free Inference-Time Plug-In for Enhancing LLMs via Global Attention Reallocation
-
HelixTrack: Event-Based Tracking and RPM Estimation of Propeller-like Objects
-
BridgeDiff: Bridging Human Observations and Flat-Garment Synthesis for Virtual Try-Off
-
RAE-NWM: Navigation World Model in Dense Visual Representation Space
-
When Detectors Forget Forensics: Blocking Semantic Shortcuts for Generalizable AI-Generated Image Detection
-
Towards Instance Segmentation with Polygon Detection Transformers
-
Social-R1: Towards Human-like Social Reasoning in LLMs
-
A Generative Sampler for distributions with possible discrete parameter based on Reversibility
-
Efficient Reasoning at Fixed Test-Time Cost via Length-Aware Attention Priors and Gain-Aware Training
-
Multi-model approach for autonomous driving: A comprehensive study on traffic sign-, vehicle- and lane detection and behavioral cloning
-
Transductive Generalization via Optimal Transport and Its Application to Graph Node Classification
-
Multimodal Graph Representation Learning with Dynamic Information Pathways
-
Implicit Geometry Representations for Vision-and-Language Navigation from Web Videos
-
ForgeDreamer: Industrial Text-to-3D Generation with Multi-Expert LoRA and Cross-View Hypergraph
-
Logos: An evolvable reasoning engine for rational molecular design
-
DendroNN: Dendrocentric Neural Networks for Energy-Efficient Classification of Event-Based Data
-
On Regret Bounds of Thompson Sampling for Bayesian Optimization
-
Speeding Up the Learning of 3D Gaussians with Much Shorter Gaussian Lists
-
From Ideal to Real: Stable Video Object Removal under Imperfect Conditions
-
Learning Convex Decomposition via Feature Fields
