Papers
-
How to make the most of your masked language model for protein engineering
-
Is this Idea Novel? An Automated Benchmark for Judgment of Research Ideas
-
Data-Driven Integration Kernels for Interpretable Nonlocal Operator Learning
-
Large language models can disambiguate opioid slang on social media
-
The Orthogonal Vulnerabilities of Generative AI Watermarks: A Comparative Empirical Benchmark of Spatial and Latent Provenance
-
NasoVoce: A Nose-Mounted Low-Audibility Speech Interface for Always-Available Speech Interaction
-
PC-Diffuser: Path-Consistent Capsule CBF Safety Filtering for Diffusion-Based Trajectory Planner
-
Does Reasoning Make Search More Fair? Comparing Fairness in Reasoning and Non-Reasoning Rerankers
-
Fuel Gauge: Estimating Chain-of-Thought Length Ahead of Time in Large Multimodal Models
-
Overcoming Visual Clutter in Vision Language Action Models via Concept-Gated Visual Distillation
-
Federated Active Learning Under Extreme Non-IID and Global Class Imbalance
-
On The Complexity of Best-Arm Identification in Non-Stationary Linear Bandits
-
EmoStory: Emotion-Aware Story Generation
-
Mitigating Translationese Bias in Multilingual LLM-as-a-Judge via Disentangled Information Bottleneck
-
StyleGallery: Training-free and Semantic-aware Personalized Style Transfer from Arbitrary Image References
-
Utility Function is All You Need: LLM-based Congestion Control
-
HEAL: Hindsight Entropy-Assisted Learning for Reasoning Distillation
-
One Token, Two Fates: A Unified Framework via Vision Token Manipulation Against MLLMs Hallucination
-
Geometric Autoencoder for Diffusion Models
-
Dynamic Knowledge Fusion for Multi-Domain Dialogue State Tracking
-
Beyond Interleaving: Causal Attention Reformulations for Generative Recommender Systems
-
GeoSense: Internalizing Geometric Necessity Perception for Multimodal Reasoning
-
Speech Codec Probing from Semantic and Phonetic Perspectives
-
Edge-Assisted Multi-Robot Visual-Inertial SLAM with Efficient Communication
-
Few-Shot Adaptation to Non-Stationary Environments via Latent Trend Embedding for Robotics
-
Reactive Writers: How Co-Writing with AI Changes How We Engage with Ideas
-
Causal Concept Graphs in LLM Latent Space for Stepwise Reasoning
-
Optimal Expert-Attention Allocation in Mixture-of-Experts: A Scalable Law for Dynamic Model Design
-
Beyond Scalars: Evaluating and Understanding LLM Reasoning via Geometric Progress and Stability
-
Variance-Aware Adaptive Weighting for Diffusion Model Training
-
Safe Probabilistic Planning for Human-Robot Interaction using Conformal Risk Control
-
Graph-GRPO: Training Graph Flow Models with Reinforcement Learning
-
Verbalizing LLM's Higher-order Uncertainty via Imprecise Probabilities
-
On the Learning Dynamics of Two-layer Linear Networks with Label Noise SGD
-
Multi-Person Pose Estimation Evaluation Using Optimal Transportation and Improved Pose Matching
-
GLM-OCR Technical Report
-
Just-in-Time: Training-Free Spatial Acceleration for Diffusion Transformers
-
DiT4DiT: Jointly Modeling Video Dynamics and Actions for Generalizable Robot Control
-
MegaScale-MoE: Large-Scale Communication-Efficient Training of Mixture-of-Experts Models in Production
-
Quality over Quantity: Demonstration Curation via Influence Functions for Data-Centric Robot Learning
-
Adaptive Active Learning for Online Reliability Prediction of Satellite Electronics
-
Dynamic Multi-period Experts for Online Time Series Forecasting
-
Learning Adaptive LLM Decoding
-
Verifying Good Regulator Conditions for Hypergraph Observers: Natural Gradient Learning from Causal Invariance via Established Theorems
-
Intelligent Spatial Estimation for Fire Hazards in Engineering Sites: An Enhanced YOLOv8-Powered Proximity Analysis Framework
-
A Text-Native Interface for Generative Video Authoring
-
Exclusive Self Attention
-
GST-VLA: Structured Gaussian Spatial Tokens for 3D Depth-Aware Vision-Language-Action Models
-
PPO-Based Hybrid Optimization for RIS-Assisted Semantic Vehicular Edge Computing
-
OmniEdit: A Training-free framework for Lip Synchronization and Audio-Visual Editing
