Papers
-
Post-Training with Policy Gradients: Optimality and the Base Model BarrierUniversity of Toronto
-
Chart-RL: Generalized Chart Comprehension via Reinforcement Learning with Verifiable Rewards
-
Learning Quadruped Walking from Seconds of DemonstrationUniversity of California
-
A SISA-based Machine Unlearning Framework for Power Transformer Inter-Turn Short-Circuit Fault LocalizationUniversity of Texas
-
Topology-Aware Reinforcement Learning over Graphs for Resilient Power Distribution NetworksUniversity at Buffalo, University of Texas
-
SurgCUT3R: Surgical Scene-Aware Continuous Understanding of Temporal 3D RepresentationImperial College London, Nanyang Technological University, University of Liverpool
-
Conditional Unbalanced Optimal Transport Maps: An Outlier-Robust Framework for Conditional Generative ModelingSungkyunkwan University
-
T2SGrid: Temporal-to-Spatial Gridification for Video Temporal GroundingGuangdong Laboratory of Artificial Intelligence and Digital Economy, South China University of Technology
-
Elenchus: Generating Knowledge Bases from Prover-Skeptic DialoguesUniversity of Amsterdam
-
A Systematic Investigation of Document Chunking Strategies and Embedding SensitivityUniversity of Canberra
-
NePPO: Near-Potential Policy Optimization for General-Sum Multi-Agent Reinforcement LearningJohns Hopkins University
-
Diffusion Controller: Framework, Algorithms and ParameterizationCarnegie Mellon University, Yale University
-
Optimizing Multi-Modal Models for Image-Based Shape Retrieval: The Role of Pre-Alignment and Hard Contrastive LearningDelft University of Technology, Fraunhofer Institute for Computer Graphics Research
-
Masked Unfairness: Hiding Causality within Zero ATEDartmouth College, Schmidt Center
-
Perception-Aware Multimodal Spatial Reasoning from Monocular ImagesAgency for Science, Technology and Research, Singapore, Massachusetts Institute of Technology, National University of Singapore
-
ADAS-TO: A Large-Scale Multimodal Naturalistic Dataset and Empirical Characterization of Human Takeovers during ADAS EngagementUniversity of South Florida
-
Foundational World Models Accurately Detect Bimanual Manipulator FailuresStanford University
-
MipSLAM: Alias-Free Gaussian Splatting SLAMHarbin Institute of Technology, National University of Singapore
-
Adaptive Discovery of Interpretable Audio Attributes with Multimodal LLMs for Low-Resource Classification
-
AdaGen: Learning Adaptive Policy for Image Synthesis
-
Large Language Model-Driven Full-Component Evolution of Adaptive Large Neighborhood Search
-
TrajPred: Trajectory-Conditioned Joint Embedding Prediction for Surgical Instrument-Tissue Interaction Recognition in Vision-Language Models
-
Combinatorial Allocation Bandits with Nonlinear Arm UtilityKeio University, The University of Tokyo
-
SuperSkillsStack: Agency, Domain Knowledge, Imagination, and Taste in Human-AI Design EducationSingapore University of Technology and Design
-
Can Safety Emerge from Weak Supervision? A Systematic Analysis of Small Language Models
-
TEA-Time: Transporting Effects Across TimeDuke University, Yale University
-
AutoChecklist: Composable Pipelines for Checklist Generation and Scoring with LLM-as-a-JudgeUniversity of Chicago
-
RESCHED: Rethinking Flexible Job Shop Scheduling from a Transformer-based Architecture with Simplified StatesNanyang Technological University, Shandong University, Singapore Management University
-
OV-DEIM: Real-time DETR-Style Open-Vocabulary Object Detection with GridSynthetic AugmentationCarleton College, Guangdong Laboratory of Artificial Intelligence and Digital Economy, Institute for Research in Biomedicine, Shenzhen University
-
Hit-RAG: Learning to Reason with Long Contexts via Preference AlignmentHuazhong University of Science and Technology, Shenzhen University of Advanced Technology, The City University of New York, Tongji University, University of Technology Sydney
-
Enhancing Web Agents with a Hierarchical Memory TreeBeijing Institute of Technology
-
Language-Aware Distillation for Multilingual Instruction-Following Speech LLMs with ASR-Only SupervisionAgency for Science, Technology and Research, Singapore, Institute for Infocomm Research, Nanyang Technological University, National University of Singapore
-
Permutation-Equivariant 2D State Space Models: Theory and Canonical Architecture for Multivariate Time SeriesKorea University
-
Resource-Adaptive Federated Text Generation with Differential PrivacyOak Ridge National Laboratory
-
Two Frames Matter: A Temporal Attack for Text-to-Video Model JailbreakingBeihang University, Wenzhou-Kean University
-
Targeted Bit-Flip Attacks on LLM-Based AgentsHuazhong University of Science and Technology, National University of Singapore, Quan Cheng Laboratory, Tsinghua University
-
Self-Supervised Multi-Modal World Model with 4D Space-Time EmbeddingAllen Institute for AI, Arizona State University, Georgia Institute of Technology, Stanford University, University of Florida, University of Houston, University of Illinois Urbana-Champaign
-
Fine-Grained 3D Facial Reconstruction for Micro-Expressions
-
Looking Back and Forth: Cross-Image Attention Calibration and Attentive Preference Learning for Multi-Image Hallucination MitigationBeijing Institute of Technology, Harbin Institute of Technology, Tsinghua University
-
Hindsight Credit Assignment for Long-Horizon LLM AgentsCity University of Hong Kong, Nanjing University
-
Animating Petascale Time-varying Data on Commodity Hardware with LLM-assisted ScriptingUniversity of Utah, Vanderbilt University
-
Bi-directional digital twin prototype anchoring with multi-periodicity learning for few-shot fault diagnosisShanghai Jiao Tong University
-
SODA: Sensitivity-Oriented Dynamic Acceleration for Diffusion TransformerHarbin Institute of Technology
-
MedSteer: Counterfactual Endoscopic Synthesis via Training-Free Activation Steering
-
User Review Writing via Interview with Dialogue SystemsThe University of Electro-Communications
-
VirtueBench: Evaluating Trustworthiness under Uncertainty in Long Video UnderstandingPeking University
-
The Talking Robot: Distortion-Robust Acoustic Models for Robot-Robot CommunicationGeorgia Institute of Technology, Institute of Science Tokyo
-
Interpretable Maximum Margin Deep Anomaly DetectionCapital Normal University, Yunnan University
-
Physics-Guided VLM Priors for All-Cloud RemovalWuhan University
-
Retinex Meets Language: A Physics-Semantics-Guided Underwater Image Enhancement NetworkOcean University of China
