Papers
-
ConflictBench: Evaluating Human-AI Conflict via Interactive and Visually Grounded Environments
-
DyLLM: Efficient Diffusion LLM Inference via Saliency-based Token Selection and Partial AttentionSeoul National University
-
Controllable Complex Human Motion Video Generation via Text-to-Skeleton CascadesMunich Center for Machine Learning, Murdoch University, Technical University of Munich, The University of Western Australia
-
QualiTeacher: Quality-Conditioned Pseudo-Labeling for Real-World Image RestorationDuke University, École Polytechnique Fédérale de Lausanne, National University of Singapore, Sun Yat-sen University, Tsinghua University
-
GCGNet: Graph-Consistent Generative Network for Time Series Forecasting with Exogenous VariablesEast China Normal University
-
Solution to the 10th ABAW Expression Recognition Challenge: A Robust Multimodal Framework with Safe Cross-Attention and Modality Dropout
-
CDRRM: Contrast-Driven Rubric Generation for Reliable and Interpretable Reward Modeling
-
S2S-FDD: Bridging Industrial Time Series and Natural Language for Explainable Zero-shot Fault DiagnosisZhejiang University
-
Examining the Role of YouTube Production and Consumption Dynamics on the Formation of Extreme IdeologiesUniversity of Iowa
-
Speed3R: Sparse Feed-forward 3D Reconstruction ModelsBaidu AMU, The University of Hong Kong
-
See and Switch: Vision-Based Branching for Interactive Robot-Skill ProgrammingCzech Institute of Informatics, Robotics, and Cybernetics, Czech Technical University in Prague
-
Stabilized Fine-Tuning with LoRA in Federated Learning: Mitigating the Side Effect of Client Size and Rank via the Scaling FactorAgency for Science, Technology and Research, Singapore, Beijing University of Posts and Telecommunications
-
ImageEdit-R1: Boosting Multi-Agent Image Editing via Reinforcement Learning
-
Adversarial Domain Adaptation Enables Knowledge Transfer Across Heterogeneous RNA-Seq DatasetsIBISC Laboratory, University Evry, University Paris-Saclay
-
Enhancing Cross-View UAV Geolocalization via LVLM-Driven Relational ModelingCity University of Hong Kong, Zhejiang University of Technology
-
Evaluating Generative Models via One-Dimensional Code Distributions
-
Deterministic Differentiable Structured Pruning for Large Language ModelsAnt Group, Tsinghua University
-
In-Context Reinforcement Learning for Tool Use in Large Language ModelsNational University of Singapore, Salesforce AI Research, University of California, Berkeley, University of California, Santa Cruz
-
Synthetic Defect Image Generation for Power Line Insulator Inspection Using Multimodal Large Language ModelsWayne State University
-
AgentOS: From Application Silos to a Natural Language-Driven Data EcosystemArizona State University, Clemson University, Duke University, University of Kansas
-
PlayWorld: Learning Robot World Models from Autonomous PlayPrinceton University
-
AtomVLA: Scalable Post-Training for Robotic Manipulation via Predictive Latent World ModelsINFIFORCE Intelligent Technology / Huazhong University of Science and Technology, The University of Hong Kong, Tsinghua University
-
Scale Space DiffusionUniversity of Maryland
-
Agentic Critical TrainingUniversity of Maryland
-
RetroAgent: From Solving to Evolving via Retrospective Dual Intrinsic FeedbackNational University of Singapore, Shanghai AI Lab
-
PostTrainBench: Can LLM Agents Automate LLM Post-Training?ELLIS Institute Tübingen, Max Planck Institute for Intelligent Systems, Tübingen AI Center, University of Tübingen
-
\$OneMillion-Bench: How Far are Language Agents from Human Experts?
-
How Far Can Unsupervised RLVR Scale LLM Training?Peking University, Shanghai AI Lab, Shanghai Jiao Tong University, Tsinghua University, University of Illinois Urbana-Champaign, Xi’an Jiaotong University
-
Context-Enriched Natural Language Descriptions of Vessel Trajectories
-
From Garbage to Gold: A Data-Architectural Theory of Predictive Robustness
-
Sparsity and Out-of-Distribution Generalization
-
Feed m Birds with One Scone: Accelerating Multi-task Gradient Balancing via Bi-level Optimization
-
Deterministic Fuzzy Triage for Legal Compliance Classification and Evidence Retrieval
-
Can Large Language Models Keep Up? Benchmarking Online Adaptation to Continual Knowledge Streams
-
AQuA: Toward Strategic Response Generation for Ambiguous Visual Questions
-
Interpretable Aneurysm Classification via 3D Concept Bottleneck Models: Integrating Morphological and Hemodynamic Clinical Features
-
VIVECaption: A Split Approach to Caption Quality Improvement
-
Generalizing Linear Autoencoder Recommenders with Decoupled Expected Quadratic Loss
-
Prompt-Based Caption Generation for Single-Tooth Dental Images Using Vision-Language Models
-
Adaptive Capacity Allocation for Vision Language Action Fine-tuning
-
UnSCAR: Universal, Scalable, Controllable, and Adaptable Image Restoration
-
Safety Under Scaffolding: How Evaluation Conditions Shape Measured Safety
-
Machine Learning for the Internet of Underwater Things: From Fundamentals to Implementation
-
QdaVPR: A novel query-based domain-agnostic model for visual place recognition
-
Context Channel Capacity: An Information-Theoretic Framework for Understanding Catastrophic Forgetting
-
DualSpec: Accelerating Deep Research Agents via Dual-Process Action Speculation
-
Dynamic Vehicle Routing Problem with Prompt Confirmation of Advance Requests
-
AutoControl Arena: Synthesizing Executable Test Environments for Frontier AI Risk Evaluation
-
Disentangled Textual Priors for Diffusion-based Image Super-Resolution
-
OrthoFormer: Instrumental Variable Estimation in Transformer Hidden States via Neural Control Functions
