Papers
-
Neural Thickets: Diverse Task Experts Are Dense Around Pretrained WeightsMassachusetts Institute of Technology
-
Temporal Straightening for Latent PlanningBrown University, New York University, University of Toronto
-
Copula-ResLogit: A Deep-Copula Framework for Unobserved Confounding EffectsToronto Metropolitan University
-
Conversational AI-Enhanced Exploration System to Query Large-Scale Digitised Collections of Natural History MuseumsUniversity of Technology Sydney
-
MultiwayPAM: Multiway Partitioning Around Medoids for LLM-as-a-Judge Score Analysis
-
Quantum entanglement provides a competitive advantage in adversarial gamesCSIRO Clayton
-
Hybrid Self-evolving Structured Memory for GUI Agents
-
Simulation-in-the-Reasoning (SiR): A Conceptual Framework for Empirically Grounded AI in Autonomous Transportation
-
GaLoRA: Parameter-Efficient Graph-Aware LLMs for Node ClassificationSan Jose State University
-
Regime-aware financial volatility forecasting via in-context learningStanford University, University of Toronto
-
From Imitation to Intuition: Intrinsic Reasoning for Open-Instance Video Classification
-
What do near-optimal learning rate schedules look like?
-
How to make the most of your masked language model for protein engineering
-
Is this Idea Novel? An Automated Benchmark for Judgment of Research IdeasNational Institute of Informatics, Technische Universität Dresden
-
Data-Driven Integration Kernels for Interpretable Nonlocal Operator LearningNVIDIA / Boston University, Lamont-Doherty Earth Observatory, New York University, University of California, University of Lausanne
-
Large language models can disambiguate opioid slang on social mediaStanford University
-
The Orthogonal Vulnerabilities of Generative AI Watermarks: A Comparative Empirical Benchmark of Spatial and Latent ProvenanceMillburn High School, Williamsville East High School
-
NasoVoce: A Nose-Mounted Low-Audibility Speech Interface for Always-Available Speech Interaction
-
PC-Diffuser: Path-Consistent Capsule CBF Safety Filtering for Diffusion-Based Trajectory PlannerTexas A&M University
-
Does Reasoning Make Search More Fair? Comparing Fairness in Reasoning and Non-Reasoning RerankersHuman Language Technology Center of Excellence, Johns Hopkins University
-
Fuel Gauge: Estimating Chain-of-Thought Length Ahead of Time in Large Multimodal ModelsUniversity of Texas
-
Overcoming Visual Clutter in Vision Language Action Models via Concept-Gated Visual DistillationUniversity of Technology Sydney, Western Sydney University
-
Federated Active Learning Under Extreme Non-IID and Global Class ImbalanceNanjing University of Aeronautics and Astronautics
-
On The Complexity of Best-Arm Identification in Non-Stationary Linear BanditsUniversity of Washington
-
EmoStory: Emotion-Aware Story GenerationShenzhen University
-
Mitigating Translationese Bias in Multilingual LLM-as-a-Judge via Disentangled Information Bottleneck
-
StyleGallery: Training-free and Semantic-aware Personalized Style Transfer from Arbitrary Image ReferencesHunan University, National University of Defense Technology
-
Utility Function is All You Need: LLM-based Congestion ControlAkamai Technologies / Fraunhofer-Institut für Sichere Informationstechnologie, Technische Universitat Berlin, Weizenbaum Institute
-
HEAL: Hindsight Entropy-Assisted Learning for Reasoning Distillation
-
One Token, Two Fates: A Unified Framework via Vision Token Manipulation Against MLLMs HallucinationNanjing University, Southeast University
-
Geometric Autoencoder for Diffusion ModelsShanghai Innovation Institute, Tsinghua University
-
Dynamic Knowledge Fusion for Multi-Domain Dialogue State Tracking
-
Beyond Interleaving: Causal Attention Reformulations for Generative Recommender Systems
-
GeoSense: Internalizing Geometric Necessity Perception for Multimodal ReasoningMohamed bin Zayed University of Artificial Intelligence, Stanford University, University of Chinese Academy of Sciences, University of Science and Technology of China
-
Speech Codec Probing from Semantic and Phonetic PerspectivesUniversity of Southern California
-
Edge-Assisted Multi-Robot Visual-Inertial SLAM with Efficient CommunicationThe Institute of Electrical and Electronics Engineers
-
Few-Shot Adaptation to Non-Stationary Environments via Latent Trend Embedding for RoboticsIchinoseki College, Ritsumeikan University, The University of Osaka
-
Reactive Writers: How Co-Writing with AI Changes How We Engage with IdeasBauhaus University, Cornell Tech, Princeton University, University of Washington
-
Causal Concept Graphs in LLM Latent Space for Stepwise ReasoningDaffodil International University, New York University
-
Optimal Expert-Attention Allocation in Mixture-of-Experts: A Scalable Law for Dynamic Model Design
-
Beyond Scalars: Evaluating and Understanding LLM Reasoning via Geometric Progress and StabilityChinese Academy of Sciences, King Abdullah University of Science and Technology (KAUST), Mohamed bin Zayed University of Artificial Intelligence, Provable Responsible AI and Data Analytics Lab, The Hong Kong Polytechnic University, University of Chinese Academy of Sciences
-
Variance-Aware Adaptive Weighting for Diffusion Model TrainingKennesaw State University
-
Safe Probabilistic Planning for Human-Robot Interaction using Conformal Risk ControlUniversity of Washington
-
Graph-GRPO: Training Graph Flow Models with Reinforcement LearningBeihang University, Beijing University of Posts and Telecommunications, National University of Singapore, The University of Sheffield
-
Verbalizing LLM's Higher-order Uncertainty via Imprecise ProbabilitiesCISPA Helmholtz Center for Information Security, Manchester Centre for AI Fundamentals, Nanyang Technological University, The University of Manchester, The University of Tokyo
-
On the Learning Dynamics of Two-layer Linear Networks with Label Noise SGDPeking University, RIKEN Center for Advanced Intelligence Project, Shanghai Jiao Tong University, The Institute of Statistical Mathematics, The University of Tokyo
-
Multi-Person Pose Estimation Evaluation Using Optimal Transportation and Improved Pose Matching
-
GLM-OCR Technical Report
-
Just-in-Time: Training-Free Spatial Acceleration for Diffusion TransformersCapital Normal University, University of Electronic Science and Technology of China
-
DiT4DiT: Jointly Modeling Video Dynamics and Actions for Generalizable Robot Control
