Papers
-
Agentic LLM Planning via Step-Wise PDDL Simulation: An Empirical CharacterisationAustrian Institute of Technology
-
Evaluating Austrian A-Level German Essays with Large Language Models for Automated Essay ScoringSalzburg University of Applied Sciences
-
Aggregative Semantics for Quantitative Bipolar Argumentation FrameworksSorbonne University
-
Text-Driven Emotionally Continuous Talking Face GenerationHarbin Institute of Technology
-
Lifelong Embodied Navigation LearningChinese Academy of Sciences, Mohamed bin Zayed University of Artificial Intelligence, University of Chinese Academy of Sciences
-
StreamVoiceAnon+: Emotion-Preserving Streaming Speaker Anonymization via Frame-Level Acoustic DistillationAgency for Science, Technology and Research, Singapore, Institute for Infocomm Research, Nanyang Technological University, The Hong Kong Polytechnic University
-
Lyapunov Probes for Hallucination Detection in Large Foundation ModelsBeihang University, Beijing Academy of Blockchain and Edge Computing, National University of Defense Technology, ShanghaiTech University
-
Offline Materials Optimization with CliqueFlowmer
-
Experiences Build Characters: The Linguistic Origins and Functional Impact of LLM PersonalityThe University of Sheffield, University of Oklahoma
-
DeepSight: Bridging Depth Maps and Language with a Depth-Driven Multimodal ModelHarbin Institute of Technology
-
Enhancing Neural Video Compression of Static Scenes with Positive-Incentive NoiseTeleAI
-
Enhancing Instruction Following of LLMs via Activation Steering with Dynamic RejectionYonsei University
-
ButterflyViT: 354$\times$ Expert Compression for Edge Vision TransformersIndian Institute of Information Technology
-
Latent Diffusion-Based 3D Molecular Recovery from Vibrational SpectraState Key Laboratory of Precision and Intelligent Chemistry, University of Birmingham, University of Science and Technology of China
-
Making Implicit Premises Explicit in Logical Understanding of EnthymemesUniversity College London
-
Dynamic Momentum Recalibration in Online Gradient LearningNortheastern University, Shenyang University of Chemical Technology, University of Louisville
-
FedARKS: Federated Aggregation via Robust and Discriminative Knowledge Selection and Integration for Person Re-identificationWuhan University of Science and Technology
-
Diffusion Language Models Are Natively Length-AwareBocconi University
-
A Hazard-Informed Data Pipeline for Robotics Physical Safety
-
DQE: A Semantic-Aware Evaluation Metric for Time Series Anomaly DetectionHangzhou Dianzi University, Zhejiang University
-
A Causal Graph Approach to Oppositional Narrative AnalysisUniversity of Deusto
-
Cross-Resolution Distribution Matching for Diffusion Distillation
-
Partial Policy Gradients for RL in LLMs
-
Place-it-R1: Unlocking Environment-aware Reasoning Potential of MLLM for Video Object Insertion
-
Spatial Colour Mixing Illusions as a Perception Stress Test for Vision-Language ModelsPolitehnica Bucuresti
-
Predictive Coding Graphs are a Superset of Feedforward Neural NetworksUtrecht University
-
Longitudinal NSCLC Treatment Progression via Multimodal Generative ModelsUmeå University, Università Campus Bio-Medico di Roma, University of Basel, University of Genoa
-
Property-driven Protein Inverse Folding With Multi-Objective Preference AlignmentHEC Montréal, Mila–Quebec AI Institute, Peking University, Université de Montréal
-
VLM-RobustBench: A Comprehensive Benchmark for Robustness of Vision-Language ModelsUniversity of Edinburgh
-
Ensemble Graph Neural Networks for Probabilistic Sea Surface Temperature Forecasting via Input PerturbationsUniversity of Illinois Urbana-Champaign
-
Efficient Vector Search in the Wild: One Model for Multi-K Queries
-
Do Compact SSL Backbones Matter for Audio Deepfake Detection? A Controlled Study with RAPTORIdiap Research Institute, Mohamed bin Zayed University of Artificial Intelligence, Tallinn University of Technology
-
Reflective Flow Sampling EnhancementHong Kong Baptist University, Hong Kong University of Science and Technology, The University of Tokyo
-
FreeOcc: Training-free Panoptic Occupancy Prediction via Foundation Models
-
A Semi-Supervised Framework for Breast Ultrasound Segmentation with Training-Free Pseudo-Label Generation and Label RefinementSendai College, Southeast University, Sun Yat-sen University Cancer Center, Tohoku University, Tohoku University Graduate School of Medicine, University of Chemistry and Technology Prague, Zhejiang University
-
JOPP-3D: Joint Open Vocabulary Semantic Segmentation on Point Clouds and PanoramasRicoh / German Research Center for Artificial Intelligence, Rheinland-Pfälzische Technische Universität Kaiserslautern
-
Robotic Foundation Models for Industrial Control: A Comprehensive Survey and Readiness Assessment FrameworkUniversität Wuppertal
-
XMACNet: An Explainable Lightweight Attention based CNN with Multi Modal Fusion for Chili Disease ClassificationAngel College of Engineering and Technology, Bannari Amman Institute of Technology, Vel Tech Rangarajan Dr.Sagunthala R&D Institute of Science and Technology, Vellore Institute of Technology
-
Optimizing 3D Diffusion Models for Medical Imaging via Multi-Scale Reward LearningUniversity of Sussex, University of Toronto
-
Making Training-Free Diffusion Segmentors Scale with the Generative PowerAlibaba / Chinese Academy of Sciences, Sun Yat-sen University, University of Chinese Academy of Sciences
-
Contrastive-to-Self-Supervised: A Two-Stage Framework for Script Similarity LearningNational Research Institute for Agriculture, Food and the Environment, University of Haute, University Paris-Saclay
-
Towards Motion Turing Test: Evaluating Human-Likeness in Humanoid Robots
-
CRIMSON: A Clinically-Grounded LLM-Based Metric for Generative Radiology Report EvaluationHarvard Medical School
-
SpaCRD: Multimodal Deep Fusion of Histology and Spatial Transcriptomics for Cancer Region DetectionWuhan University, Yunnan University, Zhongnan University of Economics and Law
-
Random Quadratic Form on a Sphere: Synchronization by Common NoiseFreie Universität Berlin, University of Amsterdam
-
Whisper-CD: Accurate Long-Form Speech Recognition using Multi-Negative Contrastive DecodingSungkyunkwan University
-
MAPO: Mixed Advantage Policy Optimization for Long-Horizon Multi-Turn Dialogue
-
Wisdom of the AI Crowd (AI-CROWD) for Ground Truth Approximation in Content Analysis: A Research Protocol & Validation Using Eleven Large Language ModelsCarlos III University of Madrid, University of Alcalá
-
LIT-RAGBench: Benchmarking Generator Capabilities of Large Language Models in Retrieval-Augmented Generation
-
Latent Autoencoder Ensemble Kalman Filter for Data assimilationNational University of Singapore, Southeast University
