Papers
-
Residual SODAP: Residual Self-Organizing Domain-Adaptive Prompting with Structural Knowledge Preservation for Continual Learning
-
Adaptive Vision-Language Model Routing for Computer Use Agents
-
NanoVDR: Distilling a 2B Vision-Language Retriever into a 70M Text-Only Encoder for Visual Document Retrieval
-
Rethinking Multiple-Choice Questions for RLVR: Unlocking Potential via Distractor Design
-
From AI Weather Prediction to Infrastructure Resilience: A Correction-Downscaling Framework for Tropical Cyclone Impacts
-
coDrawAgents: A Multi-Agent Dialogue Framework for Compositional Image Generation
-
Hierarchical Dual-Change Collaborative Learning for UAV Scene Change Captioning
-
Mask2Flow-TSE: Two-Stage Target Speaker Extraction with Masking and Flow Matching
-
DAST: A Dual-Stream Voice Anonymization Attacker with Staged Training
-
Multimodal Protein Language Models for Enzyme Kinetic Parameters: From Substrate Recognition to Conformational Adaptation
-
Hierarchical Reference Sets for Robust Unsupervised Detection of Scattered and Clustered Outliers
-
Vision-Language Based Expert Reporting for Painting Authentication and Defect Detection
-
Team LEYA in 10th ABAW Competition: Multimodal Ambivalence/Hesitancy Recognition Approach
-
On Linear Separability of the MNIST Handwritten Digits Dataset
-
Draft-and-Target Sampling for Video Generation Policy
-
Wear Classification of Abrasive Flap Wheels using a Hierarchical Deep Learning Approach
-
I Know What I Don't Know: Latent Posterior Factor Models for Multi-Evidence Probabilistic Reasoning
-
Composing Driving Worlds through Disentangled Control for Adversarial Scenario Generation
-
Surrogates for Physics-based and Data-driven Modelling of Parametric Systems: Review and New Perspectives
-
CLARIN-PT-LDB: An Open LLM Leaderboard for Portuguese to assess Language, Culture and Civility
-
TRACE: Structure-Aware Character Encoding for Robust and Generalizable Document Watermarking
-
Test-time RL alignment exposes task familiarity artifacts in LLM benchmarks
-
Explainable AI Using Inherently Interpretable Components for Wearable-based Health Monitoring
-
Enhanced Drug-drug Interaction Prediction Using Adaptive Knowledge Integration
-
A protocol for evaluating robustness to H&E staining variation in computational pathology models
-
Forecasting Epileptic Seizures from Contactless Camera via Cross-Species Transfer Learning
-
Finite Difference Flow Optimization for RL Post-Training of Text-to-Image Models
-
Human-Centered Evaluation of an LLM-Based Process Modeling Copilot: A Mixed-Methods Study with Domain Experts
-
A theory of learning data statistics in diffusion models, from easy to hard
-
Spectral-Geometric Neural Fields for Pose-Free LiDAR View Synthesis
-
Bayesian Uncertainty-Aware MRI Reconstruction
-
DirPA: Addressing Prior Shift in Imbalanced Few-shot Crop-type Classification
-
Learning from Child-Directed Speech in Two-Language Scenarios: A French-English Case Study
-
Improving Channel Estimation via Multimodal Diffusion Models with Flow Matching
-
FedBPrompt: Federated Domain Generalization Person Re-Identification via Body Distribution Aware Visual Prompts
-
Stake the Points: Structure-Faithful Instance Unlearning
-
Surprised by Attention: Predictable Query Dynamics for Time Series Anomaly Detection
-
VIRD: View-Invariant Representation through Dual-Axis Transformation for Cross-View Pose Estimation
-
HMS-BERT: Hybrid Multi-Task Self-Training for Multilingual and Multi-Label Cyberbullying Detection
-
Filtered Spectral Projection for Quantum Principal Component Analysis
-
ODRL Policy Comparison Through Normalisation
-
Rethinking VLMs for Image Forgery Detection and Localization
-
DS$^2$-Instruct: Domain-Specific Data Synthesis for Large Language Models Instruction Tuning
-
Efficient and Interpretable Multi-Agent LLM Routing via Ant Colony Optimization
-
MotionAnymesh: Physics-Grounded Articulation for Simulation-Ready Digital Twins
-
SGMatch: Semantic-Guided Non-Rigid Shape Matching with Flow Regularization
-
Thinking in Streaming Video
-
Reinforcing the Weakest Links: Modernizing SIENA with Targeted Deep Learning Integration
-
Delta1 with LLM: symbolic and neural integration for credible and explainable reasoning
-
NormCode Canvas: Making LLM Agentic Workflows Development Sustainable via Case-Based Reasoning
