Papers
-
Evaluation of LLMs in retrieving food and nutritional context for RAG systems
-
OOD-MMSafe: Advancing MLLM Safety from Harmful Intent to Hidden Consequences
-
From Phase Prediction to Phase Design: A ReAct Agent Framework for High-Entropy Alloy Discovery
-
MUGEN: Evaluating and Improving Multi-audio Understanding of Large Audio-Language Models
-
Does the Question Really Matter? Training-Free Data Selection for Vision-Language SFT
-
AutoAgent: Evolving Cognition and Elastic Memory Orchestration for Adaptive Agents
-
GSStream: 3D Gaussian Splatting based Volumetric Scene Streaming System
-
FrameDiT: Diffusion Transformer with Frame-Level Matrix Attention for Efficient Video Generation
-
RbtAct: Rebuttal as Supervision for Actionable Review Feedback Generation
-
ES-dLLM: Efficient Inference for Diffusion Large Language Models by Early-Skipping
-
A Multi-Prototype-Guided Federated Knowledge Distillation Approach in AI-RAN Enabled Multi-Access Edge Computing System
-
EXPLORE-Bench: Egocentric Scene Prediction with Long-Horizon Reasoning
-
FetalAgents: A Multi-Agent System for Fetal Ultrasound Image and Video Analysis
-
$M^2$-Occ: Resilient 3D Semantic Occupancy Prediction for Autonomous Driving with Incomplete Camera Inputs
-
Let's Reward Step-by-Step: Step-Aware Contrastive Alignment for Vision-Language Navigation in Continuous Environments
-
ENIGMA-360: An Ego-Exo Dataset for Human Behavior Understanding in Industrial Scenarios
-
Upper Generalization Bounds for Neural Oscillators
-
LAP: A Language-Aware Planning Model For Procedure Planning In Instructional Videos
-
Beyond Fine-Tuning: Robust Food Entity Linking under Ontology Drift with FoodOntoRAG
-
LogoDiffuser: Training-Free Multilingual Logo Generation and Stylization via Letter-Aware Attention Control
-
PanoAffordanceNet: Towards Holistic Affordance Grounding in 360° Indoor Environments
-
Ego: Embedding-Guided Personalization of Vision-Language Models
-
VCR: Variance-Driven Channel Recalibration for Robust Low-Light Enhancement
-
Removing the Trigger, Not the Backdoor: Alternative Triggers and Latent Backdoors
-
Global universality via discrete-time signatures
-
World2Mind: Cognition Toolkit for Allocentric Spatial Reasoning in Foundation Models
-
First Estimation of Model Parameters for Neutrino-Induced Nucleon Knockout Using Simulation-Based Inference
-
EPIC-EuroParl-UdS: Information-Theoretic Perspectives on Translation and Interpreting
-
Quantifying the Necessity of Chain of Thought through Opaque Serial Depth
-
What is Missing? Explaining Neurons Activated by Absent Concepts
-
A Hybrid Quantum-Classical Framework for Financial Volatility Forecasting Based on Quantum Circuit Born Machines
-
Exploiting Label-Aware Channel Scoring for Adaptive Channel Pruning in Split Learning
-
Information Theoretic Bayesian Optimization over the Probability Simplex
-
Test-time Ego-Exo-centric Adaptation for Action Anticipation via Multi-Label Prototype Growing and Dual-Clue Consistency
-
MITRA: An AI Assistant for Knowledge Retrieval in Physics Collaborations
-
A Survey of Weight Space Learning: Understanding, Representation, and Generation
-
Good Reasoning Makes Good Demonstrations: Implicit Reasoning Quality Supervision via In-Context Reinforcement Learning
-
RA-SSU: Towards Fine-Grained Audio-Visual Learning with Region-Aware Sound Source Understanding
-
Multi-Stream Perturbation Attack: Breaking Safety Alignment of Thinking LLMs Through Concurrent Task Interference
-
Correction of Transformer-Based Models with Smoothing Pseudo-Projector
-
ConfCtrl: Enabling Precise Camera Control in Video Diffusion via Confidence-Aware Interpolation
-
One-Eval: An Agentic System for Automated and Traceable LLM Evaluation
-
BrainSTR: Spatio-Temporal Contrastive Learning for Interpretable Dynamic Brain Network Modeling
-
VLM-Loc: Localization in Point Cloud Maps via Vision-Language Models
-
MA-EgoQA: Question Answering over Egocentric Videos from Multiple Embodied Agents
-
Execution Is the New Attack Surface: Survivability-Aware Agentic Crypto Trading with OpenClaw-Style Local Executors
-
Chow-Liu Ordering for Long-Context Reasoning in Chain-of-Agents
-
CycleULM: A unified label-free deep learning framework for ultrasound localisation microscopy
-
Equivariant Asynchronous Diffusion: An Adaptive Denoising Schedule for Accelerated Molecular Conformation Generation
-
A Unified Hierarchical Multi-Task Multi-Fidelity Framework for Data-Efficient Surrogate Modeling in Manufacturing
