Papers
-
Locally Linear Continual Learning for Time Series based on VC-Theoretical Generalization Bounds
-
TheraAgent: Multi-Agent Framework with Self-Evolving Memory and Evidence-Calibrated Reasoning for PET Theranostics
-
Toward Scalable Co-located Practical Learning: Assisting with Computer Vision and Multimodal Analytics
-
Every Error has Its Magnitude: Asymmetric Mistake Severity Training for Multiclass Multiple Instance Learning
-
Preconditioned Test-Time Adaptation for Out-of-Distribution Debiasing in Narrative Generation
-
$τ$-Voice: Benchmarking Full-Duplex Voice Agents on Real-World Domains
-
When Should Humans Step In? Optimal Human Dispatching in AI-Assisted Decisions
-
Quantum-Enhanced Vision Transformer for Flood Detection using Remote Sensing Imagery
-
QuarkMedBench: A Real-World Scenario Driven Benchmark for Evaluating Large Language Models
-
Steering Generative Models for Accessibility: EasyRead Image Generation
-
Repetition Without Exclusivity: Scale Sensitivity of Referential Mechanisms in Child-Scale Language Models
-
SAATT Nav: a Socially Aware Autonomous Transparent Transportation Navigation Framework for Wheelchairs
-
Routing Channel-Patch Dependencies in Time Series Forecasting with Graph Spectral Decomposition
-
REFINE-DP: Diffusion Policy Fine-tuning for Humanoid Loco-manipulation via Reinforcement Learning
-
RSEdit: Text-Guided Image Editing for Remote Sensing
-
REAEDP: Entropy-Calibrated Differentially Private Data Release with Formal Guarantees and Attack-Based Evaluation
-
InterventionLens: A Multi-Agent Framework for Detecting ASD Intervention Strategies in Parent-Child Shared Reading
-
Sparse-Dense Mixture of Experts Adapter for Multi-Modal Tracking
-
Facial beauty prediction fusing transfer learning and broad learning system
-
Can We Trust LLMs on Memristors? Diving into Reasoning Ability under Non-Ideality
-
Data-driven Progressive Discovery of Physical Laws
-
Bodhi VLM: Privacy-Alignment Modeling for Hierarchical Visual Representations in Vision Backbones and VLM Encoders via Bottom-Up and Top-Down Feature Search
-
R3-REC: Reasoning-Driven Recommendation via Retrieval-Augmented LLMs over Multi-Granular Interest Signals
-
Implicit Maximum Likelihood Estimation for Real-time Generative Model Predictive Control
-
UniVid: Pyramid Diffusion Model for High Quality Video Generation
-
Sky2Ground: A Benchmark for Site Modeling under Varying Altitude
-
Ego-1K -- A Large-Scale Multiview Video Dataset for Egocentric Vision
-
Few Batches or Little Memory, But Not Both: Simultaneous Space and Adaptivity Constraints in Stochastic Bandits
-
Research Paradigm of Materials Science Tetrahedra with Artificial Intelligence
-
Multi-Object Advertisement Creative Generation
-
Sub-Band Spectral Matching with Localized Score Aggregation for Robust Anomalous Sound Detection
-
Spectral Edge Dynamics of Training Trajectories: Signal--Noise Geometry Across Scales
-
Manifold-Orthogonal Dual-spectrum Extrapolation for Parameterized Physics-Informed Neural Networks
-
MeTok: An Efficient Meteorological Tokenization with Hyper-Aligned Group Learning for Precipitation Nowcasting
-
Exploration-assisted Bottleneck Transition Toward Robust and Data-efficient Deformable Object Manipulation
-
QTrack: Query-Driven Reasoning for Multi-modal MOT
-
Multimodal Emotion Regression with Multi-Objective Optimization and VAD-Aware Audio Modeling for the 10th ABAW EMI Track
-
Level Up: Defining and Exploiting Transitional Problems for Curriculum Learning
-
Knowledge Distillation for Large Language Models
-
Causal Tracing of Audio-Text Fusion in Large Audio Language Models
-
PhysAlign: Physics-Coherent Image-to-Video Generation through Feature and 3D Representation Alignment
-
Brain Tumor Classification from 3D MRI Using Persistent Homology and Betti Features: A Topological Data Analysis Approach on BraTS2020
-
LiveWeb-IE: A Benchmark For Online Web Information Extraction
-
Retrieval-Feedback-Driven Distillation and Preference Alignment for Efficient LLM-based Query Expansion
-
Generate Then Correct: Single Shot Global Correction for Aspect Sentiment Quad Prediction
-
AD-Copilot: A Vision-Language Assistant for Industrial Anomaly Detection via Visual In-context Comparison
-
Your Vision-Language-Action Model Already Has Attention Heads For Path Deviation Detection
-
RetimeGS: Continuous-Time Reconstruction of 4D Gaussian Splatting
-
Projection-Free Evolution Strategies for Continuous Prompt Search
-
Advancing Cancer Prognosis with Hierarchical Fusion of Genomic, Proteomic and Pathology Imaging Data from a Systems Biology Perspective
MongoDB - Build AI That Scales
