Papers
-
CogBlender: Towards Continuous Cognitive Intervention in Text-to-Image Generation
-
Exploring Modality-Aware Fusion and Decoupled Temporal Propagation for Multi-Modal Object Tracking
-
Proxy-Guided Measurement Calibration
-
TASER: Task-Aware Spectral Energy Refine for Backdoor Suppression in UAV Swarms Decentralized Federated Learning
-
DenoiseSplat: Feed-Forward Gaussian Splatting for Noisy 3D Scene Reconstruction
-
See, Plan, Rewind: Progress-Aware Vision-Language-Action Models for Robust Robotic Manipulation
-
Diagnosing and Repairing Citation Failures in Generative Engine Optimization
-
TA-Mem: Tool-Augmented Autonomous Memory Retrieval for LLM in Long-Term Conversational QA
-
Rescaling Confidence: What Scale Design Reveals About LLM Metacognition
-
A Gaussian Comparison Theorem for Training Dynamics in Machine Learning
-
IntroSVG: Learning from Rendering Feedback for Text-to-SVG Generation via an Introspective Generator-Critic Framework
-
Curveball Steering: The Right Direction To Steer Isn't Always Linear
-
CLoE: Expert Consistency Learning for Missing Modality Segmentation
-
NLiPsCalib: An Efficient Calibration Framework for High-Fidelity 3D Reconstruction of Curved Visuotactile Sensors
-
SpaceSense-Bench: A Large-Scale Multi-Modal Benchmark for Spacecraft Perception and Pose Estimation
-
Reading the Mood Behind Words: Integrating Prosody-Derived Emotional Context into Socially Responsive VR Agents
-
OddGridBench: Exposing the Lack of Fine-Grained Visual Discrepancy Sensitivity in Multimodal Large Language Models
-
Reward-Zero: Language Embedding Driven Implicit Reward Mechanisms for Reinforcement Learning
-
TimberAgent: Gram-Guided Retrieval for Executable Music Effect Control
-
Beyond Scaling: Assessing Strategic Reasoning and Rapid Decision-Making Capability of LLMs in Zero-sum Environments
-
Predictive Spectral Calibration for Source-Free Test-Time Regression
-
TaSR-RAG: Taxonomy-guided Structured Reasoning for Retrieval-Augmented Generation
-
Robust Regularized Policy Iteration under Transition Uncertainty
-
Robust Provably Secure Image Steganography via Latent Iterative Optimization
-
TA-GGAD: Testing-time Adaptive Graph Model for Generalist Graph Anomaly Detection
-
Interactive 3D visualization of surface roughness predictions in additive manufacturing: A data-driven framework
-
Democratising Clinical AI through Dataset Condensation for Classical Clinical Models
-
Evidential Perfusion Physics-Informed Neural Networks with Residual Uncertainty Quantification
-
M3GCLR: Multi-View Mini-Max Infinite Skeleton-Data Game Contrastive Learning For Skeleton-Based Action Recognition
-
From Representation to Clusters: A Contrastive Learning Approach for Attributed Hypergraph Clustering
-
Flow Field Reconstruction via Voronoi-Enhanced Physics-Informed Neural Networks with End-to-End Sensor Placement Optimization
-
Quantifying and extending the coverage of spatial categorization data sets
-
MIL-PF: Multiple Instance Learning on Precomputed Features for Mammography Classification
-
SinGeo: Unlock Single Model's Potential for Robust Cross-View Geo-Localization
-
SPAARS: Safer RL Policy Alignment through Abstract Exploration and Refined Exploitation of Action Space
-
EventVGGT: Exploring Cross-Modal Distillation for Consistent Event-based Depth Estimation
-
Training-Free Coverless Multi-Image Steganography with Access Control
-
Physics-Informed Neural Engine Sound Modeling with Differentiable Pulse-Train Synthesis
-
ICDAR 2025 Competition on End-to-End Document Image Machine Translation Towards Complex Layouts
-
LLM as a Meta-Judge: Synthetic Data for NLP Evaluation Metric Validation
-
Stochastic Port-Hamiltonian Neural Networks: Universal Approximation with Passivity Guarantees
-
YOLO-NAS-Bench: A Surrogate Benchmark with Self-Evolving Predictors for YOLO Architecture Search
-
RiO-DETR: DETR for Real-time Oriented Object Detection
-
Reconstructing Movement from Sparse Samples: Enhanced Spatio-Temporal Matching Strategies for Low-Frequency Data
-
Large Spikes in Stochastic Gradient Descent: A Large-Deviations View
-
PromptDLA: A Domain-aware Prompt Document Layout Analysis Framework with Descriptive Knowledge as a Cue
-
From Flow to One Step: Real-Time Multi-Modal Trajectory Policies via Implicit Maximum Likelihood Estimation-based Distribution Distillation
-
Investigating Gender Stereotypes in Large Language Models via Social Determinants of Health
-
CIGPose: Causal Intervention Graph Neural Network for Whole-Body Pose Estimation
-
MetaDAT: Generalizable Trajectory Prediction via Meta Pre-training and Data-Adaptive Test-Time Updating
