Papers
-
Transductive Generalization via Optimal Transport and Its Application to Graph Node Classification
-
Multimodal Graph Representation Learning with Dynamic Information Pathways
-
Implicit Geometry Representations for Vision-and-Language Navigation from Web Videos
-
ForgeDreamer: Industrial Text-to-3D Generation with Multi-Expert LoRA and Cross-View Hypergraph
-
Logos: An evolvable reasoning engine for rational molecular design
-
DendroNN: Dendrocentric Neural Networks for Energy-Efficient Classification of Event-Based Data
-
On Regret Bounds of Thompson Sampling for Bayesian Optimization
-
Speeding Up the Learning of 3D Gaussians with Much Shorter Gaussian Lists
-
From Ideal to Real: Stable Video Object Removal under Imperfect Conditions
-
Learning Convex Decomposition via Feature Fields
-
CogBlender: Towards Continuous Cognitive Intervention in Text-to-Image Generation
-
Exploring Modality-Aware Fusion and Decoupled Temporal Propagation for Multi-Modal Object Tracking
-
Proxy-Guided Measurement Calibration
-
TASER: Task-Aware Spectral Energy Refine for Backdoor Suppression in UAV Swarms Decentralized Federated Learning
-
DenoiseSplat: Feed-Forward Gaussian Splatting for Noisy 3D Scene Reconstruction
-
See, Plan, Rewind: Progress-Aware Vision-Language-Action Models for Robust Robotic Manipulation
-
Diagnosing and Repairing Citation Failures in Generative Engine Optimization
-
TA-Mem: Tool-Augmented Autonomous Memory Retrieval for LLM in Long-Term Conversational QA
-
Rescaling Confidence: What Scale Design Reveals About LLM Metacognition
-
A Gaussian Comparison Theorem for Training Dynamics in Machine Learning
-
IntroSVG: Learning from Rendering Feedback for Text-to-SVG Generation via an Introspective Generator-Critic Framework
-
Curveball Steering: The Right Direction To Steer Isn't Always Linear
-
CLoE: Expert Consistency Learning for Missing Modality Segmentation
-
NLiPsCalib: An Efficient Calibration Framework for High-Fidelity 3D Reconstruction of Curved Visuotactile Sensors
-
SpaceSense-Bench: A Large-Scale Multi-Modal Benchmark for Spacecraft Perception and Pose Estimation
-
Reading the Mood Behind Words: Integrating Prosody-Derived Emotional Context into Socially Responsive VR Agents
-
OddGridBench: Exposing the Lack of Fine-Grained Visual Discrepancy Sensitivity in Multimodal Large Language Models
-
Reward-Zero: Language Embedding Driven Implicit Reward Mechanisms for Reinforcement Learning
-
TimberAgent: Gram-Guided Retrieval for Executable Music Effect Control
-
Beyond Scaling: Assessing Strategic Reasoning and Rapid Decision-Making Capability of LLMs in Zero-sum Environments
-
Predictive Spectral Calibration for Source-Free Test-Time Regression
-
TaSR-RAG: Taxonomy-guided Structured Reasoning for Retrieval-Augmented Generation
-
Robust Regularized Policy Iteration under Transition Uncertainty
-
Robust Provably Secure Image Steganography via Latent Iterative Optimization
-
TA-GGAD: Testing-time Adaptive Graph Model for Generalist Graph Anomaly Detection
-
Interactive 3D visualization of surface roughness predictions in additive manufacturing: A data-driven framework
-
Democratising Clinical AI through Dataset Condensation for Classical Clinical Models
-
Evidential Perfusion Physics-Informed Neural Networks with Residual Uncertainty Quantification
-
M3GCLR: Multi-View Mini-Max Infinite Skeleton-Data Game Contrastive Learning For Skeleton-Based Action Recognition
-
From Representation to Clusters: A Contrastive Learning Approach for Attributed Hypergraph Clustering
-
Flow Field Reconstruction via Voronoi-Enhanced Physics-Informed Neural Networks with End-to-End Sensor Placement Optimization
-
Quantifying and extending the coverage of spatial categorization data sets
-
MIL-PF: Multiple Instance Learning on Precomputed Features for Mammography Classification
-
SinGeo: Unlock Single Model's Potential for Robust Cross-View Geo-Localization
-
SPAARS: Safer RL Policy Alignment through Abstract Exploration and Refined Exploitation of Action Space
-
EventVGGT: Exploring Cross-Modal Distillation for Consistent Event-based Depth Estimation
-
Training-Free Coverless Multi-Image Steganography with Access Control
-
Physics-Informed Neural Engine Sound Modeling with Differentiable Pulse-Train Synthesis
-
ICDAR 2025 Competition on End-to-End Document Image Machine Translation Towards Complex Layouts
-
LLM as a Meta-Judge: Synthetic Data for NLP Evaluation Metric Validation
