Papers
-
Improving Continual Learning for Gaussian Splatting based Environments Reconstruction on Commercial Off-the-Shelf Edge Devices
-
Fanar-Sadiq: A Multi-Agent Architecture for Grounded Islamic QAHamad Bin Khalifa University, Qatar Computing Research Institute
-
Spherical-GOF: Geometry-Aware Panoramic Gaussian Opacity Fields for 3D Scene ReconstructionNational Engineering Research Center of Robot Visual Perception and Control Technology, Hunan University, School of Artificial Intelligence and Robotics, Hunan University
-
Echo2ECG: Enhancing ECG Representations with Cardiac Morphology from Multi-View EchosImperial College London, Munich Center for Machine Learning, Technical University of Munich, TUM University Hospital, Department of Cardiology, TUM University Hospital, Department of Computing Imperial
-
Oracle-Guided Soft Shielding for Safe Move Prediction in ChessChips JU, Laboratoire d'Intégration de Systèmes et des Technologies, Laboratoire National de Metrologie et d'Essais, University Paris-Saclay
-
Beyond Hungarian: Match-Free Supervision for End-to-End Object Detection
-
Breaking the Bias Barrier in Concave Multi-Objective Reinforcement LearningPurdue University
-
OccTrack360: 4D Panoptic Occupancy Tracking from Surround-View Fisheye CamerasHunan University, Zhejiang University
-
BuildMamba: A Visual State-Space Based Model for Multi-Task Building Segmentation and Height Estimation from Satellite Images
-
Towards Effective and Efficient Graph Alignment without SupervisionBeijing Jiaotong University, Peking University
-
SecAgent: Efficient Mobile GUI Agent with Semantic Context
-
SWIFT: Sliding Window Reconstruction for Few-Shot Training-Free Generated Video AttributionHefei University of Technology, University of Science and Technology of China
-
PCFEx: Point Cloud Feature Extraction for Graph Neural NetworksKeio University
-
The Neural Compass: Probabilistic Relative Feature Fields for Robotic SearchUniversity of Freiburg
-
Interactive World Simulator for Robot Policy Training and Evaluation
-
mmGAT: Pose Estimation by Graph Attention with Mutual Features from mmWave Radar Point CloudKeio University
-
Generative Adversarial Regression (GAR): Learning Conditional Risk ScenariosTelfer School of Management, University of Ottawa
-
Impact of Connectivity on Laplacian Representations in Reinforcement LearningPolitecnico di Milano, Universita degli Studi di Milano, University College London
-
BioGait-VLM: A Tri-Modal Vision-Language-Biomechanics Framework for Interpretable Clinical Gait AssessmentDrexel University, University of California, Berkeley, Washington University
-
OSS-CRS: Liberating AIxCC Cyber Reasoning Systems for Real-World Open-Source Security
-
MetaWorld-X: Hierarchical World Modeling via VLM-Orchestrated Experts for Humanoid Loco-ManipulationBeijing University of Technology, Fudan University, University of Alberta, University of Hamburg
-
Trust via Reputation of Conviction
-
Drift-to-Action Controllers: Budgeted Interventions with Online Risk CertificatesMohammed First University, Sultan Moulay Slimane University
-
Online Sparse Synthetic Aperture Radar ImagingRensselaer Polytechnic Institute
-
DualFlexKAN: Dual-stage Kolmogorov-Arnold Networks with Independent Function ControlUniversity of Granada, University of Malaga
-
Towards Batch-to-Streaming Deep Reinforcement Learning for Continuous ControlUniversity of Padova
-
CARE-Edit: Condition-Aware Routing of Experts for Contextual Image EditingHong Kong University of Science and Technology
-
PRISM: Streaming Human Motion Generation with Per-Joint Latent DecompositionZhejiang Lab, Zhejiang University
-
Boosting MLLM Spatial Reasoning with Geometrically Referenced 3D Scene RepresentationsZillow Group
-
Don't Look Back in Anger: MAGIC Net for Streaming Continual Learning with Temporal DependencePolitecnico di Milano
-
Weakly Supervised Teacher-Student Framework with Progressive Pseudo-mask Refinement for Gland SegmentationThe Ohio State University
-
FOMO-3D: Using Vision Foundation Models for Long-Tailed 3D Object Detection
-
Micro-Diffusion Compression - Binary Tree Tweedie Denoising for Online Probability Estimation
-
StreamReady: Learning What to Answer and When in Long Streaming VideosMicrosoft Research, University of Central Florida
-
Integral Formulas for Vector Spherical Tensor Products
-
UNBOX: Unveiling Black-box visual models with Natural-languageHelmholtz Munich, Technical University of Munich, University of Catania
-
OmniGuide: Universal Guidance Fields for Enhancing Generalist Robot PoliciesUniversity of Pennsylvania
-
Retrieval-Augmented Gaussian Avatars: Improving Expression GeneralizationNVIDIA / Bar-Ilan University, OriginAI, Technion – Israel Institute of Technology, The Hebrew University of Jerusalem
-
Grow, Don't Overwrite: Fine-tuning Without Forgetting
-
CAST: Modeling Visual State Transitions for Consistent Video Retrieval
-
Divide and Predict: An Architecture for Input Space Partitioning and Enhanced AccuracyUniversity of Virginia
-
Group Entropies and Mirror Duality: A Class of Flexible Mirror Descent Updates for Machine LearningUniversidad Complutense de Madrid, Warsaw University of Technology
-
CoCo: Code as CoT for Text-to-Image Preview and Rare Concept GenerationStepFun / Chinese Academy of Sciences, Nanyang Technological University, South China University of Technology, The Chinese University of Hong Kong
-
Cluster-Aware Attention-Based Deep Reinforcement Learning for Pickup and Delivery ProblemsDalian University of Technology
-
OfficeQA Pro: An Enterprise Benchmark for End-to-End Grounded Reasoning
-
Context-free Self-Conditioned GAN for Trajectory ForecastingOrebro University
-
CODA: Difficulty-Aware Compute Allocation for Adaptive ReasoningFudan University
-
ImprovedGS+: A High-Performance C++/CUDA Re-Implementation Strategy for 3D Gaussian SplattingUniversidad de Murcia
-
Characterization and upgrade of a quantum graph neural network for charged particle trackingIstituto Nazionale di Fisica Nucleare, University of Ferrara
-
Quantization of Ricci Curvature in Information GeometryUniversity at Albany
