Papers
-
JOPP-3D: Joint Open Vocabulary Semantic Segmentation on Point Clouds and Panoramas
-
Robotic Foundation Models for Industrial Control: A Comprehensive Survey and Readiness Assessment Framework
-
XMACNet: An Explainable Lightweight Attention based CNN with Multi Modal Fusion for Chili Disease Classification
-
Optimizing 3D Diffusion Models for Medical Imaging via Multi-Scale Reward Learning
-
Making Training-Free Diffusion Segmentors Scale with the Generative Power
-
Contrastive-to-Self-Supervised: A Two-Stage Framework for Script Similarity Learning
-
Towards Motion Turing Test: Evaluating Human-Likeness in Humanoid Robots
-
CRIMSON: A Clinically-Grounded LLM-Based Metric for Generative Radiology Report Evaluation
-
SpaCRD: Multimodal Deep Fusion of Histology and Spatial Transcriptomics for Cancer Region Detection
-
Random Quadratic Form on a Sphere: Synchronization by Common Noise
-
Whisper-CD: Accurate Long-Form Speech Recognition using Multi-Negative Contrastive Decoding
-
MAPO: Mixed Advantage Policy Optimization for Long-Horizon Multi-Turn Dialogue
-
Wisdom of the AI Crowd (AI-CROWD) for Ground Truth Approximation in Content Analysis: A Research Protocol & Validation Using Eleven Large Language Models
-
LIT-RAGBench: Benchmarking Generator Capabilities of Large Language Models in Retrieval-Augmented Generation
-
Latent Autoencoder Ensemble Kalman Filter for Data assimilation
-
FlashPrefill: Instantaneous Pattern Discovery and Thresholding for Ultra-Fast Long-Context Prefilling
-
Adaptive Language-Aware Image Reflection Removal Network
-
Point-Supervised Skeleton-Based Human Action Segmentation
-
VG3S: Visual Geometry Grounded Gaussian Splatting for Semantic Occupancy Prediction
-
EarthBridge: A Solution for 4th Multi-modal Aerial View Image Challenge Translation Track
-
Topological descriptors of foot clearance gait dynamics improve differential diagnosis of Parkinsonism
-
Cut to the Chase: Training-free Multimodal Summarization via Chain-of-Events
-
EntON: Eigenentropy-Optimized Neighborhood Densification in 3D Gaussian Splatting
-
Conversational Demand Response: Bidirectional Aggregator-Prosumer Coordination through Agentic AI
-
Word-Anchored Temporal Forgery Localization
-
SPOT: Span-level Pause-of-Thought for Efficient and Interpretable Latent Reasoning in Large Language Models
-
FedSCS-XGB -- Federated Server-centric surrogate XGBoost for continual health monitoring
-
Low-latency Event-based Object Detection with Spatially-Sparse Linear Attention
-
TaPD: Temporal-adaptive Progressive Distillation for Observation-Adaptive Trajectory Forecasting in Autonomous Driving
-
DC-Merge: Improving Model Merging with Directional Consistency
-
Gradient Flow Polarizes Softmax Outputs towards Low-Entropy Solutions
-
Hierarchical Collaborative Fusion for 3D Instance-aware Referring Expression Segmentation
-
SPPCSO: Adaptive Penalized Estimation Method for High-Dimensional Correlated Data
-
Synthetic Monitoring Environments for Reinforcement Learning
-
Implementation of Quantum Implicit Neural Representation in Deterministic and Probabilistic Autoencoders for Image Reconstruction/Generation Tasks
-
NOVA: Next-step Open-Vocabulary Autoregression for 3D Multi-Object Tracking in Autonomous Driving
-
GazeMoE: Perception of Gaze Target with Mixture-of-Experts
-
Robust support vector model based on bounded asymmetric elastic net loss for binary classification
-
Learning to Solve Orienteering Problem with Time Windows and Variable Profits
-
Mind the Gap: Pitfalls of LLM Alignment with Asian Public Opinion
-
ODD-SEC: Onboard Drone Detection with a Spinning Event Camera
-
HiPP-Prune: Hierarchical Preference-Conditioned Structured Pruning for Vision-Language Models
-
Agentic retrieval-augmented reasoning reshapes collective reliability under model variability in radiology question answering
-
Looking Through Glass Box
-
Stem: Rethinking Causal Information Flow in Sparse Attention
-
Spectral and Trajectory Regularization for Diffusion Transformer Super-Resolution
-
Artificial Intelligence for Climate Adaptation: Reinforcement Learning for Climate Change-Resilient Transport
-
Can we Trust Unreliable Voxels? Exploring 3D Semantic Occupancy Prediction under Label Noise
-
Attribute Distribution Modeling and Semantic-Visual Alignment for Generative Zero-shot Learning
-
Learning Unbiased Cluster Descriptors for Interpretable Imbalanced Concept Drift Detection
