Papers
-
When Drafts Evolve: Speculative Decoding Meets Online Learning
-
Human-AI Collaborative Autonomous Experimentation With Proxy Modeling for Comparative Observation
-
Prompt-Driven Lightweight Foundation Model for Instance Segmentation-Based Fault Detection in Freight Trains
-
VLM4Rec: Multimodal Semantic Representation for Recommendation with Large Vision-Language Models
-
Batched Kernelized Bandits: Refinements and Extensions
-
Towards unified brain-to-text decoding across speech production and perception
-
The Economics of AI Supply Chain Regulation
-
Spend Less, Reason Better: Budget-Aware Value Tree Search for LLM Agents
-
Adaptive Diffusion Posterior Sampling for Data and Model Fusion of Complex Nonlinear Dynamical Systems
-
Weakly Time-Coupled Approximation of Markov Decision Processes
-
Using a Human-AI Teaming Approach to Create and Curate Scientific Datasets with the SCILIRE System
-
RoboStereo: Dual-Tower 4D Embodied World Models for Unified Policy Optimization
-
Self-Supervised Speech Models Encode Phonetic Context via Position-dependent Orthogonal Subspaces
-
UE5-Forest: A Photorealistic Synthetic Stereo Dataset for UAV Forestry Depth Estimation
-
LightMoE: Reducing Mixture-of-Experts Redundancy through Expert Replacing
-
98$\times$ Faster LLM Routing Without a Dedicated GPU: Flash Attention, Prompt Compression, and Near-Streaming for the vLLM Semantic Router
-
LR-SGS: Robust LiDAR-Reflectance-Guided Salient Gaussian Splatting for Self-Driving Scene Reconstruction
-
From Sparse to Dense: Multi-View GRPO for Flow Models via Augmented Condition Space
-
Sobolev--Ricci Curvature
-
VGGT-World: Transforming VGGT into an Autoregressive Geometry World Model
-
VFM-Recon: Unlocking Cross-Domain Scene-Level Neural Reconstruction with Scale-Aligned Foundation Priors
-
Continual Learning in Large Language Models: Methods, Challenges, and Opportunities
-
AVION: Aerial Vision-Language Instruction from Offline Teacher to Prompt-Tuned Network
-
CHIMERA-Bench: A Benchmark Dataset for Epitope-Specific Antibody Design
-
Learning Geometric and Photometric Features from Panoramic LiDAR Scans for Outdoor Place Categorization
-
From Text to Forecasts: Bridging Modality Gap with Temporal Evolution Semantic Space
-
Spatial Transcriptomics as Images for Large-Scale Pretraining
-
RetroReasoner: A Reasoning LLM for Strategic Retrosynthesis Prediction
-
Marker-Based 3D Reconstruction of Aggregates with a Comparative Analysis of 2D and 3D Morphologies
-
Vision Verification Enhanced Fusion of VLMs for Efficient Visual Reasoning
-
Spatially Grounded Long-Horizon Task Planning in the Wild
-
Disentangled Latent Dynamics Manifold Fusion for Solving Parameterized PDEs
-
MetaKE: Meta-learning Aligned Knowledge Editing via Bi-level Optimization
-
Bin~Wan,G2HFNet: GeoGran-Aware Hierarchical Feature Fusion Network for Salient Object Detection in Optical Remote Sensing Images
-
Colluding LoRA: A Composite Attack on LLM Safety Alignment
-
Experimental evidence of progressive ChatGPT models self-convergence
-
Federated Hierarchical Clustering with Automatic Selection of Optimal Cluster Numbers
-
RSONet: Region-guided Selective Optimization Network for RGB-T Salient Object Detection
-
STRAP-ViT: Segregated Tokens with Randomized -- Transformations for Defense against Adversarial Patches in ViTs
-
CM-Bench: A Comprehensive Cross-Modal Feature Matching Benchmark Bridging Visible and Infrared Images
-
HSEmotion Team at ABAW-10 Competition: Facial Expression Recognition, Valence-Arousal Estimation, Action Unit Detection and Fine-Grained Violence Classification
-
RXNRECer Enables Fine-grained Enzymatic Function Annotation through Active Learning and Protein Language Models
-
HaltNav: Reactive Visual Halting over Lightweight Topological Priors for Robust Vision-Language Navigation
-
EvolveCoder: Evolving Test Cases via Adversarial Verification for Code Reinforcement Learning
-
Seeing Eye to Eye: Enabling Cognitive Alignment Through Shared First-Person Perspective in Human-AI Collaboration
-
FGTR: Fine-Grained Multi-Table Retrieval via Hierarchical LLM Reasoning
-
VCBench: A Streaming Counting Benchmark for Spatial-Temporal State Maintenance in Long Videos
-
Cost-Efficient Multimodal LLM Inference via Cross-Tier GPU Heterogeneity
-
HFP-SAM: Hierarchical Frequency Prompted SAM for Efficient Marine Animal Segmentation
-
AI Planning Framework for LLM-Based Web Agents
