Papers
-
Universe Routing: Why Self-Evolving Agents Need Epistemic Control
-
OpenReservoirComputing: GPU-Accelerated Reservoir Computing in JAX
-
VorTEX: Various overlap ratio for Target speech EXtraction
-
Knowledge Activation: AI Skills as the Institutional Knowledge Primitive for Agentic Software Development
-
Fold-CP: A Context Parallelism Framework for Biomolecular Modeling
-
HiMemVLN: Enhancing Reliability of Open-Source Zero-Shot Vision-and-Language Navigation with Hierarchical Memory System
-
Ego to World: Collaborative Spatial Reasoning in Embodied Systems via Reinforcement Learning
-
M2IR: Proactive All-in-One Image Restoration via Mamba-style Modulation and Mixture-of-Experts
-
SimCert: Probabilistic Certification for Behavioral Similarity in Deep Neural Network Compression
-
RAZOR: Ratio-Aware Layer Editing for Targeted Unlearning in Vision Transformers and Diffusion Models
-
RadarXFormer: Robust Object Detection via Cross-Dimension Fusion of 4D Radar Spectra and Images for Autonomous Driving
-
Planning as Goal Recognition: Deriving Heuristics from Intention Models - Extended Version
-
Two Birds, One Projection: Harmonizing Safety and Utility in LVLMs via Inference-time Feature Projection
-
SemanticFace: Semantic Facial Action Estimation via Semantic Distillation in Interpretable Space
-
Dataset Distillation Efficiently Encodes Low-Dimensional Representations from Gradient-Based Learning of Non-Linear Tasks
-
Neural Networks as Local-to-Global Computations
-
Halfway to 3D: Ensembling 2.5D and 3D Models for Robust COVID-19 CT Diagnosis
-
Ablate and Rescue: A Causal Analysis of Residual Stream Hyper-Connections
-
DamageArbiter: A CLIP-Enhanced Multimodal Arbitration Framework for Hurricane Damage Assessment from Street-View Imagery
-
The Impact of Ideological Discourses in RAG: A Case Study with COVID-19 Treatments
-
Real-Time Driver Safety Scoring Through Inverse Crash Probability Modeling
-
ContiGuard: A Framework for Continual Toxicity Detection Against Evolving Evasive Perturbations
-
Integrating Weather Foundation Model and Satellite to Enable Fine-Grained Solar Irradiance Forecasting
-
Lost in Aggregation: On a Fundamental Expressivity Limit of Message-Passing Graph Neural Networks
-
Personalized Federated Learning with Residual Fisher Information for Medical Image Segmentation
-
From Artefact to Insight: Efficient Low-Rank Adaptation of BrushNet for Scanning Probe Microscopy Image Restoration
-
AutoMoT: A Unified Vision-Language-Action Model with Asynchronous Mixture-of-Transformers for End-to-End Autonomous Driving
-
PCodeTrans: Translate Decompiled Pseudocode to Compilable and Executable Equivalent
-
From Horizontal to Rotated: Cross-View Object Geo-Localization with Orientation Awareness
-
Architecture-Agnostic Feature Synergy for Universal Defense Against Heterogeneous Generative Threats
-
Video Detector: A Dual-Phase Vision-Based System for Real-Time Traffic Intersection Control and Intelligent Transportation Analysis
-
A Score Filter Enhanced Data Assimilation Framework for Data-Driven Dynamical Systems
-
Shopping Companion: A Memory-Augmented LLM Agent for Real-World E-Commerce Tasks
-
Sample-Efficient Hypergradient Estimation for Decentralized Bi-Level Reinforcement Learning
-
A Self-Evolving Defect Detection Framework for Industrial Photovoltaic Systems
-
IgPose: A Generative Data-Augmented Pipeline for Robust Immunoglobulin-Antigen Binding Prediction
-
Developing an English-Efik Corpus and Machine Translation System for Digitization Inclusion
-
Tackling Over-smoothing on Hypergraphs: A Ricci Flow-guided Neural Diffusion Approach
-
A Hybrid AI and Rule-Based Decision Support System for Disease Diagnosis and Management Using Labs
-
Seismic full-waveform inversion based on a physics-driven generative adversarial network
-
RealVLG-R1: A Large-Scale Real-World Visual-Language Grounding Benchmark for Robotic Perception and Manipulation
-
LLMind: Bio-inspired Training-free Adaptive Visual Representations for Vision-Language Models
-
Customizing ChatGPT for Second Language Speaking Practice: Genuine Support or Just a Marketing Gimmick?
-
SpiralDiff: Spiral Diffusion with LoRA for RGB-to-RAW Conversion Across Cameras
-
PASTE: Physics-Aware Scattering Topology Embedding Framework for SAR Object Detection
-
Modeling and Benchmarking Spoken Dialogue Rewards with Modality and Colloquialness
-
Decision-Level Ordinal Modeling for Multimodal Essay Scoring with Large Language Models
-
Balancing Saliency and Coverage: Semantic Prominence-Aware Budgeting for Visual Token Compression in VLMs
-
LLMs as Signal Detectors: Sensitivity, Bias, and the Temperature-Criterion Analogy
-
Informative Perturbation Selection for Uncertainty-Aware Post-hoc Explanations
