Papers
-
Designing Fatigue-Aware VR Interfaces via Biomechanical Models
-
Knowledge is Power: Advancing Few-shot Action Recognition with Multimodal Semantics from MLLMs
-
AgentCollab: A Self-Evaluation-Driven Collaboration Paradigm for Efficient LLM Agents
-
Face2Parts: Exploring Coarse-to-Fine Inter-Regional Facial Dependencies for Generalized Deepfake Detection
-
Rethinking Token Pruning for Historical Screenshots in GUI Visual Agents: Semantic, Spatial, and Temporal Perspectives
-
H-Node Attack and Defense in Large Language Models
-
Retrieval-Augmented Generation Based Nurse Observation Extraction
-
Asymptotic Optimism for Tensor Regression Models with Applications to Neural Network Compression
-
Seeing Like Radiologists: Context- and Gaze-Guided Vision-Language Pretraining for Chest X-rays
-
Bridging Pixels and Words: Mask-Aware Local Semantic Fusion for Multimodal Media Verification
-
Arithmetic OOD Failure Unfolds in Stages in Minimal GPTs
-
Pioneering Perceptual Video Fluency Assessment: A Novel Task with Benchmark Dataset and Baseline
-
Squish and Release: Exposing Hidden Hallucinations by Making Them Surface as Safety Signals
-
I Want to Believe (but the Vocabulary Changed): Measuring the Semantic Structure and Evolution of Conspiracy Theories
-
MuDD: A Multimodal Deception Detection Dataset and GSR-Guided Progressive Distillation for Non-Contact Deception Detection
-
A Regression Framework for Understanding Prompt Component Impact on LLM Performance
-
Adversarial Bandit Optimization with Globally Bounded Perturbations to Linear Losses
-
R-PGA: Robust Physical Adversarial Camouflage Generation via Relightable 3D Gaussian Splatting
-
PAD-Hand: Physics-Aware Diffusion for Hand Motion Recovery
-
MUST: Modality-Specific Representation-Aware Transformer for Diffusion-Enhanced Survival Prediction with Missing Modality
-
Envisioning global urban development with satellite imagery and generative AI
-
Semi-Automated Knowledge Engineering and Process Mapping for Total Airport Management
-
External Benchmarking of Lung Ultrasound Models for Pneumothorax-Related Signs: A Manifest-Based Multi-Source Study
-
When Identities Collapse: A Stress-Test Benchmark for Multi-Subject Personalization
-
Experimental study on surveillance video-based indoor occupancy measurement with occupant-centric control
-
Hybrid Diffusion Model for Breast Ultrasound Image Augmentation
-
ANVIL: Accelerator-Native Video Interpolation via Codec Motion Vector Priors
-
Learnable Instance Attention Filtering for Adaptive Detector Distillation
-
Selective Deficits in LLM Mental Self-Modeling in a Behavior-Based Test of Theory of Mind
-
CD-Buffer: Complementary Dual-Buffer Framework for Test-Time Adaptation in Adverse Weather Object Detection
-
IndoBERT-Relevancy: A Context-Conditioned Relevancy Classifier for Indonesian Text
-
AcTTA: Rethinking Test-Time Adaptation via Dynamic Activation
-
Dynamic Tokenization via Reinforcement Patching: End-to-end Training and Zero-shot Transfer
-
A Human-Inspired Decoupled Architecture for Efficient Audio Representation Learning
-
"Oops! ChatGPT is Temporarily Unavailable!": A Diary Study on Knowledge Workers' Experiences of LLM Withdrawal
-
Are LLM-Enhanced Graph Neural Networks Robust against Poisoning Attacks?
-
LLM Benchmark-User Need Misalignment for Climate Change
-
Accurate Precipitation Forecast by Efficiently Learning from Massive Atmospheric Variables and Unbalanced Distribution
-
SDDF: Specificity-Driven Dynamic Focusing for Open-Vocabulary Camouflaged Object Detection
-
DPD-Cancer: Explainable Graph-based Deep Learning for Small Molecule Anti-Cancer Activity Prediction
-
FINDER: Zero-Shot Field-Integrated Network for Distortion-free EPI Reconstruction in Diffusion MRI
-
SkinGPT-X: A Self-Evolving Collaborative Multi-Agent System for Transparent and Trustworthy Dermatological Diagnosis
-
Beyond Where to Look: Trajectory-Guided Reinforcement Learning for Multimodal RLVR
-
Finding Distributed Object-Centric Properties in Self-Supervised Transformers
-
TaxaAdapter: Vision Taxonomy Models are Key to Fine-grained Image Generation over the Tree of Life
-
SWE-PRBench: Benchmarking AI Code Review Quality Against Pull Request Feedback
-
InstaVSR: Taming Diffusion for Efficient and Temporally Consistent Video Super-Resolution
-
TinyML for Acoustic Anomaly Detection in IoT Sensor Networks
-
PEANUT: Perturbations by Eigenvector Alignment for Attacking Graph Neural Networks Under Topology-Driven Message Passing
-
ATime-Consistent Benchmark for Repository-Level Software Engineering Evaluation
MongoDB - Build AI That Scales
