Papers
-
Dress-ED: Instruction-Guided Editing for Virtual Try-On and Try-Off
-
Understanding LLM Performance Degradation in Multi-Instance Processing: The Roles of Instance Count and Context Length
-
Do Consumers Accept AIs as Moral Compliance Agents?
-
Bridging the Know-Act Gap via Task-Level Autoregressive Reasoning
-
Causal Discovery in Action: Learning Chain-Reaction Mechanisms from Interventions
-
Transfer learning via interpolating structures
-
A Vision Language Model for Generating Procedural Plant Architecture Representations from Simulated Images
-
To Agree or To Be Right? The Grounding-Sycophancy Tradeoff in Medical Vision-Language Models
-
Toward Faithful Segmentation Attribution via Benchmarking and Dual-Evidence Fusion
-
Upper Entropy for 2-Monotone Lower Probabilities
-
PIVM: Diffusion-Based Prior-Integrated Variation Modeling for Anatomically Precise Abdominal CT Synthesis
-
Single-Subject Multi-View MRI Super-Resolution via Implicit Neural Representations
-
LGSE: Lexically Grounded Subword Embedding Initialization for Low-Resource Language Adaptation
-
CAM3R: Camera-Agnostic Model for 3D Reconstruction
-
Graph-Aware Late Chunking for Retrieval-Augmented Generation in Biomedical Literature
-
Learning to Trust: How Humans Mentally Recalibrate AI Confidence Signals
-
Q-Tacit: Image Quality Assessment via Latent Visual Reasoning
-
Multi-Method Validation of Large Language Model Medical Translation Across High- and Low-Resource Languages
-
CAPTCHA Solving for Native GUI Agents: Automated Reasoning-Action Data Generation and Self-Corrective Training
-
Overfitting and Generalizing with (PAC) Bayesian Prediction in Noisy Binary Classification
-
AwesomeLit: Towards Hypothesis Generation with Agent-Supported Literature Research
-
Pretext Matters: An Empirical Study of SSL Methods in Medical Imaging
-
MAGICIAN: Efficient Long-Term Planning with Imagined Gaussians for Active Mapping
-
Cross-Context Verification: Hierarchical Detection of Benchmark Contamination through Session-Isolated Analysis
-
Compressive single-pixel imaging via a wavelength-multiplexed spatially incoherent diffractive optical processor
-
When Documents Disagree: Measuring Institutional Variation in Transplant Guidance with Retrieval-Augmented Language Models
-
DSPA: Dynamic SAE Steering for Data-Efficient Preference Alignment
-
EpiMask: Leveraging Epipolar Distance Based Masks in Cross-Attention for Satellite Image Matching
-
DRTriton: Large-Scale Synthetic Data Reinforcement Learning for Triton Kernel Generation
-
Beyond Correlation: Refutation-Validated Aspect-Based Sentiment Analysis for Explainable Energy Market Returns
-
Unified-MAS: Universally Generating Domain-Specific Nodes for Empowering Automatic Multi-Agent Systems
-
TaigiSpeech: A Low-Resource Real-World Speech Intent Dataset and Preliminary Results with Scalable Data Mining In-the-Wild
-
ALADIN:Attribute-Language Distillation Network for Person Re-Identification
-
Which Concepts to Forget and How to Refuse? Decomposing Concepts for Continual Unlearning in Large Vision-Language Models
-
Off-Policy Evaluation for Ranking Policies under Deterministic Logging Policies
-
GaussianSSC: Triplane-Guided Directional Gaussian Fields for 3D Semantic Completion
-
Learning Trajectory-Aware Multimodal Large Language Models for Video Reasoning Segmentation
-
Effective Strategies for Asynchronous Software Engineering Agents
-
Learning Can Converge Stably to the Wrong Belief under Latent Reliability
-
Multinoulli Extension: A Lossless Continuous Relaxation for Partition-Constrained Subset Selection
-
StreamingEval: A Unified Evaluation Protocol towards Realistic Streaming Video Understanding
-
Agentic Automation of BT-RADS Scoring: End-to-End Multi-Agent System for Standardized Brain Tumor Follow-up Assessment
-
RuntimeSlicer: Towards Generalizable Unified Runtime State Representation for Failure Management
-
A Framework for Closed-Loop Robotic Assembly, Alignment and Self-Recovery of Precision Optical Systems
-
Quotient Geometry, Effective Curvature, and Implicit Bias in Simple Shallow Neural Networks
-
Parameter-efficient Prompt Tuning and Hierarchical Textual Guidance for Few-shot Whole Slide Image Classification
-
Optimizing Feature Extraction for On-device Model Inference with User Behavior Sequences
-
Unregistered Spectral Image Fusion: Unmixing, Adversarial Learning, and Recoverability
-
Back to Point: Exploring Point-Language Models for Zero-Shot 3D Anomaly Detection
-
Unveiling the Mechanism of Continuous Representation Full-Waveform Inversion: A Wave Based Neural Tangent Kernel Framework
MongoDB - Build AI That Scales
