Papers
-
No Single Metric Tells the Whole Story: A Multi-Dimensional Evaluation Framework for Uncertainty Attributions
-
From Liar Paradox to Incongruent Sets: A Normal Form for Self-Reference
-
Cross-Modal Prototype Alignment and Mixing for Training-Free Few-Shot Classification
-
UI-Voyager: A Self-Evolving GUI Agent Learning via Failed Experience
-
Representation Learning to Study Temporal Dynamics in Tutorial Scaffolding
-
Robust Multilingual Text-to-Pictogram Mapping for Scalable Reading Rehabilitation
-
CliPPER: Contextual Video-Language Pretraining on Long-form Intraoperative Surgical Procedures for Event Recognition
-
SEGAR: Selective Enhancement for Generative Augmented Reality
-
Analysing the Safety Pitfalls of Steering Vectors
-
A Sociolinguistic Analysis of Automatic Speech Recognition Bias in Newcastle English
-
The role of spatial context and multitask learning in the detection of organic and conventional farming systems based on Sentinel-2 time series
-
Can LLMs Beat Classical Hyperparameter Optimization Algorithms? A Study on autoresearch
-
Energy-Efficient Hierarchical Federated Anomaly Detection for the Internet of Underwater Things via Selective Cooperative Aggregation
-
MedOpenClaw: Auditable Medical Imaging Agents Reasoning over Uncurated Full Studies
-
Evaluating Chunking Strategies For Retrieval-Augmented Generation in Oil and Gas Enterprise Documents
-
LensWalk: Agentic Video Understanding by Planning How You See in Videos
-
The Free-Market Algorithm: Self-Organizing Optimization for Open-Ended Complex Systems
-
Scaling Recurrence-aware Foundation Models for Clinical Records via Next-Visit Prediction
-
Trust Region Constrained Bayesian Optimization with Penalized Constraint Handling
-
POLY-SIM: Polyglot Speaker Identification with Missing Modality Grand Challenge 2026 Evaluation Plan
-
Anti-I2V: Safeguarding your photos from malicious image-to-video generation
-
Completeness of Unbounded Best-First Minimax and Descent Minimax
-
Towards Training-Free Scene Text Editing
-
VFIG: Vectorizing Complex Figures in SVG with Vision-Language Models
-
Chameleon: Episodic Memory for Long-Horizon Robotic Manipulation
-
EndoVGGT: GNN-Enhanced Depth Estimation for Surgical 3D Reconstruction
-
Vision-Language Models vs Human: Perceptual Image Quality Assessment
-
MARCH: Multi-Agent Reinforced Self-Check for LLM Hallucination
-
Retrieval Improvements Do Not Guarantee Better Answers: A Study of RAG for AI Policy QA
-
When Consistency Becomes Bias: Interviewer Effects in Semi-Structured Clinical Interviews
-
Demystifying When Pruning Works via Representation Hierarchies
-
Latent-WAM: Latent World Action Modeling for End-to-End Autonomous Driving
-
The Stochastic Gap: A Markovian Framework for Pre-Deployment Reliability and Oversight-Cost Auditing in Agentic Artificial Intelligence
-
TAG: Target-Agnostic Guidance for Stable Object-Centric Inference in Vision-Language-Action Models
-
Comparing Developer and LLM Biases in Code Evaluation
-
DreamerAD: Efficient Reinforcement Learning via Latent World Model for Autonomous Driving
-
Polynomial Speedup in Diffusion Models with the Multilevel Euler-Maruyama Method
-
From Weights to Concepts: Data-Free Interpretability of CLIP via Singular Vector Decomposition
-
Spectral methods: crucial for machine learning, natural for quantum computers?
-
When Is Collective Intelligence a Lottery? Multi-Agent Scaling Laws for Memetic Drift in LLMs
-
ReDiPrune: Relevance-Diversity Pre-Projection Token Pruning for Efficient Multimodal LLMs
-
KitchenTwin: Semantically and Geometrically Grounded 3D Kitchen Digital Twins
-
UniICL: Systematizing Unified Multimodal In-context Learning through a Capability-Oriented Taxonomy
-
BCMDA: Bidirectional Correlation Maps Domain Adaptation for Mixed Domain Semi-Supervised Medical Image Segmentation
-
Reconstructing Spiking Neural Networks Using a Single Neuron with Autapses
-
Amplified Patch-Level Differential Privacy for Free via Random Cropping
-
LLaVA-LE: Large Language-and-Vision Assistant for Lunar Exploration
-
Conformal Selective Prediction with General Risk Control
-
Amortized Inference for Correlated Discrete Choice Models via Equivariant Neural Networks
-
Training LLMs for Multi-Step Tool Orchestration with Constrained Data Synthesis and Graduated Rewards
MongoDB - Build AI That Scales
