Papers
-
CRPS-Optimal Binning for Conformal Regression
-
SegMaFormer: A Hybrid State-Space and Transformer Model for Efficient Segmentation
-
A plug-and-play approach with fast uncertainty quantification for weak lensing mass mapping
-
On the Challenges and Opportunities of Learned Sparse Retrieval for Code
-
6D Robotic OCT Scanning of Curved Tissue Surfaces
-
Retrieving Climate Change Disinformation by Narrative
-
ROM: Real-time Overthinking Mitigation via Streaming Detection and Intervention
-
AdditiveLLM2: A Multi-modal Large Language Model for Additive Manufacturing
-
Do Papers Match Code? A Benchmark and Framework for Paper-Code Consistency Detection in Bioinformatics Software
-
Tuning Real-World Image Restoration at Inference: A Test-Time Scaling Paradigm for Flow Matching Models
-
On the Interplay of Priors and Overparametrization in Bayesian Neural Network Posteriors
-
Future-Interactions-Aware Trajectory Prediction via Braid Theory
-
GTSR: Subsurface Scattering Awared 3D Gaussians for Translucent Surface Reconstruction
-
RAFL: Generalizable Sim-to-Real of Soft Robots with Residual Acceleration Field Learning
-
DTVI: Dual-Stage Textual and Visual Intervention for Safe Text-to-Image Generation
-
Uncertainty-guided Compositional Alignment with Part-to-Whole Semantic Representativeness in Hyperbolic Vision-Language Models
-
MAGPI: Multifidelity-Augmented Gaussian Process Inputs for Surrogate Modeling from Scarce Data
-
AnimalCLAP: Taxonomy-Aware Language-Audio Pretraining for Species Recognition and Trait Inference
-
FontCrafter: High-Fidelity Element-Driven Artistic Font Creation with Visual In-Context Generation
-
Dual-Space Knowledge Distillation with Key-Query Matching for Large Language Models with Vocabulary Mismatch
-
SpatialBoost: Enhancing Visual Representation through Language-Guided Reasoning
-
On the Failure of Topic-Matched Contrast Baselines in Multi-Directional Refusal Abliteration
-
Adapting Point Cloud Analysis via Multimodal Bayesian Distribution Learning
-
PreferRec: Learning and Transferring Pareto Preferences for Multi-objective Re-ranking
-
MIHT: A Hoeffding Tree for Time Series Classification using Multiple Instance Learning
-
Autoregressive vs. Masked Diffusion Language Models: A Controlled Comparison
-
A Context Engineering Framework for Improving Enterprise AI Agents based on Digital-Twin MDP
-
P-Flow: Prompting Visual Effects Generation
-
Principled Steering via Null-space Projection for Jailbreak Defense in Vision-Language Models
-
GSEM: Graph-based Self-Evolving Memory for Experience Augmented Clinical Reasoning
-
SpecTM: Spectral Targeted Masking for Trustworthy Foundation Models
-
FreeArtGS: Articulated Gaussian Splatting Under Free-moving Scenario
-
Multiperspectivity as a Resource for Narrative Similarity Prediction
-
On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation
-
StreamingClaw Technical Report
-
Mamba-VMR: Multimodal Query Augmentation via Generated Videos for Precise Temporal Grounding
-
Biophysics-Enhanced Neural Representations for Patient-Specific Respiratory Motion Modeling
-
DA-VAE: Plug-in Latent Compression for Diffusion via Detail Alignment
-
Computationally lightweight classifiers with frequentist bounds on predictions
-
The Semantic Ladder: A Framework for Progressive Formalization of Natural Language Content for Knowledge Graphs and AI Systems
-
OpenEarth-Agent: From Tool Calling to Tool Creation for Open-Environment Earth Observation
-
More Isn't Always Better: Balancing Decision Accuracy and Conformity Pressures in Multi-AI Advice
-
Beyond Matching to Tiles: Bridging Unaligned Aerial and Satellite Views for Vision-Only UAV Navigation
-
dynActivation: A Trainable Activation Family for Adaptive Nonlinearity
-
RAMPAGE: RAndomized Mid-Point for debiAsed Gradient Extrapolation
-
Multimodal Survival Analysis with Locally Deployable Large Language Models
-
Data Curation for Machine Learning Interatomic Potentials by Determinantal Point Processes
-
Mixture of Demonstrations for Textual Graph Understanding and Question Answering
-
Causal Evidence that Language Models use Confidence to Drive Behavior
-
ACPO: Counteracting Likelihood Displacement in Vision-Language Alignment with Asymmetric Constraints
MongoDB - Build AI That Scales
