Papers
-
Calibeating Made Simple
-
MARCUS: An agentic, multimodal vision-language model for cardiac diagnosis and management
-
Learning When to Act: Interval-Aware Reinforcement Learning with Predictive Temporal Structure
-
Revisiting Quantum Code Generation: Where Should Domain Knowledge Live?
-
Enhancing Document-Level Machine Translation via Filtered Synthetic Corpora and Two-Stage LLM Adaptation
-
Seeing is Improving: Visual Feedback for Iterative Text Layout Refinement
-
A Backbone Benchmarking Study on Self-supervised Learning as a Auxiliary Task with Texture-based Local Descriptors for Face Analysis
-
PAM: A Pose-Appearance-Motion Engine for Sim-to-Real HOI Video Generation
-
CayleyPy-4: AI-Holography. Towards analogs of holographic string dualities for AI tasks
-
Mixture of Mini Experts: Overcoming the Linear Layer Bottleneck in Multiple Instance Learning
-
Chimera: Latency- and Performance-Aware Multi-agent Serving for Heterogeneous LLMs
-
Omni-WorldBench: Towards a Comprehensive Interaction-Centric Evaluation for World Models
-
SPA: A Simple but Tough-to-Beat Baseline for Knowledge Injection
-
Evaluating the Reliability and Fidelity of Automated Judgment Systems of Large Language Models
-
Gumbel Distillation for Parallel Text Generation
-
Noise Titration: Exact Distributional Benchmarking for Probabilistic Time Series Forecasting
-
Adapting Self-Supervised Speech Representations for Cross-lingual Dysarthria Detection in Parkinson's Disease
-
Dyadic: A Scalable Platform for Human-Human and Human-AI Conversation Research
-
SpatialReward: Verifiable Spatial Reward Modeling for Fine-Grained Spatial Consistency in Text-to-Image Generation
-
Benchmarking Deep Learning Models for Aerial LiDAR Point Cloud Semantic Segmentation under Real Acquisition Conditions: A Case Study in Navarre
-
Riverine Land Cover Mapping through Semantic Segmentation of Multispectral Point Clouds
-
One Model, Two Markets: Bid-Aware Generative Recommendation
-
ShapDBM: Exploring Decision Boundary Maps in Shapley Space
-
MemDLM: Memory-Enhanced DLM Training
-
Drop-In Perceptual Optimization for 3D Gaussian Splatting
-
From Static Templates to Dynamic Runtime Graphs: A Survey of Workflow Optimization for LLM Agents
-
Confidence-Based Decoding is Provably Efficient for Diffusion Language Models
-
EgoGroups: A Benchmark For Detecting Social Groups of People in the Wild
-
Characterizing High-Capacity Janus Aminobenzene-Graphene Anode for Sodium-Ion Batteries with Machine Learning
-
Greater accessibility can amplify discrimination in generative AI
-
Efficient Universal Perception Encoder
-
TiCo: Time-Controllable Training for Spoken Dialogue Models
-
GenOpticalFlow: A Generative Approach to Unsupervised Optical Flow Learning
-
DUO-VSR: Dual-Stream Distillation for One-Step Video Super-Resolution
-
Decoupling Exploration and Policy Optimization: Uncertainty Guided Tree Search for Hard Exploration
-
Repurposing Geometric Foundation Models for Multi-view Diffusion
-
Scaling DoRA: High-Rank Adaptation via Factored Norms and Fused Kernels
-
The Dual Mechanisms of Spatial Reasoning in Vision-Language Models
-
3D-Layout-R1: Structured Reasoning for Language-Instructed Spatial Editing
-
DualCoT-VLA: Visual-Linguistic Chain of Thought via Parallel Reasoning for Vision-Language-Action Models
-
ThinkJEPA: Empowering Latent World Models with Large Vision-Language Reasoning Model
-
UniMotion: A Unified Framework for Motion-Text-Vision Understanding and Generation
-
End-to-End Training for Unified Tokenization and Latent Denoising
-
VideoDetective: Clue Hunting via both Extrinsic Query and Intrinsic Relevance for Long Video Understanding
-
WorldCache: Content-Aware Caching for Accelerated Video World Models
-
Latent Style-based Quantum Wasserstein GAN for Drug Design
-
Probabilistic modeling over permutations using quantum computers
-
Computational Arbitrage in AI Model Markets
-
Spatially-Aware Evaluation Framework for Aerial LiDAR Point Cloud Semantic Segmentation: Distance-Based Metrics on Challenging Regions
-
OsteoFlow: Lyapunov-Guided Flow Distillation for Predicting Bone Remodeling after Mandibular Reconstruction
MongoDB - Build AI That Scales
