Papers
-
Exposing Long-Tail Safety Failures in Large Language Models through Efficient Diverse Response Sampling
-
M$^2$RNN: Non-Linear RNNs with Matrix-Valued States for Scalable Language Modeling
-
BROTHER: Behavioral Recognition Optimized Through Heterogeneous Ensemble Regularization for Ambivalence and Hesitancy
-
AerialVLA: A Vision-Language-Action Model for UAV Navigation via Minimalist End-to-End Control
-
Representation Alignment for Just Image Transformers is not Easier than You Think
-
HomeGuard: VLM-based Embodied Safeguard for Identifying Contextual Risk in Household Task
-
From Specification to Architecture: A Theory Compiler for Knowledge-Guided Machine Learning
-
OxyGen: Unified KV Cache Management for Vision-Language-Action Models under Multi-Task Parallelism
-
Contests with Spillovers: Incentivizing Content Creation with GenAI
-
The Pulse of Motion: Measuring Physical Frame Rate from Visual Dynamics
-
LoCAtion: Long-time Collaborative Attention Framework for High Dynamic Range Video Reconstruction
-
SPARQ: Spiking Early-Exit Neural Networks for Energy-Efficient Edge AI
-
StAR: Segment Anything Reasoner
-
Label Noise Cleaning for Supervised Classification via Bernoulli Random Sampling
-
From $\boldsymbol{\logπ}$ to $\boldsymbolπ$: Taming Divergence in Soft Clipping via Bilateral Decoupled Decay of Probability Gradient Weight
-
WestWorld: A Knowledge-Encoded Scalable Trajectory World Model for Diverse Robotic Systems
-
Extending Minimal Pairs with Ordinal Surprisal Curves and Entropy Across Applied Domains
-
OCRA: Object-Centric Learning with 3D and Tactile Priors for Human-to-Robot Action Transfer
-
ES-Merging: Biological MLLM Merging via Embedding Space Signals
-
Graph-Based Deep Learning for Intelligent Detection of Energy Losses, Theft, and Operational Inefficiencies in Oil & Gas Production Networks
-
Towards One-for-All Anomaly Detection for Tabular Data
-
PGcGAN: Pathological Gait-Conditioned GAN for Human Gait Synthesis
-
BiT-MCTS: A Theme-based Bidirectional MCTS Approach to Chinese Fiction Generation
-
G-ZAP: A Generalizable Zero-Shot Framework for Arbitrary-Scale Pansharpening
-
Histo-MExNet: A Unified Framework for Real-World, Cross-Magnification, and Trustworthy Breast Cancer Histopathology
-
Questionnaire Responses Do not Capture the Safety of AI Agents
-
Deep EM with Hierarchical Latent Label Modelling for Multi-Site Prostate Lesion Segmentation
-
Data Darwinism Part II: DataEvolve -- AI can Autonomously Evolve Pretraining Data Curation
-
MBD: A Model-Based Debiasing Framework Across User, Content, and Model Dimensions
-
GenState-AI: State-Aware Dataset for Text-to-Video Retrieval on AI-Generated Videos
-
Creative Convergence or Imitation? Genre-Specific Homogeneity in LLM-Generated Chinese Literature
-
End-to-End Spatial-Temporal Transformer for Real-time 4D HOI Reconstruction
-
DASH: Dynamic Audio-Driven Semantic Chunking for Efficient Omnimodal Token Compression
-
AR-Flow VAE: A Structured Autoregressive Flow Prior Variational Autoencoder for Unsupervised Blind Source Separation
-
Solution for 10th Competition on Ambivalence/Hesitancy (AH) Video Recognition Challenge using Divergence-Based Multimodal Fusion
-
Echoes Across Centuries: Phonetic Signatures of Persian Poets
-
Zoom to Essence: Trainless GUI Grounding by Inferring upon Interface Elements
-
Life cycle assessment for all organic chemicals
-
How to find expressible and trainable parameterized quantum circuits?
-
On the Degrees of Freedom of Gridded Control Points in Learning-Based Medical Image Registration
-
Uni-MDTrack: Learning Decoupled Memory and Dynamic States for Parameter-Efficient Visual Tracking in All Modality
-
PARSA-Bench: A Comprehensive Persian Audio-Language Model Benchmark
-
Distilling Reasoning Without Knowledge: A Framework for Reliable LLMs
-
Inclusive AI for Group Interactions: Predicting Gaze-Direction Behaviors in People with Intellectual and Developmental Disabilities
-
STAG-CN: Spatio-Temporal Apiary Graph Convolutional Network for Disease Onset Prediction in Beehive Sensor Networks
-
An Industrial-Scale Insurance LLM Achieving Verifiable Domain Mastery and Hallucination Control without Competence Trade-offs
-
AgentProcessBench: Diagnosing Step-Level Process Quality in Tool-Using Agents
-
LongVidSearch: An Agentic Benchmark for Multi-hop Evidence Retrieval Planning in Long Videos
-
Physics-Informed Policy Optimization via Analytic Dynamics Regularization
-
AI Can Learn Scientific Taste
MongoDB - Build AI That Scales
