Papers
-
OmniDiT: Extending Diffusion Transformer to Omni-VTON Framework
-
Heavy-Tailed and Long-Range Dependent Noise in Stochastic Approximation: A Finite-Time Analysis
-
PolicySim: An LLM-Based Agent Social Simulation Sandbox for Proactive Policy Optimization
-
Ensembles-based Feature Guided Analysis
-
GravCal: Single-Image Calibration of IMU Gravity Priors with Per-Sample Confidence
-
Model Selection and Parameter Estimation of Multi-dimensional Gaussian Mixture Model
-
CS-MUNet: A Channel-Spatial Dual-Stream Mamba Network for Multi-Organ Segmentation
-
Semantic Audio-Visual Navigation in Continuous Environments
-
The Residual Stream Is All You Need: On the Redundancy of the KV Cache in Transformer Inference
-
Toward High-Fidelity Visual Reconstruction: From EEG-Based Conditioned Generation to Joint-Modal Guided Rebuilding
-
Structured Prompting for Arabic Essay Proficiency: A Trait-Centric Evaluation Approach
-
Scale-Dependent Radial Geometry and Metric Mismatch in Wasserstein Propagation for Reverse Diffusion
-
Geometric Mixture-of-Experts with Curvature-Guided Adaptive Routing for Graph Representation Learning
-
Making Video Models Adhere to User Intent with Minor Adjustments
-
DynFlowDrive: Flow-Based Dynamic World Modeling for Autonomous Driving
-
ATHENA: Adaptive Test-Time Steering for Improving Count Fidelity in Diffusion Models
-
GoAgent: Group-of-Agents Communication Topology Generation for LLM-based Multi-Agent Systems
-
Vision-Language Attribute Disentanglement and Reinforcement for Lifelong Person Re-Identification
-
Unbiased Dynamic Multimodal Fusion
-
3D Gaussian Splatting with Self-Constrained Priors for High Fidelity Surface Reconstruction
-
Ontology-Based Knowledge Modeling and Uncertainty-Aware Outdoor Air Quality Assessment Using Weighted Interval Type-2 Fuzzy Logic
-
TSegAgent: Zero-Shot Tooth Segmentation via Geometry-Aware Vision-Language Agents
-
A Subgoal-driven Framework for Improving Long-Horizon LLM Agents
-
Diminishing Returns in Expanding Generative Models and Godel-Tarski-Lob Limits
-
DataProphet: Demystifying Supervision Data Generalization in Multimodal LLMs
-
A Unified Phase-native Computational Principle Governs Hippocampal Spike Timing and Neural Coding
-
Demographic-Aware Self-Supervised Anomaly Detection Pretraining for Equitable Rare Cardiac Diagnosis
-
Regret Analysis of Sleeping Competing Bandits
-
Minimax and Adaptive Covariance Matrix Estimation under Differential Privacy
-
WorldAgents: Can Foundation Image Models be Agents for 3D World Models?
-
Bounded Coupled AI Learning Dynamics in Tri-Hierarchical Drone Swarms
-
AIGQ: An End-to-End Hybrid Generative Architecture for E-commerce Query Recommendation
-
EvoTaxo: Building and Evolving Taxonomy from Social Media Streams
-
TAB-AUDIT: Detecting AI-Fabricated Scientific Tables via Multi-View Likelihood Mismatch
-
Learning from Similarity/Dissimilarity and Pairwise Comparison
-
LoopRPT: Reinforcement Pre-Training for Looped Language Models
-
Stepwise: Neuro-Symbolic Proof Search for Automated Systems Verification
-
BALM: A Model-Agnostic Framework for Balanced Multimodal Learning under Imbalanced Missing Rates
-
FedRG: Unleashing the Representation Geometry for Federated Learning with Noisy Clients
-
PerformRecast: Expression and Head Pose Disentanglement for Portrait Video Editing
-
Procedural Refinement by LLM-driven Algorithmic Debugging for ARC-AGI-2
-
PoC: Performance-oriented Context Compression for Large Language Models via Performance Prediction
-
A two-step sequential approach for hyperparameter selection in finite context models
-
MOSS-TTSD: Text to Spoken Dialogue Generation
-
FedPDPO: Federated Personalized Direct Preference Optimization for Large Language Model Alignment
-
Hybrid Autoencoder-Isolation Forest approach for time series anomaly detection in C70XP cyclotron operation data at ARRONAX
-
Dual Path Attribution: Efficient Attribution for SwiGLU-Transformers through Layer-Wise Target Propagation
-
Rethinking Ground Truth: A Case Study on Human Label Variation in MLLM Benchmarking
-
PhysNeXt: Next-Generation Dual-Branch Structured Attention Fusion Network for Remote Photoplethysmography Measurement
-
ReLi3D: Relightable Multi-view 3D Reconstruction with Disentangled Illumination
MongoDB - Build AI That Scales
