Papers
-
When Visuals Aren't the Problem: Evaluating Vision-Language Models on Misleading Data Visualizations
-
Feature Incremental Clustering with Generalization Bounds
-
Spatio-Temporal Attention Enhanced Multi-Agent DRL for UAV-Assisted Wireless Networks with Limited Communications
-
In-network Attack Detection with Federated Deep Learning in IoT Networks: Real Implementation and Analysis
-
Cerebra: A Multidisciplinary AI Board for Multimodal Dementia Characterization and Risk Assessment
-
Riemannian Geometry Speaks Louder Than Words: From Graph Foundation Model to Next-Generation Graph Intelligence
-
SynLeaF: A Dual-Stage Multimodal Fusion Framework for Synthetic Lethality Prediction Across Pan- and Single-Cancer Contexts
-
mSFT: Addressing Dataset Mixtures Overfitting Heterogeneously in Multi-task SFT
-
INTRYGUE: Induction-Aware Entropy Gating for Reliable RAG Uncertainty Estimation
-
DiT-Flow: Speech Enhancement Robust to Multiple Distortions based on Flow Matching in Latent Space and Diffusion Transformers
-
Rule-State Inference (RSI): A Bayesian Framework for Compliance Monitoring in Rule-Governed Domains
-
SARe: Structure-Aware Large-Scale 3D Fragment Reassembly
-
Towards Multimodal Time Series Anomaly Detection with Semantic Alignment and Condensed Interaction
-
AgenticRec: End-to-End Tool-Integrated Policy Optimization for Ranking-Oriented Recommender Agents
-
AdaEdit: Adaptive Temporal and Channel Modulation for Flow-Based Image Editing
-
Rateless DeepJSCC for Broadcast Channels: a Rate-Distortion-Complexity Tradeoff
-
FAAR: Format-Aware Adaptive Rounding for NVFP4
-
4DGS360: 360° Gaussian Reconstruction of Dynamic Objects from a Single Video
-
Efficient Zero-Shot AI-Generated Image Detection
-
Proximal Policy Optimization in Path Space: A Schrödinger Bridge Perspective
-
PGR-Net: Prior-Guided ROI Reasoning Network for Brain Tumor MRI Segmentation
-
Dual-level Adaptation for Multi-Object Tracking: Building Test-Time Calibration from Experience and Intuition
-
EnterpriseLab: A Full-Stack Platform for developing and deploying agents in Enterprises
-
Silicon Bureaucracy and AI Test-Oriented Education: Contamination Sensitivity and Score Confidence in LLM Benchmarks
-
No Dense Tensors Needed: Fully Sparse Object Detection on Event-Camera Voxel Grids
-
Engineering Distributed Governance for Regional Prosperity: A Socio-Technical Framework for Mitigating Under-Vibrancy via Human Data Engines
-
FedCVU: Federated Learning for Cross-View Video Understanding
-
MISApp: Multi-Hop Intent-Aware Session Graph Learning for Next App Prediction
-
Towards Secure Retrieval-Augmented Generation: A Comprehensive Review of Threats, Defenses and Benchmarks
-
TrustFed: Enabling Trustworthy Medical AI under Data Privacy Constraints
-
A Comparative Analysis of LLM Memorization at Statistical and Internal Levels: Cross-Model Commonalities and Model-Specific Signatures
-
OmniFM: Toward Modality-Robust and Task-Agnostic Federated Learning for Heterogeneous Medical Imaging
-
Cross-Scenario Deraining Adaptation with Unpaired Data: Superpixel Structural Priors and Multi-Stage Pseudo-Rain Synthesis
-
TAMTRL: Teacher-Aligned Reward Reshaping for Multi-Turn Reinforcement Learning in Long-Context Compression
-
HumanOmni-Speaker: Identifying Who said What and When
-
Optimal Memory Encoding Through Fluctuation-Response Structure
-
Rethinking Multimodal Fusion for Time Series: Auxiliary Modalities Need Constrained Fusion
-
PRM-as-a-Judge: A Dense Evaluation Paradigm for Fine-Grained Robotic Auditing
-
Optimizing Multi-Agent Weather Captioning via Text Gradient Descent: A Training-Free Approach with Consensus-Aware Gradient Fusion
-
SPINONet: Scalable Spiking Physics-informed Neural Operator for Computational Mechanics Applications
-
Thinking Deeper, Not Longer: Depth-Recurrent Transformers for Compositional Generalization
-
CoNBONet: Conformalized Neuroscience-inspired Bayesian Operator Network for Reliability Analysis
-
LipsAM: Lipschitz-Continuous Amplitude Modifier for Audio Signal Processing and its Application to Plug-and-Play Dereverberation
-
MIRAGE: The Illusion of Visual Understanding
-
AI Token Futures Market: Commoditization of Compute and Derivatives Contract Design
-
Reasoning Provenance for Autonomous AI Agents: Structured Behavioral Analytics Beyond State Checkpoints and Execution Traces
-
Deterministic Hallucination Detection in Medical VQA via Confidence-Evidence Bayesian Gain
-
RefracGS: Novel View Synthesis Through Refractive Water Surfaces with 3D Gaussian Ray Tracing
-
MIND: Multi-agent inference for negotiation dialogue in travel planning
-
Structured Visual Narratives Undermine Safety Alignment in Multimodal Large Language Models
MongoDB - Build AI That Scales
