Papers
-
Statistical Learning for Latent Embedding Alignment with Application to Brain Encoding and Decoding
-
Confidence Freeze: Early Success Induces a Metastable Decoupling of Metacognition and Behaviour
-
LPNSR: Prior-Enhanced Diffusion Image Super-Resolution via LR-Guided Noise Prediction
-
SpatialFly: Geometry-Guided Representation Alignment for UAV Vision-and-Language Navigation in Urban Environments
-
When Minor Edits Matter: LLM-Driven Prompt Attack for Medical VLM Robustness in Ultrasound
-
A Two-stage Transformer Framework for Temporal Localization of Distracted Driver Behaviors
-
COMPASS-Hedge: Learning Safely Without Knowing the World
-
Harmful Visual Content Manipulation Matters in Misinformation Detection Under Multimedia Scenarios
-
SGAD-SLAM: Splatting Gaussians at Adjusted Depth for Better Radiance Fields in RGBD SLAM
-
Semi-Supervised Learning with Balanced Deep Representation Distributions
-
Single-Eye View: Monocular Real-time Perception Package for Autonomous Driving
-
Gradient Descent with Projection Finds Over-Parameterized Neural Networks for Learning Low-Degree Polynomials with Nearly Minimax Optimal Rate
-
2Xplat: Two Experts Are Better Than One Generalist
-
LongCat-Flash-Prover: Advancing Native Formal Reasoning via Agentic Tool-Integrated Reinforcement Learning
-
NoOVD: Novel Category Discovery and Embedding for Open-Vocabulary Object Detection
-
CTFS : Collaborative Teacher Framework for Forward-Looking Sonar Image Semantic Segmentation with Extremely Limited Labels
-
SqueezeComposer: Temporal Speed-up is A Simple Trick for Long-form Music Composing
-
CoVFT: Context-aware Visual Fine-tuning for Multimodal Large Language Models
-
Assessing the Ability of Neural TTS Systems to Model Consonant-Induced F0 Perturbation
-
Hierarchical Text-Guided Brain Tumor Segmentation via Sub-Region-Aware Prompts
-
ViCLSR: A Supervised Contrastive Learning Framework with Natural Language Inference for Natural Language Understanding Tasks
-
Taming Sampling Perturbations with Variance Expansion Loss for Latent Diffusion Models
-
DGRNet: Disagreement-Guided Refinement for Uncertainty-Aware Brain Tumor Segmentation
-
Stochastic approximation in non-markovian environments revisited
-
ReasonScaffold: A Scaffolded Reasoning-based Annotation Protocol for Human-AI Co-Annotation
-
Representation-Level Adversarial Regularization for Clinically Aligned Multitask Thyroid Ultrasound Assessment
-
Mixture of Chapters: Scaling Learnt Memory in Transformers
-
Learning to Optimize Joint Source and RIS-assisted Channel Encoding for Multi-User Semantic Communication Systems
-
Learning Progressive Adaptation for Multi-Modal Tracking
-
CounterScene: Counterfactual Causal Reasoning in Generative World Models for Safety-Critical Closed-Loop Evaluation
-
ResPrune: Text-Conditioned Subspace Reconstruction for Visual Token Pruning in Large Vision-Language Models
-
DMMRL: Disentangled Multi-Modal Representation Learning via Variational Autoencoders for Molecular Property Prediction
-
Frequency Switching Mechanism for Parameter-E!cient Multi-Task Learning
-
CVT-Bench: Counterfactual Viewpoint Transformations Reveal Unstable Spatial Representations in Multimodal LLMs
-
LiFR-Seg: Anytime High-Frame-Rate Segmentation via Event-Guided Propagation
-
Session Risk Memory (SRM): Temporal Authorization for Deterministic Pre-Execution Safety Gates
-
ReDiffuse: Rotation Equivariant Diffusion Model for Multi-focus Image Fusion
-
Anatomical Prior-Driven Framework for Autonomous Robotic Cardiac Ultrasound Standard View Acquisition
-
One Pool Is Not Enough: Multi-Cluster Memory for Practical Test-Time Adaptation
-
MS-CustomNet: Controllable Multi-Subject Customization with Hierarchical Relational Semantics
-
Incentivizing Generative Zero-Shot Learning via Outcome-Reward Reinforcement Learning with Visual Cues
-
Ontology-driven personalized information retrieval for XML documents
-
ORACLE: Optimizing Reasoning Abilities of Large Language Models via Constraint-Led Synthetic Data Elicitation
-
Time-adaptive functional Gaussian Process regression
-
NeSy-Edge: Neuro-Symbolic Trustworthy Self-Healing in the Computing Continuum
-
WIST: Web-Grounded Iterative Self-Play Tree for Domain-Targeted Reasoning Improvement
-
Emergent Formal Verification: How an Autonomous AI Ecosystem Independently Discovered SMT-Based Safety Across Six Domains
-
TRACE: A Multi-Agent System for Autonomous Physical Reasoning for Seismology
-
Learning from Label Proportions with Dual-proportion Constraints
-
Can LLMs Fool Graph Learning? Exploring Universal Adversarial Attacks on Text-Attributed Graphs
MongoDB - Build AI That Scales
