Papers
-
A Comparative Investigation of Thermodynamic Structure-Informed Neural Networks
-
When Sensing Varies with Contexts: Context-as-Transform for Tactile Few-Shot Class-Incremental Learning
-
AnyDoc: Enhancing Document Generation via Large-Scale HTML/CSS Data Synthesis and Height-Aware Reinforcement Optimization
-
The Language of Touch: Translating Vibrations into Text with Dual-Branch Learning
-
MCLMR: A Model-Agnostic Causal Learning Framework for Multi-Behavior Recommendation
-
AirSplat: Alignment and Rating for Robust Feed-Forward 3D Gaussian Splatting
-
Denoise and Align: Towards Source-Free UDA for Robust Panoramic Semantic Segmentation
-
Robust Principal Component Completion
-
RubricEval: A Rubric-Level Meta-Evaluation Benchmark for LLM Judges in Instruction Following
-
EgoXtreme: A Dataset for Robust Object Pose Estimation in Egocentric Views under Extreme Conditions
-
Reinforcement learning for quantum processes with memory
-
SAVe: Self-Supervised Audio-visual Deepfake Detection Exploiting Visual Artifacts and Audio-visual Misalignment
-
IncreRTL: Traceability-Guided Incremental RTL Generation under Requirement Evolution
-
FD$^2$: A Dedicated Framework for Fine-Grained Dataset Distillation
-
ReCUBE: Evaluating Repository-Level Context Utilization in Code Generation
-
Learning to Rank Caption Chains for Video-Text Alignment
-
Factors Influencing the Quality of AI-Generated Code: A Synthesis of Empirical Evidence
-
Goodness-of-pronunciation without phoneme time alignment
-
UniAI-GraphRAG: Synergizing Ontology-Guided Extraction, Multi-Dimensional Clustering, and Dual-Channel Fusion for Robust Multi-Hop Reasoning
-
Photon: Speedup Volume Understanding with Efficient Multimodal Large Language Models
-
Vision Hopfield Memory Networks
-
Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills
-
A Semantically Disentangled Unified Model for Multi-category 3D Anomaly Detection
-
SportSkills: Physical Skill Learning from Sports Instructional Videos
-
PIDP-Attack: Combining Prompt Injection with Database Poisoning Attacks on Retrieval-Augmented Generation Systems
-
Towards Foundation Models for 3D Scene Understanding: Instance-Aware Self-Supervised Learning for Point Clouds
-
Empowering Epidemic Response: The Role of Reinforcement Learning in Infectious Disease Control
-
ET-SAM: Efficient Point Prompt Prediction in SAM for Unified Scene Text Detection and Layout Analysis
-
To Write or to Automate Linguistic Prompts, That Is the Question
-
Knowledge-Guided Adversarial Training for Infrared Object Detection via Thermal Radiation Modeling
-
AG-EgoPose: Leveraging Action-Guided Motion and Kinematic Joint Encoding for Egocentric 3D Pose Estimation
-
Prompt Attack Detection with LLM-as-a-Judge and Mixture-of-Models
-
Bilingual Text-to-Motion Generation: A New Benchmark and Baselines
-
VolDiT: Controllable Volumetric Medical Image Synthesis with Diffusion Transformers
-
Cross-Preference Learning for Sentence-Level and Context-Aware Machine Translation
-
Train at Moving Edge: Online-Verified Prompt Selection for Efficient RL Training of Large Reasoning Model
-
Knowledge-Guided Retrieval-Augmented Generation for Zero-Shot Psychiatric Data: Privacy Preserving Synthetic Data Generation
-
Probing the Lack of Stable Internal Beliefs in LLMs
-
AnyID: Ultra-Fidelity Universal Identity-Preserving Video Generation from Any Visual References
-
A Catalog of Basque Dialectal Resources: Online Collections and Standard-to-Dialectal Adaptations
-
CardioDiT: Latent Diffusion Transformers for 4D Cardiac MRI Synthesis
-
A Decade-Scale Benchmark Evaluating LLMs' Clinical Practice Guidelines Detection and Adherence in Multi-turn Conversations
-
The Competence Shadow: Theory and Bounds of AI Assistance in Safety Engineering
-
TacSIm: A Dataset and Benchmark for Football Tactical Style Imitation
-
SafeMath: Inference-time Safety improves Math Accuracy
-
CIV-DG: Conditional Instrumental Variables for Domain Generalization in Medical Imaging
-
Probabilistic Concept Graph Reasoning for Multimodal Misinformation Detection
-
A CDF-First Framework for Free-Form Density Estimation
-
Free-Lunch Long Video Generation via Layer-Adaptive O.O.D Correction
-
A Wireless World Model for AI-Native 6G Networks
MongoDB - Build AI That Scales
