Papers
-
SldprtNet: A Large-Scale Multimodal Dataset for CAD Generation in Language-Driven 3D Design
-
Beyond Final Answers: CRYSTAL Benchmark for Transparent Multimodal Reasoning Evaluation
-
Evaluating VLMs' Spatial Reasoning Over Robot Motion: A Step Towards Robot Planning with Motion Preferences
-
BenDFM: A taxonomy and synthetic CAD dataset for manufacturability assessment in sheet metal bending
-
Panoramic Multimodal Semantic Occupancy Prediction for Quadruped Robots
-
BoSS: A Best-of-Strategies Selector as an Oracle for Deep Active Learning
-
ZO-SAM: Zero-Order Sharpness-Aware Minimization for Efficient Sparse Training
-
NOIR: Neural Operator mapping for Implicit Representations
-
Geometry-Guided Camera Motion Understanding in VideoLLMs
-
FDeID-Toolbox: Face De-Identification Toolbox
-
Scalable Machines with Intrinsic Higher Mental-State Dynamics
-
Developing the PsyCogMetrics AI Lab to Evaluate Large Language Models and Advance Cognitive Science -- A Three-Cycle Action Design Science Study
-
Steve-Evolving: Open-World Embodied Self-Evolution via Fine-Grained Diagnosis and Dual-Track Knowledge Distillation
-
When Right Meets Wrong: Bilateral Context Conditioning with Reward-Confidence Correction for GRPO
-
ESG-Bench: Benchmarking Long-Context ESG Reports for Hallucination Mitigation
-
DiT-IC: Aligned Diffusion Transformer for Efficient Image Compression
-
Towards Faithful Multimodal Concept Bottleneck Models
-
Reconciling In-Context and In-Weight Learning via Dual Representation Space Encoding
-
Developing and evaluating a chatbot to support maternal health care
-
Semantic Invariance in Agentic AI
-
Purifying Generative LLMs from Backdoors without Prior Knowledge or Clean Reference
-
Perceive What Matters: Relevance-Driven Scheduling for Multimodal Streaming Perception
-
Clustering Astronomical Orbital Synthetic Data Using Advanced Feature Extraction and Dimensionality Reduction Techniques
-
MXNorm: Reusing MXFP block scales for efficient tensor normalisation
-
Diffusion-Based Feature Denoising and Using NNMF for Robust Brain Tumor Classification
-
Towards Spatio-Temporal World Scene Graph Generation from Monocular Videos
-
Learnability and Privacy Vulnerability are Entangled in a Few Critical Weights
-
LLM Constitutional Multi-Agent Governance
-
From Experiments to Expertise: Scientific Knowledge Consolidation for AI-Driven Computational Research
-
Neuron-Aware Data Selection In Instruction Tuning For Large Language Models
-
Theoretical Foundations of Latent Posterior Factors: Formal Guarantees for Multi-Evidence Reasoning
-
Open World MRI Reconstruction with Bias-Calibrated Adaptation
-
Leveraging Large Vision Model for Multi-UAV Co-perception in Low-Altitude Wireless Networks
-
Out of Sight, Out of Mind? Evaluating State Evolution in Video World Models
-
Visual-ERM: Reward Modeling for Visual Equivalence
-
Resolving Interference (RI): Disentangling Models for Improved Model Merging
-
PhysMoDPO: Physically-Plausible Humanoid Motion with Preference Optimization
-
Equivalence of approximation by networks of single- and multi-spike neurons
-
Deep Invertible Autoencoders for Dimensionality Reduction of Dynamical Systems
-
Synthetic Melanoma Image Generation and Evaluation Using Generative Adversarial Networks
-
ActionPlan: Future-Aware Streaming Motion Synthesis via Frame-Level Action Planning
-
Standard Acquisition Is Sufficient for Asynchronous Bayesian Optimization
-
LibraGen: Playing a Balance Game in Subject-Driven Video Generation
-
MIRAGE: Model-agnostic Industrial Realistic Anomaly Generation and Evaluation for Visual Anomaly Detection
-
Executable Archaeology: Reanimating the Logic Theorist from its IPL-V Source
-
VoXtream2: Full-stream TTS with dynamic speaking rate control
-
A Systematic Benchmark of GAN Architectures for MRI-to-CT Synthesis
-
Eleven Primitives and Three Gates: The Universal Structure of Computational Imaging
-
Hide and Seek: Investigating Redundancy in Earth Observation Imagery
-
SAIF: A Stability-Aware Inference Framework for Medical Image Segmentation with Segment Anything Model
