Papers
-
BiTro: Bidirectional Transfer Learning Enhances Bulk and Spatial Transcriptomics Prediction in Cancer Pathological Images
-
Photonic Quantum-Enhanced Knowledge Distillation
-
The impact of machine learning forecasting on strategic decision-making for Bike Sharing Systems
-
ExPosST: Explicit Positioning with Adaptive Masking for LLM-Based Simultaneous Machine Translation
-
Machine learning for sustainable geoenergy: uncertainty, physics and decision-ready inference
-
PerlAD: Towards Enhanced Closed-loop End-to-end Autonomous Driving with Pseudo-simulation-based Reinforcement Learning
-
TopoVST: Toward Topology-fidelitous Vessel Skeleton Tracking
-
Fine-tuning RoBERTa for CVE-to-CWE Classification: A 125M Parameter Model Competitive with LLMs
-
ILV: Iterative Latent Volumes for Fast and Accurate Sparse-View CT Reconstruction
-
EditHF-1M: A Million-Scale Rich Human Preference Feedback for Image Editing
-
Spectrogram features for audio and speech analysis
-
Bayesian Inference for Missing Physics
-
F2HDR: Two-Stage HDR Video Reconstruction via Flow Adapter and Physical Motion Modeling
-
UNICORN: Ultrasound Nakagami Imaging via Score Matching and Adaptation for Assessing Hepatic Steatosis
-
Directional Routing in Transformers
-
Workflow-Aware Structured Layer Decomposition for Illustration Production
-
Masked BRep Autoencoder via Hierarchical Graph Transformer
-
Video-CoE: Reinforcing Video Event Prediction via Chain of Events
-
Relevance Feedback in Text-to-Image Diffusion: A Training-Free And Model-Agnostic Interactive Framework
-
LLM as Graph Kernel: Rethinking Message Passing on Text-Rich Graphs
-
FAR-Drive: Frame-AutoRegressive Video Generation in Closed-Loop Autonomous Driving
-
Intelligent Control of Differential Drive Robots Subject to Unmodeled Dynamics with EKF-based State Estimation
-
RS-WorldModel: a Unified Model for Remote Sensing Understanding and Future Sense Forecasting
-
KGS-GCN: Enhancing Sparse Skeleton Sensing via Kinematics-Driven Gaussian Splatting and Probabilistic Topology for Action Recognition
-
Ultra-Early Prediction of Tipping Points: Integrating Dynamical Measures with Reservoir Computing
-
Spiking Layer-Adaptive Magnitude-based Pruning
-
FairMed-XGB: A Bayesian-Optimised Multi-Metric Framework with Explainability for Demographic Equity in Critical Healthcare Data
-
Bridging Scene Generation and Planning: Driving with World Model via Unifying Vision and Motion Representation
-
GT-PCQA: Geometry-Texture Decoupled Point Cloud Quality Assessment with MLLM
-
Pansharpening for Thin-Cloud Contaminated Remote Sensing Images: A Unified Framework and Benchmark Dataset
-
Omni IIE Bench: Benchmarking the Practical Capabilities of Image Editing Models
-
Learning Question-Aware Keyframe Selection with Synthetic Supervision for Video Question Answering
-
SFedHIFI: Fire Rate-Based Heterogeneous Information Fusion for Spiking Federated Learning
-
CyCLeGen: Cycle-Consistent Layout Prediction and Image Generation in Vision Foundation Models
-
Lightweight User-Personalization Method for Closed Split Computing
-
GeoNVS: Geometry Grounded Video Diffusion for Novel View Synthesis
-
This Is Taking Too Long -- Investigating Time as a Proxy for Energy Consumption of LLMs
-
Rethinking LLM Watermark Detection in Black-Box Settings: A Non-Intrusive Third-Party Framework
-
Voronoi-based Second-order Descriptor with Whitened Metric in LiDAR Place Recognition
-
Why Agents Compromise Safety Under Pressure
-
Anchoring Emotions in Text: Robust Multimodal Fusion for Mimicry Intensity Estimation
-
Beyond Benchmark Islands: Toward Representative Trustworthiness Evaluation for Agentic AI
-
MMSpec: Benchmarking Speculative Decoding for Vision-Language Models
-
Exposing Cross-Modal Consistency for Fake News Detection in Short-Form Videos
-
OrgForge: A Multi-Agent Simulation Framework for Verifiable Synthetic Corporate Corpora
-
Thermal Image Refinement with Depth Estimation using Recurrent Networks for Monocular ORB-SLAM3
-
How Log-Barrier Helps Exploration in Policy Optimization
-
MONET: Modeling and Optimization of neural NEtwork Training from Edge to Data Centers
-
Edit2Interp: Adapting Image Foundation Models from Spatial Editing to Video Frame Interpolation with Few-Shot Learning
-
Pretraining and Benchmarking Modern Encoders for Latvian
