Papers
-
SFedHIFI: Fire Rate-Based Heterogeneous Information Fusion for Spiking Federated Learning
-
CyCLeGen: Cycle-Consistent Layout Prediction and Image Generation in Vision Foundation Models
-
Lightweight User-Personalization Method for Closed Split Computing
-
GeoNVS: Geometry Grounded Video Diffusion for Novel View Synthesis
-
This Is Taking Too Long -- Investigating Time as a Proxy for Energy Consumption of LLMs
-
Rethinking LLM Watermark Detection in Black-Box Settings: A Non-Intrusive Third-Party Framework
-
Voronoi-based Second-order Descriptor with Whitened Metric in LiDAR Place Recognition
-
Why Agents Compromise Safety Under Pressure
-
Anchoring Emotions in Text: Robust Multimodal Fusion for Mimicry Intensity Estimation
-
Beyond Benchmark Islands: Toward Representative Trustworthiness Evaluation for Agentic AI
-
MMSpec: Benchmarking Speculative Decoding for Vision-Language Models
-
Exposing Cross-Modal Consistency for Fake News Detection in Short-Form Videos
-
OrgForge: A Multi-Agent Simulation Framework for Verifiable Synthetic Corporate Corpora
-
Thermal Image Refinement with Depth Estimation using Recurrent Networks for Monocular ORB-SLAM3
-
How Log-Barrier Helps Exploration in Policy Optimization
-
MONET: Modeling and Optimization of neural NEtwork Training from Edge to Data Centers
-
Edit2Interp: Adapting Image Foundation Models from Spatial Editing to Video Frame Interpolation with Few-Shot Learning
-
Pretraining and Benchmarking Modern Encoders for Latvian
-
Empowering Chemical Structures with Biological Insights for Scalable Phenotypic Virtual Screening
-
Clue Matters: Leveraging Latent Visual Clues to Empower Video Reasoning
-
TrajFlow: Nation-wide Pseudo GPS Trajectory Generation with Flow Matching Models
-
Molecular Identifier Visual Prompt and Verifiable Reinforcement Learning for Chemical Reaction Diagram Parsing
-
Riemannian Motion Generation: A Unified Framework for Human Motion Representation and Generation via Riemannian Flow Matching
-
Consequentialist Objectives and Catastrophe
-
Reference-Free Omnidirectional Stereo Matching via Multi-View Consistency Maximization
-
MER-Bench: A Comprehensive Benchmark for Multimodal Meme Reappraisal
-
Describing Agentic AI Systems with C4: Lessons from Industry Projects
-
One CT Unified Model Training Framework to Rule All Scanning Protocols
-
Training-free Detection of Generated Videos via Spatial-Temporal Likelihoods
-
VTC-Bench: Evaluating Agentic Multimodal Models via Compositional Visual Tool Chaining
-
Rethinking Machine Unlearning: Models Designed to Forget via Key Deletion
-
Interpretable Predictability-Based AI Text Detection: A Replication Study
-
A convolutional autoencoder and neural ODE framework for surrogate modeling of transient counterflow flames
-
GUI-CEval: A Hierarchical and Comprehensive Chinese Benchmark for Mobile GUI Agents
-
Prompt Readiness Levels (PRL): a maturity scale and scoring framework for production grade prompt assets
-
AnoleVLA: Lightweight Vision-Language-Action Model with Deep State Space Models for Mobile Manipulation
-
CrossADR: enhancing adverse drug reactions prediction for combination pharmacotherapy with cross-layer feature integration and cross-level associative learning
-
SRL-MAD: Structured Residual Latents for One-Class Morphing Attack Detection
-
Thinking in Latents: Adaptive Anchor Refinement for Implicit Reasoning in LLMs
-
Interference-Aware K-Step Reachable Communication in Multi-Agent Reinforcement Learning
-
Spatio-temporal probabilistic forecast using MMAF-guided learning
-
Analyzing Error Sources in Global Feature Effect Estimation
-
Muon Converges under Heavy-Tailed Noise: Nonconvex Hölder-Smooth Empirical Risk Minimization
-
Writer-R1: Enhancing Generative Writing in LLMs via Memory-augmented Replay Policy Optimization
-
The Good, the Better, and the Best: Improving the Discriminability of Face Embeddings through Attribute-aware Learning
-
Generative Semantic HARQ: Latent-Space Text Retransmission and Combining
-
Interpretable Classification of Time Series Using Euler Characteristic Surfaces
-
Open Biomedical Knowledge Graphs at Scale: Construction, Federation, and AI Agent Access with Samyama Graph Database
-
ReactMotion: Generating Reactive Listener Motions from Speaker Utterance
-
Affordable Precision Agriculture: A Deployment-Oriented Review of Low-Cost, Low-Power Edge AI and TinyML for Resource-Constrained Farming Systems
