Papers
-
Generalization in Online Reinforcement Learning for Mobile Agents
-
Data Agent: Learning to Select Data via End-to-End Dynamic Optimization
-
RPG-SAM: Reliability-Weighted Prototypes and Geometric Adaptive Threshold Selection for Training-Free One-Shot Polyp Segmentation
-
Cost-Driven Representation Learning for Linear Quadratic Gaussian Control: Part II
-
Machine Learning for Stress Testing: Uncertainty Decomposition in Causal Panel Prediction
-
DogWeave: High-Fidelity 3D Canine Reconstruction from a Single Image via Normal Fusion and Conditional Inpainting
-
Med-Evo: Test-time Self-evolution for Medical Multimodal Large Language Models
-
HLER: Human-in-the-Loop Economic Research via Multi-Agent Pipelines for Empirical Discovery
-
Few Tokens, Big Leverage: Preserving Safety Alignment by Constraining Safety Tokens during Fine-tuning
-
Discrete Tokenization Unlocks Transformers for Calibrated Tabular Forecasting
-
Dial: A Knowledge-Grounded Dialect-Specific NL2SQL System
-
Backdoor4Good: Benchmarking Beneficial Uses of Backdoors in LLMs
-
SLNet: A Super-Lightweight Geometry-Adaptive Network for 3D Point Cloud Recognition
-
Image Generation Models: A Technical History
-
"Better Ask for Forgiveness than Permission": Practices and Policies of AI Disclosure in Freelance Work
-
Where Do LLM-based Systems Break? A System-Level Security Framework for Risk Assessment and Treatment
-
The Dual-Stream Transformer: Channelized Architecture for Interpretable Language Modeling
-
Do Machines Fail Like Humans? A Human-Centred Out-of-Distribution Spectrum for Mapping Error Alignment
-
SIGMAE: A Spectral-Index-Guided Foundation Model for Multispectral Remote Sensing
-
Selective Transfer Learning of Cross-Modality Distillation for Monocular 3D Object Detection
-
Classifying Novel 3D-Printed Objects without Retraining: Towards Post-Production Automation in Additive Manufacturing
-
Trusting What You Cannot See: Auditable Fine-Tuning and Inference for Proprietary AI
-
Probabilistic Inference and Learning with Stein's Method
-
FedEU: Evidential Uncertainty-Driven Federated Fine-Tuning of Vision Foundation Models for Remote Sensing Image Segmentation
-
Towards Lightweight Adaptation of Speech Enhancement Models in Real-World Environments
-
Contact-Guided 3D Genome Structure Generation of E. coli via Diffusion Transformers
-
Give Them an Inch and They Will Take a Mile:Understanding and Measuring Caller Identity Confusion in MCP-Based AI Systems
-
Cross-Modal Taxonomic Generalization in (Vision-) Language Models
-
Skip to the Good Part: Representation Structure & Inference-Time Layer Skipping in Diffusion vs. Autoregressive LLMs
-
EVLF: Early Vision-Language Fusion for Generative Dataset Distillation
-
Interpretable-by-Design Transformers via Architectural Stream Independence
-
Multi-Modal Decouple and Recouple Network for Robust 3D Object Detection
-
A Joint Neural Baseline for Concept, Assertion, and Relation Extraction from Clinical Text
-
RobustSCI: Beyond Reconstruction to Restoration for Snapshot Compressive Imaging under Real-World Degradations
-
Pushing Bistatic Wireless Sensing toward High Accuracy at the Sub-Wavelength Scale
-
RayD3D: Distilling Depth Knowledge Along the Ray for Robust Multi-View 3D Object Detection
-
DocCogito: Aligning Layout Cognition and Step-Level Grounded Reasoning for Document Understanding
-
From Thinker to Society: Security in Hierarchical Autonomy Evolution of AI Agents
-
AMR-CCR: Anchored Modular Retrieval for Continual Chinese Character Recognition
-
Enhanced Random Subspace Local Projections for High-Dimensional Time Series Analysis
-
SeDa: A Unified System for Dataset Discovery and Multi-Entity Augmented Semantic Exploration
-
High-Fidelity Medical Shape Generation via Skeletal Latent Diffusion
-
A Unified Framework for Knowledge Transfer in Bidirectional Model Scaling
-
Online Continual Learning for Anomaly Detection in IoT under Data Distribution Shifts
-
Bolbosh: Script-Aware Flow Matching for Kashmiri Text-to-Speech
-
A Unified View of Drifting and Score-Based Models
-
EvolveReason: Self-Evolving Reasoning Paradigm for Explainable Deepfake Facial Image Identification
-
InterReal: A Unified Physics-Based Imitation Framework for Learning Human-Object Interaction Skills
-
Reinforcement learning-based dynamic cleaning scheduling framework for solar energy system
-
SketchGraphNet: A Memory-Efficient Hybrid Graph Transformer for Large-Scale Sketch Corpora Recognition
