Papers
-
The Evolution of Tool Use in LLM Agents: From Single-Tool Call to Multi-Tool Orchestration
-
Agent-Sentry: Bounding LLM Agents via Execution Provenance
-
Chain-of-Authorization: Internalizing Authorization into Large Language Models via Reasoning Trajectories
-
Designing to Forget: Deep Semi-parametric Models for Unlearning
-
Dynamical Systems Theory Behind a Hierarchical Reasoning Model
-
ForeSea: AI Forensic Search with Multi-modal Queries for Video Surveillance
-
Template-Based Feature Aggregation Network for Industrial Anomaly Detection
-
Grounding Sim-to-Real Generalization in Dexterous Manipulation: An Empirical Study with Vision-Language-Action Models
-
Continuous Optimization for Satisfiability Modulo Theories on Linear Real Arithmetic
-
Confidence Calibration under Ambiguous Ground Truth
-
TreeTeaming: Autonomous Red-Teaming of Vision-Language Models via Hierarchical Strategy Exploration
-
Group Editing: Edit Multiple Images in One Go
-
A Heterogeneous Long-Micro Scale Cascading Architecture for General Aviation Health Management
-
Conditionally Identifiable Latent Representation for Multivariate Time Series with Structural Dynamics
-
VLGOR: Visual-Language Knowledge Guided Offline Reinforcement Learning for Generalizable Agents
-
SLARM: Streaming and Language-Aligned Reconstruction Model for Dynamic Scenes
-
Off-Policy Evaluation and Learning for Survival Outcomes under Censoring
-
Separating Diagnosis from Control: Auditable Policy Adaptation in Agent-Based Simulations with LLM-Based Diagnostics
-
Dual-Teacher Distillation with Subnetwork Rectification for Black-Box Domain Adaptation
-
EchoKV: Efficient KV Cache Compression via Similarity-Based Reconstruction
-
ForestPrune: High-ratio Visual Token Compression for Video Multimodal Large Language Models via Spatial-Temporal Forest Modeling
-
From the AI Act to a European AI Agency: Completing the Union's Regulatory Architecture
-
Multilingual KokoroChat: A Multi-LLM Ensemble Translation Method for Creating a Multilingual Counseling Dialogue Dataset
-
When AVSR Meets Video Conferencing: Dataset, Degradation, and the Hidden Mechanism Behind Performance Collapse
-
EVA: Efficient Reinforcement Learning for End-to-End Video Agent
-
The EU AI Act and the Rights-based Approach to Technological Governance
-
Quality Over Clicks: Intrinsic Quality-Driven Iterative Reinforcement Learning for Cold-Start E-Commerce Query Suggestion
-
ProGRank: Probe-Gradient Reranking to Defend Dense-Retriever RAG from Corpus Poisoning
-
Ran Score: a LLM-based Evaluation Score for Radiology Report Generation
-
FixationFormer: Direct Utilization of Expert Gaze Trajectories for Chest X-Ray Classification
-
Optimizing Small Language Models for NL2SQL via Chain-of-Thought Fine-Tuning
-
PersonalQ: Select, Quantize, and Serve Personalized Diffusion Models for Efficient Inference
-
Caption Generation for Dongba Paintings via Prompt Learning and Semantic Fusion
-
Weak-PDE-Net: Discovering Open-Form PDEs via Differentiable Symbolic Networks and Weak Formulation
-
Cluster-Wise Spatio-Temporal Masking for Efficient Video-Language Pretraining
-
Privacy-Preserving EHR Data Transformation via Geometric Operators: A Human-AI Co-Design Technical Report
-
Safe Reinforcement Learning with Preference-based Constraint Inference
-
AscendOptimizer: Episodic Agent for Ascend NPU Operator Optimization
-
Stepwise Variational Inference with Vine Copulas
-
Asymptotic Learning Curves for Diffusion Models with Random Features Score and Manifold Data
-
A PAC-Bayesian approach to generalization for quantum models
-
Few-Shot Generative Model Adaption via Identity Injection and Preservation
-
Set-Valued Prediction for Large Language Models with Feasibility-Aware Coverage Guarantees
-
Beyond Theoretical Bounds: Empirical Privacy Loss Calibration for Text Rewriting Under Local Differential Privacy
-
FCL-COD: Weakly Supervised Camouflaged Object Detection with Frequency-aware and Contrastive Learning
-
WorldMesh: Generating Navigable Multi-Room 3D Scenes via Mesh-Conditioned Image Diffusion
-
Where Experts Disagree, Models Fail: Detecting Implicit Legal Citations in French Court Decisions
-
Causal Reconstruction of Sentiment Signals from Sparse News Data
-
DariMis: Harm-Aware Modeling for Dari Misinformation Detection on YouTube
-
JFTA-Bench: Evaluate LLM's Ability of Tracking and Analyzing Malfunctions Using Fault Trees
MongoDB - Build AI That Scales
