Papers
-
Reforming the Mechanism: Editing Reasoning Patterns in LLMs with Circuit Reshaping
-
Small Target Detection Based on Mask-Enhanced Attention Fusion of Visible and Infrared Remote Sensing Images
-
MindfulAgents: Personalizing Mindfulness Meditation via an Expert-Aligned Multi-Agent System
-
HIERAMP: Coarse-to-Fine Autoregressive Amplification for Generative Dataset Distillation
-
Extracting and analyzing 3D histomorphometric features related to perineural and lymphovascular invasion in prostate cancer
-
Swimba: Switch Mamba Model Scales State Space Models
-
Physics-Consistent Neural Networks for Learning Deformation and Director Fields in Microstructured Media with Loss-Based Validation Criteria
-
Deep Research, Shallow Evaluation: A Case Study in Meta-Evaluation for Long-Form QA Benchmarks
-
Joint MDPs and Reinforcement Learning in Coupled-Dynamics Environments
-
How Private Are DNA Embeddings? Inverting Foundation Model Representations of Genomic Sequences
-
Omni-Diffusion: Unified Multimodal Understanding and Generation with Masked Discrete Diffusion
-
Solving an Open Problem in Theoretical Physics using AI-Assisted Discovery
-
Diagnosing FP4 inference: a layer-wise and block-wise sensitivity analysis of NVFP4 and MXFP4
-
KARL: Knowledge Agents via Reinforcement Learning
-
SPyCer: Semi-Supervised Physics-Guided Contextual Attention for Near-Surface Air Temperature Estimation from Satellite Imagery
-
Learning Optimal Distributionally Robust Individualized Treatment Rules Integrating Multi-Source Data
-
AI+HW 2035: Shaping the Next Decade
-
Learning Optimal Individualized Decision Rules with Conditional Demographic Parity
-
The Geometric Inductive Bias of Grokking: Bypassing Phase Transitions via Architectural Topology
-
Not All Trust is the Same: Effects of Decision Workflow and Explanations in Human-AI Decision Making
-
Digital Twin Driven Textile Classification and Foreign Object Recognition in Automated Sorting Systems
-
CBR-to-SQL: Rethinking Retrieval-based Text-to-SQL using Case-based Reasoning in the Healthcare Domain
-
Boosting ASR Robustness via Test-Time Reinforcement Learning with Audio-Text Semantic Rewards
-
On the Generalization Capacities of MLLMs for Spatial Intelligence
-
SlideSparse: Fast and Flexible (2N-2):2N Structured Sparsity
-
Recursive Inference Machines for Neural Reasoning
-
Reclaiming Lost Text Layers for Source-Free Cross-Domain Few-Shot Learning
-
GCAgent: Enhancing Group Chat Communication through Dialogue Agents System
-
ICHOR: A Robust Representation Learning Approach for ASL CBF Maps with Self-Supervised Masked Autoencoders
-
CATNet: Collaborative Alignment and Transformation Network for Cooperative Perception
-
Wiki-R1: Incentivizing Multimodal Reasoning for Knowledge-based VQA via Data and Sampling Curriculum
-
VietJobs: A Vietnamese Job Advertisement Dataset
-
A Behaviour-Aware Federated Forecasting Framework for Distributed Stand-Alone Wind Turbines
-
Beyond Word Error Rate: Auditing the Diversity Tax in Speech Recognition through Dataset Cartography
-
Visual-Informed Speech Enhancement Using Attention-Based Beamforming
-
Oral to Web: Digitizing 'Zero Resource'Languages of Bangladesh
-
SarcasmMiner: A Dual-Track Post-Training Framework for Robust Audio-Visual Sarcasm Reasoning
-
Whispering to a Blackbox: Bootstrapping Frozen OCR with Visual Prompts
-
Layer by layer, module by module: Choose both for optimal OOD probing of ViT
-
Bayesian Supervised Causal Clustering
-
X-RAY: Mapping LLM Reasoning Capability via Formalized and Calibrated Probes
-
Knowledge Divergence and the Value of Debate for Scalable Oversight
-
STRUCTUREDAGENT: Planning with AND/OR Trees for Long-Horizon Web Tasks
-
WebChain: A Large-Scale Human-Annotated Dataset of Real-World Web Interaction Traces
-
Latent Policy Steering through One-Step Flow Policies
-
WavSLM: Single-Stream Speech Language Modeling via WavLM Distillation
-
UniSTOK: Uniform Inductive Spatio-Temporal Kriging
-
Fusion4CA: Boosting 3D Object Detection via Comprehensive Image Exploitation
-
Med-V1: Small Language Models for Zero-shot and Scalable Biomedical Evidence Attribution
-
Latent-Mark: An Audio Watermark Robust to Neural Resynthesis
