Papers
-
Talking Together: Synthesizing Co-Located 3D Conversations from Audio
-
Momentum SVGD-EM for Accelerated Maximum Marginal Likelihood EstimationEcole Normale Supérieure Paris-Saclay, Imperial College London
-
A New Lower Bound for the Random Offerer Mechanism in Bilateral Trade using AI-Guided Evolutionary Search
-
ER-Pose: Rethinking Keypoint-Driven Representation Learning for Real-Time Human Pose EstimationNanjing University of Posts and Telecommunications
-
Structural Causal Bottleneck ModelsGerman Research Center for Artificial Intelligence, Technische Universitat Berlin, University of Potsdam
-
Benchmarking Language Modeling for Lossless Compression of Full-Fidelity AudioUniversity of California San Diego
-
Split Federated Learning Architectures for High-Accuracy and Low-Delay Model Training
-
A Multi-Objective Optimization Approach for Sustainable AI-Driven Entrepreneurship in Resilient EconomiesSouthern Illinois University Carbondale, Yarmouk University
-
HiAR: Efficient Autoregressive Long Video Generation via Hierarchical DenoisingAnhui Province Key Laboratory of Digital Security, Tencent Hunyuan, The Chinese University of Hong Kong, Tongji University, University of Science and Technology of China
-
Evaluating Financial Intelligence in Large Language Models: Benchmarking SuperInvesting AI with LLM EnginesThe Future University
-
Impermanent: A Live Benchmark for Temporal Generalization in Time Series ForecastingTimeCopilot, Amazon Web Services / ELLIS Institute Tübingen, Mila–Quebec AI Institute, Université de Montréal
-
FVG-PT: Adaptive Foreground View-Guided Prompt Tuning for Vision-Language ModelsShanghaiTech University, University of Technology Sydney
-
Multi-level meta-reinforcement learning with skill-based curriculumJohns Hopkins University
-
Granulon: Awakening Pixel-Level Visual Encoders with Adaptive Multi-Granularity Semantics for MLLMNanyang Technological University, National University of Singapore
-
Large Language Model-Assisted Superconducting Qubit ExperimentsAalto University, Rensselaer Polytechnic Institute, University of Chicago
-
The Temporal Markov Transition Field
-
Test-Driven AI Agent Definition (TDAD): Compiling Tool-Using Agents from Behavioral SpecificationsFiverrLabs
-
Where, What, Why: Toward Explainable 3D-GS WatermarkingNanyang Technological University, Southeast University, Waseda University
-
VisionCreator-R1: A Reflection-Enhanced Native Visual-Generation Agentic ModelHong Kong University of Science and Technology, Tencent Hunyuan
-
Scale-Plan: Scalable Language-Enabled Task Planning for Heterogeneous Multi-Robot TeamsHonda Research Institute, University of California, Riverside
-
Training Language Models via Neural Cellular AutomataImprobable AI Lab, Massachusetts Institute of Technology
-
Beyond Relevance: On the Relationship Between Retrieval and RAG Information CoverageJohns Hopkins University, National Institute of Standards and Technology, University of New Hampshire
-
Fish Audio S2 Technical Report
-
SoftJAX & SoftTorch: Empowering Automatic Differentiation Libraries with Informative GradientsMasaryk University, Max Planck Institute for Biogeochemistry, Max Planck Institute for Intelligent Systems, University of Tübingen
-
Are Expressive Encoders Necessary for Discrete Graph Generation?Michigan State University, University of Texas at Arlington
-
Computer Vision-Based Vehicle Allotment System using Perspective MappingNational Institute of Technology Rourkela
-
MASEval: Extending Multi-Agent Evaluation from Models to SystemsParameter Lab / Korea Advanced Institute of Science and Technology, Mohamed bin Zayed University of Artificial Intelligence, NAVER AI Lab, TU Darmstadt, University of Oxford, University of Tübingen
-
A Lightweight Multi-Cancer Tumor Localization Framework for Deployable Digital PathologyIndiana University, University of Pittsburgh, UPMC Hillman Cancer Center
-
HECTOR: Hybrid Editable Compositional Object References for Video Generation
-
SBOMs into Agentic AIBOMs: Schema Extensions, Agentic Orchestration, and Reproducibility EvaluationAlan Turing Institute, University of Oxford
-
LDP: An Identity-Aware Protocol for Multi-Agent LLM SystemsIndian School of Business
-
Unpacking Interpretability: Human-Centered Criteria for Optimal Combinatorial SolutionsTechnical University of Darmstadt, University of Vienna
-
Expressivity-Efficiency Tradeoffs for Hybrid Sequence ModelsUniversity of Wisconsin-Madison
-
APPLV: Adaptive Planner Parameter Learning from Vision-Language-Action ModelGeorge Mason University, Rutgers University, University of South Florida
-
Why Channel-Centric Models are not Enough to Predict End-to-End Performance in Private 5G: A Measurement Campaign and Case StudyKTH Royal Institute of Technology
-
One Language, Two Scripts: Probing Script-Invariance in LLM Concept RepresentationsColumbia University
-
Quantifying the Accuracy and Cost Impact of Design Decisions in Budget-Constrained Agentic LLM SearchLouisiana State University
-
MultiGraSCCo: A Multilingual Anonymization Benchmark with Annotations of Personal IdentifiersBerlin Institute for the Foundations of Learning and Data, Charité – Universitätsmedizin Berlin, German Research Center for Artificial Intelligence, Humboldt-Universität zu Berlin, Technische Universitat Berlin, University of Potsdam
-
From Word2Vec to Transformers: Text-Derived Composition Embeddings for Filtering Combinatorial ElectrocatalystsRuhr-Universität Bochum
-
Comparative Analysis of Patch Attack on VLM-Based Autonomous Driving ArchitecturesClemson University
-
Towards Visual Query Segmentation in the WildUniversity of North Texas
-
ConFu: Contemplate the Future for Better Speculative SamplingQualcomm AI Research, University of California
-
A New Modeling to Feature Selection Based on the Fuzzy Rough Set Theory in Normal and Optimistic States on Hybrid Information SystemsIslamic Azad University
-
NetDiffuser: Deceiving DNN-Based Network Attack Detection Systems with Diffusion-Generated Adversarial TrafficNew Mexico State University, The U.S. Army Combat Capabilities Development Command, University of Hartford
-
Multi-Kernel Gated Decoder Adapters for Robust Multi-Task Thyroid Ultrasound under Cross-Center ShiftBC Cancer Research Institute, University of British Columbia
-
Cross-Domain Uncertainty Quantification for Selective Prediction: A Comprehensive Bound Ablation with Transfer-Informed Betting
-
SciTaRC: Benchmarking QA on Scientific Tabular Data that Requires Language Reasoning and Complex ComputationJohns Hopkins University
-
FedLECC: Cluster- and Loss-Guided Client Selection for Federated Learning under Non-IID DataSapienza University of Rome
-
Quantifying Memorization and Privacy Risks in Genomic Language ModelsCase Western Reserve University, Rutgers University, University of Texas
-
Uncovering a Winning Lottery Ticket with Continuously Relaxed Bernoulli GatesBar-Ilan University
