Papers
-
Multimodal Emotion Regression with Multi-Objective Optimization and VAD-Aware Audio Modeling for the 10th ABAW EMI Track
-
Level Up: Defining and Exploiting Transitional Problems for Curriculum Learning
-
Knowledge Distillation for Large Language Models
-
Causal Tracing of Audio-Text Fusion in Large Audio Language Models
-
PhysAlign: Physics-Coherent Image-to-Video Generation through Feature and 3D Representation Alignment
-
Brain Tumor Classification from 3D MRI Using Persistent Homology and Betti Features: A Topological Data Analysis Approach on BraTS2020
-
LiveWeb-IE: A Benchmark For Online Web Information Extraction
-
Retrieval-Feedback-Driven Distillation and Preference Alignment for Efficient LLM-based Query Expansion
-
Generate Then Correct: Single Shot Global Correction for Aspect Sentiment Quad Prediction
-
AD-Copilot: A Vision-Language Assistant for Industrial Anomaly Detection via Visual In-context Comparison
-
Your Vision-Language-Action Model Already Has Attention Heads For Path Deviation Detection
-
RetimeGS: Continuous-Time Reconstruction of 4D Gaussian Splatting
-
Projection-Free Evolution Strategies for Continuous Prompt Search
-
Advancing Cancer Prognosis with Hierarchical Fusion of Genomic, Proteomic and Pathology Imaging Data from a Systems Biology Perspective
-
Hierarchy of extreme-event predictability in turbulence revealed by machine learning
-
Greedy Information Projection for LLM Data Selection
-
DeceptGuard :A Constitutional Oversight Framework For Detecting Deception in LLM Agents
-
IGU-LoRA: Adaptive Rank Allocation via Integrated Gradients and Uncertainty-Aware Scoring
-
GhanaNLP Parallel Corpora: Comprehensive Multilingual Resources for Low-Resource Ghanaian Languages
-
Computation and Communication Efficient Federated Unlearning via On-server Gradient Conflict Mitigation and Expression
-
PMIScore: An Unsupervised Approach to Quantify Dialogue Engagement
-
Node Role-Guided LLMs for Dynamic Graph Clustering
-
Beyond Medical Diagnostics: How Medical Multimodal Large Language Models Think in Space
-
ALTIS: Automated Loss Triage and Impact Scoring from Sentinel-1 SAR for Property-Level Flood Damage Assessment
-
Prototypical Exemplar Condensation for Memory-efficient Online Continual Learning
-
An Interpretable and Stable Framework for Sparse Principal Component Analysis
-
Collapse or Preserve: Data-Dependent Temporal Aggregation for Spiking Neural Network Acceleration
-
Artificial intelligence-driven improvement of hospital logistics management resilience: a practical exploration based on H Hospital
-
PA-Net: Precipitation-Adaptive Mixture-of-Experts for Long-Tail Rainfall Nowcasting
-
Evaluating Semantic Fragility in Text-to-Audio Generation Systems Under Controlled Prompt Perturbations
-
Effective Sparsity: A Unified Framework via Normalized Entropy and the Effective Number of Nonzeros
-
ArrayTac: A tactile display for simultaneous rendering of shape, stiffness and friction
-
Early Rug Pull Warning for BSC Meme Tokens via Multi-Granularity Wash-Trading Pattern Profiling
-
Efficient Semi-Automated Material Microstructure Analysis Using Deep Learning: A Case Study in Additive Manufacturing
-
Intelligent Materials Modelling: Large Language Models Versus Partial Least Squares Regression for Predicting Polysulfone Membrane Mechanical Performance
-
IdentityGuard: Context-Aware Restriction and Provenance for Personalized Synthesis
-
MICRO: A Lightweight Middleware for Optimizing Cross-store Cross-model Graph-Relation Joins [Technical Report]
-
Fine-tuning is Not Enough: A Parallel Framework for Collaborative Imitation and Reinforcement Learning in End-to-end Autonomous Driving
-
MOGeo: Beyond One-to-One Cross-View Object Geo-localization
-
Is Seeing Believing? Evaluating Human Sensitivity to Synthetic Video
-
Sirens' Whisper: Inaudible Near-Ultrasonic Jailbreaks of Speech-Driven LLMs
-
Exploring the Dimensions of a Variational Neuron
-
Fronto-parietal and fronto-temporal EEG coherence as predictive neuromarkers of transcutaneous auricular vagus nerve stimulation response in treatment-resistant schizophrenia: A machine learning study
-
APEX-Searcher: Augmenting LLMs' Search Capabilities through Agentic Planning and Execution
-
Power Term Polynomial Algebra for Boolean Logic
-
VFM-Loc: Zero-Shot Cross-View Geo-Localization via Aligning Discriminative Visual Hierarchies
-
OrigamiBench: An Interactive Environment to Synthesize Flat-Foldable Origamis
-
Learning through Creation: A Hash-Free Framework for On-the-Fly Category Discovery
-
Geo-ID: Test-Time Geometric Consensus for Cross-View Consistent Intrinsics
-
Script-to-Slide Grounding: Grounding Script Sentences to Slide Objects for Automatic Instructional Video Generation
