Papers

Filter by company

SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software Engineering?

OpenAI

Published on: 2025-02-17 4 authors
GraNNite: Enabling High-Performance Execution of Graph Neural Networks on Resource-Constrained Neural Processing Units

Intel / Purdue University

Published on: 2025-02-13 1 author
Reviving The Classics: Active Reward Modeling in Large Language Model Alignment

ByteDance / Massachusetts Institute of Technology, University of Cambridge

Published on: 2025-02-04 1 author
s1: Simple test-time scaling

Contextual AI / Allen Institute for Artificial Intelligence, Stanford University, University of Washington

Published on: 2025-01-31 10 authors
Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming

Anthropic / Safeguards Research Team

Published on: 2025-01-31 1 author
Janus-Pro: Unified Multimodal Understanding and Generation with Data and Model Scaling

DeepSeek

Published on: 2025-01-29 1 author
EmbeddingGemma: Powerful and Lightweight Text Representations

Google

Published on: 2025-01-24 1 author
Embodied Agent Interface: Benchmarking LLMs for Embodied Decision Making

Amazon / Massachusetts Institute of Technology, Northwestern University, Stanford University

Published on: 2025-01-19 1 author
ComplexFuncBench: Exploring Multi-Step and Constrained Function Calling under Long-Context Scenario

Z.ai / Tsinghua University

Published on: 2025-01-17 1 author
MiniMax-01: Scaling Foundation Models with Lightning Attention

MiniMax

Published on: 2025-01-14 1 author
PoAct: Policy and Action Dual-Control Agent for Generalized Applications

Z.ai / Central South University, Tsinghua University

Published on: 2025-01-13 1 author
Agent Laboratory: Using LLM Agents as Research Assistants

AMD / Johns Hopkins University, Swiss Federal Institute of Technology in Zurich

Published on: 2025-01-08 10 authors
Retrieval-Augmented Generation with Graphs (GraphRAG)

Amazon / Michigan State University

Published on: 2025-01-08 1 author
Cosmos World Foundation Model Platform for Physical AI

NVIDIA

Published on: 2025-01-07 1 author
Titans: Learning to Memorize at Test Time

Google

Published on: 2024-12-31 1 author
Generative Video Propagation

Adobe / The Chinese University of Hong Kong

Published on: 2024-12-27 1 author
In Case You Missed It: ARC 'Challenge' Is Not That Challenging

Snowflake

Published on: 2024-12-23 1 author
Qwen2.5 Technical Report

Alibaba

Published on: 2024-12-19 1 author
Prompt Compression with Context-Aware Sentence Encoding for Fast and Improved LLM Inference

Workday / Queen’s University

Published on: 2024-12-18 1 author
Alignment faking in large language models

Anthropic / New York University

Published on: 2024-12-18 1 author
How Often are Fingerprints Repeated in the Population? Expanding on Evidence from AI With the Birthday Paradox

University of Pennsylvania Department of Criminology and Statistics, University of Pennsylvania School of Engineering and Applied Sciences

Published on: 2024-12-17 2 authors
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

DeepSeek

Published on: 2024-12-13 1 author
VDB-GPDF: Online Gaussian Process Distance Field with VDB Structure

Google

Published on: 2024-12-12 1 author
pfl-research: simulation framework for accelerating research in Private Federated Learning

Apple

Published on: 2024-12-10 1 author
Frontier AI systems have surpassed the self-replicating red line

Fudan University

Published on: 2024-12-09 4 authors
InstantRestore: Single-Step Personalized Face Restoration with Shared-Image Attention

Snap / University of California

Published on: 2024-12-09 1 author
Best-of-N Jailbreaking

Anthropic, Tangentic, Speechmatics / MATS, Stanford University, University College London, University of Oxford

Published on: 2024-12-04 10 authors
Creating realistic 3D shapes using generative AI

Massachusetts Institute of Technology

Published on: 2024-12-04 1 author
Commit0: Library Generation from Scratch

Cohere / Cornell University

Published on: 2024-12-02 1 author
ARTIST: Improving the Generation of Text-rich Images with Disentangled Diffusion Models and Large Language Models

Adobe / Duke University

Published on: 2024-12-02
Controlling Language and Diffusion Models by Transporting Activations

Apple

Published on: 2024-11-22 1 author
The Rise and Potential of Large Language Model Based Agents: A Survey

Massachusetts Institute of Technology

Published on: 2024-11-11 5 authors
Evaluating Cultural and Social Awareness of LLM Web Agents

Salesforce / University of California

Published on: 2024-10-30 1 author
Helix Extractor 1.0 Update: High-Accuracy Information Extraction for Semi-Structured Documents

UiPath

Published on: 2024-10-24 1 author
SF-V: Single Forward Video Generation Model

Snap / Rutgers University

Published on: 2024-10-24 1 author
The Llama 3 Herd of Model

Meta Platforms

Published on: 2024-10-23 1 author
Improving Pinterest Search Relevance Using Large Language Models

Pinterest

Published on: 2024-10-22 1 author
NVLM: Open Frontier-Class Multimodal LLMs

NVIDIA

Published on: 2024-10-22 1 author
HyQE: Ranking Contexts with Hypothetical Query Embeddings

Intuit / Boston University

Published on: 2024-10-20 1 author
RedPajama: an Open Dataset for Training Large Language Models

Together AI, EleutherAI / Stanford University, The Ohio State University

Published on: 2024-10-19 1 author
Understanding Chain-of-Thought in LLMs through Information Theory

ByteDance / University of California

Published on: 2024-10-18 1 author
Survival of the Safest: Towards Secure Prompt Optimization through Interleaved Multi-Objective Evolution

Intuit

Published on: 2024-10-12 1 author
Nemotron-4-340B-Instruct

NVIDIA

Published on: 2024-10-12 1 author
Pixtral 12B

Mistral AI

Published on: 2024-10-10 1 author
Data-Driven Discovery of Conservation Laws from Trajectories via Neural Deflation

Intuit / University of Massachusetts Amherst

Published on: 2024-10-07 1 author
Chronos: Learning the Language of Time Series

Amazon / AWS AI Labs, New York University, Rutgers University, University of California, University of Freiburg

Published on: 2024-10-04 1 author
Qwen2-VL: Enhancing Vision-Language Model’s Perception of the World at Any Resolution

Alibaba

Published on: 2024-10-03 1 author
Duo-LLM: A Framework for Studying Adaptive Computation in Large Language Models

Apple

Published on: 2024-10-01 1 author
HM3: Heterogeneous Multi-Class Model Merging

DataRobot

Published on: 2024-09-27 1 author
arsier: Recipes for Training and Evaluating Large Video Description Models

ByteDance

Published on: 2024-09-24 1 author

Prev 148 149 150 151 152 153 154 155 156 157 158 Next

Go to section

Search

Papers

Help

People also viewed

Create AI Tools

Mini Tool

Vibe code an AI Tool

Choose listing type: