Ai2

Team

non-profit

Verified

https://allenai.org/

allen_ai

allenai

AI & ML interests

Building breatkthrough AI to solve the world's biggest problems.

Recent Activity

techarb published a dataset about 6 hours ago

allenai/us-patents

techarb updated a dataset about 7 hours ago

allenai/us-patents

sanghol published a dataset about 11 hours ago

allenai/Molmo2-MultiImageQA

View all activity

Papers

Olmo 3

SAGE: Training Smart Any-Horizon Agents for Long Video Reasoning with Reinforcement Learning

View all Papers

techarb

published a dataset about 6 hours ago

allenai/us-patents

Viewer • Updated about 6 hours ago • 7.9M • 4

techarb

updated a dataset about 7 hours ago

allenai/us-patents

Viewer • Updated about 6 hours ago • 7.9M • 4

sanghol

published a dataset about 11 hours ago

allenai/Molmo2-MultiImageQA

Viewer • Updated 3 days ago • 44.7k • 8 • 1

praeclarumjj3

updated a collection 1 day ago

SAGE

Smart Any-Horizon Agent for Long Video Reasoning • 18 items • Updated 1 day ago • 2

praeclarumjj3

submitted a paper to Daily Papers 1 day ago

SAGE: Training Smart Any-Horizon Agents for Long Video Reasoning with Reinforcement Learning

Paper • 2512.13874 • Published 3 days ago • 14

baileyk

updated a dataset 2 days ago

allenai/dolma3_mix-6T-1025

Preview • Updated 2 days ago • 31.4k • 15

faezeb

authored a paper 24 days ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Paper • 2511.19399 • Published 24 days ago • 58

sewon

authored a paper 24 days ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Paper • 2511.19399 • Published 24 days ago • 58

pradeepd

authored a paper 24 days ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Paper • 2511.19399 • Published 24 days ago • 58

hamishivi

authored 2 papers 24 days ago

RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments

Paper • 2511.07317 • Published Nov 10 • 13

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Paper • 2511.19399 • Published 24 days ago • 58

yanhong-l

authored a paper about 2 months ago

Text or Pixels? It Takes Half: On the Token Efficiency of Visual Text Inputs in Multimodal LLMs

Paper • 2510.18279 • Published Oct 21 • 4

stellalisy

authored 8 papers 2 months ago

CulturalTeaming: AI-Assisted Interactive Red-Teaming for Challenging LLMs' (Lack of) Multicultural Knowledge

Paper • 2404.06664 • Published Apr 10, 2024 • 1

CulturalBench: a Robust, Diverse and Challenging Benchmark on Measuring the (Lack of) Cultural Knowledge of LLMs

Paper • 2410.02677 • Published Oct 3, 2024 • 1

Aligning LLMs to Ask Good Questions A Case Study in Clinical Reasoning

Paper • 2502.14860 • Published Feb 20

BLAB: Brutally Long Audio Bench

Paper • 2505.03054 • Published May 5 • 1

Spurious Rewards: Rethinking Training Signals in RLVR

Paper • 2506.10947 • Published Jun 12 • 2

MediQ: Question-Asking LLMs and a Benchmark for Reliable Interactive Clinical Reasoning

Paper • 2406.00922 • Published Jun 3, 2024

PrefPalette: Personalized Preference Modeling with Latent Attributes

Paper • 2507.13541 • Published Jul 17 • 8

Medical Hallucinations in Foundation Models and Their Impact on Healthcare

Paper • 2503.05777 • Published Feb 26