Large Language Models - a darylmooreNC Collection

darylmooreNC 's Collections

Multi-Agent Infrastructure

LLM Training Methodologies

LLM Architectures

Agentic AI Training and Tuning

Reinforcement Learning

Sports Predictive Modeling

Large Language Models

Large Language Models

updated 8 days ago

Universal Deep Research: Bring Your Own Model and Strategy

Paper • 2509.00244 • Published Aug 29 • 13
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

Paper • 2509.02547 • Published Sep 2 • 227
Efficient Multi-modal Large Language Models via Progressive Consistency Distillation

Paper • 2510.00515 • Published Oct 1 • 39
DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search

Paper • 2509.25454 • Published Sep 29 • 140
Demystifying Reinforcement Learning in Agentic Reasoning

Paper • 2510.11701 • Published Oct 13 • 31
deepseek-ai/DeepSeek-Math-V2

Text Generation • 685B • Updated 29 days ago • 11.8k • 667
T-pro 2.0: An Efficient Russian Hybrid-Reasoning Model and Playground

Paper • 2512.10430 • Published 15 days ago • 112