Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
darylmooreNC 's Collections
Multi-Agent Infrastructure
LLM Training Methodologies
LLM Architectures
Agentic AI Training and Tuning
Reinforcement Learning
Agentic AI
Sports Predictive Modeling
Large Language Models

Large Language Models

updated 8 days ago
Upvote
-

  • Universal Deep Research: Bring Your Own Model and Strategy

    Paper • 2509.00244 • Published Aug 29 • 13

  • The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

    Paper • 2509.02547 • Published Sep 2 • 227

  • Efficient Multi-modal Large Language Models via Progressive Consistency Distillation

    Paper • 2510.00515 • Published Oct 1 • 39

  • DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search

    Paper • 2509.25454 • Published Sep 29 • 140

  • Demystifying Reinforcement Learning in Agentic Reasoning

    Paper • 2510.11701 • Published Oct 13 • 31

  • deepseek-ai/DeepSeek-Math-V2

    Text Generation • 685B • Updated 29 days ago • 11.8k • 667

  • T-pro 2.0: An Efficient Russian Hybrid-Reasoning Model and Playground

    Paper • 2512.10430 • Published 15 days ago • 112
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs