Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

2,456

Full-text search

Active filters: multimodal

ByteDance/Dolphin-v2

Image-Text-to-Text • 4B • Updated 7 days ago • 1.2k • 85

allenai/Molmo2-8B

Video-Text-to-Text • 9B • Updated 3 days ago • 654 • 57

stepfun-ai/GELab-Zero-4B-preview

Image-Text-to-Text • 4B • Updated about 4 hours ago • 1.45k • 123

allenai/Molmo2-4B

Video-Text-to-Text • 5B • Updated 3 days ago • 439 • 22

Qwen/Qwen3-Omni-30B-A3B-Instruct

Any-to-Any • 35B • Updated Sep 22 • 265k • 772

allenai/Molmo2-O-7B

Video-Text-to-Text • 8B • Updated 3 days ago • 167 • 13

allenai/Molmo2-VideoPoint-4B

Video-Text-to-Text • 5B • Updated 3 days ago • 23 • 12

jinaai/jina-vlm

Image-Text-to-Text • 2B • Updated 14 days ago • 2.03k • 82

Qwen/Qwen2.5-VL-7B-Instruct

Image-Text-to-Text • 8B • Updated Apr 6 • 2.81M • • 1.4k

microsoft/Fara-7B

Image-Text-to-Text • 8B • Updated 8 days ago • 157k • 446

ByteDance-Seed/UI-TARS-1.5-7B

Image-Text-to-Text • 8B • Updated Apr 18 • 282k • 458

Qwen/Qwen3-Omni-30B-A3B-Thinking

Any-to-Any • 32B • Updated Sep 22 • 61.8k • 238

allenai/Molmo-7B-D-0924

Image-Text-to-Text • 8B • Updated 4 days ago • 37.4k • 558

unsloth/Qwen2.5-VL-7B-Instruct-GGUF

Image-Text-to-Text • 8B • Updated May 12 • 64.5k • 111

OctoMed/OctoMed-7B

Image-Text-to-Text • 8B • Updated 13 days ago • 1.25k • 17

Cognitive-Lab/NetraEmbed

Visual Document Retrieval • 4B • Updated 9 days ago • 673 • 22

ServiceNow-AI/Apriel-1.6-15b-Thinker-GGUF

14B • Updated 3 days ago • 385 • 3

Qwen/Qwen2-VL-7B-Instruct

Image-Text-to-Text • 8B • Updated Feb 6 • 1.41M • • 1.25k

Qwen/Qwen2.5-VL-3B-Instruct

Image-Text-to-Text • 4B • Updated Apr 6 • 5.33M • 577

Qwen/Qwen3-Omni-30B-A3B-Captioner

Any-to-Any • 32B • Updated Sep 22 • 27.8k • 182

thesby/Qwen3-VL-8B-NSFW-Caption-V4.5

Image-to-Text • 9B • Updated Nov 7 • 21.3k • 60

bytedance-research/Vidi-7B

9B • Updated 3 days ago • 778 • 10

VITRA-VLA/VITRA-VLA-3B

Robotics • Updated 19 days ago • 2

hustvl/InfiniteVL

Image-Text-to-Text • 4B • Updated 7 days ago • 51 • 2

Luckybalabala/AutoGLM-Phone-9B-GGUF

Image-Text-to-Text • 9B • Updated 1 day ago • 3.07k • 2

imageomics/bioclip

Zero-Shot Image Classification • Updated Oct 2 • 7.28k • 57

marcosv/InstructIR

Image-to-Image • Updated Jan 31, 2024 • 35

nielsr/imagebind-huge

Updated Apr 28, 2024 • 100 • 22

Lewdiculous/Eris_PrimeV4-Vision-32k-7B-GGUF-IQ-Imatrix

7B • Updated Mar 27, 2024 • 383 • 15

HuggingFaceM4/idefics2-8b

Image-Text-to-Text • 8B • Updated Oct 14, 2024 • 11.7k • 619