Herculis-CUA-GUI-Actioner-4B-GGUF

Herculis-CUA-GUI-Actioner-4B is a Computer Use Agent (CUA) multimodal model designed for GUI understanding, UI localization, and action execution across web, desktop, and mobile environments. It focuses on visual grounding, intent driven actioning, and UI based question answering (VQA), enabling reliable interaction with real world software interfaces. The model is optimized for efficient inference while maintaining strong accuracy on complex UI workflows.

Quick Start with llama.cpp

Link: https://huggingface.co/prithivMLmods/Herculis-CUA-GUI-Actioner-4B-GGUF?library=llama-cpp-python

Herculis-CUA-GUI-Actioner-4B [GGUF]

File Name	Quant Type	File Size	File Link
Herculis-CUA-GUI-Actioner-4B.BF16.gguf	BF16	6.18 GB	Download
Herculis-CUA-GUI-Actioner-4B.F16.gguf	F16	6.18 GB	Download
Herculis-CUA-GUI-Actioner-4B.F32.gguf	F32	12.3 GB	Download
Herculis-CUA-GUI-Actioner-4B.Q8_0.gguf	Q8_0	3.29 GB	Download
Herculis-CUA-GUI-Actioner-4B.mmproj-bf16.gguf	mmproj-bf16	1.34 GB	Download
Herculis-CUA-GUI-Actioner-4B.mmproj-f16.gguf	mmproj-f16	1.34 GB	Download
Herculis-CUA-GUI-Actioner-4B.mmproj-f32.gguf	mmproj-f32	2.67 GB	Download
Herculis-CUA-GUI-Actioner-4B.mmproj-q8_0.gguf	mmproj-q8_0	848 MB	Download