W4A4_FP4
Collection
NVFP4 models to be used in Blackwell Architecture
•
3 items
•
Updated
This is a quantized version of the Phi-4-mini-instruct model by Microsoft, quantized using NVFP4 format with 4-bit weights and 4-bit activations.
Phi-4-mini-instruct is a smaller variant of the Phi-4 model, designed for instruction-following tasks. The quantized version retains much of the original model's capabilities while significantly reducing its size and computational requirements.
Base model
microsoft/Phi-4-mini-instruct