bubbliiiing
commited on
Commit
·
b7a82dc
1
Parent(s):
0f55221
Update 2.1 for speed up
Browse files
README.md
CHANGED
|
@@ -7,6 +7,10 @@ library_name: videox_fun
|
|
| 7 |
|
| 8 |
[](https://github.com/aigc-apps/VideoX-Fun)
|
| 9 |
|
|
|
|
|
|
|
|
|
|
|
|
|
| 10 |
## Model Features
|
| 11 |
- This ControlNet is added on 15 layer blocks and 2 refiner layer blocks.
|
| 12 |
- The model was trained from scratch for 70,000 steps on a dataset of 1 million high-quality images covering both general and human-centric content. Training was performed at 1328 resolution using BFloat16 precision, with a batch size of 64, a learning rate of 2e-5, and a text dropout ratio of 0.10.
|
|
|
|
| 7 |
|
| 8 |
[](https://github.com/aigc-apps/VideoX-Fun)
|
| 9 |
|
| 10 |
+
|
| 11 |
+
## Update
|
| 12 |
+
- Due to a typo in the code, `control_layers` was used instead of `control_noise_refiner` to process refiner latents during training. Although the model converged normally, the model inference speed was slow because `control_layers` forward pass was performed twice. In version 2.1, we made an urgent fix and the speed has returned to normal. [2025.10.16]
|
| 13 |
+
|
| 14 |
## Model Features
|
| 15 |
- This ControlNet is added on 15 layer blocks and 2 refiner layer blocks.
|
| 16 |
- The model was trained from scratch for 70,000 steps on a dataset of 1 million high-quality images covering both general and human-centric content. Training was performed at 1328 resolution using BFloat16 precision, with a batch size of 64, a learning rate of 2e-5, and a text dropout ratio of 0.10.
|
Z-Image-Turbo-Fun-Controlnet-Union-2.1.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:7f611e6d52b133f64b84bef2549fcb84589a766b8255954f96ea34684f52b633
|
| 3 |
+
size 6712485600
|