You can call Using Sapiens to get seg,normal,pose,depth,mask maps.
Sapiens From: facebookresearch/sapiens
Update-2024/11/02
- try add MPS support or no cuda user..
previous update
- 加入模型卸载代码,便于连接其他节点,感谢@lyxkilo 的代码,通过他的代码可以将fp32的模型转为fp16模型(更小,1B seg 2G左右),首次运行会生成一个同名加fp16模型文件,生成后不需要再开启fp16生成按钮;
- Add model uninstallation code for easy connection to other nodes. Thanks to @lyxkilo's code, it is possible to convert the FP32 model to an FP16 model (smaller, around 1B SEG 2G),The first run will generate an fp16 model file with the same name, and there is no need to enable the fp16 generation button after generation.
- seg选择人体部位的方式是数字加“,”,例如 2,1,11,注意逗号是英文符号。
In the ./ComfyUI /custom_node directory, run the following:
git clone https://github.com/smthemex/ComfyUI_Sapiens
pip install -r requirements.txt
If some modules missing, please pip install #ultralytics yolov8
3.1 base :(choice repo_id or ckpt_name)
- 3.1.1 #sapiens
only support torchscript version now,you can choice 1b,0.3b,0.6b,2b,do not changge ckpt's name!!!
只支持torchscript的版本,但是有多种模型可选,最好质量的是1b或者2B,如果模型选择全是none,会自动下载一个1B的seg,下载后不要改模型名字;
seg
pose
depth
normal
├── ComfyUI/models/sapiens/
| ├── seg/sapiens_1b_goliath_best_goliath_mIoU_7994_epoch_151_torchscript.pt2
| ├── pose/sapiens_1b_goliath_best_goliath_AP_639_torchscript.pt2
| ├── normal/sapiens_0.3b_normal_render_people_epoch_66_torchscript.pt2
| ├── depth/sapiens_1b_render_people_epoch_88_torchscript.pt2
- 3.1.2 yolo # if using pose
yolo是pose必须的,官方的太复杂,不如yolo好用;
yolov8m
├── ComfyUI/models/sapiens/
| ├── yolov8m.pt
seg body
- Using some ibaiGorordo's codes from ibaiGorordo
- UsingJaykumaran's codes from Jaykumaran
facebookresearch/sapiens
@article{khirodkar2024sapiens,
title={Sapiens: Foundation for Human Vision Models},
author={Khirodkar, Rawal and Bagautdinov, Timur and Martinez, Julieta and Zhaoen, Su and James, Austin and Selednik, Peter and Anderson, Stuart and Saito, Shunsuke},
journal={arXiv preprint arXiv:2408.12569},
year={2024}
}