Skip to content

You can call Using Sapiens to get seg,normal,pose,depth,mask

License

Notifications You must be signed in to change notification settings

smthemex/ComfyUI_Sapiens

Repository files navigation

ComfyUI_Sapiens

You can call Using Sapiens to get seg,normal,pose,depth,mask maps.

Sapiens From: facebookresearch/sapiens

Update-2024/11/02

  • try add MPS support or no cuda user..

previous update

  • 加入模型卸载代码,便于连接其他节点,感谢@lyxkilo 的代码,通过他的代码可以将fp32的模型转为fp16模型(更小,1B seg 2G左右),首次运行会生成一个同名加fp16模型文件,生成后不需要再开启fp16生成按钮;
  • Add model uninstallation code for easy connection to other nodes. Thanks to @lyxkilo's code, it is possible to convert the FP32 model to an FP16 model (smaller, around 1B SEG 2G),The first run will generate an fp16 model file with the same name, and there is no need to enable the fp16 generation button after generation.
  • seg选择人体部位的方式是数字加“,”,例如 2,1,11,注意逗号是英文符号。

1.Installation

In the ./ComfyUI /custom_node directory, run the following:

git clone https://github.com/smthemex/ComfyUI_Sapiens

2.requirements

pip install -r requirements.txt

If some modules missing, please pip install #ultralytics yolov8

3.checkpoints

3.1 base :(choice repo_id or ckpt_name)

  • 3.1.1 #sapiens
    only support torchscript version now,you can choice 1b,0.3b,0.6b,2b,do not changge ckpt's name!!!
    只支持torchscript的版本,但是有多种模型可选,最好质量的是1b或者2B,如果模型选择全是none,会自动下载一个1B的seg,下载后不要改模型名字;
    seg
    pose
    depth
    normal
├── ComfyUI/models/sapiens/
|     ├── seg/sapiens_1b_goliath_best_goliath_mIoU_7994_epoch_151_torchscript.pt2
|     ├── pose/sapiens_1b_goliath_best_goliath_AP_639_torchscript.pt2
|     ├── normal/sapiens_0.3b_normal_render_people_epoch_66_torchscript.pt2
|     ├── depth/sapiens_1b_render_people_epoch_88_torchscript.pt2
  • 3.1.2 yolo # if using pose yolo是pose必须的,官方的太复杂,不如yolo好用;
    yolov8m
├── ComfyUI/models/sapiens/
|     ├── yolov8m.pt

4 Example

seg body

  • 可以选人体部位27种类型 (Latest version)

Citation

  • Using some ibaiGorordo's codes from ibaiGorordo
  • UsingJaykumaran's codes from Jaykumaran
    facebookresearch/sapiens
@article{khirodkar2024sapiens,
  title={Sapiens: Foundation for Human Vision Models},
  author={Khirodkar, Rawal and Bagautdinov, Timur and Martinez, Julieta and Zhaoen, Su and James, Austin and Selednik, Peter and Anderson, Stuart and Saito, Shunsuke},
  journal={arXiv preprint arXiv:2408.12569},
  year={2024}
}

About

You can call Using Sapiens to get seg,normal,pose,depth,mask

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages