Skip to content

Commit

Permalink
update
Browse files Browse the repository at this point in the history
  • Loading branch information
haoheliu committed Feb 2, 2023
1 parent c8244f6 commit 406d4eb
Showing 1 changed file with 13 additions and 6 deletions.
19 changes: 13 additions & 6 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -19,13 +19,13 @@ wget https://zenodo.org/record/7600541/files/audioldm-s-full?download=1 -O ckpt/
python3 scripts/text2sound.py
```

# Hardware requirement


# TODO

- [ ] Add text-guided style transfer into the repo.
- [ ] Add AudioCaps finetuned AudioLDM-S model.
- [ ] Update the checkpoint with more training steps.
- [ ] Add text-guided style transfer
- [ ] Add audio super-resolution
- [ ] Add audio inpainting
- [ ] Add AudioCaps finetuned AudioLDM-S model

## Cite this work

Expand All @@ -43,6 +43,13 @@ If you found this tool useful, please consider citing
Part of the code is borrowed from the following repos. We would like to thank the authors of these repos for their contribution.

> https://github.com/LAION-AI/CLAP
> https://github.com/CompVis/stable-diffusion
> https://github.com/v-iashin/SpecVQGAN
> https://github.com/toshas/torch-fidelity
> https://github.com/toshas/torch-fidelity
# Hardware requirement
- GPU with 8GB of dedicated VRAM
- A system with a 64-bit operating system (Windows 7, 8.1 or 10, Ubuntu 16.04 or later, or macOS 10.13 or later) 16GB or more of system RAM

0 comments on commit 406d4eb

Please sign in to comment.