!!! warning
We assume no responsibility for any illegal use of the codebase. Please refer to the local laws regarding DMCA (Digital Millennium Copyright Act) and other relevant laws in your area. <br/>
This codebase and all models are released under the CC-BY-NC-SA-4.0 license.
Professional Windows users may consider using WSL2 or Docker to run the codebase.
# Create a python 3.10 virtual environment, you can also use virtualenv
conda create -n fish-speech python=3.10
conda activate fish-speech
# Install pytorch
pip3 install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu121
# Install fish-speech
pip3 install -e .
# (Enable acceleration) Install triton-windows
pip install https://github.com/AnyaCoder/fish-speech/releases/download/v0.1.0/triton_windows-0.1.0-py3-none-any.whl
Non-professional Windows users can consider the following basic methods to run the project without a Linux environment (with model compilation capabilities, i.e., torch.compile):
install_env.bat to install the environment.LLVM-17.0.6-win64.exe, double-click to install, select an appropriate installation location, and most importantly, check the Add Path to Current User option to add the environment variable.Modify button and find the Desktop development with C++ option to select and download.start.bat to open the training inference WebUI management interface. If needed, you can modify the API_FLAGS as prompted below.!!! info "Optional"
Want to start the inference WebUI?
Edit the `API_FLAGS.txt` file in the project root directory and modify the first three lines as follows:
```
--infer
# --api
# --listen ...
...
```
!!! info "Optional"
Want to start the API server?
Edit the `API_FLAGS.txt` file in the project root directory and modify the first three lines as follows:
```
# --infer
--api
--listen ...
...
```
!!! info "Optional"
Double-click `run_cmd.bat` to enter the conda/python command line environment of this project.
# Create a python 3.10 virtual environment, you can also use virtualenv
conda create -n fish-speech python=3.10
conda activate fish-speech
# Install pytorch
pip3 install torch torchvision torchaudio
# Install fish-speech
pip3 install -e .[stable]
# (Ubuntu / Debian User) Install sox
apt install libsox-dev
lora fine-tuning support.gradient checkpointing, causual sampling, and flash-attn support.text2semantic model, supporting phoneme-free mode.