mirror of
https://github.com/remsky/Kokoro-FastAPI.git
synced 2025-08-05 16:48:53 +00:00
Update README.md
This commit is contained in:
parent
cfae7db7fc
commit
97f82c0685
1 changed files with 11 additions and 5 deletions
16
README.md
16
README.md
|
@ -12,15 +12,20 @@
|
|||
|
||||
[](https://huggingface.co/hexgrad/Kokoro-82M/commit/9901c2b79161b6e898b7ea857ae5298f47b8b0d6)
|
||||
|
||||
|
||||
Dockerized FastAPI wrapper for [Kokoro-82M](https://huggingface.co/hexgrad/Kokoro-82M) text-to-speech model
|
||||
- Multi-language support (English, Japanese, Korean, Chinese, Vietnamese)
|
||||
- OpenAI-compatible Speech endpoint, NVIDIA GPU accelerated or CPU inference with PyTorch
|
||||
- ONNX support coming soon, see v0.1.5 and earlier for legacy ONNX support in the interim
|
||||
- Debug endpoints for monitoring system stats, integrated web UI on localhost:8880/web
|
||||
- Phoneme-based audio generation, phoneme generation
|
||||
- (new) Per-word timestamped caption generation
|
||||
- (new) Voice mixing with weighted combinations
|
||||
- Per-word timestamped caption generation
|
||||
- Voice mixing with weighted combinations
|
||||
|
||||
### Integration Guides
|
||||
[](https://github.com/remsky/Kokoro-FastAPI/wiki/Integrations-SillyTavern)
|
||||
[](https://github.com/remsky/Kokoro-FastAPI/wiki/Integrations-OpenWebUi) [](https://github.com/remsky/Kokoro-FastAPI/wiki/Setup-Kubernetes)
|
||||
|
||||
|
||||
|
||||
|
||||
## Get Started
|
||||
|
@ -36,8 +41,8 @@ Refer to the core/config.py file for a full list of variables which can be manag
|
|||
# the `latest` tag can be used, but should not be considered stable as it may include `nightly` branch builds
|
||||
# it may have some bonus features however, and feedback/testing is welcome
|
||||
|
||||
docker run -p 8880:8880 ghcr.io/remsky/kokoro-fastapi-cpu:v0.2.1 # CPU, or:
|
||||
docker run --gpus all -p 8880:8880 ghcr.io/remsky/kokoro-fastapi-gpu:v0.2.1 #NVIDIA GPU
|
||||
docker run -p 8880:8880 ghcr.io/remsky/kokoro-fastapi-cpu:v0.2.2 # CPU, or:
|
||||
docker run --gpus all -p 8880:8880 ghcr.io/remsky/kokoro-fastapi-gpu:v0.2.2 #NVIDIA GPU
|
||||
```
|
||||
|
||||
|
||||
|
@ -121,6 +126,7 @@ with client.audio.speech.with_streaming_response.create(
|
|||
|
||||
</details>
|
||||
|
||||
|
||||
## Features
|
||||
<details>
|
||||
<summary>OpenAI-Compatible Speech Endpoint</summary>
|
||||
|
|
Loading…
Add table
Reference in a new issue