diff --git a/README.md b/README.md index 993ecc9..9d6ce76 100644 --- a/README.md +++ b/README.md @@ -7,16 +7,15 @@ []() [](https://huggingface.co/spaces/Remsky/Kokoro-TTS-Zero) -[](https://huggingface.co/hexgrad/Kokoro-82M/commit/9901c2b79161b6e898b7ea857ae5298f47b8b0d6) -[]() -[]() +[](https://huggingface.co/hexgrad/Kokoro-82M/commit/9901c2b79161b6e898b7ea857ae5298f47b8b0d6) +[]() +[]() Dockerized FastAPI wrapper for [Kokoro-82M](https://huggingface.co/hexgrad/Kokoro-82M) text-to-speech model - Multi-language support (English, Japanese, Korean, Chinese, Vietnamese) - OpenAI-compatible Speech endpoint, NVIDIA GPU accelerated or CPU inference with PyTorch - ONNX support coming soon, see v0.1.5 and earlier for legacy ONNX support in the interim -- Debug endpoints for monitoring threads, storage, and session pools -- Integrated web UI on localhost:8880/web +- Debug endpoints for monitoring system stats, integrated web UI on localhost:8880/web - Phoneme-based audio generation, phoneme generation - (new) Per-word timestamped caption generation - (new) Voice mixing with weighted combinations @@ -113,8 +112,8 @@ with client.audio.speech.with_streaming_response.create( - Web Interface: http://localhost:8880/web