Kokoro-FastAPI

mirror of https://github.com/remsky/Kokoro-FastAPI.git synced 2025-08-05 16:48:53 +00:00

History

rvuyyuru2 44c62467ae Adds support for creating weighted voice combinations Implements a new method to parse weighted voice formulas and generate combined audio outputs based on specified weights. This enhancement allows for more diverse audio generation by letting users specify multiple voices with respective weights, improving flexibility in voice management. Updates voice processing logic in relevant API routes to handle weighted formulas seamlessly. Fixes #123 (if applicable, replace with the actual issue reference)		2025-01-25 20:54:21 +05:30
..
assorted_checks	Ruff format + fix	2025-01-09 18:41:44 -07:00
phoneme_examples	Refactor ONNX GPU backend and phoneme generation: improve token handling, add chunk processing for audio generation, and initial introduce stitch options for audio chunks.	2025-01-22 17:43:38 -07:00
streaming_refactor	-unified streaming implementation	2025-01-25 05:25:13 -07:00
voice_samples	Added mp3 samples	2024-12-31 03:48:26 -07:00
__init__.py	- CPU ONNX + PyTorch CUDA, functional	2025-01-03 17:54:17 -07:00
audio_analysis.png	First streaming attempt	2025-01-04 17:54:54 -07:00
openai_streaming_audio.py	Adds support for creating weighted voice combinations	2025-01-25 20:54:21 +05:30
requirements.txt	Initial swap to UV dependency management	2025-01-11 20:00:34 -07:00
simul_file_test.py	Refactor model loading and configuration: update, adjust model loading device,. add async streaming examples and remove unused warmup service.	2025-01-22 02:33:29 -07:00
simul_openai_streaming_audio.py	Refactor model loading and configuration: update, adjust model loading device,. add async streaming examples and remove unused warmup service.	2025-01-22 02:33:29 -07:00
simul_speaker_test.py	Refactor model loading and configuration: update, adjust model loading device,. add async streaming examples and remove unused warmup service.	2025-01-22 02:33:29 -07:00
speech.mp3	Adds support for creating weighted voice combinations	2025-01-25 20:54:21 +05:30
stream_tts_playback.py	refactor: streamline audio normalization process and update tests	2025-01-13 18:56:49 -07:00