Kokoro-FastAPI/examples
rvuyyuru2 44c62467ae Adds support for creating weighted voice combinations
Implements a new method to parse weighted voice formulas and generate combined audio outputs based on specified weights.

This enhancement allows for more diverse audio generation by letting users specify multiple voices with respective weights, improving flexibility in voice management.

Updates voice processing logic in relevant API routes to handle weighted formulas seamlessly.

Fixes #123 (if applicable, replace with the actual issue reference)
2025-01-25 20:54:21 +05:30
..
assorted_checks Ruff format + fix 2025-01-09 18:41:44 -07:00
phoneme_examples Refactor ONNX GPU backend and phoneme generation: improve token handling, add chunk processing for audio generation, and initial introduce stitch options for audio chunks. 2025-01-22 17:43:38 -07:00
streaming_refactor -unified streaming implementation 2025-01-25 05:25:13 -07:00
voice_samples Added mp3 samples 2024-12-31 03:48:26 -07:00
__init__.py - CPU ONNX + PyTorch CUDA, functional 2025-01-03 17:54:17 -07:00
audio_analysis.png First streaming attempt 2025-01-04 17:54:54 -07:00
openai_streaming_audio.py Adds support for creating weighted voice combinations 2025-01-25 20:54:21 +05:30
requirements.txt Initial swap to UV dependency management 2025-01-11 20:00:34 -07:00
simul_file_test.py Refactor model loading and configuration: update, adjust model loading device,. add async streaming examples and remove unused warmup service. 2025-01-22 02:33:29 -07:00
simul_openai_streaming_audio.py Refactor model loading and configuration: update, adjust model loading device,. add async streaming examples and remove unused warmup service. 2025-01-22 02:33:29 -07:00
simul_speaker_test.py Refactor model loading and configuration: update, adjust model loading device,. add async streaming examples and remove unused warmup service. 2025-01-22 02:33:29 -07:00
speech.mp3 Adds support for creating weighted voice combinations 2025-01-25 20:54:21 +05:30
stream_tts_playback.py refactor: streamline audio normalization process and update tests 2025-01-13 18:56:49 -07:00