Kokoro-FastAPI

mirror of https://github.com/remsky/Kokoro-FastAPI.git synced 2025-08-05 16:48:53 +00:00

Author	SHA1	Message	Date
Fireblade	7f15ba8fed	Add a .gitattributes	2025-02-18 17:44:03 -05:00
Fireblade2534	68cb097d9b	Merged from orgin/master	2025-02-11 14:05:14 +00:00
Fireblade	ab1c21130e	Made the api use the normalizer, fixed the wrong version of espeak, added better normilzation, improved the sentence splitting, fixed some formatting	2025-02-10 21:45:52 -05:00
remsky	a91e0fe9df	Ruff check + formatting	2025-02-09 18:32:17 -07:00
Fireblade2534	90f168f63d	fixed some problems in the build system and model download system	2025-02-07 18:30:59 +00:00
remsky	d452a6e114	Bump version to v0.2.0-pre, enhance Docker configurations for GPU support, and refine text processing settings	2025-02-06 01:22:21 -07:00
remsky	d3741d0d99	v1_0 full migration, captions, gpu, cpu, webui updates	2025-02-05 00:46:01 -07:00
remsky	6c234a3b67	Update dependencies, enhance voice management, and add captioned speech support	2025-02-04 19:41:41 -07:00
remsky	4c90a89545	Initial commit of Kokoro V1.0-only integration	2025-02-04 03:37:56 -07:00
remsky	903bf91c81	v1_0 multilanguage initial support -note: all languages currently installed, selected by prefix of first chosen voice in call	2025-02-03 03:33:12 -07:00
remsky	f61f79981d	-Add debug endpoint for system stats -Adjust headers, generate from phonemes, etc	2025-01-30 04:44:04 -07:00
remsky	2e318051f8	Add clear text button and enhance temporary file management - Introduced a "Clear Text" button in the web interface for user convenience. - Updated temporary file management settings in the configuration. - Added new debug endpoints for system and storage information. - Improved logging levels for better debugging insights.	2025-01-29 18:29:02 -07:00
remsky	946e322242	Implement temporary file management on openai endpoint, whole file downloads	2025-01-29 04:09:38 -07:00
remsky	20658f9759	Performance: Adjust session timeout and GPU memory limit; minim voice pre-caching and improve singleton instance management	2025-01-24 05:01:38 -07:00
remsky	ee1f7cde18	Add async audio processing and semantic chunking support; flattened static audio trimming	2025-01-24 04:06:47 -07:00
remsky	8eb3525382	Refactor configuration and enhance web interface: update GPU settings, add speed control, and improve input handling for audio generation	2025-01-23 04:54:55 -07:00
remsky	ba577d348e	Enhance web player information, adjust text chunk size, update audio wave settings, and implement OpenAI model mappings	2025-01-23 04:11:31 -07:00
remsky	8e8f120a3e	Update configuration to disable local voice saving, enhance voice validation logic, and remove deprecated test file	2025-01-23 02:00:46 -07:00
remsky	df4cc5b4b2	-Adjust testing framework for new model -Add web player support: include static file serving and HTML interface for TTS	2025-01-22 21:11:47 -07:00
remsky	d50214d3be	Enable ONNX GPU support in Docker configurations and refactor model file handling	2025-01-22 05:00:38 -07:00
remsky	4a24be1605	Refactor model loading and configuration: update, adjust model loading device,. add async streaming examples and remove unused warmup service.	2025-01-22 02:33:29 -07:00
remsky	21bf810f97	Enhance model inference: update documentation, add model download scripts for PyTorch and ONNX, and refactor configuration handling	2025-01-21 21:44:21 -07:00
remsky	ab28a62e86	Refactor inference architecture: remove legacy TTS model, add ONNX and PyTorch backends, and introduce model configuration schemas	2025-01-20 22:42:29 -07:00
Fireblade2534	eb556ec7d3	Fixed python tests so they run properly and cleaned up some unneeded files	2025-01-17 14:55:25 +00:00
remsky	8bc8661930	fix: update model directory paths and improve logging in TTS services	2025-01-14 06:37:03 -07:00
remsky	22752900e5	Ruff checks, ci fix	2025-01-13 20:15:46 -07:00
remsky	3d0ca2a8c2	Update Dockerfiles for baked in models, adjustments to cpu/gpu environment splits	2025-01-12 05:23:02 -07:00
remsky	e8c1284032	Ruff format + fix	2025-01-09 18:41:44 -07:00
remsky	720c1fb97d	-update soundfile version -alignment with streaming standards -audio processing config settings -more comprehensive model warmup -minor model improvements -enhancing testing, benchmarking -cool ascii logo	2025-01-06 03:32:41 -07:00
remsky	93aa205da9	Enhance ONNX optimization settings and add validation script for TTS audio files	2025-01-04 02:14:46 -07:00
remsky	7df2a68fb4	- CPU ONNX + PyTorch CUDA, functional - Incorporated text processing module as service, towards modularization and optimizations - Added text processing router for phonemization - Enhanced benchmark statistics with real-time speed metrics	2025-01-03 17:54:17 -07:00
remsky	8ce8334345	- Complete TTS endpoint replacement with OpenAI compatible -Removed output directory, and update configuration settings - Added benchmarking for entire novel	2024-12-31 01:52:16 -07:00
remsky	ce0ef3534a	Add initial implementation of Kokoro TTS API with Docker GPU support - Set up FastAPI application with TTS service - Define API endpoints for TTS generation and voice listing - Implement Pydantic models for request and response schemas - Add Dockerfile and docker-compose.yml for containerization - Include example usage and benchmark results in README	2024-12-30 04:17:50 -07:00

33 commits