Kokoro-FastAPI

mirror of https://github.com/remsky/Kokoro-FastAPI.git synced 2025-08-05 16:48:53 +00:00

Author	SHA1	Message	Date
remsky	df4cc5b4b2	-Adjust testing framework for new model -Add web player support: include static file serving and HTML interface for TTS	2025-01-22 21:11:47 -07:00
remsky	d50214d3be	Enable ONNX GPU support in Docker configurations and refactor model file handling	2025-01-22 05:00:38 -07:00
remsky	4a24be1605	Refactor model loading and configuration: update, adjust model loading device,. add async streaming examples and remove unused warmup service.	2025-01-22 02:33:29 -07:00
remsky	21bf810f97	Enhance model inference: update documentation, add model download scripts for PyTorch and ONNX, and refactor configuration handling	2025-01-21 21:44:21 -07:00
remsky	064313450e	fix: test of cicd	2025-01-13 20:18:02 -07:00
remsky	22752900e5	Ruff checks, ci fix	2025-01-13 20:15:46 -07:00
remsky	e8c1284032	Ruff format + fix	2025-01-09 18:41:44 -07:00
remsky	4b521f9bf0	- Added GenerateFromPhonemesRequest model to text_schemas.py - Refactored TTS model initialization methods in tts_gpu.py and tts_cpu.py - Added custom logger configuration in main.py - Deprecated text_processing router -> development route	2025-01-09 07:20:14 -07:00
remsky	720c1fb97d	-update soundfile version -alignment with streaming standards -audio processing config settings -more comprehensive model warmup -minor model improvements -enhancing testing, benchmarking -cool ascii logo	2025-01-06 03:32:41 -07:00
remsky	4c6cd83f85	Swapped generator to preprocessing	2025-01-04 22:23:59 -07:00
remsky	0e9f77fc79	WIP: open ai compatible streaming	2025-01-04 17:55:36 -07:00
remsky	f1eb1d9590	First streaming attempt	2025-01-04 17:54:54 -07:00
remsky	7df2a68fb4	- CPU ONNX + PyTorch CUDA, functional - Incorporated text processing module as service, towards modularization and optimizations - Added text processing router for phonemization - Enhanced benchmark statistics with real-time speed metrics	2025-01-03 17:54:17 -07:00
remsky	9496a3a63f	WIP: CPU/GPU Functional, few straggling tests to fix and check.	2025-01-03 03:16:42 -07:00
remsky	e4d8e74738	WIP, Functional for CPU: Updated for ONNX runtime support, Dockerfile and TTS Service	2025-01-03 00:53:41 -07:00
remsky	53cf71c151	-Removed commit lock on HF repo -Warm start added to model initialization -Layer caching tweaks to dockerfile	2025-01-01 17:38:22 -07:00
remsky	4123ab0891	Refactor TTS API and enhance testing setup with coverage and logging improvements	2024-12-31 02:55:51 -07:00
remsky	c11a6ea6ea	Enhance TTS API with logging, voice pack loading, and schema updates	2024-12-31 01:57:00 -07:00
remsky	8ce8334345	- Complete TTS endpoint replacement with OpenAI compatible -Removed output directory, and update configuration settings - Added benchmarking for entire novel	2024-12-31 01:52:16 -07:00
remsky	60a19bde43	- SQLAlchemy integration for TTS queue management - Model pre-loading and database initialization in the FastAPI app lifespan.	2024-12-30 13:21:17 -07:00
remsky	ce0ef3534a	Add initial implementation of Kokoro TTS API with Docker GPU support - Set up FastAPI application with TTS service - Define API endpoints for TTS generation and voice listing - Implement Pydantic models for request and response schemas - Add Dockerfile and docker-compose.yml for containerization - Include example usage and benchmark results in README	2024-12-30 04:17:50 -07:00

21 commits