Kokoro-FastAPI

mirror of https://github.com/remsky/Kokoro-FastAPI.git synced 2025-04-13 09:39:17 +00:00

Author	SHA1	Message	Date
Fireblade	da1e280805	fix tests	2025-02-11 21:30:41 -05:00
remsky	8ed2f2afb6	Add model listing and retrieval endpoints with tests	2025-02-09 20:55:21 -07:00
remsky	d73ed87987	Update handling in generate_captioned_speech to stream immediately, templink for caption file, and add unit tests for captioned speech generation	2025-02-09 20:26:59 -07:00
remsky	a91e0fe9df	Ruff check + formatting	2025-02-09 18:32:17 -07:00
remsky	a0dc870f4a	-fix voice selection not matching language phonemes -added voice language override parameter	2025-02-08 01:29:15 -07:00
Fireblade2534	429c959b22	fixed test case	2025-02-07 18:44:48 +00:00
remsky	ac7947b51a	Refactor Docker configurations for GPU and CPU, update test paths, and remove deprecated tests	2025-02-06 23:43:26 -07:00
remsky	165ffccd01	Remove voice manager tests and update Dockerfiles for improved dependency management and user permissions	2025-02-06 04:23:08 -07:00
remsky	d3741d0d99	v1_0 full migration, captions, gpu, cpu, webui updates	2025-02-05 00:46:01 -07:00
remsky	fb22264edc	Enhance audio generation and download handling; add test audio generation script	2025-01-30 22:56:23 -07:00
remsky	f61f79981d	-Add debug endpoint for system stats -Adjust headers, generate from phonemes, etc	2025-01-30 04:44:04 -07:00
remsky	2e318051f8	Add clear text button and enhance temporary file management - Introduced a "Clear Text" button in the web interface for user convenience. - Updated temporary file management settings in the configuration. - Added new debug endpoints for system and storage information. - Improved logging levels for better debugging insights.	2025-01-29 18:29:02 -07:00
remsky	9867fc398f	WIP: v1_0_0 migration	2025-01-28 13:52:57 -07:00
remsky	ba577d348e	Enhance web player information, adjust text chunk size, update audio wave settings, and implement OpenAI model mappings	2025-01-23 04:11:31 -07:00
remsky	8e8f120a3e	Update configuration to disable local voice saving, enhance voice validation logic, and remove deprecated test file	2025-01-23 02:00:46 -07:00
remsky	df4cc5b4b2	-Adjust testing framework for new model -Add web player support: include static file serving and HTML interface for TTS	2025-01-22 21:11:47 -07:00
Richard Roberson	d51d861861	add AAC audio format and test	2025-01-17 21:43:10 -07:00
remsky	22752900e5	Ruff checks, ci fix	2025-01-13 20:15:46 -07:00
remsky	387653050b	refactor: streamline audio normalization process and update tests	2025-01-13 18:56:49 -07:00
remsky	926ea8cecf	Refactor Docker configurations and update test mocks for development routers	2025-01-10 22:03:16 -07:00
remsky	e8c1284032	Ruff format + fix	2025-01-09 18:41:44 -07:00
remsky	4b521f9bf0	- Added GenerateFromPhonemesRequest model to text_schemas.py - Refactored TTS model initialization methods in tts_gpu.py and tts_cpu.py - Added custom logger configuration in main.py - Deprecated text_processing router -> development route	2025-01-09 07:20:14 -07:00
remsky	a0a85f5ef0	-add email handling, minor additional URL processing, tests	2025-01-08 03:13:17 -07:00
remsky	d7e8a5c953	Adjusting aiofiles implementation, testing	2025-01-07 04:30:02 -07:00
remsky	130b084cce	- Added support for combining voices via any endpoint - Updated the `process_voices` function to handle both string and list formats for voice input.	2025-01-07 03:50:08 -07:00
remsky	fddf26c905	Added tested, slight changes to regex	2025-01-07 00:18:44 -07:00
remsky	720c1fb97d	-update soundfile version -alignment with streaming standards -audio processing config settings -more comprehensive model warmup -minor model improvements -enhancing testing, benchmarking -cool ascii logo	2025-01-06 03:32:41 -07:00
remsky	e799f0c7c1	WIP: basic tests on OpenAI streaming compatibility	2025-01-04 18:09:23 -07:00
remsky	7df2a68fb4	- CPU ONNX + PyTorch CUDA, functional - Incorporated text processing module as service, towards modularization and optimizations - Added text processing router for phonemization - Enhanced benchmark statistics with real-time speed metrics	2025-01-03 17:54:17 -07:00
remsky	9496a3a63f	WIP: CPU/GPU Functional, few straggling tests to fix and check.	2025-01-03 03:16:42 -07:00
remsky	e4d8e74738	WIP, Functional for CPU: Updated for ONNX runtime support, Dockerfile and TTS Service	2025-01-03 00:53:41 -07:00
remsky	40894449da	added output audio tests, validation	2025-01-02 15:36:53 -07:00
remsky	f051984805	Ruff Check + Format	2025-01-01 21:50:41 -07:00
remsky	53cf71c151	-Removed commit lock on HF repo -Warm start added to model initialization -Layer caching tweaks to dockerfile	2025-01-01 17:38:22 -07:00
remsky	05e1e30c47	- modified voice loading to copy on init - adjustments to the combine voices functionality - error handling and analysis	2024-12-31 18:55:26 -07:00
remsky	4123ab0891	Refactor TTS API and enhance testing setup with coverage and logging improvements	2024-12-31 02:55:51 -07:00
remsky	c11a6ea6ea	Enhance TTS API with logging, voice pack loading, and schema updates	2024-12-31 01:57:00 -07:00
remsky	8ce8334345	- Complete TTS endpoint replacement with OpenAI compatible -Removed output directory, and update configuration settings - Added benchmarking for entire novel	2024-12-31 01:52:16 -07:00
remsky	175daea325	Added basic pytest on the fastapi side	2024-12-30 13:25:30 -07:00

39 commits