Kokoro-FastAPI

mirror of https://github.com/remsky/Kokoro-FastAPI.git synced 2025-08-05 16:48:53 +00:00

Author	SHA1	Message	Date
remsky	280291f2aa	ci: update Docker configurations to use pre-built images and UI services	2025-01-04 20:59:30 -07:00
remsky	e799f0c7c1	WIP: basic tests on OpenAI streaming compatibility	2025-01-04 18:09:23 -07:00
remsky	0e9f77fc79	WIP: open ai compatible streaming	2025-01-04 17:55:36 -07:00
remsky	f1eb1d9590	First streaming attempt	2025-01-04 17:54:54 -07:00
remsky	a65fd61cdb	Update README.md	2025-01-04 03:07:30 -07:00
remsky	65bf15f153	ci: update docker workflow to only build on releases	2025-01-04 02:50:45 -07:00
remsky	fe114c3367	Merge pull request #7 from remsky/feat/onnx-inference Feat/onnx inference Added some optimization options for ONNX Refactoring phonemizer/tokenizer services Cleaned up benchmarking and check scripts Added auto-wav validationd	2025-01-04 02:48:21 -07:00
remsky	76e8b07a92	Allow ONNX support optimizations for CPU inference and update benchmarking scripts; modify README for clarity on performance metrics	2025-01-04 02:46:27 -07:00
remsky	93aa205da9	Enhance ONNX optimization settings and add validation script for TTS audio files	2025-01-04 02:14:46 -07:00
remsky	7df2a68fb4	- CPU ONNX + PyTorch CUDA, functional - Incorporated text processing module as service, towards modularization and optimizations - Added text processing router for phonemization - Enhanced benchmark statistics with real-time speed metrics	2025-01-03 17:54:17 -07:00
remsky	9496a3a63f	WIP: CPU/GPU Functional, few straggling tests to fix and check.	2025-01-03 03:16:42 -07:00
remsky	e4d8e74738	WIP, Functional for CPU: Updated for ONNX runtime support, Dockerfile and TTS Service	2025-01-03 00:53:41 -07:00
remsky	f1131b4836	Update GitHub Actions workflow and README for Docker publishing and new badge	2025-01-02 20:27:12 -07:00
remsky	ee5be65596	Merge pull request #6 from dino65-dev/master Enhance Audio Converter	2025-01-02 15:38:37 -07:00
remsky	40894449da	added output audio tests, validation	2025-01-02 15:36:53 -07:00
DINMAY KUMAR BRAHMA	8ccca1fcad	Update audio.py	2025-01-03 00:28:59 +05:30
DINMAY KUMAR BRAHMA	e9014e620e	Merge branch 'remsky:master' into master	2025-01-02 21:37:01 +05:30
remsky	d1c3feb8c4	Update README.md	2025-01-02 02:53:41 -07:00
remsky	b3fdf2e2c8	feat: build steps for UI Docker image, tagging	2025-01-02 02:31:37 -07:00
remsky	ff33f5494d	fix: add latest tag to Docker publish workflow	2025-01-02 02:23:46 -07:00
remsky	3ca79c609f	fix: longer timeouts, fix on hf model pull	2025-01-02 01:59:25 -07:00
remsky	7d7da3ca1b	test: Add GitHub Actions workflow for Docker build and publish	2025-01-02 01:32:53 -07:00
remsky	e2ce93cb41	Update README.md	2025-01-01 21:54:28 -07:00
remsky	5a5bafc0c2	Update README.md	2025-01-01 21:53:29 -07:00
remsky	ec2e42b9b8	Merge pull request #4 from remsky/feat/gradio-gui Feat/gradio gui	2025-01-01 21:52:10 -07:00
remsky	f051984805	Ruff Check + Format	2025-01-01 21:50:41 -07:00
remsky	e749b3bc88	Add Gradio web interface + tests	2025-01-01 21:50:00 -07:00
remsky	19321eabb2	Merge branch 'master' into feat/gradio-gui	2025-01-01 17:39:54 -07:00
remsky	53cf71c151	-Removed commit lock on HF repo -Warm start added to model initialization -Layer caching tweaks to dockerfile	2025-01-01 17:38:22 -07:00
remsky	1163beae3a	WIP: Add Gradio interface for Kokoro TTS application with input, model, and output components	2025-01-01 17:34:01 -07:00
DINMAY KUMAR BRAHMA	94b6fc22ea	Update audio.py	2025-01-01 21:11:23 +05:30
remsky	7938de0f4a	Update docker-compose.cpu.yml	2025-01-01 03:41:52 -07:00
remsky	cf61cfa005	Update commit hash to include af_sky	2025-01-01 03:41:23 -07:00
remsky	a672fbc798	Update README.md	2024-12-31 19:11:21 -07:00
remsky	d8de7a4044	Merge branch 'master' of https://github.com/remsky/Kokoro-FastAPI	2024-12-31 19:04:48 -07:00
remsky	18e18c35a7	Enhance voice handling and add voice combination feature in TTS API	2024-12-31 19:04:40 -07:00
remsky	decf9123e7	Merge pull request #2 from eschmidbauer/master Looks great	2024-12-31 19:02:21 -07:00
remsky	05e1e30c47	- modified voice loading to copy on init - adjustments to the combine voices functionality - error handling and analysis	2024-12-31 18:55:26 -07:00
remsky	3b1b0843d6	Update README.md	2024-12-31 14:49:43 -07:00
Emmanuel Schmidbauer	510b01cc90	add ability to combine voices	2024-12-31 10:30:12 -05:00
remsky	ded92c96ad	Update README.md	2024-12-31 05:56:22 -07:00
remsky	0652056107	Add preliminary Docker support for CPU deployment	2024-12-31 04:51:21 -07:00
remsky	32a35f9a5a	Update README.md	2024-12-31 04:29:48 -07:00
remsky	415c592929	Update README.md Added sample audio player	2024-12-31 04:24:09 -07:00
remsky	f7ce5e24c7	Update README.md	2024-12-31 03:53:12 -07:00
remsky	f800c4ecf9	Added mp3 samples	2024-12-31 03:48:26 -07:00
remsky	607df6e03b	Update README and tests to clarify audio format support and enhance documentation	2024-12-31 03:46:31 -07:00
remsky	36606f7234	Refactor Docker setup to use a dedicated model-fetcher service and update schemas for additional voice support	2024-12-31 03:41:45 -07:00
remsky	4123ab0891	Refactor TTS API and enhance testing setup with coverage and logging improvements	2024-12-31 02:55:51 -07:00
remsky	c11a6ea6ea	Enhance TTS API with logging, voice pack loading, and schema updates	2024-12-31 01:57:00 -07:00

1 2 3

116 commits