Commit graph

29 commits

Author SHA1 Message Date
remsky
720c1fb97d -update soundfile version
-alignment with streaming standards
-audio processing config settings
-more comprehensive model warmup
-minor model improvements
-enhancing testing, benchmarking
-cool ascii logo
2025-01-06 03:32:41 -07:00
remsky
76e8b07a92 Allow ONNX support optimizations for CPU inference and update benchmarking scripts; modify README for clarity on performance metrics 2025-01-04 02:46:27 -07:00
remsky
7df2a68fb4 - CPU ONNX + PyTorch CUDA, functional
- Incorporated text processing module as service, towards modularization and optimizations
- Added text processing router for phonemization
- Enhanced benchmark statistics with real-time speed metrics
2025-01-03 17:54:17 -07:00
remsky
f1131b4836 Update GitHub Actions workflow and README for Docker publishing and new badge 2025-01-02 20:27:12 -07:00
remsky
40894449da added output audio tests, validation 2025-01-02 15:36:53 -07:00
remsky
d1c3feb8c4
Update README.md 2025-01-02 02:53:41 -07:00
remsky
e2ce93cb41
Update README.md 2025-01-01 21:54:28 -07:00
remsky
5a5bafc0c2
Update README.md 2025-01-01 21:53:29 -07:00
remsky
e749b3bc88 Add Gradio web interface + tests 2025-01-01 21:50:00 -07:00
remsky
53cf71c151 -Removed commit lock on HF repo
-Warm start added to model initialization
-Layer caching tweaks to dockerfile
2025-01-01 17:38:22 -07:00
remsky
a672fbc798
Update README.md 2024-12-31 19:11:21 -07:00
remsky
18e18c35a7 Enhance voice handling and add voice combination feature in TTS API 2024-12-31 19:04:40 -07:00
remsky
05e1e30c47 - modified voice loading to copy on init
- adjustments to the combine voices functionality
- error handling and analysis
2024-12-31 18:55:26 -07:00
remsky
3b1b0843d6
Update README.md 2024-12-31 14:49:43 -07:00
remsky
ded92c96ad
Update README.md 2024-12-31 05:56:22 -07:00
remsky
0652056107 Add preliminary Docker support for CPU deployment 2024-12-31 04:51:21 -07:00
remsky
32a35f9a5a
Update README.md 2024-12-31 04:29:48 -07:00
remsky
415c592929
Update README.md
Added sample audio player
2024-12-31 04:24:09 -07:00
remsky
f7ce5e24c7
Update README.md 2024-12-31 03:53:12 -07:00
remsky
607df6e03b Update README and tests to clarify audio format support and enhance documentation 2024-12-31 03:46:31 -07:00
remsky
36606f7234 Refactor Docker setup to use a dedicated model-fetcher service and update schemas for additional voice support 2024-12-31 03:41:45 -07:00
remsky
98e49c0cf9
Update README.md 2024-12-30 06:48:27 -07:00
remsky
6cfb26eb0c
Update README.md 2024-12-30 06:27:54 -07:00
remsky
66f3838170
Update README.md 2024-12-30 06:26:10 -07:00
remsky
ca66dfe27e Merge branch 'master' of https://github.com/remsky/Kokoro-FastAPI 2024-12-30 05:29:45 -07:00
remsky
ae6f49c7e1 Update Dockerfile and docker-compose.yml to add versions, specify Kokoro commit 2024-12-30 05:29:35 -07:00
remsky
6d86a8dc78
Update README.md 2024-12-30 04:55:55 -07:00
remsky
aa2df45858 Update README with performance benchmarks and usage examples; add benchmark plotting script 2024-12-30 04:53:29 -07:00
remsky
ce0ef3534a Add initial implementation of Kokoro TTS API with Docker GPU support
- Set up FastAPI application with TTS service
- Define API endpoints for TTS generation and voice listing
- Implement Pydantic models for request and response schemas
- Add Dockerfile and docker-compose.yml for containerization
- Include example usage and benchmark results in README
2024-12-30 04:17:50 -07:00