remsky
720c1fb97d
-update soundfile version
...
-alignment with streaming standards
-audio processing config settings
-more comprehensive model warmup
-minor model improvements
-enhancing testing, benchmarking
-cool ascii logo
2025-01-06 03:32:41 -07:00
remsky
76e8b07a92
Allow ONNX support optimizations for CPU inference and update benchmarking scripts; modify README for clarity on performance metrics
2025-01-04 02:46:27 -07:00
remsky
7df2a68fb4
- CPU ONNX + PyTorch CUDA, functional
...
- Incorporated text processing module as service, towards modularization and optimizations
- Added text processing router for phonemization
- Enhanced benchmark statistics with real-time speed metrics
2025-01-03 17:54:17 -07:00
remsky
f1131b4836
Update GitHub Actions workflow and README for Docker publishing and new badge
2025-01-02 20:27:12 -07:00
remsky
40894449da
added output audio tests, validation
2025-01-02 15:36:53 -07:00
remsky
d1c3feb8c4
Update README.md
2025-01-02 02:53:41 -07:00
remsky
e2ce93cb41
Update README.md
2025-01-01 21:54:28 -07:00
remsky
5a5bafc0c2
Update README.md
2025-01-01 21:53:29 -07:00
remsky
e749b3bc88
Add Gradio web interface + tests
2025-01-01 21:50:00 -07:00
remsky
53cf71c151
-Removed commit lock on HF repo
...
-Warm start added to model initialization
-Layer caching tweaks to dockerfile
2025-01-01 17:38:22 -07:00
remsky
a672fbc798
Update README.md
2024-12-31 19:11:21 -07:00
remsky
18e18c35a7
Enhance voice handling and add voice combination feature in TTS API
2024-12-31 19:04:40 -07:00
remsky
05e1e30c47
- modified voice loading to copy on init
...
- adjustments to the combine voices functionality
- error handling and analysis
2024-12-31 18:55:26 -07:00
remsky
3b1b0843d6
Update README.md
2024-12-31 14:49:43 -07:00
remsky
ded92c96ad
Update README.md
2024-12-31 05:56:22 -07:00
remsky
0652056107
Add preliminary Docker support for CPU deployment
2024-12-31 04:51:21 -07:00
remsky
32a35f9a5a
Update README.md
2024-12-31 04:29:48 -07:00
remsky
415c592929
Update README.md
...
Added sample audio player
2024-12-31 04:24:09 -07:00
remsky
f7ce5e24c7
Update README.md
2024-12-31 03:53:12 -07:00
remsky
607df6e03b
Update README and tests to clarify audio format support and enhance documentation
2024-12-31 03:46:31 -07:00
remsky
36606f7234
Refactor Docker setup to use a dedicated model-fetcher service and update schemas for additional voice support
2024-12-31 03:41:45 -07:00
remsky
98e49c0cf9
Update README.md
2024-12-30 06:48:27 -07:00
remsky
6cfb26eb0c
Update README.md
2024-12-30 06:27:54 -07:00
remsky
66f3838170
Update README.md
2024-12-30 06:26:10 -07:00
remsky
ca66dfe27e
Merge branch 'master' of https://github.com/remsky/Kokoro-FastAPI
2024-12-30 05:29:45 -07:00
remsky
ae6f49c7e1
Update Dockerfile and docker-compose.yml to add versions, specify Kokoro commit
2024-12-30 05:29:35 -07:00
remsky
6d86a8dc78
Update README.md
2024-12-30 04:55:55 -07:00
remsky
aa2df45858
Update README with performance benchmarks and usage examples; add benchmark plotting script
2024-12-30 04:53:29 -07:00
remsky
ce0ef3534a
Add initial implementation of Kokoro TTS API with Docker GPU support
...
- Set up FastAPI application with TTS service
- Define API endpoints for TTS generation and voice listing
- Implement Pydantic models for request and response schemas
- Add Dockerfile and docker-compose.yml for containerization
- Include example usage and benchmark results in README
2024-12-30 04:17:50 -07:00