Commit graph

25 commits

Author SHA1 Message Date
Cong Nguyen
9a9bc4aca9 added support for mps on mac with apple silicon 2025-03-10 11:58:45 +11:00
remsky
6c234a3b67 Update dependencies, enhance voice management, and add captioned speech support 2025-02-04 19:41:41 -07:00
remsky
2dfad62b5b Update .gitignore and benchmark scripts for GPU support; enhance TTS service handling and session management 2025-01-30 05:47:28 -07:00
remsky
f61f79981d -Add debug endpoint for system stats
-Adjust headers, generate from phonemes, etc
2025-01-30 04:44:04 -07:00
remsky
2e318051f8 Add clear text button and enhance temporary file management
- Introduced a "Clear Text" button in the web interface for user convenience.
- Updated temporary file management settings in the configuration.
- Added new debug endpoints for system and storage information.
- Improved logging levels for better debugging insights.
2025-01-29 18:29:02 -07:00
remsky
18b15728a8 Refactor web player architecture: separate concerns into PlayerState, VoiceService, and AudioService; update HTML and CSS for new structure; add progress bar styles and ignore unnecessary files. 2025-01-28 03:25:29 -07:00
remsky
ab28a62e86 Refactor inference architecture: remove legacy TTS model, add ONNX and PyTorch backends, and introduce model configuration schemas 2025-01-20 22:42:29 -07:00
Fireblade2534
9d84207c35 Added app to git ignore 2025-01-17 15:38:31 +00:00
remsky
d20da2f92e Default hexxgrad voicepacks added as temporary fix 2025-01-15 09:42:27 +00:00
remsky
58cc7c898a ci: update dependency installation and testing commands; add ebooklib and html2text dependencies 2025-01-13 23:48:13 -07:00
remsky
007b1a35e8 feat: merge master into core/uv-management for v0.1.0
Major changes:
- Baked model directly into Dockerfile for improved deployment
- Switched to uv for dependency management
- Restructured Docker files into docker/cpu and docker/gpu directories
- Updated configuration for better ONNX performance
2025-01-13 19:31:44 -07:00
remsky
f3ba8ad7f3 Merge branch 'master' of https://github.com/remsky/Kokoro-FastAPI 2025-01-12 21:44:45 -07:00
remsky
898231eefd fix: update .gitignore and suppress erramt Gradio warnings in app.py 2025-01-12 21:34:13 -07:00
remsky
ce63c8d131 fix: healthchecks and curl installation to Docker configurations 2025-01-12 20:42:11 -07:00
remsky
5e9a0ae2b1 Update .gitignore to include additional patterns for Python, environment, IDE, and project-specific files 2025-01-12 14:45:26 -07:00
remsky
3d0ca2a8c2 Update Dockerfiles for baked in models, adjustments to cpu/gpu environment splits 2025-01-12 05:23:02 -07:00
remsky
38e0b87320 Initial swap to UV dependency management 2025-01-11 20:00:34 -07:00
remsky
926ea8cecf Refactor Docker configurations and update test mocks for development routers 2025-01-10 22:03:16 -07:00
remsky
4c6cd83f85 Swapped generator to preprocessing 2025-01-04 22:23:59 -07:00
remsky
f1eb1d9590 First streaming attempt 2025-01-04 17:54:54 -07:00
remsky
7df2a68fb4 - CPU ONNX + PyTorch CUDA, functional
- Incorporated text processing module as service, towards modularization and optimizations
- Added text processing router for phonemization
- Enhanced benchmark statistics with real-time speed metrics
2025-01-03 17:54:17 -07:00
remsky
40894449da added output audio tests, validation 2025-01-02 15:36:53 -07:00
remsky
53cf71c151 -Removed commit lock on HF repo
-Warm start added to model initialization
-Layer caching tweaks to dockerfile
2025-01-01 17:38:22 -07:00
remsky
05e1e30c47 - modified voice loading to copy on init
- adjustments to the combine voices functionality
- error handling and analysis
2024-12-31 18:55:26 -07:00
remsky
ce0ef3534a Add initial implementation of Kokoro TTS API with Docker GPU support
- Set up FastAPI application with TTS service
- Define API endpoints for TTS generation and voice listing
- Implement Pydantic models for request and response schemas
- Add Dockerfile and docker-compose.yml for containerization
- Include example usage and benchmark results in README
2024-12-30 04:17:50 -07:00