Cong Nguyen
9a9bc4aca9
added support for mps on mac with apple silicon
2025-03-10 11:58:45 +11:00
remsky
6c234a3b67
Update dependencies, enhance voice management, and add captioned speech support
2025-02-04 19:41:41 -07:00
remsky
2dfad62b5b
Update .gitignore and benchmark scripts for GPU support; enhance TTS service handling and session management
2025-01-30 05:47:28 -07:00
remsky
f61f79981d
-Add debug endpoint for system stats
...
-Adjust headers, generate from phonemes, etc
2025-01-30 04:44:04 -07:00
remsky
2e318051f8
Add clear text button and enhance temporary file management
...
- Introduced a "Clear Text" button in the web interface for user convenience.
- Updated temporary file management settings in the configuration.
- Added new debug endpoints for system and storage information.
- Improved logging levels for better debugging insights.
2025-01-29 18:29:02 -07:00
remsky
18b15728a8
Refactor web player architecture: separate concerns into PlayerState, VoiceService, and AudioService; update HTML and CSS for new structure; add progress bar styles and ignore unnecessary files.
2025-01-28 03:25:29 -07:00
remsky
ab28a62e86
Refactor inference architecture: remove legacy TTS model, add ONNX and PyTorch backends, and introduce model configuration schemas
2025-01-20 22:42:29 -07:00
Fireblade2534
9d84207c35
Added app to git ignore
2025-01-17 15:38:31 +00:00
remsky
d20da2f92e
Default hexxgrad voicepacks added as temporary fix
2025-01-15 09:42:27 +00:00
remsky
58cc7c898a
ci: update dependency installation and testing commands; add ebooklib and html2text dependencies
2025-01-13 23:48:13 -07:00
remsky
007b1a35e8
feat: merge master into core/uv-management for v0.1.0
...
Major changes:
- Baked model directly into Dockerfile for improved deployment
- Switched to uv for dependency management
- Restructured Docker files into docker/cpu and docker/gpu directories
- Updated configuration for better ONNX performance
2025-01-13 19:31:44 -07:00
remsky
f3ba8ad7f3
Merge branch 'master' of https://github.com/remsky/Kokoro-FastAPI
2025-01-12 21:44:45 -07:00
remsky
898231eefd
fix: update .gitignore and suppress erramt Gradio warnings in app.py
2025-01-12 21:34:13 -07:00
remsky
ce63c8d131
fix: healthchecks and curl installation to Docker configurations
2025-01-12 20:42:11 -07:00
remsky
5e9a0ae2b1
Update .gitignore to include additional patterns for Python, environment, IDE, and project-specific files
2025-01-12 14:45:26 -07:00
remsky
3d0ca2a8c2
Update Dockerfiles for baked in models, adjustments to cpu/gpu environment splits
2025-01-12 05:23:02 -07:00
remsky
38e0b87320
Initial swap to UV dependency management
2025-01-11 20:00:34 -07:00
remsky
926ea8cecf
Refactor Docker configurations and update test mocks for development routers
2025-01-10 22:03:16 -07:00
remsky
4c6cd83f85
Swapped generator to preprocessing
2025-01-04 22:23:59 -07:00
remsky
f1eb1d9590
First streaming attempt
2025-01-04 17:54:54 -07:00
remsky
7df2a68fb4
- CPU ONNX + PyTorch CUDA, functional
...
- Incorporated text processing module as service, towards modularization and optimizations
- Added text processing router for phonemization
- Enhanced benchmark statistics with real-time speed metrics
2025-01-03 17:54:17 -07:00
remsky
40894449da
added output audio tests, validation
2025-01-02 15:36:53 -07:00
remsky
53cf71c151
-Removed commit lock on HF repo
...
-Warm start added to model initialization
-Layer caching tweaks to dockerfile
2025-01-01 17:38:22 -07:00
remsky
05e1e30c47
- modified voice loading to copy on init
...
- adjustments to the combine voices functionality
- error handling and analysis
2024-12-31 18:55:26 -07:00
remsky
ce0ef3534a
Add initial implementation of Kokoro TTS API with Docker GPU support
...
- Set up FastAPI application with TTS service
- Define API endpoints for TTS generation and voice listing
- Implement Pydantic models for request and response schemas
- Add Dockerfile and docker-compose.yml for containerization
- Include example usage and benchmark results in README
2024-12-30 04:17:50 -07:00