Commit graph

323 commits

Author SHA1 Message Date
Fireblade
cb22aab239 Fix streaming a wav file with captions not reaturning any captions (This is only a problem because wav streaming does not acually work) 2025-02-16 16:49:33 -05:00
Fireblade
e3dc959775 Simplify code so erverything uses AudioChunks 2025-02-16 15:37:01 -05:00
Fireblade
9c0e328318 made it skip text normalization when using other languages as it only supports english 2025-02-16 14:16:18 -05:00
Fireblade
4802128943 Replaced default voice with af_heart as af doesn't exist 2025-02-15 12:36:36 -05:00
Fireblade
8c457c3292 fixed final test 2025-02-15 09:49:15 -05:00
Fireblade
1a6e7abac3 fixed a bunch of tests 2025-02-15 09:40:01 -05:00
Fireblade
1a03ac7464 Fixed some tests 2025-02-14 15:00:47 -05:00
Fireblade
353fe79690 fix small error 2025-02-14 14:39:24 -05:00
Fireblade
842d056552 Merge branch 'streaming-word-timestamps' of https://github.com/fireblade2534/Kokoro-FastAPI into streaming-word-timestamps 2025-02-14 14:36:20 -05:00
Fireblade
9c1ced237b Cleaned up some code and fixed an error in the readme 2025-02-14 14:36:17 -05:00
Fireblade2534
b71bab45d4
Merge branch 'master' into streaming-word-timestamps 2025-02-14 14:32:41 -05:00
Fireblade
34acb17682 Mostly completed work on refractoring a bunch of code as well as streaming word level time stamps 2025-02-14 14:29:47 -05:00
Fireblade
0b5ec320c7 streaming word level time stamps 2025-02-14 13:37:42 -05:00
remsky
b00c9ec28d
Update README.md
Some checks failed
CI / test (3.10) (push) Has been cancelled
2025-02-13 20:38:45 -07:00
Fireblade
4027768920 Started work on allowing streaming word level timestamps as well as transitioning the dev code so it uses a lot more from the open ai endpoint 2025-02-13 18:00:03 -05:00
Fireblade
7772dbc2e4 fixed no stream file writing 2025-02-13 16:12:51 -05:00
remsky
f587309d8f
Update README.md
Some checks are pending
CI / test (3.10) (push) Waiting to run
2025-02-13 03:12:45 -07:00
remsky
97f82c0685
Update README.md 2025-02-13 03:11:11 -07:00
remsky
cfae7db7fc fix: bump up audio quality settings in StreamingAudioWriter
Some checks are pending
CI / test (3.10) (push) Waiting to run
2025-02-13 00:22:14 -07:00
remsky
37ea01eaf9 fix: download_format option for audio response, handling in create_speech 2025-02-13 00:04:21 -07:00
remsky
127aae4fab
Merge pull request #156 from eltociear/patch-1
docs: update README.md
2025-02-12 23:46:05 -07:00
remsky
af654d59aa
Merge pull request #155 from Krurst/master
Update openai_compatible.py to fix lang_code
2025-02-12 23:32:24 -07:00
remsky
f585185404
Update openai_compatible.py 2025-02-12 23:31:47 -07:00
remsky
694b7435f1
Merge branch 'master' into master 2025-02-12 23:31:13 -07:00
remsky
728e18b613
Merge pull request #152 from fireblade2534/fixedstuff
fixed a bunch of stuff
2025-02-12 23:21:12 -07:00
Fireblade
dbf2b99026 Simplifed generate_audio in tts_service mostly working (audio conversion does not work) 2025-02-12 22:42:41 -05:00
Fireblade
5b20602b8e More work on timestamps (Does not maintain accuracy over multiple chunks) 2025-02-12 21:36:35 -05:00
Fireblade2534
6985f6ef99 more work on streaming timestamps (not working weird error) :( 2025-02-12 20:34:55 +00:00
Fireblade2534
91d370d97f More working on streaming timestamps 2025-02-12 17:13:56 +00:00
Fireblade2534
51b6b01589 Fixed not returning enough values 2025-02-12 15:06:11 +00:00
Fireblade
5cc9d140fe WIP 2025-02-11 22:36:19 -05:00
Fireblade
45cdb607e6 WIP 2025-02-11 22:32:10 -05:00
Fireblade
da1e280805 fix tests 2025-02-11 21:30:41 -05:00
remsky
aae90b6d2e
Merge pull request #162 from zucher/master
Some checks are pending
CI / test (3.10) (push) Waiting to run
2025-02-11 17:32:16 -07:00
Fireblade
7cb5957848 added optional pluralization normalization 2025-02-11 19:24:29 -05:00
Fireblade
09de389b29 Added normilization options 2025-02-11 19:09:35 -05:00
Fireblade
8ea8e68b61 Fixed espeak backend erroring while initilizating causing espeak fallback to silently fail 2025-02-11 18:08:36 -05:00
Fireblade2534
84f3b8b4cb
Merge branch 'remsky:master' into fixedstuff 2025-02-11 17:04:20 -05:00
zucher
1e14fd8724 Fix chart ingress issue 2025-02-11 21:02:58 +00:00
remsky
7d4ded6e2e
Merge pull request #157 from zucher/master
Some checks are pending
CI / test (3.10) (push) Waiting to run
Add Helm chart
2025-02-11 12:08:59 -07:00
Vincent Bailleau
d4f248b3a2 Add Helm chart 2025-02-11 19:10:01 +01:00
Ikko Eltociear Ashimine
b6dd9f326b
docs: update README.md
accomodate -> accommodate
2025-02-12 02:11:08 +09:00
Krurst
1cf011b2eb
Update openai_compatible.py to fix lang_code
properly sets lang_code from api request, and applies config default if not set
2025-02-11 23:35:51 +08:00
Fireblade2534
64980b5bc8 made it so bytes vs bits are translated correctly 2025-02-11 15:18:10 +00:00
Fireblade2534
68cb097d9b Merged from orgin/master 2025-02-11 14:05:14 +00:00
remsky
24b31ccbb5 -Fixed espeak engagement on gpu
Some checks are pending
CI / test (3.10) (push) Waiting to run
-Add default voice code setting and update language code resolution logic
2025-02-11 04:49:48 -07:00
Fireblade
737e49a3f9 removed testing start-gpu.bat 2025-02-10 21:49:05 -05:00
Fireblade
ab1c21130e Made the api use the normalizer, fixed the wrong version of espeak, added better normilzation, improved the sentence splitting, fixed some formatting 2025-02-10 21:45:52 -05:00
remsky
9b76ce2071
Create LICENSE 2025-02-10 02:08:51 -07:00
remsky
3f45a506de
Update README.md 2025-02-10 02:04:50 -07:00