Commit graph

394 commits

Author SHA1 Message Date
Fireblade
f415ce7109 don't replace brackets as that is handled in misaki 2025-02-28 21:39:12 -05:00
Fireblade
906cf77a65 preserve custom phenomes 2025-02-28 21:37:46 -05:00
remsky
9c6e72943c
Merge pull request #207 from fireblade2534/master
Some checks failed
CI / test (3.10) (push) Has been cancelled
Fix low quality because audio was being encoded at a lower bitrate
2025-02-27 03:43:38 -07:00
Fireblade
9247bc3a12 notremoved the rate argument which apperently means bitrate 2025-02-26 21:51:00 -05:00
Fireblade
980bc5b4a8 Fix low quality because audio was being encoded at a lower bitrate 2025-02-26 20:52:38 -05:00
blakkd
664451e11c added docker to video group 2025-02-24 02:16:07 +01:00
blakkd
3c5029f801 converted CRLF ending lines to LF ones in api/src/structures/custom_responses.py
let ruff organise the imports
2025-02-24 02:11:48 +01:00
remsky
7d73c3c7ee
Merge pull request #173 from fireblade2534/streaming-word-timestamps
Some checks failed
CI / test (3.10) (push) Has been cancelled
Streaming word timestamps
2025-02-22 23:12:22 -07:00
Fireblade
e6feea78a3 Testing error 2025-02-22 15:29:26 -05:00
Fireblade
5de3cace3b Fix some tests and allow running the docker container offline 2025-02-22 15:17:28 -05:00
Fireblade
c1207f085b Merge remote-tracking branch 'upstream/master' into streaming-word-timestamps 2025-02-22 14:58:28 -05:00
remsky
39cc056fe2
Merge pull request #179 from fireblade2534/normalization-changes
Some checks failed
CI / test (3.10) (push) Has been cancelled
2025-02-21 20:00:15 -07:00
remsky
3fd37b837b
Merge pull request #186 from fireblade2534/Add-.gitattribues-file 2025-02-21 19:59:53 -07:00
remsky
a6defbff18
Merge pull request #171 from randombk/pr-no-reload 2025-02-21 19:59:21 -07:00
Fireblade
c5a3e13670 Converted the stream writer to use pyav 2025-02-19 23:10:51 -05:00
Fireblade
4ee4d36822 Fixes a couple of issues with audio triming and prevents errors with single voice weights 2025-02-18 18:12:49 -05:00
Fireblade
7f15ba8fed Add a .gitattributes 2025-02-18 17:44:03 -05:00
Fireblade
f2b2f41412 fixed wrong varible name bug 2025-02-16 17:07:41 -05:00
Fireblade
cb22aab239 Fix streaming a wav file with captions not reaturning any captions (This is only a problem because wav streaming does not acually work) 2025-02-16 16:49:33 -05:00
Fireblade
e3dc959775 Simplify code so erverything uses AudioChunks 2025-02-16 15:37:01 -05:00
Fireblade
9c0e328318 made it skip text normalization when using other languages as it only supports english 2025-02-16 14:16:18 -05:00
Fireblade
41598eb3c5 better parsing for times and phone numbers 2025-02-15 19:02:57 -05:00
Fireblade
3290bada2e changes to how money and numbers are handled 2025-02-15 17:48:12 -05:00
Fireblade
4802128943 Replaced default voice with af_heart as af doesn't exist 2025-02-15 12:36:36 -05:00
Fireblade
8c457c3292 fixed final test 2025-02-15 09:49:15 -05:00
Fireblade
1a6e7abac3 fixed a bunch of tests 2025-02-15 09:40:01 -05:00
Fireblade
1a03ac7464 Fixed some tests 2025-02-14 15:00:47 -05:00
Fireblade
353fe79690 fix small error 2025-02-14 14:39:24 -05:00
Fireblade
842d056552 Merge branch 'streaming-word-timestamps' of https://github.com/fireblade2534/Kokoro-FastAPI into streaming-word-timestamps 2025-02-14 14:36:20 -05:00
Fireblade
9c1ced237b Cleaned up some code and fixed an error in the readme 2025-02-14 14:36:17 -05:00
Fireblade2534
b71bab45d4
Merge branch 'master' into streaming-word-timestamps 2025-02-14 14:32:41 -05:00
Fireblade
34acb17682 Mostly completed work on refractoring a bunch of code as well as streaming word level time stamps 2025-02-14 14:29:47 -05:00
Fireblade
0b5ec320c7 streaming word level time stamps 2025-02-14 13:37:42 -05:00
David Li
2b9933479a Disable --reload on unicorn/fastapi to avoid pegging a CPU core
Signed-off-by: David Li <jiawei.davidli@gmail.com>
2025-02-13 20:12:53 -08:00
remsky
b00c9ec28d
Update README.md
Some checks failed
CI / test (3.10) (push) Has been cancelled
2025-02-13 20:38:45 -07:00
Fireblade
4027768920 Started work on allowing streaming word level timestamps as well as transitioning the dev code so it uses a lot more from the open ai endpoint 2025-02-13 18:00:03 -05:00
Fireblade
7772dbc2e4 fixed no stream file writing 2025-02-13 16:12:51 -05:00
remsky
f587309d8f
Update README.md
Some checks are pending
CI / test (3.10) (push) Waiting to run
2025-02-13 03:12:45 -07:00
remsky
97f82c0685
Update README.md 2025-02-13 03:11:11 -07:00
remsky
cfae7db7fc fix: bump up audio quality settings in StreamingAudioWriter
Some checks are pending
CI / test (3.10) (push) Waiting to run
2025-02-13 00:22:14 -07:00
remsky
37ea01eaf9 fix: download_format option for audio response, handling in create_speech 2025-02-13 00:04:21 -07:00
remsky
127aae4fab
Merge pull request #156 from eltociear/patch-1
docs: update README.md
2025-02-12 23:46:05 -07:00
remsky
af654d59aa
Merge pull request #155 from Krurst/master
Update openai_compatible.py to fix lang_code
2025-02-12 23:32:24 -07:00
remsky
f585185404
Update openai_compatible.py 2025-02-12 23:31:47 -07:00
remsky
694b7435f1
Merge branch 'master' into master 2025-02-12 23:31:13 -07:00
remsky
728e18b613
Merge pull request #152 from fireblade2534/fixedstuff
fixed a bunch of stuff
2025-02-12 23:21:12 -07:00
Fireblade
dbf2b99026 Simplifed generate_audio in tts_service mostly working (audio conversion does not work) 2025-02-12 22:42:41 -05:00
Fireblade
5b20602b8e More work on timestamps (Does not maintain accuracy over multiple chunks) 2025-02-12 21:36:35 -05:00
Fireblade2534
6985f6ef99 more work on streaming timestamps (not working weird error) :( 2025-02-12 20:34:55 +00:00
Fireblade2534
91d370d97f More working on streaming timestamps 2025-02-12 17:13:56 +00:00