Fireblade
|
226a75e782
|
fixes the low quality fix not working properly
|
2025-02-28 21:57:33 -05:00 |
|
Fireblade
|
9247bc3a12
|
notremoved the rate argument which apperently means bitrate
|
2025-02-26 21:51:00 -05:00 |
|
Fireblade
|
980bc5b4a8
|
Fix low quality because audio was being encoded at a lower bitrate
|
2025-02-26 20:52:38 -05:00 |
|
Fireblade
|
5de3cace3b
|
Fix some tests and allow running the docker container offline
|
2025-02-22 15:17:28 -05:00 |
|
Fireblade
|
c1207f085b
|
Merge remote-tracking branch 'upstream/master' into streaming-word-timestamps
|
2025-02-22 14:58:28 -05:00 |
|
remsky
|
39cc056fe2
|
Merge pull request #179 from fireblade2534/normalization-changes
CI / test (3.10) (push) Has been cancelled
|
2025-02-21 20:00:15 -07:00 |
|
Fireblade
|
c5a3e13670
|
Converted the stream writer to use pyav
|
2025-02-19 23:10:51 -05:00 |
|
Fireblade
|
4ee4d36822
|
Fixes a couple of issues with audio triming and prevents errors with single voice weights
|
2025-02-18 18:12:49 -05:00 |
|
Fireblade
|
7f15ba8fed
|
Add a .gitattributes
|
2025-02-18 17:44:03 -05:00 |
|
Fireblade
|
f2b2f41412
|
fixed wrong varible name bug
|
2025-02-16 17:07:41 -05:00 |
|
Fireblade
|
cb22aab239
|
Fix streaming a wav file with captions not reaturning any captions (This is only a problem because wav streaming does not acually work)
|
2025-02-16 16:49:33 -05:00 |
|
Fireblade
|
e3dc959775
|
Simplify code so erverything uses AudioChunks
|
2025-02-16 15:37:01 -05:00 |
|
Fireblade
|
9c0e328318
|
made it skip text normalization when using other languages as it only supports english
|
2025-02-16 14:16:18 -05:00 |
|
Fireblade
|
41598eb3c5
|
better parsing for times and phone numbers
|
2025-02-15 19:02:57 -05:00 |
|
Fireblade
|
3290bada2e
|
changes to how money and numbers are handled
|
2025-02-15 17:48:12 -05:00 |
|
Fireblade
|
4802128943
|
Replaced default voice with af_heart as af doesn't exist
|
2025-02-15 12:36:36 -05:00 |
|
Fireblade
|
8c457c3292
|
fixed final test
|
2025-02-15 09:49:15 -05:00 |
|
Fireblade
|
1a6e7abac3
|
fixed a bunch of tests
|
2025-02-15 09:40:01 -05:00 |
|
Fireblade
|
1a03ac7464
|
Fixed some tests
|
2025-02-14 15:00:47 -05:00 |
|
Fireblade
|
353fe79690
|
fix small error
|
2025-02-14 14:39:24 -05:00 |
|
Fireblade
|
842d056552
|
Merge branch 'streaming-word-timestamps' of https://github.com/fireblade2534/Kokoro-FastAPI into streaming-word-timestamps
|
2025-02-14 14:36:20 -05:00 |
|
Fireblade
|
9c1ced237b
|
Cleaned up some code and fixed an error in the readme
|
2025-02-14 14:36:17 -05:00 |
|
Fireblade2534
|
b71bab45d4
|
Merge branch 'master' into streaming-word-timestamps
|
2025-02-14 14:32:41 -05:00 |
|
Fireblade
|
34acb17682
|
Mostly completed work on refractoring a bunch of code as well as streaming word level time stamps
|
2025-02-14 14:29:47 -05:00 |
|
Fireblade
|
0b5ec320c7
|
streaming word level time stamps
|
2025-02-14 13:37:42 -05:00 |
|
Fireblade
|
4027768920
|
Started work on allowing streaming word level timestamps as well as transitioning the dev code so it uses a lot more from the open ai endpoint
|
2025-02-13 18:00:03 -05:00 |
|
Fireblade
|
7772dbc2e4
|
fixed no stream file writing
|
2025-02-13 16:12:51 -05:00 |
|
remsky
|
cfae7db7fc
|
fix: bump up audio quality settings in StreamingAudioWriter
CI / test (3.10) (push) Waiting to run
|
2025-02-13 00:22:14 -07:00 |
|
remsky
|
37ea01eaf9
|
fix: download_format option for audio response, handling in create_speech
|
2025-02-13 00:04:21 -07:00 |
|
remsky
|
f585185404
|
Update openai_compatible.py
|
2025-02-12 23:31:47 -07:00 |
|
remsky
|
694b7435f1
|
Merge branch 'master' into master
|
2025-02-12 23:31:13 -07:00 |
|
Fireblade
|
dbf2b99026
|
Simplifed generate_audio in tts_service mostly working (audio conversion does not work)
|
2025-02-12 22:42:41 -05:00 |
|
Fireblade
|
5b20602b8e
|
More work on timestamps (Does not maintain accuracy over multiple chunks)
|
2025-02-12 21:36:35 -05:00 |
|
Fireblade2534
|
6985f6ef99
|
more work on streaming timestamps (not working weird error) :(
|
2025-02-12 20:34:55 +00:00 |
|
Fireblade2534
|
91d370d97f
|
More working on streaming timestamps
|
2025-02-12 17:13:56 +00:00 |
|
Fireblade2534
|
51b6b01589
|
Fixed not returning enough values
|
2025-02-12 15:06:11 +00:00 |
|
Fireblade
|
5cc9d140fe
|
WIP
|
2025-02-11 22:36:19 -05:00 |
|
Fireblade
|
45cdb607e6
|
WIP
|
2025-02-11 22:32:10 -05:00 |
|
Fireblade
|
da1e280805
|
fix tests
|
2025-02-11 21:30:41 -05:00 |
|
Fireblade
|
7cb5957848
|
added optional pluralization normalization
|
2025-02-11 19:24:29 -05:00 |
|
Fireblade
|
09de389b29
|
Added normilization options
|
2025-02-11 19:09:35 -05:00 |
|
Krurst
|
1cf011b2eb
|
Update openai_compatible.py to fix lang_code
properly sets lang_code from api request, and applies config default if not set
|
2025-02-11 23:35:51 +08:00 |
|
Fireblade2534
|
64980b5bc8
|
made it so bytes vs bits are translated correctly
|
2025-02-11 15:18:10 +00:00 |
|
Fireblade2534
|
68cb097d9b
|
Merged from orgin/master
|
2025-02-11 14:05:14 +00:00 |
|
remsky
|
24b31ccbb5
|
-Fixed espeak engagement on gpu
CI / test (3.10) (push) Waiting to run
-Add default voice code setting and update language code resolution logic
|
2025-02-11 04:49:48 -07:00 |
|
Fireblade
|
ab1c21130e
|
Made the api use the normalizer, fixed the wrong version of espeak, added better normilzation, improved the sentence splitting, fixed some formatting
|
2025-02-10 21:45:52 -05:00 |
|
remsky
|
8ed2f2afb6
|
Add model listing and retrieval endpoints with tests
|
2025-02-09 20:55:21 -07:00 |
|
remsky
|
d73ed87987
|
Update handling in generate_captioned_speech to stream immediately, templink for caption file, and add unit tests for captioned speech generation
|
2025-02-09 20:26:59 -07:00 |
|
remsky
|
a91e0fe9df
|
Ruff check + formatting
|
2025-02-09 18:32:17 -07:00 |
|
remsky
|
af0e6dad6e
|
espeak-loader broken link fix, invalid pipeline state
|
2025-02-08 20:36:50 -07:00 |
|