Kokoro-FastAPI

mirror of https://github.com/remsky/Kokoro-FastAPI.git synced 2025-08-05 16:48:53 +00:00

Author	SHA1	Message	Date
Fireblade2534	d67570ab21	Merge pull request #210 from fireblade2534/preserve-custom-phenomes This fix allows for inputing custom pronuncations through text. For example: "This is a test of a [bla bla](/ðɪs ɪz ˈoʊnli ɐ tˈɛst/) system." It ensures that normalization does not affect custom prnouncations	2025-03-02 14:37:07 -05:00
Fireblade2534	43576c4a76	Remove random 1	2025-03-01 12:45:41 -05:00
Fireblade	226a75e782	fixes the low quality fix not working properly	2025-02-28 21:57:33 -05:00
Fireblade	f415ce7109	don't replace brackets as that is handled in misaki	2025-02-28 21:39:12 -05:00
Fireblade	906cf77a65	preserve custom phenomes	2025-02-28 21:37:46 -05:00
Fireblade	9247bc3a12	notremoved the rate argument which apperently means bitrate	2025-02-26 21:51:00 -05:00
Fireblade	980bc5b4a8	Fix low quality because audio was being encoded at a lower bitrate	2025-02-26 20:52:38 -05:00
Fireblade	5de3cace3b	Fix some tests and allow running the docker container offline	2025-02-22 15:17:28 -05:00
Fireblade	c1207f085b	Merge remote-tracking branch 'upstream/master' into streaming-word-timestamps	2025-02-22 14:58:28 -05:00
remsky	39cc056fe2	Merge pull request #179 from fireblade2534/normalization-changes Some checks failed CI / test (3.10) (push) Has been cancelled Details	2025-02-21 20:00:15 -07:00
Fireblade	c5a3e13670	Converted the stream writer to use pyav	2025-02-19 23:10:51 -05:00
Fireblade	4ee4d36822	Fixes a couple of issues with audio triming and prevents errors with single voice weights	2025-02-18 18:12:49 -05:00
Fireblade	7f15ba8fed	Add a .gitattributes	2025-02-18 17:44:03 -05:00
Fireblade	f2b2f41412	fixed wrong varible name bug	2025-02-16 17:07:41 -05:00
Fireblade	cb22aab239	Fix streaming a wav file with captions not reaturning any captions (This is only a problem because wav streaming does not acually work)	2025-02-16 16:49:33 -05:00
Fireblade	e3dc959775	Simplify code so erverything uses AudioChunks	2025-02-16 15:37:01 -05:00
Fireblade	9c0e328318	made it skip text normalization when using other languages as it only supports english	2025-02-16 14:16:18 -05:00
Fireblade	41598eb3c5	better parsing for times and phone numbers	2025-02-15 19:02:57 -05:00
Fireblade	3290bada2e	changes to how money and numbers are handled	2025-02-15 17:48:12 -05:00
Fireblade	4802128943	Replaced default voice with af_heart as af doesn't exist	2025-02-15 12:36:36 -05:00
Fireblade	8c457c3292	fixed final test	2025-02-15 09:49:15 -05:00
Fireblade	1a6e7abac3	fixed a bunch of tests	2025-02-15 09:40:01 -05:00
Fireblade	1a03ac7464	Fixed some tests	2025-02-14 15:00:47 -05:00
Fireblade	353fe79690	fix small error	2025-02-14 14:39:24 -05:00
Fireblade	842d056552	Merge branch 'streaming-word-timestamps' of https://github.com/fireblade2534/Kokoro-FastAPI into streaming-word-timestamps	2025-02-14 14:36:20 -05:00
Fireblade	9c1ced237b	Cleaned up some code and fixed an error in the readme	2025-02-14 14:36:17 -05:00
Fireblade2534	b71bab45d4	Merge branch 'master' into streaming-word-timestamps	2025-02-14 14:32:41 -05:00
Fireblade	34acb17682	Mostly completed work on refractoring a bunch of code as well as streaming word level time stamps	2025-02-14 14:29:47 -05:00
Fireblade	0b5ec320c7	streaming word level time stamps	2025-02-14 13:37:42 -05:00
Fireblade	4027768920	Started work on allowing streaming word level timestamps as well as transitioning the dev code so it uses a lot more from the open ai endpoint	2025-02-13 18:00:03 -05:00
Fireblade	7772dbc2e4	fixed no stream file writing	2025-02-13 16:12:51 -05:00
remsky	cfae7db7fc	fix: bump up audio quality settings in StreamingAudioWriter Some checks are pending CI / test (3.10) (push) Waiting to run Details	2025-02-13 00:22:14 -07:00
remsky	37ea01eaf9	fix: download_format option for audio response, handling in create_speech	2025-02-13 00:04:21 -07:00
remsky	f585185404	Update openai_compatible.py	2025-02-12 23:31:47 -07:00
remsky	694b7435f1	Merge branch 'master' into master	2025-02-12 23:31:13 -07:00
Fireblade	dbf2b99026	Simplifed generate_audio in tts_service mostly working (audio conversion does not work)	2025-02-12 22:42:41 -05:00
Fireblade	5b20602b8e	More work on timestamps (Does not maintain accuracy over multiple chunks)	2025-02-12 21:36:35 -05:00
Fireblade2534	6985f6ef99	more work on streaming timestamps (not working weird error) :(	2025-02-12 20:34:55 +00:00
Fireblade2534	91d370d97f	More working on streaming timestamps	2025-02-12 17:13:56 +00:00
Fireblade2534	51b6b01589	Fixed not returning enough values	2025-02-12 15:06:11 +00:00
Fireblade	5cc9d140fe	WIP	2025-02-11 22:36:19 -05:00
Fireblade	45cdb607e6	WIP	2025-02-11 22:32:10 -05:00
Fireblade	da1e280805	fix tests	2025-02-11 21:30:41 -05:00
Fireblade	7cb5957848	added optional pluralization normalization	2025-02-11 19:24:29 -05:00
Fireblade	09de389b29	Added normilization options	2025-02-11 19:09:35 -05:00
Krurst	1cf011b2eb	Update openai_compatible.py to fix lang_code properly sets lang_code from api request, and applies config default if not set	2025-02-11 23:35:51 +08:00
Fireblade2534	64980b5bc8	made it so bytes vs bits are translated correctly	2025-02-11 15:18:10 +00:00
Fireblade2534	68cb097d9b	Merged from orgin/master	2025-02-11 14:05:14 +00:00
remsky	24b31ccbb5	-Fixed espeak engagement on gpu Some checks are pending CI / test (3.10) (push) Waiting to run Details -Add default voice code setting and update language code resolution logic	2025-02-11 04:49:48 -07:00
Fireblade	ab1c21130e	Made the api use the normalizer, fixed the wrong version of espeak, added better normilzation, improved the sentence splitting, fixed some formatting	2025-02-10 21:45:52 -05:00

1 2 3

145 commits