Commit graph

184 commits

Author SHA1 Message Date
Fireblade2534
ec947a7b54
Merge branch 'master' into volume_setting 2025-06-16 12:43:26 -04:00
Fireblade2534
cd82dd0735 Added a volume multiplier as a request parameter 2025-06-16 16:39:30 +00:00
remsky
1e15433d24
Merge pull request #311 from mbailey/master 2025-06-13 07:58:14 -06:00
Fireblade2534
d7d90cdc9d simplified some normalization and added more tests 2025-06-12 16:00:06 +00:00
Lukin
f7fb9c524a Refactor TTS service tests: Update test_get_voice_path_combined to verify path format for combined voices, removing mock for os.path.join and enhancing assertions for path validation. 2025-06-01 10:35:59 +08:00
Lukin
0b2260602a Update TTS service tests: Enhance test_get_voice_path_combined by mocking os.path.join to ensure correct path generation for combined voices. 2025-06-01 10:28:35 +08:00
Lukin
888e3121ff Refactor text normalization: Move handling of problematic symbols to occur after number and money processing to improve accuracy in text normalization. 2025-06-01 10:18:24 +08:00
Lukin
84d2a4d806 Enhance TTS text processing: Implement pause tag handling in smart_split, allowing for better audio chunk generation with pauses. Update related tests to validate new functionality and ensure compatibility with existing features. 2025-05-30 23:06:41 +08:00
Lukin
ab8ab7d749 Refactor audio processing and text normalization: Update audio normalization to use absolute amplitude threshold, enhance streaming audio writer with MP3 container options, and improve text normalization by stripping spaces and handling special characters to prevent audio artifacts. 2025-05-30 22:52:58 +08:00
Fireblade2534
b89da1ff28 Make the code cleaner and add tests 2025-05-28 14:53:00 +00:00
jiaohuix
9c279f2b5e feat(text): add Chinese punctuation-based sentence splitting for better TTS 2025-05-26 15:30:03 +08:00
JCallicoat
75963c4aeb Add a volume multiplier setting
Allow configuring output volume via multiplier applied to np array of
audio chunk.

Defaults to 1.0 which is no-op.

Fixes #110
2025-05-22 07:02:34 -05:00
Mike Bailey
b0f46276eb
Update paths.py
Use root_dir instead of /app

This was breaking things for me when I started the app from a script that was not in the root_dir
2025-05-11 01:00:41 +10:00
Fireblade2534
243d98e339
Merge branch 'remsky:master' into master 2025-05-09 09:08:55 -04:00
Fireblade2534
1ac9924c3e Added some better safty checks to captioned speech 2025-05-09 13:04:53 +00:00
Fireblade2534
36197b9266
Merge pull request #304 from fireblade2534/Simple-phenome-fixes
Fixed phenomes
2025-05-05 22:03:52 -04:00
Fireblade2534
acee979225 Fixed phenomes 2025-05-02 13:34:08 +00:00
Fireblade2534
c80d4feb69 Fixed formatting 2025-05-01 18:16:36 +00:00
Fireblade2534
f89b76d5d6 Added more tests 2025-05-01 18:02:35 +00:00
Fireblade2534
48e096b441 Massivly improved how number, time, and money normalization 2025-05-01 17:53:01 +00:00
Fireblade2534
d004b6d304
Apply suggestions from copilot
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
2025-04-04 19:06:59 -04:00
Fireblade2534
7f0e06ea6b
Update normalizer.py 2025-04-04 19:06:13 -04:00
remsky
afa879546c CONTRIBUTING + Ruff format 2025-04-04 16:58:07 -06:00
remsky
447f9d360c Ruff check 2025-04-04 16:50:46 -06:00
remsky
65f6b979c3 Enhance temp file handling with error tracking and update Docker Compose to run as non-root user
Some checks failed
CI / test (3.10) (push) Has been cancelled
2025-03-29 17:01:15 -06:00
Fireblade2534
d712308f98 Fixes relating to parsing money and tests. Also readme stuff 2025-03-21 18:03:09 +00:00
Fireblade2534
c24aeefbb2 Aculy fixed tests this time 2025-03-20 19:15:07 +00:00
Fireblade2534
c902b2ca0d probably fix tests 2025-03-20 16:27:18 +00:00
Fireblade2534
8f23bf53a4 Inital test commit of segfault fixes 2025-03-20 16:20:28 +00:00
remsky
0d7570ab50
Merge pull request #240 from fireblade2534/fixes
Some checks are pending
CI / test (3.10) (push) Waiting to run
2025-03-18 04:27:17 -06:00
Fireblade
9f9e9b601e Fixes not returning a download link if streaming is off and return_download_link is true 2025-03-13 16:23:49 -04:00
Fireblade2534
acb7d05515
Merge branch 'master' into master 2025-03-12 11:17:44 -04:00
remsky
e4744f5545
Merge pull request #235 from fireblade2534/fixes 2025-03-12 02:22:04 -06:00
Fireblade
aa403f2070 Adds the ability to subtract voices 2025-03-11 14:28:48 -04:00
Fireblade2534
dafc87ddef
Merge pull request #199 from blakkd/master
Some checks failed
CI / test (3.10) (push) Has been cancelled
converted CRLF ending lines to LF ones in api/src/structures/custom_responses.py
2025-03-10 18:14:29 -04:00
Cong Nguyen
9a9bc4aca9 added support for mps on mac with apple silicon 2025-03-10 11:58:45 +11:00
Fireblade2534
f2c5bc1b71
Merge branch 'remsky:master' into fixes 2025-03-02 21:39:17 -05:00
Fireblade
b3d5f4de08 fixes and corrections to code that didn't cause errors but didn't really make sense 2025-03-02 21:36:34 -05:00
Fireblade2534
d67570ab21
Merge pull request #210 from fireblade2534/preserve-custom-phenomes
This fix allows for inputing custom pronuncations through text. For example: "This is a test of a [bla bla](/ðɪs ɪz ˈoʊnli ɐ tˈɛst/) system." It ensures that normalization does not affect custom prnouncations
2025-03-02 14:37:07 -05:00
Fireblade2534
43576c4a76
Remove random 1 2025-03-01 12:45:41 -05:00
Fireblade
226a75e782 fixes the low quality fix not working properly 2025-02-28 21:57:33 -05:00
Fireblade
f415ce7109 don't replace brackets as that is handled in misaki 2025-02-28 21:39:12 -05:00
Fireblade
906cf77a65 preserve custom phenomes 2025-02-28 21:37:46 -05:00
Fireblade
9247bc3a12 notremoved the rate argument which apperently means bitrate 2025-02-26 21:51:00 -05:00
Fireblade
980bc5b4a8 Fix low quality because audio was being encoded at a lower bitrate 2025-02-26 20:52:38 -05:00
blakkd
3c5029f801 converted CRLF ending lines to LF ones in api/src/structures/custom_responses.py
let ruff organise the imports
2025-02-24 02:11:48 +01:00
Fireblade
5de3cace3b Fix some tests and allow running the docker container offline 2025-02-22 15:17:28 -05:00
Fireblade
c1207f085b Merge remote-tracking branch 'upstream/master' into streaming-word-timestamps 2025-02-22 14:58:28 -05:00
remsky
39cc056fe2
Merge pull request #179 from fireblade2534/normalization-changes
Some checks failed
CI / test (3.10) (push) Has been cancelled
2025-02-21 20:00:15 -07:00
Fireblade
c5a3e13670 Converted the stream writer to use pyav 2025-02-19 23:10:51 -05:00