rvuyyuru2
44c62467ae
Adds support for creating weighted voice combinations
...
Implements a new method to parse weighted voice formulas and generate combined audio outputs based on specified weights.
This enhancement allows for more diverse audio generation by letting users specify multiple voices with respective weights, improving flexibility in voice management.
Updates voice processing logic in relevant API routes to handle weighted formulas seamlessly.
Fixes #123 (if applicable, replace with the actual issue reference)
2025-01-25 20:54:21 +05:30
remsky
ee1f7cde18
Add async audio processing and semantic chunking support; flattened static audio trimming
2025-01-24 04:06:47 -07:00
remsky
ba577d348e
Enhance web player information, adjust text chunk size, update audio wave settings, and implement OpenAI model mappings
2025-01-23 04:11:31 -07:00
remsky
8e8f120a3e
Update configuration to disable local voice saving, enhance voice validation logic, and remove deprecated test file
2025-01-23 02:00:46 -07:00
remsky
df4cc5b4b2
-Adjust testing framework for new model
...
-Add web player support: include static file serving and HTML interface for TTS
2025-01-22 21:11:47 -07:00
remsky
d50214d3be
Enable ONNX GPU support in Docker configurations and refactor model file handling
2025-01-22 05:00:38 -07:00
remsky
4a24be1605
Refactor model loading and configuration: update, adjust model loading device,. add async streaming examples and remove unused warmup service.
2025-01-22 02:33:29 -07:00
remsky
cf72e4ed2b
Add interruptible streams
2025-01-13 23:25:06 -07:00
remsky
22752900e5
Ruff checks, ci fix
2025-01-13 20:15:46 -07:00
remsky
926ea8cecf
Refactor Docker configurations and update test mocks for development routers
2025-01-10 22:03:16 -07:00
remsky
e8c1284032
Ruff format + fix
2025-01-09 18:41:44 -07:00
remsky
130b084cce
- Added support for combining voices via any endpoint
...
- Updated the `process_voices` function to handle both string and list formats for voice input.
2025-01-07 03:50:08 -07:00
remsky
720c1fb97d
-update soundfile version
...
-alignment with streaming standards
-audio processing config settings
-more comprehensive model warmup
-minor model improvements
-enhancing testing, benchmarking
-cool ascii logo
2025-01-06 03:32:41 -07:00
remsky
4c6cd83f85
Swapped generator to preprocessing
2025-01-04 22:23:59 -07:00
remsky
0e9f77fc79
WIP: open ai compatible streaming
2025-01-04 17:55:36 -07:00
remsky
f1eb1d9590
First streaming attempt
2025-01-04 17:54:54 -07:00
remsky
e4d8e74738
WIP, Functional for CPU: Updated for ONNX runtime support, Dockerfile and TTS Service
2025-01-03 00:53:41 -07:00
remsky
f051984805
Ruff Check + Format
2025-01-01 21:50:41 -07:00
remsky
05e1e30c47
- modified voice loading to copy on init
...
- adjustments to the combine voices functionality
- error handling and analysis
2024-12-31 18:55:26 -07:00
Emmanuel Schmidbauer
510b01cc90
add ability to combine voices
2024-12-31 10:30:12 -05:00
remsky
4123ab0891
Refactor TTS API and enhance testing setup with coverage and logging improvements
2024-12-31 02:55:51 -07:00
remsky
c11a6ea6ea
Enhance TTS API with logging, voice pack loading, and schema updates
2024-12-31 01:57:00 -07:00
remsky
8ce8334345
- Complete TTS endpoint replacement with OpenAI compatible
...
-Removed output directory, and update configuration settings
- Added benchmarking for entire novel
2024-12-31 01:52:16 -07:00