Kokoro-FastAPI/api/src/structures/schemas.py

from enum import Enum
from typing import Literal

from pydantic import Field, BaseModel


class TTSStatus(str, Enum):
    PENDING = "pending"
    PROCESSING = "processing"
    COMPLETED = "completed"
    FAILED = "failed"
    DELETED = "deleted"  # For files removed by cleanup


# OpenAI-compatible schemas
class OpenAISpeechRequest(BaseModel):
    model: Literal["tts-1", "tts-1-hd", "kokoro"] = "kokoro"
    input: str = Field(..., description="The text to generate audio for")
    voice: str = Field(
        default="af", 
        description="The voice to use for generation. Can be a base voice or a combined voice name."
    )
    response_format: Literal["mp3", "opus", "aac", "flac", "wav", "pcm"] = Field(
        default="mp3",
        description="The format to return audio in. Supported formats: mp3, opus, flac, wav. AAC and PCM are not currently supported.",
    )
    speed: float = Field(
        default=1.0,
        ge=0.25,
        le=4.0,
        description="The speed of the generated audio. Select a value from 0.25 to 4.0.",
    )
- Complete TTS endpoint replacement with OpenAI compatible -Removed output directory, and update configuration settings - Added benchmarking for entire novel 2024-12-31 01:52:16 -07:00			`from enum import Enum`
Refactor TTS API and enhance testing setup with coverage and logging improvements 2024-12-31 02:55:51 -07:00			`from typing import Literal`

			`from pydantic import Field, BaseModel`
- Complete TTS endpoint replacement with OpenAI compatible -Removed output directory, and update configuration settings - Added benchmarking for entire novel 2024-12-31 01:52:16 -07:00

			`class TTSStatus(str, Enum):`
			`PENDING = "pending"`
			`PROCESSING = "processing"`
			`COMPLETED = "completed"`
			`FAILED = "failed"`
			`DELETED = "deleted" # For files removed by cleanup`


			`# OpenAI-compatible schemas`
			`class OpenAISpeechRequest(BaseModel):`
Refactor TTS API and enhance testing setup with coverage and logging improvements 2024-12-31 02:55:51 -07:00			`model: Literal["tts-1", "tts-1-hd", "kokoro"] = "kokoro"`
- Complete TTS endpoint replacement with OpenAI compatible -Removed output directory, and update configuration settings - Added benchmarking for entire novel 2024-12-31 01:52:16 -07:00			`input: str = Field(..., description="The text to generate audio for")`
- modified voice loading to copy on init - adjustments to the combine voices functionality - error handling and analysis 2024-12-31 18:55:26 -07:00			`voice: str = Field(`
			`default="af",`
			`description="The voice to use for generation. Can be a base voice or a combined voice name."`
			`)`
- Complete TTS endpoint replacement with OpenAI compatible -Removed output directory, and update configuration settings - Added benchmarking for entire novel 2024-12-31 01:52:16 -07:00			`response_format: Literal["mp3", "opus", "aac", "flac", "wav", "pcm"] = Field(`
			`default="mp3",`
Enhance TTS API with logging, voice pack loading, and schema updates 2024-12-31 01:57:00 -07:00			`description="The format to return audio in. Supported formats: mp3, opus, flac, wav. AAC and PCM are not currently supported.",`
- Complete TTS endpoint replacement with OpenAI compatible -Removed output directory, and update configuration settings - Added benchmarking for entire novel 2024-12-31 01:52:16 -07:00			`)`
			`speed: float = Field(`
			`default=1.0,`
			`ge=0.25,`
			`le=4.0,`
Enhance TTS API with logging, voice pack loading, and schema updates 2024-12-31 01:57:00 -07:00			`description="The speed of the generated audio. Select a value from 0.25 to 4.0.",`
- Complete TTS endpoint replacement with OpenAI compatible -Removed output directory, and update configuration settings - Added benchmarking for entire novel 2024-12-31 01:52:16 -07:00			`)`