Kokoro-FastAPI/docker-compose.yml

name: kokoro-fastapi
services:
  model-fetcher:
    image: datamachines/git-lfs:latest
    environment:
      - SKIP_MODEL_FETCH=${SKIP_MODEL_FETCH:-false}
    volumes:
      - ./Kokoro-82M:/app/Kokoro-82M
    working_dir: /app/Kokoro-82M
    command: >
      sh -c "
        if [ \"$$SKIP_MODEL_FETCH\" = \"true\" ]; then
          echo 'Skipping model fetch...' && touch .cloned;
        else
          rm -f .git/index.lock;
          if [ -z \"$(ls -A .)\" ]; then
            git clone https://huggingface.co/hexgrad/Kokoro-82M .
            touch .cloned;
          else
            rm -f .git/index.lock && \
            git checkout main && \
            git pull origin main && \
            touch .cloned;
          fi;
        fi;
        tail -f /dev/null
      "
    healthcheck:
      test: ["CMD", "test", "-f", ".cloned"]
      interval: 5s
      timeout: 2s
      retries: 300
      start_period: 1s

  kokoro-tts:
    image: ghcr.io/remsky/kokoro-fastapi-gpu:v0.0.5post1
    # Uncomment below (and comment out above) to build from source instead of using the released image
    # build:
      # context: .
    volumes:
      - ./api/src:/app/api/src
      - ./Kokoro-82M:/app/Kokoro-82M
    ports:
      - "8880:8880"
    environment:
      - PYTHONPATH=/app:/app/Kokoro-82M
    deploy:
      resources:
        reservations:
          devices:
            - driver: nvidia
              count: 1
              capabilities: [gpu]
    healthcheck:
      test: ["CMD", "curl", "-f", "http://localhost:8880/health"]
      interval: 10s
      timeout: 5s
      retries: 30
      start_period: 30s
    depends_on:
      model-fetcher:
        condition: service_healthy

  # Gradio UI service [Comment out everything below if you don't need it]
  gradio-ui:
    image: ghcr.io/remsky/kokoro-fastapi-ui:v0.0.5post1
    # Uncomment below (and comment out above) to build from source instead of using the released image
    # build:
    #   context: ./ui
    ports:
      - "7860:7860"
    volumes:
      - ./ui/data:/app/ui/data
      - ./ui/app.py:/app/app.py  # Mount app.py for hot reload
    environment:
      - GRADIO_WATCH=True  # Enable hot reloading
      - PYTHONUNBUFFERED=1  # Ensure Python output is not buffered
    depends_on:
      kokoro-tts:
        condition: service_healthy
fix: update Docker Compose files to use specific image versions for consistency 2025-01-12 23:30:01 -07:00			`name: kokoro-fastapi`
Add initial implementation of Kokoro TTS API with Docker GPU support - Set up FastAPI application with TTS service - Define API endpoints for TTS generation and voice listing - Implement Pydantic models for request and response schemas - Add Dockerfile and docker-compose.yml for containerization - Include example usage and benchmark results in README 2024-12-30 04:17:50 -07:00			`services:`
Refactor Docker setup to use a dedicated model-fetcher service and update schemas for additional voice support 2024-12-31 03:41:45 -07:00			`model-fetcher:`
			`image: datamachines/git-lfs:latest`
WIP: open ai compatible streaming 2025-01-04 17:55:36 -07:00			`environment:`
-update soundfile version -alignment with streaming standards -audio processing config settings -more comprehensive model warmup -minor model improvements -enhancing testing, benchmarking -cool ascii logo 2025-01-06 03:32:41 -07:00			`- SKIP_MODEL_FETCH=${SKIP_MODEL_FETCH:-false}`
Refactor Docker setup to use a dedicated model-fetcher service and update schemas for additional voice support 2024-12-31 03:41:45 -07:00			`volumes:`
			`- ./Kokoro-82M:/app/Kokoro-82M`
			`working_dir: /app/Kokoro-82M`
			`command: >`
			`sh -c "`
WIP: open ai compatible streaming 2025-01-04 17:55:36 -07:00			`if [ \"$$SKIP_MODEL_FETCH\" = \"true\" ]; then`
			`echo 'Skipping model fetch...' && touch .cloned;`
Refactor Docker setup to use a dedicated model-fetcher service and update schemas for additional voice support 2024-12-31 03:41:45 -07:00			`else`
WIP: open ai compatible streaming 2025-01-04 17:55:36 -07:00			`rm -f .git/index.lock;`
			`if [ -z \"$(ls -A .)\" ]; then`
			`git clone https://huggingface.co/hexgrad/Kokoro-82M .`
			`touch .cloned;`
			`else`
			`rm -f .git/index.lock && \`
			`git checkout main && \`
			`git pull origin main && \`
			`touch .cloned;`
			`fi;`
Refactor Docker setup to use a dedicated model-fetcher service and update schemas for additional voice support 2024-12-31 03:41:45 -07:00			`fi;`
			`tail -f /dev/null`
			`"`
			`healthcheck:`
			`test: ["CMD", "test", "-f", ".cloned"]`
fix: longer timeouts, fix on hf model pull 2025-01-02 01:59:25 -07:00			`interval: 5s`
			`timeout: 2s`
			`retries: 300`
Refactor Docker setup to use a dedicated model-fetcher service and update schemas for additional voice support 2024-12-31 03:41:45 -07:00			`start_period: 1s`

Add initial implementation of Kokoro TTS API with Docker GPU support - Set up FastAPI application with TTS service - Define API endpoints for TTS generation and voice listing - Implement Pydantic models for request and response schemas - Add Dockerfile and docker-compose.yml for containerization - Include example usage and benchmark results in README 2024-12-30 04:17:50 -07:00			`kokoro-tts:`
fix: update Docker Compose files to use specific image versions for consistency 2025-01-12 23:30:01 -07:00			`image: ghcr.io/remsky/kokoro-fastapi-gpu:v0.0.5post1`
Refactor Docker configurations and update test mocks for development routers 2025-01-10 22:03:16 -07:00			`# Uncomment below (and comment out above) to build from source instead of using the released image`
fix: update Docker Compose files to use specific image versions for consistency 2025-01-12 23:30:01 -07:00			`# build:`
			`# context: .`
Add initial implementation of Kokoro TTS API with Docker GPU support - Set up FastAPI application with TTS service - Define API endpoints for TTS generation and voice listing - Implement Pydantic models for request and response schemas - Add Dockerfile and docker-compose.yml for containerization - Include example usage and benchmark results in README 2024-12-30 04:17:50 -07:00			`volumes:`
- Complete TTS endpoint replacement with OpenAI compatible -Removed output directory, and update configuration settings - Added benchmarking for entire novel 2024-12-31 01:52:16 -07:00			`- ./api/src:/app/api/src`
			`- ./Kokoro-82M:/app/Kokoro-82M`
Add initial implementation of Kokoro TTS API with Docker GPU support - Set up FastAPI application with TTS service - Define API endpoints for TTS generation and voice listing - Implement Pydantic models for request and response schemas - Add Dockerfile and docker-compose.yml for containerization - Include example usage and benchmark results in README 2024-12-30 04:17:50 -07:00			`ports:`
			`- "8880:8880"`
- Complete TTS endpoint replacement with OpenAI compatible -Removed output directory, and update configuration settings - Added benchmarking for entire novel 2024-12-31 01:52:16 -07:00			`environment:`
			`- PYTHONPATH=/app:/app/Kokoro-82M`
Add initial implementation of Kokoro TTS API with Docker GPU support - Set up FastAPI application with TTS service - Define API endpoints for TTS generation and voice listing - Implement Pydantic models for request and response schemas - Add Dockerfile and docker-compose.yml for containerization - Include example usage and benchmark results in README 2024-12-30 04:17:50 -07:00			`deploy:`
			`resources:`
			`reservations:`
			`devices:`
			`- driver: nvidia`
			`count: 1`
			`capabilities: [gpu]`
Refactor Docker configurations and update test mocks for development routers 2025-01-10 22:03:16 -07:00			`healthcheck:`
fix: ui stability, memory safeties 2025-01-12 21:33:23 -07:00			`test: ["CMD", "curl", "-f", "http://localhost:8880/health"]`
Refactor Docker configurations and update test mocks for development routers 2025-01-10 22:03:16 -07:00			`interval: 10s`
			`timeout: 5s`
			`retries: 30`
			`start_period: 30s`
Refactor Docker setup to use a dedicated model-fetcher service and update schemas for additional voice support 2024-12-31 03:41:45 -07:00			`depends_on:`
			`model-fetcher:`
			`condition: service_healthy`
-Removed commit lock on HF repo -Warm start added to model initialization -Layer caching tweaks to dockerfile 2025-01-01 17:38:22 -07:00
-update soundfile version -alignment with streaming standards -audio processing config settings -more comprehensive model warmup -minor model improvements -enhancing testing, benchmarking -cool ascii logo 2025-01-06 03:32:41 -07:00			`# Gradio UI service [Comment out everything below if you don't need it]`
			`gradio-ui:`
fix: update Docker Compose files to use specific image versions for consistency 2025-01-12 23:30:01 -07:00			`image: ghcr.io/remsky/kokoro-fastapi-ui:v0.0.5post1`
Refactor Docker configurations and update test mocks for development routers 2025-01-10 22:03:16 -07:00			`# Uncomment below (and comment out above) to build from source instead of using the released image`
			`# build:`
fix: ui stability, memory safeties 2025-01-12 21:33:23 -07:00			`# context: ./ui`
-update soundfile version -alignment with streaming standards -audio processing config settings -more comprehensive model warmup -minor model improvements -enhancing testing, benchmarking -cool ascii logo 2025-01-06 03:32:41 -07:00			`ports:`
			`- "7860:7860"`
			`volumes:`
			`- ./ui/data:/app/ui/data`
			`- ./ui/app.py:/app/app.py # Mount app.py for hot reload`
			`environment:`
			`- GRADIO_WATCH=True # Enable hot reloading`
Refactor Docker configurations and update test mocks for development routers 2025-01-10 22:03:16 -07:00			`- PYTHONUNBUFFERED=1 # Ensure Python output is not buffered`
			`depends_on:`
			`kokoro-tts:`
			`condition: service_healthy`