Commit graph

8 commits

Author SHA1 Message Date
Fireblade
3e6ee65482 Simple fixes and translations 2025-03-08 22:48:52 -05:00
CodePothunter
70c0d506de Add start-gpu.sh script for GPU-enabled FastAPI deployment
- Create GPU-specific startup script
- Set environment variables for GPU and project configuration
- Use uv to install GPU extras and run FastAPI server
2025-03-07 20:24:07 +08:00
CodePothunter
8fe85c3386
Delete start-gpu.sh 2025-03-07 15:20:04 +08:00
CodePothunter
5c8f941f06 Add API authentication and configuration improvements
- Implement OpenAI-compatible API key authentication
- Add configuration options for GPU instances, concurrency, and request handling
- Update README with authentication instructions
- Modify configuration and routing to support optional API key verification
- Enhance system information and debug endpoints to expose authentication status
2025-03-07 11:36:13 +08:00
Fireblade
5de3cace3b Fix some tests and allow running the docker container offline 2025-02-22 15:17:28 -05:00
David Li
2b9933479a Disable --reload on unicorn/fastapi to avoid pegging a CPU core
Signed-off-by: David Li <jiawei.davidli@gmail.com>
2025-02-13 20:12:53 -08:00
remsky
9198de2d95 Refactor project structure and update Dockerfiles for improved organization and dependency management 2025-02-04 05:18:18 -07:00
remsky
90c8f11111 Add CPU and GPU startup scripts; update dependencies in pyproject.toml 2025-01-24 23:46:46 -07:00