- Create GPU-specific startup script
- Set environment variables for GPU and project configuration
- Use uv to install GPU extras and run FastAPI server
- Implement OpenAI-compatible API key authentication
- Add configuration options for GPU instances, concurrency, and request handling
- Update README with authentication instructions
- Modify configuration and routing to support optional API key verification
- Enhance system information and debug endpoints to expose authentication status