Quick-Start Checklist¶
Deploy the HCLS AI Factory in 30 minutes. This checklist extracts the critical steps from the full Deployment Guide.
Prerequisites¶
- Hardware: NVIDIA DGX Spark (or equivalent with GB10 GPU, 128GB unified memory)
- Storage: 500GB+ free space for genomics data and models
- Network: Internet access for pulling containers and reference data
- Software: Docker 24+, Docker Compose 2.20+, Git
Step 1: Clone the Repository¶
Step 2: Environment Setup¶
Required variables:
| Variable | Description |
|---|---|
ANTHROPIC_API_KEY |
Claude API key for RAG chat |
NGC_API_KEY |
NVIDIA NGC key for BioNeMo models |
Step 3: Start Core Services¶
Expected services:
genomics-pipeline— Parabricks + DeepVariantrag-service— Milvus + Claude integrationdrug-discovery— BioNeMo MolMIM + DiffDockgrafana— Monitoring dashboard (port 3000)streamlit— Chat UI (port 8501)
Step 4: Verify GPU Access¶
You should see your GPU(s) listed with available memory.
Step 5: Load Reference Data¶
# Download reference genome (one-time, ~15GB)
./scripts/download_reference.sh
# Load ClinVar and AlphaMissense annotations
./scripts/load_annotations.sh
Step 6: Run a Test Pipeline¶
# Run with synthetic test data
./scripts/run_demo.sh --mock
# Expected output: variant calls in output/demo/
Step 7: Access the UI¶
Open your browser:
| Service | URL | Purpose |
|---|---|---|
| Streamlit Chat | http://localhost:8501 |
Query variants with Claude |
| Grafana | http://localhost:3000 |
Monitor pipeline metrics |
| API Docs | http://localhost:8080/docs |
REST API reference |
Troubleshooting¶
Services won't start¶
GPU not detected¶
Out of memory¶
Reduce batch sizes in .env:
Next Steps¶
- Full deployment: Deployment Guide
- Run the demo: Demo Guide
- Understand the architecture: White Paper
Success Criteria¶
You're ready when:
- All Docker services show
Upstatus - GPU is visible in containers
- Streamlit chat responds to queries
- Grafana shows pipeline metrics
Total time: ~30 minutes (excluding reference data download)
Need help? Open an issue on GitHub.