running-locally: refresh for kestrel 0.3.0 (Apple Silicon + Blackwell + Windows + Jetson Thor)#12
Merged
Merged
Conversation
Photon (kestrel) 0.3.0 ships on four new platforms beyond Linux NVIDIA Ampere–Hopper: Apple Silicon (M-series Macs), Windows AMD64, NVIDIA Blackwell (B200 + RTX PRO 6000), and Jetson Thor (JP7). This rewrite: - Reframes the lead and Requirements section to cover both NVIDIA GPU and Apple Silicon paths. - Renames the supported-hardware section to 'Supported Hardware' and splits it into NVIDIA Server / Datacenter, NVIDIA Workstation / Desktop, Apple Silicon, and NVIDIA Edge subsections — adds B200 to Datacenter, RTX PRO 6000 to Workstation, an Apple Silicon table, and Thor (JP7) to Edge. - Adds an 'Apple Silicon Setup' section (one-line install). - Restructures 'Jetson Setup' into JP7 / Thor (standard PyPI torch + venv-resident nvidia-cu13 + nvpl libs) and JP6 / Orin (NVIDIA's custom torch wheel + /usr/local/cuda LD_LIBRARY_PATH path). - Updates the Performance section to call out concrete B200, H100, and M5 Max throughput numbers. - Updates the 'Hugging Face Transformers' footnote so Apple Silicon is no longer characterized as 'non-NVIDIA hardware where Photon doesn't work' — Photon does work on Apple Silicon now.
Round of polish on the previous commit's running-locally rewrite:
- Drop the standalone 'Apple Silicon Setup' section. It just
repeated 'pip install moondream' under a new heading and ran a
`print(md.__version__)` verify that proves nothing about
whether Photon's Metal backend actually loaded. The supported-
hardware Apple Silicon table already covers the platform; the
one extra fact ('no NVIDIA CUDA, no Triton, no extra setup
beyond pip install') folds in there.
- Merge 'Server / Datacenter' and 'Workstation / Desktop' into
one 'NVIDIA GPU' table. The single-row Workstation table
('RTX PRO 6000 96 GB') read as a content stub.
- Tighten the Thor LD_LIBRARY_PATH bash. The previous
`find | grep -E '/(nvidia|nvpl)/' | tr | sed` pipeline was
overkill — just point at the three actual lib dirs torch needs
(verified on a real Thor box: `nvidia/cu13/lib`, `nvidia/cudnn/lib`,
`nvpl/lib`).
- Replace the multi-line `python3 -c"..."` Jetson verify block
with a one-liner that prints torch version + GPU name, plus a
plain-English note about what to expect ('NVIDIA Thor' / 'Orin')
and what to fix if it doesn't work.
- Replace the prose Performance bullets with a comparison table
pulling concrete numbers from PERFORMANCE.md (B200, H100,
RTX PRO 6000, M5 Max — req/s for both Moondream 2 and 3).
The kestrel docs are the authoritative source for the Jetson install flow (it's the package shipping the LD_LIBRARY_PATH / NVIDIA-CUDA plumbing). The customer-facing version here covers the common path intentionally tightly; for extra troubleshooting bits (cuSPARSELt errors, missing cuda-cupti / libnvtoolsext1 on minimal images, etc.), point readers at the canonical kestrel guide.
The Moondream 3 section says only NVIDIA GPUs with 24GB+ are supported, with 'quantized and Apple Silicon versions coming soon.' True for the Transformers path, but a reader skimming might come away thinking MD3 doesn't run on Apple Silicon at all. Photon (kestrel 0.3.0) does support MD3 on M-series Macs with >=24GB unified memory. Adjust the wording to scope the 'NVIDIA only' claim to the Transformers route specifically, and point readers at Photon for the Apple Silicon path.
The Test deployment workflow was pinned at Node 18, while prod deploy.yml uses Node 24.2.0. `@easyops-cn/docusaurus-search-local` references the `File` global (added in Node 20), so the test job has been failing on every PR since the plugin started using it (visible on PR #6 in April). Bringing test-deploy to the same node-version as prod fixes the test job and removes the prod-vs- test-CI version skew.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
Refreshes `docs/running-locally.md` for the kestrel 0.3.0 / Photon release. The previous version was NVIDIA-Ampere-through-Hopper-only and Jetson-Orin-only — kestrel 0.3.0 ships four new platforms simultaneously.
What changed
Cross-PR coordination
Test plan