dgx-spark

Star

Here are 139 public repositories matching this topic...

Avarok-Cybersecurity / atlas

Star

Pure Rust Inference Engine

rust cuda transformers ssm mamba dgx openai-api llm-inference speculative-decoding gb10 nvfp4 dgx-spark

Updated Jun 2, 2026
Rust

spark-arena / sparkrun

Star

sparkrun - launch, manage, and stop LLM inference workloads on NVIDIA DGX Spark systems

inference llama-cpp vllm sglang dgx-spark

Updated Jun 1, 2026
Python

albond / DGX_Spark_Qwen3.5-122B-A10B-AR-INT4

Star

Qwen3.5-122B-A10B on DGX Spark: 28.3 → 51 tok/s (+80%)

cuda lossless mtp speedup performance-optimization vllm autoround dgx-spark qwen3-5 sm121 qwen3-5-122b-a10b

Updated May 9, 2026
Python

eelbaz / dgx-spark-vllm-setup

Star

One-command vLLM installation for NVIDIA DGX Spark with Blackwell GB10 GPUs (sm_121 architecture)

machine-learning ai deep-learning gpu cuda pytorch nvidia arm64 blackwell llm vllm llm-inference gb10 dgx-spark

Updated Oct 28, 2025
Shell

eelbaz / dgx-spark-headless-sunshine

Star

Headless remote desktop setup for NVIDIA DGX SPARK using Sunshine streaming

remote-desktop dgx-spark

Updated Oct 25, 2025
Shell

hogeheer499-commits / strix-halo-guide

Star

Reproducible local LLM setup and benchmark evidence for AMD Strix Halo / Ryzen AI MAX+ 395: 63-98.5 t/s direct Qwen MoE, 101.1 t/s MTP.

Updated Jun 2, 2026
Python

AEON-7 / vllm-dflash

Star

DFlash vLLM for DGX Spark — Plug & Play Block-Diffusion Speculative Decoding

docker inference nvidia blackwell llm vllm qwen speculative-decoding block-diffusion nvfp4 dgx-spark dflash

Updated May 1, 2026
Python

jdaln / dgx-spark-inference-stack

Star

Serve the home! Inference stack for your Nvidia DGX Spark aka the Grace Blackwell AI supercomputer on your desk. Mostly vLLM based for now and single-spark. For the not-so-rich buddies

docker docker-compose cuda inference self-hosted llama model-serving mlops dgx generative-ai local-llm gb10 dgx-spark

Updated May 23, 2026
Shell

Bleeding-edge ComfyUI for NVIDIA DGX Spark (GB10/Blackwell/sm_121a). CUDA 13 + SageAttention v3 (sm_121a) + NVFP4 + 14 custom-node packs + Flux 2 Dev / LTX 2.3 22B / ACE-Step v1.5 XL Turbo pre-bundled with abliterated text-encoder paths.

docker flux blackwell comfyui sageattention ltx-video ace-step nvfp4 dgx-spark sm-121a

Updated May 4, 2026
Shell

joeynyc / spark-doctor

Star

Local diagnostic CLI for NVIDIA DGX Spark (GB10). Detects power caps, unified memory pressure, thermal risk, Docker/runtime issues, and validates vLLM/Ollama/llama.cpp/SGLang recipes.

cli nvidia diagnostics dgx llama-cpp vllm local-llm ollama sglang gb10 dgx-spark grace-blackwell nvidia-dgx-spark

Updated May 15, 2026
Python

bjk110 / SPARK_Qwen3.5-122B-A10B-NVFP4

Star

vLLM + Qwen3.5-122B-A10B-NVFP4 on NVIDIA DGX Spark (GB10/SM121) — single-GPU NVFP4 W4A4 with MTP speculative decoding, self-contained Docker build

docker-compose dgx-spark vllm-server

Updated Mar 12, 2026
Python

DanTup / spark-evals

Sponsor

Star

Some benchmark results of small models and quants that fit on DGX Spark

ai benchmarks llms dgx-spark

Updated May 28, 2026
Python

seanGSISG / dgx-spark-sunshine-setup

Star

headless remote desktop to your dgx spark in crystal clear 4k

remote-desktop remote-access sunshine dgx gb10 dgx-spark

Updated May 27, 2026
Shell

theshiphq / claw-spark

Star

One-click AI agent setup for NVIDIA DGX Spark, Jetson, and RTX hardware. OpenClaw + Ollama, fully local.

amd gpu nvidia hetzner claw dgx-spark clawdbot moltbot openclaw nemoclaw

Updated Apr 16, 2026
Shell

Mekopa / whisperx-blackwell

Star

GPU-accelerated WhisperX on NVIDIA Blackwell (SM_121) - DGX Spark compatible

audio docker machine-learning deep-learning gpu cuda pytorch nvidia speech-recognition transcription asr speaker-diarization dgx blackwell pyannote whisperx dgx-spark sm-121

Updated Apr 23, 2026
Python

CoconutMacaroon / blender-arm64

Star

Blender for ARM64 Linux with CUDA/OptiX/Vulkan support

blender nvidia dgx-spark

Updated Mar 26, 2026
Shell

Entrpi / ds4-on-spark

Star

antirez/ds4 (DwarfStar 4) on NVIDIA DGX Spark — install, benchmarks, and roofline analysis. Steady-state decode at ~95% of bandwidth ceiling; MTP and concurrency analyzed.

benchmark cuda inference moe llm gguf gb10 dgx-spark deepseek-v4-flash

Updated May 21, 2026
Shell

wshobson / minimax-dgx-spark

Sponsor

Star

MiniMax M2 inference server for NVIDIA DGX Spark

opencode nvidia minimax nvidia-gpu llamacpp dgx-spark

Updated Jan 24, 2026
Jinja

botAGI / AGmind

Star

Private AI stack in one command - GB10 arm64 dgx-spark

docker ai docker-compose gpu self-hosted nvidia arm64 ai-agents dify rag ai-tools vllm open-webui rag-pipeline ragflow ai-stack dgx-spark agmind

Updated Jun 2, 2026
Shell

parallelArchitect / sparkview

Star

Operator-grade GPU monitor for NVIDIA GPUs with native GB10 / DGX Spark coherent UMA support — PSI pressure, clock detection, ConnectX-7 network layer

python monitoring gpu cuda tui nvidia psi unified-memory gb10 dgx-spark

Updated May 31, 2026
Python

Improve this page

Add a description, image, and links to the dgx-spark topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the dgx-spark topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

dgx-spark

Here are 139 public repositories matching this topic...

Avarok-Cybersecurity / atlas

spark-arena / sparkrun

albond / DGX_Spark_Qwen3.5-122B-A10B-AR-INT4

eelbaz / dgx-spark-vllm-setup

eelbaz / dgx-spark-headless-sunshine

hogeheer499-commits / strix-halo-guide

AEON-7 / vllm-dflash

jdaln / dgx-spark-inference-stack

AEON-7 / comfyui-aeon-spark

joeynyc / spark-doctor

bjk110 / SPARK_Qwen3.5-122B-A10B-NVFP4

DanTup / spark-evals

seanGSISG / dgx-spark-sunshine-setup

theshiphq / claw-spark

Mekopa / whisperx-blackwell

CoconutMacaroon / blender-arm64

Entrpi / ds4-on-spark

wshobson / minimax-dgx-spark

botAGI / AGmind

parallelArchitect / sparkview

Improve this page

Add this topic to your repo