DGX Spark: Running Production AI on a $3,000 Desktop Local vLLM inference with Hermes Agent, benchmark results, and the economics of running a private LLM node on a desktop. AI · Benchmarks · Infrastructure