After the relatively easy process of setting up the DGX Spark (ok, the Dell, but for ease of reference I’m just calling it a DGX Spark) it was time to configure the software side. A lot comes preinstalled, but there was more to add.

DGX on Stand
DGX on Stand

To make this server as well maintained as possible I decided everything had to be installed via Ansible like we do for our clients. Normally, our excellent infrastructure team would write the playbooks, but I did this over the winter break and I wanted to get more experience with Ansible myself after dabbling with it previously to set up my home network.

More so than learning Ansible, this was really an opportunity to learn Claude Code. My previous experience using ChatGPT to write an R package and using Copilot to develop a Kubernetes cluster left me underwhelmed. But one of my takeaways from that second talk was that people really love Claude Code and I should try it.

Joe Marlo helped me get up to speed with adding agents and skills. This was important, because when I asked Claude Code to generate the skills it did not make the required SKILL.md file and it completely missed the yaml frontmatter, even though it created agents properly.

At first, I was not enjoying myself using Claude. I felt the bird Homer Simpson used to automate his job which was entirely hitting the “y” key for “yes.”

Homer's Simpson's Drinking Bird
Homer’s Simpson’s Drinking Bird

But worse than that existential dread was that it did not write very good code. I felt I could do it better. But then Joe Marlo told me about Context7. It provides an MCP that delivers markdown documentation for just about every software framework out there. Insisting in both the CLAUDE.md file and individual skills that Claude should query the documentation any time it writes code dramatically improved the quality of the code.

Then I used the Claude chat interface to significantly improve the skills for Ansible, Caddy, code reviewing, Docker, Linux administration and unit tests. Importantly, I used the CLAUDE.md file to really stress upon Claude that it should use the skills while writing both the playbooks and the underlying templates such as the Caddyfile and Docker compose files. While waiting for Claude to finish individual tasks I probably spent too much time staring at LinkedIn.

So far I setup:

  • Caddy (in Docker) as a reverse proxy to access the DGX Dashboard which is otherwise only available from localhost or using NVIDIA Sync which I didn’t want my team to need
  • Made zsh the default shell for all new users
  • Created new users for members of my team
  • Added the machine to our Tailscale network (not running in Docker)

Now that I have the framework setup for Claude Code to generate the Ansible playbooks and underlying files, I will hand it off to my team to truly test the capabilities of using Claude Code collaboratively.

Then we will have Claude Code add at least the following:

  • Ollama
  • OpenWebUI
  • DistillKit
  • MergeKit
  • Some kind of containerized IDE beyond the JupyterLab already installed
  • Automated user management along with proper ssh keys for authentication

The other day I saw this tweet and it nicely summarized my feelings.

Tweet about Founders Using Claude Code Over the Winter Break
Tweet about Founders Using Claude Code Over the Winter Break

We even have internal trainings where more experienced users get the less experienced up and running with their agentic workflows. This is a training we will soon be offering our clients too.

After being initially lukewarm, seeing Claude Code in action got me excited to have my team make great use of the tool.

Related Posts

Jared Lander is the Chief Data Scientist of Lander Analytics a New York data science and AI firm, Adjunct Professor at Columbia University, Organizer of the New York Open Statistical Programming meetup and the New York and Government Data Science and AI Conferences and author of R for Everyone.

Posted in AI.
DGX on Stand

We recently got the Dell Pro Max GB10, which is Dell’s version of the Nvidia DGX Spark (yes, we are official Nvidia partners, so come talk to us about setting up a full on Nvidia setup) mini super computer. The specs are identical. Both machines have 128 GB of unified memory and the same GB10 Grace Blackwell superchip, 4 TB of storage and you can still combine two of them together. We got the Dell because I love Dell monitors and I really love my XPS Tablet which is a full on Windows machine and am excited to try out my new XPS laptop with an ARM processor (I do love Windows). It also doesn’t hurt that Dell has really good promotions for Amex Platinum cards, though you should be reading about that at The Points Guy.

The box for this computer was smaller than the box for my new laptop.

The box for the GB10 compared to a laptop box
The box for the GB10 compared to a laptop box

Unpacked, it is evident quite how tiny it is.

DGX on Stand
DGX on Stand

Continue reading

Related Posts

Jared Lander is the Chief Data Scientist of Lander Analytics a New York data science and AI firm, Adjunct Professor at Columbia University, Organizer of the New York Open Statistical Programming meetup and the New York and Government Data Science and AI Conferences and author of R for Everyone.

Posted in AI.