John Pace
  • Home
  • About
  • Contact

​​
​
Data Scientist, husband, father of 3 great daughters, 5x Ironman triathlon finisher, just a normal guy who spent a lot of time in school.
Let’s explore data science, artificial intelligence, machine learning, and other topics together.

DGX station Setup Tips and Tricks

2/17/2020

0 Comments

 
Picture
Picture

​The Nvidia DGX Station is a high-performance AI workstation that enables your Data Science team to get started quickly with the power of a data center in your office. With 4 Nvidia V100 32GB GPUs and  Nvidia Docker preloaded, there is plenty of computing power to allow multiple users to run their most challenging AI training projects. Additionally, the DGX Station allows you to download preconfigured and optimized Docker containers from the Nvidia GPU Cloud (NGC) to further decrease the time it takes to get started.  Our customers use their DGX Stations for projects involving Natural Language Processing (NLP), classification with algorithms such as XGBoost, image analysis of x-rays and MRIs, and predictive analytics.

Picture

The setup for the DGX Station (DGX-S) is a quick process, typically less than 30 minutes from power on until your first nvidia-docker run command.  As you do your install, here are a few tips that can ensure the process is as smooth as possible.  These tips come from our experience with installing and configuring DGX-S.

1.  The Setup Guide says the Ethernet ports are configured for DHCP by default. We have found this is typically not the case, but that they are set to “manual”.  Be sure to edit the /etc/network/interfaces file (or configure the ports using the GUI) and either set the ports to DHCP or a static IP address.  As with any server, best practice is static IP.  Time-saving tip: Make sure you have a monitor, keyboard, and mouse handy when first booting so you can configure the interfaces.  After that, everything can be done with ssh.

​2.  Sometimes the DGX-S will ship with nvidia-docker v1. If so, it needs to be upgraded to nvidia-docker v2.  You can check you nvidia-docker version using:

    
If the command returns 2.0.x, then your system already contains the upgrade to the Nvidia Container Runtime for Docker and no further action is needed.  If it does not return 2.0.x, then perform the steps listed here.

3.  One of the most “creative” errors that can occur is this. It can happen when trying to pull new docker containers from NGC:

    

​“server misbehaving” – someone had fun with that error.  This is typically caused by a DNS issue.  Make sure the DNS servers are correctly configured and it should resolve the problem.  Setting the DNS server to 8.8.8.8 is the most common fix.

​4.  Finally, make sure you remove the piece of foam that is inside the DGX-S that keeps the GPUs from moving during shipping. It’s easy to overlook this step!
​
Picture

If you have any questions, I'd love to answer them. Be sure to follow me on Twitter (@pacejohn) and on LinkedIn (https://www.linkedin.com/in/john-pace-phd-20b87070/).

​#nvidia #dgx #dgxworkstation #artificialintelligence #ai #machinelearning #ngc #v100 #gpu #docker #nvidia-docker
0 Comments



Leave a Reply.

    Archives

    December 2020
    November 2020
    September 2020
    August 2020
    June 2020
    May 2020
    April 2020
    March 2020
    February 2020
    January 2020
    December 2019
    November 2019
    October 2019
    September 2019
    August 2019
    May 2019
    April 2019
    March 2019
    April 2018
    March 2018
    January 2018
    November 2017

    Tweets by pacejohn
Proudly powered by Weebly
  • Home
  • About
  • Contact