Before training deep learning models on your local or remote computer you should make sure you have the latest applicable prerequisites installed. This includes making sure the latest drivers and libraries for your NVIDIA GPU (if you have one). You should also ensure you have installed Python and Python libraries such as NumPy, SciPy, Python support for Visual Studio / Visual Studio Code, and appropriate deep learning frameworks such as Microsoft Cognitive Toolkit (CNTK), TensorFlow, PyTorch, Caffe2, MXNet, Keras, Theano and/or Chainer.
Note
Software introduction in the following subsectons is excerpted from their homepages.
Setting up deep learning and machine learning software as well as their dependencies is not an easy task. After you have installed NVIDIA GPU driver, CUDA, cuDNN and Python, we recommend that you use the one-click installer to install them automatically across Windows, macOS and Linux.
Deep learning frameworks take advantage of NVIDIA GPU to let machines learn at a speed, accuracy, and scale towards true artificial intelligence. If your computer has NVIDIA GPU cards, please visit here or try OS update to install the latest driver.
CUDA is a parallel computing platform and programming model invented by NVIDIA. It enables dramatic increases in computing performance by harnessing the power of the GPU. Currently, CUDA Toolkit 9.0 is required by latest version of deep learning frameworks.
To install CUDA
- Visit this site, download CUDA and install it.
- Make sure to install the CUDA runtime libraries, and then add CUDA binary path to the %PATH% or $PATH environment variable.
- On Windows, this path is "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v9.0\bin" by default.
cuDNN (CUDA Deep Neural Network library) is a GPU-accelerated library of primitives for deep neural networks by NVIDIA. cuDNN v6 is required by latest deep learning frameworks.
To install cuDNN
- Visit here to download and install v7.0.5 for CUDA 9.0 package.
- Ensure to add the directory containing cuDNN binary to the %PATH% or $PATH environment variable.
- On Windows, you can copy cudnn64_7.dll to "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v9.0\bin".
Python has been the primary programming language for deep learning applications. 64-bit Python 3.5 or 3.6 distribution is required, and the latest Python 3.5 is recommended for the best compatibility.
Note
- 32-bit Python is not supported because CNTK, TensorFlow and etc. frameworks do not support it.
- Please run the following command in a terminal. If the return value is "64bit", we are sure that 64-bit Python is installed:
python3 -c "import platform; print(platform.architecture()[0])"
Please add Python directory to the %PATH% or $PATH environment variable. You also need to install pip, which is the package management system to install and manage software packages written in Python. Deep learning frameworks rely on pip for their own installation.
Note
- On Windows, it is preferred to install the Python launcher for yourself only.
- If your Python distribution is installed in the system directory (e.g. the one shipped with Visual Studio 2017), administrative permission is required to install Python packages with pip.
Then, we verify whether Python is installed correctly, and upgrade pip to the latest version. Suppose Python 3.5 is installed, please run the following commands in a terminal:
-
Windows
C:\>python -V Python 3.5.4 C:\>pip3 -V pip 10.0.1 from c:\users\test\appdata\local\programs\python\python35\lib\site-packages (python 3.5) C:\>python -m pip install -U pip
-
macOS
MyMac:~ test$ python3 -V Python 3.5.4 MyMac:~ test$ pip3 -V pip 10.0.1 from /Library/Frameworks/Python.framework/Versions/3.5/lib/python3.5/site-packages (python 3.5) MyMac:~ test$ python3 -m pip install -U pip
-
Linux
test@MyLinux:~$ python3 -V Python 3.5.4 test@MyLinux:~$ pip3 -V pip 10.0.1 from /usr/local/lib/python3.5/dist-packages (python 3.5) test@MyLinux:~$ sudo python3 -m pip install -U pip
Visual Studio is a fully-featured IDE with unparalleled productivity for any dev, any app, and any platform.
Visual Studio provides open-source support for the Python language through the Python development and Data Science workloads (Visual Studio 2017) and the free Python Tools for Visual Studio extension (Visual Studio 2015 and earlier). Learn more about Working with Python in Visual Studio for more details.
If you are students, open-source or individual developers, you can download a free copy of Visual Studio Community 2017.
When Visual Studio 2017 installer starts, please choose Python development and .NET desktop development workloads for Python language and .NET support. If you want to scale out deep learning model training and/or inferencing to the Microsoft Azure such as Azure Machine Learning or Azure Batch AI, please also choose Azure development workload.
Note that a 64-bit Python 3.6 will also be installed by default with Visual Studio 2017. If you have installed Python 3.5, please refer to the following "Setting up the default Python environment" subsection.
When Visual Studio 2015 installer starts, please choose Custom type, and then select Python Tools for Visual Studio. If you want to scale out deep learning model training and/or inferencing to the Microsoft Azure such as Azure Machine Learning or Azure Batch AI, please install Azure SDK.
Users need to setup the default Python environment in Visual Studio for AI projects if there are multiple ones. E.g. Users install Python 3.5 manually, and Visual Studio 2017 Python development workload installs a 64-bit Python 3.6 automatically. Or users create several Anaconda virtual Python environments.
To set the default Python environment globally for Visual Studio, please go to menu Tools > Python > Python Environments (Visual Studio 2017), or Tools > Python Tools > Python Environments (Visual Studio 2015). Then, select e.g. Python 3.5 (64 bit) and click Make this the default environment for new projects button.
Python is fully supported in Visual Studio Code through extensions. Please visit here for more details.
Setting up deep learning and machine learning software as well as their dependencies is not an easy task. We recommend that you use the one-click installer to install them automatically across Windows, macOS and Linux.
-
NumPy is a general-purpose array-processing package designed to efficiently manipulate large multi-dimensional arrays of arbitrary records without sacrificing too much speed for small multi-dimensional arrays.
-
SciPy (pronounced "Sigh Pie") is open-source software for mathematics, science, and engineering, depending on NumPy. Starting from version 1.0.0, SciPy now has official prebuilt wheel package for Windows.
To install NumPy and SciPy, run the following command in a terminal:
pip3 install numpy==1.14.3 scipy==1.1.0
Note
The above command will upgrade existing old or unofficial (e.g. third party packages from http://www.lfd.uci.edu/~gohlke/pythonlibs/ for Windows) NumPy and SciPy to the latest official ones.
Jupyter Notebook is an open-source web application that allows you to create and share documents that contain live code, equations, visualizations and narrative text.
To install Jupyter Notebook, run the following command in a terminal:
pip3 install jupyter nbconvert
Pandas is an open source, BSD-licensed library providing high-performance, easy-to-use data structures and data analysis tools for the Python programming language.
To install Pandas, run the following command in a terminal:
pip3 install pandas
Matplotlib is a Python 2D plotting library which produces publication quality figures in a variety of hardcopy formats and interactive environments across platforms.
To install Matplotlib, run the following command in a terminal:
pip3 install matplotlib
The Microsoft Cognitive Toolkit is a unified deep learning toolkit that describes neural networks as a series of computational steps via a directed graph. CNTK supports both Python and BrainScript programming languages.
To install CNTK Python package, see how to install CNTK for details.
Note
- CNTK currently does not support macOS.
- CNTK GPU-1bit-SGD version is licensed under a specific 1bit-SGD License which is MORE restrictive, than the major CNTK License.
Briefly, to install CNTK Python package, run the following command in a terminal:
- With GPU
pip3 install cntk-gpu==2.5.1
- Without GPU
pip3 install cntk==2.5.1
Note
We advise that you do not have both cntk and cntk-gpu packages installed simultaneously.
To install CNTK BrainScript package, run the following command in a terminal:
-
Visit here to download the CPU-only or GPU package.
-
Windows
- Decompress the zip file to "%AppData%\Roaming\Microsoft\ToolsForAI\RuntimeSDK". Please create this folder if it does not exist.
- Add "%AppData%\Roaming\Microsoft\ToolsForAI\RuntimeSDK\cntk\cntk" to the %PATH% environment variable.
- Install Microsoft MPI from "%AppData%\Roaming\Microsoft\ToolsForAI\RuntimeSDK\cntk\prerequisites\MSMpiSetup.exe", which is required by CNTK.
- Install Microsoft Visual C++ 2015 Redistributable from "%AppData%\Roaming\Microsoft\ToolsForAI\RuntimeSDK\cntk\prerequisites\VS2015\vc_redist.x64.exe" if it is not installed yet.
-
Linux
- Decompress the zip file to your home directory "~/.toolsforai".
- Add "~/.toolsforai/cntk/cntk/bin" to the $PATH environment variable.
- Install OpenMPI by running the following command in a terminal:
sudo apt-get install libopenmpi-dev
TensorFlow is an open source software library for numerical computation using data flow graphs. Please refer to here for detailed installation.
To install TensorFlow, run the following command in a terminal:
- With GPU
pip3 install tensorflow-gpu==1.5.0
- Without GPU
pip3 install tensorflow==1.5.0
PyTorch is a python package that provides two high-level features:
- Tensor computation (like numpy) with strong GPU acceleration
- Deep Neural Networks built on a tape-based autograd system
To install PyTorch, please run the following command in a terminal:
-
Windows
- With GPU
- Python 3.5 pip3 install http://download.pytorch.org/whl/cu90/torch-0.4.0-cp35-cp35m-win_amd64.whl - Python 3.6 pip3 install http://download.pytorch.org/whl/cu90/torch-0.4.0-cp36-cp36m-win_amd64.whl
- Without GPU
- Python 3.5 pip3 install http://download.pytorch.org/whl/cpu/torch-0.4.0-cp35-cp35m-win_amd64.whl - Python 3.6 pip3 install http://download.pytorch.org/whl/cpu/torch-0.4.0-cp36-cp36m-win_amd64.whl
- With GPU
-
macOS
pip3 install torch==0.4.0
[!NOTE]
macOS binaries don't support CUDA, install from source if CUDA is needed
-
Linux
- With GPU
- Python 3.5 pip3 install http://download.pytorch.org/whl/cu90/torch-0.4.0-cp35-cp35m-linux_x86_64.whl - Python 3.6 pip3 install http://download.pytorch.org/whl/cu90/torch-0.4.0-cp36-cp36m-linux_x86_64.whl
- Without GPU
- Python 3.5 pip3 install http://download.pytorch.org/whl/cpu/torch-0.4.0-cp35-cp35m-linux_x86_64.whl - Python 3.6 pip3 install http://download.pytorch.org/whl/cpu/torch-0.4.0-cp36-cp36m-linux_x86_64.whl
- With GPU
Finally, install torchvision:
pip3 install torchvision==0.2.1
Caffe2 is a lightweight, modular, and scalable deep learning framework. Building on the original Caffe, Caffe2 is designed with expression, speed, and modularity in mind.
Currently, there's no official prebuilt Caffe2 python wheel package available. Please visit here to build from source code.
Note
This site has a third-party Caffe2 0.8.1 Windows wheel package (supports both GPU and CPU).
Apache MXNet (incubating) is a deep learning framework designed for both efficiency and flexibility. It allows you to mix symbolic and imperative programming to maximize efficiency and productivity.
To install MXNet, run the following command in a terminal:
- With GPU
pip3 install mxnet-cu90==1.2.0
- Without GPU
pip3 install mxnet==1.2.0
Keras is a high-level neural networks API, written in Python and capable of running on top of CNTK, TensorFlow or Theano. It was developed with a focus on enabling fast experimentation. Being able to go from idea to result with the least possible delay is key to doing good research.
To install Keras, please run the following command in a terminal:
pip3 install Keras==2.1.6
Theano is a Python library that allows you to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays efficiently.
To install Theano, please run the following command in a terminal:
pip3 install Theano==1.0.2
Chainer is a Python-based deep learning framework aiming at flexibility. It provides automatic differentiation APIs based on the define-by-run approach (a.k.a. dynamic computational graphs) as well as object-oriented high-level APIs to build and train neural networks.
To enable CUDA support, install CuPy:
- Linux
pip3 install cupy-cuda90==4.1.0
- Non-Linux
pip3 install cupy==4.1.0
Note
On Windows, you need 2015 version of Microsoft Visual Studio or Microsoft Visual C++ Build Tools to compile CuPy with CUDA. First, open a VS2015 x64 Native Tools Command Prompt or Visual C++ 2015 x64 Native Tools Command Prompt, and then execute the above cupy installation command.
To install Chainer, please run the following command in a terminal:
pip3 install chainer==4.1.0
scikit-learn is a Python module for machine learning built on top of SciPy and distributed under the 3-Clause BSD license.
To install scikit-learn, please run the following command in a terminal:
pip3 install scikit-learn==0.19.1
XGBoost is an optimized distributed gradient boosting library designed to be highly efficient, flexible and portable. It implements machine learning algorithms under the Gradient Boosting framework.
To install XGBoost, please run the following command in a terminal:
- Windows
There is no official prebuilt wheel package for Windows yet.
Please visit here and download a suitable 64-bit package.
pip3 install /download/path/xgboost*win_amd64.whl
- Non-Windows
pip3 install xgboost
LIBSVM is an integrated software for support vector classification, (C-SVC, nu-SVC), regression (epsilon-SVR, nu-SVR) and distribution estimation (one-class SVM). It supports multi-class classification.
To install LIBSVM on Windows, please visit here and download a suitable 64-bit package because there is no official prebuilt wheel package for Windows yet. Then, please run the following command in a terminal:
pip3 install /download/path/libsvm*win_amd64.whl
To install LIBSVM on non-Windows, please build from the source code.
ONNX is the first step toward an open ecosystem that empowers AI developers to choose the right tools as their project evolves. ONNX provides an open source format for AI models. Caffe2, PyTorch, Microsoft Cognitive Toolkit, Apache MXNet and other tools are developing ONNX support.
Note
On non-Windows, please make sure to install the Protobuf compiler and set environment variable ONNX_ML=1 for onnx-ml.
To install ONNX, please run the following command in a terminal:
pip3 install onnx
coremltools contains all supporting tools for CoreML model conversion and validation. This includes Scikit Learn (0.17+), LIBSVM, Caffe, Keras (1.2.2, 2.0.4+) and XGBoost (0.7+). These frameworks should have been installed when you are converting models.
To install coremltools, please run the following command in a terminal:
- Windows
[!NOTE] There is no official prebuilt wheel package for Windows yet. The following method installs Python stuff only.
pip3 install "git+https://github.com/apple/coremltools@v0.8"
- Non-Windows
pip3 install coremltools==0.8
onnxmltools enables you to convert models from different machine learning toolkits into ONNX. Currently the following toolkits (need installation) are supported:
- Apple Core ML
- scikit-learn (subset of models convertible to ONNX)
To install onnxmltools, please run the following command in a terminal:
pip3 install onnxmltools==1.0.0.0
winmltools enables you to convert models from different machine learning toolkits into ONNX for use with Windows Machine Learning.
To install winmltools, please run the following command in a terminal:
pip3 install winmltools==0.1.0.5072
tf2onnx converts a TensorFlow graph to an ONNX graph. tf2onnx is in its early development. Mileage will vary since TensorFlow supports ~4 times the operations that the current ONNX version supports. But standard models seem to be using mostly ops that ONNX does support.
To install tf2onnx, please run the following command in a terminal:
pip3 install "git+https://github.com/onnx/tensorflow-onnx.git@r0.1"
Netron is a viewer for neural network, deep learning and machine learning models.
Netron supports ONNX (.onnx, .pb), Keras (.h5, .keras), CoreML (.mlmodel) and TensorFlow Lite (.tflite). Netron has experimental support for Caffe (.caffemodel), Caffe2 (predict_net.pb), MXNet (-symbol.json), TensorFlow.js (model.json, .pb) and TensorFlow (.pb, .meta).
To install Netron, please visit its release page and download a suitable installer.
In recent years, machine learning and deep learning become very popular in IT industry. There have been plenty of frameworks for users to build their own models. However, they differ with each other greatly on the implementation details. This will inevitably result in that models produced by one framework cannot be reused for subsequent training or inference in another framework, which brings inconvenience and increases cost to users on framework choice.
Model file conversion is a feasible trial towards such challenge. In the above subsections, we introduce several model converters: coremltools, onnxmltools, winmltools and tf2onnx, as well as their installation method.
For Windows users, we recommend that you use the one-click installer to setup these converters. If you wish to install them by yourself, first go to the third-party web site to install unofficial XGBoost and LIBSVM 64-bit Windows packages, and then run the following command in a terminal:
pip3 install tensorflow==1.5.0 scikit-learn==0.19.1 onnx "git+https://github.com/apple/coremltools@v0.8" onnxmltools==1.0.0.0 winmltools==0.1.0.5072 "git+https://github.com/onnx/tensorflow-onnx.git@r0.1"