File Exchange

image thumbnail

GPU Coder Interface for Deep Learning Libraries

Interface for Deep Learning Libraries from GPU Coder


Updated 14 Oct 2020

GPU Coder™ enables generates optimized C++ code for deep learning, embedded vision, and autonomous systems. The generated code calls:
• optimized NVIDIA CUDA libraries and can be used for prototyping on all NVIDIA GPU platforms
• optimized ARM libraries and can be used for prototyping on ARM Mali GPU platforms

You can deploy a variety of trained deep learning networks such as YOLOv2, ResNet-50, SegNet, MobileNet, etc. from Deep Learning Toolbox™ to NVIDIA GPUs. You can generate optimized code for preprocessing and postprocessing along with your trained deep learning networks to deploy complete algorithms.

Supported networks and layers are listed here:

GPU Coder Interface for Deep Learning Libraries provides the ability to customize the generated code by leveraging target specific libraries on the embedded target. With this support package, you can integrate with libraries optimized for specific GPU targets for deep learning such as the TensorRT library for NVIDIA GPUs or ARM Compute Library for ARM Mali GPUs .

GPU Coder Interface for Deep Learning integrates with the following deep learning accelerator libraries and the corresponding GPU architectures:
• cuDNN and TensorRT libraries for NVIDIA GPUs
• ARM Compute Library for ARM Mali GPUs

This hardware support package is functional for R2018b and beyond.

It requires GPU Coder except when using Deep Learning Toolbox’s predict function with the (“Acceleration”,“mex”) name-value pair option.

If you have download or installation problems, please contact Technical Support -

[Updates in R2019b]
• Add code generation support for wordEmbeddingLayer for CuDNN target
• Add VC++ 2019 compiler support for cnncodegen for all targets (cuDNN, TensorRT)
• Add support for ONNX identity layer for all targets (cuDNN, TensorRT, ARM Mali)
• Support codegen for concatenationLayer for cuDNN
• Add support for Crop2dLayer for ARM Mali. This enables support for Fully Convolution Networks for Semantic Segmentation

Comments and Ratings (7)

Wenzhen Yue

Wanbin Song


blank tian


dudu tassa

Bill Chou

MATLAB Release Compatibility
Created with R2018b
Compatible with R2018b to R2020b
Platform Compatibility
Windows macOS Linux

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!