Deploying Neural Network with Simulink Embedded Coder: Reducing Code Size and Improving Execution Speed

Question

Julian on 2 Jan 2023

0
Link

Direct link to this question

https://www.mathworks.com/matlabcentral/answers/1887312-deploying-neural-network-with-simulink-embedded-coder-reducing-code-size-and-improving-execution-sp

Commented: Prateek on 6 Jan 2023

Simulink_NN.png

My goal is to deploy a fully connected neural network model to a STM32MP1 microcontroller using the Embedded Coder.

To this end, I've adapted the following workflow (any suggestions appreciated):

Transform the neural network from .pth to .onnx file format (via torch.onnx.export),
Bring the network to Matlab using importONNXNetwork(),
Load this .mat file inside a Simulink Predict block (see image),
Generate C-Code via the Embedded Coder App, and
Build and run the model on the STM32MP1 microcontroller.

The constraints of the microcontroller (RAM) and the task (frequency) are the following:

Network Size: Given the training results the minimum size is a MLP with three hidden layers (256x128x64), I/O = 10/2
Code Size: The generated Code needs to be < 200kb in order to fit into the RAM
Network Speed: The generated code needs to run within < 1ms

Currently, with this workflow, I am able to generate and build C-Code for a smaller network (128x64x32) that is now running on the STM32MP1 with an execution time of around 0.9ms, while the actual network (256x128x64) fails to build on the microcontroller due to an overflow of 76kB.

Hence, the two related questions are (of this priority):

How to reduce the size of the generated C-Code (from 276kb to <200kb)
How to improve the execution speed for the generated C-Code

0 Comments
Show -2 older commentsHide -2 older comments

Sign in to comment.

Sign in to answer this question.

Answer 1

Prateek on 6 Jan 2023

1
Link

Direct link to this answer

https://www.mathworks.com/matlabcentral/answers/1887312-deploying-neural-network-with-simulink-embedded-coder-reducing-code-size-and-improving-execution-sp#answer_1142662

Hello Julian,

Optimizing code generation for memory is broad topic, and I can point you to some pages that contain relevant information about this:

These might help you figure out a good approach to optimizing your model.

Hope this helps.

Regards,

Prateek

2 Comments
Show NoneHide None

Julian on 6 Jan 2023

Open in MATLAB Online

Hi Prateek,

Thanks for offering help on this topic and providing some links to the docs!

Having checked your pointers, I've tested the model advisor (#2) and selecting different objectives (#3) for improving the generated code without success so far. While the docs on code generation in general seems comprehensive, I feel it rarely focusing on the deployment of neural networks so far - do you have specific pointers also for the code optimization of NN?

Another Idea to reduce the size of the generated C-code for the NN could be to use the float16 datatype (half-precision), instead of the default float32 (single precision), which would essentially cut the size by almost 50%. However, using onnxmltools.utils.float16_converter to convert the ONNX model to float16 and importing it to Matlab with importONNXNetwork() results in the following error:

"Error using nnet.internal.cnn.onnx.getDataFromTensorProto
The datatype of initializer 'model._model.a2c_network.sigma' ('FLOAT16') is not supported."

Is Matlab (and the Coder) even able to handle neural networks with float16 datatypes at this point in time? An older forum entry said it was under "active development".

Best, Julian

Prateek on 6 Jan 2023

Hi Julian,

I would suggest contacting MathWorks Technical Support for this. Here's the link to it : Contact Us - MATLAB & Simulink (mathworks.com)

Regards,

Prateek

Sign in to comment.

Deploying Neural Network with Simulink Embedded Coder: Reducing Code Size and Improving Execution Speed

0 Comments
Show -2 older commentsHide -2 older comments

Answers (1)

2 Comments
Show NoneHide None

See Also

Categories

Tags

Products

Release

Community Treasure Hunt

Deploying Neural Network with Simulink Embedded Coder: Reducing Code Size and Improving Execution Speed

0 Comments Show -2 older commentsHide -2 older comments

Answers (1)

2 Comments Show NoneHide None

See Also

Categories

Tags

Products

Release

Community Treasure Hunt

0 Comments
Show -2 older commentsHide -2 older comments

2 Comments
Show NoneHide None