gpucoder.profile
Create an execution profile report for generated CUDA code
Description
gpucoder.profile(
generates an execution profiling report of the CUDA code generated for the design file
func_name
,codegen_inputs
)func_name
. The codegen_inputs
argument specifies the
inputs to the design file. You must install the Embedded Coder® product to generate the profiling report.
Note
The profiling workflow depends on the nvprof
tool from
NVIDIA®. In CUDA® Toolkit v10.1, NVIDIA restricts access to performance counters to admin users. To enable GPU
performance counters for all user accounts, see the instructions in Permission issue with Performance Counters (NVIDIA).
gpucoder.profile(___,
generates an execution profiling report with one or more profiling options specified as a
name-value pair argument.Name,Value
)
Examples
Input Arguments
Version History
See Also
Apps
Functions
Objects
Topics
- GPU Programming Paradigm
- Analyze Execution Profiles of the Generated Code
- GPU Execution Profiling of the Generated Code
- Code Generation by Using the GPU Coder App
- Code Generation Using the Command Line Interface
- Code Generation for Deep Learning Networks by Using cuDNN
- Code Generation for Deep Learning Networks by Using TensorRT