The GPU Cache Generator utility is used to pre-create a cache of neural networks that are used in the operation of detectors.
To install the utility:



After installation, a new window opens with a message about the completion of the utility installation.
By default, the utility window opens after installation is complete. If you don’t want to launch the utility after installation is complete, clear the Launch the GPU cache generator checkbox. |

The utility installation is complete.
When you launch the utility, a window opens asking you to stop the VMS server and other services that use GPU resources. This is required to correctly create an optimal cache. If you don’t stop the VMS server and related services, the utility can continue to work, but in this case the caching result can be less effective.

To view the main interface of the utility, click the Yes, I have closed all applications that use GPU button.
As a result, the main window of the utility is displayed. It is divided into two areas:
The left part displays a list of available detectors.
The right part displays a general list of neural networks for which you can generate a cache.

When you select a specific detector, all neural networks associated with the selected detector are automatically displayed on the right side. And when you select a specific neural network, the corresponding detector is automatically selected on the left side.
The table shows the match between detectors and neural networks:
| Detector | Neural models |
|---|---|
Barcode detection | GeneralNM barcodes |
Equipment detector (PPE) |
|
Fire detector | Fire scanned v1 |
Meta detector |
|
Neurocounter or Stopped object detector |
|
Neurotracker |
|
Pose detector |
|
Person-based privacy masking | General human pose estimation |
Privacy masking detector | Privacy masking origin |
Smoke detector | Smoke scanned v1 |
Water level detector | Water level rule net origin |
Custom neural networks | Allows you to generate a cache of a custom neural network, provided that the neural network can be run on a GPU |
To generate a cache:
In the right part of the main window of the utility, set the checkboxes next to the neural networks for which you want to create a cache.
Click the
button in the lower right part of the window to go to the cache generation settings.
In the window that opens, set the checkbox next to the video card for which the cache is created.
Cache generation is only supported for NVIDIA graphics cards, as TensorRT technology doesn’t support other graphics cards. |
| Parameter | Value | Description | |
|---|---|---|---|
| Enable int8 calibration |
|
By default, the checkbox is clear. To enable the Int8 quantization mode for a neural network, set the checkbox. Neural networks for which the quantization mode is available are included in the neural analytics package along with the *.info file of the same name | |
| |||
| Enable verbose logging mode |
| By default, the checkbox is clear. To enable logging of the process of initialization and cache generation, set the checkbox. Enabling the parameter provides detailed information about the cache generation process but increases the volume of logs and can slow down the generation process
| |
| |||
| The cache will be saved | AXXONGPU | Specify the directory of cache storage
|
Click the
button in the lower right part of the window to proceed to generating a cache for all selected neural networks.
|
Wait for the generation process to complete. If the cache is successfully generated, the created files are available for use by detectors.
Cache generation is complete.
When regenerating the cache for a specific neural network, the system tries to use the existing cache. If the cache is missing or corrupted, a new file is created. |