Description | To train neural networks, you must collect and submit to AxxonSoft videos and images from your actual cameras taken in the same resolution and in the same conditions as in your future application. For example, if you need your neural network to work outdoors, videos must contain all weather conditions (sun, rain, snow, fog, and so on) in at different times of day (morning, afternoon, evening, and night). If the collection requirements for the data submitted for training the neural network model are met, and if you operate the neural network in the conditions that are as similar as possible to the conditions in which the data was collected, we guarantee the overall accuracy (training in operating conditions) of neural network analytics from 90% to 97% and the percentage of false positives of 5-7%. For general standard neural networks (training wasn't performed in operating conditions), we guarantee the overall accuracy of 80-95% and the percentage of false positives of 5-20%. The standard neural networks are located in the C:\Program Files\Common Files\AxxonSoft\DetectorPack\NeuroSDK directory. The requirements can be changed or added to at any time |
General requirements for collected data | - When collecting data, specific requirements for object images, scene, angle, lighting, and video stream must be met for those
detection tools - detectors that you plan to use (see
Configuring detection tools- Detectors).
If it is required to train a neural network in different conditions of time of day, lighting, angle, object types, or weather, you must collect video data in equal proportions for each condition, that is, it must be balanced.
Example. It is necessary to detect a person in the surveillance area at night and during the day. - Data collected correctly:
- four videos of the surveillance area, each five minutes long are submitted for training;
- the object of interest appears in the frame in each video;
- two videos
were night - nighttime conditions, two—in daytime conditions.
- Data collected incorrectly:
- three videos of the surveillance area, each five minutes long are submitted for training;
- the object of interest appears in the frame in each video;
- two fragments
were night - nighttime conditions, one—in daytime conditions
|
Additional requirements for collected data for each neural analytics tool | Detection toolDetector | Requirements |
---|
NeurofilterNeural filter | At least 1000 frames containing different objects of interest in the given scene conditions, and the same number of frames containing no objects (noise frames) | NeurotrackerNeural tracker | 3–5 3–5 minutes of videos containing objects of interest in the given scene conditions. The more greater the number and variability of the situations in the scene, the better | Stopped object detector | Pose DetectionHuman pose detector | 10 seconds of second video of a scene with no persons. At least 100 different persons in the given scene conditions. Attention! Different conditions mean, among others, different positions of a person in a scene (bending, different positions of body parts, and so on) | Equipment detection (PPE)detector | A list of all reference equipment with examples must be collected from the object and coordinated with the analytics manufacturer (see Example of providing a list of valid equipment at the facility for the Equipment detector). 3-5 minutes of videos with personnel in the given scene conditions. Personnel must move and change position positions in the collected videos, as well as remove and put on equipment at intervals of 30 seconds. Since the equipment detection tools detectors are designed for artificial constant lighting, videos in other lighting conditions aren't required | PPE detection Equipment detector VL | Fire Detectiondetector | At least 1000 frames containing different objects of interest in the given scene conditions, and the same number of frames containing no objects (noise frames) | Smoke DetectionObject Presence Detectiondetector | Neural classifier | At least 1000 frames containing different objects of interest in the given scene conditions, and the same number of frames containing no objects (noise frames) |
|