Reduce top memory consumption of dnn module for FP16 precision #13413

dkurt · 2018-12-11T11:23:44Z

To reduce top memory consumption of dnn module usage we can achieve approximately x2 less memory usage for networks which will be executed with DNN_TARGET_OPENCL_FP16 or DNN_TARGET_MYRIAD.

For now it loads all the weights in FP32 precision and performs FP32->FP16 conversion for mentioned targets. For models whom weights size more than internal allocations for intermediate blobs, we can reduce top memory consumption converting the weights to FP16 during import.

There are several questions that must be resolved:

Some layers which are failed to execute with DNN_BACKEND_OPENCV and DNN_TARGET_OPENCL_FP16 fallback to DNN_TARGET_OPENCL and then to DNN_TARGET_CPU. We need to manage it and keep FP32 weights.
For now it works in this way:
```
net = readNet(<model path>);  <-- Importers work here
net.setPreferableBackend(<backend id>);
net.setPreferableTarget(DNN_TARGET_OPENCL_FP16);  <-- We specify precision only here
```
And we need to say importers about the desired precision earlier.

2.1. Solution 1: An extra flag to readNet with target. (Or backend and target?)
2.2. Solution 2: Create methods such Net::readFromCaffe and specify target before import:
```
Net net;
net.setPreferableTarget(DNN_TARGET_OPENCL_FP16);
net.readFromCaffe(<model path>)
```
However it's not so obvious for user as the first solution.
Precision of this approach won't be the same as FP32->FP16 path because now all the weights fusions are made in FP32.
It'd be great research to study dynamics of top memory consumption of dnn module for a single forward pass use case over different versions. There are several big PRs which made some changes in this topic: opencv/opencv_contrib#1205, #11461 (just ones I found in my logs), #9389 (closed for a while).

kunakl07 · 2020-01-10T17:08:01Z

Is this issue still open? I would like to work on this issue

souradeepmajumdar05 · 2020-09-13T20:53:25Z

anybody working on this issue?

carrycooldude · 2020-09-15T01:14:35Z

I want to work on this issue @dkurt

dkurt · 2020-09-15T09:09:39Z

Feel free to propose solution by opening a pull request

dkurt added optimization good first issue category: dnn effort: ∞ labels Dec 11, 2018

dkurt mentioned this issue Nov 12, 2019

InferenceEngine: Myriad: Reduce RAM usage #15859

Closed

opencv / opencv

Reduce top memory consumption of dnn module for FP16 precision #13413

Reduce top memory consumption of dnn module for FP16 precision #13413

dkurt commented Dec 11, 2018

kunakl07 commented Jan 10, 2020

souradeepmajumdar05 commented Sep 13, 2020

carrycooldude commented Sep 15, 2020

dkurt commented Sep 15, 2020

Oct	NOV	Dec
	25
2019	2020	2021

opencv / opencv

Join GitHub today

GitHub is where the world builds software

Reduce top memory consumption of dnn module for FP16 precision #13413

Reduce top memory consumption of dnn module for FP16 precision #13413

Comments

dkurt commented Dec 11, 2018

kunakl07 commented Jan 10, 2020

souradeepmajumdar05 commented Sep 13, 2020

carrycooldude commented Sep 15, 2020

dkurt commented Sep 15, 2020

Essential cookies

Always active

Analytics cookies