Edge TPU Compiler: ERROR: quantized_dimension must be in range [0, 1). Was 3

Asked 27/7, 2019 at 17:2 Answered 14/10, 2019 at 20:13

Solved tensorflow quantization tpu google-coral

I'm trying to get a Mobilenetv2 model (retrained last layers to my data) to run on the Google edge TPU Coral.

I've followed this tuturial https://www.tensorflow.org/lite/performance/post_training_quantization?hl=en to do the post-training quantization. The relevant code is:

...
train = tf.convert_to_tensor(np.array(train, dtype='float32'))
my_ds = tf.data.Dataset.from_tensor_slices(train).batch(1)


# POST TRAINING QUANTIZATION
def representative_dataset_gen():
    for input_value in my_ds.take(30):
        yield [input_value]

converter = tf.lite.TFLiteConverter.from_keras_model_file(saved_model_dir)
converter.optimizations = [tf.lite.Optimize.DEFAULT]
converter.representative_dataset = representative_dataset_gen
converter.target_ops = [tf.lite.OpsSet.TFLITE_BUILTINS_INT8]
tflite_quant_model = converter.convert()

I've successfully generated the tflite quantized model but when I run the edgetpu_compiler (followed this page https://coral.withgoogle.com/docs/edgetpu/compiler/#usage) I get this output:

edgetpu_compiler  Notebooks/MobileNetv2_3class_visit_split_best-val- 
acc.h5.quant.tflite

Edge TPU Compiler version 2.0.258810407
INFO: Initialized TensorFlow Lite runtime.
ERROR: quantized_dimension must be in range [0, 1). Was 3.
ERROR: quantized_dimension must be in range [0, 1). Was 3.
ERROR: quantized_dimension must be in range [0, 1). Was 3.
ERROR: quantized_dimension must be in range [0, 1). Was 3.
ERROR: quantized_dimension must be in range [0, 1). Was 3.
ERROR: quantized_dimension must be in range [0, 1). Was 3.
ERROR: quantized_dimension must be in range [0, 1). Was 3.
ERROR: quantized_dimension must be in range [0, 1). Was 3.
ERROR: quantized_dimension must be in range [0, 1). Was 3.
ERROR: quantized_dimension must be in range [0, 1). Was 3.
ERROR: quantized_dimension must be in range [0, 1). Was 3.
ERROR: quantized_dimension must be in range [0, 1). Was 3.
ERROR: quantized_dimension must be in range [0, 1). Was 3.
ERROR: quantized_dimension must be in range [0, 1). Was 3.
ERROR: quantized_dimension must be in range [0, 1). Was 3.
ERROR: quantized_dimension must be in range [0, 1). Was 3.
ERROR: quantized_dimension must be in range [0, 1). Was 3.
Invalid model: Notebooks/MobileNetv2_3class_visit_split_best-val-        
acc.h5.quant.tflite
Model could not be parsed

The input shape of the model is a 3 channel RGB image. Is possible to do full integer quantization on 3 channel images? I couldn't find anything saying that you can't either on TensorFlow and Google Coral documentation.

Caskey answered 27/7, 2019 at 17:2 Comment(4)

I think it is not an issue with the 3 channels. I have a model that uses 3 channels and it works with no problem on the edge tpu (when I convert it with the toco converter with manual min/max flags set). However I get the same error when I use post training qunatization. So there seems to be another issue with this that I also couldn't figure out yet. – Phenosafranine 20/8, 2019 at 17:30

Do you maybe know what the qunatized_dimension means? Then we maybe can figure out where the error is coming from – Phenosafranine 20/8, 2019 at 17:32

According to tensorflow.org/lite/performance/quantization_spec the quantization_dimension denotes the axis on which the scale and zero_point parameters are specified for a tensor. So it is strange that apparently only [0, 1) is allowed for this since I would assume it to be a value between [0, maxTensorDimension]. This really seems like a strange bug of the compiler and not of the converted model itself. – Phenosafranine 20/8, 2019 at 17:44

@Phenosafranine Can you please explain how to find the std_dev and mean values to use in toco? For a MobileNetV2 – Caskey 6/9, 2019 at 14:40

I had similar errors, doing the post training full integer quantization with tf-nightly build 1.15 and the use that .tflite file, compile with edge TPU compiler it should work. my error was solved with this approach.

Same issue was raised in github, you can see it - here

Jacksmelt answered 30/9, 2019 at 7:0 Comment(0)

I have the same problem and the same error message. I retrained MobilenetV2 using tensorflow.keras.applications mobilenetv2. I found that there are some big differences in the TFLite tensors between my model and the Coral's example model(https://coral.withgoogle.com/models/).

First, types of input and output are different. When I convert my tf.keras model to tflite, it contains float type input and output tensors while the example model has an integer type. This is different if I use a command-line conversion and python conversion from tensorflow-lite (https://www.tensorflow.org/lite/convert/). The command-line conversion outputs the integer type io, but python conversion outputs the float type io. (This is really strange.)

Second, there is no Batch normalization(BN) layer in the example model however there are some BNs in Keras MobilenetV2. I think the number of 'ERROR: quantized_dimension must be in range [0, 1). Was 3.' is related to the number of BN because there are 17 BN layers in Keras model.

I'm still struggling with this problem. I'm just going to follow the Coral's retraining example to solve it. (https://coral.withgoogle.com/docs/edgetpu/retrain-detection/)

Earwitness answered 26/8, 2019 at 7:11 Comment(0)

This problem is fixed in tensorflow1.15-rc. Convert your model to TFLite in the new tf version. Then the TFLite model will work in TPU compiler.

And put these lines which make the TFlite model's input and output as an uint8 type. (I think it should be tf.int8 though.)

converter.target_spec.supported_ops = [tf.lite.OpsSet.TFLITE_BUILTINS_INT8]
converter.inference_input_type = tf.uint8
converter.inference_output_type = tf.uint8

Check the link below. https://www.tensorflow.org/lite/performance/post_training_quantization

Earwitness answered 24/9, 2019 at 14:39 Comment(1)

not sure why this was down voted but these 2 lines were crucial: ``` converter.inference_input_type = tf.uint8 converter.inference_output_type = tf.uint8 ``` – Hillock 14/10, 2019 at 20:15

Same issue was raised in github, you can see it - here

Jacksmelt answered 30/9, 2019 at 7:0 Comment(0)

Do you still have this issue after updating to the newest compiler version?

Edge TPU Compiler version 2.0.267685300

Hillock answered 14/10, 2019 at 20:13 Comment(1)

The problem was the tensorflow version not the edge tpu compiller. It worked on tf-nightly build 1.15. – Caskey 15/10, 2019 at 11:55

Recommended topics

Hot tags