I'm doing a neural network prediction with my own datasets using Tensorflow. The first I did was a model that works with a small dataset in my computer. After this, I changed the code a little bit in order to use Google Cloud ML-Engine with bigger datasets to realize in ML-Engine the train and the predictions.
I am normalizing the features in the panda dataframe but this introduces skew and I get poor prediction results.
What I would really like is use the library tf-transform
to normalize the data in the graph. To do this, I would like to create a function preprocessing_fn
and use the 'tft.scale_to_0_1
'. https://github.com/tensorflow/transform/blob/master/getting_started.md
The main problem that I found is when I'm trying to do the predict. I'm looking for internet but I don't find any example of exported model where the data is normalized in the training. In all the examples I found, the data is NOT normalized anywhere.
What I would like to know is If I normalize the data in the training and I send a new instance with new data to do the prediction, how is normalized this data?
¿Maybe in the Tensorflow Data Pipeline? The variables to do the normalization are saved in some place?
In summary: I'm looking for a way to normalize the inputs for my model and then that the new instances also become standardized.