training-data Questions

3

I have been trying to fine-tune GPT2 on the wikitext-2 dataset (just to help myself learn the process) and I am running into a warning message that I have not seen before: "The attention mask ...

2

Solved

I'm trying to train Tesseract 4 with images instead of fonts. In the docs they are explaining only the approach with fonts, not with images. I know how it works, when I use a prior version of T...
Erysipelas asked 11/4, 2017 at 17:47

4

Solved

I was trying to develop some way to convert annotations between formats, and it's quit hard to find information but here I have : This one is PASCAL VOC <width>800</width> <height&gt...
Bloat asked 28/10, 2020 at 21:39

1

Someone I know already has a Google Colab Pro account and I need to use it for a while for training a deep learning model. Can he/she lend their account to me without sharing Google credentials? I ...
Monamonachal asked 3/5, 2021 at 7:13

2

I am trying to automatically extract a scale (scale bar + a number + unit) from an image. Here is an example: It is used to map pixels to real world measurement. I am using PyTesseract (installed ...
Nadabas asked 1/11, 2020 at 13:56

1

Solved

I am using pytorch to train my CNN network. I want to plot my training and validation loss curves to visulize the model performance. How can I plot two curves? I have below code # create a function...
Raffaello asked 10/12, 2022 at 16:4

4

Solved

I Can calculate accuracy after each epoch using this code . But, I want to calculate the accuracy for each class at the end . how can i do that? I have two folders train and val . each folder has 7...

28

Solved

I'm training on codility now. Some tasks I can solve by myself, but with some tasks have problems. Difficulty of this task is <**>. It's medium, but I stalled. Problem: You are given a non-e...
Tussle asked 20/1, 2014 at 20:40

8

Solved

So I have a main folder which contains sub-folders which in turn contains images for the dataset as follows. -main_db ---CLASS_1 -----img_1 -----img_2 -----img_3 -----img_4 ---CLASS_2 -----...
Aerogram asked 7/8, 2019 at 12:5

6

Solved

I am trying to use train_test_split from package scikit Learn, but I am having trouble with parameter stratify. Hereafter is the code: from sklearn import cross_validation, datasets X = iris.data...
Nejd asked 17/1, 2016 at 19:5

4

Solved

I want to separate my data into train and test set, should I apply normalization over data before or after the split? Does it make any difference while building predictive model?
Nonmaterial asked 23/3, 2018 at 7:13

4

Solved

I got the following example from here. train_datagen = ImageDataGenerator( rescale=1./255, shear_range=0.2, zoom_range=0.2, horizontal_flip=True) test_datagen = ImageDataGenerator(rescale=1./...
Hershelhershell asked 29/10, 2018 at 1:3

4

Solved

I am training a classifier over tweets for sentiment analysis purposes. The code is the following: df = pd.read_csv('Trainded Dataset Sentiment.csv', error_bad_lines=False) df.head(5) #TWEET ...
Wryneck asked 25/8, 2017 at 14:29

1

Solved

I am doing a binary classification. May I know how to extract the real indexes of the misclassified or classified instances of the training data frame while doing K fold cross-validation? I found n...

3

Solved

I have a very imbalanced dataset. I used sklearn.train_test_split function to extract the train dataset. Now I want to oversample the train dataset, so I used to count number of type1(my data set h...

4

Solved

My question is simple, what is the validation data passed to model.fit in a Sequential model used for? And, does it affect how the model is trained (normally a validation set is used, for example,...
Kemeny asked 19/9, 2017 at 19:28

7

I'm using R to do machine learning. Following standard machine learning methodology, I would like to randomly split my data into training, validation, and test data sets. How do I do that in R? I ...
Eal asked 17/3, 2016 at 18:34

2

Solved

train_image_gen = image_gen.flow_from_directory('/Users/harshpanwar/Desktop/Folder/train', target_size=image_shape[:2], batch_size=batch_size, class_mode='binary') In the above code snippet wh...

2

I have a ML model which is trained on a million data set (supervised classification on text) , however I want the same model to get trained again as soon as a new data comes in (training data). Thi...

5

Solved

I'm in my OPENCV_ROOT folder and running the following command in Win-x64: opencv_createsamples -bgcolor 0 -bgthresh 0 -maxxangle 1.1 -maxyangle 1.1 maxzangle 0.5 -maxidev 40 -w 80 -h 40 -img AB...
Equipollent asked 24/3, 2014 at 13:0

2

I am looking for the best approach to train on larger-than-memory-data in Keras and currently noticing that the vanilla ImageDataGenerator tends to be slower than I would hope. I have two networks...
Kief asked 10/12, 2016 at 3:24

3

Solved

When developing a neural net one typically partitions training data into Train, Test, and Holdout datasets (many people call these Train, Validation, and Test respectively. Same things, different n...
Standish asked 5/7, 2016 at 21:44

2

I want to draw StackOverflow's logo with this Neural Network: The NN should ideally become [r, g, b] = f([x, y]). In other words, it should return RGB colors for a given pair of coordinates. The F...

5

Solved

Let's say I've read in a textfile using a TextLineReader. Is there some way to split this into train and test sets in Tensorflow? Something like: def read_my_file_format(filename_queue): reader =...
Yearn asked 25/1, 2017 at 19:5

1

Solved

I have a neural network which takes input tensor of dimension (batch_size, 100, 1, 1) and produces an output tensor of dimension (batch_size, 3, 64, 64). I have differing results when using model t...
Laudianism asked 13/8, 2020 at 17:44

© 2022 - 2024 — McMap. All rights reserved.