cross-validation

3

Cross validation in deep neural networks

How do you perform cross-validation in a deep neural network? I know that to perform cross validation to will train it on all folds except one and test it on the excluded fold. Then do this for k f...

tensorflow deep-learning cross-validation

Disk asked 10/6, 2017 at 16:39

4

Python : GridSearchCV taking too long to finish running

I'm attempting to do a grid search to optimize my model but it's taking far too long to execute. My total dataset is only about 15,000 observations with about 30-40 variables. I was successfully ab...

python machine-learning scikit-learn data-science cross-validation

Balanchine asked 3/5, 2022 at 14:51

10

Solved

sklearn cross_val_score() returns NaN values

i'm trying to predict next customer purchase to my job. I followed a guide, but when i tried to use cross_val_score() function, it returns NaN values.Google Colab notebook screenshot Variables: ...

python nan prediction cross-validation sklearn-pandas

Sorely asked 11/2, 2020 at 15:36

4

Confusing example of nested cross validation in scikit-learn

I'm looking at this example from scikit-learn documentation: http://scikit-learn.org/0.18/auto_examples/model_selection/plot_nested_cross_validation_iris.html It seems to me that crossvalidation i...

scikit-learn cross-validation

Nealon asked 13/12, 2016 at 18:20

3

Nested cross-validation example on Scikit-learn

I'm trying to work my head around the example of Nested vs. Non-Nested CV in Sklearn. I checked multiple answers but I am still confused on the example. To my knowledge, a nested CV aims to use a d...

python scikit-learn nested cross-validation grid-search

Fatback asked 6/10, 2017 at 10:18

2

Solved

Evaluating Logistic regression with cross validation

I would like to use cross validation to test/train my dataset and evaluate the performance of the logistic regression model on the entire dataset and not only on the test set (e.g. 25%). These co...

python scikit-learn logistic-regression cross-validation

Observant asked 26/8, 2016 at 9:46

4

Solved

Classification report with Nested Cross Validation in SKlearn (Average/Individual values)

Is it possible to get classification report from cross_val_score through some workaround? I'm using nested cross-validation and I can get various scores here for a model, however, I would like to s...

machine-learning scikit-learn classification cross-validation

Downey asked 2/3, 2017 at 17:33

5

Solved

How is scikit-learn cross_val_predict accuracy score calculated?

Does the cross_val_predict (see doc, v0.18) with k-fold method as shown in the code below calculate accuracy for each fold and average them finally or not? cv = KFold(len(labels), n_folds=20) clf...

python scikit-learn cross-validation

Amari asked 4/1, 2017 at 7:57

1

Solved

Using Sample Weights through metadata routing in scikit-learn in nested cross-validation

I am using the sklearn version "1.4.dev0" to weight samples in the fitting and scoring process as described in this post and in this documentation. https://scikit-learn.org/dev/metadata_r...

python scikit-learn cross-validation

Fabrizio asked 21/11, 2023 at 12:47

4

Solved

Value Error X has 24 features, but DecisionTreeClassifier is expecting 19 features as input

I'm trying to reproduce this GitHub project on my machine, on Topological Data Analysis (TDA). My steps: get best parameters from a cross-validation output load my dataset feature selection extrac...

python cross-validation decision-tree topological-sort

Sampling asked 15/1, 2021 at 20:17

3

Solved

Not able to use Stratified-K-Fold on multi label classifier

The following code is used to do KFold Validation but I am to train the model as it is throwing the error ValueError: Error when checking target: expected dense_14 to have shape (7,) but got array...

keras scikit-learn deep-learning cross-validation

Marcasite asked 26/2, 2019 at 17:19

4

Creating train/test/val split with StratifiedKFold

I'm trying to use StratifiedKFold to create train/test/val splits for use in a non-sklearn machine learning work flow. So, the DataFrame needs to be split and then stay that way. I'm trying to do ...

python pandas scikit-learn cross-validation data-science

Groome asked 20/7, 2017 at 17:54

2

Sklearn: Cross validation for grouped data

I am trying to implement a cross validation scheme on grouped data. I was hoping to use the GroupKFold method, but I keep getting an error. what am I doing wrong? The code (slightly different from ...

python scikit-learn cross-validation

Explicate asked 1/11, 2016 at 23:6

0

xgb.cv creates Warning: Empty dataset at worker: 0

I am using a XGBClassifier and try to do a grid search in order to tune some parameters, and I get this warning : WARNING: ../src/learner.cc:1517: Empty dataset at worker: 0 whenever I launch the c...

python cross-validation xgbclassifier

Scald asked 9/2, 2023 at 12:33

2

How to Use KFold Cross Validation Output as CNN Input for Image Processing?

I'm trying to use Convolutional Neural Network (CNN) for image classification. And I want to use KFold Cross Validation for data train and test. I'm new for this and I don't really understand how t...

python image-processing conv-neural-network cross-validation

Binah asked 15/5, 2019 at 19:15

2

Solved

How to compute precision,recall and f1 score of an imbalanced dataset for K fold cross validation?

I have an imbalanced dataset containing a binary classification problem. I have built Random Forest Classifier and used k-fold cross-validation with 10 folds. kfold = model_selection.KFold(n_splits...

python scikit-learn random-forest cross-validation supervised-learning

Crimmer asked 6/10, 2017 at 4:29

8

Solved

How to extract model hyper-parameters from spark.ml in PySpark?

I'm tinkering with some cross-validation code from the PySpark documentation, and trying to get PySpark to tell me what model was selected: from pyspark.ml.classification import LogisticRegression...

pyspark modeling cross-validation apache-spark-mllib apache-spark-ml

Limey asked 18/4, 2016 at 14:46

2

Solved

Why does calling the KFold generator with shuffle give the same indices?

With sklearn, when you create a new KFold object and shuffle is true, it'll produce a different, newly randomized fold indices. However, every generator from a given KFold object gives the same ind...

python scikit-learn cross-validation

Communicate asked 22/1, 2016 at 6:36

6

Solved

How to split data on balanced training set and test set on sklearn

I am using sklearn for multi-classification task. I need to split alldata into train_set and test_set. I want to take randomly the same sample number from each class. Actually, I amusing this funct...

machine-learning scikit-learn svm cross-validation

Bodine asked 18/2, 2016 at 4:13

2

Solved

Saving a cross-validation trained model in Scikit

I have trained a model in scikit-learn using Cross-Validation and Naive Bayes classifier. How can I persist this model to later run against new instances? Here is simply what I have, I can get the...

python scikit-learn pickle cross-validation

Confab asked 21/9, 2015 at 17:2

4

Solved

return coefficients from Pipeline object in sklearn

I've fit a Pipeline object with RandomizedSearchCV pipe_sgd = Pipeline([('scl', StandardScaler()), ('clf', SGDClassifier(n_jobs=-1))]) param_dist_sgd = {'clf__loss': ['log'], 'clf__penalty': [N...

python machine-learning scikit-learn cross-validation scikit-learn-pipeline

Ossieossietzky asked 8/5, 2017 at 19:56

2

Solved

Using GridSearchCV for RandomForestRegressor

I'm trying to use GridSearchCV for RandomForestRegressor, but always get ValueError: Found array with dim 100. Expected 500. Consider this toy example: import numpy as np from sklearn import ense...

python scikit-learn random-forest cross-validation

Gustation asked 11/1, 2015 at 18:14

6

Solved

Using explicit (predefined) validation set for grid search with sklearn

I have a dataset, which has previously been split into 3 sets: train, validation and test. These sets have to be used as given in order to compare the performance across different algorithms. I wo...

python validation scikit-learn cross-validation

Robenarobenia asked 11/8, 2015 at 18:3

3

Scikit-learn, GroupKFold with shuffling groups?

I was using StratifiedKFold from scikit-learn, but now I need to watch also for "groups". There is nice function GroupKFold, but my data are very time dependent. So similary as in help, ie number o...

python scikit-learn shuffle cross-validation

Reception asked 26/11, 2016 at 14:52

1

How do I get misclassified instances and their indices for each fold cross validation in python?

from sklearn import datasets, linear_model from sklearn.model_selection import cross_val_predict iris = datasets.load_iris() X = iris.data[:150] y = iris.target[:150] lasso = linear_model.Las...

python-3.x data-science cross-validation

Earlie asked 18/3, 2021 at 7:12

cross-validation Questions

Recommended topics

Hot tags