kaggle Questions

2

Solved

What is difference between feval and eval_metric in xgb.train, both parametrs are only for evaluation purpose. Post from Kaggle gives some insight : https://www.kaggle.com/c/prudential-life-insur...
Kuhn asked 20/10, 2016 at 16:8

2

Solved

The problem is really strange, because that piece of worked pretty fine with other dataset. The full code: import numpy as np import pandas as pd import xgboost as xgb from sklearn.cross_validation...
Textualist asked 17/3, 2016 at 15:54

1

Solved

Here's code from kaggle Titanic competition kernel: grid = sns.FacetGrid(train_df, row='Embarked', size=2.2, aspect=1.6) grid.map(sns.pointplot, 'Pclass', 'Survived', 'Sex', palette='deep') grid.ad...
Grandeur asked 24/10, 2017 at 18:18

2

Solved

by adding PCA to the algorithm, I'm working to improve %96.5 SKlearn kNN prediction score for kaggle digit recognition tutorial, yet new kNN predictions based on PCA output are horrible like 23%. ...
Lattie asked 24/1, 2014 at 11:11

0

I just thought, that caching is the solution. I am trying to download an extremely big file on a slightly slow and unstable connection. Today I managed to download 12GB at very high financial cost...
Durman asked 8/8, 2017 at 21:58

3

Solved

I am trying to plot a seaborn histogram on a Kaggle notebook in this way: sns.distplot(myseries, bins=50, kde=True) but I get this error: TypeError: slice indices must be integers or None or h...
Lattimore asked 16/5, 2017 at 19:5

2

I looked into the the post on the same thing in Python, but I want a solution in R. I'm working on the Titanic dataset from Kaggle, and it looks like this: 'data.frame': 891 obs. of 13 vari...
Self asked 27/6, 2016 at 13:10

5

Solved

I'm taking some AI classes and have learned about some basic algorithms that I want to experiment with. I have gotten access to several data sets containing lots of great real-world data through Ka...
Permanganate asked 18/8, 2011 at 22:20

2

Solved

Trying to learn some stuff, I'm messing around with the global shark attack database on Kaggle and I'm trying to find the best way to lump strings using a lambda function and str.contains. Basical...
Meal asked 9/2, 2017 at 19:35

2

Solved

For a certain Kaggle dataset (rules prohibit me from sharing the data here, but is readily accessible here), import pandas df_train = pandas.read_csv( "01 - Data/act_train.csv.zip" ) df_train.de...
Sholokhov asked 29/8, 2016 at 8:16

3

Solved

I am examining the imdb movie dataset in kaggle with R. Here is a minimal repro dataset: > movies <- data.frame(movie = as.factor(c("Movie 1", "Movie 2", "Movie 3", "Movie 4")), director =...
Ejector asked 13/9, 2016 at 12:59

1

I am new to all these methods and am trying to get a simple answer to that or perhaps if someone could direct me to a high level explanation somewhere on the web. My googling only returned kaggle s...
Caylacaylor asked 6/2, 2016 at 3:34

1

Solved

I am translating the following Kaggle code into Python3.4: In the final lines when outputting a CSV file, predictions_file = open("myfirstforest.csv", "wb") open_file_object = csv.writer(predict...
Pneumonectomy asked 30/1, 2016 at 9:51

1

I am trying to solve this problem at kaggle using spark: the hierarchy of input is like this : drivers/{driver_id}/trip#.csv e.g., drivers/1/1.csv drivers/1/2.csv drivers/2/1.csv I want to re...
Meda asked 25/6, 2015 at 12:59

1

Solved

Excuse me, I have a question on using caffe for hd data? I try to run an example on the Kaggle mnist csv data with the following steps use h5py to convert it to h5 data. (I use the caffe-example....
Kkt asked 24/7, 2015 at 18:34

1

I'm new to Pandas. I downloaded and installed Anaconda. Then I tried running the following code via the Spyder app: import pandas as pd import numpy as np train = pd.read_csv('/Users/Ben/Document...
Sismondi asked 29/5, 2015 at 0:59

1

Solved

Im currently practicing R on the Kaggle using the titanic data set I am using the Random Forest Algorthim Below is the code fit <- randomForest(as.factor(Survived) ~ Pclass + Sex + Age_Bucket ...
Corcovado asked 10/5, 2015 at 13:30

1

Solved

i am extracting 30 facial keypoints (x,y) from an input image as per kaggle facialkeypoints competition. How do i setup caffe to run a regression and produce 30 dimensional output??. Input: 96x9...
Fazio asked 21/1, 2015 at 1:8

1

Solved

I have a folder of JPG images that I'm trying to classify for a kaggle competition. I have seen some code in Python that I think will accomplish this on the forums, but was wondering is it possible...
Pietrek asked 15/12, 2014 at 17:46

2

Solved

I'm in the process of attempting to learn to work with neural networks in R. As a learning problem, I've been using the following problem over at Kaggle: Don't worry, this problem is specifically ...
Zoom asked 3/7, 2013 at 19:57

1

Solved

I was trying to narrow down the number of features really relevant for my classifier using rfecv. This is the code I have written import sklearn import pandas as p import numpy as np import ...
Rambow asked 27/11, 2013 at 5:48

2

Solved

I wrote the following R code that identifies duplicate files in a directory. How can one vectorize the for-loop using the plyr package (or similar)? I would like to achieve a more idiomatic R solut...
Keratogenous asked 27/12, 2012 at 19:50

1

Solved

Let me start by saying I have no experience with R, KNN or data science in general. I recently found Kaggle and have been playing around with the Digit Recognition competition/tutorial. In this tu...
Suntan asked 28/8, 2012 at 5:27

3

Solved

I grabbed the KDD track1 dataset from Kaggle and decided to load a ~2.5GB 3-column CSV file into memory, on my 16GB high-memory EC2 instance: data = np.loadtxt('rec_log_train.txt') the python s...
Rattat asked 22/4, 2012 at 2:35

© 2022 - 2024 — McMap. All rights reserved.