data-science Questions
2
Solved
I have the following code to find the mean of the ages in the dataframe.
let df = df! [
"name" => ["panda", "polarbear", "seahorse"],
"age" =&...
Agosto asked 12/11, 2022 at 11:12
2
The 'find in selection' button is missing from VSCode when working with Jupyter Notebooks. It slows down development so I would like to ask if anybody knows how to activate it?
First image shows th...
Kirbykirch asked 4/10, 2021 at 9:45
2
Solved
I am trying to decompose a 3D matrix using python library scikit-tensor. I managed to decompose my Tensor (with dimensions 100x50x5) into three matrices. My question is how can I compose the initia...
Iatrogenic asked 28/9, 2016 at 12:58
5
Solved
Working with census data, I want to replace NaNs in two columns ("workclass" and "native-country") with the respective modes of those two columns. I can get the modes easily:
mode = df.filter(["wo...
Suffrage asked 18/3, 2017 at 4:42
5
Solved
I want to to use numbers to indicate references in footnotes, so I was wondering inside of Jupyter Notebook how can I use superscripts and subscripts?
Herbalist asked 2/9, 2017 at 8:8
6
Solved
I am trying to do a naive Bayes and after loading some data into a dataframe in Pandas, the describe function captures the data I want. I'd like to capture the mean and std from each column of the ...
Permenter asked 27/1, 2019 at 22:45
7
Solved
I have a dataframe that has a row called "Hybridization REF". I would like to filter so that I only get the data for the items that have the same label as one of the items in my lis...
Whitebait asked 11/7, 2017 at 16:45
4
I have used a fresh anaconda install to download and install all the required modules for osnmx library but I got the following error:
Cheeky asked 9/1, 2020 at 6:14
1
I am trying to a clear picture of how they are interconnected and if the use of one always require the use of the other. If you could give a non-technical definition or explanation of each of them,...
Dogbane asked 29/9, 2022 at 9:2
5
Solved
I have dataframe:
subject A_target_word_gd A_target_word_fd B_target_word_gd B_target_word_fd subject_type
1 1 2 3 4 mild
2 11 12 13 14 moderate
And I want to melt it to a dataframe that wi...
Reword asked 1/1, 2020 at 7:44
2
I am using the below command while installing surprise package. I have got error messages while installing and I am not able to understand. I need help to install this package successfully.
pip ins...
Snapper asked 12/1, 2021 at 7:10
4
Solved
I'm reading a book on Data Science for Python and the author applies 'sigma-clipping operation' to remove outliers due to typos. However the process isn't explained at all.
What is sigma clipping?...
Glennglenna asked 14/8, 2017 at 3:16
2
Solved
I find it hard to explain with words what I want to achieve, so please don't judge me for showing a simple example instead. I have a table that looks like this:
main_col
some_metadata
value
...
Sphenoid asked 27/9, 2022 at 13:51
3
Solved
I have a dataset that contains the NBA Player's average statistics per game. Some player's statistics are repeated because of they've been in different teams in season.
For example:
Player Pos Age...
Footling asked 15/8, 2021 at 15:24
2
Solved
I'm sorry if the title of the question is not that clear, I could not sum up the problem in one line.
Here are the simplified datasets for an explanation. Basically, the number of categories in t...
Tuber asked 13/12, 2017 at 6:11
4
I want to get real time predictions using my machine learning model with the help of SageMaker. I want to directly get inferences on my website. How can I use the deployed model for predictions?
Mellman asked 21/11, 2018 at 4:57
9
Solved
Simply speaking, how to apply quantile normalization on a large Pandas dataframe (probably 2,000,000 rows) in Python?
PS. I know that there is a package named rpy2 which could run R in subprocess,...
Mongol asked 21/6, 2016 at 5:1
2
Solved
I want to count the number of unique values in each column and select only those columns which have less than 32 unique values.
I tried using
df.filter(nunique<32)
and
df[[ c for df.column...
Costly asked 24/6, 2019 at 16:27
9
I have small problem when exporting static chart using plotly.
Plotly does not correctly recognize that I have orca installed and I have still error related to missing orca. I try to change the or...
Plasia asked 20/10, 2019 at 14:10
4
Solved
What is the difference between MinMaxScaler() and StandardScaler().
mms = MinMaxScaler(feature_range = (0, 1)) (Used in a machine learning model)
sc = StandardScaler() (In another machine learning ...
Cubby asked 9/7, 2018 at 2:42
5
Solved
I'm trying to calculate the Davies-Bouldin Index in Python.
Here are the steps the code below tries to reproduce.
5 Steps:
For each cluster, compute euclidean distances between each point to the c...
Trophozoite asked 30/12, 2017 at 18:8
2
Solved
I am trying to install the CUDA toolkit in order to be able to use Thundersvm in my personal computer.
However I keep getting the following message in the GUI installer:
"You already have a ne...
Fessler asked 27/1, 2021 at 18:41
2
Solved
When I change/add a variable to my config.py file and then try to import it to my Jupyter Notebook I get:
ImportError: cannot import name 'example_var' from 'config'
config.py:
example_var = 'exa...
Corum asked 20/1, 2021 at 17:8
1
from sklearn import datasets, linear_model
from sklearn.model_selection import cross_val_predict
iris = datasets.load_iris()
X = iris.data[:150]
y = iris.target[:150]
lasso = linear_model.Las...
Earlie asked 18/3, 2021 at 7:12
4
Solved
I'm using google Colab notebook for a project that requires me to plot GPS coordinates on a map. I want to use basemap for this purpose. I tried to import it on the Colab notebook by using
from mpl...
Ozonide asked 10/2, 2019 at 6:38
© 2022 - 2024 — McMap. All rights reserved.