summarize - McMap

5

R dplyr summarise multiple functions to selected variables

I have a dataset for which I want to summarise by mean, but also calculate the max to just 1 of the variables. Let me start with an example of what I would like to achieve: iris %>% group_by(...

r dplyr summarize

Piapiacenza asked 12/12, 2016 at 20:56

4

Solved

Percentage of total counts by group in R

I'm trying to create an output that calculates the percentage of counts, out of total counts (in a data frame), by factor level, but can't seem to figure out how to retain the grouping structure in...

r dplyr summarize

Variegation asked 22/12, 2023 at 19:20

6

Solved

How to interpret dplyr message `summarise()` regrouping output by 'x' (override with `.groups` argument)?

I started getting a new message (see post title) when running group_by and summarise() after updating to dplyr development version 0.8.99.9003. Here is an example to recreate the output: library(...

r dplyr summarize

Slice asked 1/6, 2020 at 20:26

15

How to get summary statistics by group

I'm trying to get multiple summary statistics in R/S-PLUS grouped by categorical column in one shot. I found couple of functions, but all of them do one statistic per call, like aggregate(). data &...

r dplyr stat summarize r-faq

Fagaly asked 23/3, 2012 at 22:4

3

Solved

How to use R dplyr's summarize to count the number of rows that match a criteria?

I have a dataset that I want to summarize. First, I want the sum of the home and away games, which I can do. However, I also want to know how many outliers (defined as more than 300 points) are wit...

r dplyr subset counting summarize

Nations asked 19/4, 2022 at 12:20

2

How to use dplyr to calculate a weighted mean of two grouped variables

I know this must be super easy, but I'm having trouble finding the right dplyr commands to do this. Let's say I want to group a dataset by two variables, and then summarize the count for each row. ...

r dplyr weighted-average summarize split-apply-combine

Extravagancy asked 24/4, 2018 at 1:15

3

R - dplyr Summarize and Retain Other Columns

I am grouping data and then summarizing it, but would also like to retain another column. I do not need to do any evaluations of that column's content as it will always be the same as the group_by ...

r dplyr summarize

Ironhanded asked 23/8, 2016 at 3:58

3

Solved

dplyr: group_by and summarize to collapse (via concatenation) columns of strings that contain NA

I have a relatively straightforward question that I've been unable to find a solution for. Suppose I have the following dataset: ID dummy_var String1 String2 String3 1 0 Tom NA NA 1 1 NA ...

r dplyr summarize

Gavel asked 20/7, 2021 at 18:52

3

Solved

What is the pandas equivalent of dplyr summarize/aggregate by multiple functions?

I want to convert my R code using dplyr package into pandas where I group-by and perform multiple summarizations. Here is my current code: import pandas as pd data = pd.DataFrame( {'col1':[1,1,1,1...

python r pandas group-by summarize

Selffertilization asked 13/8, 2016 at 18:3

4

Solved

Pass column names as strings to group_by and summarize

With dplyr starting version 0.7 the methods ending with underscore such as summarize_ group_by_ are deprecated since we are supposed to use quosures. See: https://cran.r-project.org/web/packages/d...

r dplyr summarize rlang quosure

Bug asked 24/10, 2017 at 19:18

3

Solved

tidyverse summarize multiple columns but show result as rows

I have data where I want to get a bunch of summary statistics for multiple columns with the tidyverse approach. However, utilizing tidyverse's summarize function, it will create each column statist...

r dplyr tidyr summarize

Radbourne asked 27/5, 2020 at 11:40

2

Solved

r summarize_if with multiple conditions

I'm trying to reduce a df of observations to a single observation (single line). I would like to summarize_if is numeric with the mean and if is string or factor with the mode. The code below doesn...

r dplyr mode reduction summarize

Carlyn asked 6/5, 2020 at 15:7

2

Solved

How to use "summarise" from dplyr with dynamic column names?

I am summarizing group means from a table using the summarize function from the dplyr package in R. I would like to do this dynamically, using a column name string stored in another variable. The ...

r dplyr summarize

Itemized asked 30/1, 2020 at 13:25

3

Solved

Summarize with mathematical conditions in dplyr

Building on this question: Summarize with conditions in dplyr I would like to use dplyr to summarize a column based on a mathematical condition (not string matching as in the linked post). I need t...

r dplyr conditional-statements summarize

Ewold asked 5/12, 2019 at 16:18

1

Solved

Using R & dplyr to summarize - group_by, count, mean, sd

I am fairly new to R and even newer to dplyr. I have a small data set comprised of 2 columns - var1 and var2. The var1 column is comprised of num values. The var2 column is comprised of factors wit...

r dplyr summarize

Consummate asked 25/7, 2019 at 4:18

3

Solved

Summarise? Count occurences in column based on another column

I believe this may have a simple solution but I'm having trouble describing what I need to do (and hence what to search for). I think I need the summarize function. My goal output is at the very bo...

r dplyr summarize

Zeniazenith asked 1/3, 2019 at 16:21

2

Solved

summarise returning -inf when using na.rm = TRUE

I recently built a simple R script to summarize three different data frames. Since updating to the newest version of R and R Studio, I am running into an output I haven't seen before when using the...

r dplyr summarize

Infelicity asked 18/9, 2017 at 23:29

1

Solved

Summarize (counts) by column efficiently

I have a big table similar to datadf with 3000 thousand columns and rows, I saw some methods to obtain my expected summary in stack overflow (Frequency of values per column in table), but even the ...

r dataframe data.table summarize

Rienzi asked 11/9, 2018 at 19:53

3

Solved

Pandas: Get per-year counts for Dateranges spanning multiple years

I have a dataframe with records spanning multiple years: WarName | StartDate | EndDate --------------------------------------------- 'fakewar1' 01-01-1990 02-02-1995 'examplewar' 05-01-1990 03-0...

python pandas date-arithmetic summarize

Attestation asked 20/5, 2018 at 23:9

1

Solved

Why does `summarize` drop a group?

I'm fooling around with babynames pkg. A group_by command works, but after the summarize, one of the groups is dropped from the group list. library(babynames) babynames[1:10000, ] %>% group_by(...

r group-by dplyr summarize

Clintonclintonia asked 28/1, 2018 at 17:23

2

Solved

Using dplyr to summarize and keep the same variable name

I have found that data.table and dplyr have differing results when trying to do the same thing. I would like to use dplyr syntax, but have it compute in the way that data.table does. The use case i...

r variables dplyr data.table summarize

Fullfledged asked 20/1, 2018 at 15:32

1

Solved

Find difference between grouped values in dplyr

I want to find the difference between the cases that were observed and those that were not by type of case: set.seed(42) df <- data.frame(type = factor(rep(c("A", "B", "C"), 2)), observed = rep...

r dplyr grouping summarize

Brazee asked 9/10, 2017 at 9:2

3

Solved

Applying group_by and summarise(sum) but keep columns with non-relevant conflicting data?

My question is very similar to Applying group_by and summarise on data while keeping all the columns' info but I would like to keep columns which get excluded because they conflict after groupi...

r group-by tidyverse dplyr summarize

Subbasement asked 3/10, 2017 at 21:13

4

Solved

Define and apply custom bins on a dataframe

Using python I have created following data frame which contains similarity values: cosinFcolor cosinEdge cosinTexture histoFcolor histoEdge histoTexture jaccard 1 0.770 0.489 0.388 0.57500000 0.5...

r dataframe binning summarize

Chibcha asked 15/8, 2012 at 2:50

1

Solved

tidyverse: count number of a specific level when summarizing

I would like, when summarizing after grouping, to count the number of a specific level of another factor. In the working example below, I would like to count the number of "male" levels in each g...

r group-by dplyr tidyverse summarize

Dactylology asked 22/3, 2017 at 14:47

summarize Questions

Recommended topics

Hot tags