tidytext Questions

1

Solved

I want to cluster words that are similar using R and the tidytext package. I have created my tokens and would now like to convert it to a matrix in order to cluster it. I would like to try out a nu...
Fishwife asked 3/2, 2021 at 15:48

3

I'm doing sentiment analysis from twitter but my tweets are on Spanish so I can't use tidytext to classify the words. Does anyone know if there is a similar package for Spanish?
Homebody asked 2/11, 2017 at 12:21

5

I have a column with string content temp <- c(NA, NA, "grocery pantry all offers", NA, "grocery offers today low price", "grocery offers today low price", "tide soap", "tide soap bar", "tide ...
Timikatiming asked 13/2, 2020 at 12:48

3

Solved

First off - I am a beginner at programming and R, so excuse me if this is a silly question. I am having trouble viewing more than ten rows in a tibble that is generated from the following cod...
Nelly asked 6/3, 2018 at 2:5

2

I am using tidytext package in R to do n-gram analysis. Since I analyze tweets, I would like to preserve @ and # to capture mentions, retweets, and hashtags. However, unnest_tokens function automa...
Ghirlandaio asked 12/6, 2017 at 23:23

2

Solved

This is most likely a stupid question, but I've googled and googled and can't find a solution. I think it's because I don't know the right way to word my question to search. I have a data frame th...
Pansir asked 13/10, 2017 at 16:44

2

I am using this example to conduct sentiment analysis of a collection of txt documents in R. The code is: library(tm) library(tidyverse) library(tidytext) library(glue) library(stringr) library(dp...
Hogg asked 12/6, 2018 at 15:40

2

Solved

I am trying to make a graph with ordered bars according to frequency and also using a variable two separate two variables using facets. Words have to be ordered by value given in 'n' variable. So, ...
Cauchy asked 16/5, 2018 at 16:51

3

Solved

Is there any way to replace range of numbers wih single numbers in a character string? Number can range from n-n, most probably around 1-15, 4-10 ist also possible. the range could be indicated wi...
Silvia asked 18/3, 2018 at 3:12

1

Solved

I am trying to work on tf-idf weighted corpus (where I expect tf to be a proportion by document rather than simple count). I would expect the same values to be returned by all the classic text mini...
Mangosteen asked 15/2, 2018 at 11:56

1

Solved

I'm trying to use both a bigram and a trigram using tidytext. What code could I use for the token to look for 2 and 3 words. This is the code for using bigrams only: library(tidytext) library(ja...
Angarsk asked 13/8, 2017 at 18:21

1

Solved

I have an R dataframe (df) that looks like this: blogger; word; n; total joe; dorothy; 17; 718 paul; sheriff; 10; 354 joe; gray; 9; 718 joe; toto; 9; 718 mick; robin; 9; 607 paul; robin; 9; 354 .....
Gantz asked 18/4, 2017 at 15:2

1

Solved

I have data in a normalised, tidy "long" data structure I want to upload to H2O and if possible analyse on a single machine (or have a definitive finding that I need more hardware and software than...
Profligate asked 27/12, 2016 at 6:26
1

© 2022 - 2024 — McMap. All rights reserved.