tidytext Questions
1
Solved
I want to cluster words that are similar using R and the tidytext package.
I have created my tokens and would now like to convert it to a matrix in order to cluster it. I would like to try out a nu...
Fishwife asked 3/2, 2021 at 15:48
3
I'm doing sentiment analysis from twitter but my tweets are on Spanish so I can't use tidytext to classify the words. Does anyone know if there is a similar package for Spanish?
Homebody asked 2/11, 2017 at 12:21
5
I have a column with string content
temp <- c(NA, NA, "grocery pantry all offers", NA, "grocery offers today low price",
"grocery offers today low price", "tide soap", "tide soap bar",
"tide ...
3
Solved
First off - I am a beginner at programming and R, so excuse me if this is a silly question. I am having trouble viewing more than ten rows in a tibble that is generated from the following cod...
2
I am using tidytext package in R to do n-gram analysis.
Since I analyze tweets, I would like to preserve @ and # to capture mentions, retweets, and hashtags. However, unnest_tokens function automa...
Ghirlandaio asked 12/6, 2017 at 23:23
2
Solved
This is most likely a stupid question, but I've googled and googled and can't find a solution. I think it's because I don't know the right way to word my question to search.
I have a data frame th...
2
I am using this example to conduct sentiment analysis of a collection of txt documents in R. The code is:
library(tm)
library(tidyverse)
library(tidytext)
library(glue)
library(stringr)
library(dp...
Hogg asked 12/6, 2018 at 15:40
2
Solved
I am trying to make a graph with ordered bars according to frequency and also using a variable two separate two variables using facets. Words have to be ordered by value given in 'n' variable. So, ...
3
Solved
Is there any way to replace range of numbers wih single numbers in a character string? Number can range from n-n, most probably around 1-15, 4-10 ist also possible.
the range could be indicated wi...
1
Solved
I am trying to work on tf-idf weighted corpus (where I expect tf to be a proportion by document rather than simple count). I would expect the same values to be returned by all the classic text mini...
Mangosteen asked 15/2, 2018 at 11:56
1
Solved
I'm trying to use both a bigram and a trigram using tidytext. What code could I use for the token to look for 2 and 3 words.
This is the code for using bigrams only:
library(tidytext)
library(ja...
1
Solved
I have an R dataframe (df) that looks like this:
blogger; word; n; total
joe; dorothy; 17; 718
paul; sheriff; 10; 354
joe; gray; 9; 718
joe; toto; 9; 718
mick; robin; 9; 607
paul; robin; 9; 354
.....
1
Solved
I have data in a normalised, tidy "long" data structure I want to upload to H2O and if possible analyse on a single machine (or have a definitive finding that I need more hardware and software than...
Profligate asked 27/12, 2016 at 6:26
1
© 2022 - 2024 — McMap. All rights reserved.