tidytext R in spanish - any alternative?
Asked Answered
H

3

5

I'm doing sentiment analysis from twitter but my tweets are on Spanish so I can't use tidytext to classify the words. Does anyone know if there is a similar package for Spanish?

Homebody answered 2/11, 2017 at 12:21 Comment(1)
May be It is not the best option, but I suggest translate to English using translateR package and then classify with tidytext. However if you control a little machine learning algorithms. There are several algorithms of text clustering.Monti
A
4

There are not a lot of good open source options for sentiment lexicons in non-English languages right now, unfortunately. You can request the NRC lexicon in other languages from the authors; it is translated by Google Translate (which of course adds uncertainty but has shown to be mostly OK overall) and the authors say they give it away for research purposes but will charge for commercial use.

Averse answered 2/11, 2017 at 20:7 Comment(0)
S
4

I run into the same issue with Non-English textmining. I found udpipe which is an r package developed by Bnosac. It is a Natural Language Processing toolkit that provides language-agnostic 'tokenization', 'parts of speech tagging', 'lemmatization', 'morphological feature tagging' and 'dependency parsing' of raw text. Beware that there are no sentiment tags in the package. Those you will need to find elsewhere.

It supports a diverse range of non-English languages.

You can find out more on their blog, on the webpage of udpipe or on github

P.S. I have no affiliation with them.

Skylight answered 8/2, 2018 at 14:6 Comment(0)
E
0

The Stanford Core NLP package is on cran and provides also the sentiment for spanish with the get_sentiment function

Edette answered 11/12, 2020 at 22:43 Comment(0)

© 2022 - 2024 — McMap. All rights reserved.