text-classification - 4

2

I am working on a binary classification problem in Weka with a highly imbalanced data set (90% in one category and 10% in the other). I first applied SMOTE (http://www.cs.cmu.edu/afs/cs/project/jai...

machine-learning weka text-classification

Cartel asked 6/8, 2015 at 12:52

1

Solved

python textblob and text classification

I'm trying do build a text classification model with python and textblob, the script is runing on my server and in the future the idea is that users will be able to submit their text and it will be...

python nlp nltk text-classification textblob

Darees asked 24/11, 2015 at 1:35

5

Solved

Detecting random keyboard hits considering QWERTY keyboard layout

The winner of a recent Wikipedia vandalism detection competition suggests that detection could be improved by "detecting random keyboard hits considering QWERTY keyboard layout". Example: woijf qo...

algorithm n-gram qwerty text-classification

Douglassdougy asked 27/9, 2010 at 8:41

2

Large classification document corpus

Can anyone point me to some large corpus that I use for classification? But by large I don't mean Reuters or 20 newsgroups, I'm talking about a corpus of GB size, not 20MB or something like that. ...

dataset classification corpus text-classification

Coherent asked 27/8, 2015 at 10:17

1

Solved

How to use spark Naive Bayes classifier for text classification with IDF?

I want to convert text documents into feature vectors using tf-idf, and then train a naive bayes algorithm to classify them. I can easily load my text files without the labels and use HashingTF() ...

python apache-spark tf-idf text-classification apache-spark-mllib

Conjunctiva asked 26/8, 2015 at 15:43

2

Solved

Testing the NLTK classifier on specific file

The following code run Naive Bayes movie review classifier. The code generate a list of the most informative features. Note: **movie review** folder is in the nltk. from itertools import chain ...

python-2.7 nlp classification nltk text-classification

Leroi asked 27/3, 2015 at 13:34

1

Solved

How to train a naive bayes classifier with pos-tag sequence as a feature?

I have two classes of sentences. Each has reasonably distinct pos-tag sequence. How can I train a Naive-Bayes classifier with POS-Tag sequence as a feature? Does Stanford CoreNLP/NLTK (Java or Pyth...

machine-learning nltk stanford-nlp text-classification naivebayes

Caesarea asked 27/2, 2015 at 11:50

1

Solved

How to classify URLs? what are URLs features? How to select and Extract features from URL

I have just started to work on a Classification problem. Its a two class problem, My Trained model(Machine Learning) will have to decide/predict either to allow a URL or Block it. My Question is v...

url machine-learning classification feature-extraction text-classification

Gaudery asked 20/10, 2014 at 0:22

1

Solved

How to split data (raw text) into test/train sets with scikit crossvalidation module?

I have a large corpus of opinions (2500) in raw text. I would like to use scikit-learn library to split them into test/train sets. What could be the best aproach to solve this task with scikit-lear...

machine-learning scikit-learn classification cross-validation text-classification

Mordancy asked 11/9, 2014 at 17:44

3

Solved

Naive Bayes: Imbalanced Test Dataset

I am using scikit-learn Multinomial Naive Bayes classifier for binary text classification (classifier tells me whether the document belongs to the category X or not). I use a balanced dataset to tr...

python machine-learning classification scikit-learn text-classification

Ovum asked 23/6, 2014 at 13:25

1

Python text processing: AttributeError: 'list' object has no attribute 'lower'

I am new to Python and to Stackoverflow(please be gentle) and am trying to learn how to do a sentiment analysis. I am using a combination of code I found in a tutorial and here: Python - AttributeE...

python csv text-classification

Vespertine asked 23/5, 2014 at 23:26

2

Solved

Lexicon dictionary for synonym words

There are few dictionaries available for natural language processing. Like positive, negative words dictionaries etc. Is there any dictionary available which contains list of synonym for all dict...

dictionary nlp stanford-nlp data-processing text-classification

Auscultate asked 17/5, 2014 at 10:27

3

Dealing with class imbalance in multi-label classification

I've seen a few questions on class imbalance in a multiclass setting. However, I have a multi-label problem, so how would you deal with it in this case? I have a set of around 300k text examples. ...

machine-learning classification text-classification vowpalwabbit

Dorkas asked 9/12, 2013 at 0:55

1

Solved

Scikit learn - fit_transform on the test set

I am struggling to use Random Forest in Python with Scikit learn. My problem is that I use it for text classification (in 3 classes - positive/negative/neutral) and the features that I extract are ...

machine-learning classification scikit-learn random-forest text-classification

Arvonio asked 24/2, 2014 at 20:13

1

Solved

How to rank features by their importance in a Weka classifier?

I use Weka to successfully build a classifier. I would now like to evaluate how effective or important my features are. Fot this I use AttributeSelection. But I don't know how to ouput the differen...

machine-learning nlp weka feature-selection text-classification

Interne asked 21/1, 2014 at 20:5

2

Solved

N-grams vs other classifiers in text categorization

I'm new to text categorization techniques, I want to know the difference between the N-gram approach for text categorization and other classifier (decision tree, KNN, SVM) based text categorization...

machine-learning data-mining classification n-gram text-classification

Matthiew asked 1/12, 2013 at 18:54

1

Naive Bayes probability always 1

I started using sklearn.naive_bayes.GaussianNB for text classification, and have been getting fine initial results. I want to use the probability returned by the classifier as a measure of confiden...

python scikit-learn text-classification

Barbaresi asked 5/8, 2013 at 14:5

1

Solved

Natural Language Processing - Converting Text Features Into Feature Vectors

So I've been working on a natural language processing project in which I need to classify different styles of writing. Assuming that semantic features from texts have already been extracted for me,...

java nlp svm text-classification

Juicy asked 29/5, 2013 at 20:54

text-classification Questions

Recommended topics

Hot tags