word2vec - McMap

5

How to Train GloVe algorithm on my own corpus

I tried to follow this. But some how I wasted a lot of time ending up with nothing useful. I just want to train a GloVe model on my own corpus (~900Mb corpus.txt file). I downloaded the files provi...

nlp stanford-nlp gensim word2vec glove

Statecraft asked 24/2, 2018 at 11:10

5

Solved

How to load a pre-trained Word2vec MODEL File and reuse it?

I want to use a pre-trained word2vec model, but I don't know how to load it in python. This file is a MODEL file (703 MB). It can be downloaded here: http://devmount.github.io/GermanWordEmbeddings...

python file model word2vec gensim

Skyway asked 17/9, 2016 at 16:40

4

Gensim 3.8.0 to Gensim 4.0.0

I have trained a Word2Vec model using Gensim 3.8.0. Later I tried to use the pretrained model using Gensim 4.0.o on GCP. I used the following code: model = KeyedVectors.load_word2vec_format(wv_path...

python nlp gensim word2vec word-embedding

Villous asked 30/3, 2021 at 9:28

2

Solved

What meaning does the length of a Word2vec vector have?

I am using Word2vec through gensim with Google's pretrained vectors trained on Google News. I have noticed that the word vectors I can access by doing direct index lookups on the Word2Vec object ar...

python nlp gensim word2vec

Perrone asked 16/3, 2016 at 11:31

2

Solved

Get weight matrices from gensim word2Vec

I am using gensim word2vec package in python. I would like to retrieve the W and W' weight matrices that have been learn during the skip-gram learning. It seems to me that model.syn0 gives me the f...

python machine-learning nlp word2vec gensim

Foreshadow asked 15/12, 2016 at 11:19

6

How to remove a word completely from a Word2Vec model in gensim?

Given a model, e.g. from gensim.models.word2vec import Word2Vec documents = ["Human machine interface for lab abc computer applications", "A survey of user opinion of computer system response ti...

python dictionary word2vec gensim del

Visser asked 23/2, 2018 at 5:26

5

Getting "__init__() got an unexpected keyword argument 'document'" this error in python I'm working with Word2Vec and gensim

I'm working on project using Word2vec and gensim, model = gensim.models.Word2Vec( documents = 'userDataFile.txt', size=150, window=10, min_count=2, workers=10) model = gensim.model.Word2Vec.lo...

python gensim word2vec

Deaconry asked 7/11, 2018 at 18:49

9

How to fetch vectors for a word list with Word2Vec?

I want to create a text file that is essentially a dictionary, with each word being paired with its vector representation through word2vec. I'm assuming the process would be to first train word2vec...

machine-learning nlp artificial-intelligence word2vec

Chrysolite asked 15/7, 2015 at 20:50

19

gensim error: ImportError: No module named 'gensim'

I trying to import gensim with import gensim but get the following error ImportError Traceback (most recent call last) <ipython-input-5-50007be813d4> in <module>() ----> 1 import g...

python gensim word2vec

Vasiliu asked 12/9, 2017 at 5:33

3

Solved

'utf-8' decode error when loading a word2vec module

I have to use a word2vec module containing tons of Chinese characters. The module was trained by my coworkers using Java and is saved as a bin file. I installed gensim and tries to load the modul...

python nlp gensim word2vec

Lamasery asked 23/12, 2015 at 2:24

5

Solved

gensim word2vec: Find number of words in vocabulary

After training a word2vec model using python gensim, how do you find the number of words in the model's vocabulary?

python neural-network nlp gensim word2vec

Silverman asked 24/2, 2016 at 7:39

4

Solved

TypeError: 'Word2Vec' object is not subscriptable

I am trying to build a Word2vec model but when I try to reshape the vector for tokens, I am getting this error. Any idea ? wordvec_arrays = np.zeros((len(tokenized_tweet), 100)) for i in range(len...

python-3.x jupyter-notebook gensim word2vec

Roxy asked 25/5, 2021 at 12:30

3

Solved

Interpreting negative Word2Vec similarity from gensim

E.g. we train a word2vec model using gensim: from gensim import corpora, models, similarities from gensim.models.word2vec import Word2Vec documents = ["Human machine interface for lab abc compute...

python nlp similarity gensim word2vec

Jos asked 22/2, 2017 at 3:0

1

Can anyone explain how to get BIDMach's Word2vec to work?

In a paper titled, "Machine Learning at the Limit," Canny, et. al. report substantial word2vec processing speed improvements. I'm working with the BIDMach library used in this paper, and cannot f...

machine-learning nlp word2vec

Chaffee asked 1/4, 2017 at 15:25

1

NLP: Pre-processing in doc2vec / word2vec

A few papers on the topics of word and document embeddings (word2vec, doc2vec) mention that they used the Stanford CoreNLP framework to tokenize/lemmatize/POS-tag the input words/sentences: The ...

nlp stanford-nlp word2vec gensim doc2vec

Evangelical asked 29/5, 2018 at 12:3

8

Solved

How to check if a key exists in a word2vec trained model or not

I have trained a word2vec model using a corpus of documents with Gensim. Once the model is training, I am writing the following piece of code to get the raw feature vector of a word say "view". my...

python gensim word2vec

Mistreat asked 18/5, 2015 at 11:24

3

Solved

stopword removing when using the word2vec

I have been trying word2vec for a while now using the gensim's word2vec library. My question is do I have to remove stopwords from my input text? Because, based on my initial experimental results, ...

nlp gensim word2vec

Shult asked 11/1, 2016 at 12:49

5

Word2vec with elasticsearch for texts similarity

I have a large collection of texts, where each text is rapidly growing. I need to implement a similarity search. The idea is to embed each word as word2vec, and represent each text as a normalized...

elasticsearch word2vec

Ciapha asked 23/2, 2017 at 6:45

0

knn search query using python and elasticsearch

I try to do this query with elasticsearch python client : curl -X GET "localhost:9200/articles/_knn_search" -H 'Content-Type: application/json' -d ' { "knn": { "field&quo...

python elasticsearch nlp word2vec elasticsearch-dsl

Wheeled asked 1/6, 2022 at 13:7

6

Solved

Ensure the gensim generate the same Word2Vec model for different runs on the same data

In LDA model generates different topics everytime i train on the same corpus , by setting the np.random.seed(0), the LDA model will always be initialized and trained in exactly the same way. Is i...

python random gensim word2vec word-embedding

Arnett asked 16/1, 2016 at 20:5

6

Solved

I am playing around with FastText, https://pypi.python.org/pypi/fasttext,which is quite similar to Word2Vec. Since it seems to be a pretty new library with not to many built in functions yet, I was...

python nlp word2vec fasttext

Aerophyte asked 13/2, 2017 at 14:33

2

Solved

How to fix "MetadataFetchFailedException: Missing an output location for shuffle"?

If I increase the model size of my word2vec model I start to get this kind of exception in my log: org.apache.spark.shuffle.MetadataFetchFailedException: Missing an output location for shuffle 6 ...

scala apache-spark apache-spark-mllib word2vec

Barron asked 23/4, 2016 at 19:38

3

Solved

CBOW v.s. skip-gram: why invert context and target words?

In this page, it is said that: [...] skip-gram inverts contexts and targets, and tries to predict each context word from its target word [...] However, looking at the training dataset it prod...

nlp tensorflow deep-learning word2vec word-embedding

Halfhour asked 10/7, 2016 at 1:21

3

Solved

Is there any way to get the vocabulary size from doc2vec model?

I am using gensim doc2vec. I want know if there is any efficient way to know the vocabulary size from doc2vec. One crude way is to count the total number of words, but if the data is huge(1GB or mo...

gensim word2vec doc2vec

Hiccup asked 12/1, 2017 at 8:7

2

How to find synonyms based on word2vec

I 'm working on word2vec model using gensim in Python, but I found that the result are the words having the same theme, synonyms are only part of the result. Can I find synonyms of a word based on...

word2vec

Stealthy asked 6/6, 2017 at 9:39

word2vec Questions

Recommended topics

Hot tags