gensim Questions

3

Solved

I am trying to use Gensim, but running import gensim raises this error: Traceback (most recent call last): File "<stdin>", line 1, in <module> File "/usr/local/lib/pyth...
Nielsen asked 5/4, 2024 at 10:1

4

Operating System: macOS Monterey 12.6 Chip: Apple M1 Python version: 3.11.1 I try: pip3 install gensim The install process starts well but fatally fails towards the end while running 'clang'. The ...
Metopic asked 2/1, 2023 at 6:55

5

I tried to follow this. But some how I wasted a lot of time ending up with nothing useful. I just want to train a GloVe model on my own corpus (~900Mb corpus.txt file). I downloaded the files provi...
Statecraft asked 24/2, 2018 at 11:10

1

I am currently using Gensim LDA for topic modeling. While Tuning hyper-parameters I found out that the model always gives negative log-perplexity Is it normal for model to behave like this?? (is it...
Minuend asked 22/7, 2020 at 2:30

1

I really need some help, as I have gone through all the posts and nothing has worked. I get this error when importing gensim and not numpy (numpy is before and works fine). All I want to do is impo...
Kutz asked 23/4, 2021 at 12:15

5

Solved

I want to use a pre-trained word2vec model, but I don't know how to load it in python. This file is a MODEL file (703 MB). It can be downloaded here: http://devmount.github.io/GermanWordEmbeddings...
Skyway asked 17/9, 2016 at 16:40

4

I have trained a Word2Vec model using Gensim 3.8.0. Later I tried to use the pretrained model using Gensim 4.0.o on GCP. I used the following code: model = KeyedVectors.load_word2vec_format(wv_path...
Villous asked 30/3, 2021 at 9:28

2

Solved

I am using Word2vec through gensim with Google's pretrained vectors trained on Google News. I have noticed that the word vectors I can access by doing direct index lookups on the Word2Vec object ar...
Perrone asked 16/3, 2016 at 11:31

5

I want to calculate tf-idf from the documents below. I'm using python and pandas. import pandas as pd df = pd.DataFrame({'docId': [1,2,3], 'sent': ['This is the first sentence','This is the seco...
Clabber asked 2/6, 2016 at 13:28

2

Solved

I am using gensim word2vec package in python. I would like to retrieve the W and W' weight matrices that have been learn during the skip-gram learning. It seems to me that model.syn0 gives me the f...
Foreshadow asked 15/12, 2016 at 11:19

6

Given a model, e.g. from gensim.models.word2vec import Word2Vec documents = ["Human machine interface for lab abc computer applications", "A survey of user opinion of computer system response ti...
Visser asked 23/2, 2018 at 5:26

5

I'm working on project using Word2vec and gensim, model = gensim.models.Word2Vec( documents = 'userDataFile.txt', size=150, window=10, min_count=2, workers=10) model = gensim.model.Word2Vec.lo...
Deaconry asked 7/11, 2018 at 18:49

6

Solved

I built LDA model using Gensim and I want to get the topic words only How can I get the words of the topics only no probabilities and no IDs.words only I tried print_topics() and show_topics() fu...
Stimulative asked 3/10, 2017 at 1:58

19

I trying to import gensim with import gensim but get the following error ImportError Traceback (most recent call last) <ipython-input-5-50007be813d4> in <module>() ----> 1 import g...
Vasiliu asked 12/9, 2017 at 5:33

3

Solved

I have to use a word2vec module containing tons of Chinese characters. The module was trained by my coworkers using Java and is saved as a bin file. I installed gensim and tries to load the modul...
Lamasery asked 23/12, 2015 at 2:24

8

I can't import pyLDAvis. It is installed but for some reason, I can not import it. I tried conda update anaconda pip install --upgrade pip pip install --upgrade jupyter notebook pip install pyLD...
Postdate asked 23/3, 2021 at 8:48

5

Solved

After training a word2vec model using python gensim, how do you find the number of words in the model's vocabulary?
Silverman asked 24/2, 2016 at 7:39

3

Solved

I have downloaded a .bin FastText model, and I use it with gensim as follows: model = FastText.load_fasttext_format("cc.fr.300.bin") I would like to continue the training of the model to adapt i...
Rafaelof asked 29/8, 2018 at 14:47

6

Solved

I am working on code using the gensim and having a tough time troubleshooting a ValueError within my code. I finally was able to zip GoogleNews-vectors-negative300.bin.gz file so I could implement ...
Euphony asked 26/9, 2017 at 18:51

4

Solved

I am trying to build a Word2vec model but when I try to reshape the vector for tokens, I am getting this error. Any idea ? wordvec_arrays = np.zeros((len(tokenized_tweet), 100)) for i in range(len...
Roxy asked 25/5, 2021 at 12:30

2

Here, best_model_lda is an sklearn based LDA model and we are trying to find a coherence score for this model.. coherence_model_lda = CoherenceModel(model = best_lda_model,texts=data_vectorized, d...
Gotthard asked 10/3, 2020 at 8:3

3

Solved

I have included the 2 import statements in my views.py from gensim.summarization.summarizer import summarizer from gensim.summarization import keywords However, even after I installed gensim using...
Liszt asked 17/6, 2021 at 11:49

2

Solved

I´m currently trying to evaluate my topic models with gensim topiccoherencemodel: from gensim.models.coherencemodel import CoherenceModel cm_u_mass = CoherenceModel(model = model1, corpus = corpus...
Nefen asked 30/5, 2018 at 14:34

3

Solved

E.g. we train a word2vec model using gensim: from gensim import corpora, models, similarities from gensim.models.word2vec import Word2Vec documents = ["Human machine interface for lab abc compute...
Jos asked 22/2, 2017 at 3:0

1

A few papers on the topics of word and document embeddings (word2vec, doc2vec) mention that they used the Stanford CoreNLP framework to tokenize/lemmatize/POS-tag the input words/sentences: The ...
Evangelical asked 29/5, 2018 at 12:3

© 2022 - 2025 — McMap. All rights reserved.