nlp Questions

4

I want to fine tune LabSE for Question answering using squad dataset. and i got this error: ValueError: The model did not return a loss from the inputs, only the following keys: last_hidden_state,p...
Barbital asked 9/8, 2022 at 10:43

1

When using the parser or for the matter any of the Annotation in Core NLP, is there a way to access the probability or the margin of error? To put my question into context, I am trying to understa...
Srini asked 22/3, 2013 at 16:9

8

I wrote a text classification program. When I run the program it crashes with an error as seen in this screenshot: ValueError: With n_samples=0, test_size=0.2 and train_size=None, the resulting t...
Passably asked 3/2, 2020 at 16:25

2

Solved

First of all I am using Google colab for the work and I have downloaded nltk stopwords for English with following: nltk.download('stopwords') The download was successful [nltk_data] Downloading...
Lynch asked 14/12, 2018 at 2:56

6

Solved

I have been playing around with oobabooga text-generation-webui on my Ubuntu 20.04 with my NVIDIA GTX 1060 6GB for some weeks without problems. I have been using llama2-chat models sharing memory b...
Negris asked 23/8, 2023 at 16:35

1

First I want to say that I don't have much experience with pytorch, ML, NLP and other related topics, so I may confuse some concepts. Sorry. I downloaded few models from Hugging Face, organized the...
Haggai asked 23/4, 2023 at 10:2

5

I have been working with langchain's chroma vectordb. It has two methods for running similarity search with scores. vectordb.similarity_search_with_score() vectordb.similarity_search_with_relevanc...
Cargill asked 13/7, 2023 at 11:19

3

Is it possible to exclude the contents of footers and headers of a page from a pdf file during extracting the text from it. As these contents are least important and almost redundant. Note: For ex...
Gismo asked 27/8, 2018 at 12:53

6

Solved

I am trying to install transformers using pip pip install transformers after import transformers this error show Neither PyTorch nor TensorFlow >= 2.0 have been found.Models won't be available ...
Moonset asked 13/10, 2020 at 14:49

3

In Spacy 2.x, I use the matcher to find specific tokens in my text corpus. Each rule has an ID ('class-1_0' for example). During parse, I use the callback on_match to handle each match. Is there a ...
Davies asked 26/11, 2017 at 7:21

3

Solved

This is the screen of the original paper: the screen of the paper. I understand the meaning of the paper is that when the value of dot-product is large, the gradient of softmax will get very small....
Ningpo asked 27/2, 2019 at 12:42

2

I'm confused about the technical difference between the two huggingface pipelines TextGeneration and Text2TextGeneration. In the TextGeneration it is stated that: Language generation pipeline usin...
Amortization asked 24/7, 2023 at 22:7

9

Is there a ready-to-use English grammar that I can just load it and use in NLTK? I've searched around examples of parsing with NLTK, but it seems like that I have to manually specify grammar ...
Cluff asked 24/5, 2011 at 19:17

4

Solved

I've been looking to use Hugging Face's Pipelines for NER (named entity recognition). However, it is returning the entity labels in inside-outside-beginning (IOB) format but without the IOB labels....

4

Solved

I'm using the transformers library in Google colab, and When i am using TrainingArguments from transformers library i'm getting Import error with this code: from transformers import TrainingArgumen...

5

Solved

Given a word, which may or may not be a singular-form noun, how would you generate its plural form? Based on this NLTK tutorial and this informal list on pluralization rules, I wrote this simple f...
Kweichow asked 19/9, 2013 at 18:44

8

I already have spaCy downloaded, but everytime I try the nlp = spacy.load("en_core_web_lg"), command, I get this error: OSError: [E050] Can't find model 'en_core_web_lg'. It doesn't seem to be a ...
Elfreda asked 6/6, 2019 at 3:2

3

I have : I have a PDF which are in two-column format.Is there a way to read each PDF according to the two-column format without cropping each PDF individually?
Isolationism asked 11/3, 2019 at 10:43

3

Solved

I am trying build a small text mining tool for my android app. I am checking for a machine learning library that will allow me to cluster, classify etc. Are there any machine learning librar...

4

Given an input word, I want to determine whether it is a noun or not (in case of ambiguity, for instance cook can be a noun or a verb, the word must be identified as a noun). Actually I use the PO...
Runkle asked 19/1, 2015 at 21:36

11

I got the following error when I ran my PyTorch deep learning model in Google Colab /usr/local/lib/python3.6/dist-packages/torch/nn/functional.py in linear(input, weight, bias) 1370 ret = torch.ad...
Seagoing asked 28/4, 2020 at 5:39

4

I am installing layout-parser and following this link. Did not face any issues with the following packages.   pip install layoutparser pip install "layoutparser[effdet]" pip install lay...
Asymmetry asked 6/2, 2023 at 6:19

3

Solved

I have a corpus of sentences that were preprocessed by Stanford's CoreNLP systems. One of the things it provides is the sentence's Parse Tree (Constituency-based). While I can understand a parse tr...
Stephanstephana asked 23/2, 2015 at 13:0

5

Solved

I am new to Spacy and I would like to extract "all" the noun phrases from a sentence. I'm wondering how I can do it. I have the following code: import spacy nlp = spacy.load("en") file = open("E...
Ivy asked 22/2, 2018 at 10:41

3

Solved

I'm using symanto/sn-xlm-roberta-base-snli-mnli-anli-xnli pretrained model from huggingface. My task requires to use it on pretty large texts, so it's essential to know maximum input length. The fo...

© 2022 - 2024 — McMap. All rights reserved.