bert-language-model

bert-language-model Questions

The model did not return a loss from the inputs - LabSE error

I want to fine tune LabSE for Question answering using squad dataset. and i got this error: ValueError: The model did not return a loss from the inputs, only the following keys: last_hidden_state,p...

nlp pytorch huggingface-transformers bert-language-model

Barbital asked 9/8, 2022 at 10:43

Solved

HuggingFace: ValueError: expected sequence of length 165 at dim 1 (got 128)

I am trying to fine-tune the BERT language model on my own data. I've gone through their docs, but their tasks seem to be not quite what I need, since my end goal is embedding text. Here's my code:...

python deep-learning pytorch huggingface-transformers bert-language-model

Reunite asked 17/2, 2022 at 23:45

Download pre-trained sentence-transformers model locally

I am using the SentenceTransformers library (here: https://pypi.org/project/sentence-transformers/#pretrained-models) for creating embeddings of sentences using the pre-trained model bert-base-nli-...

word-embedding bert-language-model huggingface-tokenizers sentence-transformers

Fandango asked 23/12, 2020 at 5:34

How to resolve ERROR: Could not build wheels for hdbscan, which is required to install pyproject.toml-based projects

I am trying to install bertopic and I got this error: pip install bertopic Collecting bertopic > Using cached bertopic-0.11.0-py2.py3-none-any.whl (76 kB) > Collecting hdbscan>=0.8.2...

python bert-language-model hdbscan

Deficient asked 29/7, 2022 at 22:16

ModuleNotFoundError: No module named 'torch.utils._pytree'

I have installed PyTorch 1.7.1, and it works very well. However, when I try to run this code: import transformers from transformers import BertTokenizer from transformers.models.bert.modeling_bert ...

pytorch bert-language-model

Limitation asked 31/8, 2023 at 6:36

CUDA error: CUBLAS_STATUS_ALLOC_FAILED when calling cublasCreate(handle)

I got the following error when I ran my PyTorch deep learning model in Google Colab /usr/local/lib/python3.6/dist-packages/torch/nn/functional.py in linear(input, weight, bias) 1370 ret = torch.ad...

python pytorch nlp cuda bert-language-model

Seagoing asked 28/4, 2020 at 5:39

Solved

How to convert model.safetensor to pytorch_model.bin?

I'm fine tuning a pre-trained bert model and i have a weird problem: When i'm fine tuning using the CPU, the code saves the model like this: With the "pytorch_model.bin". But when i use ...

machine-learning pytorch bert-language-model sentence-transformers

Schuman asked 23/12, 2023 at 20:43

OSError for huggingface model

I am trying to use a huggingface model (CamelBERT), but I am getting an error when loading the tokenizer: Code: from transformers import AutoTokenizer, AutoModelForMaskedLM tokenizer = AutoTokenize...

python deep-learning nlp huggingface-transformers bert-language-model

Coagulum asked 15/3, 2022 at 11:47

SSLError: HTTPSConnectionPool(host='huggingface.co', port=443): Max retries exceeded with url: /dslim/bert-base-NER/resolve/main/tokenizer_config.json

I am facing below issue while loading the pretrained BERT model from HuggingFace due to SSL certificate error. Error: SSLError: HTTPSConnectionPool(host='huggingface.co', port=443): Max retries ex...

python-3.x huggingface-transformers bert-language-model huggingface-tokenizers huggingface

Nosography asked 13/1, 2023 at 15:9

ERROR: file:///content does not appear to be a Python project: neither 'setup.py' nor 'pyproject.toml' found

https://colab.research.google.com/drive/11u6leEKvqE0CCbvDHHKmCxmW5GxyjlBm?usp=sharing setup.py file is in transformers folder(root directory). But this error occurs when I run !git clone https://gi...

python google-colaboratory huggingface-transformers bert-language-model transformer-model

Hirohito asked 3/5, 2023 at 17:21

BERT token vs. embedding

I understand that WordPiece is used to break text into tokens. And I understand that, somewhere in BERT, the model maps tokens into token embeddings that represent the meaning of the tokens. But wh...

token bert-language-model embedding transformer-model

Meill asked 27/9, 2023 at 18:3

How to use Bert for long text classification?

We know that BERT has a max length limit of tokens = 512, So if an article has a length of much bigger than 512, such as 10000 tokens in text How can BERT be used?

nlp text-classification bert-language-model

Invariant asked 31/10, 2019 at 3:34

Fine-tuning BERT sentence transformer model

I am using a pre-trained BERT sentence transformer model, as described here https://www.sbert.net/docs/training/overview.html , to get embeddings for sentences. I want to fine-tune these pre-traine...

bert-language-model sentence-transformers fine-tuning

Alehouse asked 13/10, 2021 at 21:38

Solved

How to fix random seed for BERTopic?

I'd like to fix the random seed from BERTopic library to get reproducible results. Looking at the code of BERTopic I see it uses numpy. Will using np.random.seed(123) be enough? or do I also need t...

python bert-language-model

Envelop asked 2/3, 2022 at 9:19

Solved

The essence of learnable positional embedding? Does embedding improve outcomes better?

I was recently reading the bert source code from the hugging face project. I noticed that the so-called "learnable position encoding" seems to refer to a specific nn.Parameter layer when ...

deep-learning pytorch bert-language-model transformer-model

Torpor asked 25/7, 2022 at 17:37

AttributeError: module 'torch' has no attribute '_six'. Bert model in Pytorch

I tried to load pre-trained model by using BertModel class in pytorch. I have _six.py under torch, but it still shows module 'torch' has no attribute '_six' import torch from pytorch_pretrained_b...

python deep-learning nlp pytorch bert-language-model

Dreg asked 21/5, 2019 at 15:41

Transformer: Error importing packages. "ImportError: cannot import name 'SAVE_STATE_WARNING' from 'torch.optim.lr_scheduler'"

I am working on a machine learning project on Google Colab, it seems recently there is an issue when trying to import packages from transformers. The error message says: ImportError: cannot import...

python nlp google-colaboratory bert-language-model huggingface-transformers

Pennyweight asked 11/3, 2021 at 21:43

With BERT Text Classification, ValueError: too many dimensions 'str' error occuring

Trying to make a classifier for sentiments of texts with BERT model but getting ValueError : too many dimensions 'str' That is the DataFrame for values of train data; so they are train_labels 0 not...

python tensor text-classification bert-language-model mlp

Tailback asked 20/1, 2021 at 7:12

Transformers pretrained model with dropout setting

I'm trying to use transformer's huggingface pretrained model bert-base-uncased, but I want to increace dropout. There isn't any mention to this in from_pretrained method, but colab ran the object i...

python bert-language-model huggingface-transformers

Flowerless asked 21/11, 2020 at 19:14

Solved

How to add new special token to the tokenizer?

I want to build a multi-class classification model for which I have conversational data as input for the BERT model (using bert-base-uncased). QUERY: I want to ask a question. ANSWER: Sure, ask aw...

bert-language-model huggingface-tokenizers sentencepiece

Eliza asked 15/9, 2021 at 10:24

Solved

How to apply max_length to truncate the token sequence from the left in a HuggingFace tokenizer?

In the HuggingFace tokenizer, applying the max_length argument specifies the length of the tokenized text. I believe it truncates the sequence to max_length-2 (if truncation=True) by cutting the ex...

python pytorch huggingface-transformers bert-language-model huggingface-tokenizers

Quent asked 11/5, 2022 at 13:52

What is the difference between Transformer encoder vs Transformer decoder vs Transformer encoder-decoder?

I know that GPT uses Transformer decoder, BERT uses Transformer encoder, and T5 uses Transformer encoder-decoder. But can someone help me understand why GPT only uses the decoder, BERT only uses en...

nlp bert-language-model generative-pretrained-transformer

Latinalatinate asked 7/5, 2021 at 1:21

Solved

How to cluster similar sentences using BERT

For ElMo, FastText and Word2Vec, I'm averaging the word embeddings within a sentence and using HDBSCAN/KMeans clustering to group similar sentences. A good example of the implementation can be see...

python nlp artificial-intelligence word-embedding bert-language-model

Dissension asked 10/4, 2019 at 18:31

Solved

Initialize HuggingFace Bert with random weights

How is it possible to initialize BERT with random weights? I want to compare the performance of multilingual vs monolingual vs randomly initialized BERT in a masked language modeling task. While in...

bert-language-model huggingface-transformers

Hospitalize asked 20/6, 2021 at 17:57

Huggingface BERT Tokenizer add new token

I am using Huggingface BERT for an NLP task. My texts contain names of companies which are split up into subwords. tokenizer = BertTokenizerFast.from_pretrained('bert-base-uncased') tokenizer.encod...

bert-language-model huggingface-transformers huggingface-tokenizers

Haploid asked 3/11, 2020 at 19:29

1 Next　>

Hot tags

Godot Unity Godot Help Programming Godot 4.X GUI GDScript 3D 2D Physics CSharp Godot 3.X VR XR Projects C++

bert-language-model Questions

Recommended topics

Hot tags