huggingface-transformers - 4

3

Transformers pretrained model with dropout setting

I'm trying to use transformer's huggingface pretrained model bert-base-uncased, but I want to increace dropout. There isn't any mention to this in from_pretrained method, but colab ran the object i...

python bert-language-model huggingface-transformers

Flowerless asked 21/11, 2020 at 19:14

3

cannot import name 'PartialState' from 'accelerate' when using Huggingface pipeline on Kaggle notebook?

When import pipeline from Huggingface on Kaggle notebook, from transformers import pipeline it throws this error: /opt/conda/lib/python3.10/site-packages/tensorflow_io/python/ops/__init__.py:98: U...

python pipeline huggingface-transformers kaggle

Mckay asked 30/5, 2023 at 9:29

3

Solved

XLNetTokenizer requires the SentencePiece library but it was not found in your environment

I am trying to implement the XLNET on Google Collaboratory. But I get the following issue. ImportError: XLNetTokenizer requires the SentencePiece library but it was not found in your environment. ...

google-colaboratory huggingface-transformers transformer-model huggingface-tokenizers

Gratulate asked 4/1, 2021 at 5:9

2

Solved

How to load a fine-tuned peft/lora model based on llama with Huggingface transformers?

I've followed this tutorial (colab notebook) in order to finetune my model. Trying to load my locally saved model model = AutoModelForCausalLM.from_pretrained("finetuned_model") yields K...

python huggingface-transformers llama-index peft

Tourist asked 12/6, 2023 at 17:34

1

Solved

HuggingFace Transformers model config reported "This is a deprecated strategy to control generation and will be removed soon"

I am training a sequence-to-sequence model using HuggingFace Transformers' Seq2SeqTrainer. When I execute the training process, it reports the following warning: /path/to/python3.9/site-packages/t...

python huggingface-transformers

Volkman asked 13/6, 2023 at 13:12

2

Solved

How to do Tokenizer Batch processing? - HuggingFace

in the Tokenizer documentation from huggingface, the call fuction accepts List[List[str]] and says: text (str, List[str], List[List[str]], optional) — The sequence or batch of sequences to be enco...

pytorch batch-processing tokenize huggingface-transformers huggingface-tokenizers

Billon asked 7/6, 2023 at 10:15

1

Solved

How to get the logits of the model with a text classification pipeline from HuggingFace?

I need to use pipeline in order to get the tokenization and inference from the distilbert-base-uncased-finetuned-sst-2-english model over my dataset. My data is a list of sentences, for recreation ...

python huggingface-transformers sentiment-analysis huggingface large-language-model

Gosser asked 8/6, 2023 at 17:26

1

Solved

How can I specify which GPU to use when using Huggingface Trainer

HuggingFace offers training_args like below. When I use HF trainer to train my model, I found cuda:0 is used by default. I went through the HuggingFace Docs, but still don't know how to specify whi...

huggingface-transformers

External asked 7/6, 2023 at 21:20

2

Generate the probabilities of all the next possible word for a given text

i have the following code import transformers from transformers import pipeline # Load the language model pipeline model = pipeline("text-generation", model="gpt2") # Input se...

text pytorch huggingface-transformers gpt-2

Gurge asked 3/6, 2023 at 20:23

1

Trying to install guanaco (pip install guanaco) for a text classification model but getting error

I'm trying to install the guanaco language model https://arxiv.org/abs/2305.14314 using pip install guanaco for a text classification model but getting error. Failed to build guanaco ERROR: Could n...

python huggingface-transformers transformer-model large-language-model

Chazan asked 31/5, 2023 at 9:26

2

Solved

NameError: name 'PartialState' is not defined error while training hugging face wave2vec model

Here is the code block which caused the error training_args = TrainingArguments( output_dir="my_awesome_mind_model", evaluation_strategy="epoch", save_strategy="epoch&qu...

python huggingface-transformers

Danzig asked 11/5, 2023 at 8:28

3

Solved

How to apply max_length to truncate the token sequence from the left in a HuggingFace tokenizer?

In the HuggingFace tokenizer, applying the max_length argument specifies the length of the tokenized text. I believe it truncates the sequence to max_length-2 (if truncation=True) by cutting the ex...

python pytorch huggingface-transformers bert-language-model huggingface-tokenizers

Quent asked 11/5, 2022 at 13:52

1

Solved

Huggingface - Pipeline with a fine-tuned pre-trained model errors

I have a pre-trained model from facebook/bart-large-mnli I used the Trainer in order to train it on my own dataset. model = BartForSequenceClassification.from_pretrained("facebook/bart-large-m...

python pipeline huggingface-transformers text-classification huggingface

Ingratitude asked 4/5, 2023 at 7:32

2

Solved

Initialize HuggingFace Bert with random weights

How is it possible to initialize BERT with random weights? I want to compare the performance of multilingual vs monolingual vs randomly initialized BERT in a masked language modeling task. While in...

bert-language-model huggingface-transformers

Hospitalize asked 20/6, 2021 at 17:57

6

Solved

Huggingface AlBert tokenizer NoneType error with Colab

I simply tried the sample code from hugging face website: https://huggingface.co/albert-base-v2 from transformers import AlbertTokenizer, AlbertModel tokenizer = AlbertTokenizer.from_pretrained('al...

google-colaboratory huggingface-transformers huggingface-tokenizers

Flee asked 23/1, 2021 at 1:0

1

Solved

How to finetune a zero-shot model for text classification

I need a model that is able to classify text for an unknown number of classes (i.e. the number might grow over time). The entailment approach for zero-shot text classification seems to be the solut...

python nlp huggingface-transformers

Coming asked 9/5, 2023 at 23:1

4

How to make a Trainer pad inputs in a batch with huggingface-transformers?

I'm trying to train a model using a Trainer, according to the documentation (https://huggingface.co/transformers/master/main_classes/trainer.html#transformers.Trainer) I can specify a tokenizer: t...

python pytorch huggingface-transformers

Lankester asked 24/9, 2020 at 13:13

3

Token indices sequence length is longer than the specified maximum sequence length for this model (651 > 512) with Hugging face sentiment classifier

I'm trying to get the sentiments for comments with the help of hugging face sentiment analysis pretrained model. It's returning error like Token indices sequence length is longer than the specified...

deep-learning nlp sentiment-analysis huggingface-transformers huggingface-tokenizers

Kaufmann asked 5/4, 2021 at 14:33

3

Huggingface BERT Tokenizer add new token

I am using Huggingface BERT for an NLP task. My texts contain names of companies which are split up into subwords. tokenizer = BertTokenizerFast.from_pretrained('bert-base-uncased') tokenizer.encod...

bert-language-model huggingface-transformers huggingface-tokenizers

Haploid asked 3/11, 2020 at 19:29

1

How to add new tokens to an existing Huggingface tokenizer?

How to add new tokens to an existing Huggingface AutoTokenizer? Canonically, there's this tutorial from Huggingface https://huggingface.co/learn/nlp-course/chapter6/2 but it ends on the note of &qu...

python nlp huggingface-transformers huggingface-tokenizers large-language-model

Ametropia asked 8/5, 2023 at 6:41

1

Solved

Hugging Face Transformers BART CUDA error: CUBLAS_STATUS_NOT_INITIALIZE

I'm trying to finetune the Facebook BART model, I'm following this article in order to classify text using my own dataset. And I'm using the Trainer object in order to train: training_args = Traini...

python pytorch huggingface-transformers text-classification huggingface

Comity asked 25/4, 2023 at 8:36

3

ImportError: cannot import name 'LLaMATokenizer' from 'transformers'

I am not able to import LLaMATokenizer Any solution for this problem? I am using the code of this repo. https://github.com/zphang/transformers/tree/llama_push and trying to load the models and toke...

huggingface-transformers alpaca llama-index

Lachance asked 1/4, 2023 at 17:51

1

Solved

TokenClassificationChunkPipeline is throwing error: 'BatchEncoding' object is not an iterator

Following this HuggingFace Anonymisation Tutorial. Using pytorch 2.0.0 and transformers-4.28.1 Running the code as it is, I get an error over the custom pipeline: def anonymize(text): ents = pipe(...

pytorch nlp huggingface-transformers torch named-entity-recognition

Coif asked 19/4, 2023 at 15:17

1

How is the number of steps calculated in HuggingFace trainer?

I have a train dataset of size 4107. DatasetDict({ train: Dataset({ features: ['input_ids'], num_rows: 4107 }) valid: Dataset({ features: ['input_ids'], num_rows: 498 }) }) In my training ...

machine-learning deep-learning huggingface-transformers

Superstition asked 13/4, 2023 at 7:6

3

Solved

Does Huggingface's "resume_from_checkpoint" work?

I currently have my trainer set up as: training_args = TrainingArguments( output_dir=f"./results_{model_checkpoint}", evaluation_strategy="epoch", learning_rate=5e-5, per_de...

pytorch huggingface-transformers huggingface

Heterodox asked 18/6, 2022 at 19:46

huggingface-transformers Questions

Recommended topics

Hot tags