-
Notifications
You must be signed in to change notification settings - Fork 25.2k
Issues: huggingface/transformers
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Batch is empty when fine-tuning flan-t5 using LoRA
#31357
opened Jun 10, 2024 by
MorenoLaQuatra
2 of 4 tasks
Object Detection Pipeline only outputs first element when batching
#31356
opened Jun 10, 2024 by
simonschoenhofen
4 tasks
[CI] 2 Cohere tests are failing and skipped for now
Good First Issue
Tests
Related to tests
#31351
opened Jun 10, 2024 by
ydshieh
Low retrieval and generation performance if evaluate rag model using consolidate_rag_checkpoint initialized with BART-LARGE as generator
#31349
opened Jun 10, 2024 by
Rakin061
2 of 4 tasks
AttributeError: 'NllbTokenizerFast' object has no attribute 'lang_code_to_id'
#31348
opened Jun 10, 2024 by
rajanish4
1 of 4 tasks
convert_data2vec_audio_original_pytorch_checkpoint_to_pytorch.py works for data2vec 1.0 checkpoint but not data2vec 2.0
Audio
#31346
opened Jun 10, 2024 by
Bogsamurai
3 of 4 tasks
Implementation Issue of Phi3SuScaledRotaryEmbedding
Feature request
Request for a new feature
#31339
opened Jun 10, 2024 by
ryan-minato
Support saving models trained with DeepSpeed in Trainer callbacks
DeepSpeed
Feature request
Request for a new feature
trainer
#31338
opened Jun 9, 2024 by
dwyatte
model_kwargs
is None when generation_config
is passed as a dict instead of generation.GenerationConfig
#31328
opened Jun 8, 2024 by
AADeLucia
MixtralFlashAttention2
subscripts position_ids
before checking if it is None
#31326
opened Jun 7, 2024 by
Luke20000429
2 of 4 tasks
Using a single 'RecurrentGemmaRglru' layer - "Trying to backward through the graph a second time" Error
#31324
opened Jun 7, 2024 by
talrub
2 of 4 tasks
Language modeling examples do not show how to do multi-gpu training / fine-tuning
#31323
opened Jun 7, 2024 by
csiefer2
2 of 4 tasks
[GGUF] Support new architectures/ quantisation schemes in Transformers
contributions-welcome
#31314
opened Jun 7, 2024 by
Vaibhavs10
AutoModelForCausalLM.from_pretrained silently fails
#31306
opened Jun 7, 2024 by
gpetters-amd
4 tasks
merge_and_unload
for a quantized model ruins its quality
Quantization
#31293
opened Jun 6, 2024 by
Aktsvigun
2 of 4 tasks
Having a function to verify if checkpoint is valid
Feature request
Request for a new feature
#31283
opened Jun 6, 2024 by
Bfault
Constraints in constrained beam search can be satisfied by the inputs.
Generation
#31281
opened Jun 6, 2024 by
zawedcvg
2 of 4 tasks
Stuck on Initializing Transformers Model with FSDP (Fully Sharded Data Parallel) using meta device
#31278
opened Jun 6, 2024 by
jiangjiadi
2 of 4 tasks
While using the integration of bitsandbytes, Error shows: name 'torch' is not defined
#31273
opened Jun 6, 2024 by
46319943
2 of 4 tasks
'FastSpeech2ConformerConfig' object has no attribute 'model_config'
Audio
#31270
opened Jun 6, 2024 by
spencerchubb
1 of 4 tasks
Previous Next
ProTip!
Follow long discussions with comments:>50.