-
Notifications
You must be signed in to change notification settings - Fork 4.9k
Issues: hiyouga/LLaMA-Factory
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Ascend卡上无法训练deepseek模型 是否支持呢
npu
This problem is related to NPU devices
pending
This problem is yet to be addressed
#4361
opened Jun 18, 2024 by
sweetning0809
1 task done
PPO使用zero3加载全参训练的奖励模型,奖励模型加载失败。
bug
Something isn't working
pending
This problem is yet to be addressed
#1790
opened Dec 11, 2023 by
Luoxiaohei41
Have we added VeRA (Vector Based Random Matrix Adaption) , it recently got published at ICLR 2024
pending
This problem is yet to be addressed
#2238
opened Jan 18, 2024 by
Akshay1-6180
奖励模型断点续训报错
good first issue
Good for newcomers
pending
This problem is yet to be addressed
#2351
opened Jan 26, 2024 by
zhanglv0209
有计划支持LoRAMoE吗?
pending
This problem is yet to be addressed
#2749
opened Mar 8, 2024 by
luyuntao92
1 task done
昇腾多卡训练问题
npu
This problem is related to NPU devices
pending
This problem is yet to be addressed
#3810
opened May 19, 2024 by
1737686924
1 task done
对于微调分类任务,如何在使用api inference时获取输出标签置信分数
enhancement
New feature or request
pending
This problem is yet to be addressed
#3932
opened May 28, 2024 by
xhdu
1 task done
MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning
enhancement
New feature or request
pending
This problem is yet to be addressed
#3970
opened May 29, 2024 by
backroom-coder
MODPO: Multi-Objective Direct Preference Optimization
enhancement
New feature or request
pending
This problem is yet to be addressed
#3973
opened May 30, 2024 by
AlexYoung757
Feature suggestion: cutoff_len could optionally drop too long examples from dataset.
enhancement
New feature or request
pending
This problem is yet to be addressed
#3995
opened May 30, 2024 by
s4s0l
用openai库 请求时,流式请求时缺stream_options={"include_usage": True}的处理,用于计算流式tokens
pending
This problem is yet to be addressed
#3998
opened May 30, 2024 by
sasicDHH
1 task done
sft+freeze训练internlm2-base-7b报错,RuntimeError: element 0 of tensors does not require grad and does not have a grad_fn
npu
This problem is related to NPU devices
pending
This problem is yet to be addressed
#4101
opened Jun 6, 2024 by
1737686924
1 task done
Ideas behind sharing parameters of policy model and value model?
enhancement
New feature or request
pending
This problem is yet to be addressed
#1563
opened Nov 19, 2023 by
MagiaSN
请问是否会在框架内集成RLOO算法,最新的online RLHF?
enhancement
New feature or request
pending
This problem is yet to be addressed
#4287
opened Jun 14, 2024 by
ArcherShirou
1 task done
Function tool calling inference without llama-factory openai style api.
pending
This problem is yet to be addressed
#4364
opened Jun 18, 2024 by
svjack
1 task done
关于npu训练模型总结以及疑问
npu
This problem is related to NPU devices
pending
This problem is yet to be addressed
#4388
opened Jun 20, 2024 by
sweetning0809
1 task done
[PPU]大佬有对ppu环境进行过测试么
pending
This problem is yet to be addressed
#4606
opened Jun 28, 2024 by
willionZS
1 task done
fsdp + DPO + fullyfintune会报错
bug
Something isn't working
pending
This problem is yet to be addressed
#4608
opened Jun 28, 2024 by
qy1026
1 task done
RuntimeError: "addmm_impl_cpu_" not implemented for 'Half' 华为910 命令行推理报错
npu
This problem is related to NPU devices
pending
This problem is yet to be addressed
#4622
opened Jun 30, 2024 by
apachemycat
1 task done
qwen2 72b 910b lora后merge生成的权重 推理失败
npu
This problem is related to NPU devices
pending
This problem is yet to be addressed
#4659
opened Jul 3, 2024 by
wphtrying
1 task done
Enable Contamination-Free Packaging Method During Pretraining
pending
This problem is yet to be addressed
#4744
opened Jul 9, 2024 by
kostum123
1 task done
使用vllm时支持bitsandbytes量化
pending
This problem is yet to be addressed
#4751
opened Jul 10, 2024 by
JJJJerry
FSDP-QLora w/ DeepSeek-v2-lite dones't work on 4 GPUs
bug
Something isn't working
pending
This problem is yet to be addressed
#4785
opened Jul 12, 2024 by
Jiayi-Pan
1 task done
the cutoff of multimodal input sequence
enhancement
New feature or request
pending
This problem is yet to be addressed
#6891
opened Feb 11, 2025 by
JJJYmmm
1 task done
Previous Next
ProTip!
no:milestone will show everything without a milestone.