We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
在第一个阶段使用laion-2B的caption训练数据,放开VIT,mlp projector,vision export训练,freeze大语言模型进行训练,训练过程中loss先慢慢下降,但后面升高了,升高之后发现模型训崩了,已排除了训练数据问题,learning rate也调小了都不行,请问是哪里的问题?
The text was updated successfully, but these errors were encountered:
lr设置的多少呢
Sorry, something went wrong.
lr设置的1e-5, 1e-6都是过,loss都是先下降后上升
No branches or pull requests
在第一个阶段使用laion-2B的caption训练数据,放开VIT,mlp projector,vision export训练,freeze大语言模型进行训练,训练过程中loss先慢慢下降,但后面升高了,升高之后发现模型训崩了,已排除了训练数据问题,learning rate也调小了都不行,请问是哪里的问题?
The text was updated successfully, but these errors were encountered: