Details Regarding Post-Training #29

Doctor-James · 2024-10-21T12:03:22Z

How is the post-training for the two tasks of multimodal understanding and image generation conducted? Is it done jointly like in Show-O, or are they trained separately? Also, what are the approximate total number of training samples and the ratio between the two tasks?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Details Regarding Post-Training #29

Details Regarding Post-Training #29

Doctor-James commented Oct 21, 2024

Details Regarding Post-Training #29

Details Regarding Post-Training #29

Comments

Doctor-James commented Oct 21, 2024