This is a training script I made so that I can fine-tune LLMs using my workstation with four 4090s ... first_exhausted': stop when a dataset runs out of examples. 'all_exhausted': stop when all ...
If you rather want to run dependent examples yourself, you need to modify checkpoint paths in the training scripts accordingly. Checkpoints and Tensorboard logs from newly executed examples are also ...