According to https://huggingface.co/datasets/SimpleBerry/OpenLongCoT-SFT, there are five columns of the datasets, problem, response, ground truth, query and reasoning, how do you organize it for finetuning training?
For example, what dose <start_of_father_id>-1<end_of_father_id><start_of_local_id>0<end_of_local_id><start_of_thought> implies?
According to https://huggingface.co/datasets/SimpleBerry/OpenLongCoT-SFT, there are five columns of the datasets, problem, response, ground truth, query and reasoning, how do you organize it for finetuning training?
For example, what dose <start_of_father_id>-1<end_of_father_id><start_of_local_id>0<end_of_local_id><start_of_thought> implies?