Question about SFT data?

According to https://huggingface.co/datasets/SimpleBerry/OpenLongCoT-SFT, there are five columns of the datasets,  *problem*, *response*, *ground truth*, *query*  and *reasoning*, how do you organize it for finetuning training?

For example, what dose <start_of_father_id>-1<end_of_father_id><start_of_local_id>0<end_of_local_id><start_of_thought><problem> implies?