Now I have download the datasets and changed it into images by using ffmpeg, but I do not know how to set the ground truth. Thanks a lot.