i want to achieve the joint detection and segmentation but i do not know how to make the train dataset?can you tell me the the format of the data?