I am interested in investigating results with and without the image branch of the network. As far as I am concerned you do not provide checkpoint which utilizes such a branch, so I trained the network enabling this branch, on my own.
First of all, I want to know if I am correct that corner_acc.npy, topdown_acc.npy which you provide within your example are the weights which correspond to pretraining of DRN and HG networks, thus we utilize them as-is for both training and validation/testing purposes.
Next I want to point out a possible bug related to image features which is the following. I realized that within RecordWriterTango.py, RGB values are scaled twice.
One time during loading:
|
color = color.astype(np.float32) / 255 |
Second time during processing:
|
points[:, 3:] = points[:, 3:] / 255 - 0.5 |
This results to RGB values within a range
(-0.5, -0.49607843)
It can be reproduced with the following snippet (similar for training records)
record_filepath ='Tango_val.tfrecords'
dataset = getDatasetVal([record_filepath], '', True, 1)
iterator = dataset.make_one_shot_iterator()
input_dict, gt_dict = iterator.get_next()
pt = input_dict['points'][0]
points = tf.Session().run([pt])[0]
RGB = points[:, 3:6]
print(np.amin(RGB), np.amax(RGB))
Could such issue affect how the values are interpreted by the network? Is this related to already pretrained weights?
Thanks in advance
I am interested in investigating results with and without the image branch of the network. As far as I am concerned you do not provide checkpoint which utilizes such a branch, so I trained the network enabling this branch, on my own.
First of all, I want to know if I am correct that corner_acc.npy, topdown_acc.npy which you provide within your example are the weights which correspond to pretraining of DRN and HG networks, thus we utilize them as-is for both training and validation/testing purposes.
Next I want to point out a possible bug related to image features which is the following. I realized that within RecordWriterTango.py, RGB values are scaled twice.
One time during loading:
FloorNet/RecordWriterTango.py
Line 99 in e7bd879
Second time during processing:
FloorNet/RecordWriterTango.py
Line 334 in e7bd879
This results to RGB values within a range
(-0.5, -0.49607843)
It can be reproduced with the following snippet (similar for training records)
Could such issue affect how the values are interpreted by the network? Is this related to already pretrained weights?
Thanks in advance