The classic SGG metrics of
are used to benchmark PSG models.
Notice that PSG grounds objects with segmentation,
a successful recall requires both subject and object to have mask-based IOU larger than 0.5
compared to their ground-truth counterparts, with the correct classification on every
position in the S-V-O triplet. The panoptic segmentation evaluation protocol
can be also considered as an auxiliary metric.