SCoRD: Visual Relation Predictor
Our group has produced an auto-regressive model for the purpose of enhanced Scene Graph Generation (SGG). Here we leverage our WACV 2024 paper: SCoRD: Subject-Conditional Relation Detection With Text-Augmented Data. In this paper we proposed a new approach called Subject-Conditional Relation Detection (SCoRD) aimed at predicting all relations of a given subject within a scene, including the locations of these relations. Leveraging the Open Images dataset, we presented the OIv6-SCoRD benchmark to challenge existing models with a shift in the distribution of subject-relation-object triplets between training and testing phases. The novel contribution includes an auto-regressive model that, given a subject, predicts its relations to other objects and their spatial locations as a sequence of tokens. Try uploading an image below to see the SCoRD model in action.
Here we leverage our WACV 2024 paper: SCoRD: Subject-Conditional Relation Detection With Text-Augmented Data to generate token sequences relating all objects in an images to a given subject.