File(s) not publicly available
Image Graph Production by Dense Captioning
conference contribution
posted on 2023-02-23, 04:46 authored by A Sahba, A Das, P Rad, M JamshidiImage captioning is the process of analyzing an image and generating a textual description according to objects and actions in the image. Thus, both image processing and natural language understanding are required for an image captioning system. Applications of image captioning can vary from assisting visually impaired people to detecting fake news in social media. One of significant utilizations of image captioning would be the detection of particular actions in images. In this paper, we use image captioning to produce a textual description from an image. Then we exploit a natural language processing algorithm to extract main components in the produced description. Finally we generate a general graph according to detected components in descriptions of the image. The generated graph shows objects and pairwise relationship between them along with their attributes. Thus, it can be used to determine if there is any particular relation in a sequence of input images.