ARTICLE | doi:10.20944/preprints202207.0070.v1
Subject: Mathematics & Computer Science, Artificial Intelligence & Robotics Keywords: scene recognition; object detection; scene classification; TF-IDF
Online: 5 July 2022 (08:38:17 CEST)
Indoor scene recognition and semantic information can be helpful for social robots. Recently, in the field of indoor scene recognition, researchers have incorporated object-level information and shown improved performances. This paper demonstrates that scene recognition can be performed solely using object-level information in line with these advances. A state-of-the-art object detection model was trained to detect objects typically found in indoor environments and then used to detect objects in scene data. These predicted objects were then used as features to predict room categories. This paper successfully combines approaches conventionally used in computer vision (YOLO) and Term Frequency-Inverse Document Frequency (TF-IDF). These approaches could be further helpful in the field of embodied research and dynamic scene classification, which we elaborate on.