Search | Preprints.org

Preprint ARTICLE | doi:10.20944/preprints201905.0160.v1

Siamese Neural Network Based Apperance Model for Multi-Target

Mohib Ullah

Subject: Computer Science And Mathematics, Computer Science Keywords: Siamese neural network, appearance model, contrastive loss, cross entropy.

Online: 13 May 2019 (13:32:25 CEST)

Show abstract| Download PDF| Share

Working Paper ARTICLE

Unsupervised Visual Representation Learning for Indoor Scenes with a Siamese ConvNet and Graph Constraints

Mengyun Liu, Ruizhi Chen, Haojun Ai, Yujin Chen, Deren Li

Subject: Computer Science And Mathematics, Robotics Keywords: indoor scene recognition; unsupervised representation learning; Siamese network; graph constraints

Online: 19 March 2019 (13:11:09 CET)

Show abstract| Download PDF| Share

Indoor scene recognition has great significance for intelligent applications such as mobile robots, location-based services (LBS) and so on. Wherever we are or whatever we do, we are under a specific scene. The human brain can easily discern a scene with a quick glance. However, for a machine to achieve this purpose, on one hand, it often requires plenty of well-annotated data which is time-consuming and labor-intensive. On the other hand, it is hard to learn effective visual representations due to large intra-category variation and inter-categories similarity of indoor scenes. To solve these problems, in this paper, we adopted an unsupervised visual representation learning method which can learn from unlabeled data with a Siamese Convolutional Neural Network (Siamese ConvNet) and graph-based constraints. Specifically, we first mined relationships between unlabeled samples with a graph structure. And then, these relationships can be used as supervision for representation learning with a Siamese network. In this method, firstly, a k-NN graph would be constructed by taking each image as a node in the graph and its k nearest neighbors are linked to form the edges. Then, with this graph, cycle consistency and geodesic distance would be considered as criteria for positive and negative pairs mining respectively. In other words, by detecting cycles in the graph, images with large differences but in the same cycle can be considered as same category (positive pairs). By computing geodesic distance instead of Euclidean distance from one node to another, two nodes with large geodesic distance can be regarded as in different categories (negative pairs). After that, visual representations of indoor scenes can be learned by a Siamese network in an unsupervised manner with the mined pairs as inputs. In order to evaluate the proposed method, we tested it on two scene-centric datasets, MIT67 and Places365. Experiments with different number of categories have been conducted to excavate the potential of proposed method. The results demonstrated that semantic visual representations for indoor scenes can be learned in this unsupervised manner. In addition, with the learned visual representations, indoor scene recognition models trained with the learned representations and a few of labeled samples can achieve competitive performance compared to the state-of-the-art approaches.

Preprint ARTICLE | doi:10.20944/preprints202010.0526.v1

Animal Sound Classification Using Dissimilarity Spaces

Loris Nanni, Sheryl Brahnam, Alessandra Lumini, Gianluca Maguolo

Subject: Computer Science And Mathematics, Algebra And Number Theory Keywords: audio classification; dissimilarity space; siamese network; ensemble of classifiers; pattern recognition; animal audio

Online: 26 October 2020 (13:57:01 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202108.0094.v1

Closing the Performance Gap between Siamese Networks for Dissimilarity Image Classification and Convolutional Neural Networks

Loris Nanni, Giovanni Minchio, Sheryl Brahnam, Davide Sarraggiotto, Alessandra Lumini

Subject: Computer Science And Mathematics, Discrete Mathematics And Combinatorics Keywords: Siamese networks; Ensemble of classifiers; Loss function; Discrete cosine transform

Online: 3 August 2021 (15:49:22 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202304.1268.v1

Using EfficientNet-B7 (CNN), Variational Auto Encoder (VAE) and Siamese Twins’ Networks to Evaluate Human Exercises as Super Objects in a TSSCI Images

Yoram Segal, Ofer Hadar, Lenka Lhotska

Subject: Public Health And Healthcare, Physical Therapy, Sports Therapy And Rehabilitation Keywords: Keywords: OpenPose (OP); MediaPipe (MP); Rehabilitation; Tree Structure Skeleton Image (TSSI); Tree Structure Skeleton Color Image (TSSCI); Variational Auto Encoder (VAE); Siamese twins Neural Network; Simulator; Human body movements

Online: 30 April 2023 (07:10:50 CEST)

Show abstract| Download PDF| Share

Search Results

5 articles found