Preprint Article Version 1 Preserved in Portico This version is not peer-reviewed

Mask Data Priming Network for Automatic Check-Out

Version 1 : Received: 12 June 2020 / Approved: 14 June 2020 / Online: 14 June 2020 (12:51:26 CEST)

How to cite: Xiao, Z.; Zhao, J.; Sun, G. Mask Data Priming Network for Automatic Check-Out. Preprints 2020, 2020060170. https://doi.org/10.20944/preprints202006.0170.v1 Xiao, Z.; Zhao, J.; Sun, G. Mask Data Priming Network for Automatic Check-Out. Preprints 2020, 2020060170. https://doi.org/10.20944/preprints202006.0170.v1

Abstract

Auto checkout has received more and more attention in recent years and this system automatically generates a shopping bill by identifying the picture of the products purchased by the customers. However, the system is challenged by the domain adaptation problem, where each image of the training set contains only one commodity, whereas the test set is a collection of multiple commodities. The existing solution to this problem is to resynthesize the training images to enhance the training set. Then the composite images are rendered using CycleGAN to make the image distribution of the training set and the test set more similar. However, we find that the detection boxes given by the ground truth of the common dataset contain a large part of the background area, the area will affect the training process as noise. To solve this problem, we propose a mask data priming method. Specifically, we redo the large scale Retail Product Checkout (RPC) dataset and add segmentation annotation information to each item in the training set image based on the original dataset using pixel-level annotation. Secondly, a new network structure is proposed in which we train the network using joint learning of detectors and counters, and fine-tune the detection network by filtering out suitable images from the test set. Experiments on the RPC dataset have shown that our method yields better results. we used an approach that reached 81.87% compared to 56.68% for the baseline approach which demonstrates that pixel-level information helps to improve the detection results of the network.

Keywords

object detection; semantic segmentation; computer vision; automatic check-out

Subject

Computer Science and Mathematics, Computer Science

Comments (0)

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 0
Metrics 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.