Submitted:
11 February 2025
Posted:
13 February 2025
You are already at the latest version
Abstract
Keywords:
1. Introduction
| Dataset Name | Type | Theme | Num objs | Real objects? |
|---|---|---|---|---|
| PSB [5] | Meshes | General | 1814 | |
| 3DNet [6] | Meshes | General | 3433 | |
| KIT database [7] | Textured meshes, stereo RGB images | Household | 145 | |
| LINEMOD [2] | Textured meshes, RGB images with poses | General | 15 | |
| BigBIRD [3] | Textured meshes, RGB-D images | Household | 125 | |
| ModelNet [8] | Meshes | General | 151k | |
| Rutgers APC [9] | Textured meshes, RGB-D images with poses | Household | 25 | |
| ShapeNetCore [10] | Meshes with WordNet annotations | General | 51k | |
| YCB [1] | Textured meshes, RGB-D images, shopping list | Daily life | 77 | ✓ |
| ACRV [4] | Textured meshes, shopping list | Household | 42 | ✓ |
| MVTec ITODD [11] | Meshes, RGB-D images with poses | Industrial | 28 | |
| T-LESS [12] | Textured meshes, RGB-D images with poses | Industrial | 30 | |
| RBO [13] | Articulated meshes, RGB-D images | Articulation | 14 | |
| TUD-L & TYO-L [14] | Textured meshes, RGB-D images with poses | Varied lighting | 24 | |
| ContactDB [15] | Meshes with contact maps, RGB-D & thermal images | Contact | 3750 | 3D printable |
| HomebrewedDB [16] | Textured meshes, RGB-D images with poses | Household, industry, toy | 33 | |
| EGAD [17] | Meshes, 3D printing instructions | Generated | 2282 | 3D printable |
| Household Cloth Object Set [18] | Meshes, microscopic images, object details | Cloth | 27 | ✓ |
| ABO [19] | Textured meshes with metadata including WordNet annotations, RGB images, physically-based renders | Amazon.com household | 7953 | |
| AKB-48 [20] | Articulated meshes | Articulation | 2037 | |
| GSO [21] | High-quality textured meshes with metadata | Household | 1030 | |
| HOPE [22] | Textured meshes, shopping list | Toy Grocery | 28 | ✓ |
| MP6D [23] | Meshes, RGB-D images | Industrial | 20 | |
| ObjectFolder [24,25] | Neural representations that can be used to generate visual appearance, impact sounds and tactile data | Multisensory | 1000 | |
| PCPD [26] | RGB-D images | Power grid | 10 | |
| TransCG [27] | Meshes, RGB-D images | Transparent | 51 | |
| Ours | Textured meshes, shopping list | Supermarket | 50 | ✓ |
2. Related Works
3. Supermarket Object Set
3.1. Object Choices
3.2. Cost
3.3. Commonality
3.4. Shape, Size and Weight
3.5. Variety
3.6. Data Collection Methodology
3.7. Metadata
4. Conclusions
Funding
Data Availability Statement
References
- Calli, B.; Singh, A.; Walsman, A.; Srinivasa, S.; Abbeel, P.; Dollar, A.M. The ycb object and model set: Towards common benchmarks for manipulation research. In Proceedings of the International Conference on Advanced Robotics (ICAR); 2015. [Google Scholar]
- Hinterstoißer, S.; Lepetit, V.; Ilic, S.; Holzer, S.; Bradski, G.R.; Konolige, K.; Navab, N. Model Based Training, Detection and Pose Estimation of Texture-Less 3D Objects in Heavily Cluttered Scenes. In Proceedings of the Asian Conference on Computer Vision; 2012. [Google Scholar]
- Singh, A.; Sha, J.; Narayan, K.S.; Achim, T.; Abbeel, P. BigBIRD: A large-scale 3D database of object instances. In Proceedings of the 2014 IEEE International Conference on Robotics and Automation (ICRA); 2014; pp. 509–516. [Google Scholar] [CrossRef]
- Leitner, J.; Tow, A.W.; Sünderhauf, N.; Dean, J.E.; Durham, J.W.; Cooper, M.; Eich, M.; Lehnert, C.; Mangels, R.; McCool, C.; et al. The ACRV picking benchmark: A robotic shelf picking benchmark to foster reproducible research. In Proceedings of the 2017 IEEE International Conference on Robotics and Automation (ICRA); 2017; pp. 4705–4712. [Google Scholar] [CrossRef]
- Shilane, P.; Min, P.; Kazhdan, M.; Funkhouser, T. The princeton shape benchmark. In Proceedings of the IEEEProceedings Shape Modeling Applications; 2004. [Google Scholar]
- Wohlkinger, W.; Aldoma, A.; Rusu, R.B.; Vincze, M. 3DNet: Large-scale object class recognition from CAD models. In Proceedings of the 2012 IEEE International Conference on Robotics and Automation; 2012; pp. 5384–5391. [Google Scholar] [CrossRef]
- Kasper, A.; Xue, Z.; Dillmann, R. The KIT object models database: An object model database for object recognition, localization and manipulation in service robotics. The International Journal of Robotics Research 2012, 31, 927–934. [Google Scholar] [CrossRef]
- Wu, Z.; Song, S.; Khosla, A.; Yu, F.; Zhang, L.; Tang, X.; Xiao, J. 3d shapenets: A deep representation for volumetric shapes. In Proceedings of the IEEE conference on computer vision and pattern recognition; 2015. [Google Scholar]
- Rennie, C.; Shome, R.; Bekris, K.E.; de Souza, A.F. A Dataset for Improved RGBD-Based Object Detection and Pose Estimation for Warehouse Pick-and-Place. IEEE Robotics and Automation Letters 2015, 1, 1179–1185. [Google Scholar] [CrossRef]
- Chang, A.X.; Funkhouser, T.; Guibas, L.; Hanrahan, P.; Huang, Q.; Li, Z.; Savarese, S.; Savva, M.; Song, S.; Su, H.; et al. ShapeNet: An Information-Rich 3D Model Repository. Technical Report arXiv:1512.03012 [cs.GR], Stanford University — Princeton University — Toyota Technological Institute at Chicago, 2015. arXiv:1512.03012.
- Drost, B.; Ulrich, M.; Bergmann, P.; Härtinger, P.; Steger, C. Introducing MVTec ITODD — A Dataset for 3D Object Recognition in Industry. In Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops (ICCVW); 2017; pp. 2200–2208. [Google Scholar] [CrossRef]
- Hodaň, T.; Haluza, P.; Obdržálek, Š.; Matas, J.; Lourakis, M.; Zabulis, X. T-LESS: An RGB-D Dataset for 6D Pose Estimation of Texture-less Objects. IEEE Winter Conference on Applications of Computer Vision (WACV) 2017. [Google Scholar]
- Martín-Martín, R.; Eppner, C.; Brock, O. The RBO dataset of articulated objects and interactions. The International Journal of Robotics Research 2019, 38, 1013–1019. [Google Scholar] [CrossRef]
- Hodaň, T.; Michel, F.; Brachmann, E.; Kehl, W.; Glent Buch, A.; Kraft, D.; Drost, B.; Vidal, J.; Ihrke, S.; Zabulis, X.; et al. BOP: Benchmark for 6D Object Pose Estimation. European Conference on Computer Vision (ECCV) 2018. [Google Scholar]
- Brahmbhatt, S.; Ham, C.; Kemp, C.C.; Hays, J. ContactDB: Analyzing and Predicting Grasp Contact via Thermal Imaging. 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2019; 8701–8711. [Google Scholar]
- Kaskman, R.; Zakharov, S.; Shugurov, I.; Ilic, S. HomebrewedDB: RGB-D Dataset for 6D Pose Estimation of 3D Objects. In Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW), Los Alamitos, CA, USA, oct 2019; pp. 2767–2776. [Google Scholar] [CrossRef]
- Morrison, D.; Corke, P.; Leitner, J. Egad! an evolved grasping analysis dataset for diversity and reproducibility in robotic manipulation. IEEE Robotics and Automation Letters 2020, 5, 4368–4375. [Google Scholar] [CrossRef]
- Garcia-Camacho, I.; Borràs, J.; Calli, B.; Norton, A.; Alenyà, G. Household Cloth Object Set: Fostering Benchmarking in Deformable Object Manipulation. IEEE Robotics and Automation Letters 2022, 7, 5866–5873. [Google Scholar] [CrossRef]
- Collins, J.; Goel, S.; Deng, K.; Luthra, A.; Xu, L.; Gundogdu, E.; Zhang, X.; Yago Vicente, T.F.; Dideriksen, T.; Arora, H.; et al. ABO: Dataset and Benchmarks for Real-World 3D Object Understanding. CVPR 2022. [Google Scholar]
- Liu, L.; Xu, W.; Fu, H.; Qian, S.; Han, Y.J.; Lu, C. AKB-48: A Real-World Articulated Object Knowledge Base. 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022; 14789–14798. [Google Scholar]
- Downs, L.; Francis, A.; Koenig, N.; Kinman, B.; Hickman, R.; Reymann, K.; McHugh, T.B.; Vanhoucke, V. Google Scanned Objects: A High-Quality Dataset of 3D Scanned Household Items. In Proceedings of the 2022 International Conference on Robotics and Automation (ICRA). IEEE Press; 2022; pp. 2553–2560. [Google Scholar] [CrossRef]
- Tyree, S.; Tremblay, J.; To, T.; Cheng, J.; Mosier, T.; Smith, J.; Birchfield, S. 6-DoF Pose Estimation of Household Objects for Robotic Manipulation: An Accessible Dataset and Benchmark. In Proceedings of the International Conference on Intelligent Robots and Systems (IROS); 2022. [Google Scholar]
- Chen, L.; Yang, H.; Wu, C.; Wu, S. MP6D: An RGB-D Dataset for Metal Parts’ 6D Pose Estimation. IEEE Robotics and Automation Letters 2022, 7, 5912–5919. [Google Scholar] [CrossRef]
- Gao, R.; Si, Z.; Chang, Y.Y.; Clarke, S.; Bohg, J.; Fei-Fei, L.; Yuan, W.; Wu, J. ObjectFolder 2.0: A Multisensory Object Dataset for Sim2Real Transfer. In Proceedings of the CVPR. 2022. [Google Scholar]
- Gao, R.; Chang, Y.Y.; Mall, S.; Fei-Fei, L.; Wu, J. ObjectFolder: A Dataset of Objects with Implicit Visual, Auditory, and Tactile Representations. In Proceedings of the CoRL; 2021. [Google Scholar]
- Liu, X.; Li, S.; Liu, X. A Multiform Power Components Dataset for Robotic Maintenance in Power Grid. In Proceedings of the 2022 International Conference on Advanced Robotics and Mechatronics (ICARM); 2022; pp. 1116–1121. [Google Scholar] [CrossRef]
- Fang, H.; Fang, H.S.; Xu, S.; Lu, C. TransCG: A Large-Scale Real-World Dataset for Transparent Object Depth Completion and a Grasping Baseline. IEEE Robotics and Automation Letters 2022, 7, 7383–7390. [Google Scholar] [CrossRef]
- Schönberger, J.L.; Frahm, J.M. Structure-from-Motion Revisited. In Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR); 2016. [Google Scholar]
- Schönberger, J.L.; Zheng, E.; Pollefeys, M.; Frahm, J.M. Pixelwise View Selection for Unstructured Multi-View Stereo. In Proceedings of the European Conference on Computer Vision (ECCV); 2016. [Google Scholar]
- Lowe, D.G. Distinctive image features from scale-invariant keypoints. International journal of computer vision 2004, 60, 91–110. [Google Scholar] [CrossRef]
- Fischler, M.A.; Bolles, R.C. Random Sample Consensus: A Paradigm for Model Fitting with Applications to Image Analysis and Automated Cartography. Commun. ACM 1981, 24, 381–395. [Google Scholar] [CrossRef]
- Kazhdan, M.; Hoppe, H. Screened Poisson Surface Reconstruction. ACM Trans. Graph. 2013, 32. [Google Scholar] [CrossRef]
- Zhang, Z. Iterative Point Matching for Registration of Free-Form Curves and Surfaces. Int. J. Comput. Vision 1994, 13, 119–152. [Google Scholar] [CrossRef]
| 1 | |
| 2 |




| Category | Object | Mass (g) | Dims (mm) | Category | Object | Mass (g) | Dims (mm) |
|---|---|---|---|---|---|---|---|
| Smallboxes | Alcohol wipes | 45 | 66×107×30 | Regularcylinders(food) | Baked beans | 485 | 109×75 |
| Ibuprofen | 18 | 125×49×19 | Condensed milk | 445 | 84×75 | ||
| Medistrips | 13 | 74×96×22 | Diced tomatoes | 471 | 109×75 | ||
| Tampons | 95 | 107×54×54 | Stacked chips | 183 | 233×72 | ||
| Smallboxes(food) | Electrolyte tablets | 94 | 151×38×38 | Tuna | 126 | 39×68 | |
| Gravy mix | 450 | 80×191×43 | Irregularcylinders | Gel nail polish remover | 127 | 40×124×40 | |
| Jelly | 97 | 94×66×29 | Nail polish remover | 134 | 58×146×38 | ||
| Soup box | 89 | 110×149×30 | Sunscreen roll on | 105 | 107×45 | ||
| Largeboxes | Aluminium foil | 162 | 312×51×52 | Sunscreen tube | 116 | 178×36 | |
| Disposable gloves | 373 | 220×97×49 | Toothbrush case | 24 | 30×208×20 | ||
| Resealable bags | 196 | 237×91×46 | Irregularcylinders(food) | BBQ sauce | 632 | 225×68 | |
| Soap box | 505 | 184×62×54 | Burger sauce | 477 | 78×156×58 | ||
| Toothpaste | 169 | 206×40×52 | Hazelnut spread | 441 | 82×100×65 | ||
| Largeboxes(food) | Cookies | 474 | 149×211×68 | Noodle cup | 94 | 106×95 | |
| Crisp bread | 158 | 162×135×66 | Salt | 829 | 205×83 | ||
| Milk | 1074 | 92×197×58 | Largeobjects | Bathroom cleaner | 654 | 112×258×53 | |
| Rice bars | 170 | 156×156×39 | Dishwasher powder | 1128 | 140×220×65 | ||
| Tea | 129 | 159×83×61 | Dishwashing liquid | 510 | 91×209×43 | ||
| Water crackers | 161 | 230×60×60 | Intimate wash | 289 | 67×179×40 | ||
| Regularcylinders | Bubbles | 121 | 102×42 | Toilet cleaner | 761 | 91×246×55 | |
| Chest rub ointment | 123 | 68×60 | Packets | Crackers | 128 | 265×77×48 | |
| Disinfectant bleach | 1458 | 293×85 | Digestive biscuits | 214 | 280×85×38 | ||
| Fish flakes | 70 | 115×60 | Milk biscuits | 221 | 175×64×45 | ||
| Glowsticks | 96 | 220×30 | Scotch finger biscuits | 257 | 170×75×44 | ||
| Rubbish bags | 262 | 131×56 | Wafers | 140 | 217×79×23 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).