Preprint Article Version 2 Preserved in Portico This version is not peer-reviewed

HTC-Grasp: A Hybrid Transformer-CNN Architecture for Robotic Grasp Detection

Version 1 : Received: 21 February 2023 / Approved: 22 February 2023 / Online: 22 February 2023 (09:11:59 CET)
Version 2 : Received: 11 March 2023 / Approved: 13 March 2023 / Online: 13 March 2023 (04:00:50 CET)

A peer-reviewed article of this Preprint also exists.

Zhang, Q.; Zhu, J.; Sun, X.; Liu, M. HTC-Grasp: A Hybrid Transformer-CNN Architecture for Robotic Grasp Detection. Electronics 2023, 12, 1505. Zhang, Q.; Zhu, J.; Sun, X.; Liu, M. HTC-Grasp: A Hybrid Transformer-CNN Architecture for Robotic Grasp Detection. Electronics 2023, 12, 1505.

Abstract

We introduce a novel hybrid Transformer-CNN architecture for robotic grasp detection, designed to enhance the accuracy of grasping unknown objects. Our proposed architecture has two key designs. Firstly, we develop a hierarchical transformer as the encoder, incorporating the external attention to effectively capture the correlation features across the data. Secondly, the decoder is constructed with cross-layer connections to efficiently fuse multi-scale features. Channel attention is introduced in the decoder to model the correlation between channels and to adaptively recalibrate the channel correlation feature response, thereby increasing the weight of the effective channels. Our method is evaluated on the Cornell and Jacquard public datasets, achieving an image-wise detection accuracy of 98.3% and 95.8% on each dataset, respectively. Additionally, we achieve object-wise detection accuracy of 96.9% and 92.4% on the same datasets. A physical experiment is also performed using the Elite 6Dof robot, with a grasping accuracy rate of 93.3%, demonstrating the proposed method's ability to grasp unknown objects in real-world scenarios. The results of this study show that our proposed method outperforms other state-of-the-art methods.

Keywords

Robotic Grasp; Transformer; attentional mechanism

Subject

Computer Science and Mathematics, Robotics

Comments (1)

Comment 1
Received: 13 March 2023
Commenter: Xueying Sun
Commenter's Conflict of Interests: Author
Comment: We enriched the experimental data and reorganized the expression of some statements.
+ Respond to this comment

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 1
Metrics 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.