Submitted:
30 June 2023
Posted:
04 July 2023
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Hybrid Classical-Quantum Networks
2.1. Vairiational Quantum Circuits
2.2. Tensor Quantum Circuits

3. Hybrid Classical-Quantum Transferring CNN
| Algorithm Hybrid classical-quantum transferring CNN |
|
4. Evaluation an Results
4.1. Data Profile
4.2. Evaluation Criteria
4.3. Experimental Setup
4.4. Performance Comparison



| Method | 20/80 | 50/50 |
| VGG-VD16[38] | 86.59 | 89.64 |
| GoogLeNet [38] | 83.44 | 86.39 |
| AlexNet+MSCP [39] | 88.99 | 92.36 |
| VGG-VD16+MSCP [39] | 91.52 | 94.42 |
| AlexNet+SPP [40] | 87.44 | 91.45 |
| RADCNet [41] | 88.12 | 92.53 |
| AlexNet+SAFF [42] | 87.51 | 91.83 |
| VGG-VD16+SAFF [42] | 90.25 | 93.83 |
| AlexNet+RIR [43] | 91.95 | 94.56 |
| VGG-VD16+RIR [43] | 93.34 | 95.57 |
| ResNet50+RIR [43] | 94.95 | 96.48 |
| DCNN [44] | 90.82 | 96.89 |
| CBAM [45] | 94.66 | 96.90 |
| MSCP [46] | 92.21 | 96.56 |
| Two-Stream Fusion [47] | 92.32 | 94.58 |
| RTN [48] | 92.44 | _ |
| GCFs+LOFs [49] | 92.48 | 96.85 |
| CapsNet [50] | 91.63 | 94.74 |
| ARCNet [51] | 88.75 | 93.1 |
| SCCov [52] | 93.12 | 96.1 |
| KFBNet [53] | 95.50 | 97.40 |
| GBNet [54] | 92.20 | 95.48 |
| MG-CAP [55] | 93.34 | 96.12 |
| EAM [56] | 94.26 | 97.06 |
| EAM [56] | 93.64 | 96.62 |
| F2BRBM [57] | 96.05 | 96.97 |
| MBLANet [58] | 95.60 | 97.14 |
| GRMANet [59] | 95.43 | 97.39 |
| IDCCP [60] | 94.80 | 96.95 |
| MSANet [61] | 93.53 | 96.01 |
| CTNet [62] | 96.25 | 97.70 |
| LSENet [63] | 94.41 | 96.36 |
| DFAGCN [64] | _ | 94.88 |
| MGML-FENet [65] | 96.45 | 98.60 |
| ESD-MBENet-v1[66] | 96.20 | 98.85* |
| ESD-MBENet-v2[66] | 96.39 | 98.40 |
| SeCo-ResNet-50[67] | 93.47 | 95.99 |
| RSP-ViTAEv2-S [68] | 96.91* | 98.22 |
| ISP(ViT) [69] | 96.24 | 97.95 |
| ISSP(ViT) [70] | 95.82 | 97.98 |
| RingMo(ViT-200W-200E) [71] | 96.54 | 98.38 |
| ISP(Swin) [72] | 96.24 | 98.03 |
| ISSP(Swin) [73] | 96.54 | 97.95 |
| RingMo(Swin-200W-200E) [71] | 96.90 | 98.34 |
| Ours | 97.33 | 98.82 |
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- S. Otgon; M. Datcun, B. Classification of Remot Sensing Images With Parameterized Quantum Gates. IEEE Geoscience and Remote Sensing Letters, 2022; Volume 19, pp. 154–196.
- Xiao Xiang Zhu; Devis Tuia; . Deep Learning in Remote Sensing: A Comprehensive Review and List of Resources. IEEE Geoscience and Remote Sensing Magazine, 2022, Volume:5,Issue:4, pp.8-36. phrase indicating stage of publication (submitted; accepted; in press). [CrossRef]
- Author 1, A.B. (University, City, State, Country); Author 2, C. (Institute, City, State, Country). Personal communication, 2012.
- Weiquan Wang, Yushi Chen, . Transferring CNN With Adaptive Learning for Remote Sensing Scene Classification, IEEE Transactions on Geoscience and Remote Sensing, Volume:60, 2022. [CrossRef]
- Gong Chen, Jun Wei. Remote Sensing Image Scene Classification: Benchmark and State of the Art. Proceedings of the IEEE, Volume: 105, Issue: 10, October 2017.
- Y. LeCun, Y. Bengio, and G. Hinton. “Deep learning,” Nature, vol. 521, no. 7553, pp. 436-444, 2015.
- C. Broni-Bediako, Y. Murata, and L. G.B. Mormille, “Searching for CNN Architectures for Remote Sensing Scene Classification,” IEEE Transactions on Geoscience and Remote Sensing, vol. No, pp. ,2021. [CrossRef]
- X. X.Zhu et al., “Deep learning in remote sensing :A comprehensive review and list of resources,” IEEE Geosci. Remote Sens. Mag., vol. 5, no.4, pp. 8-36, Dec. 2017. [CrossRef]
- Krizhevsky, I. Sutskever, and G. E. Hintion, “ImageNet classification with deep convolutional neural neural networks,” in Advances in Neural Information Processing Systems. Red Hook, NY, USA: Curran Associates, 2012, pp.1097-1105.
- L. Ma, Y. Liu, X. Zhang, and Y. Ye, “Deep learning in remote sensing applications: A meta-analysis and review,” ISPRS J. Photogramm. Remote Sens., vol. 152, pp. 166-177, Jun.2019. [CrossRef]
- Y. Sun, B. Xue, M. Zhang, and G.G. Yen, “Completely automated CNN architecture design based on blocks,” IEEE Trans. Neural Netw. Learn. Syst., vol. 31, no.4.pp. 1242-1254, Apr. 2019. [CrossRef]
- Krizhevsky, I. Sutskever, and G.E. Hinton, “ImageNet classification with deep convolutional neural networks,” in Proc. Adv. Neural Inf. Process. Syst. (NIPS), vol.25. Stateline, NV, USA, Dec. 2012, pp. 1097-1105.
- M. D. Zeiler and R. Fergus, “Visualizing and understanding convolutional networks,” in Proc. Eur. Conf. Comput. Vis., Sep. 2014, pp.818-833.
- C.Szegedy et al., “Going deeper with convolutions,” in Proc. IEEE Conf. Comput. Vis. Pattern Recognit. (CVPR), Jun. 2015, pp. 1-9.
- K. He, X. Zhang, S. Ren, and J. Sun, “Deep residual learning for image recognition,” in Proc. IEEE Conf. Comput. Vis. Pattern Recognit. (CVPR), Jun. 2016, pp.770-778. [CrossRef]
- J.A.Richards and X. Jia, “Sources and characteristics of remote sensing image data,” in Remote Sensing Digital Image Analysis Image Analysis:An Introduction. Berlin, Germany: Springer 1999, pp. 1-38.
- J. Coelho, “ Solve any Image Classification Problem Quickly and Easily”, https://github.com/pmarcelino/blog.
- G. Cheng,X. Xie,and J. Han, “Remote Sensing Image Scene Classification Meets Deep Learning: Challenges, Methods, Benchmarks, and Opportunities”, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, vol. 13, Jun. 2020, pp.3735 - 3756.
- P. Helber, B. Bischke and A. Dengel, “Eurosat: A novel dataset and deep learning benchmark for land use and land cover classification”, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2019. [CrossRef]
- P. Helber, B. Bischke and A. Dengel, “Introducing EuroSAT: A Novel Dataset and Deep Learning Benchmark for Land Use and Land Cover Classification”, IGARSS 2018-2018 IEEE International Geoscience and Remote Sensing Symposium,2018,pp.204-207. [CrossRef]
- P.W. Shor, SIAM J. Comput. 26,1484(1997).
- M.A. Nielsen and I. L. Chuang, Quantum Computation and Quantum Information.(Cambridge University Press, Cambridge,2010). [CrossRef]
- Y.Jeong,and J. Yu,“Bulk scanning method of a heavy metal concentration in tailings of a gold mine using SWIR hyperspectral imaging system,”Int. J. Appl. Earth Obs.,vol.102, pp. Oct. 2021.
- T. Yue, Y. Liu, Z. Du, “Quantum machine learning of eco-environmental surfaces.” Science Bulletin 67, pp.1031-1033. 2022. [CrossRef]
- R. Uddien Shaik, A. Unni and W. Zeng, “Quantum Based Pseudo-Labelling for Hyperspectral Imagery: A Simple and Efficient Semi-Supervised Learning Method for Machine Learning Classifiers”, Remote Sens. 2022, 14(22). [CrossRef]
- Mari, T. R. Bromley and J. Izaac, “Transfer learning in hybrid classical-quantum neural networks.”arXiv preprint arXiv:1912.08278,2019. [CrossRef]
- J. Qi, J. Tejedor, “Classical-to-Quantum Transfer Learning for Spoken Command Recognition Based on Quantum Neural Networks.”arXiv preprint arXiv:2110.08689,ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),2022. [CrossRef]
- V. Dunjko, & H. J. Briegel. Machine learning & artificial intelligence in the quantum domain: a review of recent progress. Rep. Prog. Phys. 81, 074001(2018). [CrossRef]
- A. Peruzzo, J. McClean, P. Shadbolt, M.-H. Yung, X.-Q. Zhou, P. J. Love, A. Aspuru-Guzik, and J. L. OBrien, Nature Communications 5, 4213 (2014).
- E. Farhi, J. Goldstone, and S. Gutmann, arXiv:1411.4028 [quant-ph] (2014), arXiv: 1411.4028.
- X. Cai, D. Z. Li, X. Liu, “Experimental Realization of a Quantum Support Vector Machine,” PhysRevLett.114.140504, 2015.
- V. Havlíček, A. D. Córcoles,“Supervised learning with quantum-enhanced feature spaces,” Nature, volume 567, pp.209–212. 2019.
- J. Deng, W. Dong, and R. Socher. ImageNet: A large-scale hierarchical image database. In 2009 IEEE Conference on Computer Vision and Pattern Recognition, pp.248-255. IEEE, 2009.
- K. He, X. Zhang, and S. Pradhan. Quanvolutional neural networks: powering image recognition with quantum circuits. Quantum Machine Intelligence, 2(1), Feb. 2020.
- C. Szegedy, S. Ioffe, V.Vanhoucke, and A. Alemi. Inception-v4, inception-resnet and the image recognition. arXiv preprint arXiv:1409.1556,2014.
- C. Szegedy, W.Liu, Y.Jia, P.Sermanet, S.Reed, D.Anguelov, D.Erhan, V.Vanhoucke, and A.Rabinovich. Going deeper with convolutions. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 1-9,2015.
- C.Segedy, V.Vanhoucke, S. Ioffe, J. Shlens, and Z. Wojna,. Rethinking the inception architecture for computer vision. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 2818-2826, 2016.
- Xia, G.S.; Hu, J.; Hu, F.; Shi, B.; Bai, X.; Zhong, Y.; Zhang, L.; Lu, X. AID: A benchmark data set for performance evaluation of aerial scene classification. IEEE Trans. Geosci. Remote Sens. 2017, 55, 3965–3981. [Google Scholar] [CrossRef]
- He, N.; Fang, L.; Li, S.; Plaza, A.; Plaza, J. Remote sensing scene classification using multilayer stacked covariance pooling. IEEE Trans. Geosci. Remote Sens. 2018, 56, 6899–6910. [Google Scholar] [CrossRef]
- Han, X.; Zhong, Y.; Cao, L.; Zhang, L. Pre-trained alexnet architecture with pyramid pooling and supervision for high spatial resolution remote sensing image scene classification. Remote Sens. 2017, 9, 848. [Google Scholar] [CrossRef]
- Bi, Q.; Qin, K.; Zhang, H.; Li, Z.; Xu, K. RADC-Net: A residual attention based convolution network for aerial scene classification. Neurocomputing 2020, 377, 345–359. [Google Scholar] [CrossRef]
- Cao, R.; Fang, L.; Lu, T.; He, N. Self-Attention-Based Deep Feature Fusion for Remote Sensing Scene Classification. IEEE Geosci. Remote. Sens. Lett. 2021, 18, 43–47. [Google Scholar] [CrossRef]
- Qi Kunlun, Yang Chao, Hu Chuli, Shen Yonglin, Shen Shengyu, Wu Huayi. Rotation invariance regularization for remote sensing image scene classification with convolutional neural networks. Remote Sensing, 2021, 13(4), 569.
- G. Cheng, C. Yang, X. Yao, L. Guo, and J. Han, “When deep learning meets metric learning: Remote sensing image scene classification via learning discriminative cnns,” IEEE transactions on geoscience and remote sensing, vol. 56, no. 5, pp. 2811–2821, 2018. [CrossRef]
- S. Woo, J. Park, J.-Y. Lee, and I. S. Kweon, “Cbam: Convolutional block attention module,”in Proceedings of the European conference on computer vision (ECCV), pp. 3–19, 2018.
- N. He, L. Fang, S. Li, A. Plaza, and J. Plaza, “Remote sensing scene classification using multilayer stacked covariance pooling,” IEEE Transactions on Geoscience and Remote Sensing, vol. 56, no. 12, pp. 6899–6910, 2018. [CrossRef]
- Y. Yu and F. Liu,“A two-stream deep fusion framework for high-resolution aerial scene classification,” Computational intelligence and neuroscience, vol. 2018, 2018. [CrossRef]
- Z. Chen, S. Wang, X. Hou, L. Shao, and A. Dhabi, “Recurrent transformer network for remote sensing scene categorisation.,”in BMVC, vol. 266, 2018.
- D. Zeng, S. Chen, B. Chen, and S. Li, “Improving remote sensing scene classification by integrating globalcontext and local-object features,” Remote Sensing, vol. 10, no. 5, p. 734, 2018. [CrossRef]
- W. Zhang, P. Tang, and L. Zhao, “Remote sensing image scene classification using cnn-capsnet,” Remote Sensing, vol. 11, no. 5, p. 494, 2019.
- Q. Wang, S. Liu, J. Chanussot, and X. Li, “Scene classification with recurrent attention of vhr remote sensing images,” IEEE Transactions on Geoscience and Remote Sensing, vol. 57, no. 2, pp. 1155–1167, 2018. [CrossRef]
- N. He, L. Fang, S. Li, J. Plaza, and A. Plaza, “Skipconnected covariance network for remote sensing scene classification,” IEEE transactions on neural networks and learning systems, vol. 31, no. 5, pp. 1461–1474, 2019. [CrossRef]
- F. Li, R. Feng, W. Han, and L. Wang, “High-resolution remote sensing image scene classification via key filter bank based on convolutional neural network,”IEEE Transactions on Geoscience and Remote Sensing, vol. 58, no. 11, pp. 8077–8092, 2020. [CrossRef]
- H. Sun, S. Li, X. Zheng, and X. Lu, “Remote sensing scene classification by gated bidirectional network,”IEEE Transactions on Geoscience and Remote Sensing, vol. 58, no. 1, pp. 82–96, 2019. [CrossRef]
- S. Wang, Y. Guan, and L. Shao, “Multi-granularity canonical appearance pooling for remote sensing scene classification,” IEEE Transactions on Image Processing, vol. 29, pp. 5396–5407, 2020. [CrossRef]
- Z. Zhao, J. Li, Z. Luo, J. Li, and C. Chen, “Remote sensing image scene classification based on an enhanced attention module,” IEEE Geoscience and Remote Sensing Letters, vol. 18, no. 11, pp. 1926–1930, 2020.
- X. Zhang, W. An, J. Sun, H. Wu, W. Zhang, and Y. Du, “Best representation branch model for remote sensing image scene classification,” IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, vol. 14, pp. 9768–9780, 2021.
- H. Chen, Z. Qi, and Z. Shi, “Remote sensing image change detection with transformers,” IEEE Transactions on Geoscience and Remote Sensing, 2021. [CrossRef]
- B. Li, Y. Guo, J. Yang, L. Wang, Y. Wang, and W. An,“Gated recurrent multiattention network for vhr remote sensing image classification,” IEEE Transactions on Geoscience and Remote Sensing, 2021.
- S. Wang, Y. Ren, G. Parr, Y. Guan, and L. Shao,“Invariant deep compressible covariance pooling for aerial scene categorization,” IEEE Transactions on Geoscience and Remote Sensing, vol. 59, no. 8, pp. 6549–6561, 2020.
- G. Zhang, W. Xu, W. Zhao, C. Huang, E. N. Yk, Y. Chen, and J. Su, “A multiscale attention network for remote sensing scene images classification,” IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, vol. 14, pp. 9530–9545, 2021.
- P. Deng, K. Xu, and H. Huang, “When cnns meet vision transformer: A joint framework for remote sensing scene classification,” IEEE Geoscience and Remote Sensing Letters, vol. 19, pp. 1–5, 2021. [CrossRef]
- Q. Bi, K. Qin, H. Zhang, and G.-S. Xia, “Local semantic enhanced convnet for aerial scene recognition,” IEEE Transactions on Image Processing, vol. 30, pp. 6498–6511, 2021. [CrossRef]
- K. Xu, H. Huang, P. Deng, and Y. Li, “Deep feature aggregation framework driven by graph convolutional network for scene classification in remote sensing,”IEEE Transactions on Neural Networks and Learning Systems, 2021. [CrossRef]
- Q. Zhao, S. Lyu, Y. Li, Y. Ma, and L. Chen, “Mgml: Multigranularity multilevel feature ensemble network for remote sensing scene classification,” IEEE Transactions on Neural Networks and Learning Systems, 2021. [CrossRef]
- Q. Zhao, Y. Ma, S. Lyu, and L. Chen, “Embedded selfdistillation in compact multi-branch ensemble network for remote sensing scene classification,” arXiv preprint arXiv:2104.00222, 2021.
- Ma˜nas, A. Lacoste, X. Giro-i Nieto, D. Vazquez, and P. Rodriguez, “Seasonal contrast: Unsupervised pre-training from uncurated remote sensing data,” in Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 9414–9423, 2021.
- D. Wang, J. Zhang, B. Du, G.-S. Xia, and D. Tao, “An empirical study of remote sensing pretraining,” IEEE Transactions on Geoscience and Remote Sensing, 2022.
- Dosovitskiy, L. Beyer, A. Kolesnikov, D. Weissenborn, X. Zhai, T. Unterthiner, M. Dehghani, M. Minderer, G. Heigold, S. Gelly, et al., “An image is worth 16 × 16 words: Transformers for image recognition at scale,”arXiv preprint arXiv:2010.11929, 2020.
- G. Cheng, C. Yang, X. Yao, L. Guo, and J. Han, “When deep learning meets metric learning: Remote sensing image scene classification via learning discriminative cnns,” IEEE transactions on geoscience and remote sensing, vol. 56, no. 5, pp. 2811–2821, 2018.
- X. Sun, P.Wang, W. Lu, Z. Zhu, X. Lu, Q. He, J. Li, X. Rong, Z. Yang, H. Chang, Q. He, G. Yang, R. Wang, J. Lu and K. Fu. RingMo: A remote sensing foundation model with masked image modeling. IEEE Transactions on Geoscience and Remote Sensing[J]. 2022. [CrossRef]
- Z. Liu, Y. Lin, Y. Cao, H. Hu, Y. Wei, Z. Zhang, S. Lin, and B. Guo, “Swin transformer: Hierarchical vision transformer using shifted windows,” in Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 10012–10022, 2021.
- Z. Xie, Z. Zhang, Y. Cao, Y. Lin, J. Bao, Z. Yao, Q. Dai,and H. Hu, “Simmim: A simple framework for masked image modeling,” arXiv preprint arXiv:2111.09886, 2021. [CrossRef]
- Pointer. Programming PyTorch for Deep Learning. Creating and Deploying Deep Learning. O'Reilly, 2019-11-4.
- V. Bergholm et al. PennyLane: Automatic differentiation of hybrid quantum-classical computations. 2018. arXiv:1811.04968.




| Method(%) | 10/90 | 20/80 | Number of Parameters |
| BoVW(SVM, SIFT, k=10) | 54.54 | 56.13 | _ |
| BoVW(SVM, SIFT, k=100) | 63.07 | 64.80 | _ |
| BoVW(SVM, SIFT, k=500) | 65.62 | 67.26 | _ |
| CNN(two layers) | 75.88 | 79.84 | 50.5K |
| ResNet-50 | 75.06 | 88.53 | 25.6M |
| GoogleNet | 77.37 | 90.97 | 6.8M |
| Ours | 95.81 | 96.62 | 21.28 M |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).