Journal of Computer and Communications

Volume 12, Issue 9 (September 2024)

ISSN Print: 2327-5219   ISSN Online: 2327-5227

Google-based Impact Factor: 1.98  Citations  

Research on PolSAR Image Classification Method Based on Vision Transformer Considering Local Information

  XML Download Download as PDF (Size: 6217KB)  PP. 22-38  
DOI: 10.4236/jcc.2024.129002    62 Downloads   323 Views  

ABSTRACT

In response to the problem of inadequate utilization of local information in PolSAR image classification using Vision Transformer in existing studies, this paper proposes a Vision Transformer method considering local information, LIViT. The method replaces image patch sequence with polarimetric feature sequence in the feature embedding, and uses convolution for mapping to preserve image spatial detail information. On the other hand, the addition of the wavelet transform branch enables the network to pay more attention to the shape and edge information of the feature target and improves the extraction of local edge information. The results in Wuhan, China and Flevoland, Netherlands show that considering local information when using Vision Transformer for PolSAR image classification effectively improves the image classification accuracy and shows better advantages in PolSAR image classification.

Share and Cite:

Zhang, M. , Wang, A. , Du, X. , Wang, X. and Wu, Y. (2024) Research on PolSAR Image Classification Method Based on Vision Transformer Considering Local Information. Journal of Computer and Communications, 12, 22-38. doi: 10.4236/jcc.2024.129002.

Cited by

No relevant information.

Copyright © 2025 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.