1. Introduction
With the development of infrared small and small targets, it plays a crucial role in detecting extremely weak targets in the fields of space collision warning, target exploration and accurate tracking by ground-based remote sensing [1-3]. However, due to the change of background intensity and the influence of many factors such as atmospheric refractive index, the target is looming, which seriously interferes with the ground-based detection mission for small and small targets [4-5]. However, the efficiency of the existing dim target algorithm needs to be improved, which is of great significance in the field of detecting extremely weak targets [6-9].
Due to the limited utilization information of single-frame processing method, some studies have extended single-frame processing method to the time domain [10-14]. The commonly used research methods in this scenario are mainly time-space combination method [
15], that is, the combination of single-frame processing method and multi-frame accumulation method, which mainly includes time-space contrast method, time-space tensor method and other time-space combination methods.
For the time-spatial domain comparison method, in the contrast field, Li Y et al. [
16] used local adaptive comparison operation and the method of generating trajectory prediction graph to separate the target. Deng L et al. [
17] proposed a spatio-temporal local contrast filter (STLCF) to enhance the contrast of the target. Du P et al. [
18] analyzed the contrast difference between three frames and the gray intensity difference in multiple directions to segment the target of interest. Du P et al. [
19] also proposed a uniformly weighted local contrast method, which uses contrast and surrounding weighted uniformity features to achieve target enhancement and background suppression. Pang D et al. [
20] calculated spatial variance significance mapping and temporal gray significance mapping in the spatiotemporal domain, and then fused the two features to segment dim targets. Uzair et al. [
21] proposed a spatial contrast operator for the total difference index of center surround in order to segment the target. All these methods are to extend the spatial local contrast algorithm to the time domain.
For the time-space tensor method, some researchers directly stack several consecutive frames of images to build space-time tensors. For example, Sun Y et al. [
22] built an infrared block tensor model and introduced weighted tensor kernel norm and total variational regularization to the low-rank background part, so that the background edge can be accurately decomposed to the low-rank background part. Zhang P et al. [
23] took local complexity as the weighted parameter of the sparse foreground part of the space-time tensor to ensure that the background corners could enter the low-rank background part as much as possible. Zhu H et al. [
24] used morphological filtering results to weight the sparse foreground part of the space-time tensor. Hu Y et al. [
25] proposed a multi-frame spatio-temporal patch tensor model, using spatio-temporal sampling and Laplacian method to pinpoint the rank of the tensor.
In addition, there are other time-space integration methods that are also used to enhance weak moving targets. Sun X et al. [
26] proposed a dynamic programming algorithm (DPA) to improve the visual quality of targets by using energy accumulation and adaptive fusion along the target trajectory. Zhang Y et al. [
27] proposed a precise trajectory extraction (PTE) method. In terms of the preprocessing effect based on the maximum projection method, the multi-frame enhancement of the target is realized through the precise offset position of the target. Fan X et al. [
28] proposed an improved anisotropic filtering method, and enhanced the target by using improved higher-order cumulants (IHQS) obtained from motion templates in multiple directions. Ren X et al. [
29] proposed a three-dimensional cooperative filtering and Spatial inversion (3DCFSI) method to suppress complex background, and used energy accumulation based on signal enhancement to achieve target enhancement. Ma T et al. [
30] proposed the energy compensation accumulation (ECA) method in order to solve the subpixel motion of the target. The target signal-to-noise ratio is improved and the noise background is suppressed. In order to improve the target of extremely low SNR, Ni Y et al. [
31] put forward the attitude-correlated frames adding (ACFA) method, which correlated continuous images to achieve the improvement of the target SNR.
Although the time-space combination method has achieved a series of achievements in the field of enhancing dim targets in infrared images, there are still some problems that need to be solved urgently:
1) Mainly affected by the poor tracking accuracy of the detector platform, platform jitter and atmospheric disturbance, the target in a certain area of the field of view occurs random tiny movement over time, and it is difficult to determine the specific position of the target, resulting in difficult to effectively superposition the target energy, which poses a huge challenge to the target detection.
2) Due to the great distance of observation, the target occupies only a few pixels on the imaging plane, which is usually represented as a point target feature. There are no texture, color or shape features, and the gray level of the target is not very different from the surrounding background, and it is easy to mix with the background in the detection process, resulting in a low signal to noise ratio of the target.
3) Due to the influence of non-uniform dynamic background, dynamic noise and other factors, the energy response of the target in the imaging plane also changes dynamically, which further leads to the reduction of the SNR of the target in some frames. Moreover, in the process of energy accumulation, the energy growth of the dynamic background is likely to exceed the energy growth of the target, making the target further submerged in the background.
Through the existing studies, it is found that anisotropy can adapt well to the suppression of dynamic background. In this paper, anisotropic filtering based on WY (WYAF) distribution function and multi-scale energy concentration accumulation (MECA) method are proposed to solve the above problems.
This paper is divided into five parts, the second part analyzes the infrared image model in detail, the third part puts forward our detailed method, the fourth part is the experimental proof, and the fifth part is the final summary.