ARTICLE | doi:10.20944/preprints202206.0390.v1
Subject: Mathematics & Computer Science, Information Technology & Data Management Keywords: Object detection; Feature fusion network; Multiple feature selection; Angle prediction; Pixel Attention Mechanism
Online: 29 June 2022 (03:09:52 CEST)
The object detection task is usually affected by complex backgrounds. In this paper, a new image object detection method is proposed, which can perform multi-feature selection on multi-scale feature maps. By this method, a bidirectional multi-scale feature fusion network is designed to fuse semantic features and shallow features to improve the detection effect of small objects in complex backgrounds. When the shallow features are transferred to the top layer, a bottom-up path is added to reduce the number of network layers experienced by the feature fusion network, reducing the loss of shallow features. In addition, a multi-feature selection module based on the attention mechanism is used to minimize the interference of useless information on subsequent classification and regression, allowing the network to adaptively focus on appropriate information for classification or regression to improve detection accuracy. Because the traditional five-parameter regression method has severe boundary problems when predicting objects with large aspect ratios, the proposed network treats angle prediction as a classification task. The experimental results on the DOTA dataset, the self-made DOTA-GF dataset and the HRSC 2016 dataset show that, compared with several popular object detection algorithms, the proposed method has certain advantages in detection accuracy.
ARTICLE | doi:10.20944/preprints201809.0463.v1
Subject: Engineering, Industrial & Manufacturing Engineering Keywords: Protrusion, illumination, height, effective pixel, gray level, teaching
Online: 24 September 2018 (15:02:56 CEST)
Protrusive defects on the color filter of thin-film transistor (TFT) liquid crystal displays (LCDs) frequently damage the valuable photomask. An fast method using side-view illuminations associated with digital charge-couple devices (CCDs) to detect the protrusive defect in the four substrates, which are the black matrix (BM), red, green, and blue. Between the photomask and substrate, the depth of field (DOF) is normally 300 μm for the proximity-type aligner; we select the four substrates to evaluate the detectability in the task. The experiment is capable of detecting measurements of 300 μm and even lower than 100 μm can be assessed successfully. The maximum error of the measurement is within 6% among the four samples. Furthermore, the uncertainty analysis of three standard deviations is conducted. Thus, the method is cost-effective to prevent damage for valuable photomasks in the flat panel display industry.
ARTICLE | doi:10.20944/preprints202212.0270.v1
Subject: Mathematics & Computer Science, Information Technology & Data Management Keywords: image encryption; high pixel density; neural networks; quantum random walk
Online: 15 December 2022 (06:50:34 CET)
This paper proposes an encryption scheme for high pixel density images. Based on the application of the quantum random walk algorithm, the Long short-term memory (LSTM) can effectively solve the problem of low efficiency of the quantum random walk algorithm in generating large-scale pseudorandom matrices, and further improve the statistical properties of the pseudorandom matrices required for encryption. The LSTM is then divided into columns and fed into the LSTM in order for training. Due to the randomness of the input matrix, the LSTM cannot be trained effectively, so the output matrix is predicted to be highly random. The LSTM prediction matrix of the same size as the key matrix is generated based on the pixel density of the image to be encrypted, which can effectively complete the encryption of the image. In the statistical performance test, the proposed encryption scheme achieves an average information entropy of 7.9992, an average number of pixels changed rate (NPCR) of 99.6231%, an average uniform average change intensity (UACI) of 33.6029% and an average correlation of 0.0032. Finally, various noise simulation tests are also conducted to verify its robustness in real-world applications where common noise and attack interference are encountered.
SHORT NOTE | doi:10.20944/preprints202202.0281.v1
Subject: Earth Sciences, Oceanography Keywords: Sub-pixel mapping; Super-resolution mapping; Downscaling; Gulf of California
Online: 22 February 2022 (16:07:26 CET)
The quantification of sea surface temperature (SST) through space platforms has revolutionized how we obtain information at a global level. However, the main disadvantage of obtaining SST with satellite images consists of its inherent coarse spatial resolution. One solution could be the use of downscaling algorithms to create sequences of matrices at a higher resolution. We used the same SST data source from the MODIS-Aqua sensor at three spatial resolutions of 9 km, 4.5 km, and 1 km in the Gulf of California. Based on an open-source algorithm, the original SST images were downscaled to 4.5 km, 1 km, 500 m, 250 m, and 125 m per pixel scales. Results indicate a strong linear relationship between the original SST-MODIS data and the modeled data for all spatial resolutions. This study demonstrates the feasibility of an open-source downscaling algorithm to enhance the spatial resolution of SST images in a marginal sea.
ARTICLE | doi:10.20944/preprints202201.0352.v1
Subject: Earth Sciences, Geoinformatics Keywords: Per-pixel classification confidence; spatial pattern; image classification; accuracy assessment; interpolation method
Online: 24 January 2022 (11:53:46 CET)
Obtaining classification confidence at the pixel level is a challenging task for accuracy assessment in remote sensing image classification. Among the various methods for estimating classification confidence at the pixel level, interpolation-based methods have drawn special attention in the literature. Even though they have been widely recognized in the literature, their usefulness has not been rigorously evaluated. This paper conducts a comprehensive evaluation of three interpolation-based methods: local error matrix method, bootstrap method, and geostatistical method. We applied each of the three methods to three representative datasets with different spatial resolutions, spectral bands, and the number of classes. We then derive the estimated classification confidence and true classification confidence and compared the results with each other using both exploratory data analysis (bi-histogram) and statistical analysis (Willmott's d and Binned classification quality). The results indicate that the three interpolation methods provide some interesting insights on various aspects of estimating per-pixel classification confidence. Unfortunately, the interpolation assumes that classification confidence is smooth across the space, which is usually not true in practice. In other words, interpolation-based methods have limited practical use.
Subject: Physical Sciences, Acoustics Keywords: Single-pixel; spectroscopy; near-infrared; DMD; multiplexing; spectral coding; sub-millisecond; compressive measurement
Online: 31 July 2021 (15:10:23 CEST)
In this contribution, we present a high-speed multiplex grating spectrometer based on a spectral coding approach that is founded on principles of compressive sensing. The spectrometer employs a single-pixel InGaAs detector to measure the signals encoded by an amplitude spatial light modulator (digital micromirror device, DMD). This approach leads to a speed advantage and multiplex sensitivity advantage atypical for standard dispersive systems. Exploiting the 18.2 kHz pattern rate of the DMD, we demonstrate 4.2 ms acquisition times for full spectra with a bandwidth of 450 nm (5250 cm-1 – 4300 cm-1; 1.9 µm – 2.33 µm). Due to the programmability of the DMD, spectral regions of interest can be chosen freely, thus reducing acquisition times further, down to the sub-millisecond regime. The adjustable resolving power of the system accessed by means of computer simulations is discussed, quantified for different measurement modes, and verified by comparison with a state-of-the-art Fourier-transform infrared spectrometer. We show measurements of characteristic polymer absorption bands in different operation regimes of the spectrometer. The theoretical multiplex advantage of 8 was experimentally verified by comparison of the noise behavior of the spectral coding approach and a standard line-scan approach.
ARTICLE | doi:10.20944/preprints201608.0069.v1
Subject: Earth Sciences, Environmental Sciences Keywords: Rubber (Hevea brasiliensis) plantation; phenology; Xishuangbanna; Landsat; object-based approach; pixel-based approach
Online: 6 August 2016 (11:54:28 CEST)
Effectively mapping and monitoring rubber plantation is still changing. Previous studies have explored the potential of phenology features for rubber plantation mapping through a pixel-based approach (pixel-based phenology approach). However, in fragmented mountainous Xishuangbanna, it could lead to noises and low accuracy of resultant maps. In this study, we investigated the capability of an integrated approach by combining phenology information with an object-based approach (object-based phenology approach) to map rubber plantations in Xishuangbanna. Moderate Resolution Imaging Spectroradiometer (MODIS) data were firstly used to acquire the temporal profile and phenological features of rubber plantations and natural forests, which delineates the time windows of defoliation and foliation phases. Landsat images were then used to extract a phenology algorithm comparing three different approaches: pixel-based phenology, object-based phenology, and extended object-based phenology to separate rubber plantations and natural forests. The results showed that the two object-based approaches achieved higher accuracy than the pixel-based approach, having overall accuracies of 96.4%, 97.4%, and 95.5%, respectively. This study proved the reliability of a phenology-based rubber mapping in fragmented landscapes with a distinct dry/cool season using Landsat images. This study indicated that the object-based phenology approaches can effectively improve the accuracy of the resultant maps in fragmented landscapes.
ARTICLE | doi:10.20944/preprints201706.0009.v1
Subject: Earth Sciences, Other Keywords: Sentinel-2; remote sensing; European Space Agency; Copernicus; continental; cloud-free; composite; darkest pixel; maximum NDVI
Online: 2 June 2017 (05:03:53 CEST)
The processing of cloud free geo-referenced imagery is one of the preliminary processing step of any land application. This letter describe the methodology developed to obtain a seamless cloud free composite of Africa for 2016 using Sentinel-2A data at 10 meters resolution freely available from the European Space Agency. The method is based on an hybrid method resulting from the merging of the two most robust time series methods namely the "darkest pixel" and the "maximum NDVI" previously developed with AVHRR time series.
ARTICLE | doi:10.20944/preprints201712.0108.v1
Subject: Mathematics & Computer Science, Artificial Intelligence & Robotics Keywords: Plant phenotyping, Plant pixel classification, Colour space, , Gaussian mixture model, Earth mover distance, Variance ratio, Plant segmentation.
Online: 15 December 2017 (16:52:23 CET)
Segmentation of a region of interest is an important pre-processing step for many colour image analysis techniques. Similarly segmentation of plant in digital images is an important preprocessing step in phenotying plants by image analysis. In this paper we present an analytical study to statistically determine the suitability of colour space representation of an image to best detect plant pixels and separate them from background pixels. Our hypothesis is that the colour space representation in which the separation of the distributions representing plant pixels and background pixels is maximized would be the best for detection of plant pixels. The two classes of pixels are modelled as a Gaussian mixture model (GMM). In our GM modelling we don't make any prior assumption about the number of Gaussians in the model. Rather a constant bandwidth mean-shift filter is used to cluster the data and the number of clusters and hence the number of Gaussians is automatically determined. Here we have analysed following representative colour spaces like $RGB$, $rgb$, $HSV$, $Ycbcr$ and $CIE-Lab$. This is because these colour spaces represent several other similar colour spaces and also an exhaustive study of all the colour space will be too voluminous. We also analyse the colour space feature from the two-class variance ratio perspective and compare the results of our hypothesis with this metric. The dataset for this empirical study consist of 378 digital images of plants and their manual segmentation. Dataset consist of various species of plants (arabidopsi, tobacco, wheat, rye grass etc.) imaged under different lighting conditions, indoor and outdoor, controlled and uncontrolled background. In results we obtain better segmentation of the plants in $HSV$ colour space, which is supported by its Earth mover distance (EMD) on the GMM distribution of plant and background pixels.
ARTICLE | doi:10.20944/preprints201712.0192.v1
Subject: Mathematics & Computer Science, Information Technology & Data Management Keywords: convolutional ceural network; Gaofen 2 remote sensing image; remote sensing image segmentation; convolutional encode neural networks model (CENN); categorical learning; per-pixel segmentation; farmland; woodland
Online: 28 December 2017 (02:54:12 CET)
It is very difficult to accurately divide farmland and woodland in Gaofen 2 (GF-2) remote sensing image, because their single plant coverage is very small, and their spectra are very similar. The ratio of spatial resolution and one plant’s coverage area must be fully taken into account when designing the Convolutional Neural Network structure for extracting them from GF-2 image. We establish a Convolutional Encode Neural Networks model (CENN), The first layer has two sets of convolution kernels to learn the characteristics of farmland and woodland respectively, while the second layer is the encoder to encode the characteristics by transfer function, which can map the results to the corresponding category number. In the training stage, samples of farmland, woodland, and other categories are categorically used to train CENN, as soon as training is accomplished, CENN would acquire enough ability to accurately extract farmland and woodland from remote sensing images. The final extraction result is obtained by implementing per-pixel segmentation of images used to train the CENN. CENN is compared and analyzed with others such as Deep Belief Network (DBN), Full Convolutional Network (FCN), Deeplab Model. The results of experiments show that CENN can more accurately mine the characteristics of farmland and woodland, and it achieves its goal of extracting farmland and woodland with high precision from GF-2 images.
ARTICLE | doi:10.20944/preprints201905.0260.v1
Subject: Earth Sciences, Oceanography Keywords: Synthetic aperture radar (SAR); along-track interferometry (ATI); sub-pixel offset tracking (sPOT); COSMO-SkyMed (CSK); staring spotlight (ST); micro-motion (m-m); vibrations; frequency modes
Online: 21 May 2019 (11:33:59 CEST)
This research aims to estimate the micro-motion (m-m) of ships. The problem of motion and m-m detection of targets is usually solved using synthetic aperture radar (SAR) along-track interferometry (ATI) which is observed employing two radars spatially distanced by a baseline extended in the azimuth direction. This paper is proposing a new approach where the m-m estimation of ships, occupying thousands of pixels, is measured processing the information given by sub-pixel tracking generated during the coregistration process of several re-synthesized time-domain and overlapped sub-apertures. The SAR products are generated splitting the raw data, according to a small-temporal baseline strategy, observed by one single wide-band staring spotlight (ST) SAR image. The predominant vibrational modes of different ships are estimated and results are promising to extend this application in performing surveillance also of land-based industries activities. Experiments are performed processing one ST SAR image observed by the COSMO-SkyMed satellite system.