Computer Science and Mathematics

Sort by

Article
Computer Science and Mathematics
Signal Processing

Yutaka Yoshida

,

Kiyoko Yokoyama

Abstract: Accurate R-peak detection in electrocardiogram (ECG) signals is fundamental for cardiovascular analysis. However, most existing methods address differences in sampling frequency (fs) through signal resampling or transfer learning, which may alter the temporal definition of annotated events. In this study, we propose a fs consistent framework for ECG R-peak detection that avoids both resampling and retraining. The proposed method is based on low-sampling morphological learning combined with physiological temporal constraints (PTC). A lightweight classifier (Extreme Gradient Boosting) is trained on 128 Hz ECG data (MIT-BIH Normal Sinus Rhythm Database, XGB) to learn local morphological structures, and feature extraction is defined in milliseconds with time-normalized derivatives to ensure consistency across fs. The trained model is directly applied to higher- fs datasets (360 Hz, 500 Hz, and 1000 Hz) without modification. Final peak locations are determined through deterministic processing, including PTC and local snap processing. Experimental results demonstrated that the proposed method achieved stable detection performance across multiple sampling frequencies. When evaluated in a sample-wise manner, the proposed method achieved mean F1-scores of 0.885 on MIT-BIH Arrhythmia Database (360 Hz), 0.848 on Lobachevsky University Electrocardiography Database (LUDB, 500 Hz, sinus rhythm), 0.837 on LUDB (500 Hz, arrhythmia), and 0.953 on PTB Diagnostic ECG Database (1000 Hz), without any resampling or retraining. The integration of probabilistic candidate detection and deterministic temporal alignment enables consistent peak localization under cross-frequency conditions. These findings demonstrate that augmenting machine learning with deterministic decision mechanisms provides a principled framework for fs -consistent ECG peak detection.

Article
Computer Science and Mathematics
Signal Processing

Nahed H. Solouma

,

Michael R. Gardner

,

Noura Negm

,

Sadeq S. AlSharfi

Abstract: Optical imaging is among the safest and most highly impactful biomedical imaging modalities. Aberration in the optical imaging systems leads to distorted images. This distortion is almost nonlinear and hence affects the relative size, intensity and appearance of image details. Image aberration has many types with some or all of them can be imposed on the image based on the quality of the imaging system and/or surrounding conditions. Many approaches have been introduced to remove or minimize aberration from optical images. If the transfer function of an imaging system and the function of the noise added during the imaging process are known, then an ideal image can be obtained from the image produced by this system. The point spread function (PSF) of the optical imaging system is the image it produces for a point object. PSF is the observable form of the transfer function. The transfer function itself is the exit pupil function or typically the system aberration. The nonlinearity and multiplicity of the aberration imposed on the image together with the added noise makes it difficult to obtain the transfer function from the degraded images. In this work, optimization and global search techniques are utilized in an iterative image restoration algorithm. The proposed technique updates an initially suggested solution of transfer function by optimizing the aberration coefficients. The final solution of the transfer function and hence the PSF is reached when the optimum restored image is obtained. The proposed algorithm is validated by a testing image and then its performance is assessed by a set of aberrated images with different degradation. The results obtained in this work showed 100% success rate to obtain the PSF.

Article
Computer Science and Mathematics
Signal Processing

Siyuan Liu

,

Hangcheng Wu

,

Cheng Sun

,

Yuanbin Qiu

,

Haoliang Wu

,

Yucong Wei

,

Yang Lv

,

Zheng Yang

Abstract: This paper proposes a novel Topological Data Analysis (TDA) pipeline to extract robust structural features from functional near-infrared spectroscopy (fNIRS) signals for the classification of Alzheimer's Disease (AD) stages. Alzheimer's disease is increasingly understood as a disconnection syndrome, where the disruption of functional brain net-works precedes gross anatomical atrophy. However, traditional graph-theoretic ap-proaches rely on arbitrary connectivity thresholds, which can obscure critical multi-scale topological information and are sensitive to noise. To address this, our framework lev-erages Persistent Homology (PH) to analyse the topological evolution of brain networks across a continuous range of scales. By modeling 48-channel hemoglobin concentration time-series as high-dimensional point clouds via Granger causality metrics, we construct filtration sequences of Vietoris-Rips complexes. The resulting topological invari-ants—specifically 0-dimensional connected components, 1-dimensional loops, and 2-dimensional voids—are captured in Persistence Diagrams and subsequently vectorized into Persistence Images (PIs) using Gaussian kernel smoothing. This transformation enables the integration of complex topological features into standard machine learning workflows. Our experimental results on 284 recordings demonstrate that this topolo-gy-driven feature extraction method yields high discriminative power, achieving 77% accuracy in multi-class diagnosis (NC vs. MCI vs. AD). This study validates the efficacy of TDA as a sophisticated signal processing tool for revealing intrinsic neurodegenerative patterns in hemodynamic data, offering a potential non-invasive biomarker for early detection.

Article
Computer Science and Mathematics
Signal Processing

Rongyan Zhou

,

Weijie Tan

,

Meng Li

,

Baosheng Wang

Abstract: This article investigates sensor placement strategies for three-dimensional (3-D) time-of arrival (TOA)-based target localization in Underwater Acoustic Sensor Networks (UASNs). To mitigate underestimation of localization accuracy in complex marine environments, the actual acoustic ray propagation time is derived, and the TOA measurement variance is estimated using the ray acoustic propagation model. These formulations enable a novel 3-D TOA measurement model, and the trace of Cramér-Rao lower bound (CRLB) for this model serves as the optimization criterion for sensor placement. The MinMax k-Means algorithm is proposed to determine the optimal sensor placement by minimizing the average of the trace of CRLB. Extensive numerical simulations are conducted to demonstrate the effectiveness of the placement strategies.

Article
Computer Science and Mathematics
Signal Processing

Wei Li

,

Jiazhu Li

,

Shuyu Wang

,

Yan Chen

,

Jian Chen

Abstract: The health status of rolling bearings is critical to the normal operation of rotating machinery. To effectively extract vibration signal features and accurately identify different fault types, a novel method based on enhanced composite multi-scale slope entropy (ECMSE) and a honey badger algorithm-optimized kernel extreme learning machine (HBA-KELM) is proposed. Specifically, ECMSE integrates high-order differences into the composite multi-scale framework to capture high-frequency information while preserving low-frequency characteristics, thereby enhancing the discriminability of time-series representations. Meanwhile, an average coarse-graining strategy is incorporated to achieve a more comprehensive characterization of the signals. The extracted features are then input into the HBA-KELM classifier for fault identification. Experiments conducted on two public and private rolling bearing datasets demonstrate that our method achieves superior performance in distinguishing different fault types and damage levels compared with several existing approaches.

Article
Computer Science and Mathematics
Signal Processing

Fernando Martín-Rodríguez

,

Mónica Fernández-Barciela

,

Ainhoa Morales-Fernández

,

María Marante-Boado

Abstract: This paper evaluates the practical use of the Three-Dimensional Discrete Cosine Transform (3D-DCT) for video and volumetric image compression. While one- and two-dimensional DCT transforms are widely used in modern multimedia standards, their three-dimensional extension has received limited attention in real coding systems. In this work, a complete 3D-DCT–based encoder is developed by extending a JPEG-like pipeline to operate on three-dimensional data blocks. The proposed approach processes groups of video frames as 3D cubes and applies a separable 3D-DCT followed by quantization, coefficient serialization, and entropy coding. Unlike conventional video codecs that rely on motion estimation and compensation, the proposed system exploits temporal redundancy directly through the transform domain, resulting in a simpler coding structure with reduced algorithmic complexity. Different configurations are evaluated, including alternative methods for computing the 3D quantization matrix, serialization schemes, and variable block depth along the temporal dimension.Experimental results obtained from multiple test videos and volumetric medical datasets, including CT and MRI studies, demonstrate that the proposed method achieves competitive compression ratios while maintaining good reconstruction quality measured in terms of Peak Signal-to-Noise Ratio (PSNR). The results indicate that 3D-DCT provides a flexible and computationally simple solution for both video compression and three-dimensional medical image coding, particularly in applications where implementation simplicity and frame-level accessibility are important.

Article
Computer Science and Mathematics
Signal Processing

Xuchao Gao

,

Mingqiang Li

,

Kai Guan

,

Jianjun Ge

Abstract: To address the high computational complexity and insufficient real-time performance of traditional multi-radar trajectory planning methods in complex electromagnetic interference environments, this paper proposes an imitation learning-based trajectory planning method for multi-radar systems. This method designs a trajectory policy neural network architecture based on multiple semantic information. It proposes a training data construction method with coverage rate as the optimization objective. Then the trajectory policy neural network is trained by using an imitation learning algorithm with an auxiliary target. Simulation results show that the proposed method achieves an average coverage rate of 93.95%, and improves the single-step decision efficiency by a factor of 6.7 compared with heuristic-based trajectory optimization methods.

Article
Computer Science and Mathematics
Signal Processing

Lin Li

,

Jichun Zhu

,

Mingxing Jiang

,

Jingli Fang

Abstract: With the increasing demand for high-quality imaging in consumer electronics, image aesthetic assessment (IAA) has been widely applied to electronic cameras and display devices. Although the deformable attention mechanism has been introduced into IAA due to its perceptual capabilities, enabling models to refine attention regions by learning interest points and their corresponding offsets, existing methods often lack guidance from aesthetic composition features during the offset generation process, which limits their performance in aesthetic evaluation tasks. To address this issue, we propose a Graph Neural Network (GNN)-guided deformable attention module that incorporates composition information into the generation of interest points by modeling image features as graphs and applying GNN to guide interest point selection. In addition, we design an improved Transformer model that employs neighborhood attention to further enhance IAA performance. We evaluate the proposed model on two aesthetic datasets, AVA and TAD66K, and the experimental results demonstrate its effectiveness in improving overall model performance.

Article
Computer Science and Mathematics
Signal Processing

Sapthak Mohajon Turjya

Abstract: This paper presents a hybrid model for the control of brain-computer interfaces (BCIs) for Metaverse environments, with the goal of advancing the capabilities of such interfaces beyond the traditional motor imagery (MI) or P300-based brain-computer interfaces. This hybrid model utilizes P300 for virtual devices' interaction and MI for navigation and movement imagination in the Metaverse, with each EEG modality being dedicated to a particular control state and state changes being made sequentially based on the context of the interaction. In the simulated experiment, the imagined movement of the left and right hands is used for rotational navigation, while discrete devices' actions use P300 responses under a five-stimulus oddball paradigm. In the performance evaluation, the paper shows that the hybrid model, with the use of MI and P300 under a single BCI, achieves accuracy comparable to single-mode BCIs, with advantages over existing BCI systems regarding their capabilities for interaction and adaptability, thus proving the effectiveness of hybrid control for achieving dynamic and flexible Metaverse interactions.

Article
Computer Science and Mathematics
Signal Processing

Ricardo Bernárdez-Vilaboa

,

Juan E. Cedrún-Sánchez

,

Silvia Burgos-Postigo

,

Rut González-Jiménez

,

Carla Otero-Currás

,

F. Javier Povedano-Montero

Abstract: Background: Sensor-based systems and virtual reality (VR) technologies provide new opportunities for the objective, technology-driven assessment and training of visuo-motor performance in applied contexts such as sport. Methods: This study examined the effects of an integrated visual training program combining stroboscopic stimula-tion, VR-based vergence exercises, and instrumented reaction-light tasks in adolescent handball players. Twenty-eight youth athletes completed two baseline assessments separated by six weeks, followed by a six-session training program integrated into reg-ular team practice. Sensor-derived outcome measures included dynamic accommoda-tive performance, simple and choice visual reaction times, peripheral-field response metrics, binocular alignment, stereoscopic depth perception, and basic oculomotor function. Results: Compared with both baseline measurements, the intervention pro-duced selective improvements in accommodative facility—particularly near–far fo-cusing speed—and in multiple reaction-time conditions involving manual and deci-sion-based responses. Specific peripheral-field locations showed increased response scores, whereas binocular alignment, AC/A ratio, near phoria, and stereoscopic acuity remained unchanged. Conclusions: These findings indicate that technology-supported visual training protocols incorporating sensor-based reaction systems and VR stimuli can be associated with measurable adaptations in dynamic visuomotor processing while preserving fundamental binocular vision parameters.

Article
Computer Science and Mathematics
Signal Processing

Vadim A. Nenashev

,

Renata I. Chembarisova

,

Aleksandr R. Bestugin

,

Vladimir P. Kuzmenko

,

Sergey A. Nenashev

Abstract: Recently, when forming radar video frames for surface mapping, group-interacting compact onboard radar systems (CORS) are increasingly being utilized. In this context, for the cooperative functioning of the group, each compact radar should use its own unique marked signal as the probing signal. This signal must be distinguishable in the common channel and should not destructively affect the probing signals emitted by other radars within the group. This organization allows for associating the marked signals reflected from the underlying surface with specific CORS in the group. This requirement arises from the fact that each compact onboard radar in the group emits a single probing signal and then receives all the reflected signals from the surface that were emitted by the other CORS in the group. Such an organization of the group-based system of technical vision requires the search for and study of specialized marked code structures used for phase modulation of probing signals to identify them in the shared radar channel. The study focuses on the search for new complex M-sequences with lower sidelobe levels of the normalized autocorrelation function compared to traditional M-sequences. This is achieved by replacing the traditional alphabet of positive and negative ones with an asymmetric set consisting of complex numbers. Using numerical methods and computer simulations, optimal complex values of the sequence with a minimum level of sidelobes in the autocorrelation function are determined. In addition to correlation properties, the phase-modulated signals generated based on the new marked sequences are also investigated. The results obtained open up new possibilities for the construction of a group-based technical vision system, enabling cooperative surface probing with each CORS in the interacting group.

Review
Computer Science and Mathematics
Signal Processing

Christos Kalogeropoulos

,

Seferina Mavroudi

,

Konstantinos Theofilatos

Abstract:

Electroencephalography (EEG) has transitioned from a subjective observational method into a data-intensive analytical field that utilises sophisticated algorithms and mathematical modeling. This progression encompasses developments in signal preprocessing, artifact removal, and feature extraction techniques including time-domain, frequency-domain, time-frequency, and nonlinear complexity measures. To provide a holistic foundation for researchers, this review begins with the neurophysiological basis, recording technique and clinical applications of EEG, while maintaining its primary focus on the diverse methods used for signal analysis. It offers an overview of traditional mathematical techniques used in EEG analysis, alongside contemporary, state-of-the-art methodologies. Machine Learning (ML) and Deep Learning (DL) architectures, such as Support Vector Machines (SVMs), Convolutional Neural Networks (CNNs), and transformer models, have been employed to automate feature learning and classification across diverse applications. We conclude that the next generation of EEG analysis will likely converge into Neuro-Symbolic architectures, synergising the generative power of foundation models with the rigorous interpretability of signal theory.

Article
Computer Science and Mathematics
Signal Processing

Seokwon Yeom

Abstract: Drone localization is essential for various purposes such as navigation, autonomous flight, and object tracking. However, this task is challenging when satellite signals are unavailable. This paper addresses vision-only localization of flying drones through op-timal window velocity fusion. Multiple optimal windows are derived from a piecewise linear regression (segment) model of the image-to-real world conversion function. Each window serves as a template to estimate the drone's instantaneous velocity. The multiple velocities obtained from multiple optimal windows are integrated by two fusion rules: one is a weighted average for lateral velocity, and the other is a winner-take-all decision for longitudinal velocity. In the experiments, a drone performed a total of six short-range (about 800 m to 2 km) and high maneuvering flights in rural and urban areas. Four flights in rural areas consist of a forward-backward straight flight, a forward-backward zigzag flight (a snake path), a square path with three banked turns, and a free flight that includes both banked turns and zigzags. Two flights in urban areas are a straight outbound flight and a forward-backward straight flight. The performance was evaluated through the root mean squared error (RMSE) and drift error of the ground-truth trajectory and the rig-id-body rotated vision-only trajectory. The proposed image-based method has been shown to achieve flight errors of a few meters to tens of meters, which corresponds to around 3% of the flight length.

Article
Computer Science and Mathematics
Signal Processing

Arturo Tozzi

Abstract: Many machine-learning tasks involve structured data whose geometry, local feature distributions and global organization interact in ways that are not well captured by existing methods based on vectorization, graph metrics or homological signatures. We introduce Fiber Bundle Learning (FBL), a topological framework that represents each data sample as a discrete fiber bundle and extracts a classification signature combining persistent homology, local feature geometry and gluing structure. FBL builds a base space from the coarse geometry of each object, models local feature patches as fibers and estimates transition maps between neighboring fibers to construct a discrete connection. From this representation, FBL computes a set of invariants: persistent homology of the base, fibers and total space; holonomy obtained by transporting fiber states along cycles; curvature-like quantities measuring transition inconsistency; discrete analogues of characteristic classes. These components are assembled into a fixed-length feature vector that can be used with any standard classifier. We show that FBL yields a signature with three desirable theoretical properties: stability under perturbations of geometry and local features; invariance under isometries and global fiber reparameterizations; robustness to sampling noise. Our synthetic experiments show that FBL distinguishes twisted from untwisted bundles with identical homology, a distinction classical topological methods fail to capture. Additional tests quantify the system’s resistance to noise, its invariance to geometric transformations and the contribution of each signature component. Taken together, our results indicate that representing data through fiber-bundle structure may provide an effective tool for classifying complex, multi-level objects.

Article
Computer Science and Mathematics
Signal Processing

Ioannis Dologlou

Abstract: A new method to estimate future samples in time series data is presented and it is compared against the well known technique ESPRIT. It exploits the null space of the Hankel matrix of the data allowing the prediction of future samples with better accuracy and confidence. Moreover a generalization of the algorithm is derived that also applies to multichannel signals. Both cases with and without cross-channel coupling are considered and different algorithms are presented. The method is fully deterministic with comparable computational complexity to ESPRIT. Testing involves 4000 randomly chosen data sets with variable spectral characteristics.

Article
Computer Science and Mathematics
Signal Processing

Bálint Maczák

,

Adél Zita Hordós

,

Gergely Vadai

Abstract: Actigraphy quantifies human locomotor activity by measuring wrist acceleration with wearable devices at relatively high rates and converting it into lower-temporal-resolution activity values; however, the computational implementations of this data compression differ substantially across manufacturers. Building on our previous work, where we ex-amined how dissimilarly the various activity determination methods we generalized can quantify the same movements through correlation analysis, we investigated here how these methods (e.g., digital filtering, data compression) influence nonparametric circadian rhythm analysis and sleep–wake scoring. In addition to our generalized actigraphic framework, we also emulated the use of specific devices commonly employed in such sleep-related studies by applying their methods to raw actigraphic acceleration data we collected to demonstrate, through concrete real-life examples, how methodological choices may shape analytical outcomes. Additionally, we assessed whether nonparametric indi-cators could be derived directly from acceleration data without compressing them into ac-tivity values. Overall, our analysis revealed that all these analytical approaches of the sleep-wake cycle can be substantially affected by the manufacturer dependent actigraphic methodology, with the observed effects traceable to distinct steps of the signal processing pipeline, underscoring the necessity of cross manufacturer harmonization from a clini-cally oriented perspective.

Article
Computer Science and Mathematics
Signal Processing

Grzegorz Szwoch

Abstract: Automatic speech recognition in a scenario with multiple speakers in a reverberant space, such as a small courtroom, often requires multiple sensors. This leads to a problem of crosstalk that must be removed before the speech-to-text transcription is performed. The proposed method uses Acoustic Vector Sensors to acquire audio streams. Speaker detection is performed using statistical analysis of the direction of arrival. This information is then used to perform source separation. Next, speakers’ activity in each channel is analyzed, and signal fragments containing direct speech and crosstalk are identified. Crosstalk is then suppressed using a dynamic gain processor, and the resulting audio streams may be passed to a speech recognition system. The algorithm was evaluated using a custom set of speech recordings. An increase in SI-SDR value over the unprocessed signal was achieved: 7.54 dB and 19.53 dB for the algorithm with and without the source separation stage, respectively. The algorithm is intended for application in multi-speaker scenarios requiring speech-to-text transcription, such as court sessions or conferences.

Article
Computer Science and Mathematics
Signal Processing

Ioannis Dologlou

Abstract: A new method to estimate future samples in time series data is presented and it is compared against the well known technique ESPRIT. It exploits the null space of the Hankel matrix of the data allowing the prediction of future samples with better accuracy and confidence. In a more general sense the notion of null space refers to the set of eigenvectors of the data Hankel matrix which are associated with the smallest eigenvalues. The method is fully deterministic with comparable computational complexity to ESPRIT. Testing involves 4000 randomly chosen data sets with variable spectral characteristics.

Article
Computer Science and Mathematics
Signal Processing

Lyu Minhui

,

Jiang Rongjun

Abstract: Distributed optical fiber sensing technologies, particularly Φ-OTDR, have been extensively applied in vibration monitoring of critical infrastructure, including highways and pipelines. This is attributed to their capabilities of long-distance monitoring, high spatiotemporal coverage, and ease of deployment. Nevertheless, the monitoring data encompasses a mixture of information such as cable structural coupling, initial vibration states, and multi-modal environmental excitations. Consequently, effective separation and extraction of these signals are crucial for practical implementations. Notably, multi-modal weak signal analysis has emerged as a significant technical challenge in this field. Building upon the Φ-OTDR DVS fiber-environment coupled vibration observation model, this research introduces an innovative scenario-oriented analytical framework that integrates a combined multi-head attention mechanism. This advancement enables precise extraction of multi-modal weak signals within complex environments. Empirical validation utilizing measured data from a 30 km optical cable installed along an urban ring road has confirmed the framework’s exceptional performance across various scenarios. These include road surface roughness detection, construction machinery detection and localization, and vehicle trajectory recognition. The study reveals that the attention mechanism effectively concentrates on scenario-relevant signals, thereby substantially enhancing the analytical real-time performance. Overall, the proposed framework offers a versatile and real-time solution for DVS signal processing in intricate scenarios.

Article
Computer Science and Mathematics
Signal Processing

Zhuolei Chen

,

Wenbin Wu

,

Renshu Wang

,

Manshu Liang

,

Weihao Zhang

,

Shuning Yao

,

Wenquan Hu

,

Chaojin Qing

Abstract: Unmanned aerial vehicle (UAV)-assisted wireless communication systems often employ the carrier aggregation (CA) technique to alleviate the issue of insufficient bandwidth. However, in high-mobility UAV communication scenarios, the dynamic channel characteristics pose significant challenges to channel estimation (CE). Given these challenges, integrated sensing and communication (ISAC), which combines communication and sensing functionalities, has emerged as a promising solution to enhance CE accuracy for UAV systems. Meanwhile, the dominant line-of-sight (LoS) characteristics inherent in UAV scenarios present a valuable opportunity for further exploitation. To this end, this paper proposes a LoS and echo sensing-based CE scheme for CA-enabled UAV-assisted communication systems. Firstly, LoS sensing and echo sensing techniques are employed to acquire sensing-assisted prior information. Subsequently, the obtained prior information is utilized to refine the CE of the primary component carrier (PCC) in CA, thereby improving the accuracy of channel parameter estimation for the PCC. Based on the path-sharing property between PCC and secondary component carriers (SCCs), a three-stage scheme is proposed to reconstruct the channel of SCCs. In Stages I and II, the path-sharing property is exploited to reconstruct the LoS and non-line-of-sight (NLoS) paths of the SCCs in the delay-Doppler (DD) domain, respectively. Finally, an iterative procedure is applied to enhance the initial reconstruction and further recover non-shared transmission paths between PCC and SCCs. Simulation results demonstrate that the proposed method effectively enhances the CE accuracy for both PCC and SCCs. Furthermore, the proposed scheme exhibits robustness against parameter variations.

of 8

Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

Disclaimer

Terms of Use

Privacy Policy

Privacy Settings

© 2026 MDPI (Basel, Switzerland) unless otherwise stated