Search | Preprints.org

Preprint ARTICLE | doi:10.20944/preprints201703.0028.v1

A Segment-Based Trajectory Similarity Measure in the Urban Transportation Systems

Yingchi Mao, Haishi Zhong, Xianjian Xiao, Xiaofang Li

Subject: Computer Science And Mathematics, Information Systems Keywords: GPS trajectory; GPS sensor; trajectory similarity measure; spatial-temporal data

Online: 6 March 2017 (06:51:37 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints201806.0440.v1

An Efficient Grid-based K-prototypes Algorithm for Sustainable Decision Making Using Spatial Objects

Hong-Jun Jang, Byoungwook Kim, Jongwan Kim, Soon-Young Jung

Subject: Computer Science And Mathematics, Computational Mathematics Keywords: clustering; spatial data; grid-based k-prototypes; data mining; sustainability

Online: 27 June 2018 (10:21:22 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202309.1016.v1

Three-Stage Sampling Algorithm for Highly Imbalanced Multi-Classification Time Series Data Sets

Haoming Wang

Subject: Computer Science And Mathematics, Artificial Intelligence And Machine Learning Keywords: Imbalanced data; Data preprocessing; Sampling; Tomek Links; DTW

Online: 14 September 2023 (14:00:42 CEST)

Show abstract| Download PDF| Share

Purpose To alleviate the data imbalance problem caused by subjective and objective reasons, scholars have developed different data preprocessing algorithms, among which undersampling algorithms are widely used because of their fast and efficient performance. However, when the number of samples of some categories in a multi-classification dataset is too small to be processed by sampling, or the number of minority class samples is only 1 to 2, the traditional undersampling algorithms will be weakened. Methods This study selects 9 multi-classification time series datasets with extremely few samples as the objects, fully considers the characteristics of time series data, and uses a three-stage algorithm to alleviate the data imbalance problem. Stage one: Random oversampling with disturbance items increases the number of sample points; Stage two: On this basis, SMOTE (Synthetic Minority Oversampling Technique) oversampling; Stage three: Using dynamic time warping distance to calculate the distance between sample points, identify the sample points of Tomek Links at the boundary, and clean up the boundary noise.Results This study proposes a new sampling algorithm. In the 9 multi-classification time series datasets with extremely few samples, the new sampling algorithm is compared with four classic undersampling algorithms, ENN (Edited Nearest Neighbours), NCR (Neighborhood Cleaning Rule), OSS (One Side Selection) and RENN (Repeated Edited Nearest Neighbours), based on macro accuracy, recall rate and F1-score evaluation indicators. The results show that: In the 9 datasets selected, the dataset with the most categories and the least number of minority class samples, FiftyWords, the accuracy of the new sampling algorithm is 0.7156, far beyond ENN, RENN, OSS and NCR; its recall rate is also better than the four undersampling algorithms used for comparison, at 0.7261; its F1-score is increased by 200.71%, 188.74%, 155.29% and 85.61%, respectively, relative to ENN, RENN, OSS, and NCR; In the other 8 datasets, this new sampling algorithm also shows good indicator scores.Conclusion The new algorithm proposed in this study can effectively alleviate the data imbalance problem of multi-classification time series datasets with many categories and few minority class samples, and at the same time clean up the boundary noise data between classes.

Preprint ARTICLE | doi:10.20944/preprints202112.0452.v1

A Fast Electrical Resistivity-Based Algorithm to Measure and Visualize Two-Phase Swirling Flows

Muhammad Awais Sattar, Matheus Martinez Garcia, Luis M Portela, Laurent Babout

Subject: Computer Science And Mathematics, Data Structures, Algorithms And Complexity Keywords: Electrical Resistance Tomography (ERT); Raw Data Processing; Inline Swirl Separator; Geometrical Parameter Extraction

Online: 28 December 2021 (14:42:44 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202111.0440.v1

Hybrid Algorithm for Anomaly Removal in Time Series Data Mining

Abdul Razaque, Marzhan Abenova, Munif Alotaibi, Bandar Alotaibi, Hamoud Alshammari, Salim Hariri, Aziz Alotaibi

Subject: Engineering, Control And Systems Engineering Keywords: time series; NMP algorithm; anomalies; data mining; similarities in time series; clustering

Online: 23 November 2021 (17:51:42 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints201810.0660.v1

Pattern Matching Trading System Based on the Dynamic Time Warping Algorithm

Sang Hyuk Kim, Hee Soo Lee, Hanjun Ko, Seung Hwan Jeong, Hyun Woo Byun, Kyong Joo Oh

Subject: Computer Science And Mathematics, Information Systems Keywords: dynamic time warping; pattern matching trading system; time series data; sliding window

Online: 29 October 2018 (07:03:51 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints201808.0540.v1

A Systems Dynamics Enabled Real-Time Efficiency for Fuel Cell Data-Driven Remanufacturing

Okechukwu Okorie, Konstantinos Salonitis, Fiona Charnley, Christopher Turner

Subject: Engineering, Industrial And Manufacturing Engineering Keywords: circular economy; remanufacturing; fuel cells; data-driven; systems dynamics

Online: 31 August 2018 (05:31:03 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202306.1378.v1

Algorithm-based Data Generation (ADG) Engine for Data Analytics

Iman I. M. Abu Sulayman, Peter Voege, Abdelkader Ouda

Subject: Computer Science And Mathematics, Artificial Intelligence And Machine Learning Keywords: Data Generation; Anomaly Data; User Behavior Generation; Big Data

Online: 19 June 2023 (16:31:37 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints201810.0253.v1

Data Censoring with Set-Membership Affine Projection Algorithm

Hamid Reza Moradi, Akram Zardadi

Subject: Computer Science And Mathematics, Computer Science Keywords: adaptive filtering; set-membership filtering; affine projection; data censoring; big data; outliers

Online: 12 October 2018 (04:57:08 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202008.0254.v1

Unsupervised Feature Selection Using Recursive k-Means Silhouette Elimination (RkSE): A Two-Scenario Case Study for Fault Classification of High-Dimensional Sensor Data

Ahlam Mallak, Madjid Fathi

Subject: Computer Science And Mathematics, Information Systems Keywords: feature selection; k-means; silhouette measure; clustering; big data; fault classification; sensor data; time-series data

Online: 11 August 2020 (06:26:43 CEST)

Show abstract| Download PDF| Share

Working Paper ARTICLE

Development of Cost and Schedule Data Integration Algorithm based on Big Data Technology

Daegu Cho, Myungdo Lee, Jihye Shin

Subject: Computer Science And Mathematics, Computer Science Keywords: big data; data integration; EVMS; construction management

Online: 30 October 2020 (15:35:00 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints201806.0365.v1

Time Series Forecasting Using a Two-level Multi-Objective Genetic Algorithm: A Case Study of Maintenance Cost Data for Tunnel Fans

Yamur K. Al-Douri, Hussan Hamodi, Jan Lundberg

Subject: Engineering, Control And Systems Engineering Keywords: ARIMA model; data forecasting; multi-objective genetic algorithm; regression model

Online: 24 June 2018 (07:48:49 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints201708.0040.v2

An Adaptive Sweep-Circle Spatial Clustering Algorithm Based on Gestalt

Qingming Zhan, Shuguang Deng, Zhihua Zheng

Subject: Engineering, Transportation Science And Technology Keywords: spatial clustering; sweep-circle; Gestalt theory; data stream

Online: 24 August 2017 (10:53:05 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202308.1170.v1

Mapping hierarchical file structures to semantic data models for efficient data integration into research data management systems

Henrik Tom Wörden, Florian Spreckelsen, Stefan Luther, Ulrich Parlitz, Alexander Schlemmer

Subject: Computer Science And Mathematics, Information Systems Keywords: research data management; FAIR; file structure; file crawler; semantic data model

Online: 16 August 2023 (11:05:47 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202404.0429.v1

Analysis of Missingness Scenarios for Observational Health Data

Alireza Zamanian, Henrik von Kleist, Octavia Andreea Ciora, Marta Piperno, Gino Lancho, Narges Ahmidi

Subject: Computer Science And Mathematics, Artificial Intelligence And Machine Learning Keywords: Missing Data Analysis; Observational Health Data; Missingness Scenarios; Missing Data Assumptions; Missingness distribution shift

Online: 5 April 2024 (10:45:36 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202312.2218.v1

Detecting IoT Anomaly using Fuzzy Subspace Clustering Algorithm

Fokrul Alom Alom Mazarbhuiya, Mohamed A Shenify, A S Wungreiphi

Subject: Computer Science And Mathematics, Artificial Intelligence And Machine Learning Keywords: Anomaly detection; Information system; High-dimensional data; Dominance relation; Fuzzy Clustering method, CORE of attribute set; Mahalanobis distance.

Online: 28 December 2023 (18:50:16 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202402.0380.v1

Secure and Fast Image Encryption Algorithm Based on Modified Logistic Map

Mamoon Riaz, Hammad Dilpazir, Sundus Naseer, Hasan Mahmood, Asim Anwar, Junaid Khan, Ian B. Benitez, Tanveer Ahmad

Subject: Computer Science And Mathematics, Computer Vision And Graphics Keywords: Image Encryption; Data Security; Chaotic Logistic Map, Substitution-permutation Network

Online: 6 February 2024 (15:27:14 CET)

Show abstract| Download PDF| Share

Working Paper ARTICLE

A Preliminary Study on Dimension-Reduction Algorithm for Variational Methods in Three Dimensions

Xuan Chen

Subject: Environmental And Earth Sciences, Oceanography Keywords: 3DVAR; data assimilation; cost function; Sylvester equation

Online: 5 December 2019 (10:36:30 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202312.1987.v1

Enhancing Accuracy in Large Language Models Through Dynamic Real-Time Information Injection

Qian Ouyang, Shiyu Wang, Bing Wang

Subject: Computer Science And Mathematics, Artificial Intelligence And Machine Learning Keywords: Large Language Models; Real-Time Data Integration; Content Hallucination; Alpaca Model; Accuracy Improvement; Dynamic Data Processing

Online: 26 December 2023 (11:01:25 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202105.0390.v1

Robustness of Imputation Methods with Backpropagation Algorithm in Nonlinear Multiple Regression

Castro Gbememali Hounmenou, Boris Milognon Behingan, Christophe Chrysostome, Kossi Essona Gneyou, Romain Lucas Glele Kakaï

Subject: Computer Science And Mathematics, Algebra And Number Theory Keywords: Multilayer perceptron neural network; regression model; backpropagation; missing data; imputation method

Online: 17 May 2021 (14:35:18 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202310.1780.v2

Predicting Road Traffic Collisions Using a Two-Layer Ensemble Machine Learning Algorithm

James Oduor Oyoo, Jael Sanyanda Wekesa, Kennedy Odhiambo Ogada

Subject: Computer Science And Mathematics, Artificial Intelligence And Machine Learning Keywords: Road collision traffic; Data imbalance; Machine Learning; Driving Simulation

Online: 8 November 2023 (02:06:15 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202006.0063.v1

COVID-19 Real-Time Tracker and Analytical Report

Jiawei Long

Subject: Computer Science And Mathematics, Probability And Statistics Keywords: COVID-19; Real-Time Tracker; Common Symptoms; Data Visualization; Hypothesis Testing; ARIMA Time-Series Forecast; Penalized Logistic Regression

Online: 7 June 2020 (07:44:48 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202307.0925.v1

A Robust Integrated Watermarking Algorithm for Vector Geographic Data Copyright Protection

Hai Ha Le, Van Thuy Nguyen, Hoang Anh Le, Dinh Han Nguyen

Subject: Environmental And Earth Sciences, Other Keywords: vector watermarking; vector copyright protection; vector geographic data; copyright protection; digital watermarking; zero-watermarking

Online: 13 July 2023 (10:53:23 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202303.0031.v1

A Mixed Clustering Approach for Real Time Anomaly Detection

Fokrul Alom Mazarbhuiya, Mohamed Shenify

Subject: Computer Science And Mathematics, Data Structures, Algorithms And Complexity Keywords: data instances; real time systems; k-means algorithm; agglomerative hierarchical algorithm; similarity measure; merge function ;

Online: 2 March 2023 (04:15:10 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202306.1589.v1

Design of Cloud-based Real-Time Eye Tracking Monitoring and Storage System

Mustafa Can Gursesli, Mehmet Emin Selek, Mustafa Oktay Samur, Mirko Duradoni, Kyoungju Park, Andrea Guazzini, Antonio Lanatà

Subject: Engineering, Bioengineering Keywords: data management; cloud computing; RESTful API; eye-tracking; web portal

Online: 22 June 2023 (10:28:01 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints201806.0367.v1

A Real-Time Calibration Method of Kinect Recognition Range Extension for Media Art

Sunghyun Kim, Won-hyung Lee

Subject: Computer Science And Mathematics, Hardware And Architecture Keywords: kinect; depth calibration; RGB-D; media art; skeletal joint data

Online: 24 June 2018 (11:19:41 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202005.0101.v1

Developing and Validating an Algorithm to Identify Incident Chronic Dialysis Patients Using Administrative Data

Dino Gibertoni, Claudio Voci, Marica Iommi, Benedetta D'Ercole, Marcora Mandreoli, Antonio Santoro, Elena Mancini

Subject: Computer Science And Mathematics, Information Systems Keywords: chronic dialysis; administrative data; hospital discharge records; ambulatory specialty visits; case definition; algorithm

Online: 6 May 2020 (15:26:06 CEST)

Show abstract| Download PDF| Share

Background: Administrative healthcare databases are widespread and are often standardized with regard to their content and data coding, thus they can be used also as data sources for surveillance and epidemiological research. Chronic dialysis requires patients to frequently access hospital and clinic services, causing a heavy burden to healthcare providers. This also means that these patients are routinely tracked on administrative databases, yet very few case definitions for their identification are currently available. The aim of this study was to develop two algorithms derived from administrative data for identifying incident chronic dialysis patients and test their validity compared to the reference standard of the regional dialysis registry. Methods: The algorithms are based on data retrieved from hospital discharge records (HDR) and ambulatory specialty visits (ASV) to identify incident chronic dialysis patients in an Italian region. Subjects are included if they have at least one event in the HDR or ASV databases based on the ICD9-CM dialysis-related diagnosis or procedure codes in the study period. Exclusion criteria comprise non-residents, prevalent cases, or patients undergoing temporary dialysis, and are evaluated only on ASV data by the first algorithm, on both ASV and HDR data by the second algorithm. We validated the algorithms against the Emilia-Romagna regional dialysis registry by searching for incident patients in 2014. Results: Algorithm 1 identified 680 patients and algorithm 2 identified 676 initiating dialysis in 2014, compared to 625 patients included in the regional dialysis registry. Sensitivity for the two algorithms was respectively 90.8% and 88.4%, positive predictive value 84.0% and 82.0%, and percentage agreement was 77.4% and 74.1%. Conclusions: These results suggest that administrative data have high sensitivity and positive predictive value for the identification of incident chronic dialysis patients. Algorithm 1, which showed the higher accuracy and has a simpler case definition, can be used in place of regional dialysis registries when they are not present or sufficiently developed in a region, or to improve the accuracy and timeliness of existing registries.

Preprint ARTICLE | doi:10.20944/preprints202303.0062.v1

High Accuracy Remote Data Detection Method for Underground Space Information Based on Fractional-order Differential Algorithm

Yanhong Zuo, Hua Cheng, Jigen Fang

Subject: Computer Science And Mathematics, Applied Mathematics Keywords: Underground space, information detection, fractional differentiation, high accuracy remote data

Online: 3 March 2023 (08:37:27 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202307.2154.v1

Energy-Efficient Opportunistic Routing Algorithm for Post-disaster Mine Internet of Things Networks

Qing Zhao, Wei Yang, Li Ya Zhang

Subject: Engineering, Telecommunications Keywords: Mine Internet of Things (MIoT); post-disaster reconstruction; opportunistic routing (OR); data transmission; energy efficient; routing void

Online: 2 August 2023 (04:44:01 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202306.0570.v1

A decision feedback equalization algorithm based on simplified Volterra structure for PAM4 IM-DD optical communication system

Yao Xie, Peili He, Wei Li, Na Li

Subject: Engineering, Other Keywords: optical fiber data communication system; EML; PAM4; Volterra; DFE

Online: 8 June 2023 (03:00:45 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202403.0470.v1

A New Computational Algorithm for Assessing Overdispersion in Machine Learning Count Models with Python

Luiz Paulo Lopes Fávero, Alexandre Duarte, Helder Prado Santos

Subject: Computer Science And Mathematics, Data Structures, Algorithms And Complexity Keywords: Count data; Machine learning; Negative binomial regression; Overdispersion; Poisson regression; Python; Vuong Test; Zero inflation

Online: 8 March 2024 (09:30:50 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202201.0229.v1

Applying the FAIR4Health Solution to Identify Multimorbidity Patterns and Their Association With Mortality Through a Frequent Pattern Growth Association Algorithm

Jonás Carmona-Pírez, Beatriz Poblador-Plou, Antonio Poncel-Falcó, Jessica Rochat, Celia Alvarez-Romero, Alicia Martínez-García, Carmen Angioletti, Marta Almada, Mert Gencturk, Anil Sinaci, Jara Eloisa Ternero-Vega, Christophe Gaudet-Blavignac, Christian Lovis, Rosa Liperoti, Elisio Costa, Carlos Luis Parra-Calderón, Aida Moreno-Juste, Antonio Gimeno-Miguel, Alexandra Prados-Torres

Subject: Public Health And Healthcare, Public Health And Health Services Keywords: FAIR principles; Multimorbidity; Mortality; Research data management; Pathfinder case study; Privacy-Preserving Distributed Data Mining.

Online: 17 January 2022 (13:04:03 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints201611.0033.v1

Construct Linear Polynomial Complementary Transformation for NP-Completeness Using Parallel Genetic Algorithm

Tarik Eltaeib, Julius Dichter

Subject: Computer Science And Mathematics, Information Systems Keywords: genetic algorithms; parallel computation; computational complexity; algorithms; optimization techniques; traveling salesman problem; NP-Hard problems; Berlin-52 data set; machine learning; linear regression

Online: 7 November 2016 (04:57:46 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints201707.0089.v1

Data Assimilation in the Air Contaminant Dispersion Using Particle Filter and Expectation-Maximization Algorithm with UAV Observations

Rongxiao Wang, Bin Chen, Sihang Qiu, Zhengqiu Zhu, Xiaogang Qiu

Subject: Engineering, Chemical Engineering Keywords: air contaminant dispersion; data assimilation; particle filter; expectation-maximization algorithm; UAV

Online: 31 July 2017 (11:02:27 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202307.0452.v2

A Reduced Order Model for Monitoring Aeroengines Condition in Real-Time

Jose Rodrigo, Luis Sanchez de Leon, Jose L. Montañes, Jose M. Vega

Subject: Engineering, Aerospace Engineering Keywords: Reduced order models; Higher order singular value decomposition; Health monitoring; Aeroengines; Predictive maintenance; Degradation parameters; Sensors scaling; Turbine inlet temperature; Gradient-like methods; Noisy data

Online: 11 July 2023 (08:22:53 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202306.0490.v1

Empirical Comparison of Higher-Order Mutation Testing and Data-Flow Testing of C# with the Aid of Genetic Algorithm

Eman H Abd-Elkawy, Rabie Ahmed

Subject: Computer Science And Mathematics, Computer Science Keywords: Data flow testing; higher-order mutation testing; “ProbSubsumes”; “ProbBetter”

Online: 7 June 2023 (08:22:34 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202003.0298.v1

A Composite Hybrid Feature Selection Learning-Based Optimization of Genetic Algorithm For Breast Cancer Detection

Ahmed Abdullah Farid, Gamal Selim, Hatem Khater

Subject: Computer Science And Mathematics, Information Systems Keywords: Data Mining; Breast Cancer; Hybrid Feature Selection; Machine learning; Support Vector Machine; Optimize Genetic Algorithm; boosting algorithms

Online: 19 March 2020 (11:13:15 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202403.1446.v1

Hardware in the Loop Simulation of Gas Turbines: How Real Time Control System Design Tools Can Be Exploited Also to Generate Fault Cases to Train and Tune Data Based Diagnostic Systems.

Attilio Brighenti, Chiara Brighenti

Subject: Engineering, Mechanical Engineering Keywords: HIL model(s); Dynamic simulation(s); Data-based modelling; Predictive diagnostics; Fault detection and isolation (FDI)

Online: 25 March 2024 (08:30:18 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202306.0192.v1

Dmitry Vesnin, Dmitry Levshun, Andrey Chechulin

Subject: Computer Science And Mathematics, Computer Vision And Graphics Keywords: trademarks; data protection; artificial intelligence; image processing; trademark retrieval

Online: 2 June 2023 (11:37:01 CEST)

Show abstract| Download PDF| Share

Working Paper ARTICLE

Assessing the Price in Data utility of k-Anonymous Microaggregation

Ana Rodriguez-Hoyos, José Estrada-Jiménez, David Rebollo-Monedero, Jordi Forné, Javier Parra-Arnau, Luis Urquiza-Aguiar

Subject: Computer Science And Mathematics, Information Systems Keywords: microaggregation; k-anonymity; privacy; data utility

Online: 23 July 2019 (11:42:34 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202402.1372.v1

Leveraging Visualization and Machine Learning Techniques in Education: A Case Study of K-12 State Assessment Data

Loni Taylor, Vibhuti Gupta, Kwanghee Jung

Subject: Computer Science And Mathematics, Analysis Keywords: Data Visualization; Big Data; AI; Machine Learning

Online: 23 February 2024 (10:39:04 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202308.1237.v1

A Method to Enable Automatic Extraction of Cost and Quantity Data from Hierarchical Construction Information Documents to Enable Rapid Digital Comparison and Analysis

Daniel Adanza Dopazo, Lamine Mahdjoubi, Bill Gething

Subject: Engineering, Transportation Science And Technology Keywords: data mining; data extraction; data science; cost infrastructure projects

Online: 17 August 2023 (09:25:22 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202306.0974.v1

Reliability Estimation for Industrial Equipment Using EM Algorithm with Censored Data: A Case Study on Centrifugal Pumps in an Oil Refinery

José Silva, Paulo Vaz, Pedro Martins, Luís Ferreira

Subject: Engineering, Industrial And Manufacturing Engineering Keywords: Reliability estimation; EM algorithm; Censored data; Weibull distribution; Industrial equipment; Maintenance optimization; Failure analysis; Proactive maintenance

Online: 14 June 2023 (07:50:43 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202402.0187.v2

Optimization and Performance Analysis of CAT Method for DNA Sequence Similarity Searching and Alignment

Veska Gancheva, Hristo Stoev

Subject: Computer Science And Mathematics, Computer Science Keywords: bioinformatics; biological data sequences; DNA sequences; metadada; performance analysis; similarity searching; sequence alignment

Online: 19 February 2024 (16:12:31 CET)

Show abstract| Download PDF| Share

Bioinformatics is a rapidly developing field enabling scientific experiments through computer models and simulations. Considering the vast databases of biological data available, it is ex-tremely important to develop efficient methods and algorithms for their processing. Sequence comparison is the best method for studying the evolutionary interaction between genes. It is based on alignment – the process of arranging two or more sequences to achieve the maximum level of identity and degree of similarity. The paper presents a new version of the algorithm for pairwise DNA sequences alignment, based on a new method called CAT, where a dependency with a previous match and the closes neighbor are taken in consideration to increase uniqueness of the CAT profile and to reduce possible collisions, i.e. two or more sequence having same CAT profiles. This makes proposed algorithm suitable for finding exact match of a concrete DNA se-quence in a big set of DNA data faster. The generation of CAT profiles is made once before data has been uploaded to the database, allowing the profiles to be used as metadata for the sequenc-es. It consists of an algorithm to calculate a CAT profile against the selected reference sequences and an algorithm to compare two sequences based on the calculated CAT profiles. Improve-ments in generation of the CAT profiles, are detailed described in the paper. Block scheme, pseudo code tables and figures were updated according to the proposed new version and ex-perimental results. Experiments have been carried out with the new version of the method and different datasets to align DNA sequences based on the CAT method. New experimental results in terms of collisions, speed, and efficiency of the proposed solutions are presented. Experiments related to the performance comparison with Needleman-Wunsch were re-executed with the new version of the algorithm to confirm that we have the same performance. An analysis of the per-formance of the proposed CAT based algorithm against Knuth–Morris–Pratt algorithm, which has a complexity of O(n) and is one of the most widely used for searching biological data, was performed. The impact of prior matching dependencies on uniqueness for generated CAT pro-files is investigated. The analysis of the experimental results obtained by sequence alignment shows a small deviation of the proposed algorithm based on the CAT method, which can be ig-nored if this deviation is acceptable at the expense of performance. The time efficiency of the CAT algorithm remains constant, regardless of the length of the sequences. Therefore, the ad-vantage of the proposed method is its fast processing in the alignment of large sequences, for which the execution of the exact algorithms takes a long time. The example code realization of the CAT Method, under the protection of the GNU General Public License v3.0, can be accessed on GitHub at: https://github.com/HristoS/CATSequenceAnalysis.

Preprint ARTICLE | doi:10.20944/preprints201610.0067.v1

Point Information Gain and Multidimensional Data Analysis

Renata Rychtáriková, Jan Korbel, Petr Macháček, Petr Císař, Jan Urban, Dalibor Štys

Subject: Computer Science And Mathematics, Applied Mathematics Keywords: point information gain; Rényi entropy; data processing

Online: 17 October 2016 (11:35:13 CEST)

Show abstract| Download PDF| Share

Working Paper ARTICLE

Design Of a Dynamic and Self-Adapting System, Supported With Artificial Intelligence, Machine Learning and Real-Time Intelligence For Predictive Cyber Risk Analytics in Extreme Environments- Cyber Risk in the Colonisation of Mars

Petar Radanliev

Subject: Computer Science And Mathematics, Artificial Intelligence And Machine Learning Keywords: Artificial intelligence; machine learning; real-time probabilistic data; for cyber risk; super forecasting; red teaming;

Online: 12 April 2021 (12:18:14 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints201805.0120.v1

The Improved Least Square Support Vector Machine Based on Wolf Pack Algorithm and Data Inconsistency Rate for Cost Prediction of Substation Projects

Haichao Wang, Dongxiao Niu, Si Li, Fenghua Wang, Yi Liang

Subject: Computer Science And Mathematics, Artificial Intelligence And Machine Learning Keywords: cost prediction of substation projects; improved least square support vector machine; wolf pack algorithm; data inconsistency rate

Online: 8 May 2018 (05:01:45 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202403.1859.v1

Enhancing Data Preservation and Security in Industrial Control Systems Through Integrated IOTA Implementation

Iuon-Chang Lin, Pai-Ching Tseng, Pin-Hsiang Chen, Shean-Juinn Chiou

Subject: Engineering, Control And Systems Engineering Keywords: DLT; IoT; data security; Docker; Container Technology; IOTA; Tangle

Online: 29 March 2024 (13:45:37 CET)

Show abstract| Download PDF| Share

: In the realm of data management, data preservation stands as a critical undertaking aimed at preserving and upholding the integrity of data. Regardless of whether it concerns personal or enterprise data, the detrimental effects of malicious alterations implemented by attackers cannot be overlooked. Particularly in conventional industrial control environments, the prevalent practice involves the transmission of data from sensors to databases for storage purposes. However, it is essential to recognize that this process exposes the data to various vulnerabilities. Thus, to ensure the long-term security and reliability of the data, it becomes imperative to implement robust data preservation strategies within these industrial control systems. However, the reliance of these databases on physical hard disks introduces inherent vulnerabilities, including the potential for data loss due to disk damage or targeted malicious attacks. Consequently, it becomes imperative to prioritize the implementation of robust data preservation measures. These measures are crucial in mitigating the risk of disruptions and protecting critical data from compromise. By establishing effective data backup systems, employing advanced security protocols, and implementing proactive monitoring mechanisms, organizations can bolster their data preservation capabilities and safeguard against potential threats to data integrity and availability. As a result, many enterprises opt to store their data with third-party providers to ensure data integrity. However, this approach carries inherent risks. If the third-party service experiences an attack or if the data is tampered with, it becomes challenging to verify the integrity of the data. To address these concerns and ensure data preservation within the context of the Internet of Things (IoT), a growing number of individuals are integrating IoT with Distributed Ledger Technology (DLT). By leveraging DLT, the integrity of data can be ensured, reducing reliance on centralized third-party storage and enhancing security in the IoT ecosystem. In this article, IOTA is the DLT, which employs Directed Acyclic Graph (DAG) to store transaction information. Compared to Ethereum or other blockchain technologies, IOTA offers notable advantages in terms of transaction verification speed, making it highly suitable for real-time IoT environments. However, the conventional transmission path from sensors to IOTA nodes entails a complex route, involving multiple hardware devices before reaching the intended destination. This complexity poses challenges in ensuring data integrity during transmission and introduces vulnerabilities such as man-in-the-middle attacks or SQL injection attacks. To address these issues, we propose a method to streamline the transmission path between sensors and IOTA, specifically tailored for industrial fields with numerous IoT devices. Our approach involves preprocessing the data stored on the server using our method before uploading, ensuring data confidentiality, and leveraging IOTA to guarantee data integrity. To achieve the shortest path between IoT and DLT nodes, it becomes necessary to establish IOTA nodes on lower-level devices, such as Raspberry Pi or IoT controllers. By simplifying the transmission path, we can reduce the potential for tampering and enhance overall data security. Implementing our proposed method enables the assurance of data confidentiality and integrity during both transmission and storage on the server, strengthening the trustworthiness of the IoT, and IOTA integration.

Preprint ARTICLE | doi:10.20944/preprints201610.0012.v1

Bio-Resource Exchange: Study of Prevalence of Antibody Donation and Development of a Web Portal to Facilitate it

Sandeep Subramanian, Madhavi Ganapathiraju

Subject: Biology And Life Sciences, Biochemistry And Molecular Biology Keywords: data exchange; resource donations; text mining

Online: 5 October 2016 (15:08:32 CEST)

Show abstract| Download PDF| Supplementary Files| Share

Preprint ARTICLE | doi:10.20944/preprints201612.0077.v1

Learning Parsimonious Classification Rules from Gene Expression Data Using Bayesian Networks with Local Structure

Jonathan Lyle Lustgarten, Jeya Balaji Balasubramanian, Shyam Visweswaran, Vanathi Gopalakrishnan

Subject: Computer Science And Mathematics, Data Structures, Algorithms And Complexity Keywords: rule based models; gene expression data; bayesian networks; parsimony

Online: 15 December 2016 (08:21:24 CET)

Show abstract| Download PDF| Supplementary Files| Share

The comprehensibility of good predictive models learned from high-dimensional gene expression data is attractive because it can lead to biomarker discovery. Several good classifiers provide comparable predictive performance but differ in their abilities to summarize the observed data. We extend a Bayesian Rule Learning (BRL-GSS) algorithm, previously shown to be a significantly better predictor than other classical approaches in this domain. It searches a space of Bayesian networks using a decision tree representation of its parameters with global constraints, and infers a set of IF-THEN rules. The number of parameters and therefore the number of rules are combinatorial to the number of predictor variables in the model. We relax these global constraints to a more generalizable local structure (BRL-LSS). BRL-LSS entails more parsimonious set of rules because it does not have to generate all combinatorial rules. The search space of local structures is much richer than the space of global structures. We design the BRL-LSS with the same worst-case time-complexity as BRL-GSS while exploring a richer and more complex model space. We measure predictive performance using Area Under the ROC curve (AUC) and Accuracy. We measure model parsimony performance by noting the average number of rules and variables needed to describe the observed data. We evaluate the predictive and parsimony performance of BRL-GSS, BRL-LSS and the state-of-the-art C4.5 decision tree algorithm, across 10-fold cross-validation using ten microarray gene-expression diagnostic datasets. In these experiments, we observe that BRL-LSS is similar to BRL-GSS in terms of predictive performance, while generating a much more parsimonious set of rules to explain the same observed data. BRL-LSS also needs fewer variables than C4.5 to explain the data with similar predictive performance. We also conduct a feasibility study to demonstrate the general applicability of our BRL methods on the newer RNA sequencing gene-expression data.

Preprint REVIEW | doi:10.20944/preprints202103.0216.v1

Machine Learning: Algorithms, Real-World Applications and Research Directions

Iqbal H. Sarker

Subject: Computer Science And Mathematics, Information Systems Keywords: machine learning; deep learning; artificial intelligence; data science; data-driven decision making; predictive analytics; intelligent applications;

Online: 8 March 2021 (12:55:59 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202103.0753.v1

Unsupervised Feature Selection for Histogram-Valued Symbolic Data by Hierarchical Conceptual Clustering

Manabu Ichino, Kadri Umbleja, Hiroyuki Yaguchi

Subject: Computer Science And Mathematics, Algebra And Number Theory Keywords: unsupervised feature selection; histogram-valued data; compactness; hierarchical conceptual clustering; multi-role measure; visualization

Online: 31 March 2021 (07:53:39 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202104.0142.v1

Photoionization Cross Sections of Carbon-Like N+ near the K-Edge (390 eV - 440 eV)

Jean-Paul Mosnier, Eugene T Kennedy, Jean-Marc Bizau, Denis Cubaynes, Ségolène Guilbaud, Christophe Blancard, Brendan M. McLaughlin

Subject: Physical Sciences, Acoustics Keywords: atomic data; inner-shell photoionization; atomic nitrogen ion

Online: 5 April 2021 (14:22:55 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202304.0959.v1

Real-Time Pipe Structure Change Detection and Classification using Distributed Acoustic Fiber Sensors Based on Convolutional Neural Network (CNN) Models

Pengdi Zhang, Abhishek Venketeswaran, Ruishu F. Wright, Nageswara Lalam, Enrico Sarcinelli, Paul R. Ohodnicki

Subject: Engineering, Mechanical Engineering Keywords: machine learning; mechanical damage detection; pipelines; physics-informed datasets; simulations; welding detection; CNN structure optimization; sensing system; data classification performance and noise robustness

Online: 26 April 2023 (04:59:54 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202001.0274.v1

Conventional Data Science Techniques to Bioinformatics and Utilizing a Grid Computing Approach to Computational Medicine

Andrew M. K. Nassief

Subject: Computer Science And Mathematics, Mathematical And Computational Biology Keywords: bioinformatics; computational genomics; computational medicine; data science; data visualization; parallel processing; grid computing; fog computing

Online: 24 January 2020 (10:26:26 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202305.1636.v1

Hierarchical Network-Based Tracklets Data Association for Multiple Extended Targets Tracking With Intermittent Measurements

Kaiyi Jiang, Yiguo Li, TianLi Ma, Lin Li

Subject: Computer Science And Mathematics, Information Systems Keywords: Multiple extended targets; Data association; Tracklets; Min-cost network flow; Intermittent measurements

Online: 23 May 2023 (10:17:47 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202301.0522.v1

Autonomous Vehicle Dataset with Real Multi-Driver Scenes and Biometric Data

Francisca Rosique, Pedro J. Navarro, Leanne Miller, Eduardo Salas

Subject: Computer Science And Mathematics, Data Structures, Algorithms And Complexity Keywords: autonomous vehicle; data set; multidriver; biometric

Online: 28 January 2023 (07:55:36 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202110.0362.v1

Effects of Different Parameter Settings for 3D Data Smoothing and Mesh Simplification on Towards Near Real-Time 3D Reconstruction of High Resolution Bioceramic Bone Void Filling Medical Images

Daniel Jie Yuan Chin, Ahmad Sufril Azlan Mohamed, Khairul Anuar Shariff, Mohd Nadhir Ab Wahab, Kunio Ishikawa

Subject: Computer Science And Mathematics, Probability And Statistics Keywords: 3D reconstruction; 3D data smoothing; mesh simplification; high resolution micro-CT images

Online: 25 October 2021 (15:34:27 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202008.0074.v1

Clustering of Cardiovascular Disease Patients Using Data Mining Techniques with Principal Component Analysis and K-Medoids

Edy Irwansyah, Ebiet Salim Pratama, Margaretha Ohyver

Subject: Computer Science And Mathematics, Probability And Statistics Keywords: data mining; cardiovascular diseases; cluster analysis; principle component analysis

Online: 4 August 2020 (03:56:19 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202304.0222.v1

Investigating the Accuracy of Autoregressive Recurrent Networks Using Hierarchical Aggregation Structure-Based Data Partitioning

José Manuel Oliveira, Patrícia Ramos

Subject: Computer Science And Mathematics, Artificial Intelligence And Machine Learning Keywords: Global models; Deep learning; Data partitioning; Time series features; Model complexity; Intermittent demand; Retail

Online: 11 April 2023 (10:41:55 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202112.0070.v1

Deep Learning Exoplanets Detection by Combining Real and Synthetic Data

Sara Cuéllar, Paulo Granados, Ernesto Fabregas, Michel Curé, Hector Vargas, Sebastián Dormido-Canto, Gonzalo Farías

Subject: Engineering, Control And Systems Engineering Keywords: Exoplanets Detection; Deep learning; Real and Simulated Data

Online: 6 December 2021 (12:36:42 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202011.0297.v1

Linear Regression Analysis for Time-Point Datasets

Janardan Patil, Li Len, Abhinav Bharat, Xi Li

Subject: Computer Science And Mathematics, Mathematics Keywords: regression; time point data; modelling

Online: 10 November 2020 (10:00:37 CET)

Show abstract| Download PDF| Share

Preprint COMMUNICATION | doi:10.20944/preprints201803.0054.v1

Travel Time Prediction Based on Data Feature Selection and Data Clustering Methods

Chi-Hua Chen

Subject: Computer Science And Mathematics, Information Systems Keywords: data feature selection; data clustering; travel time prediction

Online: 7 March 2018 (13:30:06 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202204.0068.v1

Functional Data Analysis for Imaging Mean Function Estimation: Computing Times and Parameter Selection

Juan Arias López, Carmen Cadarso Suárez, Pablo Aguiar Fernández

Subject: Computer Science And Mathematics, Computational Mathematics Keywords: Functional Data Analysis; Image Processing; Brain Imaging; Neuroimaging; Computational Neuroscience; Data Science

Online: 8 April 2022 (03:21:06 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202403.1845.v1

Advanced Analytics and Data Management in the Procurement Function: An Aviation Industry Case Study

Andrea Altundag, Martin Wynn

Subject: Business, Economics And Management, Business And Management Keywords: data analytics; strategic procurement; big data; maturity model; aviation industry; aircraft manufacturer

Online: 29 March 2024 (10:36:02 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202302.0362.v2

SenGlove - A Modular Wearable Device To Measure Kinematic Parameters Of The Human Hand

Jonas David, Thomas Helbig, Hartmut Witte

Subject: Engineering, Bioengineering Keywords: Wearable devices; Wearable sensors; Data glove; Biomechatronic design; Hand kinematics; Joint measurement; Flex sensors; Biomedical engineering

Online: 27 February 2023 (10:40:17 CET)

Show abstract| Download PDF| Share

Preprint REVIEW | doi:10.20944/preprints202402.1493.v1

Real-World Data and Evidence in Lung Cancer: A Review of Recent Developments

Eleni Kokkotou, Maximilian Anagnostakis, Georgios Evangelou, Nikolaos K Syrigos, Ioannis Gkiozos

Subject: Medicine And Pharmacology, Oncology And Oncogenics Keywords: oncology; real-world data; real-world evidence; epidemiology; safety; efficacy; artificial intelligence; machine learning; data quality; lung cancer

Online: 27 February 2024 (08:04:33 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202305.0390.v1

Detecting Potential Outliers in Longitudinal Data with Time-Dependent Covariates

Lazarus K. Mramba, Xiang Liu, Kristian F. Lynch, Jimin Yang, Carin Andrén Aronsson, Sandra Hummel, Jill M. Norris, Suvi M. Virtanen, Leena Hakola, Ulla M. Uusitalo, Jeffrey P. Krischer

Subject: Public Health And Healthcare, Other Keywords: exploratory data analysis; non-parametric statistics; skewed data; survival analysis; repeated measures.

Online: 6 May 2023 (08:32:28 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints201905.0158.v1

An Adaptive Biomedical Data Managing Scheme Based on Blockchain Technique

Ahmed Faeq Hussein, Abbas K. AlZubaidi, Qais Ahmed Habash, Mustafa Musa Jaber

Subject: Medicine And Pharmacology, Other Keywords: blockchain; biomedical data managing; DWT; keyword search; data sharing.

Online: 13 May 2019 (13:30:37 CEST)

Show abstract| Download PDF| Share

Preprint REVIEW | doi:10.20944/preprints202207.0141.v1

Real World Evidence – Current Developments and Perspectives

Friedemann Schad, Anja Thronicke

Subject: Medicine And Pharmacology, Oncology And Oncogenics Keywords: review; real -world evidence; real -world data; randomized controlled trials; registry; digital health technology; early drug approval

Online: 8 July 2022 (11:09:58 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202208.0224.v1

Recognition of Vehicles Entering Expressway Service Areas and Estimation of Dwell Time Using ETC Data

Qiqin Cai, Dingrong Yi, Fumin Zou, Zhaoyi Zhou, Nan Li, Feng Guo

Subject: Engineering, Automotive Engineering Keywords: VR-XGBoost; K-VDTE; ETC data; ESAs; data mining

Online: 12 August 2022 (03:53:23 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202202.0134.v1

DOLAVI Real-Life Study of Dolutegravir Plus Lamivudine in Naive HIV-1 Patients (48 Weeks)

Carmen Hidalgo-Tenorio, Juan Pasquau, David Vinuesa, Sergio Ferra, Alberto Terron, Isabel SanJoaquin, Antoni Payeras, Onofre Juan Martinez, Miguel Ángel Lopez-Ruz, Mohamed Omar, Javier de la Torre-Lima, Ana Lopez-Lirola, Jesus Palomares, Jose Ramón Blanco, Marta Montero, Coral Garcia-Vallecillos

Subject: Medicine And Pharmacology, Immunology And Allergy Keywords: DOLAVI; Dolutegravir; Lamivudine; Real World Data; HIV

Online: 9 February 2022 (10:45:33 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202209.0341.v1

Performance Evaluation of Machine Learning Regressors for Estimating Real Estate House Prices

Marzieh Khosravi, Sadman Bin Arif, Ali Ghaseminejad, Hamed Tohidi, Hanieh Shabanian

Subject: Social Sciences, Decision Sciences Keywords: Real State; Regressors; Artificial Intelligence; Machine Learning; Data-informed; Boston

Online: 22 September 2022 (10:33:09 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202311.1564.v1

Switching from Nusinersen to Risdiplam: Croatian Real-World Experience on Effectiveness and Safety

Andrej Belančić, Tea Strbad, Marta Kučan Štiglić, Dinko Vitezić

Subject: Medicine And Pharmacology, Neuroscience And Neurology Keywords: effectiveness; nusinersen; real-world data; risdiplam; spinal muscular atrophy

Online: 26 November 2023 (05:27:25 CET)

Show abstract| Download PDF| Share

(1) Background: Aim was to investigate real-world effectiveness (hypothesizing non-inferiority) and safety profile of risdiplam in paediatric and adult nusinersen-risdiplam spinal muscular atrophy (SMA) switch cohort. (2) Methods: A retrospective and anonymous collection of relevant demographic and clinical data for all Croatian SMA patients switched from nusinersen to risdiplam up to September 2023 (reimbursed by Croatian Health Insurance Fund – CHIF) was performed using CHIF database and associated reimbursement documentation. Patients were included in effectiveness and safety analysis if they met the following inclusion criteria: i) risdiplam reimbursed by CHIF; ii) patient received at least 6 doses of nusinersen before switch to risdiplam; iii) no relevant pause between the latter disease modifying drugs (DMDs); iv) availability of all prespecified studied data and parameters. (3) Results: 17 patients met inclusion criteria [58.9% female; median age 12.75 (3.0-44.5) yr.]. In our ‘switch’ cohort, we have demonstrated a non-inferiority of risdiplam to nusinersen, in SMA 1 (+1.0 in CHOP INTEND; p=0.067), SMA 3p (+0.7 in HFMSE; p=0.897) and SMA 3a (+0.8 in RHS; p=0.463) subpopulations, during a one-year follow-up period. There were no reports on respiratory function worsening, feeding worsening, and no lethal events. No new safety concerns were identified, except of the weight gain that arose as a new potential adverse drug reaction ‘signal’ in two patients. (4) Conclusions: We have reported a pivotal real-world findings on switching SMA patients from nusinersen to risdiplam and demonstrated its effectiveness (non-inferiority), safety and tolerability in a heterogenous paediatric and adult ‘switch’ cohort, which will further increase the quality and standards of care as well as safety of a notable portion of SMA patients; especially for those who demand switch from nusinersen to other DMDs from clinical or personal reasons.

Preprint ARTICLE | doi:10.20944/preprints202306.1477.v1

Cefto-Reallife Study: Real World Data on the Use of Ceftobiprole in a Multicenter Spanish Cohort

Carmen Hidalgo-Tenorio, Ines Pitto Robles, Daniel Arnes Garcia, F Javier Membrill De Novales, Laura Morata, Rosario Menendez, Olga Bravo De Pablo, Vicente Abril Lopez de Medran, Miguel Salavert Lleti, Pilar Vizcarra, Jaime Lora Tamayo, Ana Arnaiz Garcia, Leonor Moreno Nuñez, Mar Masia, Maria pilar Ruiz Seco, Svetlana Sadyrbaeva Dolgova

Subject: Medicine And Pharmacology, Medicine And Pharmacology Keywords: Ceftobiprole; sepsis; older; Real-World Data; OPAT

Online: 21 June 2023 (04:10:08 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202106.0738.v1

Combination of Using Pairwise Comparisons and Composite Reference Series: A New Approach in the Homogenization of Climatic Time Series

Peter Domonkos

Subject: Environmental And Earth Sciences, Atmospheric Science And Meteorology Keywords: time series; homogenization; ACMANT; observed data; data accuracy

Online: 30 June 2021 (13:08:39 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202306.1667.v1

Impact of Time Span and Missing Data on the Noise Model Estimation of GNSS Time Series

Xiwen. Sun, Tieding Lu, Shunqiang Hu, Jiahui. Huang, Xiaoxing. He, Jean-Philippe Montillet, Xiaping Ma, Zhengkai Huang

Subject: Environmental And Earth Sciences, Space And Planetary Science Keywords: GNSS time series; time length; missing data; noise analysis; velocity estimation

Online: 23 June 2023 (11:42:43 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202302.0211.v1

Beneš Network Based Efficient Data Concentrator for Triggerless Data Acquisition Systems

Marek Gumiński, Michał Kruszewski, Bartosz M. Zabołotny, Wojciech M. Zabołotny

Subject: Engineering, Electrical And Electronic Engineering Keywords: FPGA; DAQ; Data concentration; Beneš network

Online: 13 February 2023 (09:10:55 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints201801.0077.v1

A Methodology for Design and Analysis of Sensor Fusion with Real Data in UAV Platforms

Jesús García, Jose Manuel Molina, Jorge Trincado

Subject: Engineering, Electrical And Electronic Engineering Keywords: UAVs sensor fusion; EKF; real data analysis; system design

Online: 9 January 2018 (07:47:45 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202310.0985.v1

Generalized Partially Functional Linear Model with Unknown Link Function

Weiwei Xiao, Songxuan Li, Haiyan Liu

Subject: Computer Science And Mathematics, Probability And Statistics Keywords: functional data analysis; unknown link function; generalized functional linear model average life expectancy

Online: 16 October 2023 (16:50:31 CEST)

Show abstract| Download PDF| Share

Preprint COMMUNICATION | doi:10.20944/preprints202301.0335.v2

Safe Control of Autonomous Cloud Entities in Distributed Systems

Mostefa Kara

Subject: Computer Science And Mathematics, Information Systems Keywords: Cloud Computing; Data Protection; Secure Communication; Middleware; Protocols

Online: 30 January 2023 (09:24:01 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints201904.0281.v1

Choice of Cluster Computing System Hadoop and Apache Spark for Network Systems

Vasiliy Elagin, Vladislav Karpov, Aleksandr Kravchenko, Aleksandr Goldstein, Andrei Vladyko

Subject: Computer Science And Mathematics, Information Systems Keywords: Cluster computing, Big Data, Spark, Hadoop.

Online: 25 April 2019 (11:22:27 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202402.0446.v1

Investigating training datasets of real and synthetic images for swimmer localisation with YOLO

Mohsen Khan Mohammadi, Toni Schneidereit, Ashkan Mansouri Yarahmadi, Michael Breuß

Subject: Computer Science And Mathematics, Artificial Intelligence And Machine Learning Keywords: Object detection; Swimmer safety; Synthetic data; Background removal; YOLO architecture; Image augmentation

Online: 7 February 2024 (12:55:29 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202305.0049.v1

Analysis of Concurrent Systems Based on Interval Order

Yang Xu, YE Chen, Chen Yi Jun

Subject: Computer Science And Mathematics, Computer Science Keywords: concurrent systems; Mazurkiewicz traces; interval order; Petri net with Data

Online: 2 May 2023 (03:13:37 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202308.0307.v1

The Consistency of the Distribution Function Conditional Estimate and Application on the Consistency and Asymptotic Normality of the Conditional Hazard Function Estimate for High Dimensional Quasi-Associated Data

Hamza Daoudi, Zouaoui Chikr Elmezouar, Fatimah Alshahrani

Subject: Computer Science And Mathematics, Mathematics Keywords: conditional distribution function; asymptotic normality, conditionalhazard function; quasi-associated; functional data

Online: 3 August 2023 (10:53:17 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202307.1942.v1

Linear and Nonlinear Modes & Data Signatures in Dynamic Systems Biology Models

Joseph DiStefano

Subject: Engineering, Bioengineering Keywords: exponential modes; visible modes, hidden modes; data limitations; input-output data; mechanistic model; model distinguishability; invariant 2-dimensional manifolds

Online: 28 July 2023 (13:12:25 CEST)

Show abstract| Download PDF| Share

Preprint COMMUNICATION | doi:10.20944/preprints202401.0780.v1

Data Reuse in Agricultural Genomics Research: Present Challenges and Future Solutions

Alenka Hafner, Victoria DeLeo, Cecilia Deng, Christine G. Elsik, Damarius Fleming, Peter W. Harrison, Theodore S. Kalbfleisch, Bruna Petry, Boas Pucker, Elsa H. Quezada-Rodríguez, Christopher K. Tuggle, James Koltes

Subject: Biology And Life Sciences, Agricultural Science And Agronomy Keywords: data reuse; agriculture; open data; metadata; data standards; equity

Online: 10 January 2024 (10:07:03 CET)

Show abstract| Download PDF| Share

Preprint CONCEPT PAPER | doi:10.20944/preprints201810.0724.v2

Explaining and Measuring Social-Ecological Pathways: The Case of Global Changes and Water Security

Thomas Bolognesi, Andrea K. Gerlak, Gregory Giuliani

Subject: Social Sciences, Political Science Keywords: Social-Ecological System; Water security; Governance; Institution; Learning; Data-Cube

Online: 22 November 2018 (14:47:31 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202008.0626.v1

Exploring Building Data from Multispectral and Single-Photon Lidar Systems

Lingli Zhu, Juha Hyyppä, Juho-pekka Virtanen, Xiaowei Yu, Harri Kaartinen

Subject: Engineering, Civil Engineering Keywords: multispectral lidar; single-photon lidar; building data; 3D reconstruction

Online: 28 August 2020 (08:49:07 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202404.0636.v1

BRCA1/2 Testing Landscape in Ovarian Cancer: A Nationwide, Real-World Data Study

Lieke Lanjouw, Joost Bart, Marian J.E. Mourits, Stefan M. Willems, Annemieke H. van der Hout, Arja ter Elst, Geertruida H. de Bock

Subject: Medicine And Pharmacology, Oncology And Oncogenics Keywords: BRCA1/2 pathogenic variants; ovarian carcinoma; tumor test; next generation sequencing; real-world data

Online: 9 April 2024 (11:45:17 CEST)

Show abstract| Download PDF| Supplementary Files| Share

Preprint ARTICLE | doi:10.20944/preprints202311.1073.v2

Predicting Students' Progress in Intelligent Tutoring Systems

Guijia He, Chengwei Huang, Steven Yang, Kelvin Lwin, Eng Lieh Ouh, Ran Ju, Xiaoming Zhu

Subject: Computer Science And Mathematics, Artificial Intelligence And Machine Learning Keywords: Academic Performance, Progress Prediction, Score Prediction, Learning Behavior, Learning Dataset, Educational Data Mining

Online: 20 December 2023 (10:19:25 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202307.1117.v1

Design and Analysis of Query Models Database Preservation Information Systems Digitization of History and Endowments; Case Study of History and Waqf of Sumedang Larang Kingdom Indonesia

R. Sudrajat, Budi Nurani Ruchjana, Atje Setiawan Abdullah, Rahmat Budiarto

Subject: Computer Science And Mathematics, Information Systems Keywords: history; endowments; query model; digital data; physical data

Online: 17 July 2023 (15:11:18 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202109.0191.v1

Detecting Branching Condition Changes in Process Models

Yang Lu, Qifan Chen, Simon K. Poon

Subject: Computer Science And Mathematics, Information Systems Keywords: Process science; Data science; Concept drift detection and Branching frequency changes

Online: 10 September 2021 (15:44:14 CEST)

Show abstract| Download PDF| Share

Preprint HYPOTHESIS | doi:10.20944/preprints201808.0127.v1

Beyond Computational Genomics towards Systems Metabolomics for Precision Oncology

Lilia Alberghina

Subject: Medicine And Pharmacology, Oncology And Oncogenics Keywords: Big Data, Systems Models, Cancer metabolism, Cancer personalized treatment, Drug Discovery.

Online: 6 August 2018 (15:09:15 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202311.0412.v1

Safety-Security Analysis of Maritime Surveillance Systems in Critical Marine Areas

Batu Şengül, Fatih Yılmaz, Özkan Uğurlu

Subject: Engineering, Marine Engineering Keywords: big data; artificial intelligence; maritime surveillance; maritime security; sustainability

Online: 7 November 2023 (06:46:46 CET)

Show abstract| Download PDF| Share

Preprint REVIEW | doi:10.20944/preprints202211.0161.v1

Data Locality in High Performance Computing, Big Data, and Converged Systems: An Analysis of the Cutting Edge and A Future System Architecture

Sardar Usman, Rashid Mehmood, Iyad Katib, Aiiad Albeshri

Subject: Computer Science And Mathematics, Information Systems Keywords: High Performance Computing (HPC); big data; High Performance Data Analytics (HPDS); con-vergence; data locality; spark; Hadoop; design patterns; process mapping; in-situ data analysis

Online: 9 November 2022 (01:38:34 CET)

Show abstract| Download PDF| Share

Big data has revolutionised science and technology leading to the transformation of our societies. High Performance Computing (HPC) provides the necessary computational power for big data analysis using artificial intelligence and methods. Traditionally HPC and big data had focused on different problem domains and had grown into two different ecosystems. Efforts have been underway for the last few years on bringing the best of both paradigms into HPC and big converged architectures. Designing HPC and big data converged systems is a hard task requiring careful placement of data, analytics, and other computational tasks such that the desired performance is achieved with the least amount of resources. Energy efficiency has become the biggest hurdle in the realisation of HPC, big data, and converged systems capable of delivering exascale and beyond performance. Data locality is a key parameter of HPDA system design as moving even a byte costs heavily both in time and energy with an increase in the size of the system. Performance in terms of time and energy are the most important factors for users, particularly energy, due to it being the major hurdle in high performance system design and the increasing focus on green energy systems due to environmental sustainability. Data locality is a broad term that encapsulates different aspects including bringing computations to data, minimizing data movement by efficient exploitation of cache hierarchies, reducing intra- and inter-node communications, locality-aware process and thread mapping, and in-situ and in-transit data analysis. This paper provides an extensive review of the cutting-edge on data locality in HPC, big data, and converged systems. We review the literature on data locality in HPC, big data, and converged environments and discuss challenges, opportunities, and future directions. Subsequently, using the knowledge gained from this extensive review, we propose a system architecture for future HPC and big data converged systems. To the best of our knowledge, there is no such review on data locality in converged HPC and big data systems.

Preprint ARTICLE | doi:10.20944/preprints202010.0093.v1

Limit Law of the Local Linear Estimate of the Conditional Hazard Function for Functional Data

Oussama Bouanani, Abdelhak Guendouzi, Souheyla Chemikh

Subject: Computer Science And Mathematics, Algebra And Number Theory Keywords: Functional data; Local linear estimation; Asymptotic normality; Conditional hazard function

Online: 6 October 2020 (09:18:49 CEST)

Show abstract| Download PDF| Share

Preprint REVIEW | doi:10.20944/preprints202403.0161.v1

4IR Applications in the Transport Industry: Systematic Review of the State of the Art with Respect to Data Collection and Processing Mechanisms

O.O. Ajayi, A.M. Kurien, K. Djouani, L. Dieng

Subject: Engineering, Transportation Science And Technology Keywords: transportation systems; systematic review; industrial revolution; data collection; data processing

Online: 6 March 2024 (04:30:45 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202111.0029.v1

Estimation of Real-world Fuel Consumption Rate of Light-duty Vehicles Based on Big Data

Isabella Yunfei Zeng, Shiqi Tan, Jianliang Xiong, Xuesong Ding, Yawen Li, Tian Wu

Subject: Social Sciences, Decision Sciences Keywords: Real-world fuel consumption rate; machine learning; big data; light-duty vehicle; China

Online: 2 November 2021 (09:40:05 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202308.1087.v1

Cytomegalovirus Infection after Allogeneic Hematopoietic Cell Transplantation under 100-day Letermovir Prophylaxis: a Real-world, 1-year Follow-up Study

Dukhee Nho, Raeseok Lee, Sung-Yeon Cho, Dong-Gun Lee, Eun-Jin Kim, Silvia Park, Sung-Eun Lee, Byung-Sik Cho, Yoo-Jin Kim, Seok Lee, Hee-Je Kim

Subject: Medicine And Pharmacology, Epidemiology And Infectious Diseases Keywords: Cytomegalovirus; prophylaxis; allogeneic hematopoietic cell transplantation; real-world data

Online: 15 August 2023 (09:28:45 CEST)

Show abstract| Download PDF| Share

Search Results

1411 articles found