Search | Preprints.org

Working Paper ARTICLE

Predicting Spatial Crime Occurrences through an Efficient Ensemble-Learning Model

Yasmine Lamari, Bartol Freskura, Anass Abdessamad, Sarah Eichberg, Simon de Bonviller

Subject: Computer Science And Mathematics, Artificial Intelligence And Machine Learning Keywords: Crime prediction; Ensemble Learning; Machine Learning; Regression

Online: 14 September 2020 (00:53:30 CEST)

Show abstract| Download PDF| Supplementary Files| Share

Preprint ARTICLE | doi:10.20944/preprints202010.0550.v2

Conditional Mixture Model and Its Application for Regression Model

Loc Nguyen

Subject: Computer Science And Mathematics, Probability And Statistics Keywords: expectation maximization (EM) algorithm; finite mixture model; conditional mixture model; regression model; adaptive regressive model (ARM)

Online: 28 October 2020 (11:18:04 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202310.0432.v1

Economic Analysis of a Model For Forecasting Public Expenditures And Revenues

Ivan Milojevic, Miloš Krstić, Nemanja Pantic, Milan Mihajlovic

Subject: Computer Science And Mathematics, Applied Mathematics Keywords: European Union; public revenues; public expenditures; regression analysis

Online: 8 October 2023 (10:08:59 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202401.1281.v1

Probabilistic Forecasting of Lightning Strikes over Continental US and Alaska: Model Development and Verification

Ned Nikolov, Phillip Bothwell, John Snook

Subject: Environmental And Earth Sciences, Atmospheric Science And Meteorology Keywords: lightning; model; logistic regression; forecast; prediction; wildfire; probability

Online: 17 January 2024 (08:38:15 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202011.0363.v1

QSAR Model for Predicting the Cannabinoid Receptor 1 Binding Affinity and Dependence Potential of Synthetic Cannabinoids

Wonyoung Lee, So-Jung Park, Ji-Young Hwang, Kwang-Hyun Hur, Yong Sup Lee, Jongmin Kim, Xiaodi Zhao, Aekyung Park, Kyung Hoon Min, Choon-Gon Jang, Hyun-Ju Park

Subject: Chemistry And Materials Science, Analytical Chemistry Keywords: cannabinoid receptor 1; synthetic cannabinoids; quantitative structure-activity relationship; multiple linear regression; partial least squares regression; dependence and abuse potential

Online: 13 November 2020 (07:19:36 CET)

Show abstract| Download PDF| Supplementary Files| Share

Preprint ARTICLE | doi:10.20944/preprints202002.0069.v1

Gaussian Process Prediction Model to Estimate Excess Adsorption Capacity of Supercritical CO₂

Narjes Nabipour, Sultan Noman Qasem, Amir Mosavi, Shahab Shamshirband

Subject: Computer Science And Mathematics, Applied Mathematics Keywords: coal; supercritical CO2; Gaussian process regression; machine learning; adsorption model

Online: 5 February 2020 (14:09:33 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202302.0083.v2

Predictive Models for Dissolved Oxygen in an Urban Lake by Regression Analysis and Artificial Neural Network

Abu Selim, Shah Newaz Alam Shuvo, Mohammad Moniruzzaman, M. M. Islam, Sakifa Shah, Md Ohiduzzaman

Subject: Environmental And Earth Sciences, Environmental Science Keywords: Multilinear Regression; Dissolve Oxygen; Modeling; Machine Learning; Levenberg–Marquardt algorithm; ANN; Urban Lake

Online: 27 February 2023 (07:25:06 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints201803.0093.v1

A Closed-Form Error Model of Straight Lines for Improved Data Association and Sensor Fusing

Volker Sommer

Subject: Engineering, Control And Systems Engineering Keywords: linear regression; covariance matrix; data association; sensor fusing; SLAM

Online: 13 March 2018 (04:06:56 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202307.0405.v1

Selection of A New Biasing Parameter for the Jackknife Kibria-Lukman Estimator for the Negative Binomial Regression Model

Oranye Henrietta E, Adejuwon Samuel O, Arum Kingsley C, Ugwuowo Fidelis I, Ugah Tobias E, Adegoke Taiwo M, Sule Omeiza B

Subject: Computer Science And Mathematics, Applied Mathematics Keywords: Jackknife; Kibria-Lukman; estimator; Maximum Likelihood; Negative Binomial regression

Online: 6 July 2023 (08:58:10 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202310.0871.v1

Modeling and Prediction of Thermal Deformation Errors in Fiber Optic Gyroscopes Based on the TD-Model

Jintao Xu, Ailing Tian, Hui Liu, Ying Liu

Subject: Engineering, Aerospace Engineering Keywords: fiber optic gyroscope; thermal errors; prediction model; overfitting; biased regression

Online: 13 October 2023 (08:18:22 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202309.1143.v1

Estimation of Threshold Age for Cerebral Decline Using Sigmoidal Growth Model in Cross-Sectional Imaging Study

Namhee Kim, Moonseong Heo, Roman Fleysher, Malka Z. Sears, Michael L. Lipton

Subject: Public Health And Healthcare, Public Health And Health Services Keywords: aging; sigmoidal growth function; nonlinear regression; threshold estimation; fractional anisotropy

Online: 18 September 2023 (14:27:39 CEST)

Show abstract| Download PDF| Supplementary Files| Share

Preprint ARTICLE | doi:10.20944/preprints202306.1048.v1

Multiple Linear Regression Method to Constitute Permeability Model of Liquid Microcapsule Used for Electrolytic Co-deposition

Xiuqing Xu, Fagen Li, Xuehui Zhao, Fang Yang

Subject: Chemistry And Materials Science, Materials Science And Technology Keywords: Liquid microcapsule; Multiple linear regression; Permeability experiments; Permeability model; Predictive capacity

Online: 14 June 2023 (12:27:15 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202011.0266.v1

Conditional Mixture Model for Modeling Attributed Dyadic Data

Loc Nguyen

Subject: Computer Science And Mathematics, Algebra And Number Theory Keywords: dyadic data; co-occurrence data; attributed dyadic data (ADD); mixture model; conditional mixture model (CMM); regression model

Online: 9 November 2020 (08:48:40 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202104.0592.v1

A Discrete Gamma Model Approach to Flexible Count Regression Analysis: Maximum Likelihood Inference

Chénagnon Frédéric Tovissodé, Romain Lucas Glèlè Kakaï

Subject: Computer Science And Mathematics, Discrete Mathematics And Combinatorics Keywords: Flexible count regression; balanced discrete gamma distribution; deviance statistic; latent equidispersion; likelihood ratio

Online: 22 April 2021 (08:55:29 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202401.2062.v1

Estimation of Buildings’ Energy Efficiency via a Surrogate Regression Model

Luis G. R. Santos, Ido Nevat, Jordan Ivanchev, Mathias Niffeler

Subject: Engineering, Civil Engineering Keywords: Surrogate Model; Multiple Linear Regression; Energy Efficiency; Machine Learning; Energy use Intensity; Building Energy; Data Generation.

Online: 30 January 2024 (04:07:08 CET)

Show abstract| Download PDF| Share

Building energy demand impacts a myriad of interconnected economic, societal, and environmental aspects. As a result, Buildings Energy Models (BEM) play an important role in the process of urban design and planning. While previous studies have investigated the effects of building interventions on energy efficiency, their applicability may be limited due to the BEM’s high computational complexity. This limits their ability to systematically study important aspects of energy demand on a large scale. The development of Machine Learning Models (MLM) allows to design the required detailed analysis and solutions, while reducing the computational burden, making MLM attractive for urban designers. The capability of MLM to generalize well for multiple contexts (in our case, multiple buildings) is a crucial contributor to their applicability. However, the validation process in a wider context is often overlooked, therefore its generalization capabilities are not quantified. In this paper, we present a framework to train and validate a surrogate model derived from a physics-based BEM. Our method employs a Multiple Linear Regression model to predict Energy Use Intensity (EUI) for office buildings in Singapore using 36 input parameters (covariates), based on a training dataset of 23,000 samples. Model validation is performed by comparing the results of the Surrogate Model (SM) to a widely used BEM for a sample of 120 buildings. Our results indicate that the SM has an accuracy of NRMSE of 13%, NMBE of −3.56%, and R2 of 0.92, which suggests it can effectively and accurately predict building EUI. We also conduct a sensitivity analysis, which indicates that the parameters associated with internal loads and internal space usage are the most influential. Additionally, we present a reduced order model trained with only the 11 most influential parameters, which exhibits negligible loss in accuracy compared to the full SM while providing reduced complexity. Finally, we demonstrate an application of our SM to evaluate energy efficiency under uncertainty scenarios. The analytically derived results indicate a potential reduction of EUI of offices in Singapore from 227kWh/m2 to 99kWh/m2 by altering the building parameters that were identified as most influential.

Preprint ARTICLE | doi:10.20944/preprints202108.0178.v1

Detection of Influential Observations in Spatial Regression Model Based on Outliers and Bad Leverage Classification

Ali Mohammed Baba, Habshah Midi, Mohd Bakri Adam, Nur Haizum Bint Abd Rahman

Subject: Computer Science And Mathematics, Probability And Statistics Keywords: Spatial regression model; Influential observation; Outlier; Leverage; prediction residual; Masking and swamping; Diagnostic

Online: 9 August 2021 (07:57:56 CEST)

Show abstract| Download PDF| Share

Preprint COMMUNICATION | doi:10.20944/preprints202308.0893.v2

A Soft Sensor Model based on ISOA-GPR Weighted Ensemble Learning for Marine Lysozyme Fermentation Process

Na Lu, Bo Wang, Xiang lin Zhu

Subject: Engineering, Control And Systems Engineering Keywords: marine lysozyme; seagull optimization algorithm; Gaussian process regression; soft sensor; gray correlation analysis

Online: 17 October 2023 (14:05:08 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202403.1392.v1

A Completed Proved of Rapid Nondestructive Prediction Model of Wood Chip Biomass Higher Heating Value Ready for Industrial Updating

Bijendra Shrestha, Thitima Phanomsophon, Zenisha Shrestha, Jetsada Posom, Panmanas Sirisomboon, Bim Prasad Shrestha, Pimpen Pornchaloempong, Hidayah Ariffin

Subject: Engineering, Energy And Fuel Technology Keywords: Biomass energy; Higher Heating Value; Near-infrared spectroscopy; Partial least squares regression

Online: 22 March 2024 (15:01:39 CET)

Show abstract| Download PDF| Share

Nepal, primarily an agricultural country, heavily relies on agricultural residue and fuelwood for daily energy requirements. In 2022, total energy consumption was 640 PJ, with traditional sources accounting for 64.17%, and fuelwood comprising 58.53% of total fuel consumption. The estimated potential supply of agricultural residue is 26 million tonnes, yielding about 442 million GJ of energy in 2021. Biomass trading often emphasizes volume or weight, necessitating a rapid and non-destructive assessment of energy properties for mutual benefit, aiding in the identification, management, and utilization of biomass sources. In this study, 200 biomass samples were collected in two batches (126 and 74 samples) from various locations in Nepal. Using Partial Least Squares Regression (PLSR), a model was developed correlating higher heating value (HHV) from a bomb calorimeter and spectral data from Fourier Transform near-infrared spectroscopy (FT-NIRS) (3595 – 12,489 cm-1) sensor. PLSR models incorporated raw spectra, eight preprocessing techniques, the multi-preprocessing five-range method, and a genetic algorithm. Outliers in the first batch were identified, and the first batch divided into an 80% calibration set and a 20% validation set, while the second batch was designated as an unknown sample set. The optimum PLSR model, utilizing first derivative preprocessing, improved accuracy by 6.77%, with coefficients of determination in the calibration set, validation set, and unknown set as 0.9694, 0.9578, and 0.8089, respectively. Root mean square errors were 132.4790 J/g, 189.4800 J/g, and 360.8845 J/g for the calibration set (RMSEC), validation set (RMSEP), and unknown set (RMSEUN), respectively. The prediction to deviation ratio (RPD) for the validation set and unknown set was 4.9 and 2.4, respectively. The cross validation model of combined sample data of every sets showed the R2CV of 0.95 and RPDCV of 4.6 indicating the model could serve as a reliable and swift non-destructive alternative for evaluating biomass HHV using NIRS and ready for updating for industrial use. However, incorporating a larger number of representative samples is crucial to enhance accuracy and develop a more comprehensive global model for predicting biomass HHV.

Preprint ARTICLE | doi:10.20944/preprints201702.0032.v1

Estimating Recreational Value of Foy's Lake: An Application of Travel Cost Count Data Model for Truncated Zeros

Md. Touhidul Alam, Anis-Ul-Ekram Chowdury, Md. Sajib Hossian

Subject: Business, Economics And Management, Economics Keywords: individual travel cost method; zero truncated poisson regression model; endogenous stratification; consumer surplus

Online: 10 February 2017 (11:10:04 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202012.0650.v1

Evaluating and Zoning Flood Susceptibility Using Curve Number (CN) Logistic and Hydrological Regression Model (Case Study of Kalateh Qanbar Drainage Basin, Nishabur)

Mahnaz Naemitabar, Mohammad Ali Zangeneh Asadi, Abolghasem Amirahmadi, Leila Goli Mokhtari

Subject: Environmental And Earth Sciences, Atmospheric Science And Meteorology Keywords: flood proneness; zoning, CN hydrologic model; curve number (CN); logistic regression

Online: 25 December 2020 (10:36:39 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202402.0198.v1

Extending a Global Climate-Population Model to Simulate Impacts on Human Well-Being

Jack Homer

Subject: Social Sciences, Geography, Planning And Development Keywords: Climate change; well-being; population displacement; governance; tipping points; statistical regression; global modeling; simulation; system dynamics

Online: 5 February 2024 (03:43:30 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints201801.0275.v1

Development of a Regional Lidar-Derived Forest Inventory Model with Bayesian Model Averaging for use in Ponderosa Pine and Mixed Conifer Forests in Arizona and New Mexico, USA

Karis Tenneson, Matthew S. Patterson, Thomas Mellin, Mark Nigrelli, Peter Joria, Brent Mitchell

Subject: Environmental And Earth Sciences, Environmental Science Keywords: forest biomass; aboveground biomass; airborne lidar; monitoring; regional forest inventory; variable selection; Bayesian model averaging; multiple linear regression

Online: 30 January 2018 (04:05:36 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints201607.0056.v1

Dynamic simulation of urban expansion based on cellular automata and logistic regression model: Case study of Hyrcanian region of Iran

Meisam Jafari, Seyed Masoud Monavari, Hamid Majedi, Ali Asghar Alesheikh, Mirmasoud Kheirkhah Zarkesh

Subject: Environmental And Earth Sciences, Environmental Science Keywords: Land use change; urban sprawl; Logistic regression; Markov chain; Cellular automata; Gilan Province

Online: 18 July 2016 (11:53:16 CEST)

Show abstract| Download PDF| Share

Although, promotion of urbanization culture in recent decades has made inevitable development of cities in the world, however, the development can be guided in a direction that leave, to the extent possible, minimum socioeconomic and environmental impacts. For this, it is required to first forecast auto-spreading orientation of cities and suburbs in rural areas over time and then avoid shapeless growth of cities. This paper is an attempt to develop a dynamic hybrid model based on logistic regression (LR), Markov chain (MC), and cellular automata (CA) for prediction of future urban sprawl in fast-growing cities. The model was developed using 12 widely-used urban development criteria, whose significant coefficient was determined by logistic regression, and validated by relative operating characteristic (ROC) analysis. The validated model was run in Guilan, a tourist province in northern Iran with a very high rate of urban development. For this, changes in the area of urban land use were detected over the period of 1989 to 2013 and then, future sprawl of the province was forecasted by the years 2025 and 2037. The analysis results revealed that the area of urban land use was increased by more than 1.7 % from 36012.5 ha in 1989 to 59754.8 ha in 2013, and the area of Caspian Hyrcanian forestland was reduced by 31628 ha. The results also predicted an alarming increase in the rate of urban development in the province by the years 2025 and 2037, during which urban land use is predicted to develop 0.9 % and 1.38 %, respectively. The development pattern is expected to be uneven and scattered, without following any particular direction. The development will occur close to the existing or newly-formed urban basements as well as around major roads and commercial areas. This development, if not controlled, will lead to the loss of 13863 ha of Hyrcanian forests and if the trend continues, 21013 ha of Hyrcanian forests and 20208 ha of Barren/open lands are expected to be destroyed by the year 2037. In general, the proposed model is an efficient tool for the support of urban planning decisions and facilitates the process of sustainable development of cities by providing decision-makers with an overview on future development of cities where the growth rate is very fast.

Preprint CASE REPORT | doi:10.20944/preprints202403.0899.v1

A Study on the Factors for the Occurrence of Vacant Homes in Medium Cities and Characteristics of Each Types – Focusing on Asan City, Chungcheongnam-Do

Jeonghyeon Choi, Seung-Seok Han, Myung-je Woo

Subject: Social Sciences, Urban Studies And Planning Keywords: vacant home; poisson; negative binomial regression; spatial regression

Online: 15 March 2024 (07:48:10 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202306.1849.v1

Prediction Mechanical Properties of Magnesium Matrix Composites with Regression Models by Machine Learning

Song-Jeng Huang, Yudhistira Adityawardhana, Jeffry Sanjaya

Subject: Engineering, Mechanical Engineering Keywords: Machine Learning; Regression Model; XGBoost Regression; Yield Strength

Online: 27 June 2023 (05:25:11 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints201812.0237.v1

Recovery of Differential Equations from Impulse Response Time Series Data for Model Identification and Feature Extraction

Merten Stender, Sebastian Oberst, Norbert Hoffmann

Subject: Engineering, Mechanical Engineering Keywords: signal processing; sparse regression; system identification; impulse response; optimization; feature generation; structural dynamics; time series classification

Online: 19 December 2018 (16:21:41 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202311.0145.v1

Predictive Model for Occurrence of Febrile Neutropenia after Chemotherapy in Patients with Diffuse Large B-Cell Lymphoma: A Multicenter, Retrospective, Observational Study

Masaya Morimoto, Yuma Yokoya, Kikuaki Yoshida, Hideki Kosako, Yoshikazu Hori, Toshiki Mushino, Shinobu Tamura, Reiko Ito, Ryosuke Koyamada, Takuya Yamashita, Shinichiro Mori, Nobuyoshi Mori, Sachiko Ohde

Subject: Medicine And Pharmacology, Hematology Keywords: febrile neutropenia; chemotherapy; diffuse large B-cell lymphoma; outcomes; multivariate logistic regression model

Online: 2 November 2023 (10:24:29 CET)

Show abstract| Download PDF| Supplementary Files| Share

Preprint ARTICLE | doi:10.20944/preprints201907.0118.v1

Noise Disturbances and Calls for Police Service in València (Spain): a Logistic Model with Spatial and Temporal Effects

Lia Seguí, Adina Iftimi, Álvaro Briz-Redón, Lucía Martínez-Garay, Francisco Montes

Subject: Computer Science And Mathematics, Probability And Statistics Keywords: noise disturbances; residents complaints; logistic regression; spatio-temporal effects; socio-demographic and environmental effects; GIS

Online: 8 July 2019 (12:42:05 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints201807.0215.v1

High-dimensional Probabilistic Fingerprinting in Wireless Sensor Networks based on a Multivariate Gaussian Mixture Model

Yan Li, Simon Williams, Bill Moran, Allison Kealy, Guenther Retscher

Subject: Engineering, Electrical And Electronic Engineering Keywords: multivariate gaussian mixture model (MVGMM); multivariate linear regression; expectation-maximization imputation; WiFi localization; hidden markov model (HMM)

Online: 12 July 2018 (08:24:06 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202307.0679.v1

A Deep Learning Architecture for Detecting SQL Injection Attacks based on RNN Autoencoder Model

Maha Alghawazi, Daniyal Alghazzawi, Suaad Alarifi

Subject: Computer Science And Mathematics, Artificial Intelligence And Machine Learning Keywords: SQL injection attacks; Recurrent neural network (RNN) autoencoderANN; CNN; Decision Tree; Naïve Bayes; SVM; Random Forest; Logistic Regression

Online: 11 July 2023 (10:53:24 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202306.0891.v1

Enhancing Rock Fragmentation Assessment in Mining Blasting through Machine Learning Algorithms: An Effective Approach

Angesom Gebretsadik, Rahul Kumar, Hajime Ikeda, Mujahid Ali, Yewuhalashet Fissha, Arvind Kumar Mishra, Yemane Kide, Mohammed Mnzool, Enas Ali, E. E. Gomaa

Subject: Engineering, Mining And Mineral Processing Keywords: Fragmentation; Artificial neural network; Random Forest regression; Support vector regression; XG Boost Regression; Sensitivity analysis

Online: 13 June 2023 (08:04:17 CEST)

Show abstract| Download PDF| Share

In a limestone quarry mine, fragmentation is a crucial outcome of blasting operations. The optimization of blasting operations greatly benefits from the prediction of rock fragmentation. The main factors that affect fragmentation are rock mass characteristics, blast geometry, and explosive properties. This paper is a step towards the implementation of machine learning and deep learning algorithms for predicting the extent of fragmentation (in percentage) in opencast mining. Various parameters can affect fragmentation. But, in this paper initially, ten parameters (spacing, drill hole diameter, burden, average bench height, powder factor, number of holes, charge per delay, uniaxial compressive strength, specific drilling, and stemming) are collected to train the model. However, due to a weak correlation with rock fragmentation, drill diameter, Average bench height, compressive strength, stemming, and charge per delay are eliminated to reduce model complexity. A total of 219 data sets having five input features i.e., the number of holes, spacing, burden, specific drilling, and powder factor are used to develop the models. To predict rock fragmentation due to blasting in limestone quarry mines, both machine learning models (Random Forest Regression (Bagging), Support Vector Regression, and XG Boost Regression (Boosting)), as well as a deep learning model (Neural Network Regression), are applied to develop a model that can optimize the prediction of fragmentation. The Artificial neural network model optimization showed that the model with architecture 64-32-16-1 can perform well giving MSE (mean squared error) values of 41.32 and 28.59 on training and test data respectively. The R2 value for both training and test is 0.83. Random Forest regression is also performing well compared to SVR and XG boost with the MSE value 12.37 and 9.89 on training and testing data respectively. Here, the R2 value for both sets are 94%. Based on the permutation importance and Shapely plot values, the powder factor has the highest impact, and the burden has the lowest impact on fragmentation.

Preprint ARTICLE | doi:10.20944/preprints202310.1467.v1

A Multi-Layer Neural Network Model “SR-KS” of Human Activity Recognition for the Problem of Similar Human Activity

QianCheng Tan, Yonghui Qin, Rui Tang, Sixuan Wu, JIng Cao

Subject: Computer Science And Mathematics, Artificial Intelligence And Machine Learning Keywords: body-worn sensors; multi layer classifier; random forest; kernel fisher discriminant analysis; SVM; stepwise regression

Online: 23 October 2023 (16:18:56 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202002.0200.v1

Regression Medians and Uniqueness

Yijun Zuo

Subject: Computer Science And Mathematics, Probability And Statistics Keywords: uniqueness: regression depth; maximum depth estimator; regression median; robustness

Online: 15 February 2020 (14:51:15 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints201612.0139.v1

A Kinematic Model to Compensate the Structural Deformations in Machine Tools Using Fibre Bragg Grating (FBG) Sensors

Francesco Aggogeri, Alberto Borboni, Rodolfo Faglia, Angelo Merlo, Nicola Pellegrini

Subject: Engineering, Mechanical Engineering Keywords: kinematic model; fiber Bragg grating; deformations; machine tools calibration; predicted model; multiple regression analysis; finite element analysis

Online: 29 December 2016 (07:39:26 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints201907.0341.v1

Impact of Socio-Economic Factors and Indwelling Mosquito Control on Malaria Prevalence among Pregnant Women in Nigeria Using Logistic Regression Model

Monday Osagie Adenomon, Osazee Femi Obazee, Eric Vance

Subject: Computer Science And Mathematics, Probability And Statistics Keywords: malaria; indwelling malaria control; insecticide treated net (ITN); pregnancy; socio-economic; logistic regression; odds ratio

Online: 30 July 2019 (14:40:53 CEST)

Show abstract| Download PDF| Share

Malaria is endemic in Nigeria and remains a major public health problem, taking its greatest toll on children under age 5 and pregnant women, although it is preventable, treatable, and curable. This study investigates the Impact of socio-economic factors and indoor mosquito control on malaria prevalent among pregnant women in Nigeria using logistic regression. To achieve this, secondary data obtained from 2015 Nigeria Malaria Indicator survey, executed by the National Malaria Elimination Programme (NMEP) and the National Population Commission (NPopC), with a nationally representative sample of more than 8,000 consisting of 7,745 households. The results from the logistic regression with odds ratio revealed that pregnant women are more like to be affected by malaria fever (though not significant) compared to women that are not pregnant. The income levels of the household does not significant reduce the incidence of malaria fever among pregnant women in Niger. Concerning the malaria presenting measure, only dwelling sprayed by private company significantly reduce the incidence of malaria fever among pregnant women (P-value=0.020<0.05) compared to dwelling sprayed by government and NGOs and also to Insecticide Treated Net. Also pregnant women in the urban centers are less likely to have malaria fever compared to pregnant women in rural communities in Nigeria. Also, pregnant women with atleast a secondary school level of education are less likely to be affected by malaria fever compared to pregnant women with no formal education. The fitted logistic model passed the goodness-of-test fit; the classification test for the logistic model was correctly classified at about 67.02%. Therefore, this study recommends that government and NGOs should intensify their efforts in the area of dwelling spraying, awareness campaign of the danger of malaria fever among pregnant women and infants, engaged in effective distribution of insecticide treated net in order to reduce the incidence of malaria fever among pregnant women living in rural communities in Nigeria.

Preprint REVIEW | doi:10.20944/preprints202311.0156.v1

Advances in Tilapia farming: Statistical Analysis of the Use of Probiotics and Assessment of their Potentials and Challenges

Wellison Amorim Pereira, Iara Reis, Alejandro Villasante, Carolina Ramírez, Sara Franco, Carlos M.N. Mendonça, Danielle de Carla Dias, Leonardo Tachibana, Attilio Converti, Abdel-Fattah M. El-Sayed, Jaime Romero, Elias Figueroa Villalobos, Ricardo Pinheiro de Souza Oliveira

Subject: Biology And Life Sciences, Aquatic Science Keywords: tilapia; probiotics; linear regression analysis; hierarchical regression analysis; Pearson correlation

Online: 2 November 2023 (10:29:36 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202309.2134.v1

Dendrometric Relationships and Biomass in Commercial Plantations of Dipteryx spp. in the Eastern Amazon

Lucas Sérgio de Sousa Lopes, Daniela Pauletto, Emeli Susane Costa Gomes, Ádria Fernandes da Silva, Thiago Gomes de Sousa Oliveira, Jéssica Aline Godinho da Silva, Diego Damázio Baloneque, Lucieta Guerreiro Martorano

Subject: Biology And Life Sciences, Forestry Keywords: carbon; crown diameter; regression

Online: 30 September 2023 (05:42:45 CEST)

Show abstract| Download PDF| Share

Preprint REVIEW | doi:10.20944/preprints202110.0207.v1

Transfer Learning of Clinical Outcomes with Molecular Data, Principles and Perspectives

Axel Kowald, Israel Barrantes, Steffen Möller, Daniel Palmer, Hugo Murua Escobar, Georg Fuellen

Subject: Biology And Life Sciences, Biochemistry And Molecular Biology Keywords: transfer learning; classification; regression

Online: 13 October 2021 (16:28:59 CEST)

Show abstract| Download PDF| Share

Accurate transfer learning of clinical outcomes, e.g., of the effects and side effects of drugs or other interventions, from one cellular context to another (in-vitro versus ex-vivo versus in-vivo, or across tissues), between cell-types, developmental stages, omics modalities or species, is considered tremendously useful. Ultimately, it may avoid most drug development failing in translation, despite large investments in the preclinical stages, which includes animal experiments requiring careful justification. Thus, when transferring a prediction task from a source (model) domain to a target domain, what counts is the high quality of the predictions in the target domain, requiring molecular states or processes common to both source and target that can be learned by the predictor, reflected by latent variables. These latent variables may form a compendium of knowledge that is learned in the source, to enable predictions in the target; usually, there are few, if any, labeled target training samples to learn from. Transductive learning then refers to the learning of the predictor in the source domain, transferring its outcome label calculations to the target domain, considering the same task. Inductive learning considers cases where the target predictor is performing a different yet related task as compared to the source predictor, making some labeled target data necessary. Often, there is also a need to first map the variables in the input/feature spaces (e.g. of gene names to orthologs) and/or the variables in the output/outcome spaces (e.g. by matching of labels). Transfer across omics modalities also requires that the molecular information flow connecting these modalities is sufficiently conserved. Only one of the methods for transfer learning we reviewed offers an assessment of input data, suggesting that transfer learning is unreliable in certain cases. Moreover, source domains feature their very own particularities, and transfer learning should consider these, e.g., as differences in pharmacokinetics, drug clearance or the microenvironment. In light of these general considerations, we here discuss and juxtapose various recent transfer learning approaches, specifically designed (or at least adaptable) to predict clinical (human in-vivo) outcomes based on molecular data, towards finding the right tool for a given task, and paving the way for a comprehensive and systematic comparison of the suitability and accuracy of transfer learning of clinical outcomes.

Working Paper ARTICLE

Drivers of Electricity Poverty in Spanish Dwellings: A Quantile Regression Approach

Rafael de Arce, Ramón Mahía

Subject: Business, Economics And Management, Economics Keywords: electricity poverty; quantile regression

Online: 18 September 2020 (09:40:45 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202312.1619.v1

Uncovering Undiagnosed Hypertension Rates in Bangladeshi Adults: Insights from Advanced Statistical Models

Tanjim Siddiquee, Syed Ejaz Ahmed

Subject: Public Health And Healthcare, Public Health And Health Services Keywords: Undiagnosed Hypertension; Logistic Regression; Log-Binomial Regression; Machine Learning; Cross-Sectional Studies

Online: 21 December 2023 (06:23:32 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202201.0441.v1

Adaptive Batch Size Selection in Active Learning for Regression

Anthony Faulds

Subject: Computer Science And Mathematics, Artificial Intelligence And Machine Learning Keywords: Active learning (AL); batch mode; expected model change; linear regression; nonlinear regression

Online: 28 January 2022 (15:03:10 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202401.0645.v1

Spline Regression Mathematical Model for Obtaining a Sustainable Management in Young Beech (Fagus sylvatica L.) Stands

Ghiţă Cristian Crainic, Mircea Curila, Sorin Curila, Anamaria Supuran, Alexandru Mihai Bica

Subject: Environmental And Earth Sciences, Sustainable Science And Technology Keywords: sustainable forest management; spacing of the stands; forest productive potential; silvicultural interventions; density of the stands; growing space; crown diameter; cubic spline regression

Online: 9 January 2024 (02:50:00 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202008.0448.v1

An Integrated Approach to Air Passenger Index Prediction: Mutual Information Principle and Support Vector Regression (MI-Svr) Blended Model

Honglin Xiong, Chongjun Fan, Collins Opoku ANTWI, yun yang, Xiaomao fan

Subject: Computer Science And Mathematics, Information Systems Keywords: airport operation and management; air passenger index(API) prediction; machine learning(ML); mutual information(MI); support vector regression (SVR); K-Means

Online: 20 August 2020 (08:31:36 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202008.0392.v1

An Integrated Approach to Air Passenger Index Prediction: Mutual Information Principle and Support Vector Regression (MI-SVR) Blended Model

Honglin Xiong, Chongjun Fan, Collins Opoku Antwi, Yun Yang, Xiaomao Fan

Subject: Computer Science And Mathematics, Information Systems Keywords: airport operation and management; air passenger index(API) prediction; machine learning(ML); mutual information(MI); support vector regression (SVR); K-Means

Online: 18 August 2020 (16:25:20 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202402.0615.v3

Long-Term Effect of Fire on Leaf Production in a Southwest Missouri Oak Woodland

Sanjeev Sharma, D Alexander Wait, Puskar Khanal, Akeem Ajao

Subject: Environmental And Earth Sciences, Ecology Keywords: Ecosystem; woodland; regression; precipitation; saplings

Online: 14 February 2024 (04:04:54 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202208.0222.v1

Factors Associated With Mortality With Tuberculosis Diagnosis in Indigenous Population in Peru 2015-2019

Hoover Leon, Oriana Rivera-Lozada, Elvis Siprian Castro-Alzate, RULA Aylas-Salcedo, Robinson Pacheco López, Cesar Antonio Bonilla-Asalde

Subject: Medicine And Pharmacology, Epidemiology And Infectious Diseases Keywords: Tuberculosis; Mortality; Indigenous; Logistic Regression

Online: 11 August 2022 (12:00:20 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202011.0297.v1

Linear Regression Analysis for Time-Point Datasets

Janardan Patil, Li Len, Abhinav Bharat, Xi Li

Subject: Computer Science And Mathematics, Mathematics Keywords: regression; time point data; modelling

Online: 10 November 2020 (10:00:37 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202403.0470.v1

A New Computational Algorithm for Assessing Overdispersion in Machine Learning Count Models with Python

Luiz Paulo Lopes Fávero, Alexandre Duarte, Helder Prado Santos

Subject: Computer Science And Mathematics, Data Structures, Algorithms And Complexity Keywords: Count data; Machine learning; Negative binomial regression; Overdispersion; Poisson regression; Python; Vuong Test; Zero inflation

Online: 8 March 2024 (09:30:50 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202307.0288.v1

Predicting Idiosyncratic Volatility from Stock Market Trade Records: A Machine Learning Approach

Nasrin Seifi, Hassan S. Shavarani

Subject: Computer Science And Mathematics, Artificial Intelligence And Machine Learning Keywords: Idiosyncratic Volatility Estimation/Prediction; Machine Learning; Deep learning Based Regression; Tree-Based Regression; Artificial Intelligence

Online: 6 July 2023 (02:14:16 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202310.0202.v1

Stability Analysis and Identification of High-Yielding Amaranth Accessions for Varietal Development Under Various Agroecologies of Malawi

Mvuyeni Nyasulu, Sibongile Zimba Chimzinga, Moses Maliro, Rowland Maganizo Kamanga, Rudoviko Galileya Medison, Abel Sefasi

Subject: Biology And Life Sciences, Agricultural Science And Agronomy Keywords: amaranth; environmental index; linear regression; stability

Online: 4 October 2023 (05:04:02 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202008.0139.v1

Copper Price Prediction Using Support Vector Regression Technique

Gabriel Astudillo, Raúl Carrasco, Christian Fernández-Campusano, Máx Chacón

Subject: Engineering, Industrial And Manufacturing Engineering Keywords: copper price; prediction; support vector regression

Online: 6 August 2020 (08:26:35 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202008.0058.v1

Analysis of the Characteristics of Residential Function in the Mountainous Cities (Case Study: Rwandz City – Erbil Governorate – Iraq)

Kamaran Mahmood

Subject: Environmental And Earth Sciences, Geography Keywords: Rwandz; residential function; GIS; correlation; regression

Online: 3 August 2020 (00:37:42 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints201902.0135.v1

Modelling Recovery Rates for Non-performing Loans

Anthony Bellotti, Hui Ye

Subject: Business, Economics And Management, Finance Keywords: recovery rates; beta regression; credit risk

Online: 14 February 2019 (11:30:03 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints201809.0499.v1

Understanding Landscape Influences on Aquatic Fauna across the Central and Southern Appalachians

Richard Daniel Hanks, Paul B. Leonard, Robert F. Baldwin

Subject: Biology And Life Sciences, Ecology, Evolution, Behavior And Systematics Keywords: aquatics; modeling; boosted regression trees; appalachians

Online: 26 September 2018 (05:23:02 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints201712.0032.v1

Bayesian Energy Measurement and Verification Analysis

Herman Carstens, Xiaohua Xia, Sarma Yadavalli

Subject: Engineering, Energy And Fuel Technology Keywords: statistics; uncertainty; regression; sampling; outlier; probabilistic

Online: 6 December 2017 (06:36:02 CET)

Show abstract| Download PDF| Share

Preprint COMMUNICATION | doi:10.20944/preprints202111.0549.v1

PCR, PLS, or OPLS Evaluation of different regression techniques for hypothesis generation

Avani Ahuja

Subject: Computer Science And Mathematics, Applied Mathematics Keywords: Principal Component Regression, Partial Least Squares, Orthogonal Partial Least Squares, multivariate regression, hypothesis generation, Parkinson’s disease

Online: 29 November 2021 (15:42:03 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202312.0671.v2

Has COVID-19 Affected DTP3 Vaccination in the Americas?

Ines Aguinaga-Ontoso, Sara Guillén-Aguinaga, Laura Guillén-Aguinaga, Rosa Alas-Brun, Enrique Aguinaga-Ontoso, Esperanza Rayón-Valpuesta, Francisco Guillén-Grima

Subject: Public Health And Healthcare, Health Policy And Services Keywords: DTP vaccine; America; COVID-19; Vaccine coverage; Joinpoint regression; Health care system; Vaccination rates; Trends; Segmented regression

Online: 28 December 2023 (05:35:03 CET)

Show abstract| Download PDF| Share

Background: In America, vaccine-related deaths constitute a significant contributor to child mortality. An essential means of reducing this is through broad vaccine coverage. The COVID-19 pandemic has posed a potential disruption to vaccine coverage due to its effects on the healthcare system. Objectives: This study aims to evaluate the impact of the COVID-19 pandemic on DTP3 vaccination coverage in the Americas, investigating trends from 2012 to 2022 to identify significant changes, regional disparities, and the overall effect of the pandemic on progress towards global immunization targets. Methods: This study used the coverage data for the third dose of the Diphtheria, Tetanus, and Pertussis Vaccine (DTP3) pulled from UNICEF databases spanning 2012 to 2022. We conducted a JoinPoint regression to identify points of significant trend changes. The annual percentage change (APC) and 95% confidence intervals (95% CI) were calculated for America and its regions. We also used segmented regression analysis. Using the Chi-square test, we compared DTP3 vaccination coverage for each country between 2019 and 2022. Results: Overall, America saw a decrease in vaccine coverage during this period, with an APC of -1.4 (95% CI -1.8.; -1.0). This trend varied across regions. In North America, the decrease was negligible (-0.1% APC). South America showed the steepest decrease, with an APC of -2.5%. Central America also signif-icantly declined, with an APC of -1.3%. Our findings suggest a concerning trend of declining DTP vaccination rates in the Americas, exacerbated in certain regions, in the wake of the COVID-19 pandemic. The absolute decrease in vaccine coverage in the Americas was -4 % between 2019 and 2022, with the most significant drop observed in Central America (-7 %). However, six countries reported increased vaccination rates post-COVID-19, led by Brazil, with a 7% increase. Conversely, twenty-two countries registered a decline in DTP3 vaccine coverage, with the average decrease being -7.37%. This decline poses a significant challenge to achieving the WHO's target of 90% coverage for the third dose of DTP by 2030, as evidenced by the reduction in the number of countries meeting this target from 2019 to 2022. Conclusions: The COVID-19 pandemic has impacted vaccine coverage in America, leading to a decrease, especially across Central America.

Preprint ARTICLE | doi:10.20944/preprints201907.0351.v1

Modeling Daily Pan Evaporation in Humid Climates Using Gaussian Process Regression

Sevda Shabani, Saeed Samadianfard, Mohammad Taghi Sattari, Shahab Shamshirband, Amir Mosavi, Tibor Kmet, Annamária R. Várkonyi-Kóczy

Subject: Computer Science And Mathematics, Artificial Intelligence And Machine Learning Keywords: evaporation; meteorological parameters; Gaussian process regression; support vector regression; machine learning modeling; hydrology; prediction; data science; hydroinformatics

Online: 31 July 2019 (10:58:29 CEST)

Show abstract| Download PDF| Share

Evaporation is one of the main processes in the hydrological cycle, and it is one of the most critical factors in agricultural, hydrological, and meteorological studies. Due to the interactions of multiple climatic factors, the evaporation is a complex and nonlinear phenomenon; therefore, the data-based methods can be used to have precise estimations of it. In this regard, in the present study, Gaussian Process Regression (GPR), Nearest-Neighbor (IBK), Random Forest (RF) and Support Vector Regression (SVR) were used to estimate the pan evaporation (PE) in the meteorological stations of Golestan Province, Iran. For this purpose, meteorological data including PE, temperature (T), relative humidity (RH), wind speed (W) and sunny hours (S) collected from the Gonbad-e Kavus, Gorgan and Bandar Torkman stations from 2011 through 2017. The accuracy of the studied methods was determined using the statistical indices of Root Mean Squared Error (RMSE), correlation coefficient (R) and Mean Absolute Error (MAE). Furthermore, the Taylor charts utilized for evaluating the accuracy of the mentioned models. The outcome indicates that the optimum state of Gonbad-e Kavus, Gorgan and Bandar Torkman stations, Gaussian Process Regression (GPR) with the error values of 1.521, 1.244, and 1.254, the Nearest-Neighbor (IBK) with error values of 1.991, 1.775, and 1.577, Random Forest (RF) with error values of 1.614, 1.337, and 1.316, and Support Vector Regression (SVR) with error values of 1.55, 1.262, and 1.275, respectively, have more appropriate performances in estimating PE. It found that GPR for Gonbad-e Kavus Station with input parameters of T, W and S and GPR for Gorgan and Bandar Torkmen stations with input parameters of T, RH, W, and S had the most accurate performances and proposed for precise estimation of PE. Due to the high rate of evaporation in Iran and the lack of measurement instruments, the findings of the current study indicated that the PE values might be estimated with few easily measured meteorological parameters accurately.

Preprint ARTICLE | doi:10.20944/preprints202401.1809.v1

Sustainable and Optimized Production in an Aluminum Extrusion Process

Filipe Ferrá, Aldina Correia, Fátima De Almeida, Eliana Costa e Silva

Subject: Engineering, Industrial And Manufacturing Engineering Keywords: Aluminium; Extrusion; Scrap; Sustainability; Multiple Linear Regression

Online: 25 January 2024 (08:01:31 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202307.1405.v1

Quantitating Wastewater Characteristic Parameters Using Neural Network Regression Modeling on Spectral Reflectance

Dhan Lord B. Fortela, Armani Travis, Ashley P. Mikolajczyk, Wayne Sharp, Emmanuel Revellame, William Holmes, Rafael Hernandez, Mark Zappi

Subject: Engineering, Chemical Engineering Keywords: neural network regression; wastewater quality; spectral reflectance

Online: 20 July 2023 (10:44:00 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202305.1678.v1

Divided Continent: Unraveling the Roots of Income Polarization in Europe

Michele Fabiani

Subject: Business, Economics And Management, Economics Keywords: Europe; Income Distrubution; Relative Distribution; RIF-regression

Online: 24 May 2023 (03:34:42 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202305.0792.v1

Exploring the Factors Influencing the Impact of the COVID-19 Pandemic on Global Shipping: A Case Study of the Baltic Dry Index

Cheng-Wen Chang, Ming-Hsien Hsueh, Chia-Nan Wang, Cheng-Chun Huang

Subject: Business, Economics And Management, Business And Management Keywords: Baltic Dry Index; Covid-19; Stepwise Regression

Online: 11 May 2023 (05:11:46 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202305.0096.v1

Exploring the Symmetry of Curvilinear Regression Models for Enhancing the Analysis of Fibrates Drug Activity through Molecular Descriptors

Suha Wazzan, Nurten Urlu Ozalan

Subject: Computer Science And Mathematics, Mathematics Keywords: Topological indices; Fibrates; Curvilinear regression; QSPR analysis

Online: 3 May 2023 (04:48:22 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202205.0417.v1

Predicting COVID-19 Infections in Eswatini Using the Maximum Likelihood Estimation Method

Sabelo Nick Dlamini, Wisdom Mdumiseni Dlamini, Ibrahima Socé Fall

Subject: Medicine And Pharmacology, Epidemiology And Infectious Diseases Keywords: COVID-19; Eswatini; risk mapping; Poisson regression

Online: 31 May 2022 (11:04:12 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202107.0139.v1

Core Elements Towards Circularity: Evidence From the European Countries

Olga Lingaitienė, Aurelija Burinskienė

Subject: Business, Economics And Management, Accounting And Taxation Keywords: circularity; waste streams; circular approaches; regression equation

Online: 6 July 2021 (11:40:19 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202012.0321.v1

Relationship of Trace Metal Covariates and pH Distribution in Groundwater within Gold mining and Non-Gold mining Areas in Ghana

Frederick Armah, Arnold Paintsil, Michael Adu, David Oscar Yawson, Justice Odoi

Subject: Environmental And Earth Sciences, Atmospheric Science And Meteorology Keywords: quantile regression; groundwater; environmental; multivariate; metals; health

Online: 14 December 2020 (10:13:09 CET)

Show abstract| Download PDF| Share

One of the most important defining characteristics of groundwater quality is pH as it fundamentally controls the amount and chemical form of many organic and inorganic solutes in groundwater. Groundwater data are frequently characterized by a wide degree of variability of the factors which possibly influence pH distribution. For this reason, it is challenging to link the spatio-temporal dynamics of pH to a single environmental factor by the ordinary least squares regression technique of the conditional mean. In this study, quantile regression was used to estimate the response of pH to nine environmental factors (As, Cd, Fe, Mn, Pb, turbidity, electrical conductivity, total dissolved solids and nitrates). Results of 25%, 50%, 75% quantile regression and ordinary least squares (OLS) regression were compared. The standard regression of the conditional means (OLS) underestimated the rates of change of pH due to the selected factors in comparison with the regression quantiles. The effect of arsenic increased for sampling locations with higher pH values (higher quantiles) likewise the influence of Pb and Mn. However, the effects of Cd and Fe decreased for sampling locations in higher quantiles. It can be concluded that these detected heterogeneities would be missed if this study had focused exclusively on the conditional means of the pH values. Consequently, quantile regression provides a more comprehensive account of possible spatio-temporal relationships between environmental covariates in groundwater. This study is one of the first to apply this technique on groundwater systems in sub-Saharan Africa. The approach is useful and interesting and has broad application for other mining environments especially tropical low-income countries where climatic conditions can drive rapid cycling or transformations of pollutants. It is also pertinent to geopolitical contexts where regulatory; monitoring and management capacities are weak and where mining pollution of groundwater largely occur.

Preprint REVIEW | doi:10.20944/preprints202312.1938.v1

Machine Learning-Based Regression Models for State of Charge Estimation in Hybrid Electric Vehicles: A Review

Arash Mousaei, Yahya Naderi

Subject: Engineering, Electrical And Electronic Engineering Keywords: Hybrid Electric Vehicles (HEVs); State of Charge (SOC); MACHINE LEARNING; Support Vector Regression (SVR); Neural Network Regression (NNR)

Online: 26 December 2023 (10:05:23 CET)

Show abstract| Download PDF| Share

Preprint REVIEW | doi:10.20944/preprints202111.0310.v1

A Literature Review of Semi-functional Partial Linear Regression Models

Mohammad Fayaz

Subject: Computer Science And Mathematics, Probability And Statistics Keywords: Functional Data Analysis (FDA); Hybrid Data; Semi-Functional Partial Linear Regression Model (SFPLR); Partial Functional Linear Regression; Literature Review

Online: 17 November 2021 (15:21:19 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints201910.0238.v1

Hybrid Machine Learning Model of Support Vector Machine and Fruit Fly Optimization Algorithm for Prediction of Remaining Service Life of Flexible Pavement

Nader Karballaeezadeh, Adrienn Dineva, Amir Mosavi, Narjes Nabipour, Shahaboddin Shamshirband, Danial Mohammadzadeh

Subject: Computer Science And Mathematics, Artificial Intelligence And Machine Learning Keywords: hybrid machine learning model; transportation infrastructure; flexible pavement; remaining service life prediction; pavement condition index; support vector regression; fruit fly optimization algorithm (foa); gene expression programming (gep); svr-foa

Online: 20 October 2019 (17:11:10 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202404.0517.v1

Analysis of Immediate Effects of the COVID-19 Pandemic on Small and Medium-Sized Enterprises (SMEs) in Rwanda: Firm-Level Data Analysis

Emmanuel Munyemana, Joseph K. Mung’atu, Charles Ruranga

Subject: Business, Economics And Management, Econometrics And Statistics Keywords: wood; firm; COVID-19 and SMEs; regression models

Online: 8 April 2024 (08:46:54 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202403.0425.v1

Experimental Study on the Dowel-Bearing Strength of Bambusa Blumeana

Cres Dan Jr. Omictin Bangoy, Jedelle Yu Falcon, Hannah Amyrose Fajardo Lorenzo, Steven Royce Austria Zeng, Lessandro Estelito O Garciano, Carlo Joseph D Cacanando

Subject: Engineering, Civil Engineering Keywords: Bamboo; Dowel-Bearing Strength; Sustainable Construction; Multivariate Regression

Online: 8 March 2024 (04:48:17 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202402.0923.v1

A New Landscape for Southwest Ozark Missouri Woodlands?

Akeem Ajao, Sanjeev Sharma, D Alexander Wait, Puskar Khanal

Subject: Environmental And Earth Sciences, Ecology Keywords: woodland; regression; t-test; specific leaf weight; decomposition

Online: 18 February 2024 (11:49:46 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202401.1978.v1

Study of the Influence of Data Volume on the Quality of Regression to Restore the Distribution of Temperatures inside Tissue during Hyperthermia

Evgeny Kostyuchenko, Elena Amletova

Subject: Computer Science And Mathematics, Artificial Intelligence And Machine Learning Keywords: Hyperthermia, Regression, Data reduction, Decision Tree, Random Forest

Online: 29 January 2024 (09:52:17 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202312.0938.v2

Potential of Machine Learning for Predicting Sleep Disorders: A Comprehensive Analysis of Regression and Classification Models

Raed Alazaidah, Ghassan Samara, Mohammad Aljaidi, Mais Haj Qasem, Ayoub Alsarhan, Mohammed Alshammari

Subject: Computer Science And Mathematics, Artificial Intelligence And Machine Learning Keywords: classification; learning strategies; machine learning; sleep disorders; regression

Online: 14 December 2023 (03:08:25 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202312.0092.v1

Data Augmentation for Regression Machine Learning Problems in High Dimensions

Clara Guilhaumon, Nicolas Hascoët, Francisco Chinesta, Marc Lavarde, Fatima Daim

Subject: Computer Science And Mathematics, Artificial Intelligence And Machine Learning Keywords: Active Learning; Design of experiments; Regression; s-PGD

Online: 1 December 2023 (15:04:37 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202311.1782.v1

Efficiency of Micro and Small Wood-Processing Enterprises in the EU – Evidence From DEA and Fractional Regression Analysis

Nikolay Neykov, Mariana Sedliačiková, Petar Antov, Marek Potkány, Emil Kitchoukov, Aureliu-Florin Hălălișan, Natália Poláková

Subject: Business, Economics And Management, Economics Keywords: DEA; wood processing enterprises; small enterprises; fractional regression

Online: 28 November 2023 (07:49:48 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202311.1435.v1

Impact of Exchange Rate Volatility on South Africa's Exports: New Evidence from NARDL and STR Models

Dumisani Pamba

Subject: Business, Economics And Management, Finance Keywords: Exchange Rate Volatility; Exports; NARDL; Smooth Threshold Regression

Online: 22 November 2023 (13:48:53 CET)

Show abstract| Download PDF| Share

Preprint REVIEW | doi:10.20944/preprints202310.1913.v3

Design of Photovoltaic System for Green Manufacturing by using Statistical Design of Experiments

Debo Brata Paul Argha, Md Ashik Ahmed

Subject: Engineering, Civil Engineering Keywords: Solar PV system; Regression Model; DOE; Solar energy; Fossil fuels

Online: 9 November 2023 (10:58:47 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202308.0823.v1

An Adaptive Partial Least Square Regression Approach for Classifying Chicken Egg Fertility by Hyperspectral Imaging

Adeyemi Olutoyin Adegbenjo, Li Liu, Michael O. Ngadi

Subject: Engineering, Bioengineering Keywords: chicken egg fertility; classification; PLS regression; hyperspectral imaging

Online: 10 August 2023 (08:59:12 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202211.0227.v1

Variation Tendency of CVD Risk Factors in Blood According to Age and Relative Grip Strength among Different Populations Based on Bayesian Probabilistic Approach: A Cross-Sectional Study from Jiangsu Province’s Data of China Physical Fitness Surveillance

Komnivoc Tharmo, SB Feng

Subject: Medicine And Pharmacology, Orthopedics And Sports Medicine Keywords: Bayesian; cardiovascular disease; CVD; cross-sectional; logistic regression

Online: 14 November 2022 (01:55:06 CET)

Show abstract| Download PDF| Supplementary Files| Share

Background: Cardiovascular disease (CVD) has been one of the leading causes of death and disability-adjusted life years lost worldwide. Blood pressure, lipid, and cholesterol are good predictors of CVD risk and correspond upon age and physical fitness. However, few studies have explored the variation trend of CVD risk factors across different populations upon age and their muscle strength. Objective: to analysis the variation tendency of CVD risk factors in blood according to age and relative grip strength among different populations. Method: 25363 participants were recruited in this cross-sectional study and 24709 were included in the analysis. A logistic regression and a Bayesian probabilistic analysis based on Markov Chain Monte Carlo (MCMC) Modeling is conducted to build probability prediction models of hypertension, hyperlipidemia, and hypercholesterolemia according to age, relative grip strength, body weight conditions, and physical activity levels. Results: 1) age might be the main influence factor of hypertension, which is regarded as one of the primary CVD risk factors. However, although keeping a high level of physical activity might have positive effect on preventing hypertension because that individuals with normal body weight and higher physical activity shows a lower probability of being diagnosed with hypertension, it might could not prevent individuals from getting hypertension with age. 2) After 60, individuals of normal body weight seem more likely to have hyperlipidemia than those are overweight or obese. 3) Larger relative grip strength might not be able to offset the negative effects of obesity, overweight and physical inactivity on hyperlipidemia. 4) The probability of getting hypercholesterolemia varies less with age and relative grip strength. Conclusion: Body weight management and keeping high levels of physical activity are recommended at any age. It might benefit to increase some bodyweight after 60 years old.

Preprint REVIEW | doi:10.20944/preprints202210.0391.v1

Application of Computational Intelligence Methods in Agricultural Soil-Machine Interaction : A Review

Chetan Badgujar, Dania Martinez Figueroa, Sanjoy Das, Daniel Flippo

Subject: Computer Science And Mathematics, Artificial Intelligence And Machine Learning Keywords: Tillage; Traction; Compaction; Neural networks; Support vector regression

Online: 26 October 2022 (02:07:19 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202106.0533.v1

Pharmacy Impact on Vaccination Progress Using Machine Learning Approach

Samir Bandyopadhyay, Shawni Dutta, Upasana Mukherjee

Subject: Medicine And Pharmacology, Immunology And Allergy Keywords: COVID-19; Vaccine; Prediction; Regression; Ensemble learning; AdaBoost

Online: 22 June 2021 (08:30:30 CEST)

Show abstract| Download PDF| Share

Working Paper ARTICLE

Statistical Methods to Support Difficult Diagnoses

Guenter F. Pilz, Frank Weber, Werner G. Mueller, Juergen R. Schaefer

Subject: Medicine And Pharmacology, Immunology And Allergy Keywords: Diagnosing designs; rare diseases; statistics; regression; block designs

Online: 2 June 2021 (12:14:34 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202103.0586.v1

Prediction of Natural Volatile Organic Compounds Emitted by Bamboo Groves in Urban Forests

Yeji Choi, Geonwoo Kim, Sujin Park, Eunsoo Kim, Soojin Kim

Subject: Environmental And Earth Sciences, Atmospheric Science And Meteorology Keywords: NVOC; phytoncide; bamboo grove; monoterpene; microclimate; regression analysis

Online: 24 March 2021 (13:10:25 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202008.0329.v2

Predictors of Death Rate During the COVID-19 Pandemic

Ian Feinhandler, Benjamin Cilento, Brad Beauvais, Jordan Harrop, Lawrence Fulton

Subject: Medicine And Pharmacology, Epidemiology And Infectious Diseases Keywords: COVID-19; Geospatial Regression; Health Disparities; Public Health

Online: 11 September 2020 (09:48:57 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints201906.0291.v1

Factors Enhancing Serum Syndecan-1 Concentrations: A Large-Scale Comprehensive Medical Examination

Kazumasa Oda, Hideshi Okada, Akio Suzuki, Hiroyuki Tomita, Ryo Kabayashi, Kazuyuki Sumi, Kodai Suzuki, Chihiro Takada, Takuma Ishihara, Keiko Suzuki, Soichiro Kano, Kohei Kondo, Yuki Iwashita, Hirohisa Yano, Ryogen Zaikokuji, So Sampei, Tetsuya Fukuta, Yuichiro Kitagawa, Haruka Okamoto, Takatomo Watanabe, Tomonori Kawaguchi, Takao Kojima, Fumiko Deguchi, Nagisa Miyazaki, Noriaki Yamada, Tomoaki Doi, Takahiro Yoshida, Hiroaki Ushikoshi, Shozo Yoshida, Genzou Takemura, Shinji Ogura

Subject: Medicine And Pharmacology, Internal Medicine Keywords: endothelial disorders; glycocalyx injury; syndecan-1; nonlinear regression

Online: 28 June 2019 (07:42:18 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints201811.0096.v1

Machine Learning Models for Sales Time Series Forecasting

Bohdan M. Pavlyshenko

Subject: Computer Science And Mathematics, Information Systems Keywords: machine learning; stacking; forecasting; regression; sales; time series

Online: 5 November 2018 (09:54:54 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints201608.0025.v2

The Role of Natural Factors on Major Climate Variability in Northern Winter

Indrani Roy

Subject: Environmental And Earth Sciences, Atmospheric Science And Meteorology Keywords: solar variability; NAO; ENSO; volcanic eruptions; multiple regression

Online: 17 May 2017 (06:27:16 CEST)

Show abstract| Download PDF| Share

Preprint COMMENT | doi:10.20944/preprints201608.0166.v1

Regional Inequality in Underdeveloped Areas: A Case Study of Guizhou Province in China

Wei Sun, Xiaona Lin, Yutian Liang, Lu Li

Subject: Social Sciences, Geography, Planning And Development Keywords: Regional inequality; Multilevel regression; Markov chain; Guizhou Province

Online: 17 August 2016 (12:58:58 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202304.1023.v1

Ordinal Logistic Regression as a Tool To Estimate the Risk of Escalating Outcomes. An Application to Vehicle Crash Data

Charalambos Gnardellis, Venetia Notara, Georgia Tzamalouka, Maria Papadakaki, Joannes Chliaoutakis

Subject: Social Sciences, Safety Research Keywords: vehicle crash data; collision risk; ordinal logistic regression; multinomial logistic regression; proportional odds model (POM); partial proportional odds model (PPOM)

Online: 27 April 2023 (04:02:49 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202404.0080.v1

Second Moment/Order Approximations by Kernel Smoothers with Application to Volatility Estimation

León Beleña Lamor, Ernesto Curbelo Benitez, Luca Martino, Valero Laparra Perez-Muelas

Subject: Computer Science And Mathematics, Artificial Intelligence And Machine Learning Keywords: Quantile regression; kernel smoothers; times series; heteroscedasticity; nearest neighbours

Online: 2 April 2024 (02:31:54 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202403.1703.v1

Smart Delivery Assignment Problem through Machine Learning and the Munkres Algorithm

Juan Pablo Vásconez, Elias Schotborgh, Ingrid Nicole Vásconez, Viviana Moya, Andrea Pilco, Oswaldo Menéndez, Robert Guamán-Rivera, Leonardo Guevara

Subject: Engineering, Transportation Science And Technology Keywords: Smart delivery; Machine learning; Regression model; Munkres optimization algorithm

Online: 28 March 2024 (08:17:23 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202312.0131.v1

Factors Affecting Properties of Polymer Grouted Sands

Costas Anagnostopoulos, Vassilios Aggelidis

Subject: Engineering, Civil Engineering Keywords: epoxy resin; grout; creep; strength; permeability; porosity; regression analysis

Online: 5 December 2023 (06:08:16 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202311.0350.v1

A Korean Cattle Weight Prediction Approach Using 3D Segmentation-Based Feature Extraction and Regression Machine Learning from Incomplete 3D Shapes Acquired from Real Farm Environments

Chang Gwon Dang, Seung Soo Lee, Mahboob Alam, Sang Min Lee, Mi Na Park, Ha-Seung Seong, Min Ki Baek, Van Thuan Pham, Jae Gu Lee, Seungkyu Han

Subject: Computer Science And Mathematics, Computer Vision And Graphics Keywords: 3D segmentation; feature extraction; regression machine learning; weight estimation

Online: 6 November 2023 (11:20:30 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202310.0938.v1

Analysis and Modelling of the Process of Skin Onion Peeling by the Method of Blowing Compressed Air

Paweł Woźniak, Agata Bieńczak, Stanisław Nosal, Joanna Piepiórka-Stepuk, Monika Sterczyńska

Subject: Engineering, Mechanical Engineering Keywords: onion; peeling; compressed air; skin; waste; non-linear regression

Online: 16 October 2023 (09:11:18 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202309.0755.v1

Comparison of Three Interstitial Glucose Level Prediction Models for People with Diabetes

Othmar Moser, Harald Sourij, Sergejs Lobanovs, Svjatoslavs Kistkins, Timurs Mihailovs, Valdis Pīrāgs, Dmitrijs Bļizņuks

Subject: Medicine And Pharmacology, Endocrinology And Metabolism Keywords: diabetes; CGM; hypoglycemia; hyperglycemia; prediction; ARIMA; logistic regression; LSTM

Online: 12 September 2023 (16:53:51 CEST)

Show abstract| Download PDF| Share

Background: Novel technologies like continuous glucose monitor (CGM) systems are improving diabetes management by means of real-time sensor glucose levels, retrospective course of glucose and trend arrows. Continuous Glucose Monitoring (CGM) offers real-time alerts for (prognostic) hypo- and hyperglycemia, fast dropping or increasing glucose, and hence improving glycaemia under unstable conditions like during meals, physical activity and exercise management. Complex CGM systems challenge people with diabetes and health care professionals in interpreting rapid changes, sensor delay (~10-minute difference between interstitial and plasma glucose), and malfunctions. Enhanced prediction models are necessary for optimal insulin dosing, daily activities, and especially for future fully closed-loop systems. Methods: The aim of this study was to investigate the efficacy of three different predictive models for glucose responses: 1) an autoregressive integrated moving average model (ARIMA), 2) logistic regression, 3)and long short-term memory networks (LSTM), in predicting glucose levels after 15 minutes and one hour. We compared and evaluated the performance of these models in predicting hypoglycemia (<70 mg/dL), euglycemia (70-180 mg/dL), and hyperglycemia (>180 mg/dL). In more detail, by assessing metrics such as precision, recall, F1-score, and accuracy, we specifically assessed which model provided the most accurate and reliable predictions for glucose levels Results: As expected, ARIMA showed the worst accuracy especially predicting hypoglycaemia withing 1-hour (7.3%). The accuracy of the logistic regression model, predicting hypoglycemia during the first 15 min was higher (98%), comparing to LSTM (88%). However, the LSTM model (87%) exceeded the accuracy of hypoglycemia prediction of the logistic regression (83%) during an hour prognosis. The same pattern observed in hyperglycemia - ARIMA model (60%, 1 hour), logistic regression (96%, 15 minutes) and LSTM (85%, 1 hour) Conclusions: These findings suggest that different models may have varying strengths and weaknesses in predicting glucose levels, and the choice of model should be carefully considered based on the specific requirements and context of the clinical application. The logistic regression model was more accurate for the next 15 minutes, especially predicting hypoglycemia. However, the LSTM model exceeded logistic regression for the next one hour prediction. Future research could explore hybrid models or ensemble approaches that combine the strengths of multiple models to further improve the accuracy and reliability of glucose predictions.

Preprint ARTICLE | doi:10.20944/preprints202309.0302.v1

Adaptive Synthesized Control for Solving The Optimal Control Problem

Askhat Diveev, Elizaveta Shmalko

Subject: Computer Science And Mathematics, Robotics Keywords: stabilization; symbolic regression; synthesized control; evolutionary computations; quadcopter model

Online: 5 September 2023 (10:11:12 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202308.1978.v1

Using LLM Models and Explainable ML to Analyse Biomarkers at Single Cell Level for Improved Understanding of Diseases

Jonas Elsborg, Marco Salvatore

Subject: Biology And Life Sciences, Life Sciences Keywords: biomarker, LLM, interpretability, scRNA-seq, machine learning, symbolic regression

Online: 30 August 2023 (03:53:31 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202308.0314.v1

Modelled Multidecadal Trends of Lightning and (Very) Large Hail in Europe and North America (1950–2021)

Francesco Battaglioli, Pieter Groenemeijer, Tomas Pucik, Mateusz Taszarek, Uwe Ulbrich, Henning Rust

Subject: Environmental And Earth Sciences, Atmospheric Science And Meteorology Keywords: Hail; Lightning; Climate change; Regression analysis; Trends; Reanalysis data

Online: 3 August 2023 (10:07:40 CEST)

Show abstract| Download PDF| Share

Preprint REVIEW | doi:10.20944/preprints202303.0401.v1

The Scientific Record: Examining Some of the Claims and Counterclaims in the MMR Saga

Jacob M. Puliyel

Subject: Computer Science And Mathematics, Applied Mathematics Keywords: Strawman fallacy; UK General Medical Council; autism; regression; MMR

Online: 22 March 2023 (14:39:30 CET)

Show abstract| Download PDF| Share

Search Results

416 articles found