Search | Preprints.org

Vegetation identification using remote sensing is essential for understanding terrestrial ecosystems and monitoring climate change. This technology by providing high-resolution images of the Earth's surface from satellites, aircraft, and drones has revolutionized the way scientists study and un-derstand vegetation patterns, providing information on the distribution, structure, and health of plant communities in different regions and biomes. However, identifying vegetation in multi and hyperspectral images remains a difficult, time-consuming, and costly process. Traditional methods for vegetation mapping involve the in-terpretation of spectral indices such as the Normalized Difference Vegetation Index (NDVI), alt-hough they often suffer from limitations such as manual interpretation and limited spatial resolution of satellite images. Furthermore, machine learning techniques, artificial neural networks, decision trees, support vector machines, and random forests have shown promising results in identifying and classifying vegetation cover from hyperspectral images. However, due to the need for per-son-nel with specialized expertise, their application on a large scale is still challenging. On the other hand, correlation methods between curves have emerged as powerful and simplified tool for comparing spectral signatures of hyperspectral image pixels and efficiently classifying vegetation in a study area. Pearson's correlation coefficient and spectral angle mapper are among the most common correlation methods used in vegetation mapping. These correlation methods, among many others available, offer an easily implemented and computationally efficient approach, making them particularly useful for applications in developing countries or regions with limited resources. In this article, a comparative study of correlation/distance metrics was conducted for the detection of vegetation pixels in hyperspectral images. The study allowed for the comparison of five distance and/or correlation metrics: direct correlation, cosine similarity, normalized Euclidean distance, Bray-Curtis’s distance, and Pearson correlation. The two metrics that yielded the best accuracy results in vegetation pixel detection were direct correlation and Pearson correlation. Based on the selected methods, a vegetation detection algorithm was implemented and validated on a hyper-spectral image of the Manga neighborhood in Cartagena de Indias, Colombia. The spectral library was utilized for image processing, while the numpy and scipy libraries in the Python programming language were used for the mathematical calculation of correlations. Both the study's approach and the implemented algorithm aim to serve as a reference for conducting detection studies of various material types in hyperspectral images using open-access programming platforms.

Preprint ARTICLE | doi:10.20944/preprints201805.0143.v1

Robust Template-Based Watermarking for DIBR 3D Images

Wook-Hyung Kim, Jong-Uk Hou, Han-Ul Jang, Heung-Kyu Lee

Subject: Computer Science And Mathematics, Computer Science Keywords: depth-image-based rendering (DIBR); 3D content; curvelet transform; 1D-discrete cosine transform (1D-DCT); template watermark; DIBR watermarking

Online: 9 May 2018 (09:00:10 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202407.2201.v1

Combustion Control of Ship’s Oil-Fired Boilers based on Prediction of Flame Images

Chang-Min Lee

Subject: Engineering, Control And Systems Engineering Keywords: Combustion control; Emission prediction; IMC-based PI Control; Real-Time Control; Performance assessment

Online: 26 July 2024 (15:04:57 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202404.1691.v1

Text Analytics on YouTube Comments for Food Products

Maria Tsiourlini, Katerina Tzafilkou, Dimitrios Karapiperis, Christos Tjortjis

Subject: Computer Science And Mathematics, Artificial Intelligence And Machine Learning Keywords: plant-based products; hedonic food products; sentiment analysis; text analytics; youtube comments; machine learning

Online: 26 April 2024 (08:15:37 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202004.0032.v1

Indoor Positioning Using PnP Problem on Mobile Phone Images

Hana Kubíčková, Karel Jedlička, Radek Fiala, Daniel Beran

Subject: Environmental And Earth Sciences, Remote Sensing Keywords: indoor positioning system; image-based positioning system; computer vision; SIFT; feature detection; feature description; cell phone camera; PnP problem; projection matrix; epipolar geometry; OpenCV

Online: 3 April 2020 (11:59:48 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202308.0549.v1

Impact of Phantom Size on Low-Energy Virtual Monoenergetic Images of Three Dual-Energy CT Platforms

Joël Greffier, Claire Van Ngoc Ty, Isabelle Fitton, Julien Frandon, Jean-Paul Beregi, Djamel Dabli

Subject: Medicine And Pharmacology, Other Keywords: Dual-energy; Multidetector Computed tomography; Task-based image quality assessment; Split-filter

Online: 8 August 2023 (03:33:22 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202305.0007.v1

An Advanced Approach on Enhancing Accommodation Maintenance Via Text-Based Customer Review Analysis

Tharindu Wickramasinghe, Pumudu Fernando

Subject: Computer Science And Mathematics, Artificial Intelligence And Machine Learning Keywords: Machine Learning; Classification; Natural Language Processing; Text-based Customer Review Analysis; Sentiment Analysis; Deep Learning

Online: 1 May 2023 (03:21:05 CEST)

Show abstract| Download PDF| Share

Preprint REVIEW | doi:10.20944/preprints202304.1035.v1

A Review on Deriving Maintenance of Accommodations Via Text-based Feedback Analysis

Tharindu Wickramasinghe, Pumudu Fernando

Subject: Computer Science And Mathematics, Artificial Intelligence And Machine Learning Keywords: Machine Learning; Classification; Natural Language Processing; Text-based Customer Review Analysis; Sentiment Analysis; Deep Learning

Online: 27 April 2023 (04:30:17 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202402.0693.v1

SFT For Improved Text-to-SQL Translation

Ankit Agrahari, Puneet Kumar Ojha, Abhishek Gautam, Parikshit Singh

Subject: Computer Science And Mathematics, Artificial Intelligence And Machine Learning Keywords: Text-to-sql

Online: 13 February 2024 (03:07:37 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202306.0643.v1

Spectral Characterization, Microscopic Images, and Antimicrobial Activity of Chenodeoxycholic Acid Complexes with Zn(II), Mg(II), and Ca(II) Ions

Abdulrahman A. Almehizia, Mohamed A. Al-Omar, Ahmed M. Naglah, Mashooq A. Bhat, Fhdah S. Alanazi, Fatimah A. Alotaibi, Moamen S. Refat, Abdel Majid A. Adam

Subject: Chemistry And Materials Science, Applied Chemistry Keywords: Chenodeoxycholic acid; Metal-based complex; Spectroscopy; Microscopic images; Antimicrobial test

Online: 8 June 2023 (11:28:03 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202211.0249.v1

Deep Learning-Based Screening of Urothelial Carcinoma in Whole Slide Images of Liquid-Based Cytology Urine Specimens

Masayuki Tsuneki, Makoto Abe, Fahdi Kanavati

Subject: Computer Science And Mathematics, Artificial Intelligence And Machine Learning Keywords: urothelial carcinoma; urine; liquid-based cytology; deep learning; cancer screening; whole slide image

Online: 14 November 2022 (09:31:16 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202106.0157.v1

Object-Based Land Use and Land Cover Change Detection in Multi Temporal Remote-Sensing Images

Dan Li, Runjie Jin, Jiali Gu, Runqiu Huang, Jiaping Wu

Subject: Environmental And Earth Sciences, Atmospheric Science And Meteorology Keywords: Land use and land cover; Classification; Object-based change detection; Multi-temporal image analysis; Landsat; Tiaoxi

Online: 7 June 2021 (09:27:22 CEST)

Show abstract| Download PDF| Share

Preprint DATASET | doi:10.20944/preprints202311.1183.v1

S-LIGHT: Synthetic Dataset for the Separation of Diffuse and Specular Reflection Images

Sangho Jo, Ohtae Jang, Chaitali Bhattacharyya, Minjun Kim, Taeseok Lee, Yewon Jang, Haekang Song, Hyeokmin Kwon, Saebyeol Do, Sungho Kim

Subject: Computer Science And Mathematics, Artificial Intelligence And Machine Learning Keywords: Single Image based Deep Learning Model, Specular Highlight Removal, Reflection Removal, Synthetic Dataset, Multi-Scale Normalized Cross Correlation (MS-NCC)

Online: 21 November 2023 (10:00:38 CET)

Show abstract| Download PDF| Share

Preprint REVIEW | doi:10.20944/preprints202010.0649.v2

Modern Clinical Text Mining: A Guide and Review

Bethany Percha

Subject: Computer Science And Mathematics, Information Systems Keywords: text mining; natural language processing; electronic health records; clinical text; machine learning

Online: 3 February 2021 (10:31:14 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202310.0286.v1

Natural Language Processing-based Method for Clustering and Analysis of Movie Reviews and Classification by Genre

Fernando González, Miguel Torres-Ruiz, Guadalupe Rivera-Torruco, Liliana Chanona-Hernández, Rolando Quintero

Subject: Computer Science And Mathematics, Artificial Intelligence And Machine Learning Keywords: Text document clustering; K-means; TF-IDF; NLP; Text vectorization; machine learning; movie reviews

Online: 5 October 2023 (11:57:13 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202203.0329.v1

Plagiarism Detection in the Bengali Language: A Text Similarity-Based Approach

Satyajit Ghosh, Aniruddha Ghosh, Bittaswer Ghosh, Abhishek Roy

Subject: Computer Science And Mathematics, Analysis Keywords: Plagiarism Detection; Plagiarism checker for Bengali text; Bengali Literature Corpus; OCR in Bengali text

Online: 24 March 2022 (09:36:56 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202302.0017.v1

Digital Mental Health Service Engagement Changes during Covid-19 in Children and Young People across the UK: Presenting Concerns, Service Activity, and Access by Gender, Ethnicity, and Deprivation

Duleeka Knipe, Santiago De Ossorno Garcia, Louisa Salhi, Lily Mainstone-Cotton, Aaron Sefi, Ann John

Subject: Social Sciences, Psychology Keywords: Digital Mental Health; deprivation; service activity; Mental health concerns; ethnicity; time-series analysis; Covid-19; Text-based online therapy

Online: 2 February 2023 (01:30:05 CET)

Show abstract| Download PDF| Supplementary Files| Share

The adoption of digital health technologies accelerated during Covid-19, with concerns over the equity of access due to digital exclusion. Using data from a text-based online mental health service for children and young people we explore the impact of the pandemic on service access and presenting concerns and whether differences were observed by sociodemographic characteristics in terms of access (gender, ethnicity and deprivation). We used interrupted time-series models to assess whether there was a change in the level and rate of service use during the Covid-19 pandemic (April 2020-April 2021) compared to pre-pandemic trends (June 2019-March 2020). Routinely collected data from 61221 service users were extracted for observation, those represented half of the service population as only those with consent to share their data were used. The majority of users identified as female (74%) and White (80%), with an age range between 13 and 20 years of age,. There was evidence of a sudden increase (13%) in service access at the start of the pandemic (RR 1.13 95% CI 1.02, 1.25), followed by a reduced rate (from 25% to 21%) of engagement during the pandemic compared to pre-pandemic trends (RR 0.97 95% CI 0.95,0.98). There was a sudden increase in almost all presenting issues apart from physical complaints. There was evidence of a step increase in the number of contacts for Black/African/Caribbean/Black British (38% increase; 95% CI: 1%-90%) and White ethnic groups (14% increase; 95% CI: 2%-27%) ), the sudden increase in service use at the start of the pandemic for the most (58% increase; 95% CI: 1%-247%) and least (47% increase; 95% CI: 6%-204%) deprived areas. During the pandemic, contact rates decreased, and referral sources change at the start. Findings on access and service activity align with other studies observing reduced service utilisation. The lack of differences in deprivation levels and ethnicity at lockdown suggests exploring equity of access to the anonymous service. The study provides unique insights into changes in digital mental health use during Covid-19 in the UK.

Preprint ARTICLE | doi:10.20944/preprints202003.0313.v3

Small-Object Detection in Remote Sensing Images with End-to-End Edge-Enhanced GAN and Object Detector Network

Jakaria Rabbi, Nilanjan Ray, Matthias Schubert, Subir Chowdhury, Dennis Chao

Subject: Computer Science And Mathematics, Computer Vision And Graphics Keywords: object detection; faster region-based convolutional neural network (FRCNN); single-shot multibox detector (SSD); super-resolution; remote sensing imagery; edge enhancement; satellites

Online: 29 April 2020 (13:33:56 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202406.1696.v1

Speech-to-Text Conventional Myanmar (Burmese) Language Recognition System

Halawati Binti Abd Jalil, Pyae Sone Phyo, Md Amin Ullah Sheikh, Riskhan Basheer

Subject: Computer Science And Mathematics, Computer Science Keywords: Speech to Text; NLP; Language Recognition

Online: 24 June 2024 (13:49:30 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202403.0147.v1

A Method to Classify Texts Based on Sentiment Analysis and Machine Learning

Claudia Corona López, Jesus Urias Piña, Rafael Lahoz-Beltra

Subject: Computer Science And Mathematics, Artificial Intelligence And Machine Learning Keywords: Sentiment analysis; text classification; machine learning

Online: 5 March 2024 (05:10:29 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202306.2163.v1

The Impact of Digital Media on Event-Related Perception

Stefano Calabrese

Subject: Arts And Humanities, Literature And Literary Theory Keywords: Event; transmedia storytelling; action; expanded text

Online: 30 June 2023 (03:08:15 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202110.0033.v1

KOMBAT: Knowledgebase of Microbes’ Battling Agents for Therapeutics

Anasuya Bhargav, Srijanee Gupta, Surabhi Seth, Sweety James, Firdaus Fatima, Pratibha Chaurasia, Srinivasan Ramachandran

Subject: Biology And Life Sciences, Immunology And Microbiology Keywords: Antibiotic resistance; text mining; therapy; database

Online: 4 October 2021 (08:58:52 CEST)

Show abstract| Download PDF| Supplementary Files| Share

Preprint ARTICLE | doi:10.20944/preprints202011.0646.v1

Online Multilingual Hate Speech Detection: Experimenting with Hindi and English Social Media

Neeraj Vashistha, Arkaitz Zubiaga

Subject: Computer Science And Mathematics, Computer Science Keywords: social media; hate speech; text classification

Online: 25 November 2020 (14:12:07 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints201610.0012.v1

Bio-Resource Exchange: Study of Prevalence of Antibody Donation and Development of a Web Portal to Facilitate it

Sandeep Subramanian, Madhavi Ganapathiraju

Subject: Biology And Life Sciences, Biochemistry And Molecular Biology Keywords: data exchange; resource donations; text mining

Online: 5 October 2016 (15:08:32 CEST)

Show abstract| Download PDF| Supplementary Files| Share

Preprint ARTICLE | doi:10.20944/preprints202204.0303.v2

Linguistic Markers of Intercultural Competence in Student Blogs

Hilde Hanegreefs, Mark Pluymaekers, Ankie Hoefnagels

Subject: Social Sciences, Language And Linguistics Keywords: Blogging; intercultural competence; international learning outcomes; reflective writing; reflection; text analysis; text mining; psycholinguistics; linguistic markers

Online: 8 March 2023 (10:07:17 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202008.0033.v1

Detecting Suspicious Texts Using Machine Learning Techniques

Omar Sharif, Mohammed Moshiul Hoque, A. S. M. Kayes, Raza Nowrozy, Iqbal H. Sarker

Subject: Computer Science And Mathematics, Artificial Intelligence And Machine Learning Keywords: Natural Language Processing; Suspicious Text Detection; Bengali Language Processing; Machine Learning; Text Classification; Feature Extraction; Suspicious Corpora

Online: 2 August 2020 (14:38:13 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202212.0478.v1

Implementing Computer Vision Techniques to Recognize American Sign Language (ASL) Hand Signals

Tauheed Khan Mohd, Alvaro Martin Grande, Rodrigo E. Ayala, Stuart Isteefano

Subject: Computer Science And Mathematics, Information Systems Keywords: Datasets, Neural Networks, Hand Detection, Text Tagging

Online: 26 December 2022 (07:30:24 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202211.0017.v1

Performance Comparison of TTS Models for Brazilian Portuguese to Establish a Baseline

Wilmer Lobato, Felipe Farias, William Cruz, Marcellus Amadeus

Subject: Computer Science And Mathematics, Computer Science Keywords: text-to-speech; naturalness; intelligibility; Brazilian Portuguese

Online: 1 November 2022 (04:37:04 CET)

Show abstract| Download PDF| Share

Working Paper ARTICLE

DASTEX: a New Readability Formula based on Semantic Complexity of Text

Mohammad Reza Besharati, Mohammad Izadi

Subject: Computer Science And Mathematics, Algebra And Number Theory Keywords: Semantic Complexity; Semantics; Text Complexity; Readability Formulae

Online: 6 September 2021 (13:33:34 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202010.0057.v1

Automatic electronic invoice classification using machine learning models

Chiara Bardelli, Alessandro Rondinelli, Ruggero Vecchio, Silvia Figini

Subject: Business, Economics And Management, Accounting And Taxation Keywords: multiclass classification; text mining; accounting control system

Online: 5 October 2020 (09:05:53 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints201802.0001.v1

Building a Domain Ontology in the Process of Linguistic Analysis of Text Resources

Nadezhda Yarushkina, Aleksey Filippov, Vadim Moshkin, Yuri Egorov

Subject: Computer Science And Mathematics, Computer Science Keywords: domain ontology; semantic analysis; linguistics, text resources

Online: 1 February 2018 (03:08:47 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202402.0332.v1

Switching Self-Attention Text Classification Model with Innovative Reverse Positional Encoding for Right-To-Left Languages: A Focus on Arabic Dialects

Laith H. Baniata, Sangwoo Kang

Subject: Computer Science And Mathematics, Artificial Intelligence And Machine Learning Keywords: Switching Self-Attention; Reverse Positional Encoding (RPE) mothed; Text Classification (SA); Right-to-Left Text; five-polarity; ITL

Online: 6 February 2024 (05:20:40 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202107.0277.v1

Detection of Cervical Cancer Cells in Whole Slide Images Us-ing Deformable and Global Context Aware Faster RCNN-FPN

Xia LI, Zhenhao Xu, Xi Shen, Yongxia Zhou, Binggang Xiao, Tie-Qiang Li

Subject: Medicine And Pharmacology, Oncology And Oncogenics Keywords: Cervical cancer; Pap smear test; whole slide image (WSI); feature pyramid network (FPN); global context aware (GCA); region based convolutional neural networks (R-CNN); Region Proposal Network (RPN).

Online: 12 July 2021 (23:05:34 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202111.0023.v1

AI-Crime Hunter: An AI Mixture of Experts for Crime Discovery on Twitter

Niloufar Shoeibi, Nastaran Shoeibi, Guillermo Hernández, Pablo Chamoso, Juan Manuel Corchado

Subject: Engineering, Control And Systems Engineering Keywords: Twitter; Social Media Analysis; User Behavior Mining; Crime Detection; Feature Extraction; Graph Analysis; Natural Language Processing; Text Classification; Aspect-based Sentiment Analysis; DistilBERT

Online: 1 November 2021 (15:25:19 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202405.1123.v1

Abstractive Summarization Model for Summarizing Scientific Article

Mehtap Ülker, Ahmet Bedri Özer

Subject: Computer Science And Mathematics, Computer Science Keywords: Text summarization; Abstractive method; SciBERT; SciIE; Graph transformer

Online: 16 May 2024 (18:21:24 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202404.1073.v1

The Effect of Ambient Illumination and Text Color on Visual Fatigue Under Negative Polarity

Qiangqiang Fan, Jinhan Xie, Yang Wang, Zhaoyang Dong

Subject: Engineering, Industrial And Manufacturing Engineering Keywords: visual fatigue; ambient illumination; Negative polarity; text color

Online: 17 April 2024 (11:26:44 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202307.0462.v1

WHORU: Improving Abstractive Dialogue Summarization with Personal Pronoun Resolution

Tingting Zhou

Subject: Computer Science And Mathematics, Computer Science Keywords: text summarization; abstractive dialogue summarization; personal pronoun resolution

Online: 7 July 2023 (10:15:01 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202305.1326.v1

Image E-book Guidance for Improving Urinary Catheter Discomfort, Self-Efficacy, and Pain in Postoperative Patients

Hsin-Shu Huang, Hsin-Yuan Fang

Subject: Biology And Life Sciences, Behavioral Sciences Keywords: image e-book guidance; text guidance; self-efficacy

Online: 18 May 2023 (10:23:37 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202304.0935.v1

A Study on Generating Webtoons using Multilingual Text-to-Image Models

Kyungho Yu, Hyungho Ju, Jeongin Kim, Chanjun Chun, Pankoo Kim

Subject: Computer Science And Mathematics, Artificial Intelligence And Machine Learning Keywords: Multilingual BERT; Text-to-image; DCGAN; Webtoon; GAN

Online: 26 April 2023 (03:16:07 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202105.0601.v1

A Study on Ways to Improve Mobile RPG Using Big Data Text Mining

DongHyun Youm, JungYoon Kim

Subject: Computer Science And Mathematics, Algebra And Number Theory Keywords: Mobile RPG; Big Data; Text Mining; Topic Modeling

Online: 25 May 2021 (10:21:36 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202102.0120.v1

Integrating Text Mining and Balanced Scorecard Techniques to Investigate the Association between CEO Message of Homepage Words and Financial Status: Emphasis on Hospitals

Hyung Jong Na, Kun Chang Lee, Sung Tae Kim

Subject: Business, Economics And Management, Business And Management Keywords: Homepage words; Financial ratio; Text-mining; Balanced scorecard

Online: 3 February 2021 (15:07:40 CET)

Show abstract| Download PDF| Share

(1) Background: The CEO message of hospital homepage contain various contents such as the hospital's future vision, promises with customers, upgraded services and public activities. The CEO’s message of the homepage includes non-financial information as well as financial information of corporates. Also, it provides useful information for not only company's goals and vision but also firm performance and strategies for the future. This study aims to investigate associations between CEO’s message of hospitals homepages and financial status. We used the balanced scorecard frame to analyze what content on the hospital's homepage is related to the hospital's various financial ratios. (2) Methods: We adopt a text mining method to extract significantly repeated keywords from the CEO’s message of hospital website. And we classify these keywords by a balanced scorecard frame. To examine the relationship between keywords of CEO’s message of the hospital homepage and hospital’s financial ratio, T-test is conducted for the difference in the TF-IDF (Term Frequency is Divided by Inverse Document Frequency) mean of the home page contents and its relationship with the views of the balanced scorecard framework. (3) Results: According to empirical results on 65 samples collected from local hospitals, there are some significant relationship between the qualitative content of the hospital's homepage and the quantitative financial ratio that indicates profitability, activity, leverage, liquidity, and transfer to essential business fund (EBF) income. (4) Conclusions: The introduction section of a homepage is most accessible to customers, containing the aims and ideals of hospitals and reflecting their values and visions [1]. In addition, in view of financial status, they can either emphasize financial strength or focus on other areas to mask weakness of financial information. This study reminds us of the importance of hospital website’s disclosure, and it can be inferred from the financial status of the hospital. It also highlights the need for harmonization between quantitative data, financial statements, and qualitative data, CEO’s messages. (5) Implications: To our best knowledge, this paper is the first research attempting to investigate the relation between text of hospital homepage and financial ratio of hospital through text mining technique and balanced scorecard frame. Hospitals take a crucial part in a country’s welfare and healthcare backbone industry. Nevertheless, in many countries, hospital organization sectors tend to remain a source of critical fiscal deficits due to its ineffective and sloppy management. We expect that the result of this paper can provide hospital managers to useful information.

Preprint BRIEF REPORT | doi:10.20944/preprints201811.0527.v1

Mapping the Literature on Nutritional Interventions in Cognitive Health: A Data-Driven Approach

Erin I. Walsh, Nicolas Cherbuin

Subject: Biology And Life Sciences, Food Science And Technology Keywords: citation network analysis; text mining; nutrition intervention; cognition

Online: 21 November 2018 (13:50:28 CET)

Show abstract| Download PDF| Supplementary Files| Share

Preprint ARTICLE | doi:10.20944/preprints201811.0206.v1

Towards identifying author confidence in biomedical articles

Daniela Gifu, Mihaela Onofrei, Diana Trandabat

Subject: Computer Science And Mathematics, Other Keywords: Biomedical libraries; author’s confidence; writing styles; text analysis

Online: 8 November 2018 (11:01:24 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints201810.0338.v1

Improving the Accuracy in Text Classification Methodology in Light of Modelling the Latent Semantic Relations

Nina Rizun, Yurii Taranenko, Wojciech Waloszek

Subject: Computer Science And Mathematics, Information Systems Keywords: text classification; topic modelling; latent semantic analysis; latent dirichlet allocation; hierarchical sentiment dictionary; contextually-oriented hierarchical corpus; text tonality; evaluation

Online: 16 October 2018 (07:55:35 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202208.0451.v1

Employing a Multilingual Transformer Model for Segmenting Unpunctuated Arabic Text

Abdullah M. Alshanqiti, Sami Albouq, Ahmad B. Alkhodre, Abdallah Namoun, Emad Nabil

Subject: Computer Science And Mathematics, Artificial Intelligence And Machine Learning Keywords: text splitting; text tokenization; transfer learning; mask-fill prediction; NLP linguistic rules; missing punctuations; cross-lingual BERT model; Masked Language Modeling

Online: 26 August 2022 (05:19:39 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202407.2098.v1

Arabic X(formerly Twitter) Sentiment Analysis using Skip grams method

Rasha M. AlEidan

Subject: Computer Science And Mathematics, Computer Science Keywords: Sentiment; Arabic; machine learning; Text analysis; Tweet; Social Network

Online: 26 July 2024 (09:09:58 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202406.0665.v1

The Impact of Pause and Filler Words Encoding on Dementia Detection with Contrastive Learning

Reza Soleimani, Shengjie Guo, Katarina Haley, Adam Jacks, Edgar Lobaton

Subject: Engineering, Electrical And Electronic Engineering Keywords: Dementia; Contrastive learning; Deep learning; Text classification, LLMs, NLP

Online: 11 June 2024 (11:39:31 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202404.1579.v1

RB-GAT: A Text Classification Model Based on RoBERTa-BiGRU with Graph ATtention Network

Shaoqing Lv, Jungang Dong, Chichi Wang, Xuanhong Wang, Zhiqiang Bao

Subject: Computer Science And Mathematics, Artificial Intelligence And Machine Learning Keywords: word embedding; RoBERTa; BiGRU; text classification; multi-head GAT

Online: 25 April 2024 (15:11:24 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202305.0990.v1

Research on Multilingual News Clustering Based on Cross-Language Word Embeddings

Lin Wu, Rui Li, Wong-Hing Lam

Subject: Computer Science And Mathematics, Artificial Intelligence And Machine Learning Keywords: news; cross-language word embedding; LDA model; text clustering

Online: 15 May 2023 (07:29:01 CEST)

Show abstract| Download PDF| Share

Working Paper ARTICLE

WATS-SMS: A T5-based French Wikipedia Abstractive Text Summarizer for SMS

Jean Louis Ebongue Kedieng Fendji, Désiré Manuel Taira, Marcellin Atemkeng, Adam Musa Ali

Subject: Computer Science And Mathematics, Algebra And Number Theory Keywords: Text summarization; Fine-tuning; Transformers; SMS; Gateway; French Wikipedia.

Online: 14 September 2021 (10:48:55 CEST)

Show abstract| Download PDF| Share

Working Paper ARTICLE

Learning by Injection: Attention Embedded Recurrent Neural Network for Amharic Text-image Recognition

Birhanu Belay, Tewodros Habtegebrial, Gebeyehu Belay, MIllion Mesheshsa, Marcus Liwicki, Didier Stricker

Subject: Computer Science And Mathematics, Algebra And Number Theory Keywords: Amharic script; Attention mechanism; OCR; Encoder-decoder; Text-image

Online: 15 October 2020 (13:42:28 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints201812.0306.v1

Scene to Text Conversion and a Cymatics Based Configurable Text Perception

Saeed Mian Qaisa

Subject: Engineering, Electrical And Electronic Engineering Keywords: cymatics; text detection and recognition; optical character recognition (OCR)

Online: 25 December 2018 (13:52:31 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202407.2014.v1

AI vs. Human: Decoding Text Authenticity with Transformers

Daniela Gifu, Covaci Silviu-Vasile

Subject: Computer Science And Mathematics, Artificial Intelligence And Machine Learning Keywords: large language models; natural language processing; content creation; text authenticity

Online: 25 July 2024 (07:29:49 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202407.0802.v1

Smart Governance Tool's Design to Monitor the Commitments of Bio-Business Licensing in Indonesia

Muhammad Mahreza Maulana, Arif Imam Suroso, Yani Nurhadryani, Kudang Boro Seminar

Subject: Computer Science And Mathematics, Information Systems Keywords: smart city; smart governance tool; system design; text summarization; prototype

Online: 10 July 2024 (04:25:39 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202405.1040.v1

(HTBNet)Arbitrary Shape Scene Text Detection with Binarization of Hyperbolic Tangent and Cross Entropy

Zhao Chen

Subject: Computer Science And Mathematics, Computer Vision And Graphics Keywords: Scene Text Detection; binarization; hyperbolic tangent; MSCA; FMCS; cross entropy

Online: 15 May 2024 (13:19:51 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202404.0744.v1

Analyzing the Possibilities of Using the Scilit Platform to Identify Current Energy Efficiency and Conservation Issues

Boris Chigarev

Subject: Engineering, Energy And Fuel Technology Keywords: energy efficiency, energy conservation, Scilit, bibliometric record analysis, text clustering

Online: 10 April 2024 (14:57:33 CEST)

Show abstract| Download PDF| Share

Purpose of publication: - Preparation of bibliometric data exported from the Scilit platform on energy efficiency and conservation for further analysis to identify relevant research topics. - To identify potential issues in the processing of data exported from the Scilit platform. - Providing colleagues with the opportunity to use the prepared data and examples of their analysis for independent research on topical issues of energy efficiency and energy conservation using materials provided by the Scilit platform. Research Materials: Files in CSV and RIS formats exported from Scilit for the query "energy conservation OR efficiency" in Common Fields [Title, Abstract, Keyword], using filters: Content Type → Journal Article; Year→2021-2023; Subject → Industrial Engineering (29.8K), Energy and Fuel Technology (9.8K), Manufacturing Engineering (9.2K). A total of 30K records sorted by their relevance (10K for each year) were exported. Data are current as of 14-03-2024. Methods: Preprocessing of title, annotation, and keyword field texts using lemmatization dictionaries collected on GitHub, removal of keywords taken from GATE and spaCy, and "manual" editing. Using VOSviewer to analyze publication topics by clustering keywords based on their co-occurrence. Using Scimago Graphica to build bubble diagrams.Application of the GSDMM algorithm for clustering bibliometric records by title and annotation texts. Creation of a dictionary for this algorithm using the keyword field.Use of the Carrot2 demo version and the NMF algorithm for a more detailed analysis of the topics of the record clusters obtained from GSDMM. Results: are presented in the form of initial and interim tables and graphs obtained in the course of this study. The full tables are provided as references to the attached materials. Supplementary material for this preprint on figshare: Chigarev, Boris (2024). Supplementary material for preprint "Analyzing the Possibilities of Using the Scilit Platform to Identify Current Energy Efficiency and Conservation Issues". figshare. Dataset. https://doi.org/10.6084/m9.figshare.25574058.v1

Preprint REVIEW | doi:10.20944/preprints202403.0064.v1

Enabling Public Security Text-Based Analytics: A Survey to Outline Research Directions

Victor Diogho Heuer De Carvalho, Robério José Rogério Dos Santos, Thyago Celso Cavalcante Nepomuceno, Thiago Poleto

Subject: Computer Science And Mathematics, Artificial Intelligence And Machine Learning Keywords: Text mining; Public security; Survey; Applications; Opportunities; Future Research Directions

Online: 1 March 2024 (18:30:13 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202312.0093.v1

Analytical Experimentations with Midjourney Architectural Virtual Lab: Defining Some Major Current Limits in AI-generated Representations of Islamic Architectural Heritage

Ahmad Sukkar, Mohamed W. Fareed, Moohammed Wasim Yahia, Salem Buhashima Abdalla, Iman Ibrahim, Khaldoun Abdul Karim Senjab

Subject: Engineering, Architecture, Building And Construction Keywords: Islamic architecture; architectural visualization; intangible heritage; text-to-image generation

Online: 1 December 2023 (21:29:02 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202306.0642.v1

Adverse Crosstalk Between Extracellular Matrix Remodeling and Ferroptosis in Basal Breast Cancer

Christophe Desterke, Emma Cosialls, Yao Xiang, Rima Elhage, Clémence Duruel, Yunhua Chang-Marchand, Ahmed Hamaï

Subject: Biology And Life Sciences, Cell And Developmental Biology Keywords: basal breast cancer; extracellular matrix remodeling; ferroptosis; transcriptome; text mining

Online: 8 June 2023 (11:21:03 CEST)

Show abstract| Download PDF| Supplementary Files| Share

(1) Background: Breast cancer is a frequent heterogeneous disorder diagnosed in woman and is a high cause of mortality of them in reason to rapid metastasis and disease recurrence. Ferroptosis can inhibit breast cancer cell growth, improve the sensitivity of chemotherapy and radiotherapy and inhibit distant metastases so potentially acts on tumor micro-environment; (2) Methods: Ferroptosis/Extracellular matrix remodeling literature text-mining results were integrated in breast cancer transcriptome cohort according their distant relapse free survival (DRFS) under adjuvant therapy (anthracyclin+taxanes) and also in MDA-MB-231 transcriptome functional experiments with ferroptosis activations (GSE173905); (3) Results: Ferroptosis/Extracellular matrix remodeling text-mining identified 910 associated genes in at list 10 articles. Univariate Cox analyses censored on breast cancer (GSE25066) selected 252 individual significant genes and 171 of them found with an adverse expression. Functional enrichment of these 171 adverse genes predicted basal breast cancer signatures. By text-mining some ferroptosis significant adverse selected genes shared citations in domain of ECM remodeling such as: TNF, IL6, SET, CDKN2A, EGFR, HMGB1, KRAS, MET, LCN2, HIF1A, TLR4. A molecular score based on expression the eleven genes was found predictive of worst prognosis breast cancer at univariate level: basal subtype, short DRFS, high grade values 3 and 4, estrogen and progesterone receptors negative and nodal stages 2 and 3. This eleven gene signature was validated as regulated by ferroptosis inductors (erastin and RSL3) in triple negative breast cancer cellular model MDA-MB-231.; (4) Conclusions: Crosstalk between ECM remodeling-Ferroptosis functionalities allowed to define a molecular score which have been characterized as an independent adverse parameter in prognosis of breast cancer patients. Gene signature of this molecular score have been validated to be regulated by erastin/RSL3 ferroptosis activators. This molecular score could be promising to evaluate ECM impact of ferroptosis target therapies in breast cancer.

Preprint ARTICLE | doi:10.20944/preprints202210.0247.v1

The New Version of the Anddigest Tool with Improved AI-Based Short Names Recognition

Timofey V Ivanisenko, Pavel S Demenkov, Nikolay A. Kolchanov, Vladimir A. Ivanisenko

Subject: Social Sciences, Library And Information Sciences Keywords: Text-mining; ANDDigest; ANDSystem; Named entity recognition; Machine learning; PubMedBERT

Online: 18 October 2022 (04:29:17 CEST)

Show abstract| Download PDF| Supplementary Files| Share

Preprint ARTICLE | doi:10.20944/preprints202106.0482.v3

Fighting the COVID-19 Infodemic in News articles and False Publications: The NeoNet Text Classifier, a Supervised Machine Learning Algorithm

Mohammad AR Abdeen, Ahmed Abdeen Hamed, Xindong Wu

Subject: Computer Science And Mathematics, Algebra And Number Theory Keywords: COVID-19 Infodemic; Text Classification; TFIDF Features; Network Training modes; Supervised Learning; Misinformation; News Classification; False Publications; PubMed; Anomaly Detection

Online: 26 July 2021 (12:06:04 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202103.0738.v1

Impact of the Coronavirus Pandemic on Science and Society: Insights from Temporal Bibliometric Networks

Ramya Gupta, Abhishek Prasad, Suresh Babu, Gitanjali Yadav

Subject: Computer Science And Mathematics, Analysis Keywords: bibliometry; coronavirus; text and data mining; SARS; MERS; COVID-19

Online: 31 March 2021 (17:30:56 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202103.0380.v1

Issues and Agendas of Pandemic Crisis Management: A Text Analysis of World Economic Forum COVID-19 Reports

Hyundong Nam, Taewoo Nam

Subject: Business, Economics And Management, Accounting And Taxation Keywords: COVID-19; pandemic crisis; crisis management; text mining; network analysis

Online: 15 March 2021 (12:34:01 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints201809.0466.v1

Topological Signature of 19th Century Novelists: Persistence Homology in Context-Free Text Mining

Shafie Gholizadeh, Armin Seyeditabari, Wlodek Zadrozny

Subject: Computer Science And Mathematics, Information Systems Keywords: topological data analysis; text mining; computational topology; style; persistent homology

Online: 24 September 2018 (15:33:02 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202107.0200.v1

SEDIQA: Sound Emitting Document Image Quality Assessment in a Reading Aid for the Visually Impaired

Jane Courtney

Subject: Engineering, Electrical And Electronic Engineering Keywords: image quality assessment; image quality metrics; NR-IQAs; D-IQA; OCR accuracy; OCR prediction; OCR improvements; visual aids; visually impaired; reading aids; document images; text-based images

Online: 8 July 2021 (13:21:49 CEST)

Show abstract| Download PDF| Share

Preprint REVIEW | doi:10.20944/preprints201607.0012.v1

Analysis of Access Control Methods in Cloud Computing

Madhura Mulimani, Rashmi Rachh

Subject: Computer Science And Mathematics, Information Systems Keywords: role-based access control; attribute-based access control; attribute-based encryption

Online: 8 July 2016 (10:12:21 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202406.0613.v1

Research on the Classification of Work Orders based on BERT and Feature Fusion

Xiong Yun Peng, Lian Guo Chen, Kuo Jun Cao

Subject: Computer Science And Mathematics, Artificial Intelligence And Machine Learning Keywords: Text classification; Ensemble learning; Hybrid neural network model; Feature fusion; BERT

Online: 11 June 2024 (08:07:15 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202404.0563.v1

Development of Context-based Sentiment Classification for Intelligent Stock Market Prediction

Nurmaganbet Smatov, Ruslan Kalashnikov, Amandyk Kartbayev

Subject: Computer Science And Mathematics, Artificial Intelligence And Machine Learning Keywords: sentiment analysis; neural networks; stock price prediction; text-mining; deep learning.

Online: 8 April 2024 (15:45:25 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202402.1432.v1

Differences in CEO Communication Strategies between High and Low Performing Firms in the Global Auto Parts Industry

Yunseok Hong, Keuntae Cho

Subject: Business, Economics And Management, Business And Management Keywords: CEO communication; innovation management; network analysis; text mining; auto parts industry

Online: 26 February 2024 (12:59:38 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202311.1241.v1

The Role of the Energy Sector in Contributing to Sustainability Development Goals: A Text Mining Analysis of Literature

Luísa Cagica Carvalho, Márcia R. C. Santos

Subject: Business, Economics And Management, Business And Management Keywords: Energy Sector; Circular Economy; Sustainable Development Goals; SDG; Text Mining; VOSviewer

Online: 20 November 2023 (13:56:34 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202311.0818.v1

Transformer Text Classification Model for Arabic Dialects that Utilizes Inductive Transfer

Laith H. Baniata, Sangwoo Kang

Subject: Computer Science And Mathematics, Computer Science Keywords: transformer; inductive transfer; text classification; Arabic dialects; positional encoding; 5-polarity

Online: 13 November 2023 (12:11:04 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202309.1106.v1

Research on Safety Risk Transfer in Subway Construction Based on Text Mining and Complex Networks

Kunpeng Wu, Jianshe Zhang, Yanlong Huang, Hui Wang, Hujun Li, Huihua Chen

Subject: Engineering, Architecture, Building And Construction Keywords: Text mining; apriori algorithm; complex network model; subway construction; risk transfer

Online: 18 September 2023 (12:58:50 CEST)

Show abstract| Download PDF| Share

Subway construction is often in a complex natural and human-machine operating environment, and that complicated setting leads to subway construction more prone to safety accidents, which can cause substantial casualties and monetary losses. Thus, it is necessary to investigate the safety risks of subway construction. The existing literature on the identification and assessment of subway construction safety risks(SCSR) is susceptible to the influence of subjective factors. Moreover, although existing studies have explored the interrelationships between different risks, these studies usually analyze the interrelationships of single risks, lack the study of risk chain transfer relationships, and fail to find out the key path of risk transfer. Therefore, this paper innovatively combines text mining, association rules and complex networks to deep mine subway construction safety incident reports and explore risk transfer process. Firstly, it uses text mining technology to identify subway construction safety risk; Then, association rules are introduced to explore the causal relationships among safety risk; Finally, the key safety risk and important transfer paths of subway construction safety accidents (SCSA) are obtained based on the complex network model. Research results show that (a) improper safety management, unimplemented safety subject responsibilities, violation of operation rules, non-perfect safety responsibilities system and insufficient safety education and training are the key safety risk in SCSA; (b) two shorter key risk transfer paths in the subway construction safety network can be obtained: insufficient safety education and training→lower safety awareness→violation of operation rules→safety accidents; insufficient safety checks or hidden trouble investigations→violation of operation rules→safety accidents; (c) in the process of risk transfer, the risk can be controlled by controlling the key nodes or cutting off the transfer path. The results of the study provide new ideas and methods for SCSR identification and influence element mining, which help safety managers propose accurate subway construction safety risk control measures.

Preprint ARTICLE | doi:10.20944/preprints202309.0744.v1

Social Aspects in Energy Research & Social Science Journal Publications for 2019-2023. Bibliometric Analysis

Boris Chigarev

Subject: Social Sciences, Library And Information Sciences Keywords: subjects of publications; bibliometric analysis; text clustering; energy transition; social issues

Online: 12 September 2023 (10:54:09 CEST)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints202302.0077.v1

aeroBERT-Classifier: Classification of Aerospace Requirements using BERT

Archana Tikayat Ray, Bjorn F. Cole, Olivia J. Pinon Fischer, Ryan T. White, Dimitri N. Mavris

Subject: Computer Science And Mathematics, Computer Science Keywords: Requirements Engineering; Natural Language Processing; NLP; BERT; Requirements Classification; Text Classification

Online: 6 February 2023 (02:26:56 CET)

Show abstract| Download PDF| Share

Preprint REVIEW | doi:10.20944/preprints202212.0064.v1

Application of Natural Language Processing (NLP) in Detecting and Preventing Suicide Ideation: A Systematic Review

Abayomi Arowosegbe, Tope Oyelade

Subject: Medicine And Pharmacology, Psychiatry And Mental Health Keywords: Natural language processing; NLP; Text mining; Suicide; Suicide-Ideation; Mental Health

Online: 5 December 2022 (07:34:30 CET)

Show abstract| Download PDF| Share

Introduction: Around a million people are reported to die by suicide every year, and due to the stigma associated with the nature of the death, this figure is usually assumed to be an underestimate. Suicide may be prevented if prompt intervention is taken to mitigate risk. Machine learning and artificial intelligence-based modelling, such as natural language processing (NLP) and other text analytics approaches, has the potential to become a major technique for the detection, diagnosis, and treatment of people who are suffering from mental health issues. The primary aims of this research are to determine whether NLP techniques have been utilised in the field of suicide prevention, and if so, were they effective? What were their limitations? Methods: PubMed, EMBASE, MEDLINE, PsycInfo, and Global Health databases were searched for studies that reported use of NLP for suicide ideation or self-harm. Thematic analysis was used to synthesise and analyse the included studies. Findings were reported using the Preferred Reporting Items for Systematic Reviews and Meta-Analysis (PRISMA) statement, and the Mixed Methods Appraisal Tool (MMAT) was used in assessing paper quality. Result: The preliminary search of five databases generated 387 results. Removal of duplicates resulted in 158 potentially suitable studies. Twenty papers were finally included in this review. Discussion: Studies show that combining structured and unstructured data in NLP data modelling yielded more accurate results than utilizing either alone. Also, to reduce suicides, people with mental problems must be continuously and passively monitored. Further, NLP and other machine learning/artificial intelligence technologies can be used to address health inequities and electronic health records provide valuable data for creating suicide risk tools. Finally, Online, social media, and smartphone applications can be leverage in detecting people with suicide ideation. Conclusion: The use of artificial intelligence and machine learning opens new avenues for considerably guiding risk prediction and advancing suicide prevention frameworks. The review's analysis of the included research revealed that the use of NLP may result in low-cost and effective alternatives to existing resource-intensive methods of suicide prevention. To summarise, there is substantial evidence that NLP is useful in identifying people who have suicide ideation.

Preprint ARTICLE | doi:10.20944/preprints202111.0344.v1

Extraction of the Relations between Significant Pharmacological Entities in Russian-Language Internet Reviews on Medications

Alexander Sboev, Anton Selivanov, Ivan Moloshnikov, Roman Rybka, Artem Gryaznov, Sanna Sboeva, Gleb Rylkov

Subject: Computer Science And Mathematics, Artificial Intelligence And Machine Learning Keywords: pharmacological text corpus; automatic relation extraction; natural language processing; deep learning

Online: 19 November 2021 (10:40:10 CET)

Show abstract| Download PDF| Share

Preprint ARTICLE | doi:10.20944/preprints201904.0170.v1

Mapping Research in Assisted Reproduction Worldwide

D. García, Francesco Alessandro Massucci, Alessandro Mosca, Ismael Ràfols, A. Rodríguez, R. Vassena

Subject: Medicine And Pharmacology, Pediatrics, Perinatology And Child Health Keywords: topic modelling; latent dirichlet allocation; text mining; assisted reproduction; ART; IVF

Online: 15 April 2019 (12:25:12 CEST)

Show abstract| Download PDF| Share

Study question: What are the current trends of research in Human Assisted Reproduction around the world? Summary answer: USA is the leading country, followed by the UK, China, France and Italy. The largest research area is “laboratory techniques”, although other areas such as “public health”, “quality, ethics and law” and “female factor” are gaining ground worldwide. What is known already: Scientific research, especially in health and medical sciences, aims at addressing specific needs that society (and, especially, patients) perceives as pressing. One of the main challenges for policymakers and research funders alike is therefore to align research priorities to societal needs. We can thus think of research agendas in terms of a demand side (societal needs) and a supply side (research outputs). Research output in Human Assisted Reproduction has expanded in the past years, as indicated by the increasing number of scientific publications in indexed journals in this area. Nevertheless, no map of research related to assisted reproduction has been produced so far, hindering the identification of potential areas of improvement and need. Study design, size, duration: 26,000+ scientific publications (articles, letters, and reviews) on Human Assisted Reproduction produced worldwide between 2005 and 2016 were analyzed. These publications were indexed in PubMed or obtained from reference list of indexed publications included in the analysis.Participants/materials, setting, methods: The corpus of publications was obtained by combining the MeSH terms: “Reproductive techniques”, “Reproductive medicine”, “Reproductive health”, “Fertility”, “Infertility”, and “Germ cells”. Then it was analyzed by means of text mining algorithms (Topic Modeling (TM) based on Latent Dirichlet Allocation (LDA)), in order to obtain the main topics of interest. Finally, these categories were analyzed across world regions and time. Main results and the role of chance: We identified 44 main topics, which were further grouped in 11 macro categories, form larger to smaller: “laboratory techniques”, “male factor”, “quality, ethics and law”, “female factor”, “public health and infectious diseases”, “basic research and genetics”, “pregnancy complications and risks”, “general infertility and ART”, “psychosocial aspects”, “cancer”, and “research methodology”. The USA was the leading country in number of publications, followed by the UK, China, France and Italy. Interestingly, research contents in high income countries is fairly homogeneous across macro-categories, and it is dominated by “laboratory techniques” in Western and Southern Europe, and by “quality, ethics and law” in North America, Australia and New Zealand. In middle income countries we observe that research is mainly performed on “male factor”, and noticeably less on “female factor”. Finally, research on “public health and infectious diseases” predominates in low-income countries. Regarding temporal evolution of research, “laboratory techniques” is the most abundant topic on a yearly basis, and relatively constant over time. However, since production in most of the other categories is increasing, the relative contribution of this research category is actually decreasing. Publication is especially increasing in “public health and infectious diseases” (in all world regions, but especially in low income countries), “quality, ethics and law” (high income countries), and “female factor” (middle income countries). Limitations, reasons for caution: Three main factors might limit the robustness of our work: the textual corpus analyzed is based on abstract and titles, the reproducibility of the stochastic algorithms applied, which may produce slightly differing results at each run, and the interpretation of the topics obtained. Wider implications of the findings: This study should prove beneficial in the design of research strategies and policies that foster the alignment between supply (assisted reproduction research) and demand (society). Study funding/competing interest(s): PTQ-14-06718 of the Spanish MINECO Torres Quevedo programme (FAM).

Preprint ARTICLE | doi:10.20944/preprints201810.0678.v1

Unstructured Text in EMR Improves Prediction of Death after Surgery in Children

Oguz Akbilgic, Ramin Homayouni, Kevin Heinrich, Max Raymond langham, Jr, Robert Lowell Davis

Subject: Medicine And Pharmacology, Pediatrics, Perinatology And Child Health Keywords: post-operative death; unstructured data; logistic regression; text mining; surgery outcome

Online: 29 October 2018 (11:46:18 CET)

Show abstract| Download PDF| Share