Preprint
Review

This version is not peer-reviewed.

A Comprehensive Review of Automatic Methods for Suicidal Ideation Detection

Submitted:

12 February 2025

Posted:

14 February 2025

You are already at the latest version

Abstract

Suicide is a complex health concern that affects not only individuals but society as a whole. Its prevention is possible but to be effective it has to consider the medical and technological resources available for its evaluation. Nowadays, computational advancements have enabled new forms of study of this phenomenon. Particularly, the study of suicidal ideation using computer-based techniques primarily involves two main technical approaches: text-based classification and deep learning. Furthermore, it heavily relies on creating and using custom datasets with social media textual data. However, some publications have utilized public information (e.g. records from a particular healthcare provider) in their studies. In this paper, a comprehensive overview of current advancements in automatic suicidal ideation detection, focusing on computer science techniques from 2020 to 2024, is provided. Particularly, it evaluates existing and innovative methodologies, datasets, and limitations in the field to give a proper analysis based on the PRISMA methodology of the current state-of-the-art of this specific task.

Keywords: 
;  ;  ;  

1. Introduction

Suicide, despite not being considered a disease, it can be classified as a convoluted global health concern [1,2]. As revealed by the World Health Organization, an estimate of over 700,000 suicides per year happen worldwide [3]. Particularly, in 2019 over one in every 100 deaths was the result of suicide [4]. Moreover, in that year, around 77% of global self-murders occurred in low- and middle-income regions, like the Southern Cone of the American continent [4,5]. Namely, countries within this sub-territory such as Argentina, Chile, or Uruguay, despite the availability of intervention techniques to prevent suicide, still have a high suicide mortality rate with an average of 20,046 deaths from 2015 to 2019 [5].
Despite being one of the most investigated causes of human mortality with a huge volume of literature and mortality statistics, very little is still understood about suicide [6,7]. Namely, such an important health complication lacks a universally accepted definition in current research (eidem). For instance, Émile Durkheim, one of the first individuals to study this phenomenon, defined it as a classification of all deaths resulting from the victim’s own actions, distinguishing them from the notion of attempt due to the direct or indirect accomplishment of the expected mortal result [8]. Moreover, Bailey explains it as the intentional act of causing a human being’s own death [9]. In contrast, Douglas interprets it as a strategic solution by an individual to existential problems, whereas Chávez-Hernández and Leenaars perceive suicide as a universal and tangible manifestation of the struggles between human passions, biological basis, and cultural environment [8,10]. These final rationales suggest that this is a multidimensional problem whose complexity is influenced and determined by various causes, such as physiological, intrapsychic, historical, political, philosophical, and socio-cultural contexts [8,11,12]. Moreover, such risk factors hint at a general rationality and intentionality of meaning for these actions due to the establishment of a set of common detectable characteristics that shape how communities and individuals respond to the issue [8,13]. However, these may not be universal since the motivation for a convoluted outcome such as suicide depends on varied components that act together, which vary from one individual to another [14,15]. Particularly, the lack of a common suicide victim model makes these fail to capture the emergence of a suicide crisis [16,17]. Thus, the necessary, but insufficient, nature of the influence of risk factors in suicide suggests that there is still a lack of a mechanism for explaining how these general elements are linked to suicidal mortality [15,18].
For this reason, suicide is considered a multidimensional and multifaceted complex event whose understanding involves knowledge and insights drawn from multiple disciplines [12]. For instance, mixed research (qualitative and quantitative approaches), large-scale epidemiological surveys, and problem-solving therapy are examples of techniques from Suicidology, Epidemiology, and Psychology, respectively, that, in conjunction, have explored in recent times the origin, development, and outcome of suicidal activity [17,19,20]. Moreover, the broader perceptions of suicide derived from the contribution of different fields of study have expanded empirical evidence and knowledge of the phenomenon since its proper interpretation results complicated due to the multiple phenotypes that must be considered across the suicide spectrum [1,17,19]. These phenotypes include [1]:
  • Suicidal ideation: Commonly defined as thoughts about ending one’s own life. Further, literature has acknowledged its broad conceptualization by contemplating it as a set of contemplations and behaviors within a continuum of suicide potential [16,21,22].
  • Suicide attempt: Defined as a self-injurious behavior with inferred or actual intent to die. In addition, Yerly highlights the multifaceted nature of suicide by illustrating this concept as the intentional actions of ending one’s own life without achieving a fatal result [23].
  • Death by suicide: Defined as the mortal outcome of a suicide attempt.
Notably, suicidal ideation is cataloged as one of the strongest predictors of suicide attempts as it logically precedes a mortal endeavor or completed suicide [24,25]. However, literature still fails to determine when ideation is transformed into action since there is a lack of consensus about this transition [26,27]. Particularly, some studies suggest that this progression is facilitated by co-occurring psychiatric conditions that increase distress (eg. panic disorder or post-traumatic stress disorder) [1]. Other theories emphasize suicidality as a single phenomenon in need of a single overarching explanation [27]. Neither the approach, there is yet a loose collection of empirical facts unconnected by theory that need to illuminate the likelihood of suicidal behavior based on the attempter-ideator distinction [15,18,27].
Moreover, it exists limited progress derived from the paucity of research on the basic phenomenology of suicidal ideation [25]. Contemporary investigations couched it within the distinction between active (a latent intention that leads to the creation of specific plans to commit suicide) and passive (formulation of suicidal thinking without a particular plan) deliberation. Because of this, it is important to highlight the nuanced differences between suicidal thoughts and suicidal ideation since many studies have a limited empirical basis founded on the interchangeability of these concepts derived from the few explorations of the latent structure of explicit rationalizations (eidem). Suicidal thoughts are any episodic, with a quick onset and short duration, ideas or ruminations about the possibility of ending one’s own life [28]. Alternatively, suicidal ideation results in a conceptualization with a more generalized scope since it implies the progression of these contemplations to consolidate a two-factor structure based on suicidal desire and resolved planning/preparation [25,27]. Thus, a clear differentiation between notions, as well as improved knowledge of suicidality’s meta-analysis, are critical for more robustly designed prevention strategies [19,27].
Notably, current prevention strategies flaw to identify which subjects with suicidal behavior are at risk of acting on their thoughts and which ones aren’t since literature has prioritized the quantification of suicide rather than its understanding [7,17,26]. Particularly, the methodologies derived from diverse understandings about suicide (highlighting suicidal ideation phenotype) like ideation-to-action frameworks such as: the Interpersonal Theory of Suicide, the Integrated Motivational-Volitional Model, and the Three-Step Theory) have pushed techniques for researching and practically assessing the phenomenon with varying degrees of success [1,27,29]. For instance, self-reports and clinical interviews based on scales (e.g. Scale for Suicide Ideation from 1979 [24] and the Columbia–Suicide Severity Rating Scale from 2011 [30]) are the most common methods employed in the field and yet there is the need for improved strategies for its assessment [14,31,32]. Moreover, with human life shaping into digital interactions, these evaluations have not been able to suit the online context due to the need for the analysis of clinical professionals, costly research efforts, and an overall lack of comprehension of the full spectrum of suicidal ideation [32,33].
Therefore, efforts such as the SIDA scale have tried to adapt these traditional approaches to the web-based era [32]. Nonetheless, in recent times the more observable changes in suicidal ideation research have been done by the application of artificial intelligence (AI) since its identification relies on both: traditional and advanced statistical methods [34,35]. Namely, the usage of methods within the branches of Machine Learning (ML), Deep Learning (DL), and Natural Language Processing (NLP) on large datasets provides the ability to detect new patterns of factors regarding the phenomenon by the inclusion of a wider array of factors/variables to, in the future, automatically monitor and respond to harmful behaviors in real time [36,37]. In particular, the application of these advanced methods is currently based on [38]:
  • Text-based Classification: Based on Feature Engineering, the goal of this approach is to determine, via text-based interactions (e.g. social media posts), if a person has suicidal ideation. Such task is tackled using:
    -
    Tabular Features: This is structured data (statistical information or questionnaire responses) that can be employed as features for regression or classification analysis.
    -
    General Text Features: To handle unstructured data -that with no systematic order [39]- general features like N-gram features, knowledge-based features, syntactic features, context features, and class-specific features, are extracted via self-made vocabulary or dictionaries.
    -
    Affective Characteristics: Probably the most experimental approach since it combines specific psychological techniques (e.g. manual emotion categories) with the previous computational methods to perform a fine-grained sentiment analysis over suicide notes or blog streams. As a consequence of this, it is expected to better distinguish between a potentially suicidal person and a healthy one.
  • Deep Learning: Taking advantage of automatic feature learning, this approach utilizes advanced methods based on Neural Networks to boost predictive performance. Consequently, these techniques indicate a more robust preliminary success in suicidal ideation understanding compared to traditional statistical methodologies as demonstrated by Haque R, Islam N., et al [40].
Hence, these technologies have highlighted their value in predicting suicide risk by enabling low-cost and efficient alternatives to conventional techniques [41,42].
Acknowledging the broader nature of this phenomenon, the present paper aims to develop a systematic assessment of current limitations, approaches, and the impact that computational techniques have in suicidal ideation detection. Namely, this research has the objective to: (i) provide valuable recommendations or insights for future studies from the synthesis and analysis of contemporary efforts focused on a technological perspective, and (ii) explore the diversification of this topic’s literature based on the language of the data employed to assess different AI-based methodologies, predicting English as the most popular and suggesting that the subject should be further explored in more vocabularies, specifically in Spanish. Consequently, this document is organized as follows: Section 2 introduces the methodology employed for collecting investigations. Section 3 institutes a concise summary of the selected research’s characteristics, methods, metrics, and results. Moreover, Section 4 institutes a discussion of these works to understand the current state of computer-based approaches for automatic suicidal ideation detection. Finally, Section 5 establishes the author’s comment on feasible technological trends for this problem and opportunities to conclude the compilation.

2. Methodology

The scoping review of investigations utilizing Natural Language Processing technologies, alongside other computational-based techniques, to detect individuals with suicidal ideation was conducted adhering to the standards of the PRISMA Statement [43]. Notably, this approach aided the present qualitative study in increasing transparency and the quality of the reporting of publications (eidem).

2.1. Search Strategy

For this systematic review, the collection of scientific efforts related to suicidal ideation detection using computational-based approaches was made using a search query with the following filter criteria:
  • Limitation to peer-reviewed journal publications/articles.
  • Publication date between 2020 and 2024 (inclusive).
  • Use of a similar syntax based on the PICo framework applied by Arowosegbe., et al, for the searched index terms [42], making a special inquiry in the top 6 most popular languages according to Webber [44]:
    (("Machine Learning" OR ML OR "Deep Learning" OR DL OR "Natural Language Processing" OR NLP OR "Artificial Intelligence" OR AI) AND (Suicidality OR "Suicidal Ideation" OR Ideation OR Suicide) AND (English OR Spanish OR French OR Russian OR Arabic OR Chinese))

2.2. Databases

The search for suitable literature for this research was made using five scientific databases: 1)Springer Link; 2)Scopus; 3)Science Direct; 4)IEEE; and 5)MDPI. The main reason behind the election of such repositories is because of the grand Computer Science content available and the author’s ease of access.

2.3. Eligibility and Inclusion criteria

All peer-reviewed journal publications published between 2020 and 2024 were included. This range was chosen because, as mentioned by Eiberg., et al, the manifestation of this phenomenon online is constantly evolving due to technological developments [45]. Additional inclusion criteria were: 1) presentation of comparative studies between methods for suicidal ideation detection, 2) introduction of new AI-based detection methodologies, and 3) discussion of original frameworks applicable to real-life scenarios and supplemented by one or more AI-based techniques. Excluded from this review were conference papers, conference proceedings, conference reviews, books, book chapters, and reviews. Furthermore, literature related to measuring the effect of an intervention, presenting guidelines, case reports, and clinical studies, was removed as it did not include any computational-based approaches. Finally, research with full-text submissions available in languages other than English or Spanish was omitted.

2.4. Screening

A single reviewer screened titles, abstracts, and conclusions for eligibility. In particular, to meet the inclusion criteria defined above, the author briefly analyzed the abstract of each article using tools like Elicit, Scispace, and ChatPDF. When eligibility could not be determined with the obtained information, a full-text analysis was performed. Moreover, articles that appeared more than once in the complete collection were removed.

2.5. Analysis

As suggested by Arowosegbe., et al, a reflective thematic analysis (RTA) was adopted for this paper to generate a grounded discussion based on the findings from the collected studies [42]. Further, the assessment literature based on RTA, as exemplified by Byrne, resulted beneficial to the declared objective of the present research since it facilitated a flexible interpretation of qualitative data due to the identification and analysis of themes in the compiled information [42,46].

3. Results

3.1. Study Selection

As seen in the PRISMA flow diagram depicted in Figure 1, 49 publications were identified in the selected databases. Originally, the preliminary search with the criteria described in Section 2 produced 317 results. After an initial screening using each article’s title and abstract, 256 records were excluded due to 3 main reasons: 1) wrong study (not being associated with computer-based approaches); 2) wrong theme of investigation; and 3) not being publicly available for a complete individual assessment. Finally, a full review of 49 studies was developed once duplicated entries (12) were removed.

3.2. Study Characteristics

Since this research focused more on the language in which each dataset was originally written for the selected publications, an origin country analysis was omitted. In this sense, most investigations (n = 33 , 67.3 % ) utilized information written in English as seen in Figure 2.
A few exceptions, as seen in Figure 3, employed a combination of English and another language (n = 3 in total: n = 2 , 4.1 % for English/Spanish; and n = 1 , 2.0 % for English/Thai). Further, both figures highlight the fact that Chinese was the second most used language in the utilized sets (n = 5 , 10.2 % ) whereas both, Arab and Spanish, were the third most popular language (n = 3 , 6.1 % ). From the language popularity list from Webber [44], Russian and French did not appear nor were mentioned in any of the obtained literature (n = 0 , 0 % ).
As shown in Figure 4, recent literature was mainly based on ML approaches (n = 20 , 40.8 % ) or a combination of ML and DL techniques (n = 16 , 32.7 % ). However, despite the recent public consciousness of DL [47], a limited amount of publications focused their efforts on exploiting it (n = 12 , 24.5 % ). In addition, a single paper employed evolutionary algorithms (EA) for researching suicidal ideation (n = 1 , 2.0 % ).
Moreover, Figure 5 shows that most of the literature used Twitter as their primary source of data (n = 13 , 27 % ), followed by Reddit (n = 9 , 19 % ), custom sets (n = 7 , 14 % ), clinical or clinically-reviewed sets (n = 5 , 10 % ), and a combination of Reddit and Twitter (n = 3 , 6 % ).
Finally, almost every publication had classification as the main task to accomplish. Only a single document had message assessment as its main technical objective. However, Figure 6 highlights that 10 investigations focused on specific tasks apart from the generalized classification concept.
A more detailed summary of the characteristics of each included study (n = 49) is outlined in Table 1. Some information presents acronyms for some of the techniques employed. Hence, the following glossary is provided:
  • LR (ML Classifier): Logistic Regression
  • RF (ML Classifier): Random Forest
  • NB (ML Classifier): Naïve Bayes
  • GB (ML Classifier): Gradient Boosting
  • DT (ML Classifier): Decision Tree
  • SVM (ML Classifier): Support Vector Machine
  • SVC (ML Classifier): Support Vector Classifier
  • KNN (ML Classifier): K-Nearest Neighbor
  • MNB (ML Classifier): Multinomial Naive Bayes
  • SGD (ML Classifier): Stochastic Gradient Descent
  • ANN (ML Classifier): Artificial Neural Network
  • MLP (DL Classifier): Multilayer Perceptron
  • CNN (DL Classifier): Convolutional Neural Network
  • RNN (DL Classifier): Recurrent Neural Network
  • GRU (DL Classifier): Gated Recurrent Unit
  • LSTM (DL Classifier): Long Short-Term Memory
  • BERT (DL Classifier): Bidirectional Encoder Representation from Transformers
Preprints 149230 i001Preprints 149230 i002Preprints 149230 i003Preprints 149230 i004Preprints 149230 i005Preprints 149230 i006Preprints 149230 i007Preprints 149230 i008Preprints 149230 i009

3.3. Discussion

AI and its different areas of study (ML, DL, NLP) have impacted how various fields study suicide by aiding in the application and development of sophisticated statistical methods for the identification of potential suicide risk and suicidal ideation [35]. Thus, literature constantly tries to improve prediction accuracy to refine suicide care overall [36,37]. In that sense, and as demonstrated in Section 3, most investigations related to the topic employ canonical NLP features, like TF-IDF, in conjunction with ML classifiers, such as SVM or Logistic Regression, to perform an analysis based on extracted features [38]. For instance, Valeriano K, Condori-Larico A., et al, one of the few recent research focused on the use of Spanish data and the first to use these technical approaches in such language, prepared a phrase classification model to detect suicidal ideation in social networks automatically via TF-IDF andWord2vec, for pre-processing, and SVM and Logistic Regression classifiers [74].
Furthermore, automatic detection of suicidal ideation has started to explore automatic feature learning with deep learning models, in particular, CNN and LSTM [38]. For example, Fabra J, Martínez A., et al, created a new platform that enables the detection of suicide risk by combining emotional processing systems, clustering techniques, and the LSTM classification technique [71].
However, as suggested by Table 1, as well as by Figure 7, a significant amount of research has focused on employing both of these approaches (ML and DL classifiers). Notably, Abdulsalam A, Alhothali A., et al, recently created a new Arabic suicidal tweet dataset to evaluate ML and DL techniques for suicide detection in Arabic, satisfying the need for corpora in such language [69].
Notably, by making a special inquiry in the presence of research in the top 6 most popular languages according to Webber [44], English is still the predominant language in which suicide is being analyzed as it has access to ample ground truth information derived from expert diagnostics of individuals with suicidal ideation or depression [96]. As seen in Figure 8, throughout 2020 and 2024, such language was predominantly used in the datasets employed to classify this condition automatically with an outstanding difference between other languages. Multilingual approaches, as those proposed by Noraset T, Chatrinan K., et al; Wellington R, Gómez J., et al; and Pool-Cen J, Martínez H., et al [60,72,81], have used Anglo-Saxon data, along with language-specific information to detect this phenomenon in other linguistic contexts (Spanish or Thai). However, these approaches are not easily feasible due to costly effort derived from the excessive features needed for semantic representation of multiple linguistic resources [97].
In contrast, Chinese, the second most popular language for this particular task and the most spoken language in the world [98], presents a vast discrepancy to English when the amount and constancy of publications are compared over the years. Similarly, the Spanish and Arab idioms suffer from this absence of representation with even less presence in recent literature than its Asian counterpart. This deficiency, as well as the limited exploitation of techniques in languages highlighted in Figure 9, suggest the existence of several potential research branches yet to be acknowledged that would benefit extensive non-explored numerous geographic and digital linguistic populations, as is the case of the Spanish language.
Moreover, as suggested by the works included in Section 3, as a whole, the utilization of these advanced techniques has allowed the expansion of methodologies in which suicide is studied from a computer-based perspective.
Namely, from 20202 to 2024, as shown in Figure 10, the utilization of advanced technical tools in research years has facilitated the development of custom datasets, frameworks, models and specialized solutions to enhance comprehension in the efficacy of practical applications of automatic suicidal ideation detection systems. For example, Rentz D, Heckler W., et al, created a proprietary system (S-Care) for suicidal ideation detection and assistance that integrated external data sources (like hardware information and social media information) for enabling a daily tracking of a user [49]. Other examples are the works presented by Renjith S, Abraham A., et al; and Pillai A, Nepal S., et al, where they: i) explored new technological improvements for suicide detection by the application of social media data and a novel LSTM-Attention-CNN model; and ii) performed experiments to explore the generalizability of speech-based suicidal ideation detection, evaluating at the same time unsupervised (UDA) and semi-supervised (SSDA) domain adaptation to assess performance mitigation due to distribution shifts, respectively [55,80].
Additionally, as hinted in Section 3, the understanding of the current state-of-the-art of how literature studies suicide as a computationally detectable phenomenon via the application of such techniques can be expanded under the constant investigation of two main research themes: i) Suicidal ideation & Suicide; and ii) Depression & other Mental Disorders.

3.4. Suicidal Ideation & Suicide

As previously mentioned, suicide is a complex multifaceted event with multiple phenotypes that can and must be considered across its spectrum, such as suicidal ideation [1,12]. The focused study on the manifestations of the phenomenon [mainly done by technological methods] is a legitimate avenue that has increased its understanding [12].
Notably, ML and DL techniques have aided in the identification of suicide risk factors and expressions not only in isolation but also while integrating complex interactions across variables [35]. Significantly, researchers have exploited the vast amount of data passively and actively created by social media users to: i) present new classifiers based on modern pre-processing and ensemble techniques for suicidal ideation detection [50,67,93]; ii) Enhance the accuracy of detection by finetuning specific parameters or mechanisms of existing models [57,70,92,94]; iii) Compare and identify which techniques are the most effective for the identification of suicidal ideation [48,56,68,90]; iv) Create new sets that could expand the research of the phenomenon on more languages or expand current ones with synthetic information [59,69,89].
Moreover, clinical data has also been used (with less frequency than social media information) to endorse and refine suicide predictive algorithms according to a specific population, like Guiñazú M, González M., et al approach that validated a custom model (PDRM) for specific Chilean characteristics [35,52]. Furthermore, this type of data source has enabled a more diverse exploration of suicide as a phenomenon as it allows the analysis of speech-based manifestation, the evaluation of physiological characteristics (e.g. brain magnetic resonance imaging) of potentially suicidal patients, as well as the identification of linguistic features that could indicate suicidal ideation in clinical transcripts of a specific language [78,80,86].
Consequently, it can be said that the predictive analysis derived from these approaches has enabled the recognition of at-risk individuals, their likelihood of a future suicide attempt, and, overall, improved suicide management in areas of prevention, evaluation, diagnostics, treatment, and followup strategies [35].

3.5. Depression & other Mental Disorders

Currently, as complex as the phenomenon itself, some literature about suicide has suggested equating it to a cause, namely disorder of the mind [7]. Meaning that death by suicide is a consequence of a particular mental illness (e.g. Depression) and, thus its prevention revolves around the interpretation of a peculiar clinical condition (eidem). Furthermore, this specific approach categorizes and emphasizes the understanding of the type of factors that contribute to suicide risk1, as these might facilitate or increase vulnerability for developing suicidal ideation or behavior [1]. Namely, medical factors such as anxiety, depression, impulse-control disorders, self-harm, eating disorders, or post-traumatic stress disorder have become relevant risk factors for suicide, making its consideration a relevant research opportunity for automatic evaluation of data [1,8].
In that sense, more recent literature regarding the automatic detection of suicidal ideation has been complemented by the analysis of these particular disorders. Namely, the identification of relevant terms in common social posts has been improved by the utilization of domain-specific corpora from mental health issues [77]. Additionally, from this perspective, computer-based investigations have expanded beyond the basic textual analysis of data to incorporate more complex relations via the assessment of temporal information of individual users in social media [61]. Also, it has enabled the creation of sophisticated intervention strategies to assure instantaneous mood evaluation or to improve response time to crisis messages, as the ones proposed by Dogrucu A, Perucic A., et al, and Swaminathan A, López I., et al, respectively [84,85].
Nonetheless, risk factors are not universal as an individual who experiences either suicidal ideation or attempted suicide, does not necessarily die by suicide [1,14]. This suggests that approaching the automatic detection of suicide cannot be solely dependent on the evaluation of a mental disorder as these are not the same [7]. Thus, research should also be complemented with culturally informed insights to avoid the exacerbation of this phenomenon and ultimately develop robust methodologies for improved preventive strategies [7,14].

3.6. Limitations

From 2020 to 2024, each research presented unique limitations that affected their development as the complexity of automatically assessing suicidal ideation is recently being studied. However, it was possible to summarize what the most common constraints are currently being discussed in ongoing literature, and that represent a valuable insight for further research regarding this particular theme:
  • Limited dataset size for more robust experimentation development like expanded statistical analysis or the application of cross-validation [62,90,94].
  • Created custom models lacked interpretability in clinical-based scenarios, affecting their reproducibility and real-life applications [58,79,89].
  • Datasets contained insufficient and imbalanced data that made it harder for some models to capture semantic relationships and affected the detection [66,67,76].
  • Datasets that used social media data lacked ground truth values and, as a whole, the source can affect reliability. This approach was mainly made due to the restrained access to clinical-assessed data [51,69,73].
  • Data did not consider demographic or culturally specific characteristics of the language [59,64,83].
  • Absence of domain-specific corpora in languages apart from English [36,64,74].
Despite being external constraints, each of the presented limitations is and should still be addressed in future investigations exploiting new technologies and methodologies to continue the development of solutions for this phenomenon. For example, more recent research, like the one presented by Ghanadian H, Nejadgholi I., et al, focuses on the expansion of data with the implementation of IA-based content created with Large Language Models [89]. Another example is the application of curated datasets that have been previously used or collected in past related studies to continue the exploration of new approaches [54,56,68,93], or the use of specific regional data to comprehend suicide [52] particularly.

4. Conclusion

Suicide, and more particularly Suicidal Ideation, is an issue whose understanding relies on the exploitation of different strategies ranging in numerous fields [99]. Particularly, the identification of potential suicide risk and suicidal ideation relies on traditional and advanced statistical methods [35]. Hence, it is through technology-driven approaches that literature is allowed to encompass different and more complex manifestations of suicidal ideation in a more digital era by enabling efficient alternatives to conventional techniques [35].
Currently, the most popular techniques in related publications during the last years have been: 1) text-bassed classification, with ML models like SVM, ANN, CRF, Logistic Regression, Naïve Bayes, Random Forest, XGBoost, Decision Trees and KNN; and 2) deep learning, with models like DNN, LSTM, CNN, BERT and Ensemble Learning (learning based on the combination of two or more models [100]). Each of these advanced statistical methods has enabled the computational study of suicide through its broad spectrum considering phenotypes like suicidal ideation and potential risk factors such as mental illnesses, depression being the most studied. Moreover, these applications derived in the creation of solutions ranging from custom ML or DL models for classification; to former applications and proto-frameworks for testing its potential effectiveness in real-life scenarios.
Nonetheless, before AI can be integrated into clinical practice and more robust predictive frameworks, there are still limitations to be addressed [35]. Namely, current literature is mostly centered in Anglo-Saxon contexts, leaving the implementation of these novel methods out of other languages. As highlighted in Figure 8 and in Figure 9, there is a considerable gap in research that exploits ML or DL for detecting suicidal ideation in the most popular languages of the world according toWebber [44]. This also highlights the impediment of not having investigations that explore the influence of cultural characteristics manifested in a particular idiom. Another barrier that constrains modern-day investigations is the limited access to ground-truth data and the reliability of social media information. This has affected the overall amount of entries in datasets for robustness and more accurate results in experimentation processes.
Hence, to aid in the constant tackling of knowledge gaps to improve the treatment and prevention strategies of this phenomenon, this document suggests that future research, based on the analysis elaborated in this investigation, should be focused on:
  • Detecting implicit suicidal manifestations and ambiguous emotions in sentences.
  • Improve the integration of user-profiles and linguistic features for social media data to improve suicide ideation detection.
  • Expand the inclusion of parameters that add demographic context to data such as gender and location.
  • Data from social media should include images, videos, and emoticons in its analysis to provide a complete assessment of potential suicidal risk.
  • Explore language diversity beyond English in models, as well as cross-lingual frameworks.
  • Address data sparsity and imbalance challenges.
  • Validate results through large-scale research with larger datasets, ground-truth clinical data, and specialized vocabulary.
  • Continue the exploration of the impact of risk factors in the detection of suicidal ideation automatically.
  • Test models generalization for potential clinical detection applications.
  • Continue exploring the performance of novel techniques and datasets from various social media sites for this task.
  • Continue incorporating a timeline analysis in social media data to improve the identification of individual risk cases.

Author Contributions

Conceptualization, Francisco Ariel Arenas Enciso; methodology, Mahdi Zareei; formal analysis, Alejandro de León Languré; investigation,Francisco Ariel Arenas Enciso; resources, Rajesh Roshan Biswa; writing—original draft preparation, Francisco Ariel Arenas Enciso; writing—review and editing, Mahdi Zareei and Alejandro de León Languré, Enrique Garcia-Ceja; supervision, Mahdi Zareei; funding acquisition, Rajesh Roshan Biswa. All authors have read and agreed to the published version of the manuscript. Conflicts of Interest: The authors declare no conflicts of interest.

References

  1. Turecki, G.; Brent, D.A.; Gunnell, D.; O’Connor, R.C.; Oquendo, M.A.; Pirkis, J.; Stanley, B.H. Suicide and suicide risk. Nature Reviews Disease Primers 2019, 5, 74. [Google Scholar] [CrossRef] [PubMed]
  2. Gabilondo, A.; Muela, A.; Belarra, B.; De Sayas, A.; García, J.; López, P.; Reich, H.; Iruin, Á. Evaluación de BIZI, nuevo programa en línea en español para prevenir el suicidio desde la comunidad. Revista Panamericana de Salud Pública 2024, 48, 1. [Google Scholar] [CrossRef]
  3. Organization, W.H. World Suicide Prevention Day 2023, 2023.
  4. for Suicide Prevention, T.I.A. Global Suicide Statistics, 2023.
  5. Organization, P.A.H. Suicide Mortality in the Americas – Regional Report 2015-2019; Pan American Health Organization, 2021. [CrossRef]
  6. SM Masango, S.R.; Motojesi, A. Suicide and suicide risk factors: A literature review. South African Family Practice 2008, 50, 25–29. [Google Scholar] [CrossRef]
  7. Pridmore, S.; Varbanov, S.; Aleksandrov, I.; Shahtahmasebi, S. Social Attitudes to Suicide and Suicide Rates. Open Journal of Social Sciences 2016, 04, 39–58. [Google Scholar] [CrossRef]
  8. Minayo, M.C.d.S.; Cavalcante, F.G.; Souza, E.R.d. Methodological proposal for studying suicide as a complex phenomenon. Cadernos de saude publica 2006, 22, 1587–1596. [Google Scholar] [CrossRef]
  9. Bailey, R.K.; Patel, T.C.; Avenido, J.; Patel, M.; Jaleel, M.; Barker, N.C.; Khan, J.A.; All, S.; Jabeen, S. Suicide: current trends. Journal of the National Medical Association 2011, 103, 614–617. [Google Scholar] [CrossRef]
  10. Chávez-Hernández, A.M.; Leenaars, A.A. Edwin S Shneidman y la suicidología moderna. Salud mental 2010, 33, 355–360. [Google Scholar]
  11. for Disease Control, C. ; Prevention. Facts about suicide, 2023.
  12. Leenaars, A.A. Suicide: A Multidimensional Malaise 1996. 26, 221–236. [CrossRef]
  13. Suicide, I.o.M.U.C.o.P.; of Adolescent, P. ; Adult.; Goldsmith, S.K.; Pellmar, T.C.; Kleinman, A.M.; Bunney, W.E., Society and Culture. In Reducing Suicide: A National Imperative; National Academies Press (US), 2002.
  14. Systematic review of risk factors for suicide and suicide attempt among psychiatric patients in Latin America and Caribbean.
  15. Mościcki, E.K. IDENTIFICATION OF SUICIDE RISK FACTORS USING EPIDEMIOLOGIC STUDIES. Psychiatric Clinics of North America 1997, 20, 499–517. [Google Scholar] [CrossRef]
  16. Harmer, B.; Lee, S.; Duong, T.; Saadabadi, A. Suicid al ideation 2020.
  17. Kral, M.J.; Links, P.S.; Bergmans, Y. Suicide studies and the need for mixed methods research. Journal of Mixed Methods Research 2012, 6, 236–249. [Google Scholar] [CrossRef]
  18. Hood-Williams, J. Studying suicide. Health & Place 1996, 2, 167–177. [Google Scholar] [CrossRef]
  19. Sareen, J.; Isaak, C.; Katz, L.Y.; Bolton, J.; Enns, M.W.; Stein, M.B. Promising strategies for advancement in knowledge of suicide risk factors and prevention. American journal of preventive medicine 2014, 47, S257–S263. [Google Scholar] [CrossRef] [PubMed]
  20. Vadon, N.B.; Calati, R.; Gonda, X. Evidence-based psychotherapeutic methods of prevention and intervention in suicidal behaviour. Neuropsychopharmacologia Hungarica: a Magyar Pszichofarmakologiai Egyesulet lapja= official journal of the Hungarian Association of Psychopharmacology 2022, 24, 170–179. [Google Scholar] [PubMed]
  21. Botega, N.J.; Barros, M.B.d.A.; Oliveira, H.B.d.; Dalgalarrondo, P.; Marín-León, L. Suicidal behavior in the community: prevalence and factors associated with suicidal ideation. Brazilian Journal of Psychiatry 2005, 27, 45–53. [Google Scholar] [CrossRef]
  22. Cooper-Patrick, L.; Crum, R.M.; Ford, D.E. Identifying suicidal ideation in general medical patients. Jama 1994, 272, 1757–1762. [Google Scholar] [CrossRef] [PubMed]
  23. DiKtaş Yerli, G. SUICIDE AS A SOCIOLOGICAL FACT AND ITS DIMENSION IN TURKEY 2023. p. 52. [CrossRef]
  24. Beck, A.T.; Kovacs, M.; Weissman, A. Assessment of suicidal intention: the Scale for Suicide Ideation. Journal of consulting and clinical psychology 1979, 47, 343. [Google Scholar] [CrossRef]
  25. Wastler, H.M.; Khazem, L.R.; Ammendola, E.; Baker, J.C.; Bauder, C.R.; Tabares, J.; Bryan, A.O.; Szeto, E.; Bryan, C.J. An empirical investigation of the distinction between passive and active ideation: Understanding the latent structure of suicidal thought content. Suicide and Life-Threatening Behavior 2023, 53, 219–226. [Google Scholar] [CrossRef]
  26. Dhingra, K.; Boduszek, D.; O’Connor, R.C. Differentiating suicide attempters from suicide ideators using the Integrated Motivational–Volitional model of suicidal behaviour. Journal of affective disorders 2015, 186, 211–218. [Google Scholar] [CrossRef]
  27. Klonsky, E.D.; Saffer, B.Y.; Bryan, C.J. Ideation-to-action theories of suicide: a conceptual and empirical update. Current opinion in psychology 2018, 22, 38–43. [Google Scholar] [CrossRef] [PubMed]
  28. Kleiman, E.M.; Nock, M.K. Real-time assessment of suicidal thoughts and behaviors. Current Opinion in Psychology 2018, 22, 33–37. [Google Scholar] [CrossRef]
  29. Marchionatti, L.E.; Magalhães, P.V.d.S. On suicidal ideation: the need for inductive methodologies to advance the field. Trends in psychiatry and psychotherapy 2023, 45, e20230655. [Google Scholar] [CrossRef]
  30. Posner, K.; Brown, G.K.; Stanley, B.; Brent, D.A.; Yershova, K.V.; Oquendo, M.A.; Currier, G.W.; Melvin, G.A.; Greenhill, L.; Shen, S.; et al. The Columbia–Suicide Severity Rating Scale: initial validity and internal consistency findings from three multisite studies with adolescents and adults. American journal of psychiatry 2011, 168, 1266–1277. [Google Scholar] [CrossRef]
  31. Terrill, D.R.; Rodriguez-Seijas, C.; Zimmerman, M. Assessing suicidal ideation using a brief self-report measure. Psychiatry Research 2021, 297, 113737. [Google Scholar] [CrossRef] [PubMed]
  32. Van Spijker, B.A.; Batterham, P.J.; Calear, A.L.; Farrer, L.; Christensen, H.; Reynolds, J.; Kerkhof, A.J. The Suicidal Ideation Attributes Scale (SIDAS): Community-based validation study of a new scale for the measurement of suicidal ideation. Suicide and Life-Threatening Behavior 2014, 44, 408–419. [Google Scholar] [CrossRef] [PubMed]
  33. Giddens, J.M.; Sheehan, K.H.; Sheehan, D.V. The Columbia-suicide severity rating scale (C–SSRS): has the “gold standard” become a liability? Innovations in clinical neuroscience 2014, 11, 66. [Google Scholar]
  34. Gupta, M.; Ramar, D.; Vijayan, R.; Gupta, N. Artificial intelligence tools for suicide prevention in adolescents and young adults. Adolescent Psychiatry 2022, 12, 1–10. [Google Scholar] [CrossRef]
  35. Fonseka, T.M.; Bhat, V.; Kennedy, S.H. The utility of artificial intelligence in suicide risk prediction and the management of suicidal behaviors. Australian & New Zealand Journal of Psychiatry 2019, 53, 954–964. [Google Scholar]
  36. Halsband, A.; Heinrichs, B. AI, suicide prevention and the limits of beneficence. Philosophy & Technology 2022, 35, 103. [Google Scholar]
  37. Dhelim, S.; Chen, L.; Ning, H.; Nugent, C. Artificial intelligence for suicide assessment using Audiovisual Cues: a review. Artificial Intelligence Review 2023, 56, 5591–5618. [Google Scholar] [CrossRef]
  38. Ji, S.; Pan, S.; Li, X.; Cambria, E.; Long, G.; Huang, Z. Suicidal ideation detection: A review of machine learning methods and applications. IEEE Transactions on Computational Social Systems 2020, 8, 214–226. [Google Scholar] [CrossRef]
  39. Thomas, D.A. Searching for significance in unstructured data: Text mining with Leximancer. European Educational Research Journal 2014, 13, 235–256. [Google Scholar] [CrossRef]
  40. Haque, R.; Islam, N.; Islam, M.; Ahsan, M.M. A Comparative Analysis on Suicidal Ideation Detection Using NLP, Machine, and Deep Learning. Technologies 2022, 10, 57. [Google Scholar] [CrossRef]
  41. Barua, P.D.; Vicnesh, J.; Lih, O.S.; Palmer, E.E.; Yamakawa, T.; Kobayashi, M.; Acharya, U.R. Artificial intelligence assisted tools for the detection of anxiety and depression leading to suicidal ideation in adolescents: a review. Cognitive Neurodynamics 2022, pp. 1–22.
  42. Arowosegbe, A.; Oyelade, T. Application of Natural Language Processing (NLP) in Detecting and Preventing Suicide Ideation: A Systematic Review. International Journal of Environmental Research and Public Health 2023, 20, 1514. [Google Scholar] [CrossRef] [PubMed]
  43. Page, M.J.; Moher, D.; Bossuyt, P.M.; Boutron, I.; Hoffmann, T.C.; Mulrow, C.D.; Shamseer, L.; Tetzlaff, J.M.; Akl, E.A.; Brennan, S.E.; et al. PRISMA 2020 explanation and elaboration: updated guidance and exemplars for reporting systematic reviews. BMJ 2021, 372. [Google Scholar] [CrossRef]
  44. Weber, G. The World’s 10 Most Influential Languages. AATF National Bulletin 2008, 24. [Google Scholar]
  45. Holm, M.E.; Østergaard, L.D.; Aamund, K.; Jørgensen, K.; Midtgaard, J.; Vinberg, M.; Nordentoft, M. What methods are used in research of firsthand experiences with online self-harming and suicidal behavior? A scoping review. Nordic Journal of Psychiatry 2024, 78, 165–180. [Google Scholar] [CrossRef]
  46. Byrne, D. A worked example of Braun and Clarke’s approach to reflexive thematic analysis. Quality & Quantity 2022, 56, 1391–1412. [Google Scholar] [CrossRef]
  47. Hatcher, W.G.; Yu, W. A Survey of Deep Learning: Platforms, Applications and Emerging Research Trends. IEEE Access 2018, 6, 24411–24432. [Google Scholar] [CrossRef]
  48. Santoso, M.S.; Suryadi, J.J.; Marchellino, K.; Nabiilah, G.Z.; Rojali. A Comparative Analysis of Decision Tree and Support Vector Machine on Suicide Ideation Detection. Procedia Computer Science 2023, 227, 518–523. [CrossRef]
  49. Rentz, D.M.; Heckler, W.F.; Barbosa, J.L.V. A computational model for assisting individuals with suicidal ideation based on context histories. Universal Access in the Information Society 2023. [Google Scholar] [CrossRef]
  50. K, S.; M, E.; V, A. A Hybrid Attention Based Deep Learning System for Suicidal Ideation Detection in Social Media 2024. [CrossRef]
  51. Chadha, A.; Kaushik, B. A Hybrid Deep Learning Model Using Grid Search and Cross-Validation for Effective Classification and Prediction of Suicidal Ideation from Social Network Data. New Generation Computing 2022, 40, 889–914. [Google Scholar] [CrossRef]
  52. Guiñazú, M.F.; González, M.; Ruiz, R.B.; Hernández, V.; Diez, S.B.; Velásquez, J.D. A novel depression risk prediction model based on data fusion from Chilean National Health Surveys to diagnose risk depression among patients with mood disorders. Information Fusion 2023, 100, 101960. [Google Scholar] [CrossRef]
  53. Ben Hassine, M.A.; Abdellatif, S.; Ben Yahia, S. A novel imbalanced data classification approach for suicidal ideation detection on social media. Computing 2022, 104, 741–765. [Google Scholar] [CrossRef]
  54. Qorich, M.; El Ouazzani, R. Advanced deep learning and large language models for suicide ideation detection on social media. Progress in Artificial Intelligence 2024, 13, 135–147. [Google Scholar] [CrossRef]
  55. Renjith, S.; Abraham, A.; Jyothi, S.B.; Chandran, L.; Thomson, J. An ensemble deep learning technique for detecting suicidal ideation from posts in social media platforms. Journal of King Saud University - Computer and Information Sciences 2022, 34, 9564–9575. [Google Scholar] [CrossRef]
  56. Saha, S.; Dasgupta, S.; Anam, A.; Saha, R.; Nath, S.; Dutta, S. An Investigation of Suicidal Ideation from Social Media Using Machine Learning Approach. Baghdad Science Journal 2023, 20, 1164. [Google Scholar] [CrossRef]
  57. Baghdadi, N.A.; Malki, A.; Magdy Balaha, H.; AbdulAzeem, Y.; Badawy, M.; Elhosseini, M. An optimized deep learning approach for suicide detection through Arabic tweets. PeerJ Computer Science 2022, 8, e1070. [Google Scholar] [CrossRef] [PubMed]
  58. Tang, H.; Miri Rekavandi, A.; Rooprai, D.; Dwivedi, G.; Sanfilippo, F.M.; Boussaid, F.; Bennamoun, M. Analysis and evaluation of explainable artificial intelligence on suicide risk assessment. Scientific Reports 2024, 14, 6163. [Google Scholar] [CrossRef]
  59. Hassib, M.; Hossam, N.; Sameh, J.; Torki, M. AraDepSu: Detecting Depression and Suicidal Ideation in Arabic Tweets Using Transformers. In Proceedings of the Proceedings of the The Seventh Arabic Natural Language Processing Workshop (WANLP), Abu Dhabi, United Arab Emirates (Hybrid), 2022; p. 302–311. [CrossRef]
  60. Acuña Caicedo, R.W.; Gómez Soriano, J.M.; Melgar Sasieta, H.A. Assessment of supervised classifiers for the task of detecting messages with suicidal ideation. Heliyon 2020, 6, e04412. [Google Scholar] [CrossRef]
  61. García-Noguez, L.R.; Tovar-Arriaga, S.; Paredes-García, W.J.; Ramos-Arreguín, J.M.; Aceves-Fernandez, M.A. Automatic classification of depressive users on Twitter including temporal analysis. Network Modeling Analysis in Health Informatics and Bioinformatics 2023, 12, 38. [Google Scholar] [CrossRef]
  62. Zhang, T.; Schoene, A.M.; Ananiadou, S. Automatic identification of suicide notes with a transformer-based deep learning model. Internet Interventions 2021, 25, 100422. [Google Scholar] [CrossRef]
  63. Sarsam, S.M.; Al-Samarraie, H.; Alzahrani, A.I.; Mon, C.S.; Shibghatullah, A.S. Characterizing Suicide Ideation by Using Mental Disorder Features on Microblogs: A Machine Learning Perspective. International Journal of Mental Health and Addiction 2022. [Google Scholar] [CrossRef]
  64. Lee, D.; Park, S.; Kang, J.; Choi, D.; Han, J. Cross-Lingual Suicidal-Oriented Word Embedding toward Suicide Prevention. In Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, Online; 2020; pp. 2208–2217. [Google Scholar] [CrossRef]
  65. Stapelberg, N.J.; Randall, M.; Sveticic, J.; Fugelli, P.; Dave, H.; Turner, K. Data mining of hospital suicidal and self-harm presentation records using a tailored evolutionary algorithm. Machine Learning with Applications 2021, 3, 100012. [Google Scholar] [CrossRef]
  66. Li, Z.; Zhou, J.; An, Z.; Cheng, W.; Hu, B. Deep Hierarchical Ensemble Model for Suicide Detection on Imbalanced Social Media Data. Entropy 2022, 24, 442. [Google Scholar] [CrossRef] [PubMed]
  67. Li, Z.; Cheng, W.; Zhou, J.; An, Z.; Hu, B. Deep learning model with multi-feature fusion and label association for suicide detection. Multimedia Systems 2023, 29, 2193–2203. [Google Scholar] [CrossRef]
  68. Aldhyani, T.H.H.; Alsubari, S.N.; Alshebami, A.S.; Alkahtani, H.; Ahmed, Z.A.T. Detecting and Analyzing Suicidal Ideation on Social Media Using Deep Learning and Machine Learning Models. International Journal of Environmental Research and Public Health 2022, 19, 12635–12635. [Google Scholar] [CrossRef] [PubMed]
  69. Abdulsalam, A.; Alhothali, A.; Al-Ghamdi, S. Detecting Suicidality in Arabic Tweets Using Machine Learning and Deep Learning Techniques. Arabian Journal for Science and Engineering 2024. [Google Scholar] [CrossRef]
  70. Rabani, S.T.; Ud Din Khanday, A.M.; Khan, Q.R.; Hajam, U.A.; Imran, A.S.; Kastrati, Z. Detecting suicidality on social media: Machine learning at rescue. Egyptian Informatics Journal 2023, 24, 291–302. [Google Scholar] [CrossRef]
  71. Fabra, J.; Martínez-Martínez, A.B.; Aragón, I.; López-Del-Hoyo, Y.; Pérez-Yus, C.; Oliván, B. Detecting Suicide Risk Through Twitter 2020.
  72. Pool-Cen, J.; Carlos-Martínez, H.; Hernández-Chan, G.; Sánchez-Siordia, O. Detection of Depression-Related Tweets in Mexico Using Crosslingual Schemes and Knowledge Distillation. Healthcare 2023, 11, 1057. [Google Scholar] [CrossRef]
  73. Ramírez-Cifuentes, D.; Freire, A.; Baeza-Yates, R.; Puntí, J.; Medina-Bravo, P.; Velazquez, D.A.; Gonfaus, J.M.; Gonzàlez, J. Detection of Suicidal Ideation on Social Media: Multimodal, Relational, and Behavioral Analysis. Journal of Medical Internet Research 2020, 22, e17758. [Google Scholar] [CrossRef]
  74. Valeriano, K.; Condori-Larico, A.; Sulla-Torres, J. Detection of Suicidal Intent in Spanish Language Social Networks using Machine Learning. International Journal of Advanced Computer Science and Applications 2020, 11. [Google Scholar] [CrossRef]
  75. Tadesse, M.M.; Lin, H.; Xu, B.; Yang, L. Detection of Suicide Ideation in Social Media Forums Using Deep Learning. Algorithms 2019, 13, 7. [Google Scholar] [CrossRef]
  76. Kodati, D.; Tene, R. Emotion mining for early suicidal threat detection on both social media and suicide notes using context dynamic masking-based transformer with deep learning. Multimedia Tools and Applications 2024. [Google Scholar] [CrossRef]
  77. Ramirez-Cifuentes, D.; Largeron, C.; Tissier, J.; Baeza-Yates, R.; Freire, A. Enhanced Word Embedding Variations for the Detection of Substance Abuse and Mental Health Issues on Social Media Writings. IEEE Access 2021, 9, 130449–130471. [Google Scholar] [CrossRef]
  78. Huang, R.; Yi, S.; Chen, J.; Chan, K.Y.; Chan, J.W.Y.; Chan, N.Y.; Li, S.X.; Wing, Y.K.; Li, T.M.H. Exploring the Role of First-Person Singular Pronouns in Detecting Suicidal Ideation: A Machine Learning Analysis of Clinical Transcripts. Behavioral Sciences 2024, 14, 225. [Google Scholar] [CrossRef] [PubMed]
  79. Ge, F.; Jiang, J.; Wang, Y.; Yuan, C.; Zhang, W. Identifying Suicidal Ideation Among Chinese Patients with Major Depressive Disorder: Evidence from a Real-World Hospital-Based Study in China. Neuropsychiatric Disease and Treatment 2020, Volume 16, 665–672. [Google Scholar] [CrossRef]
  80. Pillai, A.; Nepal, S.K.; Wang, W.; Nemesure, M.; Heinz, M.; Price, G.; Lekkas, D.; Collins, A.C.; Griffin, T.; Buck, B.; et al. Investigating Generalizability of Speech-based Suicidal Ideation Detection Using Mobile Phones. Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies 2023, 7, 1–38. [Google Scholar] [CrossRef]
  81. Noraset, T.; Chatrinan, K.; Tawichsri, T.; Thaipisutikul, T.; Tuarob, S. Language-agnostic deep learning framework for automatic monitoring of population-level mental health from social networks. Journal of Biomedical Informatics 2022, 133, 104145. [Google Scholar] [CrossRef] [PubMed]
  82. Pan, W.; Wang, X.; Zhou, W.; Hang, B.; Guo, L. Linguistic Analysis for Identifying Depression and Subsequent Suicidal Ideation on Weibo: Machine Learning Approaches. International Journal of Environmental Research and Public Health 2023, 20, 2688. [Google Scholar] [CrossRef]
  83. Malko, A.; Duenser, A.; Kangas, M.; Mollá-Aliod, D.; Paris, C. Message similarity as a proxy to repetitive thinking: Associations with non-suicidal self-injury and suicidal ideation on social media. Computers in Human Behavior Reports 2023, 11, 100320. [Google Scholar] [CrossRef]
  84. Dogrucu, A.; Perucic, A.; Isaro, A.; Ball, D.; Toto, E.; Rundensteiner, E.A.; Agu, E.; Davis-Martin, R.; Boudreaux, E. Moodable: On feasibility of instantaneous depression assessment using machine learning on voice samples with retrospectively harvested smartphone and social media data. Smart Health 2020, 17, 100118. [Google Scholar] [CrossRef]
  85. Swaminathan, A.; López, I.; Mar, R.A.G.; Heist, T.; McClintock, T.; Caoili, K.; Grace, M.; Rubashkin, M.; Boggs, M.N.; Chen, J.H.; et al. Natural language processing system for rapid detection and intervention of mental health crisis chat messages. npj Digital Medicine 2023, 6, 213. [Google Scholar] [CrossRef]
  86. Lee, D.Y.; Byeon, G.; Kim, N.; Son, S.J.; Park, R.W.; Park, B. Neuroimaging and natural language processing-based classification of suicidal thoughts in major depressive disorder. Translational Psychiatry 2024, 14, 276. [Google Scholar] [CrossRef] [PubMed]
  87. Ramachandran, A.; Gadwe, A.; Poddar, D.; Satavalekar, S.; Sahu, S. Performance Evaluation of Different Machine Learning Techniques using Twitter Data for Identification of Suicidal Intent. In Proceedings of the 2020 International Conference on Electronics and Sustainable Communication Systems (ICESC), Coimbatore, India; 2020; pp. 223–227. [Google Scholar] [CrossRef]
  88. Rawat, B.P.S.; Kovaly, S.; Yu, H.; Pigeon, W. ScAN: Suicide Attempt and Ideation Events Dataset. In Proceedings of the Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Seattle, United States, 2022. [CrossRef]
  89. Ghanadian, H.; Nejadgholi, I.; Osman, H.A. Socially Aware Synthetic Data Generation for Suicidal Ideation Detection Using Large Language Models. IEEE Access 2024, 12, 14350–14363. [Google Scholar] [CrossRef]
  90. Priyamvada, B.; Singhal, S.; Nayyar, A.; Jain, R.; Goel, P.; Rani, M.; Srivastava, M. Stacked CNN - LSTM approach for prediction of suicidal ideation on social media. Multimedia Tools and Applications 2023, 82, 27883–27904. [Google Scholar] [CrossRef]
  91. Ji, S.; Li, X.; Huang, Z.; Cambria, E. Suicidal ideation and mental disorder detection with attentive relation networks. Neural Computing and Applications 2022, 34, 10309–10319. [Google Scholar] [CrossRef]
  92. J, A.K.; Trueman, T.E.; A K, A. Suicidal risk identification in social media. Procedia Computer Science 2021, 189, 368–373. [Google Scholar] [CrossRef]
  93. Goel, R.; Digalwar, M. Suicidal Thought Detection using Max Voting Ensemble Technique. Procedia Computer Science 2024, 235, 2587–2598. [Google Scholar] [CrossRef]
  94. Chatterjee, M.; Kumar, P.; Samanta, P.; Sarkar, D. Suicide ideation detection from online social media: A multi-modal feature based technique. International Journal of Information Management Data Insights 2022, 2, 100103. [Google Scholar] [CrossRef]
  95. Ghosh, S.; Ekbal, A.; Bhattacharyya, P. VAD-assisted multitask transformer framework for emotion recognition and intensity prediction on suicide notes. Information Processing & Management 2023, 60, 103234. [Google Scholar] [CrossRef]
  96. Pool-Cen, J.; Carlos-Martínez, H.; Hernández-Chan, G.; Sánchez-Siordia, O. Detection of Depression-Related Tweets in Mexico Using Crosslingual Schemes and Knowledge Distillation. In Proceedings of the Healthcare. MDPI, 2023, Vol. 11, p. 1057.
  97. Valeriano, K.; Condori-Larico, A.; Sulla-Torres, J. Detection of suicidal intent in Spanish language social networks using machine learning. International Journal of Advanced Computer Science and Applications 2020, 11. [Google Scholar] [CrossRef]
  98. Schools, M.L. , 2023.
  99. Rajesh Kumar, E.; Rama Rao, K.; Nayak, S.R.; Chandra, R. Suicidal ideation prediction in twitter data using machine learning techniques. Journal of Interdisciplinary Mathematics 2020, 23, 117–125. [Google Scholar] [CrossRef]
  100. Lones, M.A. How to avoid machine learning pitfalls: a guide for academic researchers. arXiv preprint arXiv:2108.02497 2021.
Figure 1. PRISMA Flow Diagram for identification of studies related to suicidal ideation
Figure 1. PRISMA Flow Diagram for identification of studies related to suicidal ideation
Preprints 149230 g001
Figure 2. Language popularity in datasets used in collected literature
Figure 2. Language popularity in datasets used in collected literature
Preprints 149230 g002
Figure 3. Collected literature count based on Language
Figure 3. Collected literature count based on Language
Preprints 149230 g003
Figure 4. Technical approach popularity in collected literature
Figure 4. Technical approach popularity in collected literature
Preprints 149230 g004
Figure 5. Dataset origin popularity in collected literature
Figure 5. Dataset origin popularity in collected literature
Preprints 149230 g005
Figure 6. Main task counting per publication
Figure 6. Main task counting per publication
Preprints 149230 g006
Figure 7. Main task counting per publication
Figure 7. Main task counting per publication
Preprints 149230 g007
Figure 8. Number of publications per year by language
Figure 8. Number of publications per year by language
Preprints 149230 g008
Figure 9. Percentage of usage of different technological approaches (ML & DL) for suicide and suicidal ideation detection in different languages
Figure 9. Percentage of usage of different technological approaches (ML & DL) for suicide and suicidal ideation detection in different languages
Preprints 149230 g009
Figure 10. Principal research outcomes per year
Figure 10. Principal research outcomes per year
Preprints 149230 g010
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.
Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

Disclaimer

Terms of Use

Privacy Policy

Privacy Settings

© 2025 MDPI (Basel, Switzerland) unless otherwise stated