Introduction:
The COVID-19 pandemic, caused by the SARS-CoV-2 virus, has led to a global health crisis, affecting millions of individuals worldwide. As of today, over 776 million cases and 7 million deaths have been reported globally, as regularly updated by the World Health Organization (WHO) [
1]. While the majority of individuals recover from COVID-19 without long-term complications, an increasing number continue to experience persistent symptoms, a condition known as Long COVID or Post-Acute Sequelae of SARS-CoV-2 infection (PASC). Long COVID represents a diverse set of symptoms, including cognitive dysfunction (e.g., brain fog), fatigue, respiratory distress, joint pain, and other neurological symptoms, which can significantly impact an individual's quality of life.
The Emergence of Long Covid:
Long COVID has emerged as a significant public health concern, affecting individuals across different demographic groups, including those who had mild or asymptomatic COVID-19 infections. According to data from the Centers for Disease Control and Prevention (CDC) [
2], up to 25.6% of adults who have had COVID-19 report persistent symptoms, which can last for months or even longer. These lingering effects not only burden healthcare systems but also affect economic productivity, as individuals struggle to return to work or engage in daily activities. The mechanisms behind Long COVID are not fully understood, but it is believed to involve a combination of factors such as viral persistence, immune system dysregulation, and prolonged inflammation [
3].
Demographic Factors and Risks:
Several demographic factors play a crucial role in determining the likelihood of developing Long COVID. Age is a significant factor, with older adults (especially those over 65 years) being more susceptible to severe acute outcomes from COVID-19 due to age-related declines in immune function and the presence of comorbidities such as diabetes and cardiovascular diseases [
4]. However, studies indicate that middle-aged adults (35-49 years) are at a heightened risk for Long COVID, showing higher prevalence rates of symptoms compared to both younger and older age groups [
5].
Socioeconomic and Racial Disparities:
Socioeconomic status significantly influences both the likelihood of initial infection and the development of Long COVID. Individuals from lower socioeconomic backgrounds, particularly those in marginalized communities, often face greater exposure to COVID-19 due to factors such as crowded living conditions, inadequate healthcare access, and pre-existing health conditions such as obesity, diabetes, and cardiovascular diseases [
4]. Moreover, racial disparities are evident in the prevalence of Long COVID, with Hispanic adults exhibiting the highest rates of persistent symptoms (8.3%), followed by White and Black adults. Asian adults report the lowest prevalence of Long COVID, although the reasons for this disparity remain unclear and may be influenced by healthcare access, socioeconomic factors, or cultural differences [
5].
Prevalence and Recovery of Long COVID
Recent surveys, such as those from the CDC [
5], have shown that Long COVID symptoms are less common in individuals residing in large metropolitan areas, likely due to better healthcare infrastructure and access to preventative measures like vaccination. The recovery timeline for Long COVID symptoms varies greatly, with some individuals experiencing prolonged symptoms for several months to a year or longer. Cognitive symptoms such as brain fog, characterized by memory issues, difficulty concentrating, and mental fatigue, are particularly prevalent, and these cognitive difficulties often interfere with work and daily activities. Long COVID, particularly in terms of cognitive and neurological deficits, can severely affect an individual's ability to function normally, even after the initial infection has subsided [
3].
Aim of the Study:
This study aims to analyze the demographic influences on the severity and duration (timeline) of SARS-CoV-2 infection. The symptom categories were centered primarily on physiological and cognitive effects. The study seeks to examine how demographic factors, including sex, age, vaccination status, and ethnicity, affect the development of Long COVID in individuals. In addition, this study analyzes the connections between symptoms through Exploratory Factor Analysis (EFA), Pearson Correlation, and other forms of statistical analysis to understand how specific symptoms are correlated.
Ultimately, this study aims to contribute to a greater understanding of Long COVID. It also provides insights into the prevalence of PASC that can shape global public health efforts to address and, eventually, treat PASC.
Methodology:
Figure 1.
Outlines the sequential steps of our research process, including data collection and analysis.
Figure 1.
Outlines the sequential steps of our research process, including data collection and analysis.
Figure 2.
Participants eligibility pipeline. From the flow chart displayed, the eligibility to eliminate the biases associated with pre-existing health conditions.
Figure 2.
Participants eligibility pipeline. From the flow chart displayed, the eligibility to eliminate the biases associated with pre-existing health conditions.
Table 1.
Symptom analysis on 14 symptoms assessed with descriptions providing insights into their relevance to Post-Acute Sequelae of SARS-CoV-2 (PASC). Categorizes symptoms into three domains: physiological, cognitive, and working memory.
Table 1.
Symptom analysis on 14 symptoms assessed with descriptions providing insights into their relevance to Post-Acute Sequelae of SARS-CoV-2 (PASC). Categorizes symptoms into three domains: physiological, cognitive, and working memory.
This study employed a descriptive research design using a survey strategy to gather data on the physiological and cognitive effects of PASC [38, 39]. This method was chosen because it allows for a comprehensive analysis using a larger sample size. Participants were recruited through posted flyers, word of mouth, and online platforms targeting individuals who fit the eligibility criteria. The eligibility criteria required every participant to not have any pre-existing health conditions, were from the ages 18 - 55, and tested positive for Covid-19. The effects of pre-existing health conditions could cause a lot of variability to our results, so to decrease the confounding variables we ensured that every participant didn’t have any preexisting health conditions such as chronic pain, or autoimmune diseases and weren’t pregnant as that also can cause symptoms intervening with Long Covid. For example, when answering questions in our questionnaire about symptom severity, if those with preexisting health conditions participated, we wouldn’t know if their responses were because of COVID-19 or those health conditions. We had to limit the ages of participants to 55 due to natural health decline from old age. If we had many senior citizen responses being compared to younger participants, we can’t determine if the results are simply because of age or because of health decline of those who are older than 55. The online self-reporting survey consisted of 64 questions and was built on the HIPAA-compliant platform, JotForm.
Power analysis was not conducted due to the exploratory nature of this research and the constraints associated with participant recruitment. Future studies with larger samples are planned to validate findings.
All participants consented before the study and were informed about the aim and purpose of it. An internal Institutional Review Board has reviewed and approved the research proposal and the data privacy.
With ethical guidelines, trigger warnings were implemented before the survey sections containing potentially distressing content.
The 64 questions evaluated 14 symptoms and were categorized into physiological, cognitive, and working memory. The physiological [
38] category encompassed breathlessness, cranial nerve dysfunction, cough, anosmia, dysgeusia, fatigue, sleep, and throat sensitivity. The cognitive category contained suicidal thoughts, depression, behavior, stress, working memory, and fibromyalgia. These specific symptoms were included to get a comprehensive view of each category and to investigate symptoms associated with PASC in previous studies [
39] to confirm these findings.
The primary focus of this study is to gain a more in-depth understanding of brain fog [
38]. Our study explored the impact of Long-COVID on working memory, in addition to the psychological impact.
Demographics, including location, age, ethnicity, and vaccination status, were collected through a self-reporting survey to observe patterns between symptoms experienced and an individual's demographic information. Specific eligibility requirements include all of the participants residing in various states across the United States, between the ages of 18 and 55 years old, infected and diagnosed with COVID-19 in the US, have not been pregnant while exposed to COVID-19, and no underlying critical health conditions like cancer or autoimmune diseases.
Table 2.
Demographic characteristics of participants. Data including gender, race, age, and vaccination status were collected for all participants.
Table 2.
Demographic characteristics of participants. Data including gender, race, age, and vaccination status were collected for all participants.
The sampling method used was a mix of random and convenience sampling. Participants’ data was de-identified before participating in the self-reporting survey to protect participant confidentiality.
Preliminary data analysis was initiated upon completion of the pilot study. The pilot study data helped establish the scoring process for the severity and timeline of each symptom experienced by participants. This study compiled many different types of questions including a selection of statements, each describing a different intensity level of the specified symptom. In addition, a scale from one to ten was used to determine the severity of each symptom experienced. Participants were asked to describe their symptoms from feeling it once in a while to feeling it regularly. Selection statements, specifying how many days of the week a participant felt the symptom, were used to determine the timeline value. The severity scale was from 0, never experiencing the symptom, to 3, experiencing intensely. The timeline scale was from 0 (never experiencing the symptom) to 4 (enduring for a prolonged time). If a participant indicated that they never experienced the symptom, a 0 on the severity scale, then the timeline value would automatically be 0 as well. In addition to the scoring scale, averages were calculated for both physiological symptom severity/timeline and cognitive severity/timeline by grouping sub-symptoms and taking the mean. Pearson correlation, a statistical measure used to assess the strength and direction of the linear relationship between two variables, was employed to evaluate the connections among all the symptoms.
Categories containing more than one question, had the scores averaged to calculate the overall severity. For example, the severity of working memory had scores added up from the 3 questions, then divided by three to obtain the mean of the values noted. The pilot study of data analysis led to the usage of automatic scoring in JotForm, eliminating human error in calculations.
Before completing the self-reporting survey, participants filled out a form, determining their eligibility to participate in the study. If one met the criteria for eligibility, one completed the final survey; if not, they were removed from the system. Using this method eliminated the need to delete ineligible participants, removing the possibility of human error. Data scrubbing was conducted to eliminate redundant data, if a participant mistakenly turned in the form twice. Sometimes participants did not answer every question, causing incorrect averages for the severity of each symptom experienced; all their data throughout the self-reporting survey would then have to be scrubbed. This was done to ensure all of the data collected was not through accidental or randomly clicking, and the survey was carefully filtered.
A statistical data analysis was performed using t-test: Paired Two Sample for Means [
46] and ANOVA: Two Factor Without Replication Tests [
47]. Pearson correlations were calculated between each symptom’s severity and timeline and different symptoms. R, a statistical analysis platform, was used to process and analyze the data for a more thorough analysis. EFA was conducted to uncover latent constructs underlying the symptoms of Long COVID. The analysis utilized principal axis factoring with varimax rotation, as the data met sampling adequacy criteria.
Results:
In this study, the results were analyzed through two correlations: the severity of symptoms and the timeline of symptoms. Pearson correlation and correlation matrix heatmaps were performed to analyze and visualize the results. Upon assessing symptom severity, it was determined that the Pearson correlation coefficient (r) between the severity of depression and working memory was 0.73, indicating a strong positive correlation.
Figure 3.
Two Pearson Correlation heat maps representing the strength of correlations between a) severity of symptoms and b) timeline of symptoms. The color of a square signifies the extent to which two symptoms are correlated: dark red signifies a strong positive correlation (r > 0.5), light red signifies a weak correlation (r ≤ 0.5), and white signifies no correlation (r ≈ 0).
Figure 3.
Two Pearson Correlation heat maps representing the strength of correlations between a) severity of symptoms and b) timeline of symptoms. The color of a square signifies the extent to which two symptoms are correlated: dark red signifies a strong positive correlation (r > 0.5), light red signifies a weak correlation (r ≤ 0.5), and white signifies no correlation (r ≈ 0).
Table 3.
Severity and Timeline mean and standard deviation for each symptom.
Table 3.
Severity and Timeline mean and standard deviation for each symptom.
Sleep was the most affected symptom of COVID-19 in the study, with the mean severity being 1.155 on a scale of 0-3. In correspondence with the highest severity–sleep—the highest timeline was also sleep, with a mean of 2-4 weeks. Throughout the data, sleep had the highest severity and timeline, with 68% of participants dealing with sleep changes, demonstrating that most individuals affected by COVID-19 faced disturbances or changes in their sleep patterns.
Unlike sleep, the factor, of suicidal thoughts, had the lowest severity found in this study, with a mean of 0.06 on the same scale of 0-3. Similar to the lowest severity for suicidal thoughts, the lowest timeline of the study was also represented by this factor with the mean being 0-1 weeks. Approximately 0.04% of participants in our study dealt with suicidal thoughts during their time affected by COVID-19, revealing that it was one of the least common symptoms of the virus.
The physiological severity for both males and females encompasses eight symptom categories. Upon examination, the median and distribution were not statistically different. The median value for physiological severity of both males and females was 0.47. However, for the physiological timeline, biological females exhibited a much greater standard deviation (0.72) than males (0.52), despite a similar median value. The median value for physiological severity in males and females was 0.5.
Females had a median score of 0.267 for mental severity, compared to 0.033 for males [Refer to Supplemental Figure 18]. The standard deviation for females was also greater (0.41 vs. 0.28 for males). Regarding the mental timeline, the difference was even more pronounced. The median score for females was 0.4 and 0.1 for males [Refer to Supplemental Figure 18]. Similarly, the standard deviation for the mental timeline score for females was 0.91 as opposed to 0.47 for males.
For stress severity (
Figure 4a), vaccinated individuals showed a considerably lower distribution of severity scores, with a median score significantly lower than that of unvaccinated individuals. The unvaccinated group had a wider distribution as well, indicating greater variability and high central tendency, suggesting that unvaccinated individuals were more likely to experience higher stress levels post-COVID-19 infection. The violin plot for sleep severity [
Figure 4b] reveals a broader range of scores and a higher median for unvaccinated individuals, while the vaccinated sample has a more condensed distribution.
Figure 4.
Violin Plots the distributions of individual symptom severities for Stress, Sleep, Behavior, Breathlessness, Throat Sensitivity, and Depression for vaccinated and unvaccinated individuals.
Figure 4.
Violin Plots the distributions of individual symptom severities for Stress, Sleep, Behavior, Breathlessness, Throat Sensitivity, and Depression for vaccinated and unvaccinated individuals.
In Behavior Severity [
Figure 4c], the vaccinated group exhibited lower severity and less variability than the unvaccinated group, implying lower behavioral disturbances due to vaccination. This same trend is mirrored for Throat Sensitivity Severity [
Figure 4e], but not for Breathlessness [
Figure 4d], where the median and distribution are much more similar, with the vaccinated group having a larger range.
Lastly, depression severity [
Figure 4f] was significantly lower among the vaccinated group, with a narrower distribution of scores centered on a lower median severity score. Unvaccinated individuals exhibited a wider and more variable distribution, showing a higher tendency towards more severe depressive symptoms post-COVID-19 infection.
Analysis of the data presented in
Figure 4 indicates that vaccinated individuals generally report lower severity scores with more consistent distributions compared to unvaccinated individuals. This trend is seen for both physical symptoms, such as sleep and throat sensitivity, and cognitive symptoms, such as stress and depression.
The analysis also included investigating the differences between COVID-19 booster shot status (no booster, one, two, and three boosters) and symptom severity, specifically Depression, Behavior, Stress, and Chronic Pain.
For Depression Severity, those with no boosters had a mean score of 0.357, while those with three boosters scored 0 for both severity and timeline. For Behavior, the no-booster group had a mean severity score of 0.571, while the three-booster group scored 0.111. Stress Timeline averaged 1.143 for no boosters and 0.667 for three boosters.
The two-booster group showed higher scores than the one-booster group for Depression Timeline, Behavior Severity, Behavior Timeline, Stress Severity, and Chronic Pain Severity, indicating worsened symptoms after the third booster. However, only 8% of participants received three boosters, with 3 participants contracting COVID-19 after the third booster.
Overall, severity scores decreased from 0.319 for no boosters to 0.1629 for three boosters, and timeline scores decreased from 0.714 to 0.444, suggesting a downward trend with increasing boosters.
Table 4.
Shows the distribution of when participants were infected relative to vaccination status.
Table 4.
Shows the distribution of when participants were infected relative to vaccination status.
The majority of participants (67.3%) were infected with COVID-19 after they received at least their 2nd dose of the vaccine. Of the remaining 32.7% who were infected prior to their second dose, 1.8% were infected after their first dose of a single-dose vaccine. Male participants had a higher likelihood of being infected prior to completing their first complete course of vaccination (2 doses for double-dose vaccine, 1 dose for single-dose vaccine), with 35% of males being infected before a complete course compared to 29% of females being infected before a complete course of vaccination. However, the variability in vaccination efficacy between sexes may be a result of the sample. The likelihood of participants getting infected after the 2nd booster shot was significantly lower than in the other categories, likely because of a lower rate of vaccination for the second booster (45.5%) and the timing of the second booster release.
Figure 5.
Exploratory Factor Analysis for Timeline of Symptoms.
Figure 5.
Exploratory Factor Analysis for Timeline of Symptoms.
EFA identified four factors for symptom severity: respiratory and fatigue symptoms (e.g., cough, breathlessness), psychological and behavioral disturbance (e.g., brain fog, stress), sensory dysfunction (e.g., dysgeusia, anosmia), and depression (e.g., depression, suicidal thoughts).
We classified Factor 1 as respiratory and fatigue symptoms, which include symptoms such as cough (loading = 0.9), sleep disturbances (loading = 0.7), throat sensitivity (loading = 0.6), fatigue (loading = 0.5), and breathlessness (loading = 0.4), reflects a group of symptoms associated with both respiratory and fatigue-related problems. The high loading value of cough indicates that it is a key symptom within this factor, which aligns with the known respiratory effects of Long-COVID. This reveals that patients in this cluster disproportionately suffer from impacts on breathing and energy levels.
The psychological impact of Long COVID is highlighted through the second factor, psychological and behavioral disturbance, characterized by behavior (loading = 1.0) and stress (loading = 0.8). This factor shows the significant mental strain, impacting patients’ behavior and mental health. The loading of 1.0 for behavior shows a strong correlation with the overall factor, suggesting that Long COVID patients have profound behavior disruptions.
Factor 3, Sensory Dysfunction, included the timeline for the three symptoms of Dysgeusia (loading = 0.8), Anosmia (loading = 0.7), and Cranial Nerve Dysfunction (loading = 0.4). All of these symptoms are sensory nerve-related conditions, and the high loading values for Dysgeusia and Anosmia and their high correlation value indicate these symptoms are closely tied together.
The fourth factor that we categorized the symptoms into was Depression, which encompasses depression (loading = 0.9) and suicidal thoughts (loading = 0.4), which also exhibited a high correlation in early analysis. The significant loading for depression emphasizes the serious mental implications of Long-COVID, suggesting these patients may benefit from comprehensive neuropsychiatric evaluation and support.
Figure 6.
Exploratory Factor Analysis for Severity of Symptoms.
Figure 6.
Exploratory Factor Analysis for Severity of Symptoms.
EFA identified four factors for symptom duration: psychological and cognitive (e.g., stress, working memory), respiratory and fatigue (e.g., cough, sleep), sensory dysfunction (e.g., dysgeusia, anosmia), and depression (e.g., depression, suicidal thoughts). These categories, for the most part, aligned with the factors constructed for symptom severity, reinforcing the strong correlation between the severity and duration of the symptom experienced.
We classified factor 1 as Psychological and Cognitive Strain, which is characterized by symptoms of behavioral severity (loading = 0.9), stress (loading = 0.8), and working memory (loading = 0.6). This grouping contrasts with the timeline analysis, which categorized behavioral and psychological symptoms separately from cognitive impairments. The severity analysis indicates that elevated stress and behavioral disruptions are closely linked to cognitive strain, suggesting that mental health challenges may worsen cognitive symptoms in patients with Long-COVID.
Factor 2, Respiratory and Fatigue Severity, includes cough (loading = 0.8), throat sensitivity (loading = 0.6), breathlessness (loading = 0.5), fatigue (loading = 0.4), sleep issues (loading = 0.4), and cranial nerve dysfunction (loading = 0.4). This factor is aligned with the respiratory cluster found in the timeline analysis; however, when taking severity into account, cough stands out as a particularly significant symptom due to its high loading. The lengthy duration of respiratory symptoms highlights the substantial physical burden these symptoms place on patients.
Factor 3, labeled as Sensory Dysfunction, includes Dysgeusia (loading = 1) and Anosmia (loading = 0.7), indicating alterations in taste and smell perception in Long-COVID patients. These sensory impairments can impact patients' quality of life, potentially worsening psychological distress and complicating recovery efforts.
Lastly, Factor 4, labeled as Depression, consists of depression (loading = 0.7), and suicidal thoughts (loading = 0.6). Although suicidal thoughts indicate a major area of worry, their slightly lower loading indicates that depression is a core component of this factor. This connection calls for a more thorough investigation of the ways in which clinical populations may experience depression, especially in light of other illnesses.
Conclusion
This study aimed to investigate the prevalence, severity, and demographic influences on symptoms associated with Long COVID (PASC).
The exploratory factor analysis identified four primary symptom clusters for Long COVID, both for the symptom severity and timeline: (1) Respiratory and Fatigue Symptoms (e.g., cough, fatigue, breathlessness), including significant respiratory impacts; (2) Psychological and Behavioral Disturbance (e.g., stress, behavioral changes), which highlighted the mental strain of Long COVID; (3) Sensory Dysfunction (e.g., dysgeusia, anosmia), indicating that neurological impacts on sensory pathways are still prevalent post-infection; and (4) Depression (e.g., depression, suicidal ideation), which highlighted neuropsychiatric effects. These findings demonstrate the multifaceted impact of Long COVID and underscore the need for multidisciplinary management strategies addressing physical, psychological, and neurological health.
The findings indicate that sleep disturbances were the most prevalent symptom, with a mean severity score of 1.15 and a duration of 2–4 weeks. These conclusions align with previous studies (e.g., Ayigebusi et al., 2021), which highlight sleep disruptions as a common long-term consequence of COVID-19. On the other hand, suicidal thoughts were the least reported symptom, with a mean severity of 0.06, suggesting it is less commonly associated with Long COVID in the studied population.
Demographic factors played a significant role in the severity of the symptoms experienced. Female participants consistently reported higher severity scores for both physiological and cognitive symptoms compared to males, corroborating findings by Bai et al. (2021) that women tend to experience more severe post-infection symptoms. Age also influenced symptom reporting, with adults aged 35–49 demonstrating a higher likelihood of experiencing Long COVID, which may reflect age-related immune responses or occupational exposure risks.
As hypothesized, vaccination was a critical variable, with vaccinated participants reporting lower severity scores across symptoms such as stress, sleep disturbances, and depressive symptoms. This finding highlights the importance of vaccination in mitigating the long-term effects of COVID-19, although the presence of breakthrough infections highlights the need for continued surveillance and tailored interventions.
Overall, this study contributes to the growing literature on COVID-19, identifying demographic disparities and emphasizing Long COVID's complex, multifactorial nature. However, limitations, such as the overrepresentation of female and Asian-American participants and reliance on self-reported data, suggest caution in generalizing the findings to the broader population. Future research should utilize a larger, more diverse sample and explore the influence of SARS-CoV-2 variants and socioeconomic factors on symptom severity and duration. The findings of this study could inform public health strategies and underscore the importance of tailored interventions for Long COVID based on demographic factors and symptom severity.