Submitted:
28 June 2023
Posted:
28 June 2023
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Related Work
3. Proposed Method
- Acquire a large amount of Japanese text data from Twitter.
- Terms related to child care are selected, and two types of texts (tweets) containing those terms and profiles are collected. We created a set of texts divided into children’s developmental stages.
- Using SVMs, etc (the NB,LR,ANN, XGboost, RF, decision trees and SVM), and the neural language model BERT, we build a classification model that predicts numbers from “0” to “6” from sentences.
3.1. Data collection
3.2. Preprocessing
3.3. Classification Method
3.3.1. SVM classification method
3.3.2. Classification method by BERT
4. Experimental setup and results
4.1. Classification method by BERT
4.1.1. Data used in experiments
4.1.2. Experiment details
4.2. Experimental results
4.2.1. The classification results of the created classifier A.
4.2.2. Results of classifier B using BERT
4.2.3. Results by classifier
- The accuracy rate for Decision Tree was low at 43%, but there was not much difference for the others.
- RF and SVM showed stable and high accuracy.
- F1 had the highest RF at 60%.
- For analysis of Tweet text data classification results,
- Accuracy rate was generally lower than the accuracy rate of Profile. Here, F1 in Decision Tree was the highest at 36%.
5. Conclusions
Acknowledgments
References
- Khodabakhsh, M.; Fani, H.; Zarrinkalam, F.; Bagheri, E. October. In Predicting personal life events from streaming social content. In Proceedings of the 27th ACM international conference on information and knowledge management, Torino, Italy, 22–26 October 2018; pp. 1751–1754. [Google Scholar]
- Saha, K.; Seybolt, J.; Mattingly, S.M.; Aledavood, T.; Konjeti, C.; Martinez, G.J.; Grover, T.; Mark, G.; De Choudhury, M. What Life Events are Disclosed on Social Media, How, When, and By Whom? In Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems, Yokohama, Japan, 8–13 May 2021.
- Choudhury, M.D.; Massimi, M. “She said yes!” Liminality and Engagement Announcements on Twitter. In Proceedings of the iConference 2015, Newport Beach, CA, USA; 2015; pp. 1–13. [Google Scholar]
- Burke, M.; Kraut, R. Using Facebook after Losing a Job: Differential Benefits of Strong and Weak Ties. In Proceedings of the 2013 Conference Computer Supported Cooperative Work and Social Computing (CSCW 2013), San Antonio, TX, USA; 2013; pp. 1419–1430. [Google Scholar]
- Yoshida, M.; Kita, K. Mining Numbers in Text: A Survey; IntechOpen, 2021. [Google Scholar]
- Bakalov, A.; Fuxman, A.; Talukdar, P.P.; Chakrabarti, S. SCAD: collective discovery of attribute values. In Proceedings of the 20th international conference on World wide web, Hyderabad, India, 28 March–1 April 2011; pp. 447–456. [Google Scholar]
- Sitaula, C.; Shahi, T.B. Multi-channel CNN to classify nepali covid-19 related tweets using hybrid features. arXiv 2022, arXiv:2203.10286. [Google Scholar]
- Shahi, T.B. and Sitaula, C., Natural language processing for Nepali text: a review. Artificial Intelligence Review 2022, 55, 3401–3429. [Google Scholar] [CrossRef]
- Devlin, J.; Chang, M.W.; Lee, K.; Toutanova, K. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv 2018, arXiv:1810.04805. [Google Scholar]
- Yamamoto, K.; Shimada, K. Acquisition of Periodic Events with Person Attributes. In Proceedings of the 2020 International Conference on Asian Language Processing (IALP), Kuala Lumpur, 4–6 December 2020. [Google Scholar]
- Zhang, X.; Ramachandran, D.; Tenney, I.; Elazar, Y.; Roth, D. Do Language Embeddings capture Scales? arXiv 2020, arXiv:2010.05345. [Google Scholar]

| Search words used to collect text | Search terms used to collect profiles |
|---|---|
| child,0-year-old child One year old,1-year-old, Two years old,2-years old, Three years old,3-years old, Four years old,4-years old, Five years old, 5-years old, Six years old, 6 -years old, zero-year-old | raising children, Dad, Mother, Father, my dad, my mum, raising children, child |
| Old | Part of profile data collection |
|---|---|
| 0 | I am a housewife raising a *-year-old child. |
| 6 | Full support for Hiroshima Cup in Tokyo. Takuya Kimura was my all-time favorite player. Mother of a * -year-old child. She is from Hiroshima Prefecture. She lives in Tokyo. |
| 2 | around four-working mother who is raising *-year-old child. I like music, Hanshin Tigers, Daichi Miura, and animals. |
| 4 | I live with Sunhora and Ayanogo.Adult.I’m quietly doing cosplay. I’m a mom of *-year-old child. RT is too much.\n Archive ID: *52993 |
| 1 | An adventurer who explores cognitive science and sign language with a focus on linguistics.Specializes in cognitive linguistics.Postdoc in R.With *-year-old child. Working mama first grader. http:\/\/ask.fm\/rhetorico |
| 3 | With * year-old daughter,Shinma who is taking care of her family.\nTwitter is still not working. |
| 5 | Daily mumblings of an unfortunate rotten adult who likes anime, manga and sometimes games. Childcare (*Year-old child) Mutters here and there. Currently pregnant with second child. (Scheduled for the second half of October) Gestational diabetes, hospitalization for threatened premature labor, etc. I’m already full_ (:3” ∠) _ |
| Old | Part of original tweet texts |
|---|---|
| 4 | I’m coming Tokyo Disney, and when I saw the Coconut tree*year old boy said, “Mom! This is Hawaii!” and I laughed. |
| 5 | *Year-old boy often shakes his head. Isn’t the ex-nurse mother-in-law a tic ... https:\/\/t.co\/KstYS2lcJ1 |
| 0 | My daughter’s album has stopped until *-year-old 5 months ...that’s bad... After all, I haven’t had time since I returned to work. |
| 3 | @0246monpe in May become *-year-old! In the pitch-black room, I can hear the *year-olds humming (groaning?) (’ー`)\n I’m depressed that it will be crowded, but I’m looking forward to it and I have to take him (^◇^;) |
| 1 | @Alice_ssni*-year-old \n started crawling. he can get over my mother’s body too. \n Even when my mother was lying on her back, I was able to come up to π and drink. \n thank you for the meal. Itadakimasu has a high probability of being done by yourself (when you wearing an apron then understand). |
| 2 | RT @FururiMama98: Parenting is hard and painful. Now, I have a *-year-old daughter, but it’s really hard. It’s really hard to kill myself and keep looking at others and hugging them. Even after chasing, cuteness: frustration = about 1:9. He is also an unbalanced eater, and is always overturned with toys around the house. How can I rest and comfort myself... |
| 6 | RT @unikunmama: 🎉happy birthday unnie🎂\n I turned *-year-old today (o^^o) Let’s have fun together from now on, https:\/\/t.co\/ybW9AcS4oP |
| data | age_0 | age_1 | age_2 | age_3 | age_4 | age_5 | age_6 | total |
|---|---|---|---|---|---|---|---|---|
| Profile | 206 | 572 | 255 | 170 | 80 | 85 | 207 | 1575 |
| Tweet- | 46 | 200 | 212 | 205 | 105 | 129 | 56 | 953 |
| True label | |||
|---|---|---|---|
| positive | negative | ||
| Prediction label by SVM | positive | (A)True positive | (B)False positive |
| negative | (C)False negative | (D)True negative | |








| All class | Results of profile data | Result of the tweet data | ||||||
|---|---|---|---|---|---|---|---|---|
| Classifier | Precision | Recall | F1 | Accuracy | Precision | Recall | F1 | Accuracy |
| NB | 0.62 | 0.54 | 0.57 | 0.54 | 0.28 | 0.24 | 0.24 | 0.24 |
| LR | 0.74 | 0.53 | 0.47 | 0.53 | 0.40 | 0.20 | 0.10 | 0.20 |
| ANN | 0.54 | 0.54 | 0.54 | 0.54 | 0.24 | 0.24 | 0.24 | 0.24 |
| XGBoost | 0.62 | 0.54 | 0.57 | 0.54 | 0.36 | 0.26 | 0.28 | 0.26 |
| RF | 0.79 | 0.56 | 0.60 | 0.56 | 0.46 | 0.28 | 0.32 | 0.28 |
| Decision Tree | 0.61 | 0.43 | 0.49 | 0.43 | 0.88 | 0.26 | 0.36 | 0.26 |
| SVM | 0.70 | 0.57 | 0.55 | 0.57 | 0.35 | 0.31 | 0.26 | 0.31 |
| BERT | 0.70 | 0.61 | 0.63 | 0.61 | 0.34 | 0.31 | 0.31 | 0.31 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).