Submitted:
01 March 2024
Posted:
01 March 2024
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Related Work
2.1. Fishing Tourism
- the performance of the daily fishing process, accompanied by an explanation of the process to passengers
- encouraging the active, safe participation of visitors in the whole process of fishing and with the opportunity to be engaged at marine sport activities
- informing tourists about the fishing activity and fishing tradition
- visits to beaches, underwater caves, and boat trips
- possibility of diving for fishing and observation of marine flora and fauna
- contact with local flavors and traditional cooking of the catch
- on-site tasting and sale of traditional fishing products
- overnight accommodation and catering services in fishermen’s houses or other “fisherman’s style” establishments
- Have an overall length of up to 15 meters.
- They must be equipped with a professional fishing license for fishing with gear other than bottom trawls with nets and boat-drawn gillnets.
- They must meet the requirements of professional tourist vessels under the relevant laws.
- Carry up to 12 passengers.
- Be equipped with a certificate of seaworthiness stating the number of passengers they can carry, the extent of the voyages, and the relevant “Orders - Instructions” without requiring the issue of special or other certificates.
- There shall be a special waiting area for all passengers to be safely accommodated during fishing operations without obstructing them.
- Comply with the rules laid down by the legislation in force at the time concerning the safety of navigation, manning, hygiene, and the suitability of the fishing vessel for the embarkation of passengers.
2.2. The Experience Realm
- 1.
- Entertainment: Usually, this experience is passively gained where the viewer is not directly involved in the “performance” of the entertainment. e.g., participation in the theatre, cinema, concerts, parades, nightclubs, etc. carnivals and carnivals, and folklore festivals as spectators.
- 2.
- Education: This type is the result of active participation and absorption of the material element that a person has been exposed to. The presentation by speech at a conference of thematic modules that are simultaneously a professional e.g., a doctor, can be considered an experience of this module classification.
- 3.
- Aesthetics: The category of this type of experience is based on both exciting and passive enjoyment. A classic example of this type of experience is the understanding and inner search for the stimuli evoked by a series of specific themes of artistic works that are exhibited in a gallery or in an exhibition of unique exhibits in a museum.
- 4.
- Escape: This type exists when you are immersed in an activity that is actively engaged in by stakeholders who are transported into a new state of experience, e.g., role-playing to enhance relationship building in a working group that as partners take on the role of role models playing the role of experts to solve a crisis problem, e.g., due to an epidemic.
- Active (act) and Passive (accept) participation, respectively, in the first axis and
- Basic adaptation/absorption and Total immersion, respectively, in the second axis
2.3. Natural Language Processing and the Tourism Domain
2.4. User Profiling in Tourism-Related Platforms
3. Materials and Methods
3.1. Pipeline Overview
3.2. Dataset Collection
3.3. Dataset Annotation
4. Results
4.1. Extracting Sentiment, Emotion, and Descriptive Insights of Tourists and Businesses from Linguistic Cues
4.1.1. Linguistics Insights
- Tourists went on boat trips mostly for fishing and actively participated in the process (fishing experience, caught fish, etc.).
- The overall experience of fishing boat trips is highly recommended as tourists mention that they had a “great time” and “fantastic day”.
- Tourists appreciate the beauty of the natural environment by leaving positive comments about the “crystal waters”, the sea, the fresh fish, etc.
- Tourists highlight the hospitality and the skills of the crew and business owners.
- Tourists overall put emphasis on commenting on the offered services.
- Food and restaurants are at the top of tourists’ attention.
- Tourists often make positive comments about services, with phrases such as “really good”, “really nice”, “well worth”, “great food”, etc.
- Tourists in practice write reviews for businesses in order to recommend or not services and experiences.
- Reviewer Badges: These badges are graded starting from the “New Reviewer” (1 review) to the “Top Contributor” (more than 50 reviews). Figure 6a shows the distribution of tourists who reviewed fishing tourism business based on their “Reviewer Badges”. Interestingly, approximately half the users belong to the “New Reviewer” category, meaning that they only joined the TripAdvisor platform to positively review the respective fishing tourism business.
- Expertise Badges: These badges showcase the unique knowledge of the users. For example, if a user publishes multiple reviews in a single category – hotels, restaurants, or attractions - they will be assigned the respective “Expertise Badge”. Figure 6b shows the distribution of tourists who reviewed the fishing tourism business based on their “Expertise Badges”. We notice that the users who review fishing tourism businesses also tend to review hotels and attractions, as well as luxury and boutique hotels, and B&B and Inns at a smaller scale.
- Passport Badge: This badge recognizes users for being world travelers. Once they have added reviews for places in at least two destinations, they start collecting such graded badges. Figure 7 shows that most users who have reviewed fishing tourism businesses have only reviewed destinations in limited locations. This is not surprising as almost half of users are “New Reviewers” as mentioned previously.
- Explorer Badge: This badge is assigned to users who are amongst the first to review a hotel, restaurant, or attraction in a given language. Our results indicate that 1 out of 3 users who have reviewed a fishing tourism business own this badge, meaning that they are trailblazers in the tourism domain, seeking out-of-the-beaten-path experiences.
4.1.2. Sentiment Analysis and Emotion Extraction
- Insights for tourists. The distribution of the overall sentiments found in user reviews in our corpus, reveals that there is a tendency to express neutral to positive comments about tourist venues and experiences. A surprising finding is that even for lower ratings, the detected sentiment polarity is rather positive than negative (see Figure 8a).
- Insights for businesses. The distribution of the sentiments found in reviews for fishing businesses, as shown in Figure 9, reflects the overall satisfaction of customers with the provided services. We expected a positive sentiment after the aforementioned linguistic insights and the high ratings fishing tourism businesses received.
- Insights for general tourist reviews. We collected a set of 11,861 reviews from the profiles of users in our dataset. Given that a percentage of 90.8% of these reviews are written in English, we found 30,649 occurrences of affective concepts based on WordNet affect (see Table 2 for distributions). Emotion analysis shows that users are more likely to express positive emotions in their reviews, specifically surprise and joy, while negative emotions are expressed less frequently.
- Insights for fishing tourism businesses. Concerning the affective concepts found in users’ reviews for fishing businesses, based on the linguistic and sentiment analysis preceded, we expect a high percentage of positive emotions in the total of 3,506 emotion terms found (see Table 3 for distributions). Indeed, positive emotions and surprise dominate, indicating customer satisfaction above expectation (prevalence of surprise emotion).
4.1.3. Topic Detection
4.2. User Profiling Aspects: Gender, Age and Marital Status
4.2.1. Gender Classification
- Gender estimation based on name: We train a naive Bayes model in order to get an estimation of each user’s gender based on their username according to names lexicons with female and male names. This is assumed a powerful feature for gender detection.
- Sentiment score: The aggregated score of reviews sentiment.
- Syntax features: Number of part of speech tags (Adjectives, Nouns, Verbs, and Adverbs).
- Language vectors: The frequency vectors of words used by each user. We constructed TF-IDF and n-grams vectors reflecting the importance of words and phrases in a collection of documents based on preprocessed textual data.
4.2.2. Age Classifier
- Structure features: Refer to the structural use of language. These features include the number of words in each review, the number of characters, the number of words in a sentence, and the number of exclamatories.
- Syntax features: Refer to the number of parts of speech tags (Adjectives, Nouns, Verbs, and Adverbs).
- Sentiment score: The aggregated score of reviews sentiment.
- Readability features: Refer to the level of the text complexity. We included: a) Flesch reading ease, indicating how easy is a text to read, b) Smog index, estimating the years of education needed to understand a piece of writing, c) Flesch–Kincaid grade, indicating the average student in that grade level can read the text, d) Coleman Liau index, gauging the understandability of a text, e) automated readability index, assessing the understandability of a text, f) Dale Chall readability score, providing a numeric gauge of the comprehension difficulty that readers come upon when reading a text, g) difficult words, indicating how many difficult words used in a text, h) gunning fog, estimating the years of formal education a person needs to understand the text on the first reading.
- Language vectors: The frequency vectors of words used by each user. We constructed TF-IDF and n-grams vectors reflecting the importance of words and phrases in a collection of documents based on preprocessed textual data.
4.2.3. Marital Status Detection
4.3. The 4Es of Experience Economy
5. Discussion & Conclusions
- Cultural Exchange: Fishing tourism can facilitate cultural exchange between tourists and local fishing communities, leading to mutual understanding and preservation of traditional practices. This implies the importance of promoting respectful interactions and cultural sensitivity.
- Community Livelihoods: The theoretical implications involve examining how fishing tourism affects the livelihoods and well-being of local communities. Sustainable fishing tourism can enhance income diversification and improve quality of life.
- Tourism Management: Theoretical discussions revolve around determining the carrying capacity of fishing tourism destinations to ensure that environmental and social impacts are kept within sustainable limits.
- Stakeholder Engagement: Effective stakeholder engagement is crucial for managing fishing tourism. The implications include the need for collaboration among governments, local communities, tour operators, and conservation organizations.
- Environmental Ethics: Theoretical implications extend to ethical discussions about catch-and-release practices, the welfare of targeted fish species, and the broader ecological consequences of fishing tourism.
- Conservation and Research: Fishing tourism can provide opportunities for scientific research, such as studying fish populations, migration patterns, and ecosystem dynamics. Theoretical implications emphasize the role of fishing tourism in advancing marine conservation efforts.
- Education and Outreach: Fishing tourism can serve as a platform for educating tourists and the public about the importance of marine conservation, fostering a sense of responsibility and support for preserving aquatic ecosystems.
- Climate Change Adaptation: Theoretical implications may explore how fishing tourism destinations need to adapt to changing climate conditions, such as shifts in fish distribution and abundance, and how these changes could affect tourism experiences and local economies.
Author Contributions
Funding
Institutional Review Board Statement
Data Availability Statement
Conflicts of Interest
Abbreviations
| API | Application Programming Interface |
| DUTH | Democritus University of Thrace |
| HCMR | Hellenic Centre for Marine Research |
| HTML | HyperText Markup Language |
| LDA | Latent Dirichlet Allocation |
| NLP | Natural Language Processing |
| POS | Part Of Speech |
| SMOTE | Synthetic Minority Oversampling Technique |
| TF-IDF | Term Frequency - Inverse Document Frequency |
| XML | Extensible Markup Language |
References
- Jafari, J. Encyclopedia of tourism; Routledge, 2002.
- Centre for the Promotion of Imports from developing countries. The European market potential for sports tourism. Ministry of Foreign Affairs of the Netherlands 2021. [Google Scholar]
- Pine, B.J.; Pine, J.; Gilmore, J.H. The experience economy: work is theatre & every business a stage. Harvard Business Press, 1999. [Google Scholar]
- Ali-Knight, J. The role of niche tourism products in destination development. PhD thesis, Edinburgh Napier University, 2010. [Google Scholar]
- Holbrook, M.B.; Hirschman, E.C. The experiential aspects of consumption: Consumer fantasies, feelings, and fun. Journal of consumer research 1982, 9, 132–140. [Google Scholar] [CrossRef]
- Pine, B.J.; Gilmore, J.H.; et al. Welcome to the experience economy; Vol. 76, Harvard Business Review Press Cambridge, MA, USA, 1998.
- Pikkemaat, B.; Weiermair, K. The aesthetic (design) orientated customer in tourism-implications for product development. In Proceedings of the EIASM 10th international product development management conference. EIASM Brussels, 2003, Vol. 825; p. 839.
- Menner, T.; Höpken, W.; Fuchs, M.; Lexhagen, M. Topic detection: identifying relevant topics in tourism reviews. In Information and Communication Technologies in Tourism 2016; Springer, 2016; pp. 411–423. [Google Scholar]
- Rossetti, M.; Stella, F.; Zanker, M. Analyzing user reviews in tourism with topic models. Information Technology & Tourism 2016, 16, 5–21. [Google Scholar]
- Afzaal, M.; Usman, M.; Fong, A.C.; Fong, S. Multiaspect-based opinion classification model for tourist reviews. Expert Systems 2019, 36, e12371. [Google Scholar] [CrossRef]
- Afzaal, M.; Usman, M.; Fong, A. Predictive aspect-based sentiment classification of online tourist reviews. Journal of Information Science 2019, 45, 341–363. [Google Scholar] [CrossRef]
- Yu, C.; Zhu, X.; Feng, B.; Cai, L.; An, L. Sentiment Analysis of Japanese Tourism Online Reviews. Journal of Data and Information Science 2019, 4, 89–113. [Google Scholar] [CrossRef]
- Marrese-Taylor, E.; Velásquez, J.D.; Bravo-Marquez, F. A novel deterministic approach for aspect-based opinion mining in tourism products reviews. Expert Systems with Applications 2014, 41, 7764–7775. [Google Scholar] [CrossRef]
- Kirilenko, A.P.; Stepchenkova, S.O.; Kim, H.; Li, X. Automated sentiment analysis in tourism: Comparison of approaches. Journal of Travel Research 2018, 57, 1012–1025. [Google Scholar] [CrossRef]
- Fang, B.; Ye, Q.; Kucukusta, D.; Law, R. Analysis of the perceived value of online tourism reviews: Influence of readability and reviewer characteristics. Tourism Management 2016, 52, 498–506. [Google Scholar] [CrossRef]
- Shin, S.; Du, Q.; Xiang, Z. What’s vs. how’s in online hotel reviews: Comparing information value of content and writing style with machine learning. In Information and Communication Technologies in Tourism 2019; Springer, 2019; pp. 321–332. [Google Scholar]
- Leal, F.; González-Vélez, H.; Malheiro, B.; Burguillo, J.C. Semantic profiling and destination recommendation based on crowd-sourced tourist reviews. In Proceedings of the International Symposium on Distributed Computing and Artificial Intelligence. Springer; 2017; pp. 140–147. [Google Scholar]
- Guerreiro, J.; Rita, P. How to predict explicit recommendations in online reviews using text mining and sentiment analysis. Journal of Hospitality and Tourism Management 2020, 43, 269–272. [Google Scholar] [CrossRef]
- Dong, X.; Li, T.; Song, R.; Ding, Z. Profiling users via their reviews: an extended systematic mapping study. Software and Systems Modeling 2020, pp. 1–21.
- Madke, N.B.; Kulkarni, M.Y.; Mule, M.R.; Lakade, M.A.; Kulkarni, M.S. User Profile Based Behavior Identificaton Using Data Mining Technique 2018.
- Farnadi, G.; Tang, J.; De Cock, M.; Moens, M.F. User profiling through deep multimodal fusion. In Proceedings of the Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining, 2018, ; pp. 171–179.
- Alkan, O.; Daly, E. User Profiling from Reviews for Accurate Time-Based Recommendations. arXiv preprint arXiv:2006.08805 2020.
- Kavitha, S.; Jobi, V.; Rajeswari, S. Tourism recommendation using social media profiles. In Artificial Intelligence and Evolutionary Computations in Engineering Systems; Springer, 2017; pp. 243–253.
- Missaoui, S.; Viviani, M.; Faiz, R.; Pasi, G. A language modeling approach for the recommendation of tourism-related services. In Proceedings of the Proceedings of the Symposium on Applied Computing, 2017,; pp. 1697–1700.
- Fazzolari, M.; Petrocchi, M. A study on online travel reviews through intelligent data analysis. Information Technology & Tourism 2018, 20, 37–58. [Google Scholar]
- Jivani, A.G.; et al. A comparative study of stemming algorithms. Int. J. Comp. Tech. Appl 2011, 2, 1930–1938. [Google Scholar]
- Ekman, P.; Sorenson, E.R.; Friesen, W.V. Pan-cultural elements in facial displays of emotion. Science 1969, 164, 86–88. [Google Scholar] [CrossRef]
- Loria, S. textblob Documentation. Release 0.15 2018, 2. [Google Scholar]
- Likas, A.; Vlassis, N.; Verbeek, J.J. The global k-means clustering algorithm. Pattern recognition 2003, 36, 451–461. [Google Scholar] [CrossRef]
- Ramage, D.; Hall, D.; Nallapati, R.; Manning, C.D. Labeled LDA: A supervised topic model for credit attribution in multi-labeled corpora. In Proceedings of the Proceedings of the 2009 conference on empirical methods in natural language processing, 2009,; pp. 248–256.
- Kelaiaia, A.; Merouani, H.F. Clustering with probabilistic topic models on arabic texts: a comparative study of LDA and K-means. Int. Arab J. Inf. Technol. 2016, 13, 332–338. [Google Scholar]
- Raschka, S.; Mirjalili, V. Python machine learning second edition, 2017.
- Marquardt, J.; Farnadi, G.; Vasudevan, G.; Moens, M.F.; Davalos, S.; Teredesai, A.; De Cock, M. Age and gender identification in social media. Proceedings of CLEF 2014 Evaluation Labs 2014, 1180, 1129–1136. [Google Scholar]
- Weren, E.R.; Kauer, A.U.; Mizusaki, L.; Moreira, V.P.; de Oliveira, J.P.M.; Wives, L.K. Examining multiple features for author profiling. Journal of information and data management 2014, 5, 266–266. [Google Scholar]
- Chawla, N.V.; Bowyer, K.W.; Hall, L.O.; Kegelmeyer, W.P. SMOTE: synthetic minority over-sampling technique. Journal of artificial intelligence research 2002, 16, 321–357. [Google Scholar] [CrossRef]
- Yfantidou, G.; Matarazzo, M. The future of sustainable tourism in developing countries. Sustainable development 2017, 25, 459–466. [Google Scholar] [CrossRef]
- Falcó, C.; Moeller, H.V. Optimal spatial management in a multiuse marine habitat: Balancing fisheries and tourism. Natural Resource Modeling 2022, 35, e12309. [Google Scholar] [CrossRef]
| 1 | |
| 2 | |
| 3 | |
| 4 |



















| Rating | Counts-percentage | avg #words |
|---|---|---|
| 1 | 6087 (51.3%) | 82.3 |
| 2 | 15 (0.1%) | 59.1 |
| 3 | 46 (0.3%) | 118.4 |
| 4 | 4795 (40.4%) | 95.0 |
| 5 | 919 (7.0%) | 81.8 |
| Emotion | Percentage (counts) |
|---|---|
| positive emotion | 56.5% (17,322) |
| negative emotion | 4.8% (1,482) |
| other emotion | 5.8% (1,798) |
| joy | 7.3% (2,239) |
| surprise | 20.4% (6,255) |
| anger | 0.05% (176) |
| disgust | 0.01% (32) |
| fear | 1.1% (345) |
| sadness | 3.2% (993) |
| Emotion | Percentage (counts) |
|---|---|
| positive emotion | 53.4% (1,875) |
| negative emotion | 3.6% (128) |
| other emotion | 5.0% (178) |
| joy | 4.5% (161) |
| surprise | 30.2% (1,061) |
| anger | 0.1% (6) |
| disgust | 0.2% (8) |
| fear | 0.5% (21) |
| sadness | 1.9% (68) |
| Classifier | Training error | Test error | |
|---|---|---|---|
| lr | acc. | 0.84 | 0.73 |
| f1 | 0.83 | 0.72 | |
| rf | acc. | 1.0 | 0.71 |
| f1 | 1.0 | 0.68 | |
| sgd | acc. | 0.40 | 0.42 |
| f1 | 0.23 | 0.25 |
| Classifier | Training error | Test error | |
|---|---|---|---|
| lr | acc. | 0.47 | 0.46 |
| f1 | 0.42 | 0.42 | |
| rf | acc. | 1.0 | 0.62 |
| f1 | 1.0 | 0.59 | |
| sgd | acc. | 0.44 | 0.46 |
| f1 | 0.32 | 0.34 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).