Sort by
Machine Learning-Based Computer Vision for Depth Camera-Based Physiotherapy Movement Assessment: A Systematic Review
Yafeng Zhou,
Fadilla ’Atyka Nor Rashid,
Marizuana Mat Daud,
Mohammad Kamrul Hasan,
Wangmei Chen
Posted: 12 December 2024
A Multi-Stage Prompt Framework for High-Quality News Summarization with Large Language Models
Salma Ali,
Arthit Wongsawat
News summarization is a critical task in natural language processing (NLP) due to the increasing volume of information available online. Traditional extractive summarization methods often fail to capture the nuanced and contextual nature of news content, leading to a growing interest in using large language models (LLMs) like GPT-4 for more sophisticated, abstractive summarization tasks. However, LLMs face challenges in maintaining factual consistency and accurately reflecting the core content of news articles. This research addresses these challenges by proposing a novel prompt engineering method designed to guide LLMs, specifically GPT-4, in generating high-quality news summaries. Our approach utilizes a multi-stage prompt framework that ensures comprehensive coverage of essential details and incorporates an iterative refinement process to improve summary coherence and relevance. To enhance factual accuracy, we include built-in validation mechanisms using entailment-based metrics and question-answering techniques. Experiments conducted on a newly collected dataset of diverse news articles demonstrate the effectiveness of our approach, showing significant improvements in summary quality, coherence, and factual accuracy
News summarization is a critical task in natural language processing (NLP) due to the increasing volume of information available online. Traditional extractive summarization methods often fail to capture the nuanced and contextual nature of news content, leading to a growing interest in using large language models (LLMs) like GPT-4 for more sophisticated, abstractive summarization tasks. However, LLMs face challenges in maintaining factual consistency and accurately reflecting the core content of news articles. This research addresses these challenges by proposing a novel prompt engineering method designed to guide LLMs, specifically GPT-4, in generating high-quality news summaries. Our approach utilizes a multi-stage prompt framework that ensures comprehensive coverage of essential details and incorporates an iterative refinement process to improve summary coherence and relevance. To enhance factual accuracy, we include built-in validation mechanisms using entailment-based metrics and question-answering techniques. Experiments conducted on a newly collected dataset of diverse news articles demonstrate the effectiveness of our approach, showing significant improvements in summary quality, coherence, and factual accuracy
Posted: 12 December 2024
Tensor Derivative in Curvilinear Coordinates
Sourangshu Ghosh
Posted: 12 December 2024
Weighted Reproducing Kernel Property on Banach Spaces
Saeed Hashemi Sababe,
Nader Biranvand
Weighted Reproducing Kernel Banach Spaces (WRKBS) extend kernel theory by incorporating weights to enhance modeling flexibility. This paper defines WRKBS, explores their theoretical foundations, and demonstrates their effectiveness in regression, classification, and clustering. Numerical experiments validate their advantages in structured data modeling and symmetry-aware learning. Applications span computer vision, physics-based modeling, and graph-based learning, with future directions in scalable algorithms and deep learning integration.
Weighted Reproducing Kernel Banach Spaces (WRKBS) extend kernel theory by incorporating weights to enhance modeling flexibility. This paper defines WRKBS, explores their theoretical foundations, and demonstrates their effectiveness in regression, classification, and clustering. Numerical experiments validate their advantages in structured data modeling and symmetry-aware learning. Applications span computer vision, physics-based modeling, and graph-based learning, with future directions in scalable algorithms and deep learning integration.
Posted: 12 December 2024
Enhancing Construction Safety: A deep-learning based Personal Protective Equipment Detection System
Daniel Oluwatise Owolabi,
Desmond Moru
Posted: 12 December 2024
Cloud-Based License Plate Recognition: A Comparative Approach Using YOLO Versions 5,7, 8 and 9 Object Detection
Christine Bukola Asaju,
Pius Olawale Owolawi,
Chuling Tu,
Etienne Van Wyk
Posted: 12 December 2024
Discretization of Dynamical Systems Based on Observations
Eugene Kagan,
Alexander Novoselsky
Posted: 12 December 2024
You Only Attack Once: Single-step DeepFool Algorithm
Jun Li,
Yanwei Xu,
Yaocun Hu,
Yongyong Ma,
Xin Yin
Posted: 12 December 2024
A Comparative Study of Supervised Machine Learning for Effective Bots Accounts Detection on Kaggle
Daniel Oluwatise Owolabi,
Pius Onobhayedo
Kaggle is an online platform for data scientists, machine learning engineers, and researchers to access datasets, compete in machine learning competitions, collaborate with other data scientists, and develop and showcase their data science skills. Bot accounts can cause a variety of issues, including inflating the popularity of certain content artificially, simulating user activity to affect rankings or ratings, spreading spam, stealing data, or carrying out cyberattacks. Despite Kaggle's prominent focus on data science and its robust community of data scientists, the platform has been notably neglected in terms of addressing the pervasive issue of bot activity within the platform. Recognizing this gap, this study embarks on a comparative investigation of supervised machine learning algorithms tailored for detecting bot accounts effectively within the Kaggle ecosystem. The dataset consists of 799 users, of which 400 were labeled as bots, and 399 were labeled as real users. The study found that the Random Forest classification algorithm had the best evaluation metrics compared to other algorithms used in detecting bots. Feature importance analysis was also conducted to identify the most relevant features in differentiating between bot and real accounts. Overall, the study provides a useful framework for identifying bot accounts on Kaggle, which can be applied in other similar platforms to improve their user verification and security systems.
Kaggle is an online platform for data scientists, machine learning engineers, and researchers to access datasets, compete in machine learning competitions, collaborate with other data scientists, and develop and showcase their data science skills. Bot accounts can cause a variety of issues, including inflating the popularity of certain content artificially, simulating user activity to affect rankings or ratings, spreading spam, stealing data, or carrying out cyberattacks. Despite Kaggle's prominent focus on data science and its robust community of data scientists, the platform has been notably neglected in terms of addressing the pervasive issue of bot activity within the platform. Recognizing this gap, this study embarks on a comparative investigation of supervised machine learning algorithms tailored for detecting bot accounts effectively within the Kaggle ecosystem. The dataset consists of 799 users, of which 400 were labeled as bots, and 399 were labeled as real users. The study found that the Random Forest classification algorithm had the best evaluation metrics compared to other algorithms used in detecting bots. Feature importance analysis was also conducted to identify the most relevant features in differentiating between bot and real accounts. Overall, the study provides a useful framework for identifying bot accounts on Kaggle, which can be applied in other similar platforms to improve their user verification and security systems.
Posted: 12 December 2024
MixCFormer: A CNN-Transformer Hybrid with Mixup Augmentation for Enhanced Finger Vein Attack Detection
Zhaodi Wang,
Shuqiang Yang,
Huafeng Qin,
Yike Liu,
Junqiang Wang
Finger vein recognition has gained significant attention for its importance in enhancing security, safeguarding privacy, and ensuring reliable liveness detection. As a foundation of vein recognition systems, vein detection faces challenges including low feature extraction efficiency, limited robustness, and a heavy reliance on real-world data. Additionally, environmental variability and advancements in spoofing technologies further exacerbate data privacy and security concerns. To address these challenges, this paper proposes MixCFormer, a hybrid CNN-Transformer architecture that incorporates Mixup data augmentation to improve the accuracy of finger vein liveness detection and reduce dependency on large-scale real datasets. First, The MixCFormer model applies baseline drift elimination, morphological filtering, and Butterworth filtering techniques to minimize the impact of background noise and illumination variations, thereby enhancing the clarity and recognizability of vein features. Next, finger vein video data is transformed into feature sequences, optimizing feature extraction and matching efficiency, effectively capturing dynamic time-series information and improving discrimination between live and forged samples. Furthermore, Mixup data augmentation is used to expand sample diversity and decrease dependency on extensive real datasets, thereby enhancing the model’s ability to recognize forged samples across diverse attack scenarios. Finally, the CNN and Transformer architecture leverages both local and global feature extraction capabilities to capture vein feature correlations and dependencies. Residual connections improve feature propagation, enhancing the stability of feature representations in liveness detection. Rigorous experimental evaluations demonstrate that MixCFormer achieves a detection accuracy of 99.51% on finger vein datasets, significantly outperforming existing methods.
Finger vein recognition has gained significant attention for its importance in enhancing security, safeguarding privacy, and ensuring reliable liveness detection. As a foundation of vein recognition systems, vein detection faces challenges including low feature extraction efficiency, limited robustness, and a heavy reliance on real-world data. Additionally, environmental variability and advancements in spoofing technologies further exacerbate data privacy and security concerns. To address these challenges, this paper proposes MixCFormer, a hybrid CNN-Transformer architecture that incorporates Mixup data augmentation to improve the accuracy of finger vein liveness detection and reduce dependency on large-scale real datasets. First, The MixCFormer model applies baseline drift elimination, morphological filtering, and Butterworth filtering techniques to minimize the impact of background noise and illumination variations, thereby enhancing the clarity and recognizability of vein features. Next, finger vein video data is transformed into feature sequences, optimizing feature extraction and matching efficiency, effectively capturing dynamic time-series information and improving discrimination between live and forged samples. Furthermore, Mixup data augmentation is used to expand sample diversity and decrease dependency on extensive real datasets, thereby enhancing the model’s ability to recognize forged samples across diverse attack scenarios. Finally, the CNN and Transformer architecture leverages both local and global feature extraction capabilities to capture vein feature correlations and dependencies. Residual connections improve feature propagation, enhancing the stability of feature representations in liveness detection. Rigorous experimental evaluations demonstrate that MixCFormer achieves a detection accuracy of 99.51% on finger vein datasets, significantly outperforming existing methods.
Posted: 12 December 2024
of 803