Preprint Article Version 1 Preserved in Portico This version is not peer-reviewed

Addressing the Selection Bias in Voice Assistance: Training Voice Assistance Model in Python with Equal Data Selection

Version 1 : Received: 26 December 2022 / Approved: 30 December 2022 / Online: 30 December 2022 (02:12:34 CET)

How to cite: Piya, K.; Shrestha, S.; Frank, C.; Jebessa, E.; Khan Mohd, T. Addressing the Selection Bias in Voice Assistance: Training Voice Assistance Model in Python with Equal Data Selection. Preprints 2022, 2022120567. https://doi.org/10.20944/preprints202212.0567.v1 Piya, K.; Shrestha, S.; Frank, C.; Jebessa, E.; Khan Mohd, T. Addressing the Selection Bias in Voice Assistance: Training Voice Assistance Model in Python with Equal Data Selection. Preprints 2022, 2022120567. https://doi.org/10.20944/preprints202212.0567.v1

Abstract

In recent times, voice assistants have become a part of our day-to-day lives, allowing information retrieval by voice synthesis, voice recognition, and natural language processing. These voice assistants can be found in many modern-day devices such as Apple, Amazon, Google, and Samsung. This project is primarily focused on Virtual Assistance in Natural Language Processing. Natural Language Processing is a form of AI that helps machines understand people and create feedback loops. This project will use deep learning to create a Voice Recognizer and use Commonvoice and data collected from the local community for model training using Google Colaboratory. After recognizing a command, the AI assistant will be able to perform the most suitable actions and then give a response. The motivation for this project comes from the race and gender bias that exists in many virtual assistants. The computer industry is primarily dominated by the male gender, and because of this, many of the products produced do not regard women. This bias has an impact on natural language processing. This project will be utilizing various open-source projects to implement machine learning algorithms and train the assistant algorithm to recognize different types of voices, accents, and dialects. Through this project, the goal to use voice data from underrepresented groups to build a voice assistant that can recognize voices regardless of gender, race, or accent. Increasing the representation of women in the computer industry is important for the future of the industry. By representing women in the initial study of voice assistants, it can be shown that females play a vital role in the development of this technology. In line with related work, this project will use first-hand data from the college population and middle-aged adults to train voice assistants to combat gender bias.

Keywords

Voice Assistance, Machine Learning, Virtual Assistance, Artificial Intelligence, Selection Bias, Sample Population, Python 3.10, Pyttsx3, PyTorch, JSON

Subject

Computer Science and Mathematics, Artificial Intelligence and Machine Learning

Comments (0)

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 0
Metrics 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.