Preprint Article Version 2 Preserved in Portico This version is not peer-reviewed

Plan for Constructing DataDiscoveryLab: Creating DataBases for Well-Rounded Searches

Version 1 : Received: 20 April 2023 / Approved: 27 April 2023 / Online: 27 April 2023 (10:36:36 CEST)
Version 2 : Received: 27 April 2023 / Approved: 2 May 2023 / Online: 2 May 2023 (04:13:23 CEST)

How to cite: Keskinoglu, E. Plan for Constructing DataDiscoveryLab: Creating DataBases for Well-Rounded Searches. Preprints 2023, 2023041074. https://doi.org/10.20944/preprints202304.1074.v2 Keskinoglu, E. Plan for Constructing DataDiscoveryLab: Creating DataBases for Well-Rounded Searches. Preprints 2023, 2023041074. https://doi.org/10.20944/preprints202304.1074.v2

Abstract

The abundance of information in academic articles, reports, and studies can make it challenging for researchers to gain insights from the existing literature. To address this issue, there is a growing demand for tools that can help researchers effectively parse and analyze large volumes of data. One such tool is DataDiscoveryLab, a software system that utilizes computer vision algorithms and NLP techniques to parse academic articles into text and figures, creating three separate databases. These databases allow researchers to quickly identify articles that may be relevant to their research questions, gain a deeper understanding of the research presented, and analyze visual data. The integration of article mining and computer vision in the DataDiscoveryLab software system provides researchers with a powerful tool for navigating the vast amount of scientific literature available today. Yet, as we will discuss in the latter papers these databases’ purpose is to create a bridge between researchers’ data and practically unlimited scientific publications. Yet, in this article, we will discuss how we plan to do that, and our efforts on integrating deep learning modes. After all, unlike already existing AI models, DataDiscoveryLab can be their combination and the first Generative AI in academia that can encompass every part of the natural sciences.

Keywords

data analysis; computer vision algorithms; visual data; natural language processing; scientific research

Subject

Computer Science and Mathematics, Artificial Intelligence and Machine Learning

Comments (1)

Comment 1
Received: 2 May 2023
Commenter: Elbek Keskinoglu
Commenter's Conflict of Interests: Author
Comment: More detailed for future use cases of the platform.
+ Respond to this comment

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 1
Metrics 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.