Preprint Data Descriptor Version 1 NOT YET PEER-REVIEWED

688,112 Statistical Results: Content Mining Psychology Articles for Statistical Test Results

  1. Department of Methodology and Statistics, Tilburg University,Warandelaan 2, 5037AB Tilburg, The Netherlands
Version 1 : Received: 22 August 2016 / Approved: 23 August 2016 / Online: 23 August 2016 (10:33:53 CEST)

A peer-reviewed article of this Preprint also exists.

Hartgerink, C.H.J. 688,112 Statistical Results: Content Mining Psychology Articles for Statistical Test Results. Data 2016, 1, 14. Hartgerink, C.H.J. 688,112 Statistical Results: Content Mining Psychology Articles for Statistical Test Results. Data 2016, 1, 14.

Journal reference: Data 2016, 1, 14
DOI: 10.3390/data1030014

Abstract

In this data deposit, I describe a dataset that is the result of content mining 167,318 published articles for statistical test results. As a result of this content mining, 688,112 results from 50,845 articles were extracted. In order to provide a comprehensive set of data, the statistical results are supplemented with metadata from the article they originate from. The dataset is provided in a comma separated file (CSV) in long-format. For each of the 688,112 results, 20 variables are included, of which seven are article metadata and 13 pertain to the individual statistical results (e.g., reported and recalculated p-value). A five-pronged approach was taken to generate the dataset: (i) collect journal lists, (ii) spider journal pages for articles, (iii) download articles, (iv) add article metadata, and (v) mine articles for statistical results.

Subject Areas

nhst; p-values; apa; content mining; tdm; errors

Readers' Comments and Ratings (0)

Discuss and rate this article
Views 1375
Downloads 468
Comments 0
Metrics 0
Discuss and rate this article

×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.