Working Paper Dataset Version 2 This version is not peer-reviewed

Beyond History and “on a Roll”: The List of the most Well-Studied Human Protein Structures and Overall Trends in the Protein Data Bank

Version 1 : Received: 27 August 2020 / Approved: 30 August 2020 / Online: 30 August 2020 (10:59:17 CEST)
Version 2 : Received: 16 October 2020 / Approved: 16 October 2020 / Online: 16 October 2020 (12:11:04 CEST)

How to cite: Li, Z.; Buck, M. Beyond History and “on a Roll”: The List of the most Well-Studied Human Protein Structures and Overall Trends in the Protein Data Bank. Preprints 2020, 2020080655 Li, Z.; Buck, M. Beyond History and “on a Roll”: The List of the most Well-Studied Human Protein Structures and Overall Trends in the Protein Data Bank. Preprints 2020, 2020080655

Abstract

Of the roughly 20,000 canonical human protein sequences, as of September 15, 2020, 6,937 proteins have had their full or partial, medium- to high-resolution structures determined by x-ray crystallography or other methods. Which of these proteins dominate the Protein Data Bank (the PDB) and why? In this paper, we list the 273 top human protein structures based on the number of their PDB entries. This set of proteins accounts for more than 40% of all available human PDB entries and represent past trends as well as current status for protein structural biology. We briefly discuss the relationship which some of the prominent protein structures have with protein research as a whole and mention their relevance to human diseases. The top-10 soluble and membrane proteins are all well-known (most of their first structures being deposited more than 30 years ago). Overall, there is no dramatic change in recent trends in the PDB. Remarkably, the number of structure depositions has grown nearly exponentially over the last 10 or more years (with a doubling time of 7 yrs for proteins from all organisms). Growth in human protein structures is slightly faster (at 5.9 yrs, while E.Coli and Mouse+Rat protein structures accumulate more slowly, Zebrafish protein structures are growing most, at a doubling every 3.7 years, albeit starting from only approx. 100 structure entries in 2010). The information may be informative to senior scientists but also inspire researchers who are new to protein science, providing the year 2020 snap-shot for the state of protein structural biology.

Subject Areas

mostly studied human protein structure; human diseases; biophysics

Comments (1)

Comment 1
Received: 16 October 2020
Commenter: Zhenlu Li
Commenter's Conflict of Interests: Author
Comment: Add more references and more analysis on PDB statistics of model organisms.
+ Respond to this comment

We encourage comments and feedback from a broad range of readers. See criteria for comments and our diversity statement.

Leave a public comment
Send a private comment to the author(s)
Views 0
Downloads 0
Comments 1
Metrics 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.