Disentangling the evolution of MEDLINE bibliographic database: A complex network perspective.

Bibliographic databases Complex networks MEDLINE Network evolution Science of science

Journal

Journal of biomedical informatics
ISSN: 1532-0480
Titre abrégé: J Biomed Inform
Pays: United States
ID NLM: 100970413

Informations de publication

Date de publication:
01 2019
Historique:
received: 01 06 2018
revised: 20 11 2018
accepted: 28 11 2018
pubmed: 12 12 2018
medline: 14 3 2020
entrez: 12 12 2018
Statut: ppublish

Résumé

Scientific knowledge constitutes a complex system that has recently been the topic of in-depth analysis. Empirical evidence reveals that little is known about the dynamic aspects of human knowledge. Precise dissection of the expansion of scientific knowledge could help us to better understand the evolutionary dynamics of science. In this paper, we analyzed the dynamic properties and growth principles of the MEDLINE bibliographic database using network analysis methodology. The basic assumption of this work is that the scientific evolution of the life sciences can be represented as a list of co-occurrences of MeSH descriptors that are linked to MEDLINE citations. The MEDLINE database was summarized as a complex system, consisting of nodes and edges, where the nodes refer to knowledge concepts and the edges symbolize corresponding relations. We performed an extensive statistical evaluation based on more than 25 million citations in the MEDLINE database, from 1966 until 2014. We based our analysis on node and community level in order to track temporal evolution in the network. The degree distribution of the network follows a stretched exponential distribution which prevents the creation of large hubs. Results showed that the appearance of new MeSH terms does not also imply new connections. The majority of new connections among nodes results from old MeSH descriptors. We suggest a wiring mechanism based on the theory of structural holes, according to which a novel scientific discovery is established when a connection is built among two or more previously disconnected parts of scientific knowledge. Overall, we extracted 142 different evolving communities. It is evident that new communities are constantly born, live for some time, and then die. We also provide a Web-based application that helps characterize and understand the content of extracted communities. This study clearly shows that the evolution of MEDLINE knowledge correlates with the network's structural and temporal characteristics.

Identifiants

pubmed: 30529574
pii: S1532-0464(18)30227-2
doi: 10.1016/j.jbi.2018.11.014
pii:
doi:

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

101-113

Informations de copyright

Copyright © 2018 Elsevier Inc. All rights reserved.

Auteurs

Andrej Kastrin (A)

Institute of Biostatistics and Medical Informatics, Faculty of Medicine, University of Ljubljana, Vrazov trg 2, SI-1000 Ljubljana, Slovenia. Electronic address: andrej.kastrin@mf.uni-lj.si.

Dimitar Hristovski (D)

Institute of Biostatistics and Medical Informatics, Faculty of Medicine, University of Ljubljana, Vrazov trg 2, SI-1000 Ljubljana, Slovenia. Electronic address: dimitar.hristovski@mf.uni-lj.si.

Articles similaires

United Kingdom Humans Databases, Bibliographic Information Storage and Retrieval MEDLINE
1.00
Medical Subject Headings Machine Learning PubMed Algorithms
Natural Language Processing Data Mining Knowledge Discovery PubMed Search Engine
Humans COVID-19 MEDLINE United Kingdom SARS-CoV-2

Classifications MeSH