| Literature DB >> 28490825 |
Nees Jan van Eck1, Ludo Waltman1.
Abstract
Clustering scientific publications in an important problem in bibliometric research. We demonstrate how two software tools, CitNetExplorer and VOSviewer, can be used to cluster publications and to analyze the resulting clustering solutions. CitNetExplorer is used to cluster a large set of publications in the field of astronomy and astrophysics. The publications are clustered based on direct citation relations. CitNetExplorer and VOSviewer are used together to analyze the resulting clustering solutions. Both tools use visualizations to support the analysis of the clustering solutions, with CitNetExplorer focusing on the analysis at the level of individual publications and VOSviewer focusing on the analysis at an aggregate level. The demonstration provided in this paper shows how a clustering of publications can be created and analyzed using freely available software tools. Using the approach presented in this paper, bibliometricians are able to carry out sophisticated cluster analyses without the need to have a deep knowledge of clustering techniques and without requiring advanced computer skills.Entities:
Keywords: CitNetExplorer; Citation; Clustering; VOSviewer
Year: 2017 PMID: 28490825 PMCID: PMC5400793 DOI: 10.1007/s11192-017-2300-7
Source DB: PubMed Journal: Scientometrics ISSN: 0138-9130 Impact factor: 3.238
Statistics for the data set of astronomy and astrophysics publications
| No. of publications | 111,616 |
| No. of journals | 59 |
| No. of cited references | 4,311,953 |
| No. of citation relations between publications in the data set | 929,364 |
| No. of citation relations in CitNetExplorer | 925,540 |
Parameters and statistics for the different clustering solutions
| Level | Resolution | Min. cluster size | No. of clusters | Avg. no. of pub. per cluster | No. of pub. smallest cluster | No. of pub. largest cluster |
|---|---|---|---|---|---|---|
| 1 | 1.8 | 500 | 22 | 4628.5 | 794 | 14,873 |
| 2 | 3.0 | 250 | 42 | 2424.5 | 253 | 9395 |
| 3 | 10.0 | 150 | 115 | 885.5 | 176 | 2891 |
| 4 | 40.0 | 50 | 434 | 234.6 | 50 | 1080 |
Brief summary of the 22 level 1 clusters
| Cluster | No. of pub. | Terms |
|---|---|---|
| 1 | 14,873 | Galaxy cluster; galaxy; early type galaxy; abell; high redshift |
| 2 | 8954 | Dark energy; inflation; wmap; cosmic microwave background; cosmology |
| 3 | 7998 | Solar flare; coronal mass ejection; solar corona; solar cycle; sunspot |
| 4 | 7483 | Brown dwarf; protoplanetary disk; extrasolar planet; planet; exoplanet |
| 5 | 5704 | Molecular cloud; region; dark cloud; protostar; dense core |
| 6 | 5597 | Globular cluster; globular cluster system; metal poor star; star cluster; omega centauri |
| 7 | 5363 | Qcd; lattice; decay; finite temperature; lattice qcd |
| 8 | 5211 | Cern lhc; lhc; dark matter annihilation; leptogenesis; higgs boson |
| 9 | 5179 | Ultraluminous x ray source; cygnus x; x ray binary; microquasar; integral |
| 10 | 3904 | Quasinormal mode; hawking radiation; ads; higher dimension; wormhole |
| 11 | 3527 | Supernova remnant; pulsar; psr; magnetar; radio pulsar |
| 12 | 3413 | Asteroid; comet; body problem; trans neptunian object; centaur |
| 13 | 3392 | Eta carinae; asteroseismology; ap star; peculiar star; binary |
| 14 | 3355 | Titan; mars; venus; mercury; europa |
| 15 | 3182 | Grb; gamma ray burst; afterglow; type ia supernovae; short gamma ray burst |
| 16 | 3156 | Lisa; gravitational wafe; numerical relativity; gravitational wave detector; gravitational wave burst |
| 17 | 2625 | Blazar; ultra high energy cosmic ray; bl lacertae object; pks; bl lac object |
| 18 | 2228 | Iri; cluster observation; ionosphere; magnetosheath; low latitude |
| 19 | 2088 | Cataclysmic variable; white dwarf; nova; superoutburst; dwarf novae |
| 20 | 1963 | Loop quantum gravity; loop quantum cosmology; quantum gravity; lorentz; lorentz violation |
| 21 | 1839 | Planetary nebulae; symbiotic star; planetary nebula ngc; central star; planetary nebula |
| 22 | 794 | Pioneer; lense thirring effect; teleparallel gravity; equivalence principle; iau |
Extended summary of the 22 level 1 clusters
| Cluster | No. of pub. | Terms, standardized terms, journals, and most frequently cited publication |
|---|---|---|
| 1 | 14,873 (14.6%) |
|
| 2 | 8954 (8.8%) |
|
| 3 | 7998 (7.9%) |
|
| 4 | 7483 (7.3%) |
|
| 5 | 5704 (5.6%) |
|
| 6 | 5597 (5.5%) |
|
| 7 | 5363 (5.3%) |
|
| 8 | 5211 (5.1%) |
|
| 9 | 5179 (5.1%) |
|
| 10 | 3904 (3.8%) |
|
| 11 | 3527 (3.5%) |
|
| 12 | 3413 (3.4%) |
|
| 13 | 3392 (3.3%) |
|
| 14 | 3355 (3.3%) |
|
| 15 | 3182 (3.1%) |
|
| 16 | 3156 (3.1%) |
|
| 17 | 2 625 (2.6%) |
|
| 18 | 2228 (2.2%) |
|
| 19 | 2088 (2.1%) |
|
| 20 | 1963 (1.9%) |
|
| 21 | 1839 (1.8%) |
|
| 22 | 794 (0.8%) |
|
Fig. 1CitNetExplorer visualization of the 100 most frequently cited publications in level 1 clusters 1, 2, 3, and 4. Colors indicate the level 1 cluster to which a publication belongs. (Color figure online)
Fig. 2CitNetExplorer visualization of the 100 most frequently cited publications in level 1 cluster 2. Colors indicate the level 3 cluster to which a publication belongs. (Color figure online)
Fig. 3VOSviewer visualization of the 22 level 1 clusters and their citation relations. An interactive version of the visualization is available online at http://goo.gl/968hLw
Fig. 4VOSviewer term map visualization for level 1 cluster 3. The visualization shows 1420 terms extracted from the titles and abstracts of the publications belonging to the cluster. The strongest co-occurrence relations between terms are shown as well. An interactive version of the visualization is available online at http://goo.gl/sotbF1