Clustering More than Two Million Biomedical Publications: Comparing the Accuracies of Nine Text-Based Similarity Approaches

dc.contributor.authorBoyack, Kevin W.
dc.contributor.authorNewman, David
dc.contributor.authorDuhon, Russell J.
dc.contributor.authorKlavans, Richard
dc.contributor.authorPatek, Michael
dc.contributor.authorBiberstine, Joseph R.
dc.contributor.authorSchijvenaars, Bob
dc.contributor.authorSkupin, André
dc.contributor.authorMa, Nianli
dc.contributor.authorBörner, Katy
dc.date.accessioned2011-09-12T18:12:22Z
dc.date.available2011-09-12T18:12:22Z
dc.date.issued2011
dc.description.abstractWe investigate the accuracy of different similarity approaches for clustering over two million biomedical documents. Clustering large sets of text documents is important for a variety of information needs and applications such as collection management and navigation, summary and analysis. The few comparisons of clustering results from different similarity approaches have focused on small literature sets and have given conflicting results. Our study was designed to seek a robust answer to the question of which similarity approach would generate the most coherent clusters of a biomedical literature set of over two million documents.en
dc.identifier.citationBoyack, Kevin W., Newman, David, Duhon, Russell Jackson, Klavans, Richard, Patek, Michael, Biberstine, Joseph R., Schijvenaars, Bob, Skupin, Andre, Ma, Nianli & Börner, Katy. (2011). Clustering More Than Two Million Biomedical Publications: Comparing the Accuracies of Nine Text-Based Similarity Approaches. PLoS ONE. Vol. 6, 1-11.en
dc.identifier.urihttps://hdl.handle.net/2022/13472
dc.language.isoen_USen
dc.publisherPublic Library of Scienceen
dc.relation.isversionofhttps://doi.org/10.1371/journal.pone.0018029en
dc.titleClustering More than Two Million Biomedical Publications: Comparing the Accuracies of Nine Text-Based Similarity Approachesen
dc.typeArticleen

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
clustering_more_than_two_million_biomed_pub.pdf
Size:
1.08 MB
Format:
Adobe Portable Document Format
Description:
Main Article
Can’t use the file because of accessibility barriers? Contact us with the title of the item, permanent link, and specifics of your accommodation need.