A comparison of MeSH terms and CiteULike social tags as metadata for the same items

Danielle H. Lee, Titus Schleyer

Research output: Chapter in Book/Report/Conference proceedingConference contribution

7 Citations (Scopus)

Abstract

In this paper, we examine the degree of difference between two types of metadata for biomedical articles generated by different groups of people. The first type of metadata is social tags, which are assigned to articles by their readers using uncontrolled vocabulary. The second type is index terms, which are assigned by professionally trained indexers and domain experts using a controlled vocabulary. When the two kinds of metadata are assigned to the same item, we may expect that they overlap to a large extent and could substitute for one another. In this study, we compared social tags and index terms for a set of papers that appear both in CiteULike and MEDLINE, and assessed their differences. Due to the idiosyncratic nature of social tags, we preprocessed the tags through normalization, stop-word removal, stemming and spell-checking. Our results show that social tags and Medical Subject Heading (MeSH) index have little overlap and embody largely heterogeneous understanding of items.

Original languageEnglish (US)
Title of host publicationIHI'10 - Proceedings of the 1st ACM International Health Informatics Symposium
Pages445-448
Number of pages4
DOIs
StatePublished - 2010
Externally publishedYes
Event1st ACM International Health Informatics Symposium, IHI'10 - Arlington, VA, United States
Duration: Nov 11 2010Nov 12 2010

Other

Other1st ACM International Health Informatics Symposium, IHI'10
CountryUnited States
CityArlington, VA
Period11/11/1011/12/10

Fingerprint

Medical Subject Headings
Controlled Vocabulary
Vocabulary
MEDLINE
Metadata

Keywords

  • citeulike
  • controlled vocabulary
  • folksonomy
  • mesh term
  • metadata
  • social tags

ASJC Scopus subject areas

  • Health Informatics
  • Health Information Management

Cite this

Lee, D. H., & Schleyer, T. (2010). A comparison of MeSH terms and CiteULike social tags as metadata for the same items. In IHI'10 - Proceedings of the 1st ACM International Health Informatics Symposium (pp. 445-448) https://doi.org/10.1145/1882992.1883060

A comparison of MeSH terms and CiteULike social tags as metadata for the same items. / Lee, Danielle H.; Schleyer, Titus.

IHI'10 - Proceedings of the 1st ACM International Health Informatics Symposium. 2010. p. 445-448.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Lee, DH & Schleyer, T 2010, A comparison of MeSH terms and CiteULike social tags as metadata for the same items. in IHI'10 - Proceedings of the 1st ACM International Health Informatics Symposium. pp. 445-448, 1st ACM International Health Informatics Symposium, IHI'10, Arlington, VA, United States, 11/11/10. https://doi.org/10.1145/1882992.1883060
Lee DH, Schleyer T. A comparison of MeSH terms and CiteULike social tags as metadata for the same items. In IHI'10 - Proceedings of the 1st ACM International Health Informatics Symposium. 2010. p. 445-448 https://doi.org/10.1145/1882992.1883060
Lee, Danielle H. ; Schleyer, Titus. / A comparison of MeSH terms and CiteULike social tags as metadata for the same items. IHI'10 - Proceedings of the 1st ACM International Health Informatics Symposium. 2010. pp. 445-448
@inproceedings{516c7c9c89f245e3b71bc18fe92cf551,
title = "A comparison of MeSH terms and CiteULike social tags as metadata for the same items",
abstract = "In this paper, we examine the degree of difference between two types of metadata for biomedical articles generated by different groups of people. The first type of metadata is social tags, which are assigned to articles by their readers using uncontrolled vocabulary. The second type is index terms, which are assigned by professionally trained indexers and domain experts using a controlled vocabulary. When the two kinds of metadata are assigned to the same item, we may expect that they overlap to a large extent and could substitute for one another. In this study, we compared social tags and index terms for a set of papers that appear both in CiteULike and MEDLINE, and assessed their differences. Due to the idiosyncratic nature of social tags, we preprocessed the tags through normalization, stop-word removal, stemming and spell-checking. Our results show that social tags and Medical Subject Heading (MeSH) index have little overlap and embody largely heterogeneous understanding of items.",
keywords = "citeulike, controlled vocabulary, folksonomy, mesh term, metadata, social tags",
author = "Lee, {Danielle H.} and Titus Schleyer",
year = "2010",
doi = "10.1145/1882992.1883060",
language = "English (US)",
isbn = "9781450300308",
pages = "445--448",
booktitle = "IHI'10 - Proceedings of the 1st ACM International Health Informatics Symposium",

}

TY - GEN

T1 - A comparison of MeSH terms and CiteULike social tags as metadata for the same items

AU - Lee, Danielle H.

AU - Schleyer, Titus

PY - 2010

Y1 - 2010

N2 - In this paper, we examine the degree of difference between two types of metadata for biomedical articles generated by different groups of people. The first type of metadata is social tags, which are assigned to articles by their readers using uncontrolled vocabulary. The second type is index terms, which are assigned by professionally trained indexers and domain experts using a controlled vocabulary. When the two kinds of metadata are assigned to the same item, we may expect that they overlap to a large extent and could substitute for one another. In this study, we compared social tags and index terms for a set of papers that appear both in CiteULike and MEDLINE, and assessed their differences. Due to the idiosyncratic nature of social tags, we preprocessed the tags through normalization, stop-word removal, stemming and spell-checking. Our results show that social tags and Medical Subject Heading (MeSH) index have little overlap and embody largely heterogeneous understanding of items.

AB - In this paper, we examine the degree of difference between two types of metadata for biomedical articles generated by different groups of people. The first type of metadata is social tags, which are assigned to articles by their readers using uncontrolled vocabulary. The second type is index terms, which are assigned by professionally trained indexers and domain experts using a controlled vocabulary. When the two kinds of metadata are assigned to the same item, we may expect that they overlap to a large extent and could substitute for one another. In this study, we compared social tags and index terms for a set of papers that appear both in CiteULike and MEDLINE, and assessed their differences. Due to the idiosyncratic nature of social tags, we preprocessed the tags through normalization, stop-word removal, stemming and spell-checking. Our results show that social tags and Medical Subject Heading (MeSH) index have little overlap and embody largely heterogeneous understanding of items.

KW - citeulike

KW - controlled vocabulary

KW - folksonomy

KW - mesh term

KW - metadata

KW - social tags

UR - http://www.scopus.com/inward/record.url?scp=78650942027&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=78650942027&partnerID=8YFLogxK

U2 - 10.1145/1882992.1883060

DO - 10.1145/1882992.1883060

M3 - Conference contribution

AN - SCOPUS:78650942027

SN - 9781450300308

SP - 445

EP - 448

BT - IHI'10 - Proceedings of the 1st ACM International Health Informatics Symposium

ER -