Display Settings:

Format

Send to:

Choose Destination
    Proc Natl Acad Sci U S A. 2004 Apr 6;101 Suppl 1:5220-7. Epub 2004 Mar 12.

    Mixed-membership models of scientific publications.

    Source

    Department of Statistics, School of Social Work, and Center for Statistics and the Social Sciences, University of Washington, Seattle, WA 98195, USA. elena@stat.washington.edu

    Abstract

    PNAS is one of world's most cited multidisciplinary scientific journals. The PNAS official classification structure of subjects is reflected in topic labels submitted by the authors of articles, largely related to traditionally established disciplines. These include broad field classifications into physical sciences, biological sciences, social sciences, and further subtopic classifications within the fields. Focusing on biological sciences, we explore an internal soft-classification structure of articles based only on semantic decompositions of abstracts and bibliographies and compare it with the formal discipline classifications. Our model assumes that there is a fixed number of internal categories, each characterized by multinomial distributions over words (in abstracts) and references (in bibliographies). Soft classification for each article is based on proportions of the article's content coming from each category. We discuss the appropriateness of the model for the PNAS database as well as other features of the data relevant to soft classification.

    PMID:
    15020766
    [PubMed - indexed for MEDLINE]
    PMCID:
    PMC387299
    Free PMC Article

    Images from this publication.See all images (1) Free text

    Fig. 1.

      Supplemental Content

      Icon for HighWire Press Icon for PubMed Central

      Save items

      loading

      Recent activity

      Your browsing activity is empty.

      Activity recording is turned off.

      Turn recording back on

      See more...
      Write to the Help Desk