122
views
0
recommends
+1 Recommend
0 collections
    4
    shares
      • Record: found
      • Abstract: found
      • Article: not found

      Identifying personal genomes by surname inference.

      Read this article at

      ScienceOpenPublisherPubMed
      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Sharing sequencing data sets without identifiers has become a common practice in genomics. Here, we report that surnames can be recovered from personal genomes by profiling short tandem repeats on the Y chromosome (Y-STRs) and querying recreational genetic genealogy databases. We show that a combination of a surname with other types of metadata, such as age and state, can be used to triangulate the identity of the target. A key feature of this technique is that it entirely relies on free, publicly accessible Internet resources. We quantitatively analyze the probability of identification for U.S. males. We further demonstrate the feasibility of this technique by tracing back with high probability the identities of multiple participants in public sequencing projects.

          Related collections

          Author and article information

          Journal
          Science
          Science (New York, N.Y.)
          American Association for the Advancement of Science (AAAS)
          1095-9203
          0036-8075
          Jan 18 2013
          : 339
          : 6117
          Affiliations
          [1 ] Whitehead Institute for Biomedical Research, 9 Cambridge Center, Cambridge, MA 02142, USA.
          Article
          339/6117/321
          10.1126/science.1229566
          23329047
          f9b888fc-550c-41c6-af4a-3f55e8fe30aa
          History

          Comments

          Comment on this article