• Most White Americans’ DNA Can Be Identified Through Genealogy Databases - The New York Times
    https://www.nytimes.com/2018/10/11/science/science-genetic-genealogy-study.html

    The genetic genealogy industry is booming. In recent years, more than 15 million people have offered up their DNA — a cheek swab, some saliva in a test-tube — to services such as 23andMe and Ancestry.com in pursuit of answers about their heritage. In exchange for a genetic fingerprint, individuals may find a birth parent, long-lost cousins, perhaps even a link to Oprah or Alexander the Great.

    But as these registries of genetic identity grow, it’s becoming harder for individuals to retain any anonymity. Already, 60 percent of Americans of Northern European descent — the primary group using these sites — can be identified through such databases whether or not they’ve joined one themselves, according to a study published today in the journal Science.

    Within two or three years, 90 percent of Americans of European descent will be identifiable from their DNA, researchers found. The science-fiction future, in which everyone is known whether or not they want to be, is nigh.

    Their results were eye-opening. The team found that a DNA sample from an American of Northern European heritage could be tracked successfully to a third-cousin distance of its owner in 60 percent of cases. A comparable analysis on the MyHeritage site had similar results. (The analysis focused on Americans of North European background because 75 percent of the users on GEDmatch and other genealogy sites belong to that demographic.)

    Some experts have raised questions about the study’s methodology. Its sample size was small, and it didn’t factor in that more than one match is often required to identify a suspect.

    CeCe Moore, a genetic genealogist with Parabon, a forensic consulting firm, also expressed worry in an email that the Science paper may obscure the difficulty involved in puzzling out someone’s identity; it takes a highly skilled expert to build a family tree from the initial genetic clues.

    Still, she said, the takeaway of the study “is not news to us.” In recent months Ms. Moore has been involved in a dozen murder and sexual assault cases that used GEDmatch to identify suspects. Of the 100 crime-scene profiles that her firm had uploaded to GEDmatch by May, half were obviously solvable, she said, and 20 were “promising.”

    “I think it’s a strong and convincing paper,” said Graham Coop, a population genetics researcher at the University of California, Davis. In a blog post in May, Dr. Coop calculated just how lucky investigators had been in the Golden State killer case. He reached a statistical conclusion similar to Dr. Erlich’s: society is not far from being able to identify 90 percent of people through the DNA of their cousins in genealogical databases.

    “This is this moment of, wow, oh, this opens up a lot of possibilities, some of which are good and some are more questionable,” he said.

    In an alarming result, the Science study found that a supposedly “anonymized” genetic profile taken from a medical data set could be uploaded to GEDmatch and positively identified. This shows that an individual’s private health data might not be so private after all.

    #Génomique #ADN #Vie_privée