Kandiru t1_ishg2ww wrote
Reply to comment by promonk in When it's said 99.9% of human DNA is the same in all humans, is this referring to only coding DNA or both coding and non-coding DNA combined? by PeanutSalsa
Say a certain position is a A for 90% of people, but a C for 10%. The A variant is more common than the C.
So when the reference had previously had a C there, in a later version it's often been changed to the most frequent base.
promonk t1_ishu3eb wrote
I get that. What I'm curious about is sampling. 90% of which population? Is it 90 of some college-age kids being paid a hundred bucks for a cheek swab? Or is it drawn from a broad swathe of demographics and locations?
emfts t1_isiaa4s wrote
The first human reference genome (from the human genome project) was a group of people from all over, random volunteers.
You can read all about it here:
https://www.genome.gov/12513430/2004-release-ihgsc-describes-finished-human-sequence
Kandiru t1_isimfb1 wrote
The 1000 genome project used populations around the world
http://ftp.1000genomes.ebi.ac.uk/vol1/ftp/README_populations.md
Has a list of the ones used.
[deleted] t1_ishhjrb wrote
[removed]
[deleted] t1_isiu3kr wrote
[removed]
Viewing a single comment thread. View all comments