PNAS -- Harpending et al. 95 (4): 1961

Abstract of this Article

Reprint (PDF) Version of this Article

This contribution is part of the special series of Inaugural Articles by members of the National Academy of Sciences elected on April 30, 1996.

Evolution / Anthropology
Genetic traces of ancient demography

Henry C. Harpending^*^,, Mark A. Batzer, Michael Gurven^§, Lynn B. Jorde^¶, Alan R. Rogers^*, and Stephen T. Sherry

^* Department of Anthropology, University of Utah, Salt Lake City, UT 84112; Departments of Pathology and Biometry and Genetics, Stanley S. Scott Cancer Center, Neuroscience Center of Excellence, Louisiana State University Medical Center, New Orleans, LA 70112; ^§ Department of Anthropology, University of New Mexico, Albuquerque, NM 87104; and ^¶ Department of Human Genetics, University of Utah School of Medicine, Salt Lake City, UT 84112

Contributed by Henry C. Harpending, December 10, 1997

	ABSTRACT

Top Abstract Article References

Patterns of gene differences among humans contain information about the demographic history of our species. Haploid loci likemitochondrial DNA and the nonrecombining part of the Y chromosomeshow a pattern indicating expansion from a population of onlyseveral thousand during the late middle or early upper Pleistocene.Nuclear short tandem repeat loci also show evidence of this expansion.Both mitochondrial DNA and the Y chromosome coalesce within thelast several hundred thousand years, and they cannot provide informationabout the population before their coalescence. Several nuclearloci are informative about our ancestral population size duringnearly the whole Pleistocene. They indicate a small effectivesize, on the order of 10,000 breeding individuals, throughoutthis time period. This genetic evidence denies any version ofthe multiregional model of modern human origins. It implies insteadthat our ancestors were effectively a separate species for mostof the Pleistocene.

	ARTICLE

Top Abstract Article References

When and where did modern humans evolve? This question remains the focus of much scientific controversy. Traditionally, answerswere sought in the human fossil record, which tells us that uprightbipedal hominids who made stone tools have occupied much of thetemperate Old World for 0.5-1.5 million years. The earlier formsof these hominids are usually called Homo erectus, whereas thelater forms, with larger brains and more sophisticated tool kits,are called Archaic Homo sapiens. The Neandertals of Europe arethe most familiar of these archaics. In Europe they were replacedby modern humans over several millennia about 40,000 years ago.In Indonesia archaics may have persisted until as recently as25,000 years ago (1).

Although fossils provide unique and invaluable information, they are very limited in quantity and quality. Huge gaps remainin the human fossil record, and it is difficult to assess, forexample, whether there was continuity between archaic and modernhumans. Genetic data, in contrast, are easy to collect, and theyare accumulating rapidly. Ancient demographic events have leftimprints that can be detected in present-day gene differences.Our purpose is to write an accessible summary of current geneticresearch about human population history.

Genetic methods and data are providing fresh perspectives on a long-standing debate about the origins of our species, which,in its simple form, can be summarized as two competing hypotheses.The multiregional hypothesis suggests that modern humans evolveddirectly from archaic forms in several different locations inthe Old World. Gene flow among these populations, combined withnatural selection for advantageous genes, maintained genetic homogeneityof the species. Under this hypothesis, our species had hundredsof thousands, perhaps millions, of ancestors for most of the lastmillion years. Without a large population, gene flow among populationsdistributed widely over the temperate and tropical Old World wouldhave been impossible.

The other hypothesis is called variously the Garden of Eden, the Noah's Ark, or the single origin model. According to thishypothesis, a specific population ancestral to modern humans underwentdemographic expansion and populated those parts of the world occupiedby archaics and then beyond into northern parts of Eurasia andeventually the New World. The contribution of archaic populationsto the modern gene pool was negligible. The number of our ancestorsjust before the expansion ("origin") of modern humans was small,only several thousand breeding adults.

A clear difference between these two hypotheses is the implied size of the past human population. If the size of the humanpopulation had been large throughout much of its history, extantgenetic variation should be substantial. Conversely, a small humanpopulation would result in relatively little genetic variation.Many genetic systems provide reassuringly congruent estimates:all indicate that human genetic variation is relatively low andthat the approximate "effective" size (i.e., the number of breedingadults) of humans is on the order of 10,000 (2, 3). Becauseseveral thousands or even tens of thousands of humans could nothave occupied the whole temperate Old World, genetic data providestrong support for the single origin hypothesis. Archaeologicaland skeletal evidence also generally support some version of thesingle origin hypothesis (4, 5).

The genetic relationship between modern humanity and the world population of archaics at, say, a half million years ago isstill unspecified. If the small effective size of humans reflectsa transient but drastic reduction in size of a large population,and subsequent recovery, then before the reduction, the numberof our ancestors was large, and a graph of our population historylooks like an hourglass. By contrast, if we are descended froma subpopulation of archaic humans that was effectively a separatespecies for the last million years or so, then the graph of ourpopulation history is a bottleneck, a short bottle and a verylong neck.

The hourglass hypothesis posits that there was a contraction in the number of our ancestors at some time during the Pleistocene,perhaps before the last interglacial, but specifies that the smallancestral population that later expanded had been part of a networkof gene flow over the whole temperate Old World occupied by archaichumans before the contraction. In other words, if the genes inthe small founding population of modern humans were traced backwardin time, they would be dispersed over a large part of the OldWorld in a population of hundreds of thousands to millions, asin the multiregional hypothesis. Humanity's small apparent effectivesize is the result of loss of genetic diversity during the contraction.

The long-neck hypothesis, in contrast, posits that the small ancestral population was small during most of the Pleistocene,for the last million years or so, and that genes in this populationtraced backward in time were restricted to the range of the particularspecies of archaic humans that were our ancestors. The essentialdifference in the two hypotheses is the effective size beforethe constriction in the middle or upper Pleistocene. They havedifferent consequences for the shape of trees of descent of nucleargenes. Below we show that there is no support in current geneticdata for the hourglass hypothesis, whereas the long-neck hypothesisfinds strong support.

Estimating Human Effective Size

Effective Size and Census Size. Effective size is the breeding size of an abstract population, and relating effective to census size of human populationsis complicated.

The standard model treats a population as a collection of genes that give birth each generation to a Poisson-distributed numberof progeny in such a way that the overall number of genes in thepopulation, N, remains constant from one generation to the next.Under this model, many genes become extinct because they haveno progeny. If we pick two genes at random from the population,the probability that they had the same parent (i.e., that theycoalesce) is just 1/N, so the expected waiting time until theycoalesce is N generations. This can serve as a definition of effectivesize at a single time, or of the long-term effective size if populationsize and breeding structure do not change. Felsenstein (6)showed that the effective size of human populations is about one-halfthe census size. This fraction may have been higher before theevolution of our long postreproductive life span.

When effective size changes over time, then long-term effective size is usually closer to the minimum than to the averageeffective size. In some simple cases, long-term effective sizeis the harmonic mean of the changing instantaneous effective size,and this may be a useful heuristic.

Population breeding structure can change effective size in significant ways. If a population is subdivided into partiallyisolated subpopulations, then the effective size is greater thanthat of a random mating population because the waiting time tocoalescence of genes is increased by the time they spend in differentsubpopulations. At the level of subdivision among human populationstoday, this effect would be minor, elevating the ratio of effectiveto census size by 10% or 15%.

If subpopulations frequently go extinct and are replaced by members of a neighboring subpopulation, then effective size isreduced. In the extreme case where there is almost no gene flowamong subpopulations and where descendants of a single subpopulationultimately replace all others, effective size over time can becloser to the size of a single subpopulation than to the sizeof the whole population. If something like this happened in ourevolution (7), the effective size of the founding subpopulationis exactly what we want to estimate. If there were any substantialgene flow among subpopulations during the replacement process,the effective size over time would reflect the size of the wholepopulation rather than that of a single subpopulation.

Estimates from Mean Pairwise Difference and Segregating Sites. The simplest genetic evidence about our demographic history is from estimates of overall human effective size. There are twostandard approaches to estimating effective size. Each does notestimate size per se but the product of size and mutation rate:these two parameters are almost always confounded in populationgenetic models. An exception is the estimate from human-specificAlu insertions described below.

The familiar way to estimate size uses DNA sequences. The mean time to coalescence of pairs of sequences is N generations,so the total path length between them is 2N generations. Withthe infinite sites assumption, according to which every mutationoccurs at a new nucleotide position, the expected number of differencesbetween two sequences is 2Nu, where u is the mutation rate forthe whole sequence, that is, the pernucleotide rate multipliedby the sequence length. This method requires knowledge of themutation rate. When the infinite sites assumption is violated,it is often necessary to correct this mean pairwise differenceestimate for repeated mutations (8, 9).

An alternate method of estimating N from DNA sequences relies on the overall branch length of the genealogical tree of a sampleof n genes and on the infinite sites assumption. The genealogyof a sample of n genes can be divided into n $-$ 1 epochs duringwhich there are n, n $-$ 1, n $-$ 2, ... , 2 ancestors of the samplein the population. The expected branch length at each epoch, theproduct of the number of lines present and the duration of theepoch, has a simple form under the constant size hypothesis. Theoldest epoch, when there were two genes ancestral to the sample,has expected duration N generations as derived above. During amore recent epoch, when there are j genes ancestral to the samplepresent in the population, the hazard of coalescence between anypair is just 1/N per generation and there are j(j $-$ 1)/2 waysthat pairs can be formed. The total hazard is j(j $-$ 1)/2N forepoch j, so the expected duration of this epoch is 2N/j(j $-$ 1)generations. Adding these and multiplying the expected lengthof each epoch by j because there are j lines present, the expectedtotal branch length is

<LIM><OP>∑</OP><LL>j=2</LL><UL>n</UL></LIM> j <FR><NU>2N</NU><DE>j(j−1)</DE></FR>=2N<LIM><OP>∑</OP><LL>i=1</LL><UL>n−1</UL></LIM><FR><NU>1</NU><DE>i</DE></FR> <UP>generations.</UP>

[ 1 ]

The expected time back to the most recent common ancestor, in contrast to total branch length, is the sum of the intervallengths rather than the sum of the branch lengths. This approaches2N as the sample size n becomes moderately large. This is calledthe coalescent of the tree.

It is remarkable that the distribution in Eq. 1 describes both the distribution of when mutations occurred and of the relativefrequency of mutations in the sample. The probability that a mutationoccurred in epoch j is proportional to 1/j as j varies from 2to n, and the probability that there are k copies of a mutantin a sample of n genes is proportional to 1/k as k varies from1 to n $-$ 1.

The expected number of mutations in the whole tree, equivalent to the total number of segregating sites in the sample, isthe tree length multiplied by the mutation rate u. Although thisestimator of N based on Eq. 1 has in theory better statisticalproperties than the estimator based on pairwise differences, itis more sensitive to violations of the assumption of constantpopulation size in the past.

An important recent extension of the pairwise method simultaneously estimates the effective sizes of two related species andthe effective size of their common ancestral population by usingmaximum likelihood (3).

New Effective Size Estimates. While the above methods require knowledge of the mutation rate to estimate N, a different approach is to use the time of separationbetween the ancestral chimpanzee and human species to calibratea genetic estimate of human effective size using Alu insertions.Most Alu elements are short ( $approx$ 300 bp) pseudogenes (10). Theyare stable, transcriptionally inactive copies of a few activeAlu elements, scattered randomly throughout the entire nucleargenome. Collectively, there are about 500,000 copies per haploidgenome or 5% of the genome by mass. Some Alu elements are sharedwith prosimians, monkeys, and apes, whereas others are so recentthat they are polymorphic in humans. We have studied insertionsof the Ya5 and Yb8 subfamilies whose active elements have beenpresent since before the separation of gorillas from the chimp-humanlineage. There are several thousand Ya5 and Yb8 elements in thehuman nuclear genome, and they are still being inserted. The uniquevalue for evolutionary inference of these loci is that insertedelements are never precisely deleted, so the ancestral state isalways known.

Fig. 1 is a schematic of the history of a sample from some locus in our nuclear genome. Interval A is the coalescent of thesample in humans, interval B is the interval from the top of thehuman coalescent tree to the time of speciation of the ancestralchimp-human population, and interval C is the coalescent timein the ancestral population. Alu insertions in humans that areabsent in chimpanzees have occurred somewhere along the branchleading to humans. Any insertions during intervals B and C arefixed in the sample, whereas any that occurred during intervalA are polymorphic. The total path length in A is proportionalto human effective size, the duration of B can be estimated frompaleontology, and C can be estimated by comparing within and betweenspecies genetic diversity in chimps and humans at other loci (3).

View larger version (63K):
[in this window]
[in a new window]

Fig. 1. Schematic history of a nuclear locus in humans and chimps.

We found 44 fixed and 13 polymorphic insertions in a sample of 122 humans. Sherry et al. (11) give details of the relationshipbetween the number of segregating insertions, the total tree lengthin interval A, and the implied effective size. We estimated theeffective size of humans to be 17,500 with this method, or 9,000if we assumed the length of interval C to be zero. Our methodgives an estimate of human effective size slightly higher thanthe conventional value of 10,000. The difference might just reflectsampling error or it might indicate a slight reduction in sizeduring the Pleistocene. The Alu method gives greatest weight toeffective size near the top of the coalescent tree.

Another new estimate of human effective size during the Pleistocene is derived from diversity among human leukocyte antigen(HLA) alleles. The HLA system is highly polymorphic because ofbalancing selection that maintains a few allelic lines over verylong time periods. The long persistence of alleles at HLA lociimplies that the effective size of human ancestors over the latterpart of the Cenozoic, i.e., over tens of millions of years, musthave been on the order of 100,000 rather than 10,000 (12). Diversityof synonymous (hence neutral) substitutions in selected HLA exonsis compatible with the long persistence of allelic lines, whereasthat of unselected neutral regions is much lower. This differenceled Takahata and Satta (13) to conclude that human ancestraleffective size decreased about 1 or 2 million years ago from 100,000to 10,000. An upper limit on the time of this population reductionwhich, the authors suggest, may be associated with the dispersalof Homo erectus, is given by the extent of synonymous diversitywithin HLA lineages.

Changing Effective Size

There are almost 6 billion members of our species on earth today, but the genetic indications of our low effective size suggestthat we have not been so numerous for long. Effective size estimatesfrom genetic data refer to a kind of average of population sizefrom the present back to the coalescent of the sample. To understandhow genetic data are used to read population size history, itis necessary to look at trees of descent of genes and how populationsize change affects their characteristics. Because common practicein the literature about human evolution is to reconstruct a genetree from differences among sequences and then infer history fromthe reconstructed tree, we include some comments and caveats aboutthis practice.

Properties of Gene Trees. In this section we show simulated gene trees from populations that have been stationary, that have undergone expansion, andthat have undergone contraction. Each of these histories can generatecharacteristic signatures in the structure of gene trees. In thesesimulations there are two populations that have always exchangedmembers at the rate of 0.5 gene per generation; the two populationsare approximately as different from each other as two human populationsfrom different continents. The sample is 10 sequences or genesfrom each population, the red and the green populations.

In practice, trees such as those we portray are reconstructed from gene differences. Each sample, that is, each tip of thetree, is a DNA sequence, and differences among sequences reflectmutations along the branches. The number of mutations along anybranch is a Poisson random variable with expectation proportionalto the length of the branch. The usual model, the infinite sitesmodel, postulates that every mutation occurs at a different site.In practice in important cases like that of human mitochondrialDNA (mtDNA) there have been multiple mutations at certain sites.These violations of the infinite sites assumption have led todifficulties reconstructing the human mtDNA tree. There is a richliterature on methods for tree reconstruction, and in the caseof human mtDNA, there is disagreement about whether current reconstructionsare "good enough." Here we assume that a correct tree has beenreconstructed from genetic data and consider inferences that canbe drawn from the reconstructed tree. The trees in each of thefigures are four equally likely results of the evolutionary processin the population.

Constant Population Size. The trees shown in Fig. 2 are generated by assuming that the two populations have never changed size. The four trees in thefigure are derived from exactly the same demographic scenario,yet they vary widely from one to another. Typically we would haveonly a single tree to analyze, for example, a tree of mtDNA sequences,so it is important to look at variability among trees in thesesimulated populations to assess how reliable inference from asingle reconstructed intraspecific tree can be.

View larger version (15K):
[in this window]
[in a new window]

Fig. 2. Simulated gene trees from a pair of populations of constant size that exchange an average of one-half a gene every generation.These are four simulated loci from the same populations with thevertical axes drawn to the same time scale.

Fig. 2 suggests several cautions about the value of tree reconstructions. First, the time of the root of the tree, the coalescent,varies a lot. Even if we could infer with precision the age ofthe root from data, it is apparent that any single locus is notvery informative about the population. The attention that hasbeen given to the age of the human mitochondrial coalescent seemsmisplaced because it is only a single locus.

Second, these simulated trees have clear structure. The bottom right tree, for example, has two deep clades. In the top righttree the primary branch on the right contains only samples fromthe red population, whereas the other contains members of bothpopulations. We could conclude from this that the "origin" ofthe populations is the red continent from which emigrants populatedthe green continent. This is just the argument from the humanmtDNA tree for an African origin of modern humans: several (contentious)reconstructed trees of descent of human mtDNA contained Africanson one side of the root and people from all the continents onthe other (14, 15).

Third, the number of "major clades" varies from tree to tree. The top left and bottom right trees each have two major clades,the lower left tree has four, and the tree on the top right hasthree. Depending on which tree we had sampled, we might concludethat there were two "founding lineages," or three, or four. Withmore study we could reconstruct ancient migration events betweenthe two populations.

Each of these four trees tells a detailed story, and all the stories are utterly spurious and wrong. None of these trees suggeststhe true population structure, two partially isolated populationsthat have no other dynamics nor history. The failures are notthe result of our small sample size of 20 genes. The overall structureof gene trees is dominated by events at the top of the tree, inthe far past. Increasing the sample of genes generally fills indetail at the bottom of the tree. We do not mean to suggest thatthere is no value in reconstructing intraspecific trees. In somecases, for example, genes of medical interest, the history ofthe gene rather than the history of the population is of interest.However, this exercise does suggest that population interpretationsof single locus trees should be regarded with caution.

Population Expansion. As we follow a sample going backwards in time to the coalescent, the rate at which lineages coalesce is proportional to theinverse of the effective size. If a population is large now, butexpanded rapidly from a small population in the past, then coalescentevents will be relatively few since the expansion. They will beconcentrated just before the expansion when the population wassmall. The result is a characteristic star-like gene genealogyas shown in Fig. 3. Prolonged exponential population growth generatessimilar trees (16). Trees like these are also produced by "selectivesweeps," replacements by an advantageous new allele that is fixedby selection. In the case of population expansion and of a selectivesweep, today's genes have a smaller-than-expected number of ancestorsat some time in the past.

View larger version (52K):
[in this window]
[in a new window]

Fig. 3. Simulated gene trees from a pair of populations that expanded by a factor of 1,000. The populations exchange an average ofone-half a gene every generation.

Population Contraction. Reduction in population size can lead to rapid loss of genetic diversity. Because the expected depth of a coalescent treeis 2N generations, a contraction in population size lasting thislong would erase preexisting diversity in many loci, whereas otherswould retain two or a few variants that would differ accordingto the coalescence time, hence the population size, before thecontraction. Fig. 4 shows trees from a population that has undergonean instantaneous hundredfold size reduction. The loci in the tworight panels retain several variants from before the contraction,whereas those in the left panels lost all precontraction diversity.The visible result of a contraction should be many loci with verylittle diversity, like those on the left, along with others witha few very divergent gene lineages, like those on the right.

View larger version (9K):
[in this window]
[in a new window]

Fig. 4. Simulated gene trees from a pair of populations that have contracted by a factor of 100. The populations exchange an averageof one-half a gene every generation. Because the vertical axesare drawn to the same scale, the trees in the left two panelsare so shallow that they are invisible.

Just as population expansion mimics selection for a favorable new mutant that evolves rapidly to fixation, population contractionmimics balancing selection maintaining several alleles or classesof alleles over a long time. In both these cases the result isa few alleles or allele classes that coalesce in the far distantpast.

Graphical Methods, Trees, and Human History

Expansion in the Pleistocene. An important paper by Felsenstein (17) showed that estimates of population size could be dramatically improved if the branchingorder of the underlying tree were known. Because unambiguous reconstructionof this branching order is often impossible, computer-intensivemethods have been developed that examine large numbers of trees,evaluating the likelihood of each tree and the likelihood of ademographic hypothesis given the tree (18, 19). As these methodsbecome faster and more widely available, methods like those aswe describe here will be relegated to screening roles. However,the simple approaches we describe are fast, simple, and easy tounderstand. Computer-intensive methods have the important disadvantagethat one never really knows what they do; it is difficult to provethat they are working correctly.

The top panel of Fig. 5 shows a simulated tree of a population that has undergone an expansion. The middle and bottom panelsshow two summaries of the simulated tree that are simple to compute.The middle panel shows the frequency spectrum of mutations. Thespectrum is the distribution of segregating sites according tofrequency in the sample. In practice we usually do not know theancestral state at a site, so we do not know which of two variantsis the mutant. The spectrum then must be "folded" at one-halfthe sample size. In the present case a mutant that occurred twicein the sample of 20 would be indistinguishable from a mutant thatoccurred in 18 of the 20, so these two categories are combinedand the range of the spectrum is from 1 to 10.

View larger version (24K):
[in this window]
[in a new window]

Fig. 5. Gene tree (Top), frequency spectrum (Middle) and mismatch distribution (Bottom) from a population that has undergone a populationexpansion. The circles on tree represent mutations. The simulationparameters match approximately those estimated from human mtDNA.

Mutations that happened far in the past, near the top of the gene tree, can occur at high frequencies in the sample. Recentmutations, on the other hand, always exist in one or a few copies.If an expansion has occurred, so that the gene tree is star-likeas it is here, many mutations occur in the long recent branchesand there are more singleton sites and sites with low frequencyvariants than are expected in a stationary population. Fig. 6shows a simulated tree from a stationary population, and the spectrumin the middle panel of this figure shows that there are more variantsat intermediate frequencies.

View larger version (20K):
[in this window]
[in a new window]

Fig. 6. Gene tree (Top), frequency spectrum (Middle) and mismatch distribution (Bottom) from a population that has always been constantin size.

The bottom panels of Figs. 5 and 6 show mismatch distributions calculated from the simulated trees. These are histograms ofthe numbers of sequence differences among all possible pairs ofsequences in the sample. Under the infinite sites assumption,every mutation on the path from one sample to another contributesa single difference to the comparison. These are not ordinarydistributions because the data points are not independent, butthey do provide quick visual summaries of important propertiesof the trees. In the case of population expansion in Fig. 5 thebiggest contribution to pairwise differences is from the longterminal branches that are independent of each other, so the resultis a smooth and often unimodal mismatch distribution in the bottompanel of the figure. Mismatch distributions from stationary populationsare reliably ragged and often multimodal, as in the bottom panelof Fig. 6.

Fig. 7 shows these distributions from a worldwide sample of 636 sequences at 411 positions of the first hypervariable segmentof mtDNA (ref. 20; L.B.J., unpublished data). In the top panel,the spectrum of frequencies is collapsed into four ranges. Expectedvalues shown are those expected if the population had always beenthe same size. In the human data there is a large excess of lowfrequency variants in accordance with the hypothesis that thehuman population has undergone a major expansion. The bottom panelof the figure shows the mismatch distribution: it is smooth withthe distinct mode characteristic of a population expansion ora selective sweep within the last several hundred thousand years.Almost all mismatch distributions from human mtDNA have the generalappearance of Fig. 7. We and others interpreted this pattern asa signature of a population expansion of our ancestors beginningin the last interglacial about 100,000 years ago (22-24).

View larger version (25K):
[in this window]
[in a new window]

Fig. 7. Frequency spectrum and mismatch distribution from a world sample of 636 mtDNA sequences at 411 positions of the first hypervariablesegment (HVS-I). Compare this with Fig. 5. The diamonds show theexpected number of segregating sites in each frequency intervalexpected in a constant size population.

However, there is the possibility that the pattern results from a selective sweep in which an advantageous new mtDNA sequencereplaced all other sequences in the population. To test this werequire information from other loci, and such information is onlyrecently becoming available.

The first new evidence is from a report of sequence differences in approximately 20,000 sites along the nonrecombining partof the Y chromosome from 718 men (25). The transmission of thispart of the Y is formally like that of mtDNA except that it isthrough males rather than through females. There are, however,two important ways in which these data must be treated differentlyfrom mtDNA sequence data. First, the segregating sites were ascertainedin small numbers of Y chromosomes, 21 in one set and 53 in theother. Unfortunately correct mismatch distributions cannot becomputed from the data because the typings in the screening sampleswere not reported. Many of the 718 chromosomes are expected todiffer at undetected sites.

Second, the gene tree was reconstructed without ambiguity and the ancestral states determined by comparisons with the samesites in African apes, so we can examine the whole frequency spectrumof the segregating sites without folding. Fig. 8 shows the observeddistribution of sites among four frequency classes. The diamondsin the figure show the expected distribution under the hypothesisof constant population size. There is a large excess of low frequencysites, consistent with the hypothesis of population expansionand inconsistent with the hypothesis that the evidence for expansionin human mtDNA reflects a selective sweep.

View larger version (10K):
[in this window]
[in a new window]

Fig. 8. Frequency spectrum and mismatch distribution from a world sample of 718 Y chromosome sequences. There are 20 segregating sitesascertained at approximately 20,000 positions. Ascertainment wasdone in two samples, one with 21 chromosomes and one with 53.If a site has population frequency x, then the probability thatit will be detected in a sample of size n, the ascertainment function,is 1 $-$ (xⁿ + (1 $-$ x)ⁿ). Diamonds show the expected number of sites in each frequencyclass under the hypothesis of constant population size, computedby multiplying the ascertainment function by the distributionin Eq. 1. The excess of low frequency sites is consistent witha Pleistocene population expansion. Because the whole nonrecombiningportion of the Y chromosome is a single locus, these sites arenot independent, and there is no simple statistical test of theconstant size hypothesis.

The second new evidence supporting ancient population expansion is from tandem repeat loci. These are loci in which some shortmotif is repeated, and the number of repeats varies from chromosometo chromosome. Mutation occurs at these loci, according to theusual model, by small gains and losses in the number of repeats.Kimmel et al. (26) show that the variance of repeat size andhomozygosity change at different rates after population expansion.Comparison of these quantities at 60 tetranucleotide loci fromthree human continental groups showed clear evidence of populationexpansion among Europeans and Asians, but none among Africans.Patterns in mtDNA and in human craniometric traits (27) suggestthat the ancestors of Africans expanded before the ancestors ofother continental populations. Because repeat loci evolve rapidly,their findings may suggest that these loci in Africans have hadtime to reach their new equilibrium, erasing the trace of theexpansion.

In summary, the estimates of overall human effective size of 10,000 from nuclear sequences, Alu insertions, and HLA exons,mtDNA mismatch distributions, frequency spectra from mtDNA andfrom the Y chromosome, and discordance between allele size varianceand homozygosity at tandem repeat loci all support the hypothesisof a bottleneck in our past during which the number of our ancestorswas only a few thousand breeding adults. The original formulationof the multiregional hypothesis, that there was a worldwide transformationof archaics into modern humans caused by spread of new alleles,is contradicted by all these findings.

The best available estimates of mtDNA mutation rates imply that the expansion occurred between 100,000 and 50,000 years agoin excellent agreement with archaeological evidence of the earliestmodern humans about 100,000 years ago, and the "creative explosion"of upper Paleolithic type technology about 50,000 years ago (28).This could be illusory because the time depends on estimates ofmtDNA rates, and these are fragile at best. A recent review (29)suggests that the expansion apparent from genetics is associatedwith the first complex flake tool industries several hundred thousandyears ago, i.e., much earlier than the Upper Paleolithic industriesassociated with modern humans in Europe. The older date cannotbe falsified from the genetic evidence.

Population Size Before the Bottleneck. Both mtDNA and Y chromosomes coalesce several hundred thousand years ago, so they provide no information about populationsize before then. The coalescent of nuclear genes should be fourtimes as old as that of mtDNA and the Y in the absence of populationsize change. Unfortunately, nuclear genes undergo recombination,and recombination rapidly destroys evidence of population sizein DNA sequences. The mutation rate at nuclear loci is also muchlower than that of mtDNA so long sequences are necessary to achieveresolution comparable to that of mtDNA sequences, but the longerthe sequence the more likely recombination has occurred.

If suitable nonrecombinant nuclear sequences can be found, then it will be possible to test the hourglass model by lookingfor very deep differences between alleles (see Fig. 4). Such patternsare conspicuously absent in humans with the exception of the HLAsystem, which owes its deep allelic lineages instead to balancingselection.

Harding et al. (30) describe a careful analysis of nuclear sequences from a region where recombination has not erased thecoalescent history. They studied part of the $beta$ -globin gene usinga computer-intensive method that examined large numbers of possiblemutation histories. Approximately 10% of their sequences werediscarded from the analysis because they had undergone recombination.There was no evidence of deep roots as predicted by the hourglassmodel; instead, they suggest that there was a constant populationsize of 10,000 all the way back to the root of this nuclear tree.

Under the hourglass model, nuclear loci with deep branches at the top of the gene tree should often lead to disjoint bimodalor multimodal distributions of allele size at tandem repeat loci.Disjoint distributions should occasionally appear even with constantpopulation size. For example, the gene trees in the right twopanels of Fig. 2 could generate bimodal allele size distributionsbecause of size-change mutations accumulating along the deep topbranches. The gene trees in the left two panels probably wouldnot. Many tandem repeat loci do show such allele size distributions,but their frequency does not appear to be any greater than thatexpected under constant population size.

The most useful class of genetic markers for ancient population studies is currently young Alu insertions. Because these areinserted into the genome but never precisely deleted, the ancestraltype is always known. They are inserted at random into the nucleargenome so that the probability of two insertions in the same placeis vanishingly small. Fig. 9 shows the spectrum of frequenciesof 23 human-specific Alu insertions (refs. 11 and 21; M.A.B.,unpublished data) along with the predicted spectra under the long-neckand hourglass models of Pleistocene human population size. Thereis no suggestion of population contraction in our history fromthese data. Because these insertions have independent historiesafter they are inserted, standard statistical methods for contingencytables can be applied to them. The long-neck model cannot be rejected,whereas the extreme hourglass model can be rejected.

View larger version (13K):
[in this window]
[in a new window]

Fig. 9. Frequency spectrum of 23 Alu insertions in humans. The diamonds show the expected numbers of loci under constant populationsize as specified by the long-neck model. Circles show expectednumbers of loci under the hourglass model of a population contraction.The hourglass hypothesis can be rejected by a statistical test,whereas the long-neck model cannot. Each of these was ascertainedin a diploid. The probability of detecting at least one copy ofan insertion in a diploid whose population frequency is x is x² + 2x(1 $-$ x), the ascertainment function for this system. Expectedvalues for the long-neck model were computed by multiplying thedistribution in Eq. 2 by this function. Expected valuesfor the hourglass model were computed by multiplying the ascertainmentfunction by the uniform distribution because the distributionof the number of copies of a mutation in the top interval of thetree is uniform.

Conclusions

We have avoided hedging and qualifying our findings in the interest of making this paper simple and accessible. Nevertheless,the broad picture that we paint continues to gain empirical support.Most of the familiar specimens of Homo erectus and of archaichumans known from the Pleistocene were not members of populationsancestral to us, instead "the fate of most such populations appearsto be tragic" (13). We are descended from a population thatwas effectively a separate species for at least the last 1 or2 million years. Although the size of this population must havefluctuated over time, it was often reduced to the level of severalthousands of adults. Such a population would have occupied anarea the size of Swaziland or Rhode Island rather than a wholecontinent, although episodic expansions would have covered a muchlarger area. Archaeologists should find and identify this population.

	ACKNOWLEDGEMENTS

We are grateful for help and suggestions from Elise Eller, Marta Lahr, Renee Pennington, James O'Connell, John Relethford,Naoyuki Takahata, Stephen Wooding, and the Human Diversity Project,King's College Research Centre, Cambridge.

	FOOTNOTES

To whom reprint requests should be addressed. e-mail: harpend@ibm.net.

ABBREVIATIONS

	ABBREVIATIONS

mtDNA, mitochondrial DNA; HLA, human leukocyte antigen.

	REFERENCES

Top Abstract Article References

1.	Swisher, C. C., Rink, W. J., Anton, S.C., Schwarcz, H. P., Curtis, G.H., Suprijo, A. & Widiasmora (1996) Science 274, 1870-1874 [Abstract/Free Full Text].
2.	Nei, M. & Graur, D. (1984) in EvolutionaryBiology, eds. Hecht, M. K., Wallace, B. & Prance, G.T. (Plenum, NewYork), Vol. 17, pp. 73-118[ISI].
3.	Takahata, N. & Satta, Y. (1997) Proc.Natl. Acad. Sci. USA 94, 4811-4815 [Abstract/Free Full Text].
4.	Klein, R. G. (1995) J. World Prehist. 9, 167-198 [ISI].
5.	Lahr, M. M. (1996) The Evolutionof Modern Human Diversity (CambridgeUniv. Press, Cambridge, U.K.).
6.	Felsenstein, J. (1971) Genetics 68, 581-597 [ISI][Medline].
7.	Takahata, N. (1994) Mol. Biol. Evol. 11, 803-805 [Free Full Text].
8.	Watterson, G. A. (1975) Theor. Pop.Biol. 7, 256-276 [ISI][Medline].
9.	Tajima, F. (1983) Genetics 105, 437-460 [Abstract].
10.	Deininger, P. L. & Batzer, M. A. (1993) in EvolutionaryBiology, eds. Hecht, M. K., MacIntyre, R.J. & Clegg, M. T. (Plenum, NewYork), Vol. 27, pp. 157-196[ISI].
11.	Sherry, S. T., Harpending, H. C., Batzer, M.A. & Stoneking, M. (1997) Genetics 147, 1977-1982 [Abstract/Free Full Text].
12.	Takahata, N. (1993) Mol. Biol.Evol. 10, 2-22 [Abstract].
13.	Takahata, N. & Satta, Y. (1998) Immunogenetics, in press.
14.	Vigilant, L., Stoneking, M., Harpending, H., Hawkes, K. & Wilson, A.C. (1991) Science 253, 1503-1507 [ISI][Medline].
15.	Cann, R. L., Stoneking, M. & Wilson, A.C. (1987) Nature (London) 325, 31-36 [ISI][Medline].
16.	Hudson, R. R. & Slatkin, M. (1991) Genetics 129, 555-562 [Abstract/Free Full Text].
17.	Felsenstein, J. (1992) Genet. Res. 59, 139-147 [ISI][Medline].
18.	Kuhner, M. K., Yamato, J. & Felsenstein, J. (1997) in Progressin Population Genetics and Human Evolution, eds. Donnelly, P.J. & Tavaré, S. (Springer, NewYork), pp. 183-192.
19.	Griffiths, R. C. & Tavaré, S. (1997) in Progressin Population Genetics and Human Evolution, eds. Donnelly, P.J. & Tavaré, S. (Springer, NewYork), pp. 165-182.
20.	Jorde, L. B., Bamshad, M. J., Watkins, W.S., Zenger, R., Fraley, A.E., Krakowiak, P. A., Carpenter, K.D., Soodyall, H., Jenkins, T. & Rogers, A.R. (1995) Am. J. Hum. Genet. 57, 523-538 [ISI][Medline].
21.	Stoneking, M., Fontius, J. J., Clifford, S.L., Soodyall, H., Arcot, S.S., Saha, N., Jenkins, T., Tahir, M.A., Deininger, P. L. & Batzer, M.A. (1997) Genome Res. 7, 1061-1071 [Abstract/Free Full Text].
22.	Sherry, S., Rogers, A. R., Harpending, H.C., Soodyall, H., Jenkins, T. & Stoneking, M. (1994) Hum.Biol. 66, 761-775 [ISI][Medline].
23.	Harpending, H. C., Sherry, S. T., Rogers, A.R. & Stoneking, M. (1993) Curr.Anthropol. 34, 483-496 [CrossRef][ISI].
24.	Rogers, A. R. & Harpending, H. C. (1992) Mol.Biol. Evol. 9, 552-569 [Abstract].
25.	Underhill, P. A., Jin, L., Lin, A.A., Mehdi, S. Q., Jenkins, T., Vollrath, D., Davis, R.W., Cavalli-Sforza, L. L. & Oefner, P.J. (1997) Genome Res. 7, 996-1005 [Abstract/Free Full Text].
26.	Kimmel, M., Chakraborty, R., King, J. P., Bamshad, M., Watkins, W. S. & Jorde, L. B. (1998) Genetics, in press.
27.	Relethford, J. H. & Harpending, H. (1994) Am.J. Phys. Anthropol. 95, 249-270 [ISI][Medline].
28.	Klein, R. G. (1989) The Human Career (Universityof Chicago Press, Chicago).
29.	Foley, R. A. & Lahr, M. M. (1997) Camb.Arch. J. 7, 3-32 [ISI].
30.	Harding, R. M., Fullerton, S. M., Griffiths, R.C., Bond, J., Cox, M.J., Schneider, J. A., Moulin, D.S. & Clegg, J. B. (1997) Am.J. Hum. Genet. 60, 722-789 .

This article has been cited by other articles:

S. E. Ramos-Onsins and J. Rozas
Statistical Properties of New Neutrality Tests Against Population Growth
Mol. Biol. Evol., December 1, 2002; 19(12): 2092 - 2100.
[Abstract] [Full Text] [PDF]

B. S. Arbogast, S. V. Edwards, J. Wakeley, P. Beerli, and J. B. Slowinski
ESTIMATING DIVERGENCE TIMES FROM MOLECULAR DATA ON PHYLOGENETIC AND POPULATION GENETIC TIMESCALES
Annu. Rev. Ecol. Syst., January 1, 2002; 33(1): 707 - 740.
[Abstract] [Full Text] [PDF]

J. F. Storz, M. A. Beaumont, and S. C. Alberts
Genetic Evidence for Long-Term Population Decline in a Savannah-Dwelling Primate: Inferences from a Hierarchical Bayesian Model
Mol. Biol. Evol., November 1, 2002; 19(11): 1981 - 1990.
[Abstract] [Full Text] [PDF]

L. Forster, P. Forster, S. Lutz-Bonengel, H. Willkomm, and B. Brinkmann
Natural radioactivity and human mitochondrial DNA mutations
PNAS, October 15, 2002; 99(21): 13950 - 13954.
[Abstract] [Full Text] [PDF]

O. Bar-Yosef
THE UPPER PALEOLITHIC REVOLUTION
Annu. Rev. Anthropol., January 1, 2002; 31(1): 363 - 393.
[Abstract] [Full Text] [PDF]

S. Wooding and A. Rogers
The Matrix Coalescent and an Application to Human Single-Nucleotide Polymorphisms
Genetics, August 1, 2002; 161(4): 1641 - 1650.
[Abstract] [Full Text] [PDF]

M. I. Jensen-Seaman, A. S. Deinard, and K. K. Kidd
Modern African Ape Populations as Genetic and Demographic Models of the Last Common Ancestor of Humans, Chimpanzees, and Gorillas
J. Hered., November 1, 2001; 92(6): 475 - 480.
[Abstract] [Full Text] [PDF]

P. A. Doris
Hypertension Genetics, Single Nucleotide Polymorphisms, and the Common Disease:Common Variant Hypothesis
Hypertension, February 1, 2002; 39(2): 323 - 331.
[Abstract] [Full Text] [PDF]

L.B. Jorde, W.S. Watkins, and M.J. Bamshad
Population genomics: a bridge from evolutionary history to genetic medicine
Hum. Mol. Genet., October 1, 2001; 10(20): 2199 - 2207.
[Abstract] [Full Text] [PDF]

P. Forster, A. Torroni, C. Renfrew, and A. Rohl
Phylogenetic Star Contraction Applied to Asian and Papuan mtDNA Evolution
Mol. Biol. Evol., October 1, 2001; 18(10): 1864 - 1881.
[Abstract] [Full Text] [PDF]

L. Pereira, I. Dupanloup, Z. H. Rosser, M. A. Jobling, and G. Barbujani
Y-Chromosome Mismatch Distributions in Europe
Mol. Biol. Evol., July 1, 2001; 18(7): 1259 - 1271.
[Abstract] [Full Text] [PDF]

S. Sunyaev, V. Ramensky, I. Koch, W. Lathe III, A. S. Kondrashov, and P. Bork
Prediction of deleterious human alleles
Hum. Mol. Genet., March 1, 2001; 10(6): 591 - 597.
[Abstract] [Full Text]

R. L. Cann
Genetic Clues to Dispersal in Human Populations: Retracing the Past from the Present
Science, March 2, 2001; 291(5509): 1742 - 1748.
[Abstract] [Full Text]

J. H. Relethford
Ancient DNA and the origin of modern humans
PNAS, January 16, 2001; 98(2): 390 - 391.
[Full Text] [PDF]

L.B. Jorde
Linkage Disequilibrium and the Search for Complex Disease Genes
Genome Res., October 1, 2000; 10(10): 1435 - 1444.
[Full Text]

F.-m. Sheen, S. T. Sherry, G. M. Risch, M. Robichaux, I. Nasidze, M. Stoneking, M. A. Batzer, and G. D. Swergold
Reading between the LINEs: Human Genomic Variation Induced by LINE-1 Retrotransposition
Genome Res., October 1, 2000; 10(10): 1496 - 1508.
[Abstract] [Full Text]

M. Labuda, D. Labuda, C. Miranda, J. Poirier, B.-W. Soong, N. E. Barucha, and M. Pandolfo
Unique origin and specific ethnic distribution of the Friedreich ataxia GAA expansion
Neurology, June 27, 2000; 54(12): 2322 - 2324.
[Abstract] [Full Text] [PDF]

J. D. Wall and M. Przeworski
When Did the Human Population Size Start Increasing?
Genetics, August 1, 2000; 155(4): 1865 - 1874.
[Abstract] [Full Text]

J. Graham, J. Curran, and B. S. Weir
Conditional Genotypic Probabilities for Microsatellite Loci
Genetics, August 1, 2000; 155(4): 1973 - 1980.
[Abstract] [Full Text]

E. Giuffra, J. M. H. Kijas, V. Amarger, O. Carlborg, J.-T. Jeon, and L. Andersson
The Origin of the Domestic Pig: Independent Domestication and Subsequent Introgression
Genetics, April 1, 2000; 154(4): 1785 - 1791.
[Abstract] [Full Text]

R. Gonser, P. Donnelly, G. Nicholson, and A. Di Rienzo
Microsatellite Mutations and Inferences About Human Demography
Genetics, April 1, 2000; 154(4): 1793 - 1807.
[Abstract] [Full Text]

A. Collins, C. Lonjou, and N. E. Morton
Genetic epidemiology of single-nucleotide polymorphisms
PNAS, December 21, 1999; 96(26): 15173 - 15177.
[Abstract] [Full Text] [PDF]

D. Jones
EVOLUTIONARY PSYCHOLOGY
Annu. Rev. Anthropol., January 1, 1999; 28(1): 553 - 575.
[Abstract] [Full Text]

G. Cooper, N. J. Burroughs, D. A. Rand, D. C. Rubinsztein, and W. Amos
Markov Chain Monte Carlo analysis of human Y-chromosome microsatellites provides evidence of biased mutation
PNAS, October 12, 1999; 96(21): 11916 - 11921.
[Abstract] [Full Text] [PDF]

L. Excoffier and S. Schneider
Why hunter-gatherer populations do not show signs of Pleistocene demographic expansions
PNAS, September 14, 1999; 96(19): 10597 - 10602.
[Abstract] [Full Text] [PDF]

K. M. Weiss
COMING TO TERMS WITH HUMAN VARIATION
Annu. Rev. Anthropol., January 1, 1998; 27(1): 273 - 300.
[Abstract] [Full Text]

S. Schneider and L. Excoffier
Estimation of Past Demographic Parameters From the Distribution of Pairwise Differences When the Mutation Rates Vary Among Sites: Application to Human Mitochondrial DNA
Genetics, July 1, 1999; 152(3): 1079 - 1089.
[Abstract] [Full Text]

C. Duarte, J. Mauricio, P. B. Pettitt, P. Souto, E. Trinkaus, H. van der Plicht, and J. Zilhao
The early Upper Paleolithic human skeleton from the Abrigo do Lagar Velho (Portugal) and modern human emergence in Iberia
PNAS, June 22, 1999; 96(13): 7604 - 7609.
[Abstract] [Full Text] [PDF]

E. E. Harris and J. Hey
X chromosome evidence for ancient human histories
PNAS, March 16, 1999; 96(6): 3320 - 3324.
[Abstract] [Full Text] [PDF]

S. Alonso and J. A.L. Armour
MS205 Minisatellite Diversity in Basques: Evidence for a Pre-Neolithic Component
Genome Res., December 1, 1998; 8(12): 1289 - 1298.
[Abstract] [Full Text]

K. M. Weiss
In Search of Human�Variation
Genome Res., July 1, 1998; 8(7): 691 - 697.
[Abstract] [Full Text]

Abstract of this Article

Reprint (PDF) Version of this Article

Similar articles found in:
PNAS Online
ISI Web of Science
PubMed

PubMed Citation

This Article has been cited by:

Search Medline for articles by:
Harpending, H. C. || Sherry, S. T.

Search for citing articles in:
ISI Web of Science (103)

Alert me when:
new articles cite this article

Download to Citation Manager

				B. S. Arbogast, S. V. Edwards, J. Wakeley, P. Beerli, and J. B. Slowinski ESTIMATING DIVERGENCE TIMES FROM MOLECULAR DATA ON PHYLOGENETIC AND POPULATION GENETIC TIMESCALES Annu. Rev. Ecol. Syst., January 1, 2002; 33(1): 707 - 740. [Abstract] [Full Text] [PDF]

				J. F. Storz, M. A. Beaumont, and S. C. Alberts Genetic Evidence for Long-Term Population Decline in a Savannah-Dwelling Primate: Inferences from a Hierarchical Bayesian Model Mol. Biol. Evol., November 1, 2002; 19(11): 1981 - 1990. [Abstract] [Full Text] [PDF]

				L. Forster, P. Forster, S. Lutz-Bonengel, H. Willkomm, and B. Brinkmann Natural radioactivity and human mitochondrial DNA mutations PNAS, October 15, 2002; 99(21): 13950 - 13954. [Abstract] [Full Text] [PDF]

				O. Bar-Yosef THE UPPER PALEOLITHIC REVOLUTION Annu. Rev. Anthropol., January 1, 2002; 31(1): 363 - 393. [Abstract] [Full Text] [PDF]

				S. Wooding and A. Rogers The Matrix Coalescent and an Application to Human Single-Nucleotide Polymorphisms Genetics, August 1, 2002; 161(4): 1641 - 1650. [Abstract] [Full Text] [PDF]

				M. I. Jensen-Seaman, A. S. Deinard, and K. K. Kidd Modern African Ape Populations as Genetic and Demographic Models of the Last Common Ancestor of Humans, Chimpanzees, and Gorillas J. Hered., November 1, 2001; 92(6): 475 - 480. [Abstract] [Full Text] [PDF]

				P. A. Doris Hypertension Genetics, Single Nucleotide Polymorphisms, and the Common Disease:Common Variant Hypothesis Hypertension, February 1, 2002; 39(2): 323 - 331. [Abstract] [Full Text] [PDF]

				L.B. Jorde, W.S. Watkins, and M.J. Bamshad Population genomics: a bridge from evolutionary history to genetic medicine Hum. Mol. Genet., October 1, 2001; 10(20): 2199 - 2207. [Abstract] [Full Text] [PDF]

				P. Forster, A. Torroni, C. Renfrew, and A. Rohl Phylogenetic Star Contraction Applied to Asian and Papuan mtDNA Evolution Mol. Biol. Evol., October 1, 2001; 18(10): 1864 - 1881. [Abstract] [Full Text] [PDF]

				L. Pereira, I. Dupanloup, Z. H. Rosser, M. A. Jobling, and G. Barbujani Y-Chromosome Mismatch Distributions in Europe Mol. Biol. Evol., July 1, 2001; 18(7): 1259 - 1271. [Abstract] [Full Text] [PDF]

				S. Sunyaev, V. Ramensky, I. Koch, W. Lathe III, A. S. Kondrashov, and P. Bork Prediction of deleterious human alleles Hum. Mol. Genet., March 1, 2001; 10(6): 591 - 597. [Abstract] [Full Text]

				R. L. Cann Genetic Clues to Dispersal in Human Populations: Retracing the Past from the Present Science, March 2, 2001; 291(5509): 1742 - 1748. [Abstract] [Full Text]

				J. H. Relethford Ancient DNA and the origin of modern humans PNAS, January 16, 2001; 98(2): 390 - 391. [Full Text] [PDF]

				L.B. Jorde Linkage Disequilibrium and the Search for Complex Disease Genes Genome Res., October 1, 2000; 10(10): 1435 - 1444. [Full Text]

				F.-m. Sheen, S. T. Sherry, G. M. Risch, M. Robichaux, I. Nasidze, M. Stoneking, M. A. Batzer, and G. D. Swergold Reading between the LINEs: Human Genomic Variation Induced by LINE-1 Retrotransposition Genome Res., October 1, 2000; 10(10): 1496 - 1508. [Abstract] [Full Text]

				M. Labuda, D. Labuda, C. Miranda, J. Poirier, B.-W. Soong, N. E. Barucha, and M. Pandolfo Unique origin and specific ethnic distribution of the Friedreich ataxia GAA expansion Neurology, June 27, 2000; 54(12): 2322 - 2324. [Abstract] [Full Text] [PDF]

				J. D. Wall and M. Przeworski When Did the Human Population Size Start Increasing? Genetics, August 1, 2000; 155(4): 1865 - 1874. [Abstract] [Full Text]

				J. Graham, J. Curran, and B. S. Weir Conditional Genotypic Probabilities for Microsatellite Loci Genetics, August 1, 2000; 155(4): 1973 - 1980. [Abstract] [Full Text]

				E. Giuffra, J. M. H. Kijas, V. Amarger, O. Carlborg, J.-T. Jeon, and L. Andersson The Origin of the Domestic Pig: Independent Domestication and Subsequent Introgression Genetics, April 1, 2000; 154(4): 1785 - 1791. [Abstract] [Full Text]

				R. Gonser, P. Donnelly, G. Nicholson, and A. Di Rienzo Microsatellite Mutations and Inferences About Human Demography Genetics, April 1, 2000; 154(4): 1793 - 1807. [Abstract] [Full Text]

This contribution is part of the special series of Inaugural Articles by members of the National Academy of Sciences elected on April 30, 1996. Evolution / Anthropology Genetic traces of ancient demography

This contribution is part of the special series of Inaugural Articles by members of the National Academy of Sciences elected on April 30, 1996.

Evolution / Anthropology
Genetic traces of ancient demography

				A. Collins, C. Lonjou, and N. E. Morton Genetic epidemiology of single-nucleotide polymorphisms PNAS, December 21, 1999; 96(26): 15173 - 15177. [Abstract] [Full Text] [PDF]

				D. Jones EVOLUTIONARY PSYCHOLOGY Annu. Rev. Anthropol., January 1, 1999; 28(1): 553 - 575. [Abstract] [Full Text]

				G. Cooper, N. J. Burroughs, D. A. Rand, D. C. Rubinsztein, and W. Amos Markov Chain Monte Carlo analysis of human Y-chromosome microsatellites provides evidence of biased mutation PNAS, October 12, 1999; 96(21): 11916 - 11921. [Abstract] [Full Text] [PDF]

				L. Excoffier and S. Schneider Why hunter-gatherer populations do not show signs of Pleistocene demographic expansions PNAS, September 14, 1999; 96(19): 10597 - 10602. [Abstract] [Full Text] [PDF]

				K. M. Weiss COMING TO TERMS WITH HUMAN VARIATION Annu. Rev. Anthropol., January 1, 1998; 27(1): 273 - 300. [Abstract] [Full Text]

				S. Schneider and L. Excoffier Estimation of Past Demographic Parameters From the Distribution of Pairwise Differences When the Mutation Rates Vary Among Sites: Application to Human Mitochondrial DNA Genetics, July 1, 1999; 152(3): 1079 - 1089. [Abstract] [Full Text]

				C. Duarte, J. Mauricio, P. B. Pettitt, P. Souto, E. Trinkaus, H. van der Plicht, and J. Zilhao The early Upper Paleolithic human skeleton from the Abrigo do Lagar Velho (Portugal) and modern human emergence in Iberia PNAS, June 22, 1999; 96(13): 7604 - 7609. [Abstract] [Full Text] [PDF]

				E. E. Harris and J. Hey X chromosome evidence for ancient human histories PNAS, March 16, 1999; 96(6): 3320 - 3324. [Abstract] [Full Text] [PDF]

				S. Alonso and J. A.L. Armour MS205 Minisatellite Diversity in Basques: Evidence for a Pre-Neolithic Component Genome Res., December 1, 1998; 8(12): 1289 - 1298. [Abstract] [Full Text]

				K. M. Weiss In Search of Human�Variation Genome Res., July 1, 1998; 8(7): 691 - 697. [Abstract] [Full Text]