Thursday, February 21, 2008

How many harmful mutations do you carry?

ResearchBlogging.orgAccording to a new article in Nature (full text for subscribers only), the answer depends on your ancestry: Europeans carry more potentially fitness-reducing mutations than individuals from Africa.

The authors support this claim with sequencing data from more than 10,000 genes, obtained from 20 individuals of European ancestry and 15 African-Americans. As far as I can tell the same data were used in a 2005 Nature paper by the same group to address a completely different question, which I note simply because getting two Nature papers from the same data is a rare and wondrous thing!

Here, the authors examine the patterns of genetic variation within the protein-coding regions of their 10,000 genes. This variation can be broken down into two major classes: variations that don't change the sequence of the encoded protein, and are thus unlikely to affect fitness; and variations which do alter protein sequence. The protein-changing variants were further classified into three groups, based on how likely they are to disrupt the function of the encoded protein: benign, possibly damaging, or probably damaging.

The authors find some potentially interesting differences between the European and African samples: although the total amount of variation is substantially lower in Europeans, the proportion of it that rates as potentially harmful is higher. This is consistent with the notion that modern Europeans are derived from a relatively small population that migrated out of Africa between 50 and 100 thousand years ago. This "bottleneck" resulted in a loss of genetic variation, while the subsequent expansion as Europeans colonised new and fertile lands resulted in variants that have slightly negative effects on fitness "surfing" to disproportionately high frequencies.

The differences between the populations is actually pretty small, and this interpretation of the data is controversial (as noted in a Nature news article). But I'll leave that debate for another blogger - what interests me right now are the implications of these results for future analysis of large-scale sequencing data.

The article notes that the average individual in their data was heterozygous (that is, carried one "good" copy and one "bad" copy) for more than 400 possibly or probably damaging variants, and homozygous (that is, carried two "bad" copies) for more than 90 of these variants. Bearing in mind that these data are based on around half of the genes contained in our genome, this means that when you have your entire genome sequenced (as we are all likely to, at some point in the next twenty or so years) you will find that you carry one copy of more than 800 potentially damaging variants, and two copies of almost 200.

Among these potentially damaging variants there will be massive differences in the risk of disease for you and for your children: many will have little or no effect on health even if you carry two bad copies, whereas some may be severe enough to cause serious disease even if you carry only one bad copy (like a Huntington's disease mutation).

We know from studies of inbreeding that all of the variants will add up to around two to five "lethal equivalents" - which could mean five variants that would each be 100% lethal if you happened to carry two copies, or 500 variants that each carry a 1% chance of death (with a more likely scenario being somewhere in between!). With our current understanding of molecular and cellular biology we can confidently predict the final effects on health of only a tiny fraction of these variations - and that's not even considering the substantial proportion of potentially nasty variation occurring outside protein-coding regions, for which it is currently essentially impossible to predict function. Within five to ten years, we will have the capacity to cheaply sequence your entire genome and find all of these variations, but we will be unable to predict the likely health impacts of the vast majority of them.

It's hard enough providing genetic counselling for a family carrying one serious disease mutation. How will counsellors deal with a patient carrying around 1,000 potentially deleterious mutations, most of which we don't really understand?

Added in edit: Those interested in further exploration of the implications of these data for recent human evolution should check out John Hawks. Thomas Mailund also has an excellent post discussing this paper in detail.

Lohmueller, K.E., Indap, A.R., Schmidt, S., Boyko, A.R., Hernandez, R.D., Hubisz, M.J., Sninsky, J.J., White, T.J., Sunyaev, S.R., Nielsen, R., Clark, A.G., Bustamante, C.D. (2008). Proportionally more deleterious genetic variation in European than in African populations. Nature, 451(7181), 994-997. DOI: 10.1038/nature06611

4 comments:

Mike said...

Hi Daniel,

I've been following your blog via the DNA Network, and I think you write great stuff. What would you think about being a writer on Scientific Blogging? It doesn't mean that you would move this blog, but you'd have a chance to write for a broad non-expert audience - we got 600,000 visitors in January. We're looking for more people of your caliber.

I'm a postdoc at Washington University's Center for Genome Sciences, and I write the blog Adaptive Complexity. If you're interested in more info about Scientific Blogging, send me an email at mwhite@genetics.wustl.edu. Sorry to solicit this on your blog, but I couldn't find your email on your research group's website.

Mike

germ-o-phobe said...

If heritable genes caused common illnesses in humans this might be a significant problem. But they don't.

Any common gene that leads to increased susceptability to problem X also offers increased immunity to problem Z. Environmental damage/input is always required. Mathematically it can't work any other way. If it wasn't a game of pluses and minuses these susceptability genes would get obliterated in short order.

Rare disorders are a different story. It's too bad that phenomenon like Founder Effect can keep bad mutations around longer than they would otherwise last in nature.

Daniel said...

If heritable genes caused common illnesses in humans this might be a significant problem. But they don't.

Yes, they do. Most common diseases have a substantial heritable component, and the individual genes responsible are slowly being identified. As just a single example, the WTCCC genome-wide scans identified 24 genetic variants that are associated with one or more of seven different common diseases (ranging from bipolar disease to type 2 diabetes).

Any common gene that leads to increased susceptability to problem X also offers increased immunity to problem Z. Environmental damage/input is always required. Mathematically it can't work any other way. If it wasn't a game of pluses and minuses these susceptability genes would get obliterated in short order.

Disease susceptibility genes can be maintained in the population in a number of ways. You've already noted one, which is when disease variants also provide protection from some environmental condition - for instance, metabolic disease genes may be involved in adaptation to novel environments. Other mechanisms include heterozygote advantage, when a variant provides an advantage when it is present as one copy but causes disease when present as two copies. Disease variants can also simply slip through the cracks and drift to high frequency even when they are moderately deleterious, especially during a population bottleneck followed by an expansion (as likely occurred for most groups of modern humans).

The take-home message: common disease variants certainly do exist, and have reached moderate frequency in the human population via multiple known mechanisms; and population genetics is not as simple as you seem to think it is.

germ-o-phobe said...

Disease susceptibility genes can be maintained in the population in a number of ways.

Disease susceptability genes and disease CAUSING genes are two completely different things.

Just because a gene might increase the odds that someone will catch the flu (or any common illness) doesn't mean that the flu is spread through genes. Most of the "disease" genes that end up in stories in the major media are perfectly healthy. The overwhelming majority of fitness reducing condition are the result of negative, environmental input like pathogens, pollution, poor nutrition, injury, etc.

I know you're smart, but since it's election season I'll use this line.

It's the environment stupid. 8-)