Friday, May 9, 2008

Free personal genomics!

Over at Eye on DNA, Hsien wonders about the effects of a slowing economy on the personal genomics market. Well, no matter how hard it's getting to make your mortgage repayments, you can probably still afford personal genomics if it doesn't cost you anything:

In New Jersey, meanwhile, the nonprofit Coriell Institute for Medical Research is developing a service that will test for a slate of validated genetic markers, and provide free — yes, free — information and analysis for common diseases. The institute plans to sign up 10,000 people in the next two years, and eventually enlist 100,000 people.

(From a recent piece in Wired). You can sign up here; there is a pretty extensive FAQ here. Note that you will need to physically attend an enrollment session at the Coriell Institute in New Jersey. Also, I see that Coriell is adopting the paternalistic "need to know" approach pioneered by Navigenics, and won't provide participants with any information about genetic variants that aren't "medically actionable" (e.g. incurable disease risk variants), although they will hand out information on non-disease traits like eye colour. Still, if I lived anywhere near New Jersey I'd be signing up right now rather than wasting time writing this post.

(As an aside, I wonder why Coriell is using a saliva-based method when it could be using its considerable expertise to create and store cell lines from blood - essentially generating an endless source of DNA for researchers to analyse. That seems like a missed opportunity that someone will be seriously regretting in a few years when there's no DNA left for whole-genome sequencing, or epigenome analysis, or whatever.)

If you're more ambitious, you could also sign up for (eventual) free whole-genome sequencing via the Personal Genome Project.

Subscribe to Genetic Future.

23andMe, deCODEme and Navigenics at Cold Spring Harbor

Just in case anyone has been wondering why I've been so quiet, I'm in beautiful Cold Spring Harbor this week for the Biology of Genomes meeting. Be warned that I'll have a lot more to say about this meeting once I've recovered from the combined effects of jet-lag and the punishing schedule (last night's evening session finished at 11:30pm!), most of which will include the words "next", "generation", "sequencing" and "wow".

For now, I want to download some thoughts on this afternoon's panel discussion on direct-to-consumer genetic testing between three giants of the personal genomics industry: 23andMe's Linda Avey, deCODEme's Kari Stephansson and Navigenics' Dietrich Stephan. The session was ably chaired by the former head of the public Human Genome Project, Francis Collins.

Each of the three company representatives was given around 10 minutes to push their vision for the personal genomics industry. While all three are selling essentially the same product (a read-out of between 500,000 and 1,000,000 variable positions from your genome), each did their best to differentiate their company from the competition.

Navigenics was first off the mark, and the tone was serious: Western society is headed for a healthcare crisis, with the combination of an ageing population with an increasing frequency of lifestyle diseases like obesity and diabetes leading us inexorably towards an increased load of common diseases. Within a generation, said Stephan, we need a solution, and the only solution is early prediction and intervention.

The Navigenics message has remained on target since its launch a month ago: this is a careful, serious, completely disease-focused company. Stephan talked up the quality control on the Navigenics testing service, which includes repeating any disease-related genotype calls that fail the first time around (unlike 23andMe and deCODEme). He also emphatically stated what Navigenics wasn't: the company will never offer testing of genetic predictors of non-disease traits like height or eye colour; will never offer genetic ancestry testing; and will not offer comparisons of family members (apparently out of concerns about unexpected discoveries of non-paternity). This is, of course, an explicit attempt to portray the competitors - who both offer both ancestry, trait prediction - as frivolous.

Next off the rank was Google-funded 23andMe. Avey immediately turned Stephan's accusation back on itself: rather than being frivolous, 23andMe simply "looks at genetics holistically", with information on non-disease traits and ancestry being part of the big picture that customers want ("genealogy is the second most popular hobby on the internet," she said. "You can guess what the most popular one is.")

Avey explained that her decision to found 23andMe was based on her frustrating experiences at Perlegen attempting to recruit large cohorts of patients to use for large-scale genetics, and during her talk there was a strong emphasis on 23andMe as a system for driving genetic research. Just as the internet-based communities of Web 2.0 are willing to share their information with one another and reap the benefits, 23andMe is apparently aiming to create "Research 2.0", in which patients volunteer their genetic and clinical information for researchers to work with, and then use the results of that research to inform their own lifestyle decisions. Avey cutely terms this model "23andWe".

Finally, the intimidating Viking-like figure of Kari Stephansson took to the platform. Stephansson played continuously on the research cred of deCODEme's parent company, deCODE - not without justification, given the impressive list of large-scale genetic studies performed by the company (which Stephansson rather heavy-handedly flashed up on the screen, one by one, for what seemed like forever).

In fact, these distinguished scientific credentials seemed to be pretty much the only thing Stephansson could find to distinguish deCODEme from its rivals. Otherwise, the deCODEme ethos seems to resemble the open information model of 23andMe rather than the more old-fashioned paternalistic approach of Navigenics. "Is it always laudable when people learn more about themselves?" he asked, and argued that the answer was unambiguously yes. "Our customers will benefit from knowing themselves better."

After the three company talks, there were presentations from Johns Hopkins' Kathy Hudson and the National Coalition for Health Professional Education in Genetics' Joseph McInerney. Hudson emphasised the strong appetite of consumers for genetic information, and the desperate need for empirical data regarding the responses of people to information about genetic risk variants (which I've wondered about myself). McInerney made a strong case for the complete under-preparedness of health care providers for the boom in personal genetics, and introduced a new database called GeneFacts (currently under development) which will store expert-curated information about genetic tests for both consumers and health providers.

The following discussion was lively, and nowhere near as hostile as I expected. A few highlights:

  • Stephansson was remarkably candid about the usefulness of current tests involving common variants (which all three companies rely on): in one exchange he noted that "we are marketing these tests without any claim that they will impact on people's lives"; in another, he admitted that common variants probably provide marginal utility beyond simply collecting general information on family history.

  • Kathy Hudson responded to the problem of patients being given data of very limited predictive value with a very sensible solution: "In the absence of demonstrable harm, the default should be to provide the information." In her talk, she referred to the argument that genetic tests should only be ordered through a health-care provider as "an old-fashioned model" - a clear rebuff to both Navigenics, which boasts about using an in-house doctor to authorise all of its tests, and the American College of Medical Genetics, which recently issued a statement which argues that "a knowledgeable health professional should be involved in the process of ordering and interpreting a genetic test".

  • Stephansson was sceptical of Avey's claims that 23andMe can perform useful research, given the limitations of self-reported data (I agree). Avey explained that this problem is one of the reasons why 23andMe is interested in working with companies like Google to integrate genetic data with medical records - a suggestion that resulted in some shocked muttering from the audience.

  • Eric Lander (founding director of the Broad Institute) suggested that the medical genetics community needs to play a stronger role in the field of personal genetics, possibly contributing to some sort of expert-curated wiki-style database of information about genetic associations. He indicated that this couldn't be privately funded due to conflicts of interest. McInerney suggested that the soon-to-arrive GeneFacts database could serve as a starting point; Hudson stated that there has been substantial discussion about a genetic test registry at the Genetics & Public Policy Center, and that clear support from the medical genetics community would help this move forward. It appears that DNA Perspectives may have some competition in the very near future. Either way, this is good news for genetic test consumers.

  • There was an interesting back-and-forth between Stephan and Stephansson over the issue of ancestry testing both in their talks and in the discussion: Stephan basically sees ancestry testing as a distraction from the serious business of disease, while Stephansson argued that the effect size of disease risk variants frequently varies between populations, so ancestry testing is highly relevant to calculations of disease risk. This is an empirical question that will soon be answerable (deCODE is apparently currently investing in association studies in Asian populations).

  • All three of the company representatives mentioned their interest in developing whole-genome sequencing capabilities, which is unsurprising - sequencing has always been the Holy Grail of personal genomics, with the current SNP chip technology really little more than a crude place-holder until sequencing prices drop.

I also chatted with both Stephan and Avey after the session - for what it's worth, they're both extremely personable, and there was no overt animosity between any of the three competitors (somewhat to the disappointment of the audience, I suspect).

Anyway, that's the bulk of the personal genomics component of this meeting. As I hinted at the beginning of the post, the other main message has been the rapid advance in next-generation sequencing technology and its application to human genomes, which I'll hopefully be able to post about over the next few days.


Subscribe to Genetic Future.

Thursday, May 1, 2008

Low technical error rates for personal genomics companies

Antonio Oliveira from Longa Vista has compared the results of his genome scans from both 23andMe and deCODEme. Of the 560,299 sites analysed by both companies, just 23 showed a different result between the two scans - a discrepancy rate of just 0.004%!

This fits with the low discrepancy rate reported by Ann Turner back in January. The take-home message: by all means worry about the interpretation of your personal genomics result, but it's likely that your actual genotype data are extremely accurate.

Via The Quantified Self.

Subscribe to Genetic Future.

Sunday, April 27, 2008

The human genome is old news. Next stop: the human proteome

A Nature News article describes the initial plans for an ambitious effort to begin mapping the complete human proteome: the set of all human proteins expressed in all of our cells at all points during our development and adult life.

This is a project of vastly greater magnitude and complexity than the sequencing of the human genome. Unlike the genome, which remains essentially static between cell types and over time, the proteome is tremendously dynamic, changing constantly in response to cell-cell signalling and environmental stimuli. Thus even though -with some small exceptions - every cell in your body carries the same genome, the proteome can be wildly different between different tissues and can change rapidly over time (the image on the left is the result of proteomic analysis of a single tissue, the human kidney; each spot represents one protein). In addition, the function of proteins can change depending on where they localise within the cell, and which other proteins are around for them to interact with.

The complete mapping of the human proteome would require analysing the expression, localisation and interactions of all proteins in human tissue samples from all tissues at all stages of development, and following exposure to all possible forms of environmental stimulus. That's completely impossible with current technology, so the architects of the human proteome project have drawn up a more realistic wish-list:

The plan is to tackle this with three different experimental approaches. One would use mass spectrometry to identify proteins and their quantities in tissue samples; another would generate antibodies to each protein and use these to show its location in tissues and cells; and the third would systematically identify, for each protein, which others it interacts with in protein complexes. The project would also involve a massive bioinformatics effort to ensure that the data could be pooled and accessed, and the production of shared reagents.

It's unclear exactly which tissue samples will be used for the first phase of the project, but it appears that this stage will rely heavily on pooling data from pre-existing studies. After that, the project may move onto a detailed analysis of the expression levels, cellular localisation and interaction partners of proteins encoded by genes on chromosome 21 (the smallest human chromosome); alternative suggestions include a comprehensive analysis of all of the proteins found in a specific cellular location such as the mitochondria or the cell membrane.

There are some daunting technical obstacles to overcome for this project to be successful. Given that the project will be carried out by multiple laboratories around the world, there needs to be a serious attempt at standardising the protocols used to extract and characterise proteins. The article notes that "results from the Human Plasma Proteome project and other proteomics efforts showed that different laboratories — and even the same lab — often identify very different sets of proteins from exactly the same sample".

The project will be complicated by the fact that many genes encode for multiple different proteins, differing from one another in various regions, through a process known as alternative splicing. The proposed solution to that problem is to ignore it altogether:

[...] the group plans to focus on only a single protein produced from each gene, rather than its many forms. “We got rid of all this complexity,” Bergeron says.

That may simplify the analysis, but it will also significantly reduce the power of the project. The single protein isoform selected by the project will not necessarily be the most important isoform produced by that gene (this is likely to differ substantially between different tissues). That means that the project will miss crucial information about the function of many of the proteins it analyses.

Actually, there are caveats of varying severity for nearly all of the currently available technologies for separating, identifying and characterising proteins. It's extremely difficult to develop methods that can accurately examine both low- and high-abundance proteins in a single run. Generating antibodies that reliably and specifically bind to each protein in the proteome will be a mammoth undertaking, and will be confounded by the alternative splicing issues mentioned above. High-throughput methods for detecting protein-protein interactions, while they have been used extensively (for instance in characterising the yeast protein interaction network), still suffer from a range of problems that can result in both false-positive and false-negative findings.

However, these are largely technology-driven constraints. Similar negative arguments were thrown at the human genome project, and look how that turned out! If anything, it seems likely that a proteome project of this magnitude would provide strong incentives to overcome the technical hurdles and standardisation problems that currently plague proteomics in general.

As a useful side-effect, this project (or its successors) will provide information that will help in interpreting the results of whole-genome sequencing. As I've noted before, we still know so little about our own genome that it's likely that most of us will have complete genome sequences well before we really have the tools and understanding to decipher what that sequence actually means. In order to have any chance of figuring out what effects a rare variant in an unannotated gene might have on our health we will need to call on data from many different fields of biology.

At the very least, large-scale analysis of the human proteome should allow researchers to tentatively place many of our currently anonymous genes into functional pathways. That's a step forward for personal genomics: knowing that you have a loss-of-function mutation in a gene that may be involved in cholesterol biosynthesis is a lot more useful (in terms of guiding further clinical testing) than simply knowing that you have a mutation in hypothetical gene C11orf68.



Subscribe to Genetic Future.

Wednesday, April 23, 2008

David Altshuler on personal genomics

The Boston Globe has a fairly well-balanced article on the current state of personal genomics: a field with tremendous promise that is yet to really deliver. I particularly liked these two contrasting quotes from David Altshuler of the Broad Institute:

"From a clinical point of view, [current genome scans are] just noise," he said. "No one knows how to use such information to improve health."

and

In coming years, Altshuler says, he believes genomics will be as transformative as the Internet.

The first quote is only slightly exaggerated; the second is spot-on.


Subscribe to Genetic Future.

My genes made me do it

An article in the Washington Post discusses new uses of genetic testing in the courtroom that go far beyond standard forensic DNA profiling:

[...] defense attorneys are asking judges to admit test results suggesting that their clients have a genetic predisposition for violent or impulsive behavior, adding a potential "DNA defense" to a legal system that until now has held virtually everyone accountable for their actions except the insane or mentally retarded.

Some gene tests are even being touted for their capacity to help judges predict the likelihood that a convict, if released, will break the law again -- a measure of "future dangerousness" that raises questions about how far courts can go to abort crimes that have not yet been committed.

The article correctly notes that these tests are still very much at the fringes of science - behavioural genetics is a complex field, and the current associations are generally pretty weak. However, there's little doubt that many of the traits underlying a predisposition to criminal behaviour, such as a fondness for risk-taking or susceptibility to addiction, are substantially influenced by genetic factors, and it's only a matter of time before the major genes responsible are identified and characterised.

Although genetic testing will only ever allow for a probabilistic prediction of susceptibility to criminal behaviour (unlike the tortured psychics in Minority Report), society needs to prepare itself for the consequences of these findings. For instance, do "criminal genes" excuse someone from criminality, or do they simply provide an even better reason to lock such people away for the good of society? Should expensive family monitoring and support programs be targeted towards individuals who are genetically susceptible to antisocial behaviour in the presence of abuse or neglect? Given a limited budget, should rehabilitation programs focus on criminals who lack these susceptibility genes and may thus be less inclined to re-offend?

Update: The Genetic Genealogist has a great tangentially-related post on forensic genetics.

Subscribe to Genetic Future.

Sunday, April 20, 2008

A new model for genetic privacy: you don't have any

In a perspective piece in Nature Reviews Genetics (subscription required, I think), Personal Genome Project leader George Church and colleagues advocate a revolutionary new approach to research subject privacy. Essentially, they argue that "the reality of the new genetics and genomics urges us to abandon the traditional concept of medical confidentiality". In other words, research participants must learn to accept the fact that the privacy of their genetic and health information cannot be guaranteed.

When I first heard of this concept in the context of the Personal Genome Project it struck me as pure insanity - who would volunteer for a project if there is a significant risk of your genetic and health information being accessed by (say) insurance companies? Having thought it over, though, the need for such an approach is becoming more and more clear to me. The basic argument goes something like this:

  1. Your DNA sequence (or any sufficiently large set of genetic markers, like those used in modern genome-wide association studies) is enough by itself to unambiguously identify you.

  2. Thus even "anonymous" participants in large-scale genetic studies are vulnerable to having their identity revealed - all it would take is someone to have a sample of your DNA, and access to the individual data-points from the study, and they would then have access to any health or life-style information recorded about you as part of that study.

  3. As such, there simply cannot be guarantees of anonymity given to participants in such studies, fundamentally undermining the traditional model of confidentiality.

  4. The best solution to this problem is to abandon the illusion of research subject privacy, and instead recruit participants with the explicit condition that all of the data collected about them as part of the study may in fact be revealed to the public.

The authors aren't advocating a complete dump of participant genetic and health records on a publically accessible website - although volunteers in the Personal Genome Project have the option of doing just that, should they choose to. Rather, they argue for a strategy of "maximizing data protection while informing people about its limits". In other words, doing your best to limit disclosure of individual health data, while clearly informing participants of the fact that their privacy can't be guaranteed.

It certainly is an audacious paradigm shift, and I'm having trouble predicting its consequences. For instance, will such a policy discourage people with a clear family history of genetic disease from participating in large-scale cohort studies (for insurance reasons), thus reducing the power of such studies to detect disease-associated variants? Will it create a generation gap in research participation, with conservative older people shunning studies while the children of the Facebook era - who engage in public disclosure of information with a wilfulness that seems shocking to their elders - embrace participation? I don't know, but I guess we'll all find out sooner rather than later...

Anyone interested in the Personal Genome Project (which is calling for volunteers for whole-genome sequencing, by the way) should check out their informative web-site. Misha Angrist, one of the "First Ten" participants who will have their genomes sequenced by the PGP, also has a blog that's well worth adding to your RSS reader.

Subscribe to Genetic Future.

DNA Perspectives

A while back I discussed a Nature editorial calling for a public registry of disease-gene associations. This would provide potential consumers with objective information about the scientific evidence underlying commercial gene tests, helping them to make an informed decision amidst the hype, overstated claims (and occasionally sheer lunacy) that unfortunately characterises a large swathe of the genetic testing industry at the moment.

I think a broadly-backed international genetic association registry would be a fantastic resource, whether it is built on the foundations of an existing model such as SNPedia or (more likely) assembled from scratch. However, it's unlikely that a single monolithic registry will emerge: rather, we'll probably see an array of competing databases, some with official backing, some Wikipedia-like community-based annotation projects, and more than a few set up by genetic testing companies themselves. Consumers will certainly have more access to information on genetic associations, but it will unfortunately be hosted by a plethora of organisations with different goals and target audiences.

At Eye on DNA, Hsien-Hsien Lei points to a new initiative called DNA Perspectives. DNA Perspectives is funded by DNA Direct, Hsien's employer (as Hsien is commendably scrupulous in pointing out whenever the topic arises, I should add). The aim is to develop "a collaborative site developed by a wide range of industry experts to objectively evaluate the clinical validity and utility of genetic markers as well as commercially available genetic tests" - an admirable goal.

DNA Perspectives will be based on annotation by invited experts in genetics, with all information freely available to the public, and a forum for consumers to add their comments and personal ratings. I think this model is a good one, treading the line between the semi-structured anarchy of free-for-all community resources like Wikipedia and the slow-moving, cumbersome centralised bureaucracy of many official databases (there are plenty of other viable models, of course). However, it will be interesting to see if it can overcome two potentially major obstacles.

The first is community apathy, which I think will be familiar to anyone working in an expert-curated database who has tried to recruit researchers to annotate material. Return rates tend to be low, and most experts who do visit the site will make perfunctory corrections at best. The problem is basically that most experts are busy people - writing grants and papers will always be a higher priority than annotating a database, which is a considerable effort that typically has minimal (or zero) pay-off. (I write this rather guiltily, looking back on my seriously mediocre track record of participating in such efforts.)

The second problem is the overt link to a genetic testing company. No matter how hands-off DNA Direct attempts to be, there will always be a conflict of interest when the body running a genetic association registry is simultaneously relying on sales of genetic tests to pay the rent. DNA Direct certainly appears to be one of the most evidence-based genetic testing companies out there, so it hopefully doesn't have much to hide - but nonetheless, if an expert reviewer offers a scathing critique of a test that DNA Direct offers, how will the company feel about hosting that review on its own server space? Even if the company fastidiously avoids censoring the reviews, it will always be very hard to overcome consumer perceptions of bias.

Ultimately, potential genetic testing customers will probably feel much more comfortable sourcing their information from a registry with no financial ties to the testing industry. I also suspect that expert reviewers will also be easier to attract to a database backed by major funding bodies and research institutions, who can offer both the small carrot of officially-sanctioned kudos for their efforts, and (potentially) the more effective stick of making funding partly conditional upon participation in the review effort.

Of course, at this stage no such official and comprehensive registry exists - and while I don't think it's the ultimate solution, DNA Perspectives is at least a step in the right direction at a time when consumers are desperate for guidance through the murky waters of the DTC genetic testing field. I look forward to seeing how it progresses.

Subscribe to Genetic Future.

Thursday, April 17, 2008

Watson's sequence: gloomy news for personal genomics?

It's hard to imagine how the publication of the complete genome sequence of Jim Watson - assembled with unprecedented speed and cheapness using next-generation sequencing technology - could possibly be bad news for the field of personal genomics. But in an opinion piece accompanying the publication in Nature, genome evolution guru Maynard V. Olson makes that argument:

If Watson took his sequence to a genetic counsellor, there would be little to discuss. The sequence seems to show that he is a carrier for a handful of mutations that might catch a counsellor's interest. But these mutations have no known effects on Watson himself, and would confer risk on offspring only in the highly unlikely event of a marriage between two carriers. None of these mutations is ever likely to be considered an appropriate candidate for screening in the general population — of which, for these purposes, Watson is a representative member.

Recognition of the thin clinical value of this sequence may cause some investors in the new sequencing methods to take pause, given that the major capital investments required to commercialize these technologies have been motivated more by their perceived medical potential than by research applications.

Well, our current ignorance of the functional significance of most genetic variants make a good argument for not getting your genome sequenced right now. But it's not like you need that argument - the best reason for not getting your genome sequenced now is that it's ludicrously expensive (unless you have $350,000 to burn, like Dan Stoicescu).

By the time whole-genome sequencing becomes affordable - in perhaps five years - our understanding of the functional effects of human genetic variation will be dramatically better than today. With each genome that gets sequenced that understanding will grow. And best of all, a genome sequence never becomes obsolete (unlike the SNP chips currently used by personal genomics companies like 23andMe and Navigenics, which will really start to lose their usefulness over the next year or two).

In any case, while we can't predict the functional impact of every single variant in Watson's genome, even our limited current knowledge is enough to reveal some potentially important sites. For instance, Watson carries at least 10 mutations that have previously been associated with severe diseases in humans (in most cases he only carries one copy of a mutation, where two would be required to cause disease). Given that known mutations are only a small fraction of the total sequence changes that could result in severe disease, this suggests that each of us may carry quite a large number of mutations that could potentially result in serious disease in our children, should we be unlucky enough to mate with someone carrying mutations in the same gene.

In addition, the researchers predict that almost 300 of the protein-altering variants in Watson's genome are "probably damaging" to the function of the protein. These types of variants may potentially play a role in susceptibility to disease, although we don't yet know enough to be able to pick them out with any real confidence.

Anyway, it's a start. Olson is certainly correct that we still know far too little about the function of our genome for large-scale sequencing to be used as a population screening tool, but Watson's sequence illustrates that - once our knowledge has improved - there will be plenty of potential functional information to explore in a typical genome.

Update: MassGenomics has a great break-down of the Watson data.


Subscribe to Genetic Future.

Monday, April 14, 2008

Genome-wide association studies taken to the next level

The Wellcome Trust Case-Control Consortium, the group responsible for a massive study of genome-wide associations (GWAS) in seven different common diseases published last year as well as a wide range of other projects in disease genetics, has just announced plans for a mind-bogglingly large expansion of their GWAS efforts.

The numbers are truly impressive: 120,000 participants, 25 different diseases, and a total cost of £30 million (nearly US$60 million). Patients and controls will be screened for up to 1 million genetic variants, as well as being subjected to analysis of genome-wide copy-number variation (insertions and deletions of DNA).

The diseases aren't all listed in the press release, but I've managed to get a breakdown from the Wellcome Trust:

Visceral leishmaniasis
Bacteraemia susceptibility
Human prion disease
Ankylosing spondylitis
Multiple sclerosis
Ulcerative colitis
Psoriasis
Coeliac disease
Asthma
Glaucoma
Schizophrenia
Psychosis endophenotypes
Parkinson’s disease
Partial epilepsies
Ischaemic stroke
Abdominal aortic aneurysms
Myocardial infarcation
Coronary artery disease
Extreme and early onset obesity
Response to statin treatment
Barrett’s oesophagus and oesophageal adenocarcinoma
Breast Cancer
Adult glioma
Pre-eclampsia
Endometriosis

There's something in that for nearly everyone, as well as an interesting addition that isn't a disease at all: reading and mathematics abilities in 12-year-old children enrolled in the UK Twins Early Development Study. This marks an interesting (and potentially extremely controversial) foray into the world of cognitive genetics. Watch this space - the media coverage of this aspect of the project is unlikely to be universally positive.

I haven't yet been able to find out the sample sizes for each disease, but it's clear from the total number quoted in the press release that at least some of these cohorts will be quite well-powered - assuming they use a large, shared group of controls, the average sample size is likely to be more than 4,000 patients.

I've recently spent quite a bit of my time talking down the power of genome-wide association studies. Nonetheless, a study of this magnitude - combining SNP data with copy number variation - is likely to capture a sizeable (albeit by no means complete) chunk of the genetic risk variants for many of these diseases. Having comparable data-sets from so many different diseases will also facilitate the identification of common variants that influence risk for multiple diseases, as has already been demonstrated for IL23R in several auto-immune conditions.

In addition, the WTCCC and its partners are assembling an enviable collection of well-characterised DNA samples from patients and controls that can be rapidly deployed for large-scale sequencing approaches once the cost of sequencing drops far enough.

Exciting times...


Subscribe to Genetic Future.

Sunday, April 13, 2008

Navigenics vs 23andMe: drawing the battle-lines

Well, the debut of Navigenics has certainly been a lot more interesting than I anticipated. Far from being just another genome-scan product limping along in the wake of 23andMe (like, say, SeqWright's rather depressing effort), Navigenics is brazenly attempting to re-define the entire industry in a way that suits them.

At the very least, the company is staking a solid claim over the lucrative well-paid over-30 non-geek market niche, which has been surprisingly poorly tapped by the current players. But Navigenics seems to want to go further than this: in fact, they appear to be trying to reshape the personal genomics industry as being first and foremost about the sober provision of evidence-based health information, and simultaneously position themselves as the most respectable provider of this information. If in the process they can create a perception of their competitors (particularly 23andMe) as frivolous and over-hyped, so much the better.

Over at Genetics and Health, Elaine Warburton has a long interview with Navigenics' Medical Director Michael Nierenberg. This is by no means a probing critique - in fact, it reads suspiciously like an extended advertisement for the company - but there are some interesting snippets from Nierenberg about the image Navigenics wishes to present:

Navigenics is no way a ‘recreational’ genomics company and does not wish to contemplate entering any ‘recreational’ field. It is a company focusing on the wellness and prevention aspects of health. Our service focuses on actionable entities and things of substance such as cardiac disease, not eye colour or such like. We welcome regulation and make heavy use of genetic counseling.

The sub-text is abundantly clear: we'll give you accurate information about the really important stuff like cancer and heart disease, whereas our competitors (they know who they are!) mess about with trivial information about athletic performance and ear-wax consistency.

Navigenics' well-orchestrated marketing campaign revolves around this central theme of seriousness and competence, and I'm sure the message is sinking in with their apparent target audience (well-paid, highly-educated, time-poor executive types old enough to start fretting about their long-term health); having the reliably earnest Al Gore spruik the company certainly didn't hurt. To emphasise their trustworthy seriousness, Navigenics has launched a joint study with the Mayo Clinic into the effects on patients of receiving genetic information, is partnering with Medscape to provide physician education, and proposed a set of standards for personal genomics companies (a clear attempt to re-define the industry in their own image, while simultaneously seizing the moral high ground).

Through these activities, as well as their use of CLIA-certified genotyping facilities and provision of 24-hour access to genetic counselling, the company no doubt hopes to avoid many of the criticisms thrown at other personal genomics companies.

This all seems quite admirable, on the whole. However, the Navigenics model is also deeply regressive: they are taking the currently exciting, somewhat anarchic but intrinsically empowering field of personal genomics (in which individuals are free to explore their own genetic data however they wish) and cramming it back into the tightly-regulated, paternalistic environment of the standard medical framework. Where 23andMe talks about guiding customers through their own journey of genetic discovery, Navigenics appears to be more about giving clients the information that Navigenics thinks is medically relevant, and protecting them from all the non-essential details that might overwhelm or confuse them.

Nowhere is this regressive paradigm more evident than in Navigenics' refusal reluctance to give their customers access to more than a tiny fraction of their own genotyping results. Unlike 23andMe and deCODEme, who both freely provide clients with access to their complete, raw genotyping data, Navigenics customers must sign a waiver to receive their results on an encrypted disk (presumably without an easy-to-navigate interface); Navigenics ominously warns that "without our blessing, the potential for misinformation is extremely high" (updated thanks to Hsien). Elaine puts a positive spin on this reluctance:

Imagine the confusion and furore if Navigenics were to provide its members with their full 1 million marker analysis! Navigenics’ (and others) sensible, if somewhat patriarchal approach of ‘drip feeding’ results to members as and when the research is robust enough to bring the SNP into the public domain, is one that should be applauded not derided.

In other words, customers shouldn't need to worry their pretty little heads over all these confusing As, Cs, Gs and Ts - they can just let Navigenics decide what they need to know. Ouch.

I can only assume that Navigenics' focus group research suggests that their target audience finds this attitude reassuring rather than profoundly insulting; either way, it's both patronising and unnecessary. After all, it's not like 23andMe simply punt any old genetic association out there for their customers to sift through - they carefully code the associations to indicate how reliable they are (based on a pretty reasonable set of criteria [PDF], I might add). Customers are allowed to analyse their own data for both gold-standard and lower-reliability associations, but are given information to help them decide how much weight they should place on each. In my opinion this sort of informed freedom is a far more enlightening (and vastly less insulting) model that the constrained "need-to-know" approach of Navigenics.

Anyway, it will be interesting to see how Navigenics alters the long-term tone of the personal genomics market. Perhaps the early pioneering feel of personal genomics was just a temporary aberration, and we are now seeing the beginning of a general regressive shift towards the standard medical model. More optimistically, I suspect these early battle-lines mark the beginning of a diversification of the industry, with some products targeting the individualistic and curious spirit of a younger, information-savvy generation, and others appealing to the more serious health-centered focus of individuals moving towards middle age.

Either way, it will be fascinating to watch 23andMe, Navigenics and their current and upcoming competitors struggling to define an entire industry as they battle for market share.


Subscribe to Genetic Future.

Thursday, April 10, 2008

Ready or not, personal genomics is here

A new editorial in Nature comments on the rapidly expanding field of personal genomics. The appearance of this industry has taken many observers by surprise; indeed, the authors note, "Rarely have basic discoveries morphed into a commercial product quite so swiftly."

The speed of the industry's growth has led to many calls for heavy regulation, which (I think) would be a disastrous approach for consumers. Nature agrees, and offers a positive alternative solution:

If consumers are to reap the benefits that genetic testing can offer, they need understandable information about the basis, validity and limitations of the tests. One proposed structure for providing this information is a publicly accessible registry into which test-makers would be required to upload data about their tests and the studies that back them. This information should be updated as genetic risks are changed or refined, as inevitably they will be.

There are already some similar databases that currently exist (such as the Wikipedia-like SNPedia) or are being planned (e.g. GEN2PHEN [PDF]), although none of them are yet comprehensive or rigorous enough to fulfil the needs of genetic test consumers. It would be great to see these and similar efforts promoted and funded, or perhaps even combined in a central registry that supplements slow, careful expert annotation with the faster but looser community-driven SNPedia approach. It would almost certainly be more cost-effective to build on existing projects rather than developing a new registry from scratch.

However the registry develops, Nature's point is that the solution to shonky genetic test vendors isn't just legislation (which, if too heavy-handed, will also negatively affect legitimate companies and limit consumer choice), it's also information. Providing potential customers with reliable data about the efficacy of genetic tests and allowing them to make their own decisions protects consumers without sacrificing their autonomy. This is certainly my philosophy, and the motivation behind Genetic Future - it's very reassuring to see that this sentiment is shared in the lofty reaches of the Nature editorial board.

The article finishes with pertinent advice to consumers:

In the meantime, online shoppers who buy genetic tests would do well to keep asking themselves whether the science is, indeed, ready.

Before buying any genetic test, research widely about its pros and cons, and think hard about whether the information you receive will really be worth the money you spend, or whether you'd be better to save your money until better tests are available.


Subscribe to Genetic Future.

Personal genomics: getting your money's worth

Over at Eye on DNA, Hsien-Hsien Lei has an entertaining list of the variety of personal genomics services that could be purchased for the $2,500 cost of a full Navigenics scan. There's some tough decisions in there: would I prefer two paternity tests, or sixteen genetic tests predicting my risk for baldness?

Hsien points out that at the current going rates of the sole company offering commercial whole-genome sequencing, $2,500 would buy you only 0.71% of a whole genome. That sounds small, but it's still more than 20 million base pairs - twenty times the paltry one million sites interrogated by the Navigenics chip for the same price!

Of course, the Navigenics SNPs have been carefully selected to provide as much information as possible about common genetic variation, so they're still a better purchase right now than a random 0.71% of a genome sequence. Nonetheless, this comparison provides some insight into just how cheap sequencing technology is becoming; it certainly won't be long before it's commercially competitive.

As I've emphasised in recent posts, the chip technology currently used to analyse genetic variation by researchers and personal genomics companies (23andMe, deCODEme, SeqWright and now Navigenics) will only ever capture a fraction of your total genetic risk for common disease: the fraction that consists of common small-scale variants.

In contrast, whole-genome sequencing will give you information about the types of genetic variation - such as rare variants and large-scale structural variation - that are completely invisible to current chips. Since these variants probably constitute a substantial fraction of genetic risk for common diseases, sequencing (when it becomes affordable) is likely to give you a lot more useful information than current genome scans. And best of all, since whole-genome sequencing gives you information on every variation in your genome, it won't ever become obsolete - whereas chips will be periodically replaced by new, higher-resolution models that capture a larger (but still incomplete) snapshot of your genetic variation.

In other words, while genome scans are the best affordable technology we have right now, they have profound limitations and will become rapidly outdated as researchers begin to focus on the rare variants and structural variation that contribute to variation in complex traits and common disease risk. For those who care about value for money, my suggestion is that you put your $2,500 in a bank account with a good interest rate and don't take it out until whole-genome sequencing becomes cheap enough to buy that instead.

Of course, that's advice for those who are mainly interested in health prediction. For genetic genealogists current genome scans provide some powerful information about genetic ancestry; if that's your interest, you'd probably be best off investing in a deCODEme scan, which gives you the same number of SNPs as Navigenics for 40% of the price, and (unlike Navigenics) allows you to download your complete raw data.


Subscribe to Genetic Future.

Tuesday, April 8, 2008

Some early thoughts on Navigenics

It's been a long, long wait, but Navigenics has finally officially entered the personal genomics arena. Like the Me Two (23andMe and deCODEme), Navigenics will be offering to determine the sequence at hundreds of thousands of commonly variable positions throughout your genome, which it will use to make predictions about your risk for a variety of common diseases such as heart disease and type 2 diabetes. The service will cost US$2,500, compared to US$1,000 for the two major competing offers.

The Wired blog has the facts. A few early thoughts:

  1. Navigenics would probably struggle to compete head-to-head with 23andMe, which has a much stronger public profile, a funkier website, and offers a significantly cheaper service (albeit with fewer markers); BUT

  2. It's pretty clear they're aiming at a different market niche altogether. Just compare the two websites (23andMe, Navigenics): 23andMe will appeal more to the hip young web-savvy childless yuppie who wants to know more about him/herself and build up some cool conversation topics (their website is all about "Your personal journey of genetic discovery"); Navigenics is aiming for the sober, older executive with kids who watches their weight and cholesterol and heads to the gym three times a week (website quote: "I want to be part of all the big moments in my son's life, so I'm doing everything I can to stay healthy.")

  3. Providing access to genetic counselling and promoting physician education, while certainly praise-worthy, is all part of this market positioning. Navigenics is trying to say that they're above all the hype and frivolity of 23andMe; all they care about is your future health, and they care about that in a deeply earnest, professional yet compassionate manner. (The genetic counselling video sums up the mood nicely.)

  4. Navigenics has started with a genotyping service that is CLIA-certified - unlike 23andMe, who had to change labs to a CLIA-certified facility a few weeks back (causing disruptions to their service).

  5. Navigenics offers a long-term DNA storage service, which is a clever business move. It will be that much easier to convince customers to purchase an extra DNA test in a year's time - or whole-genome sequencing in five years' time - if they don't have to go through the hassle of re-submitting a mouth swab. Never underestimate the role of convenience in shaping consumer decisions.

  6. The "relative lifetime risk" analysis offered by Navigenics seems as though it will provide more impressive-sounding numbers than the absolute risk estimates offered by 23andMe and deCODEme, but I'm not sure how statistically sound it is to extrapolate odds ratios from genetic association studies to total lifetime risk (especially given that the strength of genetic associations is known to vary with age). Would any statisticians out there care to dissect Navigenics' white paper on their methods?

  7. Finally - and this is completely a personal thing - there's absolutely no way I'd buy into a service that didn't provide me with complete and unfettered access to my raw SNP data. Both 23andMe and deCODEme offer customers the ability to download and analyse their own data; this isn't the case for Navigenics, according to the Wired article. That's a complete deal-breaker for me, but admittedly it's unlikely to have much of an impact on the chiselled, athletic executive types featured on Navigenics' front page, who (understandably) have no particular interest in the raw data but simply want to know their risk of stroke and heart disease (presumably so they can stay alive and healthy long enough to play football with their chiselled, athletic kids).

Cynicism aside, I've got to hand it to Navigenics: they've managed to neatly differentiate themselves from the competition, and they're now poised to capture a lucrative high-income section of the market that has been surprisingly poorly targeted by existing personal genomics companies.

Anyway, you'll no doubt read a lot more about this over the next day or two. Genetics and Health has an ongoing (and thus far relentlessly positive) series of posts on Navigenics. I'm particularly interested to see what Steve Murphy has to say - it seems to me that Navigenics has managed to avoid most of the problems that he's been slamming 23andMe for over the last few months.


Subscribe to Genetic Future.

Monday, April 7, 2008

Height and hypertension genes in Nature Genetics

The advance online edition of Nature Genetics is stuffed with juicy complex human genetics goodness.

Firstly, there are three massive genome-wide scans for genes involved in regulating human height, each of which analysed more than ten thousand individuals. As I've mentioned before, height appears to be one of those traits (like bipolar disease) that thumbs its nose at genome-wide association studies (GWAS). That's evidently clear from these studies, each of which - despite their unprecedented size (one of them scanned more than 25,000 individuals!) - managed to capture variants explaining less than 5% of variation in height.

I note that a few previously identified height genes, like HMGA2 and GDF5, pop up in more than one of the three studies, while a new gene (ZBTB38) appears as the top candidate in all three of the studies. However, there doesn't seem to be a huge amount of overlap in the lower-ranked genes (although I need to read the articles more carefully to be sure).

ScienceDaily puts a positive spin on the story ("Scientists are beginning to develop a clearer picture of what makes some people stand head and shoulders above the rest"), but the real story is this: despite the massive scale of these studies, they're still only capturing less than 5% of the total variance in a trait that is almost entirely (~80%) genetic. This is a powerful demonstration of the inability of current GWAS technology to access the genetic variants responsible for the vast majority of heritable variation in at least some complex traits, for reasons I've discussed recently.

Researchers interested in the genetics of common diseases are no doubt experiencing a sinking feeling as they read these studies, since there's every reason to expect that what holds true for height will also apply in at least some of these conditions. If so, the number of patients required to characterise even a trivial proportion of the total genetic risk using GWAS will be astronomical. However, there is a light at the end of the tunnel: large-scale sequencing, once it drops in price, will provide researchers with access to the rare variants and structural variation currently missed by chip-based GWAS technologies, and should help to capture a substantial proportion of the missing variation.

This leads into another Nature Genetics article, which used an interesting candidate gene resequencing strategy to detect variants linked with variation in blood pressure. Readers persistent enough to slog through yesterday's post on the genetics of bipolar disease might recall that hypertension is another disease in which the GWAS approach has yielded little success; in the comments to that post, G from Popgen ramblings notes that admixture mapping (an approach to gene identification using populations with mixed ancestry) has also failed to produce consistent signals, despite a profound difference in hypertension risk between populations.

The Nature Genetics study took a different approach, sequencing the full coding regions of three genes associated with rare, serious hypertension conditions in previous family studies in more than 3,000 individuals from the Framingham Heart Study cohort. They found a scattering of rare variants - all present in a single copy in any given individual - with either inferred or biochemically verified effects on protein function. When the individuals carrying these rare mutations were analysed as a group they showed significantly lower blood pressure than non-carriers.

This combination of targeted resequencing and functional analysis is a difficult road, but it's one that researchers will have to follow increasingly often as they attempt to characterise the rare variants that likely comprise a significant fraction of common disease risk. I'll have more to say about this in future posts.


Subscribe to Genetic Future.