Clinical Sequencing Data Sharing Is Essential


Clinical Sequencing Data Sharing Is Essential

The past few decades have seen rapid advances in our knowledge of genetic diseases, which affect an estimated 25 million Americans. These advances can be quantified in things like the growth of dbSNP (now contains about 90 million validated genetic variants) and the number of Mendelian disorders understood at the genetic level (over 5,000).

Some of the factors that have contributed to this progress include:

  • Big science. Ambitious, grant-supported, international efforts like the Human Genome Project, the HapMap Project, and the Cancer Genome Atlas yielded the public resources that form the foundation of modern human genetics research. Thank you, taxpayers.
  • Technology development. Revolutionary advances in genome interrogation technologies (high density SNP arrays, whole-genome sequencing, etc.) have made large-scale genetic studies feasible, both technically and financially.
  • Study participants. It’s important to remember that most (if not all) of human genetics studies could not have happened without the patients and families who volunteered their samples, often with the knowledge that they’d get nothing in return.

The Unsolved Problem of Inherited Disease

Few areas have benefited as much from these advances as the study of rare genetic diseases. Exome sequencing has enabled the rapid genetic diagnosis of many patients, and the discovery of hundreds of new Mendelian disease genes. Yet even well-powered Mendelian disease studies can fail for a variety of reasons. There’s also a considerable gray area between success and failure: the implication of an unknown gene, or one that has never been associated with disease.

One particular challenge is that Mendelian diseases are rare by definition, and the variants definitively shown to cause them are rarer still. As a result, many variants detected in clinical sequencing project end up with the label variant of unknown significance, or VUS. Even when given a classification, some variants are interpreted differently by different clinical laboratories.

As discussed in a report at the New England Journal of Medicine this week, another thing that has hampered our ability to discover and annotate clinically-relevant genetic variation is the “silo effect” — in which research groups (both commercial and academic) maintain private databases of clinical sequencing results. A great example of this is Myriad Genetics, a company that’s probably sitting on the largest database of BRCA1/2 mutations in the world.

The problem, of course, is that not all of the clinical datasets for a given disease or gene ends up in the same silo. Thus, researchers in group A might have a promising new disease gene that researchers in group B have also identified in a different family kindred. If those datasets were shared, rather than kept isolated, these groups could cross-validate with one another and the research community as a whole would benefit.

Data Sharing in ClinVar

The NIH’s Clinical Genome Resource program (ClinGen) hopes to address some of these issues by developing community resources to understand our understanding of genomic variation and improve its use in clinical care. The cornerstone of this effort is ClinVar, a database of variants annotated with clinical data.

ClinVar Contributors

Over 300 different submitters have contributed to ClinVar thus far. Those submitters comprise research groups, clinical laboratories, locus-specific databases, and aggregate databases (like OMIM). Here’s a plot of the variants submitted for some of the major (or interesting) contributors:

ClinVar Submitters

The largest submitter by far is OMIM, which has contributed over 25,000 variants to ClinVar. It’s encouraging to see two of the leading genetic testing providers (GeneDx and Ambry Genetics) making substantial contributions. Among academic centers, the University of Chicago and Emory University are the clear leaders.

As of May 2015, ClinVar contained 172,055 variant submissions across 22,864 genes. More than 118,000 unique variants have clinical annotations, though 21% of those are “variant of unknown significance.” Nevertheless, this rapidly-growing resource illustrates the power of sharing clinical variant annotations in a centralized manner.

Discordant Clinical Annotations

Notably, 12,895 variants have clinical annotations (pathogenic, unknown, or benign) from at least two different laboratories and 17% of the time, those annotations did not agree. For example, at least 220 of the “pathogenic” variants pulled in from OMIM (the largest contributing database) are classified by clinical laboratories as either benign or unknown significance.

It is clear that the guidelines for variant interpretation differ between laboratories, and need to be standardized. Even so, adopting standards and making the effort to share clinical variant findings and annotations (along with the relevant phenotype data) is critical to the success of rare disease research. ClinVar seems to be taking us in the right direction.


Rehm HL, Berg JS, Brooks LD, Bustamante CD, Evans JP, Landrum MJ, Ledbetter DH, Maglott DR, Martin CL, Nussbaum RL, Plon SE, Ramos EM, Sherry ST, Watson MS, & ClinGen (2015). ClinGen – The Clinical Genome Resource. The New England journal of medicine PMID: 26014595

24,920 thoughts on “Clinical Sequencing Data Sharing Is Essential

  1. Today, I went to the beach with my kids. I found a sea shell and gave it to my 4 year old daughter and said “You can hear the ocean if you put this to your ear.” She placed the shell to her ear and screamed.
    There was a hermit crab inside and it pinched her ear.
    She never wants to go back! LoL I know this is entirely off topic but I had to tell someone!

  2. I will immediately seize your rss as I can’t find your e-mail subscription link or e-newsletter service.
    Do you’ve any? Kindly let me recognize so that I may subscribe.

  3. I was suggested this website through my cousin. I’m now not sure whether this put up is written by
    him as nobody else recognize such unique approximately my trouble.
    You’re incredible! Thanks!

  4. I got this website from my friend who informed me regarding this web site and at the moment this time I
    am browsing this web site and reading very informative articles at this place.

  5. はじめまして。自分は明日で33歳と11カ月になります。そしてムシムシする時期になりました。ですからすぐにでも無駄な毛はをやりたいですよね。近年では、全国に脱毛クリニックがたくさんあります。やりたいところは、個人差が、特に多いのはフェイスです。私は、人気店の脱毛ラボを選びました。そのおかげで、かなりムダ毛が減ってきました!やはり一人で処理するのとは、違います。これからも脱毛ラボに通ってムダ毛を減らしたいです。でも、脱毛専門のエステサロンに通ったとしても気になるのが脱毛にかかる費用です。それについては、従業員に聞けばいいでしょう。あと気になるのが、長い間通わないといけないのかです。自分はできれば、9カ月くらいですべて終わってくれると助かりますね。まあ、これからの人は相談してみましょう。

  6. I intended to write you this bit of note to finally thank you the moment again about the superb concepts you’ve contributed on this website. It’s certainly incredibly open-handed of people like you to offer unreservedly all many individuals might have marketed for an e-book to get some cash on their own, mostly given that you could possibly have done it if you ever considered necessary. Those solutions as well worked as the good way to recognize that other people online have a similar fervor similar to my very own to learn a good deal more when considering this problem. I believe there are some more fun moments in the future for people who go through your website.

  7. Thanks for all your labor on this web page. Ellie really likes working on investigations and it is easy to understand why. A lot of people notice all regarding the compelling ways you present invaluable tips and hints on the web site and as well as cause participation from people on the concept and my simple princess is in fact learning a lot. Have fun with the rest of the year. You are always performing a remarkable job.

  8. Thank you for the good writeup. It in fact was a amusement account it.
    Look advanced to more added agreeable from you! However,
    how could we communicate?

  9. My general rule is, when training for weight loss, you ought to minimize the variety of collections you
    do to regarding 75 percent of just what you were doing previously.

  10. I was curious if you ever thought of changing the layout of your site? Its very well written; I love what youve got to say. But maybe you could a little more in the way of content so people could connect with it better. Youve got an awful lot of text for only having one or two images. Maybe you could space it out better?

  11. It’s actually a cool and helpful piece of info. I am satisfied that
    you shared this useful information with us. Please stay us up to date like this.
    Thank you for sharing.

  12. Considered quite the luxury snack during the American
    Depression in the 1930’s, it brought happiness to the lower class families selling for as little as five and ten cents a bag,
    and despite a dip in popularity when TV stole the cinemas fans, it came racing
    back into our lives when it was discovered to be the ultimate TV snack, with 70% now being consumed
    at home since the marvellous invention that is the Microwave.
    This resulted in a lot of deaths and injuries which led to a number of schools banning the
    sport. You can find a website that will update the scores throughout the game,
    giving you access to the most up to the minute scores of all playing teams,
    any day of the week.

Leave a Reply

Your email address will not be published.