Category Archives: Public data & annotations

Updating Somatic Annotation Catalogs: ICGC and COSMIC

Golden Helix works to keep incorporating and updating great somatic annotation catalogs for our VSClinical users. We currently have the updated version of one of the largest cancer databases from the International Cancer Genome Consortium, or ICGC. Version 28 has been improved by integrating ClinVar and CIViC clinical annotations, and as always, increasing the number of mutations listed. The current… Read more »

Updated Annotations: The new and improved gnomAD 2.1.1

The Broad Institute team led by Dan MacArthur announced the release of gnomAD version 2.1 at last year’s ASHG conference. This new version boasted data from 125,748 exomes and 15,708 genomes, but the greater updates were the improved QC refinement and more discrete sub-population break downs. Although the majority of samples were counted in the previous 2.0.2 release, the additional… Read more »

New and Updated Annotations

      Cody Sarrazin    December 7, 2017    No Comments on New and Updated Annotations
Genotype Imputation

Golden Helix is excited to announce a new round of novel and updated annotations; including a frequency track, a region track, and a gene track. All three of these tracks were created with the use of VarSeq and its Convert Wizard functionality. First, the expansive 1000 genomes track (1kG) has been updated to include sub-population allele frequencies and heterozygous and… Read more »

Annotation Education Series: CNV Annotations

CNV Annotations

With the recent upgrade to VarSeq 1.4.7, users gain access to some new great features. Among the additions are new CNV annotations (Figure 1). In this final chapter of the annotation blog series, we are going to provide descriptions of the new CNV annotations and how they can be used. The types of CNV annotations vary and include frequency, clinical… Read more »

Annotation Education Series: Frequency, Functional Prediction, and Gene Annotations

VarSeq annotations

In our final chapter of this variant annotation blog series, we will discuss additional annotations that provide powerful variant filtering and analysis capability. Golden Helix curates many annotations in a way that allows for simple analysis and saves the users the hassle of all this data management. Whether you are trying to capture rare variants known across multiple subpopulations in… Read more »

Annotation Education Series: Cancer Annotations

CIViC The Clinical Interpretations of Variants in Cancer, better known as CIViC, is an open access open source, community-driven web resource available to all VarSeq users. Nature Genetics published an article that states, “CIViC accepts public knowledge contributions but requires that experts review these submissions”. Fundamentally, the focus behind CIViC is to make sure the variants contained in the database… Read more »

Golden Helix, Inc. – Your Annotation Curation Station

The current reduced cost and increase availability of genome sequencing has been making academics, clinicians and individuals alike excited with the possibility of increased research depth, diagnosing capability and personal curiosity. And although a freshly sequenced genome is chock-full of tasty letter snippets, the real revelation and education occurs when comparing to an annotation foundation. In this post, I’ll review… Read more »

Coming Soon! The genome Aggregation Database (gnomAD)

VarSeq Updated

Ever since the MacArther Lab announced the new gnomAD browser at last year’s ASHG conference, we have had many requests from our customers to make this new variant frequency source available within both VarSeq and SVS. This new dataset includes variants obtained from 123,136 exome sequences and 15,496 whole-genome sequences. In comparison to the original ExAC dataset which contained exomes… Read more »

ExAC CNVs: The First Large Scale Public Exome CNV Variant Set

ExAC CNVs

ExAC CNVs were released publicly with a recent publication, providing the full set of rare CNVs called on ~60K human exomes. While there are many public CNV databases out there, this is the first one that was derived from exome data, and thus includes both extremely rare and very small CNV events. With the recent release of Golden Helix’s CNV calling… Read more »

GWAS Example Project Updated for SVS Viewer

SVS 8

With the release of our updated GWAS E-book, we have recently updated the GWAS example project (SNP Genome-Wide Association Tutorial – Complete). This updated project includes more details about how spreadsheets were generated, how to generate plots and which images were used for the GWAS E-book. This information can be found in the User Notes view in the project navigator and… Read more »

Updates to dbNSFP – The Universal Remote of Annotation Sources

Probably one our most popular public annotation sources we curate and update is the database of Non-Synonymous Functional Predictions (dbNSFP). In it’s recent update, it has expanded the predictions to include FATHMM-MKL and VarSeq now incorporates this new prediction into its voting algorithm of now 6 different discrete predictions per variant. You can update to dbNSFP 3.0 using the built-in… Read more »

Analyze Your 23andMe Genotype Files with Golden Helix

23andMe

I was definitely an early adopter when it comes to personal genomics. In a recent email to their customer base announcing their one millionth customer, they revealed that I was customer #44,299. And I have been consistently impressed with the product 23andMe provides through their web interface to make your hundreds of thousands of genotyped SNPs accessible and useful. It… Read more »

The Clinical Genome Conference 2015 Highlights

This last week I had the pleasure of attending the fourth annual Clinical Genome Conference (TCGC) in Japantown, San Francisco and kicking off the conference by teaching a short course on Personal Genomics Variant Analysis and Interpretation. Some highlights of the conference from my perspective: Talking about clinical genomics is no longer a wonder-fest of individual case studies, but a… Read more »

New and Updated Annotation Tracks Now Available!

In recent months we have been updating our public annotation library to include the most recent versions of existing sources as well as include new sources. Each of these annotation sources are compatible with our three major products (SVS, GenomeBrowse and VarSeq) and can be used for visualization, annotation and filtering. NHLBI ESP6500SI-V2-SSA137 Exomes Variant Frequencies 0.0.30, GHI Annotations are… Read more »

Supercentenarian Variant Annotation: Complex to Primitive

In a previous blog post, I demonstrated using VarSeq to directly analyze the whole genomes of 17 supercentenarians. Since then, I have been working with the variant set from these long-lived genomes to prepare a public data track useful for annotation and filtering. Well, we just published the track last week, and I’m excited to share some of the details… Read more »