The Genome Aggregation Database (gnomAD) is a resource developed by an international coalition of investigators, with the goal of aggregating and harmonizing both exome and genome sequencing data from a wide variety of large-scale sequencing projects (1). We have covered this annotation in-depth in other blog posts, but this resource contains over 125,000 exome sequences and around 16,000 whole genome… Read more »
We have covered a lot of ground in this Automating & Standardizing Your Workflows blog series. First, we saw how to perform secondary analysis with Sentieon to generate the necessary VCF and BAM files for tertiary analysis in Part I. The implementation of VSPipeline allowed for rapid import and project generation for a predefined cancer gene panel project template in… Read more »
In our last part of this series, we showed how to run a pre-built workflow template via VSPipeline to automatically import and filter sample variants to streamline the search for clinically relevant variants. Now, we can deep-dive into our filtered, pathogenic variants to fully understand and capture their final classification and interpretation. Filtered Germline Variants for ACMG Guidelines The VSPipeline… Read more »
VSPipeline: Automating your Tertiary Workflows The first part of this “Automating & Standardizing your NGS Workflow” blog series covered the secondary analysis steps of read alignment and variant calling with Sentieon. The next step is to transition into the tertiary analysis via utilization of our workflow automation tool, VSPipeline. VSPipeline operates as a command-line tool meant to simplify the deployment… Read more »
Smoothing Hurdles into Speed Bumps when creating Annotation Sources Although most researchers assume that getting the pile of VCF sequence files is the largest hurdle in moving towards an analysis, there still exists the looming step of normalizing the variant calls in annotation sources to make variant comparison easier. In this ever-refining field of study, VarSeq continually works to increase… Read more »
Check out some information on the upcoming release of new genome maps and annotation tracks for a few animal species.
Clinical Assessment Tracks Golden Helix provides a large catalog of annotation sources for our research and clinical clientele. Making these public data repositories available to all our users is no easy task. As Cody Sarrazin mentioned in his blog post, annotation curation is a complex data science pipeline. This process aggregates data from many disparate sources and normalizes it into… Read more »
In recent months we have been updating our public annotation library to include the most recent versions of existing sources as well as include new sources. Each of these annotation sources are compatible with our three major products (SVS, GenomeBrowse and VarSeq) and can be used for visualization, annotation and filtering. NHLBI ESP6500SI-V2-SSA137 Exomes Variant Frequencies 0.0.30, GHI Annotations are… Read more »
Since its early release in early 2012, the population frequencies from the GO Exome Sequencing Project (ESP) – from the National Heart, Lung and Blood Institute (NHLBI) have been a staple of the genomic community. With the recent release of ExAC exome variant frequencies, the ESP has been surpassed as the largest cohort of publicly available variant frequencies (by nearly… Read more »
In recent months we have been updating our public annotation library to include the most recent versions of existing sources, as well as include new sources. All of these annotation sources are compatible with our three major products, VarSeq, SVS, and GenomeBrowse, and can be used for visualization, annotation, and filtering. dbNSFP NS Functional Predictions 2.8, GHI and dbNSFP Predictions… Read more »