Category Archives: How to’s and advanced workflows

Looking Beyond the Exons: Splice Altering Variants

There are many approaches that one might use to define a variant as potentially deleterious. For example, we often see analysis workflows based on rare, non-synonymous variants, perhaps incorporating additional annotation sources that capture known or predicted consequences of coding variants. Annotations for coding regions of the genome are relatively abundant and familiar to genome scientists. We are comfortable in… Read more »

VarSeq as a Clinical NGS Platform Q&A

CNV User

Our VarSeq as a Clinical Platform webcast last week highlighted some recent updates in VarSeq that support gene panel screenings and rare variant diagnostics. The webcast generated some good questions, and I wanted to share them with you. If the questions below spark new questions or need clarification, feel free to get in touch with us at info@goldenhelix.com. Question: Should… Read more »

Visually Filtering Data in GenomeBrowse

Over 650 GenomeBrowse licenses have been registered and downloaded since the beginning of 2015, and with so many people enjoying the utility of this freeware program, I wanted to showcase some advanced tips and tricks so you can get more out of GenomeBrowse! Under the Controls panel, when you’re clicked inside a data plot, there is a “Filter” tab. This… Read more »

Analyzing a Unique Family Structure in VarSeq 1.1.1

I am constantly on the lookout for fun or interesting datasets to analyze in SVS or VarSeq and recently came across a study looking into inherited cardiac conduction disease in an extended family (Lai et al. 2013). The researchers sequenced the exomes from five family members including three affected siblings and their unaffected mother and an unaffected child of one… Read more »

Q&A Surrounding Population-Based DNA Variant Analysis

Last month, Dr. Bryce Christensen presented Population-Based DNA Variant Analysis via webcast. The webcast reviewed the fundamentals of population-based variant analysis and demonstrated some of the tools available in SVS for analysis of both common and rare variants such as the SKAT-O method, as well as other functions for annotation, visualization, quality control and statistical analysis of DNA sequence variants. Here… Read more »

Q&A from our December Genomic Prediction webcast

Our Genomic Prediction webcast in December discussed using Bayes-C pi and Genomic Best Linear Unbiased Predictors (GBLUP) to predict phenotypic traits from genotypes in order to identify the plants or animals with the best breeding potential for desirable traits. The webcast generated a lot of good questions as our webcasts generally do. I decided to begin to share these Q&A… Read more »

SVS, Population Genetics, and 1000 Genomes Phase 3

One frequent question I hear from SVS customers is whether whole exome sequence data can be used for principal components analysis (PCA) and other applications in population genetics. The answer is, “yes, but you need to be cautious.” What does cautious mean? Let’s take a look at the 1000 Genomes project for some examples.

VarSeq: A bioinformatics Swiss Army knife

If you’ve seen the recent webinars given by Gabe Rudy and Bryce Christensen, you’ve no doubt been impressed by the capabilities of VarSeq when it comes to annotation and filtering. However, we sometimes forget that the power that enables all this complex analysis can also be used in more mundane tasks like VCF subsetting. And although these day-to-day tasks don’t… Read more »

A little known fact about Box Plots

A helpful tool that is included in SVS, but many of our customers may not know about, is the ability to create Box Plots or box-and-whisker plots. These are effective visualizations for comparing groups of numerical data through the data quartiles. I’ll take you through a couple different cases with examples.

Top 5 Webcasts to Watch at GoldenHelix.com

Genomic research is exploding. There is a plethora of new methods and workflows for research and clinical use. While we are a software company at heart, we find ourselves in the role of educators. Our customer interactions are about informing, teaching, and consulting. A few years back, we started with regular webcasts that took this idea to the next level…. Read more »

Turning SRA Files Into Usable BAMs and VCFs

In our recent webcast, Advancing Agrigenomic Discoveries with Sequencing and GWAS Research, Greta Linse Peterson featured bovine data which she download from the NCBI website. The data was downloaded in SRA format and in order to analyze the data in SVS, the files had to be converted to BAMs and then merged into a single VCF file. Since many of… Read more »

Back to Basics: Importing/Exporting Data in Imputation Program Data Formats with SVS

In a recent blog post (Comparing BEAGLE, IMPUTE2, and Minimac Imputation Methods for Accuracy, Computation Time, and Memory Usage), Autumn Laughbaum compared three imputation programs. Data can be exported from, or imported into, SVS in the standard file formats for these and other imputation programs. The goal of this blog post will be to review the different tools available to… Read more »

Follow Along on an Analyst’s Journey to Filter Whole Genome Data to Four Candidate Variants in SVS

Last week Khanh-Nhat Tran-Viet, Manager/Research Analyst II at Duke University, presented the webcast: Insights: Identification of Candidate Variants using Exome Data in Ophthalmic Genetics. (That link has the recording if you are interested in viewing.) In it, Khanh-Nhat highlighted tools available in SVS that might be under used or were recently updated. These tools were used in his last three… Read more »

Dr. Ken Kaufman’s Webcast on Exome Sequencing Wildly Successful

Thank you to everyone who joined us yesterday for a webcast by Dr. Ken Kaufman of Cincinnati Children’s Hospital: “Identification of Candidate Functional Polymorphism Using Trio Family Whole Exome DNA Data.” Over 750 people registered for this event and 430 attended – a new Golden Helix record! If you missed the webcast (or would like to watch it again), the… Read more »

Why You Should Care About Segmental Duplications

My work in the GHI analytical services department gives me the opportunity to handle data from a variety of sources.  I have learned over time that every genotyping platform has its own personality.  Every time we get data from a new chip, I tend to learn something new about the quirks of genotyping technology.  I usually discover these quirks the… Read more »

How SVS Treats Gender in Calculating Genotype Statistics

Recently several customers have asked how SNP & Variation Suite (SVS) treats gender when calculating genotype statistics. In this blog post, I will cover SVS’ current capabilities, what we have available through Python scripts, and what is coming in the near future. We thank all of our customers who have inquired about these capabilities and have given us valuable feedback… Read more »

Analyzing PacBio Data with SNP & Variation Suite

As most in the Golden Helix community are aware, SNP & Variation Suite (SVS) can handle all sorts of data including files from Affymetrix, Illumina, Agilent, and Complete Genomics with its powerful data management capabilities. Announced in February, SVS is now part of the Pacific Biosciences Partner Program and has the ability to analyze PacBio files.

Creating Annotation Tracks from 1000 Genomes Phase 1 Data

If you have ever worked with NGS variant data, you may have come to realize that the first task at hand is the seemingly simple categorization of your variants into two bins: known and novel. Of course, if you’ve ever worked with NGS variant data, you may have also come to the realization that this step is more complex than… Read more »

Sequence Analysis Methods Not Just for Sequence Data

Speaking as somebody with a long history in data analysis, there are few things I find more exciting and tantalizing than new analysis methods that might apply to a problem I am trying to solve or was unable to solve in the past.  Whenever I make a breakthrough in one project, I find I want to abandon the current project… Read more »

Marker Map Manipulation Improvements in SVS 7.5

Manipulating a marker map in SVS has never been easier, thanks to expanded functionality in SVS 7.5.  Have you ever wanted to view annotation data next to marker map data?  Or expand the current marker map with spreadsheet data to create a custom map?  SVS 7.5 features two new functions that can accomplish these tasks. Adding Annotation Data to a… Read more »