Tag Archives: Big Data

Scaling is in our DNA: Making Genomics Accessible

Scalable Data

Scaling is in our DNA: Making Genomics Accessible One of the things I absolutely love about the work we do at Golden Helix is keeping up with the changes in data analysis driven by the iterative and generational leaps in technology. But one thing has always been a constant since day one: we break preconceived notions of what scale of… Read more »

Genomic Data is Big Data, But That is not the Hard Part

Big Data

There is no doubt that we have big data in the field of genomics in general and Next Generation Sequencing specifically. Illumina’s latest HiSeq X can produce 16 genomes per run, resulting in terabytes of raw data to crunch through. Yet all that crunching is not the hard part. So, what is the main obstacle to scientists being able to… Read more »

False Positives in Big Data Analytics

We had a lot to celebrate recently. Last year was the 300th anniversary of Jacob Bernoulli’s Ars Conjectandi. In this book he consolidated central ideas in probability theory, such as the very first version of the law of large numbers. It was also the 250th anniversary of  Bayes theorem named after Thomas Bayes (1701–1761), who first suggested using the theorem to update beliefs.