SuperFreq: Integrated mutation detection and clonal tracking in cancer

doi:10.1101/380097

Posted Content•DOI•

SuperFreq: Integrated mutation detection and clonal tracking in cancer

Christoffer Flensburg¹, Tobias Sargeant¹, Alicia Oshlack², Ian J. Majewski³, Ian J. Majewski¹ - Show less +1 more•Institutions (3)

Walter and Eliza Hall Institute of Medical Research¹, Royal Children's Hospital², University of Melbourne³

30 Jul 2018-bioRxiv (Cold Spring Harbor Laboratory)-pp 380097

TL;DR: SuperFreq is a cancer exome sequencing analysis pipeline that integrates identification of somatic single nucleotide variants (SNVs) and copy number alterations (CNAs) and clonal tracking for both and can be applied in many different experimental settings for the analysis of exomes and other capture libraries.

read less

Abstract: Motivation Analysing multiple tumour samples from an individual cancer patient allows insight into the way the disease evolves. Monitoring the expansion and contraction of distinct clones helps to reveal the mutations that initiate the disease and those that drive progression; therefore, the ability to identify and track clones using genomics data is of great interest. Existing approaches for clonal tracking typically require the user to combine multiple tools that are not purpose-made. Furthermore, most methods require a matched normal (non-tumour) sample, which limits the scope of application. Results We have built superFreq, a cancer exome sequencing analysis tool that calls and annotates somatic SNVs and CNAs and attributes them to clones. SuperFreq makes use of unrelated control samples and does not require matched normal samples. We demonstrate the ability of superFreq to track clones by combining real samples in known proportions to simulating a multi-sample analysis. In addition, we compared superFreq to other somatic SNV callers and CNA callers on exome sequencing data from cancer-normal pairs, including 304 participants gathered from 33 cancer types in The Cancer Genome Atlas (TCGA). SuperFreq offers a reliable platform to identify somatic mutations and to track clones. SuperFreq recalled 91% of somatic SNVs identified by a consensus of four other methods, with a median of 1 additional somatic SNV per sample that was not found by any other method. CNA calls from superFreq showed good agreement with those generated by Sequenza, or those from ASCAT generated using matched SNP arrays. Using our simulated data set for testing multi-sample clonal tracking, we found that superFreq identified 93% of clones with a cellular fraction of at least 50%, and mutations were assigned to clones with high recall and close to 100% precision. In addition, SuperFreq maintained a similar level of performance for most aspects of the analysis without a matched normal control. SuperFreq is a highly adaptable method and has already been used in multiple different projects. Availability SuperFreq is implemented in R and available on github at https://github.com/ChristofferFlensburg/superFreq.

...read moreread less

SuperFreq: Integrated mutation detection and clonal tracking in cancer

Citations

Cites methods from "SuperFreq: Integrated mutation dete..."

References

"SuperFreq: Integrated mutation dete..." refers methods in this paper

"SuperFreq: Integrated mutation dete..." refers methods in this paper

"SuperFreq: Integrated mutation dete..." refers background in this paper

Related Papers (5)