[HTML][HTML] SuperFreq: Integrated mutation detection and clonal tracking in cancer

C Flensburg, T Sargeant, A Oshlack… - PLoS computational …, 2020 - journals.plos.org
PLoS computational biology, 2020journals.plos.org
Analysing multiple cancer samples from an individual patient can provide insight into the
way the disease evolves. Monitoring the expansion and contraction of distinct clones helps
to reveal the mutations that initiate the disease and those that drive progression. Existing
approaches for clonal tracking from sequencing data typically require the user to combine
multiple tools that are not purpose-built for this task. Furthermore, most methods require a
matched normal (non-tumour) sample, which limits the scope of application. We developed …
Analysing multiple cancer samples from an individual patient can provide insight into the way the disease evolves. Monitoring the expansion and contraction of distinct clones helps to reveal the mutations that initiate the disease and those that drive progression. Existing approaches for clonal tracking from sequencing data typically require the user to combine multiple tools that are not purpose-built for this task. Furthermore, most methods require a matched normal (non-tumour) sample, which limits the scope of application. We developed SuperFreq, a cancer exome sequencing analysis pipeline that integrates identification of somatic single nucleotide variants (SNVs) and copy number alterations (CNAs) and clonal tracking for both. SuperFreq does not require a matched normal and instead relies on unrelated controls. When analysing multiple samples from a single patient, SuperFreq cross checks variant calls to improve clonal tracking, which helps to separate somatic from germline variants, and to resolve overlapping CNA calls. To demonstrate our software we analysed 304 cancer-normal exome samples across 33 cancer types in The Cancer Genome Atlas (TCGA) and evaluated the quality of the SNV and CNA calls. We simulated clonal evolution through in silico mixing of cancer and normal samples in known proportion. We found that SuperFreq identified 93% of clones with a cellular fraction of at least 50% and mutations were assigned to the correct clone with high recall and precision. In addition, SuperFreq maintained a similar level of performance for most aspects of the analysis when run without a matched normal. SuperFreq is highly versatile and can be applied in many different experimental settings for the analysis of exomes and other capture libraries. We demonstrate an application of SuperFreq to leukaemia patients with diagnosis and relapse samples.
PLOS