Basal-like Breast Cancer DNA copy number losses
identify genes involved in genomic instability,
response to therapy, and patient survival

Victor J. Weigman1,2,3,*, Hann-Hsiang Chao4,*, Andrey A. Shabalin5,8, Xiaping He2,4, Joel S. Parker2,4, Silje Nordgard9, Tatyana Grushko10, Dezheng Huo10, Chika Nwachukwu10, Andrew Nobel5,8, Vessela N. Kristensen9,11,12 Anne-Lisa Børresen-Dale9,12, Olufunmilayo I. Olopade10, Charles M. Perou2,4,6,7

SupWald Identification of dna-copy CHanges (SWITCHdna)

Supplemental Website

 

1 Program in Bioinformatics and Computational Biology,
2 Lineberger Comprehensive Cancer Center,
3 Department of Biology,
4 Department of Genetics,
5 Department of Statistics and Operations Research,
6 Carolina Center for Genome Sciences,
7 Department of Pathology and Laboratory Medicine,
8 Department of Biostatistics

University of North Carolina at Chapel Hill, Chapel Hill, NC 27599

9 Department of Genetics, Institute for Cancer Research, Oslo University Hospital, Radiumhospitalet, Norway
10 Center for Clinical Cancer Genetics and Global Health, University of Chicago Medical Center, Chicago, IL 60615

11 Department of Clinical Molecular Biology (EpiGen), Akershus University Hospital, University of Oslo, Oslo, Norway

12 Institute for Clinical Medicince, Faculty of Medicine, University of Oslo, Oslo, Norway

 

*=these authors made equal contributions

 

# = Corresponding author

 

In the paper we introduce a new method, SWITCHdna, to identify regions of Copy Number Aberrations (CNAs) based on intensity data generated from copy number platforms. The method is a modification of change point detection method used by Lickwar et al. (2009).

The SWITCHdna method performs identification of transition points followed by significance testing of defined segments. The package also includes functions for visual representation of the findings. The method is described in the paper and the instructions are provided in the manual.

In SWITCHdna, we perform estimation of transition points sequentially. First, we test for the presence of at least one transition point (versus none) across the entire chromosome. To conduct the test we calculate F-statistic for the difference of means for every candidate (possible) transition point such that the size of the region on either side of the breakpoint was larger than ±. If the maximum observed F-statistic is below a user-specified threshold, in other words the test shows no significant deviation from the null model (or no change points), we accept K=0. Otherwise, we say that the transition happened at the location of the maximum F-statistic. The testing procedure is then applied separately to the regions to the left and to the right of the transition point. Following detection of the transition points, for each segment we record its average value, number of observations in the region, and the corresponding z-score. We filter out segments with a z-score less than 3 and a average intensity measurement of 0.09 or less.

The code was tested on both Windows and Linux systems on both 32 and 64 bit versions of R and on Revolution R. Questions about the package should be sent to weigman-AT-gmail.com.

Download:

SWITTCHdna R code.
Requires R programming languge.

SWITTCHdna Manual.
Adobe .pdf format, may require Adobe Reader.

Sample input data.
Includes sample aCGH data, normalized and Log-transformed, and supplementary files.
The landscape plot presented below can be generated via analysis of the sample data using the code in
readme.txt file.

 

Visualisation of the findings.

The SWITCHdna package includes functions for visual representation of the findings. They generate plots which show the frequencies of CNAs within the groups across the genome. For example, analysis of the provided sample data produces the following landscape plot:

(click the picture for bigger version)