Basal-like Breast Cancer DNA copy number losses
identify genes involved in genomic instability,
response to therapy, and patient survival
Victor J. Weigman1,2,3,*,
Hann-Hsiang Chao4,*,
Andrey
A. Shabalin5,8, Xiaping He2,4,
Joel S. Parker2,4,
Silje
Nordgard9, Tatyana Grushko10, Dezheng Huo10,
Chika Nwachukwu10, Andrew Nobel5,8,
Vessela N. Kristensen9,11,12 Anne-Lisa Børresen-Dale9,12,
Olufunmilayo I.
Olopade10, Charles
M. Perou2,4,6,7
SupWald
Identification of dna-copy CHanges (SWITCHdna)
Supplemental Website
1 Program in Bioinformatics
and Computational Biology,
2 Lineberger Comprehensive
Cancer Center,
3 Department of Biology,
4 Department of Genetics,
5 Department of Statistics and
Operations Research,
6 Carolina Center for Genome
Sciences,
7 Department of Pathology
and Laboratory Medicine,
8 Department of Biostatistics
University of North Carolina at Chapel Hill, Chapel Hill,
NC 27599
9 Department of Genetics, Institute for Cancer Research, Oslo University Hospital,
Radiumhospitalet, Norway
10 Center for Clinical
Cancer Genetics and Global Health, University
of Chicago Medical Center, Chicago, IL 60615
11 Department of Clinical Molecular Biology
(EpiGen), Akershus University Hospital, University of Oslo, Oslo, Norway
12 Institute for Clinical Medicince, Faculty of Medicine, University of Oslo, Oslo, Norway
*=these authors made equal contributions
# = Corresponding author
In the paper we introduce a new method, SWITCHdna , to identify regions of Copy
Number Aberrations (CNAs) based on intensity data generated from copy number
platforms. The method is a modification of change point detection method used
by Lickwar
et al. (2009).
The SWITCHdna
method performs identification of transition points followed by significance
testing of defined segments. The package also includes functions for visual representation
of the findings. The method is described in the paper and the instructions
are provided in the manual.
In SWITCHdna ,
we perform estimation of transition points sequentially. First, we test for
the presence of at least one transition point (versus none) across the entire
chromosome. To conduct the test we calculate F-statistic for the difference
of means for every candidate (possible) transition point such that the size
of the region on either side of the breakpoint was larger than ±. If the maximum
observed F-statistic is below a user-specified threshold, in other words the
test shows no significant deviation from the null model (or no change
points), we accept K=0. Otherwise, we say that the transition happened at the
location of the maximum F-statistic. The testing procedure is then applied
separately to the regions to the left and to the right of the transition
point. Following detection of the transition points, for each segment we
record its average value, number of observations in the region, and the
corresponding z-score. We filter out segments with a z-score less than 3 and
a average intensity measurement of 0.09 or less.
The code was tested on both Windows and Linux
systems on both 32 and 64 bit versions of R
and on Revolution
R. Questions about the package should be sent to weigman-AT-gmail.com.
|
|
Download:
 SWITTCHdna R code.
Requires R programming languge.
 SWITTCHdna Manual.
Adobe .pdf format, may require Adobe
Reader.
Sample
input data.
Includes sample aCGH data, normalized and Log-transformed, and supplementary
files.
The landscape plot presented below can be generated via analysis of the
sample data using the code in readme.txt file.
|
Visualisation of the findings.
The SWITCHdna
package includes functions for visual representation of the findings. They
generate plots which show the frequencies of CNAs within the groups across the
genome. For example, analysis of the provided sample data produces the
following landscape plot:

(click the picture for bigger version)