Background: Genome-wide association studies have identified polymorphisms associated with breast cancer subtypes and across multiple population subgroups; however, few studies to date have applied linkage analysis to other population groups.

Methods: We performed the first genome-wide breast cancer linkage analysis in 106 African American families (comprising 179 affected and 79 unaffected members) not known to be segregating BRCA mutations to search for novel breast cancer loci. We performed regression-based model-free multipoint linkage analyses of the sibling pairs using SIBPAL, and two-level Haseman–Elston linkage analyses of affected relative pairs using RELPAL.

Results: We identified −log10P values that exceed 4 on chromosomes 3q and 12q, as well as a region near BRCA1 on chromosome 17 (−log10P values in the range of 3.0–3.2) using both sibling-based and relative-based methods; the latter observation may suggest that undetected BRCA1 mutations or other mutations nearby such as HOXB13 may be segregating in our sample.

Conclusions: In summary, these results suggest novel putative regions harboring risk alleles in African Americans that deserve further study.

Impact: We hope that our study will spur further family-based investigation into specific mechanisms for breast cancer disparities. Cancer Epidemiol Biomarkers Prev; 24(2); 442–7. ©2014 AACR.

Breast cancer is a complex phenotype that is known to display disparities according to race; in particular, African Americans are known to have higher premenopausal breast cancer incidence, and overall higher breast cancer mortality, compared with European Americans (1). Despite linkage analyses published decades ago for European American pedigrees (2–5) and numerous genome-wide association studies (GWAS) for the breast cancer phenotype (6–15) and more homogeneous breast cancer subtypes (16–21), and in different population groups (22–24), family-based linkage approaches have not been attempted, specifically in African Americans. Linkage analysis is a powerful tool for identifying the presence of novel loci for both Mendelian and complex diseases and phenotypes; several successes have been reported for colon (25), prostate (26), and lung cancers (27).

It is recognized that difficulties exist in recruitment of minorities for genetic research, and existing family-based data are limited. The Jewels in Our Genes study began in 2009 for the sole purpose of conducting linkage analysis in African American pedigrees to search for novel breast cancer loci. Using this resource, we conducted the first breast cancer genome-wide linkage analysis in African Americans.

Recruitment and study approval

Recruitment was focused primarily on sibling pairs that were both concordant and discordant for invasive breast cancer and as well as affected relative pairs (sibling, avuncular, grandparental, first cousin). In addition to recruiting women locally from Buffalo, participants were recruited via a partnership with the National Witness Project, nationally at conferences and via the Love Army of Women, and partnerships with investigators from existing epidemiologic studies (28). The Jewels in Our Genes study was approved by the University at Buffalo Health Sciences Institutional Review Board (IRB). Our collaborators also obtained IRB approvals to make further contact with previous participants, when necessary. Women who were interested and eligible were mailed out a packet containing the informed consent and HIPAA forms, study questionnaire, medical records consent, and an Oragene saliva collection kit (DNA Genotek). We obtained medical records to adjudicate date of diagnosis, invasive breast cancer diagnosis, and tumor characteristics where available.

Genotyping and quality control

Pedigrees were selected for genotyping (for the primary purpose of linkage analysis) according to relationship type and informativeness. DNA was isolated according to standard protocol from Oragene DNA kits (DNA Genotek) at Roswell Park Cancer Institute (Buffalo, New York). Normalization and analytic quality control of DNA samples and automated preparation of Affymetrix Axiom Genome-Wide AFR (Admixed population of West African and European ancestry) SNP microarrays (Affymetrix), was performed at Rutgers Environmental and Occupational Health Sciences Institute (Piscataway, New Jersey). Twenty individuals with a dish quality control (QC) < 0.82 were dropped from the analysis, as well as one sample with a dish QC of 0.828 and a call rate of 96.97%, for a total of 21 (6.4%) individuals who were excluded; this resulted in loss of 13 pedigrees. Average heterozygosity was calculated by individual for all SNPs; all values ranged from 0.199 to 0.245; therefore, no individuals were dropped from the analysis. Sex checks were all consistent with reports.

Preparation of SNPs for linkage and pedigree QC

We limited SNPs to those conforming to Hardy–Weinberg proportions (P > 0.05) with a call rate ≥99%, and as much as possible with minor allele frequency (MAF) ≥0.40. After thinning the SNPs to at least 0.33 cM apart, we added SNPs with smaller MAF, whenever possible, so that SNPs were not more than 0.67 cM apart. We thus prepared a panel of 8,261 SNPs for linkage analysis, 97% of which had MAF ≥ 0.40 (see Fig. 1). We then ran S.A.G.E. RELTEST and MARKERINFO (29) to look for relationship errors and marker inconsistencies; we corrected pedigree relationships as appropriate.

Figure 1.

Selection of SNPs for linkage analysis.

Figure 1.

Selection of SNPs for linkage analysis.

Close modal

Preparation of genetic map file

We generated a genetic map file by using the genetic map for African Americans that was built by Hinch and colleagues (30) based on about 2.1 million cross-overs in 30,000 unrelated African Americans.

Identity-by-descent estimation

We estimated founder marker allele frequencies from all the pedigree members by maximum likelihood for all the SNPs to be used for linkage using FREQ (29). Multipoint exact identity-by-descent (IBD) sharing for every pair of relatives was estimated by GENIBD (29) by calculating the likelihood of each inheritance vector at multiple markers to generate IBD distributions.

Ancestry estimation

To obtain population ancestry estimates, we selected 16,697 ancestry informative markers (AIM) that had allele frequency differences >0.5 between the HapMap samples of Utah residents with ancestry from northern and western Europe (CEU) and Yoruba (YRI); among them, 1,440 were in linkage equilibrium (r2 < 0.004). We evaluated linkage disequilibrium within a window of 10 SNPs, shifting the window five SNPs at each step. To assess what number of ancestral populations is appropriate in our sample, we selected 111 unrelated individuals from the pedigrees and ran STRUCTURE for 1, 2, 3, 4, and 5 groups (31) with the 1,440 AIMs in linkage equilibrium, and assessed where −2ln likelihood of these models plateaued; based on previous analyses of African Americans, we expected that two groups would be sufficient (i.e., European and African).

Global ancestry for each individual was estimated using SNP weights (32), which infers ancestry proportions using genome-wide SNP weights that are precomputed from external reference panels (Hapmap 3 unrelated ancestry populations). The SNP weights were used to predict principal components (PC) for the target admixed samples and from them ancestry proportions were estimated for both related and unrelated individuals. The global African ancestry proportion for each genotyped individual was estimated using the genome-wide 131,274 SNPs that are common to both our data and the SNPs used by SNP weights, and these estimates were used to adjust for ancestry in linkage analyses.

Model-free linkage analysis

We performed sibpair model-free linkage analysis using SIBPAL, both the original Haseman–Elston regression and W4 regression using the best linear unbiased predictor (BLUP) of the sibship mean (29, 33). P values <0.05 of the statistic were evaluated by up to 1,000,000 permutation replicates. Using SIBPAL, we performed linkage analysis without the inclusion of any covariates, and with age or date of birth (DOB) and global ancestry as pair-specific covariates, each taken to be the sibpair difference, the sibpair sum, or both the sibpair sum and the difference. We considered permutation −log10P values ≥4 (equivalent to LOD ≥ 3) as suggestive for linkage significance for our all of our model-free linkage analyses.

We also applied regression-based model-free two-level Haseman–Elston linkage analyses for the breast cancer phenotype using RELPAL (34, 35), which also models trait data from relative pairs as a function of marker allele sharing IBD, but allows for adjusting for covariates at the individual level. This analysis included 180 affected relative pairs. At the first level, individual level global ancestry and age or DOB were used as covariates. All individuals are used at the first level; all pairs of relatives are used at the second level linkage analysis. We estimated empirical P values with up to 1,000,000 permutations. Linkage significance was evaluated using the IBD variance, the robust sandwich variance, and the alternative variance.

Of the 127 African American families recruited to the study (n = 341 women), 281 women (197 affected, 84 unaffected) from 106 families were selected for genotyping based on informativeness. Twenty-seven pedigrees had one affected, 68 pedigrees had 2, and 10 pedigrees had 3 affected individuals.

Our analyses suggest that our data best fit two ancestry groups (Supplementary Fig. S1). The average ± SD African proportion in our sample was 0.78 ± 0.13.

When we modeled linkage without including any covariates, we obtained −log10P values of 4.37 and 3.93 for the original Haseman–Elston method (Fig. 2) and W4 with BLUP means, respectively, on chromosome 3. The chromosome 3 peak was also suggestive of significance (−log10P value = 4.08) when using W4 regression with BLUP means and including as covariates date of birth, using the sibpair sum and the difference, and the sibpair difference in global ancestry (Supplementary Fig. S2). The other peak we identified using SIBPAL was on chromosome 16. This peak was less consistent across the many models we examined and reached −log10P value of 3.99 at 83.5 cM when date of birth, the sibpair sum and difference, and the sibpair difference in global ancestry were included in the regression model as covariates (Supplementary Fig. S3). Table 1 shows the estimates of mean allele sharing for siblings at the chromosome 3 and 16 peaks that were statistically significant in SIBPAL. The most significant marker on chromosome 3q displayed excess allele sharing in concordantly affected and concordantly unaffected siblings, and less than expected allele sharing in discordant pairs, providing evidence for linkage. The chromosome 16 peak showed a pattern of less than expected allele sharing in both concordantly affected sibling pairs and discordantly affected sibling pairs, which represents conflicting evidence for linkage.

Figure 2.

Genome-wide model-free multipoint linkage by SIBPAL, P values based on up to 1,000,000 permutations (Haseman–Elston regression), without adjustment for any covariates.

Figure 2.

Genome-wide model-free multipoint linkage by SIBPAL, P values based on up to 1,000,000 permutations (Haseman–Elston regression), without adjustment for any covariates.

Close modal
Table 1.

Chromosome regions with the most statistically significant linkage: mean tests and Haseman–Elston regression (in sibling pairs only)

Concordantly affected sibling pairs (n = 27)Discordantly affected sibling pairs (n = 84)Concordantly unaffected sibling pairs (n = 19)
Chromosome location (cM)Allele sharingSEPAllele sharingSEPAllele sharingSEPH–E P valuea
3 (196) 0.587 0.060 0.079 0.415 0.030 0.001 0.589 0.055 0.061 0.00008b 
16 (83) 0.486 0.068 0.580 0.475 0.037 0.252 0.530 0.066 0.327 0.00010c 
Concordantly affected sibling pairs (n = 27)Discordantly affected sibling pairs (n = 84)Concordantly unaffected sibling pairs (n = 19)
Chromosome location (cM)Allele sharingSEPAllele sharingSEPAllele sharingSEPH–E P valuea
3 (196) 0.587 0.060 0.079 0.415 0.030 0.001 0.589 0.055 0.061 0.00008b 
16 (83) 0.486 0.068 0.580 0.475 0.037 0.252 0.530 0.066 0.327 0.00010c 

NOTE: The three mean tests are not adjusted for any covariates. All tests are one-sided as implemented in SIBPAL.

aBased on 1 million replicate permutations.

bHaseman–Elston regression with W4 regression (dependent variable is a weighted combination of sibpair mean-corrected squared trait sum and sibpair squared trait difference), and adjusting for date of birth (pairwise sum and absolute difference) and African ancestry proportion (pairwise absolute difference).

cOriginal Haseman–Elston regression (dependent variable is sibpair squared trait difference), and adjusted for date of birth (pairwise sum and absolute difference) and African ancestry proportion (pairwise absolute difference).

RELPAL estimates for a region on chromosome 12 (109.9 cM) achieved a maximum −log10P value of 4.29 (Fig. 3). The region on chromosome 12 reached 4 for IBD variance and alternative variance methods only; the robust sandwich variance estimate here reached −log10P value = 3.12.

Figure 3.

Model-free multipoint linkage by RELPAL using all affected relative pairs and the IBD variance, adjusting for date of birth and global ancestry at the individual level. P values evaluated on the basis of up to 1,000,000 permutations.

Figure 3.

Model-free multipoint linkage by RELPAL using all affected relative pairs and the IBD variance, adjusting for date of birth and global ancestry at the individual level. P values evaluated on the basis of up to 1,000,000 permutations.

Close modal

With regard to concordance of SIBPAL and RELPAL results at the chromosome 3 and 12 peaks, the strongest signal we observed on chromosome 12 using SIBPAL (at the strongest RELPAL peak) was −log10P value = 2.09. The chromosome 3 peak identified as suggestive for linkage in SIBPAL analyses did not reach our threshold for suggestive significance in RELPAL (all −log10P values < 1.47).

Interestingly, our analyses were suggestive of linkage to chromosome 17 when covariates (DOB and global ancestry) were included in the model using both sibling-based and affected relative pair-based analyses; original SIBPAL Haseman–Elston estimates ranged from 3.10 to 3.16, and RELPAL ranged from 1.38 to 3.24 (Supplementary Fig. S4).

We examined what genes are located under the peaks, considering boundaries where −log10P values were above 3.0 (approximately equivalent to a 1 LOD drop): chromosome 3 (194.6–200.7 cM), chromosome 12 (107.2–117.8 cM), and chromosome 16 (79.6–87.1 cM). Here, we converted genetic position to physical position using Ensembl release 75, which imports SNP information from release 138 of dbSNP. Using these base pair regions, we identified genes under each linkage peak from hg38 refGene via the UCSC Genome Browser. The chromosome 3q and 12q peaks do not include genes directly implicated in common cancer phenotypes to date; however, SOX2 on 3q codes for a transcription factor involved in embryonic development.

Under our chromosome 16q peak lies CDH1 that codes for E-cadherin, a calcium-dependent adhesion molecule that regulates differentiation, invasion, and metastasis. CDH1 was identified as associated with familial gastric cancer via linkage studies (36, 37). This gene is also associated with familial breast (38) and prostate cancer (39). CDH1 is recognized as a high-risk breast cancer gene due to its high penetrance and lifetime risk for mutation carriers, in the range of 40% to 50% (40). In our sample of 106 families, 10% reported a family history of gastric/stomach cancer and 3% reported a family history of esophageal cancer, suggesting that our sample is segregating a similar hereditary gastric-breast cancer syndrome.

We examined whether published breast cancer GWAS have revealed any common variants that lie under or near our linkage peaks. The Framingham breast cancer GWAS study identified a hit (rs10513754) on chromosome 3 located in the LINC00578 gene that codes for a non–protein-coding RNA; however, this SNP did not reach genome-wide significance (8). A recent breast cancer GWAS study in African American unrelateds reported association with rs13074711 on 3q26.31, nearby to TNFSF10 (24), which is near our identified 1 LOD-drop equivalent on 3q26-27. TNFSF10 codes for a cytokine that is a member of the tumor necrosis factor ligand family, and is implicated in apoptosis. Kim and colleagues reported a marginally significant association with 3q26.32 rs3806685 in a Korean study (14).

Similarly, Antoniou and colleagues reported associations of breast cancer with polymorphisms on 12q24 (rs1292011) for BRCA1 and/or BRCA2 mutation carriers (20), which is near the region we identified on 12q22-23. This potential overlap on chromosome 12, in conjunction with our suggestive findings of unidentified BRCA1 mutations in our sample, indicates that these two regions deserve further joint consideration in the family-based context. Investigators recently analyzed whether two GWAS hits on 12q (12q22: rs17356907, ref. 41; and 12q24: rs1292011, ref. 42) identified in samples of European ancestry replicated in African Americans (22). For both of these SNPs, the genetic associations originally identified in the samples of European ancestry were attenuated in the African American analyses. Collectively, these findings may suggest that rare variants, possibly in combination with other rare variants, may be driving our linkage peak.

Although we excluded African American women who were tested and positive for BRCA1 mutations, we cannot rule out that there are other BRCA1 mutations segregating in our sample. Nonetheless, this suggestive finding gives us some confidence that the other linkage signals we identified where we adjusted for covariates are less likely to be false positives.

In our sample of families, we were able to adjudicate 72% (141 of 197) of the medical records; the reason for missing clinical information was largely due to time limits for facilities in maintaining records. The mean age at diagnosis in our cases included in the linkage study is 51.6 (SD = 12.0). Among the women for whom we obtained medical records, 31% were estrogen receptor (ER) negative, 50% were progesterone receptor (PR) negative, and 72% were HER2/neu negative. Taking all three into account resulted in 16% of our sample being triple negative; this approximates the proportion of invasive breast cancers in the United States that are triple negative. Interestingly, in our sample of 106 families, only three pedigrees showed full concordance of triple-negative status. Further family-based recruitment studies along with full adjudication of tumor phenotype will allow for investigation of differences in linkage signals when taking into account tumor heterogeneity. It is important to note that linkage signals may result from causal alleles up to as far away as five megabases on each side of the peak; therefore, we cannot rule out that our signal indicates the presence of causal variants, but not under a 1-LOD drop of the peaks we identified. We also note that there are other candidates on chromosome 17 where BRCA1 is located, and this includes HOXB13 (43, 44). This chromosome 17 region deserves additional follow-up to refine our signal.

One limitation of our study is sample size. Future studies developed like ours in partnership with community groups and existing epidemiologic studies with a well-considered recruitment protocol should be conducted to increase the availability of family samples for study. This will not only allow for further refinement and perhaps the identification of novel linkage peaks, but also for replication of GWAS to determine to what degree the common GWAS-identified polymorphisms also contribute to familial breast cancer, defined here loosely, as our study is largely limited to families with two affected women.

Because our study was not able to consider genetic heterogeneity due to the limited sample size, larger family-based linkage studies in African Americans should be conducted with the application of more recent developments in linkage methods for complex disease to overcome issues related to disease heterogeneity (45, 46), locus heterogeneity, and incomplete penetrance, which can decrease power to detect linkage. Additional family-based recruitment and linkage analyses in a sample with more homogeneous subsets of families will increase the ability to detect linkage. Furthermore, because the genetic architecture of breast cancer is in itself complicated by pleiotropy, allelic heterogeneity, and the polygenic nature of cancer, novel methods are needed to take into account the complexity and architecture of breast cancer in admixed samples. One breast cancer admixture study in African Americans produced no specific regions associated with breast cancer; however, there were differences in global ancestry according to ER and PR status and stage of disease (47).

In conclusion, our study is the first to localize novel breast cancer susceptibility loci in pedigrees of African ancestry. We identified three linkage peaks at two regions on chromosomes 3 and 16 and one region on chromosome 12. The new peaks, which may harbor susceptibility loci unique to African Americans, represent loci that deserve additional scrutiny. On the basis of the overlap we report herein of our putative linkage peaks and GWAS findings, our data support the need to appreciate family-based designs and the linkage approach for complex disease mapping of both rare and common variants. Overall, these preliminary data suggest that there are novel putative regions harboring risk alleles for breast cancer in African American families that deserve further study in larger samples.

No potential conflicts of interest were disclosed.

Conception and design: H.M. Ochs-Balcom, D.O. Erwin, L. Jandorf, L. Sucheston-Campbell, R.C. Elston

Development of methodology: H.M. Ochs-Balcom, D.O. Erwin, R.C. Elston

Acquisition of data (provided animals, acquired and managed patients, provided facilities, etc.): H.M. Ochs-Balcom, D.O. Erwin, L. Jandorf

Analysis and interpretation of data (e.g., statistical analysis, biostatistics, computational analysis): H.M. Ochs-Balcom, X. Sun, Y. Chen, J. Barnholtz-Sloan, R.C. Elston

Writing, review, and/or revision of the manuscript: H.M. Ochs-Balcom, X. Sun, Y. Chen, J. Barnholtz-Sloan, D.O. Erwin, L. Jandorf, L. Sucheston-Campbell, R.C. Elston

Administrative, technical, or material support (i.e., reporting or organizing data, constructing databases): H.M. Ochs-Balcom

Study supervision: H.M. Ochs-Balcom, D.O. Erwin, R.C. Elston

The authors extend a special thanks to Veronica Meadows-Ray and her family. Mrs. Meadows-Ray, who insists that her family's breast cancer is genetic, spurred the development of the Jewels in Our Genes study. The authors thank the National Witness Project and its Steering Committee for helping them with this work, the Buffalo Witness Project, Rosa Bordonaro, Mary Crawford, Anne Weaver, Youjin Wang, Laurie Grieshober, and others who assisted the Jewels team for their tireless efforts. Others who were instrumental in study recruitment include Greg Ciupak and Drs. Christine Ambrosone and Elisa Bandera (Women's Circle of Health Study), Dr. Susan Kadlubar (Spit for the Cure study), Drs. Cheryl Thompson and Li Li (Breast Density Study), Jennie Ellison (GATE/TACT study), and Teri Deans-McFarlane (Witness Project of Harlem). Thank you to the Dr. Susan Love Research Foundation's Army of Women Program. The authors also thank all of the families who participated in their study.

This article is dedicated to Ms. Mattye Willis, who tirelessly helped to recruit families to the study and inspired her community to be champions of scientific research via her role in the National Witness Project and as a special consultant for the Jewels in Our Genes study before her passing in June 2014.

H.M. Ochs-Balcom was supported by a grant from Susan G. Komen for the Cure and the School of Public Health and Health Professions at the University at Buffalo.

The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.

1.
Howlader
N
,
Noone
AM
,
Krapcho
M
,
Garshell
J
,
Neyman
N
,
Altekruse
SF
, et al
In
SEER Cancer Statistics Review, 1975-2010
.
National Cancer Institute
,
Bethesda, MD
; 
2013
.
2.
Easton
DF
,
Bishop
DT
,
Ford
D
,
Crockford
GP
. 
Genetic linkage analysis in familial breast and ovarian cancer: results from 214 families. The Breast Cancer Linkage Consortium
.
Am J Hum Genet
1993
;
52
:
678
701
.
3.
Goldgar
DE
,
Cannon-Albright
LA
,
Oliphant
A
,
Ward
JH
,
Linker
G
,
Swensen
J
, et al
Chromosome 17q linkage studies of 18 Utah breast cancer kindreds
.
Am J Hum Genet
1993
;
52
:
743
8
.
4.
Hall
JM
,
Lee
MK
,
Newman
B
,
Morrow
JE
,
Anderson
LA
,
Huey
B
, et al
Linkage of early-onset familial breast cancer to chromosome 17q21
.
Science
1990
;
250
:
1684
9
.
5.
Narod
SA
,
Feunteun
J
,
Lynch
HT
,
Watson
P
,
Conway
T
,
Lynch
J
, et al
Familial breast-ovarian cancer locus on chromosome 17q12-q23
.
Lancet
1991
;
338
:
82
3
.
6.
Easton
DF
,
Pooley
KA
,
Dunning
AM
,
Pharoah
PD
,
Thompson
D
,
Ballinger
DG
, et al
Genome-wide association study identifies novel breast cancer susceptibility loci
.
Nature
2007
;
447
:
1087
93
.
7.
Cox
A
,
Dunning
AM
,
Garcia-Closas
M
,
Balasubramanian
S
,
Reed
MW
,
Pooley
KA
, et al
A common coding variant in CASP8 is associated with breast cancer risk
.
Nat Genet
2007
;
39
:
352
8
.
8.
Murabito
JM
,
Rosenberg
CL
,
Finger
D
,
Kreger
BE
,
Levy
D
,
Splansky
GL
, et al
A genome-wide association study of breast and prostate cancer in the NHLBI's Framingham Heart Study
.
BMC Med Genet
2007
;
8
Suppl 1
:
S6
.
9.
Ahmed
S
,
Thomas
G
,
Ghoussaini
M
,
Healey
CS
,
Humphreys
MK
,
Platte
R
, et al
Newly discovered breast cancer susceptibility loci on 3p24 and 17q23.2
.
Nat Genet
2009
;
41
:
585
90
.
10.
Gold
B
,
Kirchhoff
T
,
Stefanov
S
,
Lautenberger
J
,
Viale
A
,
Garber
J
, et al
Genome-wide association study provides evidence for a breast cancer risk locus at 6q22.33
.
Proc Natl Acad Sci U S A
2008
;
105
:
4340
5
.
11.
Zheng
W
,
Long
J
,
Gao
YT
,
Li
C
,
Zheng
Y
,
Xiang
YB
, et al
Genome-wide association study identifies a new breast cancer susceptibility locus at 6q25.1
.
Nat Genet
2009
;
41
:
324
8
.
12.
Thomas
G
,
Jacobs
KB
,
Kraft
P
,
Yeager
M
,
Wacholder
S
,
Cox
DG
, et al
A multistage genome-wide association study in breast cancer identifies two new risk alleles at 1p11.2 and 14q24.1 (RAD51L1)
.
Nat Genet
2009
;
41
:
579
84
.
13.
Turnbull
C
,
Ahmed
S
,
Morrison
J
,
Pernet
D
,
Renwick
A
,
Maranian
M
, et al
Genome-wide association study identifies five new breast cancer susceptibility loci
.
Nat Genet
2010
;
42
:
504
7
.
14.
Kim
HC
,
Lee
JY
,
Sung
H
,
Choi
JY
,
Park
SK
,
Lee
KM
, et al
A genome-wide association study identifies a breast cancer risk variant in ERBB4 at 2q34: results from the Seoul Breast Cancer Study
.
Breast Cancer Res
2012
;
14
:
R56
.
15.
Siddiq
A
,
Couch
FJ
,
Chen
GK
,
Lindstrom
S
,
Eccles
D
,
Millikan
RC
, et al
A meta-analysis of genome-wide association studies of breast cancer identifies two novel susceptibility loci at 6q14 and 20q11
.
Hum Mol Genet
2012
;
21
:
5373
84
.
16.
Stacey
SN
,
Manolescu
A
,
Sulem
P
,
Rafnar
T
,
Gudmundsson
J
,
Gudjonsson
SA
, et al
Common variants on chromosomes 2q35 and 16q12 confer susceptibility to estrogen receptor-positive breast cancer
.
Nat Genet
2007
;
39
:
865
9
.
17.
Stacey
SN
,
Manolescu
A
,
Sulem
P
,
Thorlacius
S
,
Gudjonsson
SA
,
Jonsson
GF
, et al
Common variants on chromosome 5p12 confer susceptibility to estrogen receptor-positive breast cancer
.
Nat Genet
2008
;
40
:
703
6
.
18.
Gaudet
MM
,
Kirchhoff
T
,
Green
T
,
Vijai
J
,
Korn
JM
,
Guiducci
C
, et al
Common genetic variants and modification of penetrance of BRCA2-associated breast cancer
.
PLoS Genet
2010
;
6
:
e1001183
.
19.
Haiman
CA
,
Chen
GK
,
Vachon
CM
,
Canzian
F
,
Dunning
A
,
Millikan
RC
, et al
A common variant at the TERT-CLPTM1L locus is associated with estrogen receptor-negative breast cancer
.
Nat Genet
2011
;
43
:
1210
4
.
20.
Antoniou
AC
,
Kuchenbaecker
KB
,
Soucy
P
,
Beesley
J
,
Chen
X
,
McGuffog
L
, et al
Common variants at 12p11, 12q24, 9p21, 9q31.2 and in ZNF365 are associated with breast cancer risk for BRCA1 and/or BRCA2 mutation carriers
.
Breast Cancer Res
2012
;
14
:
R33
.
21.
Lambrechts
D
,
Truong
T
,
Justenhoven
C
,
Humphreys
MK
,
Wang
J
,
Hopper
JL
, et al
11q13 is a susceptibility locus for hormone receptor positive breast cancer
.
Hum Mutat
2012
;
33
:
1123
32
.
22.
Feng
Y
,
Stram
DO
,
Rhie
SK
,
Millikan
RC
,
Ambrosone
CB
,
John
EM
, et al
A comprehensive examination of breast cancer risk loci in African American women
.
Hum Mol Genet
2014
;
23
:
5518
26
.
23.
Long
J
,
Cai
Q
,
Sung
H
,
Shi
J
,
Zhang
B
,
Choi
JY
, et al
Genome-wide association study in east Asians identifies novel susceptibility loci for breast cancer
.
PLoS Genet
2012
;
8
:
e1002532
.
24.
Chen
F
,
Chen
GK
,
Stram
DO
,
Millikan
RC
,
Ambrosone
CB
,
John
EM
, et al
A genome-wide association study of breast cancer in women of African ancestry
.
Hum Genet
2013
;
132
:
39
48
.
25.
Wiesner
GL
,
Daley
D
,
Lewis
S
,
Ticknor
C
,
Platzer
P
,
Lutterbaugh
J
, et al
A subset of familial colorectal neoplasia kindreds linked to chromosome 9q22.2-31.2
.
Proc Natl Acad Sci U S A
2003
;
100
:
12961
5
.
26.
Goddard
KA
,
Witte
JS
,
Suarez
BK
,
Catalona
WJ
,
Olson
JM
. 
Model-free linkage analysis with covariates confirms linkage of prostate cancer to chromosomes 1 and 4
.
Am J Hum Genet
2001
;
68
:
1197
206
.
27.
Bailey-Wilson
JE
,
Amos
CI
,
Pinney
SM
,
Petersen
GM
,
de Andrade
M
,
Wiest
JS
, et al
A major lung cancer susceptibility locus maps to chromosome 6q23-25
.
Am J Hum Genet
2004
;
75
:
460
74
.
28.
Ochs-Balcom
HM
,
Jandorf
L
,
Wang
Y
,
Johnson
D
,
Meadows Ray
V
,
Willis
MJ
, et al
“It takes a village”: multilevel approaches to recruit African Americans and their families for genetic research
.
J Community Genet
2014 Aug 12
;
[Epub ahead of print]
.
29.
S.A.G.E
. 
Statistical Analysis for Genetic Epidemiology
.
version 6.3
, 
2012
.
30.
Hinch
AG
,
Tandon
A
,
Patterson
N
,
Song
Y
,
Rohland
N
,
Palmer
CD
, et al
The landscape of recombination in African Americans
.
Nature
2011
;
476
:
170
5
.
31.
Pritchard
JK
,
Stephens
M
,
Donnelly
P
. 
Inference of population structure using multilocus genotype data
.
Genetics
2000
;
155
:
945
59
.
32.
Chen
CY
,
Pollack
S
,
Hunter
DJ
,
Hirschhorn
JN
,
Kraft
P
,
Price
AL
. 
Improved ancestry inference using weights from external reference panels
.
Bioinformatics
2013
;
29
:
1399
406
.
33.
Shete
S
,
Jacobs
KB
,
Elston
RC
. 
Adding further power to the Haseman and Elston method for detecting linkage in larger sibships: weighting sums and differences
.
Hum Hered
2003
;
55
:
79
85
.
34.
Wang
T
,
Elston
RC
. 
Two-level Haseman-Elston regression for general pedigree data analysis
.
Genet Epidemiol
2005
;
29
:
12
22
.
35.
Wang
T
,
Elston
RC
. 
A quantitative linkage score for an association study following a linkage analysis
.
BMC Genet
2006
;
7
:
5
.
36.
Guilford
PJ
,
Hopkins
JB
,
Grady
WM
,
Markowitz
SD
,
Willis
J
,
Lynch
H
, et al
E-cadherin germline mutations define an inherited cancer syndrome dominated by diffuse gastric cancer
.
Hum Mutat
1999
;
14
:
249
55
.
37.
Oliveira
C
,
Pinheiro
H
,
Figueiredo
J
,
Seruca
R
,
Carneiro
F
. 
E-cadherin alterations in hereditary disorders with emphasis on hereditary diffuse gastric cancer
.
Prog Mol Biol Transl Sci
2013
;
116
:
337
59
.
38.
Schrader
KA
,
Masciari
S
,
Boyd
N
,
Wiyrick
S
,
Kaurah
P
,
Senz
J
, et al
Hereditary diffuse gastric cancer: association with lobular breast cancer
.
Fam Cancer
2008
;
7
:
73
82
.
39.
Jonsson
BA
,
Adami
HO
,
Hagglund
M
,
Bergh
A
,
Goransson
I
,
Stattin
P
, et al
-160C/A polymorphism in the E-cadherin gene promoter and risk of hereditary, familial and sporadic prostate cancer
.
Int J Cancer
2004
;
109
:
348
52
.
40.
Maxwell
KN
,
Domchek
SM
. 
Familial breast cancer risk
.
Curr Breast Cancer Rep
2013
;
5
:
170
82
.
41.
Michailidou
K
,
Hall
P
,
Gonzalez-Neira
A
,
Ghoussaini
M
,
Dennis
J
,
Milne
RL
, et al
Large-scale genotyping identifies 41 new loci associated with breast cancer risk
.
Nat Genet
2013
;
45
:
353
61
.
42.
Ghoussaini
M
,
Fletcher
O
,
Michailidou
K
,
Turnbull
C
,
Schmidt
MK
,
Dicks
E
, et al
Genome-wide association analysis identifies three new breast cancer susceptibility loci
.
Nat Genet
2012
;
44
:
312
8
.
43.
Alanee
S
,
Couch
F
,
Offit
K
. 
Association of a HOXB13 variant with breast cancer
.
N Engl J Med
2012
;
367
:
480
1
.
44.
Ewing
CM
,
Ray
AM
,
Lange
EM
,
Zuhlke
KA
,
Robbins
CM
,
Tembe
WD
, et al
Germline mutations in HOXB13 and prostate-cancer risk
.
N Engl J Med
2012
;
366
:
141
9
.
45.
Fang
S
,
Pinney
SM
,
Bailey-Wilson
JE
,
de Andrade
MA
,
Li
Y
,
Kupert
E
, et al
Ordered subset analysis identifies loci influencing lung cancer risk on chromosomes 6q and 12q
.
Cancer Epidemiol Biomarkers Prev
2010
;
19
:
3157
66
.
46.
Simpson
CL
,
Cropp
CD
,
Wahlfors
T
,
George
A
,
Jones
MS
,
Harper
U
, et al
Genetic heterogeneity in Finnish hereditary prostate cancer using ordered subset analysis
.
Eur J Hum Genet
2013
;
21
:
437
43
.
47.
Fejerman
L
,
Haiman
CA
,
Reich
D
,
Tandon
A
,
Deo
RC
,
John
EM
, et al
An admixture scan in 1,484 African American women with breast cancer
.
Cancer Epidemiol Biomarkers Prev
2009
;
18
:
3110
7
.