Abstract
The current paradigm in the development of high-grade serous ovarian carcinoma (HGSC) proposes that the majority of HGSCs arise from precursor serous tubal intraepithelial carcinoma (STIC) lesions of the fallopian tube. Here we survey genome-wide methylation in HGSC precursor lesions to identify genomic regions that exhibit high-specificity differential hypermethylation for potential use as biomarkers for detecting STIC and HGSC at stages when curative intervention likely remains feasible.
We first identified quality control criteria for performing reliable methylomic analysis of DNA-limited tubal precursor lesions with the Illumina Infinium MethylationEPIC array. We then used this platform to compare genome-wide methylation among 12 STICs with paired adjacent-normal epithelia, one p53 signature lesion and two samples of concurrent HGSC. The resulting methylomic data were analyzed by unsupervised hierarchical clustering and multidimensional analysis. Regions of high-confidence STIC-specific differential hypermethylation were identified using selective bioinformatic criteria and compared with published MethylationEPIC data from 23 HGSC tumors and 11 healthy fallopian tube mucosae.
Unsupervised analysis showed that STICs largely clustered with HGSCs, but were clearly distinct from adjacent-normal fallopian tube epithelia. Forty-two genomic regions exhibited high-confidence STIC-specific differential hypermethylation, of which 17 (40.5%) directly overlapped with HGSC-specific differentially methylated regions. Methylation at these shared loci was able to completely distinguish STIC and HGSC samples from normal and adjacent-normal specimens.
Our results suggest that most STICs are epigenetically similar to HGSCs and share regions of differential hypermethylation that warrant further evaluation for potential use as biomarkers for early detection of ovarian HGSC.
See related commentary by Ishak and De Carvalho, p. 6083
There is accumulating evidence that the majority of high-grade serous ovarian carcinomas (HGSC) likely originate as precursor, serous tubal intraepithelial carcinoma (STIC) lesions of the fallopian tube. A better understanding of these lesions is likewise expected to be critically important to achieving a deeper understanding of HGSC carcinogenesis and for the development of new methods for early detection of the disease. However, technical challenges associated with the rarity and diminutive nature of STIC lesions have made comprehensive and statistically significant molecular analyses, such as genome-wide DNA methylation, difficult to perform. In this study, we validate and employ a protocol to successfully perform methylomic analyses of 13 precursor fallopian tube lesions, 12 adjacent-normal epithelia, and 2 concurrent HGSC tumors and compare the resulting datasets to independent methylomic analyses of 23 HGSC tumors and 11 normal fallopian tube epithelia. Our results demonstrate that STIC lesions exhibit a high degree of epigenetic similarity to HGSC and enable identification of a discrete set of differentially hypermethylated regions that exhibit potential promise as sensitive and specific methylation biomarkers for identifying HGSC at early stages of carcinogenesis.
Introduction
Early detection of ovarian high-grade serous carcinoma (HGSC), the most common type of epithelial ovarian cancer, is crucial to reducing disease-associated morbidity and mortality (1, 2). This is because while radical surgery and cytotoxic chemotherapy have extended survival, overall mortality rates from the disease have not improved (3). There is currently no effective screening strategy for HGSC in clinical practice, partly due to an inadequate understanding of the early events of ovarian carcinogenesis. Recently, it has become increasingly clear that most ovarian HGSCs originate from epithelial precursor lesions of the fallopian tubes rather than from the ovary itself. This new paradigm of ovarian HGSC genesis was proposed on the basis of the initial observation of dysplastic epithelium in the fallopian tubes among women carrying BRCA1 and BRCA2 germline mutations (4, 5). Serous tubal intraepithelial carcinoma (STIC), a term currently used to denote the precursor lesion of HGSC, is characterized by a continuation of nonciliated tubal epithelial cells showing marked nuclear atypia, mitotic figures, apoptotic bodies, and loss of cellular polarization with p53-staining abnormality and increased Ki-67 labeling index (6–10).
Since its initial postulation, the tubal paradigm has been supported by a wealth of epidemiologic, clinical, pathologic, and molecular studies. It has been reported that as much as 60% or more of HGSC cases are associated with STICs (11, 12) and that STIC is more frequently detected in women at an increased risk of developing HGSC than those at an average risk (13). Transcriptomic analysis has demonstrated that HGSCs molecularly resemble fallopian tube epithelium as opposed to ovarian surface epithelium and peritoneal mesothelium (14). Molecular genetic analyses have further revealed that STICs harbor the same TP53 mutations as their concurrent HGSCs, thereby establishing a clonal relationship between the two (8). Moreover, phylogenetic studies of somatic mutations and DNA copy-number profiles indicate a complex evolutionary history involving the development of STIC and concurrent HGSC, suggesting that STIC precedes HGSC in many, if not most, cases (12, 15, 16). Many STICs associated with HGSC have shorter telomeres and amplified centrosomes (17, 18), further supporting their precursor roles in tumor progression. Incidental STICs without synchronous HGSC frequently present as multiples and also exhibit molecular alterations similar to HGSC (16).
On the basis of the number of somatic mutations acquired in STICs and clinicopathologic observation, researchers have estimated that it takes approximately 6 to 7 years for a STIC to progress into an invasive carcinoma (12, 16). This suggests that there exists a critical window of time in which STICs might be detected before they progress to advanced and often incurable stages of the disease. While molecular genetic alterations such as somatic mutations have been extensively studied as surrogate markers for detecting HGSC and other cancers in body fluids such as Pap specimens and blood, some recent studies have called into question the specificity of frequently mutated genes, such as TP53, for diagnosing the disease. For example, ultrahigh-sensitivity approaches such as duplex sequencing have shown that TP53 mutations are readily detectable in the majority of apparently heathy women (19, 20). More recently, it has also been shown that the majority of mutations detected in circulating cell-free DNA are actually derived from clonal hematopoiesis, not cancer (21). Thus, although TP53 mutations are present in essentially all STICs and HGSCs, they cannot be considered specific due to their occurrence in nominally healthy and nonmalignant tissues, including normal-appearing fallopian tube epithelium (13, 16, 22). This portends potential limitations to achieving desirable diagnostic performance with mutation-only detection schemes and indicates that there remains a need for the identification of alternative biomarkers of HGSC, particularly at early stages of the disease.
Discovery of candidate methylation biomarkers by genome-wide methylomic analyses of STIC lesions has been largely hereto precluded by technical challenges. Nonetheless, we previously reported that ovarian HGSC-specific differentially hypermethylated regions (DHMR) could be readily detected in STIC lesions using locus-specific techniques (23), thereby providing prima facie evidence that epigenetic aberrations arise early during ovarian cancer carcinogenesis and are potentially useful as biomarkers for early-stage HGSC diagnostics. In this work, we seek to validate and significantly extend these observations by performing genome-wide assessment of methylation directly on select pathology-confirmed precursor lesions associated with the development of ovarian cancer. Bioinformatic analyses of these results should have several biological and clinical implications, most notably the identification of a set of nonmutation-based biomarkers that occur prevalently at precursor stages and persist throughout the progression HGSC that might be implemented to facilitate development of early diagnostic approaches for HGSC.
Materials and Methods
Tissue samples and laser capture microdissection
This was a retrospective study with samples obtained according to protocols approved by the Institutional Review Boards (IRB) of Johns Hopkins Medical Institutions (Baltimore, MD). Formalin-fixed paraffin-embedded (FFPE) tissue blocks were selected from patients undergoing gynecologic surgeries between 2010 and 2019. The tissues selected included lesions of STIC and p53 signature from women with or without HGSC. The diagnosis was confirmed independently by experienced gynecologic pathologists according to previously published criteria (10). Normal fallopian tube epithelium away from the lesion on the same tissue block was selected as the control. In cases with concurrent HGSC involving the fimbriated ends where the STICs were located, normal epithelium from the contralateral fallopian tube was selected as the control.
FFPE tissue blocks were cut onto PEN-Membrane Slides (Carl-Zeiss, catalog no. 415909041000) for laser capture microdissection (LCM). Three additional unstained sections above and below the LCM membrane sections were reserved for immunohistologic confirmation of lesion location by hematoxylin and eosin staining, IHC for p53 (clone BP53–11, catalog no. 760–2542, Ventana Medical Systems), and Ki-67 (clone 30–7, catalog no. 790–4286, Ventana Medical Systems). Epithelial cells were microdissected using the adjacent slide prestained with p53 as a guide.
Post-LCM DNA extraction, quantification, and bisulfite conversion
Genomic DNA was extracted using the QIAmp DNA FFPE Tissue Kit (Qiagen, 56404) according to the manufacturer's instructions. Post-LCM DNA yields were then quantified by qPCR using forward primer sequence, 5′-AGG GTT TTT ATG GTT TTAGGT T-3′ and reverse primer sequence, 5′-ATC CCT TCC TTA CACC-3′, targeting a shared 82 bp locus within the consensus sequences of the L1PA1-L1PA5 LINE-1 families (24). Assays were performed using 10X Master Mix to yield a final volume of 25 μL and final working concentrations of 16.6 mmol/L (NH4)2SO4, 67 mmol/L Tris pH 8.8, 2.7 mmol/L MgCl2 10 mmol/L β-mercaptoethanol, 200 μmol/L of each dNTP (Thermo Fisher Scientific) and 0.04 U/μL of Platinum Taq polymerase (Thermo Fisher Scientific). Cycling conditions were 95°C for 5 minutes, followed by 50 cycles of (95°C for 5 seconds, 50°C for 30 seconds, and 72°C for 30 seconds). Standards for quantification were created by serial dilution of human male genomic DNA (Promega). PCR was performed in duplicate in 96-well plates using a CFX96 Touch Real-time PCR Detection System (Bio-Rad) and analyzed using the accompanying stock software, Bio-Rad CFX Manager (v3.1). DNA concentrations were determined by cycle of quantification (Cq) values with reference to a standard curve created via serial dilution of human male genomic DNA (Promega).
A total of 50 ng of DNA from each sample underwent bisulfite conversion using EZ DNA Methylation Kit (Zymo, D5001) according to the manufacturer's protocol. In brief, 5 μL of M-dilution buffer was mixed with 45 μL of sample and incubated at 37°C for 15 minutes; 100 μL of CT conversion reagent was then added to the mixture and incubated in the dark overnight. The sample was transferred into the spin column with M-binding buffer. Following the wash and spin-down procedure, 200 μL of Desulphonation buffer was added. The sample was then eluted with 10 μL of elution buffer.
TP53 Mutation analysis
The TP53 somatic mutation status in most of the microdissected samples was obtained either from the whole-exon sequencing or from TP53 amplicon sequencing data in our previous study (16). Some were obtained by Sanger sequencing targeting the TP53 DNA-binding domain (from exon 4 to exon 9). Mutations were analyzed by the Mutation Explorer software.
DNA restoration and MethylationEPIC assay
Bisulfite-treated (BST) DNA was restored to amplifiable length using the Illumina Infinium FFPE DNA Restoration Kit (WG321–1004) according to the manufacturer's instructions. Restored BST sample DNA was then analyzed using Illumina Infinium Human MethylationEPIC BeadChip Kit (WG-317–1002) by following manufacturer's manual. Briefly, 8 μL of bisulfite-converted DNA was added to a 0.8 mL 96-well storage plate (Thermo Fisher Scientific), denatured in 0.014N sodium hydroxide, neutralized and amplified with kit-provided reagents and buffer at 37°C for 20–24 hours. Samples were enzymatically fragmented by addition of 50 μL FMS fragmentation solution (Illumina) to each well. Plates were then vortexed at 1,600 rpm for 1 minute and pulse centrifuged at 280 g. The sealed plates were then incubated on a heat block at 37°C for 1 hour before precipitation of DNA by addition of 2-propanol to each sample well. Resuspended samples were denatured in a 96-well plate heat block at 95°C for 20 minutes. A total of 26 μL of each sample was loaded onto an 8-sample chip and the chips were assembled into the hybridization chamber as instructed in the manual. After incubation at 48°C for 16–20 hours, chips were briefly washed and then assembled and placed in a fluid flow-through station for primer-extension and staining procedures. Polymer-coated chips were image processed in Illumina iScan scanner.
Data analysis
DNA methylation data containing the Illumina EpicArray iDat files were processed using the functional normalization (funNorm) algorithm, as implemented in the minfi package (25). The ilm10b2.hg19 package from Bioconductor provided current hg19 annotations for the probe sequences represented on the array, which were remapped to hg38 based on the UCSC liftOver mapping (4). R studio software was used to analyze data obtained from the MethylationEPIC assay to identify DHMRs between STIC and corresponding normal-appearing tubal epithelium. To identify DHMRs that are most specific to HGSC and STIC, we incorporated the MethylationEPIC array data from our previous study (23), which include 23 HGSC and 11 normal fallopian tube mucosae derived from patients having bilateral salpingo-oophorectomy due to benign disease such as myoma or adenomyosis. Probes with detection P values greater than 0.05 were filtered out. Locus methylation was calculated as a β value, low to high ranging from 0–1, respectively. Probes with a [mean β value + 2 SDs] ≥ 0.2 in normal fallopian tube epithelium derived from women without malignancy were filtered out. To increase the confidence that the genomic region was methylated in HGSC and STIC, only genes with at least two probes located within 1500 bp of a CpG island, shore or shelf that exhibited contiguous, STIC-specific hypermethylation within a maximum interprobe distance of 400 bp were selected.
Data and code availability
All Illumina MethylationEPIC BeadArray data from this study, and our prior study (23), have been deposited to the NCBI Gene Expression Omnibus database as a SuperSeries (accession number GSE155761). An R Markdown file detailing all bioinformatic analyses is available for download at https://github.com/lesliecope/STICmanuscript.
Results
Case cohorts and study overview
The primary goal of this study was to survey the genome-wide landscape of methylation alterations that occur during the development of fallopian tube precursor lesions and to identify candidate DHMR biomarkers potentially suitable for early-stage identification of HGSC. However, the diminutive nature and need for tedious processing to unambiguously identify rare precursor lesions in FFPE fallopian tube tissues poses considerable challenges to identifying a suitable cohort of lesions in which genomic DNA of sufficient quantity and quality can be extracted to perform reliable whole-genome methylation analysis. In light of these concerns, we first performed preliminary validation experiments to identify appropriate sample requirements that would ensure reliable analysis by the MethylationEPIC BeadArray platform. Toward this end, we performed MethylationEPIC profiling using three concentrations (500 ng, 50 ng, and 10 ng per sample volume) of serially diluted DNA obtained from two separate LCM FFPE HGSOC tumor specimens. We then examined the respective Illumina quality control (QC) methylation profiling (Supplementary Fig. S1), sample clustering dendrograms (Supplementary Fig. S2) and CpG detection P values and subsequently determined 50 ng of high-purity, LCM DNA to be sufficient for consistent and reliable analysis using the MethylationEPIC platform, which is also in accord with previous studies (26).
From an extended library of 36 histopathologically confirmed FFPE specimens containing tubal precursor lesions, we identified 13 tubal lesions whose post-LCM DNA yields were sufficient to meet the predetermined sample requirements (Supplementary Fig. S3). Established histopathologic criteria were used to classify each lesion (10), of which representative cases can be seen in Supplementary Fig. S4. For each respective case, LCM was also performed, and DNA extracted, from normal-appearing, adjacent fallopian tube epithelia (normal FTE) control tissues, as well as from microdissected tumors when concomitant HGSC was present. In total, the initial sample cohort for this study comprised 27 DNA samples, including: 12 STIC lesions (including 1 “dormant STIC”), 12 adjacent-normal FTE, 1 p53 signature and 2 HGSCs from a total of 12 women. Figure 1 provides a pictorial overview of the overall sample cohort. Table 1 presents the relevant clinical information, mutational statuses (TP53 and BRCA1/2) and Ki-67 proliferation index for each of the lesions.
Case number . | Age . | Race . | Sample name . | Histopathology . | p53 pattern . | Ki-67(%) . | TP53 mutation . | Concurrent HGSOC . | Other tumors . | Germline mutations . |
---|---|---|---|---|---|---|---|---|---|---|
1 | 66 | W | N1 | NFT | No | BRCA2 mutation (+) c.26del (p.Pro9fs) | ||||
S1 | STIC | Nonsense | 70 | P191fs | ||||||
2 | 50 | W | N2 | NFT | No | MMMT | N/A | |||
S2 | STIC | Nonsense | 38 | H82fs | ||||||
3 | 79 | W | N3 | NFT | No | APST | N/A | |||
S3 | STIC | Missense | 20 | L62P | ||||||
4 | 58 | W | N4 | NFT | No | Uterine endometrioid carcinoma | N/A | |||
S4 | STIC | Nonsense | 65 | c.782+14del | ||||||
5 | NA | NA | N5 | NFT | No | N/A | ||||
S5 | STIC | N/A | N/A | N/A | ||||||
6 | 75 | W | N6 | NFT | Yes | N/A | ||||
S6 | STIC | Missense | 31 | C145F | ||||||
T6 | HGSC | C145F | ||||||||
7 | 71 | W | N7 | NFT | No | N/A | ||||
S7 | STIC | Missense | 18 | R379H c.920–2 A>C c.920–0 GC>G | ||||||
8 | 61 | W | N8 | NFT | Yes | BRCA2 mutation (+) c.1189_1190ins | ||||
S8 | STIC | Nonsense | 54 | D281N | ||||||
T8 | HGSC | D281N/N235N | ||||||||
9 | 74 | W | N9 | NFT | No | Uterine serous carcinoma | N/A | |||
S9 | STIC | Missense | 22 | R175H | ||||||
11 | W | N11 | NFT | No | ||||||
56 | P11 | p53 | Missense | 3 | A29D | N/A | ||||
S11 | STIC | Nonsense | 20 | Y88X | ||||||
12 | 49 | W | N12 | NFT | No | Breast cancer; Hodgkin lymphoma | BRCA2 mutation (+) Mutation status N/A | |||
S12 | STIC | Nonsense | 10 | C141a | ||||||
13 | 70 | W | N13 | NFT | No | N/A | ||||
L13 | Dorm. STIC | Missense | 4 | N239S |
Case number . | Age . | Race . | Sample name . | Histopathology . | p53 pattern . | Ki-67(%) . | TP53 mutation . | Concurrent HGSOC . | Other tumors . | Germline mutations . |
---|---|---|---|---|---|---|---|---|---|---|
1 | 66 | W | N1 | NFT | No | BRCA2 mutation (+) c.26del (p.Pro9fs) | ||||
S1 | STIC | Nonsense | 70 | P191fs | ||||||
2 | 50 | W | N2 | NFT | No | MMMT | N/A | |||
S2 | STIC | Nonsense | 38 | H82fs | ||||||
3 | 79 | W | N3 | NFT | No | APST | N/A | |||
S3 | STIC | Missense | 20 | L62P | ||||||
4 | 58 | W | N4 | NFT | No | Uterine endometrioid carcinoma | N/A | |||
S4 | STIC | Nonsense | 65 | c.782+14del | ||||||
5 | NA | NA | N5 | NFT | No | N/A | ||||
S5 | STIC | N/A | N/A | N/A | ||||||
6 | 75 | W | N6 | NFT | Yes | N/A | ||||
S6 | STIC | Missense | 31 | C145F | ||||||
T6 | HGSC | C145F | ||||||||
7 | 71 | W | N7 | NFT | No | N/A | ||||
S7 | STIC | Missense | 18 | R379H c.920–2 A>C c.920–0 GC>G | ||||||
8 | 61 | W | N8 | NFT | Yes | BRCA2 mutation (+) c.1189_1190ins | ||||
S8 | STIC | Nonsense | 54 | D281N | ||||||
T8 | HGSC | D281N/N235N | ||||||||
9 | 74 | W | N9 | NFT | No | Uterine serous carcinoma | N/A | |||
S9 | STIC | Missense | 22 | R175H | ||||||
11 | W | N11 | NFT | No | ||||||
56 | P11 | p53 | Missense | 3 | A29D | N/A | ||||
S11 | STIC | Nonsense | 20 | Y88X | ||||||
12 | 49 | W | N12 | NFT | No | Breast cancer; Hodgkin lymphoma | BRCA2 mutation (+) Mutation status N/A | |||
S12 | STIC | Nonsense | 10 | C141a | ||||||
13 | 70 | W | N13 | NFT | No | N/A | ||||
L13 | Dorm. STIC | Missense | 4 | N239S |
Abbreviations: APST, atypical proliferative serous tumor; Dorm. STIC, dormant STIC; MMMT, malignant mixed Müllerian tumors; N/A, not available; NFT, normal fallopian tube; p53, p53 signature; STIC, serous tubal intraepithelial carcinoma.
aPatient information not available.
Methylomic analysis of fallopian tube precursor lesions
We performed MethylationEPIC analyses on 50 ng of DNA extracted from each of the 27 FFPE tissue samples. QC of the resultant data indicated that all samples exhibited bimodal β-value distributions (Supplementary Fig. S5) as well as a similar or greater number of probes detected at statistical significant P value (e.g., P < 0.05) than in the preliminary validation samples, thereby indicating the quality of the resultant methylation data to be sufficient for downstream analyses. A heatmap of the initial, unsupervised analysis of the 1,000 most-variable CpG sites between the 12 STIC—adjacent-normal-FTE pairs showed that all tumors and the majority of lesions were readily distinguishable from adjacent-normal FTE according to sample clustering (Fig. 2A). Unsupervised multidimensional analysis (Fig. 2B) and the sample dendrogram (Fig. 2C) of the array data also demonstrated congruent clustering patterns. In addition, these data illustrate that the majority of normal FTE samples (and incidentally the p53 signature lesion) exhibit strong overlap in their respective methylomes, as compared with the STIC and HGSC samples, which are more heterogeneous in comparison. Taken as a whole, these results imply significant divergence between the genome-wide methylation patterns of precursor lesions, which is possibly reflective of the broader genomic instability known to arise during the progression of precursor lesions during ovarian cancer carcinogenesis (27). Of particular note is the direct subclustering seen in Fig. 2C between the two ovarian carcinomas (T6 and T8) and their associated STIC lesions (S6 and S8, respectively). This strongly suggests the existence of a clonal relationship between each cognate pair and provides additional anecdotal evidence supporting a fallopian tube precursor origin for the respective ovarian carcinomas.
We next employed a bioinformatic pipeline, described in our prior work (23), for identifying loci that exhibit high-confidence cancer-specific DHMRs that might be amenable for use as methylation biomarkers for identifying STIC lesions when compared with normal-appearing FTE. Using this algorithm, high specificity was prioritized by selecting only those CpG probe sites that exhibited minimal methylation [(mean β + 2 SD) < 0.2] in all tested control tissues (i.e., adjacent-normal FTE). Second, a so-called “high-confidence” criterion was employed to identify only those high-specificity loci containing a high density of CpG sites (CpG islands, shores, and shelves within 1,500 bp of the transcription start site) exhibiting contiguous STIC-specific hypermethylation within a maximum interprobe distance of 400 bp. The combination of both high-specificity and high-confidence criteria are aimed at identifying only those loci that are most likely to exhibit dense, disease-specific hypermethylation that are thereby well suited to commonly employed locus-specific methylation analysis techniques such methylation-specific PCR or targeted bisulfite sequencing. Analysis of STIC and adjacent-normal methylome data with this pipeline identified a total of 42 loci that exhibited high-confidence, STIC-specific hypermethylation (Supplementary Table S1). A heatmap demonstrated that methylation at these loci were able to not only fully discriminate all STIC from HGSC specimens, but incidentally classified even the p53 signature sample as distinct from both healthy and malignant tissues (Fig. 3).
Correlation between the STIC and HGSC methylome
We next proceeded to further compare the methylomic patterns between STIC lesions and HGSC. Toward this end, we combined the MethylationEPIC dataset from this study with MethylationEPIC datasets from HGSCs (n = 23) and normal gynecologic mucosal tissues (n = 11) derived from our previous study (23). A corresponding methylation heatmap of the unsupervised analysis of the top 1,000 most variable CpG sites of all samples in the conjoined datasets is shown in Fig. 4. There are a number of notable observations regarding the sample clustering from this analysis. First, our data demonstrate that the methylomes of HGSC and STIC lesions were clearly and reproducibly distinct from healthy and normal-appearing gynecologic tissues. Notably, the methylomes of most STICs also formed a subcluster distinct from both the adjacent-normal FTE and HGSC samples, possibly reflective of their intermediate state of malignancy. It was also notable that the p53 signature specimen clustered well within the adjacent-normal FTE specimens, indicating a largely nonmalignant epigenotype.
Shared cancer and STIC-specific hypermethylation
We next sought to determine whether there might be a convergence between HGSC-specific DHMRs with those regions exhibiting hypermethylation in the putative precursor, STIC lesions. To achieve this, we compared the 42 high-confidence STIC-specific DHMRs identified in this study with the 91 high-confidence HGSC-specific DHMRs independently identified in the previous study (23). Overall, we observed a striking overlap between the two candidate biomarker sets, with a total of 17 DHMRs in common (Fig. 5A; Table 2), accounting for a total of 40.5% of all high-confidence STIC-specific DHMRs.
Genomic locationa . | Gene . | STIC vs. Adj. norm. FTE AUC . | HGSC vs. Healthy FTM AUC . | (HGSC & STIC) vs. (Healthy FTM & Adj. norm FTE) AUC . |
---|---|---|---|---|
chr5:141208649–141208659 | PCDHB12 | 0.958 | 0.673 | 0.723 |
chr5:141407937–141407940 | PCDHGA1 … PCDHGB6 | 0.875 | 0.793 | 0.813 |
chr5:141417605–141417623 | PCDHGA1 … PCDHGB7 | 0.833 | 0.88 | 0.871 |
chr19:50667559–50667804 | SHANK1 | 0.826 | 0.745 | 0.761 |
chr3:187737583–187737946 | BCL6 | 0.826 | 0.807 | 0.81 |
chr17:60421510–60421683 | C17orf64 | 0.819 | 0.924 | 0.879 |
chr11:2271280–2271282 | ASCL2 | 0.812 | 0.789 | 0.753 |
chr11:131910988–131911000 | NTM | 0.806 | 0.855 | 0.804 |
chr17:37745219–37745526 | HNF1B | 0.778 | 0.633 | 0.737 |
chr2:153871394–153871412 | GALNT13 | 0.771 | 0.782 | 0.746 |
chr6:26183280–26183344 | HIST1H2BE | 0.771 | 0.782 | 0.765 |
chr12:38905914–38906042 | CPNE8 | 0.722 | 0.836 | 0.784 |
chr12:24903146–24903193 | BCAT1 | 0.694 | 0.676 | 0.699 |
chr6:26043976–26044177 | HIST1H2BB, HIST1H3C | 0.632 | 0.815 | 0.76 |
chr17:37743962–37744227 | HNF1B | 0.625 | 0.531 | 0.555 |
chr2:42792857–42792873 | HAAO | 0.549 | 0.829 | 0.717 |
chr6:31815145–31815220 | HSPA1A, HSPA1L | 0.507 | 0.716 | 0.651 |
Mean: | 0.753 | 0.769 | 0.755 | |
SD: | 0.118 | 0.098 | 0.077 | |
Wilcoxon P-value: | 0.679 |
Genomic locationa . | Gene . | STIC vs. Adj. norm. FTE AUC . | HGSC vs. Healthy FTM AUC . | (HGSC & STIC) vs. (Healthy FTM & Adj. norm FTE) AUC . |
---|---|---|---|---|
chr5:141208649–141208659 | PCDHB12 | 0.958 | 0.673 | 0.723 |
chr5:141407937–141407940 | PCDHGA1 … PCDHGB6 | 0.875 | 0.793 | 0.813 |
chr5:141417605–141417623 | PCDHGA1 … PCDHGB7 | 0.833 | 0.88 | 0.871 |
chr19:50667559–50667804 | SHANK1 | 0.826 | 0.745 | 0.761 |
chr3:187737583–187737946 | BCL6 | 0.826 | 0.807 | 0.81 |
chr17:60421510–60421683 | C17orf64 | 0.819 | 0.924 | 0.879 |
chr11:2271280–2271282 | ASCL2 | 0.812 | 0.789 | 0.753 |
chr11:131910988–131911000 | NTM | 0.806 | 0.855 | 0.804 |
chr17:37745219–37745526 | HNF1B | 0.778 | 0.633 | 0.737 |
chr2:153871394–153871412 | GALNT13 | 0.771 | 0.782 | 0.746 |
chr6:26183280–26183344 | HIST1H2BE | 0.771 | 0.782 | 0.765 |
chr12:38905914–38906042 | CPNE8 | 0.722 | 0.836 | 0.784 |
chr12:24903146–24903193 | BCAT1 | 0.694 | 0.676 | 0.699 |
chr6:26043976–26044177 | HIST1H2BB, HIST1H3C | 0.632 | 0.815 | 0.76 |
chr17:37743962–37744227 | HNF1B | 0.625 | 0.531 | 0.555 |
chr2:42792857–42792873 | HAAO | 0.549 | 0.829 | 0.717 |
chr6:31815145–31815220 | HSPA1A, HSPA1L | 0.507 | 0.716 | 0.651 |
Mean: | 0.753 | 0.769 | 0.755 | |
SD: | 0.118 | 0.098 | 0.077 | |
Wilcoxon P-value: | 0.679 |
aCoordinates refer to the human reference genome hg38 release (Genome Reference Consortium GRCh38, Dec 2013).
A methylation heatmap demonstrated that the 17 shared DHMRs readily distinguished malignant (STIC and HGSC) from all normal-appearing gynecologic tissues without pathologic changes (Fig. 5B). It is, however, particularly notable that HGSC could not be distinguished from STIC based on the presence and/or extent of methylation in these regions. While the inability to distinguish precursor lesions from high-grade ovarian carcinomas provides additional anecdotal support for the fallopian tube origin of HGSC, it also implies that, collectively speaking, methylation at these loci likely accrues primarily during the progression of precursor lesions prior to metastasizing to the ovary.
A plot of the average β value of the three candidate biomarker sets (HGSC-, STIC- and HGSC/STIC-specific DHMRs), as shown in Fig. 6A and Supplementary Tables S1–S3, also showed that although malignant tissues exhibited considerably higher methylation than normal-appearing fallopian tube tissues, HGSC specimens were only marginally more methylated in the majority (76%) of STIC-specific and HGSC/STIC-specific DHMRs (paired t test, P = 0.13 and 0.33, respectively). On the other hand, we observed moderately higher mean methylation at 88% of HGSC-specific DHMRs in HGSC than in STIC specimens (paired t test, P = 2E-10), indicating that hypermethylation at many of these loci either arises late or continues to accrue following dissemination to the ovary. In comparison, when all CpG-probes in islands, shores and shelves located near promotor regions were considered, no correlation between hypermethylation and disease status (STIC vs. HGSC) was observed. A similar trend was observed when the receiver operating characteristic (ROC) performance of the DHMR sets were considered (Fig. 6B; Supplementary Tables S1–S3). These data revealed that while STIC-specific DHMRs performed equally well at discriminating STIC and HGSC from corresponding healthy tissues (mean AUC 0.816 and 0.815, respectively), they actually slightly outperformed HGSC-specific DHMRs at discriminating HGSC tissues (mean HGSC-specific DHMR AUC, 0.799) and greatly outperformed HGSC-specific DHMRs in identifying STIC (mean HGSC-specific DHMR AUC, 0.702). These results further support the notion that STIC-specific hypermethylation likely accrues before, and is largely retained, following dissemination to the ovary, while some HGSC-specific differential hypermethylation arises during subsequent progression of ovarian carcinoma. Finally, it is worth noting that although STIC-specific DHMRs appear to perform better in terms of distinguishing both HGSC and STIC from healthy tissues than HGSC/STIC-specific DHMRs (mean HGSC/STIC-specific AUC, 0.769 and 0.753, respectively), the HGSC/STIC-specific DHMRs exhibit a considerably lower mean and variance in β value in healthy fallopian tube mucosae. This implies that the HGSC/STIC-specific DHMRs would likely outperform STIC-specific DHMRs in terms of clinical specificity for the detection of STIC or HGSC in samples such as blood or Pap specimens that would nominally contain DNA derived from heterogeneous populations of cells within the mucosae of the fallopian tube, endometrium and cervix, which were included as controls in our previous study (23).
Discussion
There continues to be a critical unmet need for biomarkers and diagnostic methods capable of detecting asymptomatic ovarian HGSC, particularly at early, precursor and/or incipient stages of the disease when surgical intervention is most effective. While considerable effort has been made toward this goal, particularly in the implementation of sequencing technologies with noninvasively collected samples such as blood, uterine lavage, or Pap specimens (20, 28–30), the increasing analytic sensitivity of these technologies has also begun to reveal that many cancer-free individuals and healthy tissues commonly harbor these same genetic abnormalities (19, 20). This issue is further confounded by the fact that TP53 is the only significantly mutated gene (SMG) present in most HGSC tumors (31), indicating that targeted sequencing of additional frequently mutated genes should provide only limited improvement in diagnostic performance. These issues ultimately imply that alternative biomarkers and/or multivariate approaches will likely be necessary to fully leverage advances in molecular diagnostic techniques while maintaining acceptable clinical specificity.
Biomarkers based on cancer-specific alterations in DNA methylation are a particularly attractive option for the noninvasive detection of HGSC. First, in comparison with nonsynonymous mutations, which typically occur in no more than two SMGs per HGSC tumor (31), cancer-specific aberrant DNA methylation is far more abundant, typically occurring in thousands of CpG dinucleotides within hundreds of CpG islands in the typical tumor genome (32). This redundancy is particularly useful for noninvasive diagnostic applications, as panels of methylation biomarkers can be utilized to dramatically improve clinical sensitivity in challenging samples, such as liquid biopsies, that normally contain only limited quantities of tumor DNA (33, 34). A second key advantage of DNA methylation is that it is highly cell-type specific (35). A number of key studies have likewise demonstrated that not only are DNA methylation biomarkers useful for early detection, but also for identifying corresponding tissues of origin and even subtypes of many human cancers (34, 36–39), including etiologically-diverse epithelial ovarian carcinomas (40).
This study directly builds upon our previous efforts to identify a discrete set of genomic regions exhibiting ovarian HGSC-specific differential hypermethylation through genome-wide methylomic analysis. Using locus-specific methylation analyses, our prior study also provided initial evidence that hypermethylation at the HGSC-specific loci was likely present in STIC lesions, the putative precursors of ovarian HGSC, thereby indicating that that epigenetic aberrations are likely an early event in, and possibly contribute to, ovarian cancer carcinogenesis. In this study, we sought to validate and significantly extend these observations by directly evaluating genome-wide assessment of methylation in pathology-confirmed precursor lesions associated with the development of ovarian cancer. Overall, we found that HGSCs and its precursor lesions are readily distinguishable from normal-appearing fallopian tube tissues based solely on the methylation status of a limited set of genomic loci.
In unsupervised analyses, all but 2 STIC samples clustered together with the 25 HGSC samples, which is reflective of the common origin between the two malignancies. Perhaps more surprisingly, these analyses also demonstrated that the methylomes of STIC samples could also be distinguished from HGSCs, primarily due to hypermethylation of certain loci otherwise associated with normal FTE, as well as notably less prevalent and extensive degrees of hypermethylation and hypomethylation in HGSC-specific DHMRs. This subclustering implies that while, for the most part, the methylomes of STIC lesions largely resemble that of HGSC tumors, there appears to be notable hypomethylation within certain genomic regions that occurs in HGSC, but not in STIC, samples. However, an open question remains as to whether the observed HGSC-specific hypomethylation is a result of cellular heterogeneity within the tumor samples, epigenetic reprogramming that occurs within malignant nonciliated fallopian tube epithelial cells following dissemination to the ovary or, possibly, an FFPE-related artifact (samples from the prior study were fresh frozen rather than FFPE). Nonetheless, taken at face value, our results imply that assessment of methylation might be potentially leveraged to differentiate early, premalignant disease from HGSC.
In general, HGSCs appeared to exhibit only marginally more hypermethylation at STIC-specific DHMRs than precursor STIC lesions, implying that the majority of hypermethylation at these loci likely occurs before dissemination to the ovary. This is also supported by the observation that STIC-specific DHMRs perform equally well at discriminating STIC and HGSC from healthy tissues. We also observed that microdissected adjacent-normal fallopian tube epithelium exhibited a modest, but detectable degree of HGSC-specific methylation. This observation is in general accord with the results of our previous study (23) that indicated the potential existence of STIC-associated methylation field effect or broader, genetically driven, epigenomic reprogramming (41) that may occur even prior to the development of early-stage, p53 signatures. Likewise, because the high-confidence STIC-specific DHMRs identified in this study were selected using adjacent-normal FTE as a negative control, it is probable that at least some DHMRs positively associated with the development of precursor lesions and subsequent HGSC have been excluded as a result of hypermethylation due to a methylation field effect.
p53 signature is a tubal lesion of unknown clinical significance that, while likely related to STIC lesions, is often detected in fallopian tubes in the absence of disease (16). p53 signature is defined by an abnormal p53 immunostaining pattern compatible with a missense TP53 mutation in a minute stretch of otherwise normal-appearing nonciliated tubal epithelium. p53 signatures are thought to represent an evolutionary “dead end” and do not have any known clinical significance but a few of them may precede STIC as the earliest precursor lesions. Because of their diminutive nature, the DNA obtained from p53 signatures is rarely sufficient for global methylation studies. Nevertheless, in this study, we, for the first time, were able to profile the methylome of this incipient lesion. The fact that p53 signature is epigenetically closer to normal-appearing fallopian tube epithelium than to STIC and HGSC suggests that TP53 mutations may not significantly affect the methylation profiles in epithelial cells and reassure that p53 signatures unlike STIC have yet to undergo malignant transformation. Although more p53 signatures will need to be analyzed in follow-up studies, this initial study indicates that methylation analyses might ultimately be leveraged to both detect and distinguish precursor lesions with respect to their malignant potential.
It is worth highlighting that in addition to the 17 HGSC-specific genomic loci identified in our previous study, several other identified genes associated with the STIC-specific hypermethylation have also previously been implicated by others in HGSC and associated carcinogenesis. Most notably, silencing of the genes HOXA11 (42, 43), miR-124 (44), RUNX3 (45, 46), SHANK1 (47), MN1 (48), SOCS2 (49), and OPCML (50) have all been documented to contribute to the development of HGSC. Interestingly, the methylation heatmaps of STIC-specific hypermethylation seem to indicate that in most lesions, methylation occurs in an “all or nothing” manner at various individual loci, reflecting considerable heterogeneity between the lesions, which are themselves, epigenetically clonal in nature.
The 91, 42, and 17 DHMRs identified here as highly specific to HGSC, STIC, and both specimen types, respectively, represent a relatively conservative set of candidate hypermethylation biomarkers for further validation studies. It is ultimately anticipated that the performance of mutation-based diagnostics approaches such as the PapSEEK tests (30) will likely benefit from incorporation of complementary epigenetic or protein biomarkers that can be concurrently evaluated in a single, noninvasively-collected specimen. A simple, unified test would be particularly attractive for early HGSC diagnostic or screening purposes, especially for women at increased risk of developing ovarian cancer due to germline mutations in BRCA1/2 and other DNA repair genes as well as a family history of breast and ovarian cancer.
There are a number of limitations to this study that warrant discussion. First, the relative scarcity and minute nature of fallopian tube precursor lesions make it particularly difficult to obtain a large cohort of specimens with sufficient DNA for genome-wide methylation analysis. As such, the inclusion criteria for our study reflect a best effort to achieve an ideal compromise between sufficient data quality and analysis of a suitable number of specimens to provide an adequate representation of the overall methylomic landscape of precursor lesions. Another limitation inherent to the study is that precursor lesions must undergo FFPE processing for identification and accurate classification. This ultimately implies that the quality of downstream molecular analyses, methylomic or otherwise, are necessarily subject to potential artifacts associated with FFPE sample processing. Batch effects arising from such artifacts can be at least partially addressed through incorporation of appropriate FFPE controls, such as the adjacent-normal FTE specimens used here. Similarly, the potential for false DMR discovery arising from spurious probe data can be further minimized by implementation of rigid discovery criteria, such as the “high-confidence” DHMR, which are characterized by minimal methylation in control specimens with statistically significant hypermethylation in contiguous CpG probes in STIC specimens. Regardless, batch effects may still exist when comparing data from DNA obtained from FFPE specimens to those from fresh-frozen samples, such as the HGSC and healthy fallopian tube mucosae samples of our prior study. However, this concern is somewhat mitigated by previous studies demonstrating the equivalence of EPIC datasets from DNA obtained from fresh-frozen samples and 50 ng of restored FFPE DNA (26).
This study represents an important initial survey of the methylomic landscape of fallopian tube lesions associated with high-grade serous ovarian carcinoma. Future studies are needed to validate our findings in an independent cohort of STIC and p53 signature lesions and to further determine the underlying biological mechanisms from which STIC- and/or HGSC-specific methylation arises. Validation of DHMRs specific to STIC but not in fallopian tube epithelium will provide new insight into the very early pathogenesis of ovarian HGSC. Furthermore, exploration into possible epigenetic “field” effects in normal-appearing tubal epithelium in reference to anatomic location might help to elucidate epigenetic contributions that contribute to or accelerate lesion formation and/or progression.
Disclosure of Potential Conflicts of Interest
No potential conflicts of interest were disclosed.
Authors' Contributions
T.R. Pisanic II: Conceptualization, resources, data curation, formal analysis, supervision, funding acquisition, validation, investigation, visualization, methodology, writing-original draft, writing-review and editing. Y. Wang: Data curation, formal analysis, validation, investigation, visualization, methodology, writing-original draft, writing-review and editing. H. Sun: Formal analysis, visualization. M. Considine: Formal analysis, visualization. L. Li: Data curation, methodology. T.-H. Wang: Resources, supervision, funding acquisition. T.-L. Wang: Resources, supervision, funding acquisition, project administration. I.-M. Shih: Conceptualization, resources, formal analysis, supervision, funding acquisition, writing-original draft, project administration, writing-review and editing.
Acknowledgments
This work was supported by the Honorable Tina Brozman Foundation, NIH/NCI (grant numbers P50CA228991, UO1CA200469, RO1CA215483), Teal Award, Gray Foundation, Department of Defense CDMRP (grant number W81XWH-11-2-0230), Johns Hopkins Discovery Award and the Richard W. TeLinde Endowment, Johns Hopkins University.
This work was supported by the Honorable Tina Brozman Foundation, the Department of Defense CDMRP (grant number W81XWH-11-2-0230), the NIH/NCI (grant numbers EDRN UO1CA200469, RO1CA215483), the Teal award, the Gray Foundation, Colleen's Dream Foundation, the Johns Hopkins Discovery Team Award and the Richard W. TeLinde Endowment, Johns Hopkins University.
The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.