Abstract
Summary: Using a series of detailed experiments, Zhang and colleagues establish that the prostate cancer RNA chimera SLC45A3-ELK4 is generated by cis-splicing between the 2 adjacent genes and does not involve DNA rearrangements or trans-splicing. The chimera expression is induced by androgen treatment likely by overcoming the read-through block imposed by the intergenic CCCTC insulators bound by CCCTC-binding factor repressor protein. The chimeric transcript, but not wild-type ELK4, is shown to augment prostate cancer cell proliferation. Cancer Discov; 2(7); 582–5. ©2012 AACR.
Commentary on Zhang et al., p. 598.
Gene fusions represent one of the most common genomic aberrations in cancers. Characterization of the oncogenic gene fusion BCR-ABL1 in chronic myeloid leukemia culminating in the successful treatment of the disease with a molecularly targeted therapy represents a compelling “bench to bedside” paradigm (1) that has inspired the discovery and characterization of similar oncogenic “driver” fusions in other hematologic malignancies, soft tissue tumors, and rare epithelial carcinoma (2, 3). More recently, driven by the development of sensitive genomic techniques such as microarrays and high-throughput sequencing in cytogenetically complex tissues of common solid cancers such as prostate, lung, and breast, many recurrent gene fusions involving genes such as ALK, RET, BRAF, ERG, and NOTCH have been discovered that provide novel diagnostic, prognostic, and therapeutic leads that are targetable by available or experimental therapeutics and thus have implications for personalized medicine (4–7). This is driving an ongoing quest for the discovery of recurrent gene fusions across major types of cancers using the high-throughput sequencing approaches.
Chromosomal rearrangements (including recombination, insertion, deletion, inversion, tandem duplication, or amplification) are regarded as the principal mechanism of the formation of gene fusions that may involve genes on the same chromosome (intrachromosomal fusions) or different chromosomes (interchromosomal fusions). Next-generation sequencing of cancer samples has revealed that most individual tumors harbor multiple gene fusions that include potential oncogenic “drivers,” as well as random by-products of genomic aberrations, generically described as “passenger” fusions. While the discovery of gene fusions resulting from chromosomal aberrations forms the primary objective of chimera analyses, RNA sequencing is unraveling yet another category of gene fusions that typically involve adjacent genes (in the same coding orientation) which express chimeric transcripts without involving concomitant chromosomal rearrangements. These RNA chimeras, variously described as “read-throughs,” transcription-induced chimeras (TIC), or CoTIS (co-transcription of adjacent genes coupled with intergenic splicing), are widely observed in virtually all samples analyzed, including in benign samples across different tissue types. Some of these RNA chimeras, however, appear restricted to individual tissue types, and a few of these have been observed to be highly expressed in cancers that may indicate potential functional relevance with respect to cellular differentiation and disease development (8–10). One such RNA chimera involves 2 adjacent genes, 60 kb apart on Chr1q32.1, the androgen-driven, prostate-specific gene, solute carrier family 45 member 3 (SLC45A3) and the gene-encoding ETS-family oncogenic transcription factor, ELK4, frequently observed at high levels in prostate cancer (9–11).
In this issue of Cancer Discovery, Zhang and colleagues (12) describe studies pertaining to a structural and functional characterization of SLC45A3-ELK4 chimera in prostate cancers. First, analyzing the likely mechanisms underlying the expression of the SLC45A3-ELK4 chimera, 3 scenarios—chromosomal rearrangement, trans-splicing, and cis-splicing—were considered. While previous analyses of the chromosomal region between SLC45A3 and ELK4 have ruled out chromosomal deletion (10, 11), here using hybridization probes encompassing the 2 genes as well as the intergenic region, Southern blot analysis was conducted to rule out the absence of a potential balanced translocation in the fusion-positive LNCaP cells, effectively negating the possibility of a DNA-based aberration underlying this chimera. Next, to forestall the potential artifacts associated with PCR, such as false chimera generated by template switching, the authors provide direct evidence of the endogenous chimeric transcript in the index cases using RNase protection assay. This was followed by several experiments to differentiate between the 2 distinct modes of chimera formation at the RNA level, that is, cis- versus trans-splicing. An in vitro trans-splicing assay was conducted using nuclear extracts of chimera-positive LNCaP cells and green monkey kidney cells CV-1 that harbor distinct polymorphisms in its ELK4 sequence. Real-time PCR was carried out using a common forward primer located on SLC45A3 and 2 distinct human and monkey ELK4-specific reverse primers. Absence of any trans-spliced RNA product composed of human SLC45A3 and monkey ELK4 led to the conclusion that trans-splicing is not involved. As a direct evidence of cis-splicing, the RNA corresponding to the intergenic region was detected by real-time PCR.
Next, the authors examined the role of CCCTC-binding factor (CTCF) binding to the recently described 2 “insulator” sequences present in the intergenic region between SLC45A3 and ELK4 (13) in controlling the expression of the chimera (Fig. 1A). Insulator sequences typically lie within intergenic regions and prevent runaway transcription across neighboring genes, through association with CTCF (14). The role of CTCF has been described in the regulation of alternative splicing (15), as well as through association of DNA methylation at CTCF-binding sites, in the generation of alternative splice variants (16). Here, the authors show that CTCF binds to the 2 insulators and this binding is attenuated upon androgen treatment. SLC45A3 being strongly induced by androgen, it likely forces a run-off transcription overcoming CTCF insulator block. This provides the mechanism for selective expression of SLC45A3-ELK4 in prostate cancer. Furthermore, CTCF knockdown induced SLC45A3-ELK4 expression whereas it abrogated the expression of the trans-spliced chimeric transcript JAZF1-JJAZ1 (17), which further rules out the possibility of trans-splicing and provides additional support for a cis-splicing mechanism.
A, schematic representation of the genomic coordinates of SLC45A3 and ELK4 on chr1q32 (reverse strand) and the location of the 2 intergenic insulator sequences. Androgen receptor–driven transcription initiated at SLC45A3 promoter, overrides the repressive effect of CTCF-bound insulator sequences. B, architecture of potential fusion isoforms generated by cis-splicing between SLC45A3 (green) and ELK4 (orange), as revealed by transcriptome sequencing. The lighter colors indicate 5′- and 3′-untranslated regions (left). Predicted protein products resulting from the chimeric isoforms are shown in right. The SLC45A3_exon1-ELK4_exon2 (left, top) putatively encodes full-length ELK4 (right, top), and SLC45A3_exon4-ELK4_exon2 (left, bottom) encodes a fusion protein as indicated (right, bottom). C, schematic representation of known ETS family gene fusions driven by androgen-responsive promoters in prostate cancer. As indicated, TMPRSS2 and SLC45A3 are the most frequent 5′ partners. SLC45A3-ELK4 fusion represents a unique “RNA only” chimera (not represented by a DNA-based aberration) indicated by the box with a dashed outline.
A, schematic representation of the genomic coordinates of SLC45A3 and ELK4 on chr1q32 (reverse strand) and the location of the 2 intergenic insulator sequences. Androgen receptor–driven transcription initiated at SLC45A3 promoter, overrides the repressive effect of CTCF-bound insulator sequences. B, architecture of potential fusion isoforms generated by cis-splicing between SLC45A3 (green) and ELK4 (orange), as revealed by transcriptome sequencing. The lighter colors indicate 5′- and 3′-untranslated regions (left). Predicted protein products resulting from the chimeric isoforms are shown in right. The SLC45A3_exon1-ELK4_exon2 (left, top) putatively encodes full-length ELK4 (right, top), and SLC45A3_exon4-ELK4_exon2 (left, bottom) encodes a fusion protein as indicated (right, bottom). C, schematic representation of known ETS family gene fusions driven by androgen-responsive promoters in prostate cancer. As indicated, TMPRSS2 and SLC45A3 are the most frequent 5′ partners. SLC45A3-ELK4 fusion represents a unique “RNA only” chimera (not represented by a DNA-based aberration) indicated by the box with a dashed outline.
Analyzing potential phenotypic effects of the fusion transcript overexpression, knockdown of SLC45A3-ELK4 but intriguingly, not by wild-type ELK4 (or SLC45A3), resulted in reduced proliferation. Probing this further, cell-cycle analysis of LNCaP cells transfected with fusion-specific siRNA (again, not with ELK4-specific siRNA) displayed G1 arrest, even as no cell death was observed. The G1 arrest was attributed to upregulation of cyclin-dependent kinase inhibitor CDKN1A (p21) in LNCaP cells transfected with siRNA against SLC45A3-ELK4, but not with siELK4. Microarray analysis also identified upregulation of cell-cycle inhibitor p21 and S100A10, in LNCaP cells transfected with si-SLC45A3-ELK4 but not by siELK4. Surprisingly, the authors found no effect on cell invasion by SLC45A3-ELK4, unlike previous reports on other ETS family oncogenes such as ERG and ETV1, where an effect on invasion but not proliferation was observed (18, 19). The study also reports the expression of SLC45A3-ELK4 in androgen-independent prostate cancer cell line PC3 as well as non–prostate cancer types and even benign cells, as has been observed previously (9), suggesting that SLC45A3-ELK4 expression may also involve androgen receptor–independent pathways.
Analyzing clinical correlates of SLC45A3-ELK4 expression, a significant association was found with prostate cancers with Gleason score 7 or more. Notably, prostate cancer metastases were found to show the highest levels of chimera expression. If validated in larger sample cohorts, this chimeric RNA could potentially be an attractive diagnostic/prognostic marker.
With increasing depth of coverage in transcriptome sequencing, chimeric RNA sequences are routinely observed in all samples, presumably driven by strong expression of the 5′ genes. The tissue-specific expression of SLC45A3-ELK4 is certainly attributable to androgen-driven, prostate-specific expression of SLC45A3. Follow-up studies will need to identify chimera-derived protein(s) and characterize its difference(s) from wild-type protein, as 2 distinct fusion isoforms have been described (refs. 9, 10; Fig. 1B), SLC45A3_exon1-ELK4_exon2 encoding full-length open reading frame for ELK4 protein and SLC45A3_exon4-ELK4_exon2 that encodes a putative chimeric protein with amino acid stretches corresponding to both SLC45A3 and ELK4. The latter product could potentially account for the chimera-specific increase in cell proliferation in prostate cancer cells reported in the present study.
A majority of prostate cancers are characterized by the presence of recurrent gene fusions primarily involving androgen-regulated upstream genes, TMPRSS2 and SLC45A3, fused to one of the genes of the ETS family of oncogenic transcription factors (ref. 20; Fig. 1C). SLC45A3-ELK4 represents a novel class of fusions in this group generated through cis-splicing and adds to the repertoire of prostate cancer–specific chimera that may serve as disease-specific biomarkers.
Disclosure of Potential Conflicts of Interest
A.M. Chinnaiyan serves as a consultant to Gen-Probe and Ventana/Roche who are developing gene fusion–based assays for prostate cancer. No potential conflicts of interests were disclosed by the other authors.
Acknowledgments
The authors thank University of Michigan GI SPORE Career Development Award for its generous support (C. Kumar-Sinha).
Grant Support
This work is supported, in part, by the NIH R21CA155992 (C. Kumar-Sinha); U.S. NIH (R01CA132874), Early Detection Research Network grant UO1 CA111275, Prostate SPORE grant P50CA69568, and the Department of Defense Era of Hope grant BC075023 (A.M. Chinnaiyan). A.M. Chinnaiyan is supported by the Doris Duke Charitable Foundation Clinical Scientist Award and the Prostate Cancer Foundation. A.M. Chinnaiyan is an American Cancer Society Research Professor and A. Alfred Taubman Scholar.