Purpose: To identify the novel gene signatures and molecular markers of nasopharyngeal carcinoma (NPC) by integrated bioinformatics analysis of multiple gene expression profiling datasets.

Experimental Design: Seven published gene expression profiling studies and one of our unpublished works were reanalyzed to identify the common significantly dysregulated (CSD) genes in NPC. Overrepresentation analysis of cytogenetic bands, Gene Ontology (GO) categories, pathways were used to explore CSD genes functionally associated with carcinogenesis. The protein expressions of selected CSD genes were examined by immunohistochemistry on tissue microarrays, and the correlations of their expressions with clinical outcomes were evaluated.

Results: Using the criteria (genes reported deregulated in more than one study), a total of 962 genes were identified as the CSD genes in NPC. Four upregulated (BUB1B, CCND2, CENPF, and MAD2L1) and two downregulated (LTF and SLPI) genes were markedly reported in six studies. The enrichments of chromosome aberrations were 2q23, 2q31, 7p15, 12q15, 12q22, 18q11, and 18q12 in upregulated genes and 14q32 and 16q13 in downregulated genes. The activated GO categories and pathways related to proliferation, adhesion, invasion, and downregulated immune response had been functionally associated with NPC. SLPI significantly downregulated in nasopharyngeal adenocarcinoma. Furthermore, the high expression of BUB1B or CENPF was associated with poor overall survival of patients.

Conclusion: It was first clearly identified the dysregulated expression of BUB1B and SLPI in NPC tissues.

Impact: Further studies of the CSD genes as gene signatures and molecular markers of NPC might improve the understanding of the disease and identify new therapeutic targets. Cancer Epidemiol Biomarkers Prev; 21(1); 166–75. ©2011 AACR.

This article is featured in Highlights of This Issue, p. 1

Nasopharyngeal carcinoma (NPC) is a malignant epithelial head and neck carcinoma that is highly prevalent in southern China and Southeast Asia (1). Multiple molecular studies have reported that NPC is a highly aneuploid and genetically complex tumor that develops in a multistep process involving alterations of numerous genes. However, the mechanism of carcinogenesis and progression of NPC is not well understood. Moreover, the prognosis of NPC patients is related to its potent localized invasion and metastatic spread. A clearer understanding of the molecular signature and underlying mechanisms would be beneficial in providing effective biomarkers for therapeutic planning and intervention.

Gene expression profiling analysis using microarray or array technologies has emerged in the last decade as a powerful tool to search for reliable gene markers of specific cancers and to determine the molecular features of cancer progression and the clinical outcome in greater detail (2). Moreover, because the genome-wide set of dysregulated genes and proteins are known to function in regulated biological networks or pathways associated with cancers, the focus of data obtained from gene expression profiling in cancers has been directed toward analysis of these sets of genes or pathways. In the literature, several reports have described dysregulation of gene expression in NPC and have found a vast amount of genome-wide information (3–11). However, the overall molecular signature of NPC is still undefined. Focusing on dysregulated genes and pathways in a single gene expression profiling dataset is not the most efficient method for understanding the molecular signature of NPC because only a few clinical samples can be validated and studied in detail at any given time. Hence, an integrated analysis would be useful in understanding the cross-talk among the gene expression profiling datasets of NPC clinical samples from different laboratories and would maximize the biological information that can be collected from genome-wide, gene expression profiling datasets of NPC.

We hypothesized that common significant gene expression changes between different microarray studies may be identified by reclassifying and reanalyzing publicly available gene expression profiling data. To understand the underlying mechanisms of NPC carcinogenesis, and to develop new therapeutic approaches targeted to NPC-specific molecular abnormalities, gene expression data were examined by bioinformatics analysis tools that searched for chromosome aberrations, biological annotations, and pathways. Then, expression of selected dysregulated genes in NPC was examined by immunohistochemistry (IHC) on tissue microarrays (TMA), and the correlation of protein expression with clinical outcomes was evaluated.

Literature search and identification of the common significantly dysregulated genes in NPC

For data collection, a literature survey of gene expression data published in NPC was carried out. Articles were selected that satisfied the predetermined criteria: sample from human tissues and the preparation were detailed, technology for gene expression studies was defined, and detailed results of the gene expression changes were available. Studies concerning single genes or arbitrarily selected genes were discarded. Results obtained from animal models were not considered. Eight NPC dysregulated gene lists from 7 published gene expression profiling studies and one of our unpublished works were included in our analysis (Table 1). All given gene details were converted into the National Center for Biotechnology Information official gene symbol by The Database for Annotation, Visualization and Integrated Discovery (DAVID) v6.7 (12, 13). The dysregulated genes reported in 2 or more studies were selected.

Table 1.

NPC gene expression profiling studies details

AuthorYearPlatformSamplesNumber of reported dysregulated gene
Unpublished data  HG-U133.Plus.2.0a N: 4, T: 12 806 
Bose and colleagues 2009 HG-U133Ab N: 3, T: 25 1963 
Fang and colleagues 2008 CSC-GE-80c N: 1 pool (24 normal), T: 8 pools (32 tumor) 140 
Zeng and colleagues 2007 Biostar-H80sd N: 10, T: 23 503 
Shi and colleagues 2006 HG-U133A N: 6, T: 14 1089 
Sengupta and colleagues 2006 HG-U133.Plus.2.0 N: 10, T: 31 831 
Sriuranpong and colleagues 2004 Human GEM2 cDNA arraye N: 7, T: 12 485 
Xie and colleagues 2000 Atlas human cancer cDNA expression arrayf N: 1 pool (9 normal), T: 1 pool (26 tumor) 46 
AuthorYearPlatformSamplesNumber of reported dysregulated gene
Unpublished data  HG-U133.Plus.2.0a N: 4, T: 12 806 
Bose and colleagues 2009 HG-U133Ab N: 3, T: 25 1963 
Fang and colleagues 2008 CSC-GE-80c N: 1 pool (24 normal), T: 8 pools (32 tumor) 140 
Zeng and colleagues 2007 Biostar-H80sd N: 10, T: 23 503 
Shi and colleagues 2006 HG-U133A N: 6, T: 14 1089 
Sengupta and colleagues 2006 HG-U133.Plus.2.0 N: 10, T: 31 831 
Sriuranpong and colleagues 2004 Human GEM2 cDNA arraye N: 7, T: 12 485 
Xie and colleagues 2000 Atlas human cancer cDNA expression arrayf N: 1 pool (9 normal), T: 1 pool (26 tumor) 46 

N, normal nasopharyngeal tissue; T, NPC tissue.

aThe HG-U133 Plus 2.0 oligonucleotide microarray (Affymetrix) consisted of 54,675 probesets for more than 47,000 transcripts and variants, including 38,500 human genes.

bThe HG-U133A oligonucleotide microarray (Affymetrix) consisted of 22,282 human transcripts.

cThe CSC-GE-80 cDNA array (Shenzhen Chipscreen Biosciences Co Ltd.) consisted of 8,064 human genes.

dThe Biostar-H80s cDNA array (Biostar Gene Technology Co Ltd.) consisted of 8,378 full-length or segmental novel and known human genes.

eThe Human GEM2 cDNA array (Advanced Technology Center, National Cancer Institute) consisted of 9,128 cDNA clones in Human GEM2 cDNA clones (Incyte Genomics Inc.).

fThe Atlas human cancer cDNA expression array (Clontech Laboratories Inc.) consisted of 588 known genes that play crucial roles in tumor biology.

Cytogenetic band analysis

Enrichment analysis for the cytogenetic band was carried out by using WebGestalt toolkit (14, 15). For identification of the enriched cytogenetic band of genes, the whole human genome was used as a reference list. The hypergeometric statistical method test was used, and only statistically enriched terms (P < 0.05) with at least 2 genes were selected.

Gene Ontology and Kyoto Encyclopedia of Genes and Genomes pathway analysis

Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis was the WebGestalt toolkit. This toolkit includes information from Gene Ontology Tree Machine software (GOTM; ref. 16). WebGestalt queries were carried out with lists of Entrez Gene IDs. For identification of enriched GO and KEGG pathway terms (i.e., GO terms with a number of associated genes significantly higher than expected), the whole human genome was used as a reference list. The hypergeometric statistical method test was used, and only statistically enriched terms (P < 0.05) with at least 2 genes were selected.

IHC

The TMA prepared from NPC and normal tissue specimens collected at the Second Xiangya Hospital (Changsha, Hunan, China) was those reported in our previous studies (11, 17). IHC was carried out using the peroxidase antiperoxidase technique after a microwave antigen retrieval procedure. Antibodies against CENPF (clone 14C10/1D8, dilution 1:200), BUB1B (clone 8G1, dilution 1:100), and SLPI (clone 31, dilution 1:50) were obtained from Abcam. Antibodies against CENPF, BUB1B, and SLPI were overlaid on TMA and incubated overnight at 4°C. After incubation with secondary antibody (Dako Envision+ system, peroxidase conjugate) at room temperature for 30 minutes and staining with DAB (Dako Envision+ system). Two pathologists independently scored the results of IHC staining. Both staining intensity and positive areas of each tissue core on the TMA were recorded. The staining intensity was scored as 0 (no staining), 1 (weak staining exhibited as light yellow), 2 (moderate staining exhibited as yellow brown), or 3 (strong staining exhibited as brown). The extension of staining was scored as 0 (0%), 1 (1%–25%), 2 (26%–50%), or 3 (51%–100%), according to the proportion of positive cells of interest. Cases with a combined staining score (intensity + extension) of 1 or greater were considered positive. For the purpose of statistical evaluation, the expression level of protein was classified that the combined staining score (intensity + extension) of 2 or more was low expression and the score between 3 and 6 was high expression.

Statistical analysis

Associations between pathologic parameters and protein expressions were evaluated using the Pearson χ2 test or Fisher's exact test. Overall survival and disease-specific survival curves were constructed by the Kaplan–Meier method, and the differences between the curves were compared by log-rank tests. Overall survival was calculated from the date of diagnosis of NPC to the date of death. Patients who died as a result of or who had the disease at the time of follow-up were included in the analysis. For disease-specific survival, data for patients who died from other causes were censored at the time of death. Univariate analysis with Cox proportional hazards model was used to determine each identified prognostic factor.

Identification of common significantly dysregulated genes in NPC from gene expression profiling datasets

A total of 3,916 unique and mapped dysregulated genes were differentially expressed in the 8 independent gene expression profiling studies on NPC (Supplementary Fig. S1). From these genes, 2,135 genes were upregulated and 1,781 were downregulated in NPC samples. Then, 962 dysregulated genes (651 upregulated and 311 downregulated), which were identified in more than 1 study were collocated and referred to as the “common significantly dysregulated genes” (CSD genes) in NPC (Supplementary Table S1). Of these CSD genes, 30 upregulated and 6 downregulated genes overlapped in 5 and more studies (Table 2). Four upregulated genes (BUB1B, CCND2, CENPF, and MAD2L1) and 2 downregulated genes (LTF and SLPI) were identified in 6 different studies.

Table 2.

CSD genes in NPC identified in 5 and more studies

Official gene symbolGene nameCytobandEntrez gene ID
Upregulated genes in NPC 
BUB1Ba Budding uninhibited by benzimidazoles 1 homolog β (yeast) 15q15 701 
CCND2a Cyclin D2 12p13 894 
CENPFa Centromere protein F, 350/400 ka (mitosin) 1q32-q41 1063 
MAD2L1a MAD2 mitotic arrest deficient-like 1 (yeast) 4q27 4085 
C12ORF5 Chromosome 12 open reading frame 5 12p13.3 57103 
CAPRIN1 Cell-cycle–associated protein 1 11p13 4076 
CCNA2 Cyclin A2 4q25-q31 890 
CCNB1 Cyclin B1 5q12 891 
CCT6A Chaperonin containing TCP1, subunit 6A (zeta 1) 7p11.2 908 
CDK1 Cell division cycle 2, G1 to S and G2 to M 10q21.1 983 
E2F3 E2F transcription factor 3 6p22 1871 
EZH2 Enhancer of zeste homolog 2 (Drosophila7q35-q36 2146 
FZD7 Frizzled homolog 7 (Drosophila2q33 8324 
GAD1 Blutamate decarboxylase 1 (brain, 67 kDa) 2q31 2571 
GJA1 Gap junction protein, α 1, 43 kDa 6q21-q23.2 2697 
KIF14 Kinesin family member 14 1q32.1 9928 
MELK Maternal embryonic leucine zipper kinase 9p13.2 9833 
NUSAP1 Nucleolar and spindle associated protein 1 15q15.1 51203 
PAICS Phosphoribosylaminoimidazole carboxylase, phosphoribosylaminoimidazole succinocarboxamide synthetase 4q12 10606 
PCNA Proliferating cell nuclear antigen 20pter-p12 5111 
PMAIP1 Phorbol-12-myristate-13-acetate-induced protein 1 18q21.32 5366 
PRC1 Protein regulator of cytokinesis 1 15q26.1 9055 
RBBP8 Retinoblastoma binding protein 8 18q11.2 5932 
RFC3 Replication factor C (activator 1) 3, 38 kDa 13q12.3-q13 5983 
SMC4 Structural maintenance of chromosomes 4 3q26.1 10051 
TMPO Thymopoietin 12q22 7112 
TNFAIP6 Tumor necrosis factor, α-induced protein 6 2q23.3 7130 
TOP2A Topoisomerase (DNA) II α 170 kDa 17q21-q22 7153 
TYMS Thymidylate synthetase 18p11.32 7298 
ZWINT ZW10 interactor 10q21-q22 11130 
Downregulated genes in NPC 
LTFa Lactotransferrin 3p21.31 4057 
SLPIa Secretory leukocyte peptidase inhibitor 20q12 6590 
CLU Clusterin 8p21-p12 1191 
DHCR24 24-Dehydrocholesterol reductase 1p33-p31.1 1718 
PIGR Polymeric immunoglobulin receptor 1q31-q41 5284 
SERPINB6 Serpin peptidase inhibitor, clade B (ovalbumin), member 6 6p25 5269 
Official gene symbolGene nameCytobandEntrez gene ID
Upregulated genes in NPC 
BUB1Ba Budding uninhibited by benzimidazoles 1 homolog β (yeast) 15q15 701 
CCND2a Cyclin D2 12p13 894 
CENPFa Centromere protein F, 350/400 ka (mitosin) 1q32-q41 1063 
MAD2L1a MAD2 mitotic arrest deficient-like 1 (yeast) 4q27 4085 
C12ORF5 Chromosome 12 open reading frame 5 12p13.3 57103 
CAPRIN1 Cell-cycle–associated protein 1 11p13 4076 
CCNA2 Cyclin A2 4q25-q31 890 
CCNB1 Cyclin B1 5q12 891 
CCT6A Chaperonin containing TCP1, subunit 6A (zeta 1) 7p11.2 908 
CDK1 Cell division cycle 2, G1 to S and G2 to M 10q21.1 983 
E2F3 E2F transcription factor 3 6p22 1871 
EZH2 Enhancer of zeste homolog 2 (Drosophila7q35-q36 2146 
FZD7 Frizzled homolog 7 (Drosophila2q33 8324 
GAD1 Blutamate decarboxylase 1 (brain, 67 kDa) 2q31 2571 
GJA1 Gap junction protein, α 1, 43 kDa 6q21-q23.2 2697 
KIF14 Kinesin family member 14 1q32.1 9928 
MELK Maternal embryonic leucine zipper kinase 9p13.2 9833 
NUSAP1 Nucleolar and spindle associated protein 1 15q15.1 51203 
PAICS Phosphoribosylaminoimidazole carboxylase, phosphoribosylaminoimidazole succinocarboxamide synthetase 4q12 10606 
PCNA Proliferating cell nuclear antigen 20pter-p12 5111 
PMAIP1 Phorbol-12-myristate-13-acetate-induced protein 1 18q21.32 5366 
PRC1 Protein regulator of cytokinesis 1 15q26.1 9055 
RBBP8 Retinoblastoma binding protein 8 18q11.2 5932 
RFC3 Replication factor C (activator 1) 3, 38 kDa 13q12.3-q13 5983 
SMC4 Structural maintenance of chromosomes 4 3q26.1 10051 
TMPO Thymopoietin 12q22 7112 
TNFAIP6 Tumor necrosis factor, α-induced protein 6 2q23.3 7130 
TOP2A Topoisomerase (DNA) II α 170 kDa 17q21-q22 7153 
TYMS Thymidylate synthetase 18p11.32 7298 
ZWINT ZW10 interactor 10q21-q22 11130 
Downregulated genes in NPC 
LTFa Lactotransferrin 3p21.31 4057 
SLPIa Secretory leukocyte peptidase inhibitor 20q12 6590 
CLU Clusterin 8p21-p12 1191 
DHCR24 24-Dehydrocholesterol reductase 1p33-p31.1 1718 
PIGR Polymeric immunoglobulin receptor 1q31-q41 5284 
SERPINB6 Serpin peptidase inhibitor, clade B (ovalbumin), member 6 6p25 5269 

aGene was identified in 6 studies.

Chromosomal distribution of CSD genes in NPC

The chromosomal distributions of CSD genes associated with NPC were analyzed in an attempt to identify the genetic hotspots and gene expression patterns of NPC. As shown in Fig. 1, CSD genes were represented as the peaks of gene expression in chromosomes 1, 2, and 7. Chromosome 1 contained 112 (11.64%) CSD genes, followed by chromosomes 2 and 7, which contained 88 (9.15%) and 67 (6.96%) CSD genes, respectively. Chromosome Y did not present any CSD genes. The upregulated CSD genes showed peaks in chromosomes 2 (71/651, 10.91%), 1 (70/651, 10.75%), and 7 (60/651, 9.22%). The downregulated CSD genes exhibited peaks of gene expression in chromosomes 1 (42/311, 13.50%), 3 (27/311, 8.68%), and 6 (21/311, 6.75%). Enrichment analysis for the cytogenetic band revealed the marked enrichments in the upregulated genes residing at 2q, 2q23, 2q31, 7p, 7p15, 12q, 12q15, 12q22, 18q, 18q11, and 18q12 and in the downregulated genes residing at 14q32 and 16q13 (Supplementary Table S2).

Figure 1.

Chromosomal distribution of CSD genes in NPC. Black bar plots represent the upregulated genes. White bar plots represent the downregulated genes.

Figure 1.

Chromosomal distribution of CSD genes in NPC. Black bar plots represent the upregulated genes. White bar plots represent the downregulated genes.

Close modal

GO analysis of CSD genes in NPC

By WebGestalt, 120, 120, and 77 GO categories were enriched in CSD genes, upregulated genes, and downregulated genes in NPC, respectively (Supplementary Table S3). The Directed Acyclic Graph (DAG) diagram visualized the close interaction of the enriched GO terms (Supplementary Fig. S2). As shown in Fig. 2, 5 main enriched biological processes of CSD genes were outlined: cell cycle and proliferation, DNA replication, development, immune process, and response to stimulus. The most significantly enriched biological processes of upregulated genes were related to the cell cycle and proliferation, DNA replication, and development. The top 5 enriched biological processes in the downregulated genes' list were related to immune system process and response to stimulus, including the humoral immune response, protein maturation by peptide bond cleavage, the activation of plasma proteins involved in acute inflammatory response, the protein maturation, and the defense response. The dominant molecular function of CSD and upregulated genes was “binding,” especially protein binding and nucleotide binding, whereas that of downregulated genes was “oxidoreductase activity.”

Figure 2.

The DAG diagram of the GO terms from the Biological Process ontology enriched in the CSD genes (A), upregulated genes (B), or downregulated genes (C) in NPC when compared with the human genome. Analysis was carried out by using WebGestalt toolkit. Only the enriched GO sets (hypergeometric; P < 0.05) containing 2 or more genes were included in the analysis. Enriched functions of genes were outlined.

Figure 2.

The DAG diagram of the GO terms from the Biological Process ontology enriched in the CSD genes (A), upregulated genes (B), or downregulated genes (C) in NPC when compared with the human genome. Analysis was carried out by using WebGestalt toolkit. Only the enriched GO sets (hypergeometric; P < 0.05) containing 2 or more genes were included in the analysis. Enriched functions of genes were outlined.

Close modal

KEGG pathway analysis of CSD genes in NPC

To identify the molecular events involving in the tumorigenesis of NPC, CSD genes were overlaid onto the KEGG pathway database using WebGestalt. This analysis identified 115, 78, and 56 KEGG pathways containing 2 or more CSD, upregulated, or downregulated genes in NPC, respectively (Supplementary Table S4). The functions of metabolic pathways have the highest significance with respect to enrichment in 110 CSD, 66 upregulated, and 44 downregulated genes (P = 1.60E-39, 6.11E-21, and 5.95e-19, respectively).

Several clusters of CSD genes associated with major cancer entities were overrepresented, which indicates a common oncogenic basis: pathways in cancer (50 genes, P = 6.00E-26), small cell lung cancer (25 genes, P = 1.83E-20), glioma (13 genes, P = 1.07E-08), colorectal cancer (13 genes, P = 2.29E-07), prostate cancer (13 genes, P = 4.39E-07), chronic myeloid leukemia (12 genes, P = 4.77E-07), melanoma (11 genes, P = 1.79E-06), pancreatic cancer (11 genes, P = 1.98E-06), bladder cancer (8 genes, P = 1.24E-05), non–small cell lung cancer (8 genes, P = 7.87E-05), renal cell carcinoma (8 genes, P = 0.0003), endometrial cancer (6 genes, P = 0.0019), basal cell carcinoma (5 genes, P = 0.0101), acute myeloid leukemia (5 genes, P = 0.0139), and thyroid cancer (4 genes, P = 0.0057).

Besides cancer pathways, KEGG pathways of CSD genes involved in classical cancer development were significantly enriched, such as cell cycle (41 genes, P = 7.34E-35), focal adhesion (27 genes, P = 5.72E-13), ECM-receptor interaction (20 genes, P = 2.02E-14), p53 (19 genes, P = 6.65E-15), MAPK (18 genes, P = 9.29E-05), apoptosis (15 genes, P = 6.44E-09), insulin signaling (15 genes, P = 1.57E-06), cell adhesion molecules (14 genes, P = 5.92E-06), Toll-like receptor (13 genes, P = 1.56E-06), Jak-STAT (12 genes, P = 0.0003), complement and coagulation cascades (11 genes, P = 1.51E-06), and the Wnt signaling pathway (10 genes, P = 0.0032). It was notable that the complement and coagulation cascades pathway was significantly enriched in 8 downregulated gene C3, C4A, CD55, CFB, CFH, CR2, F3, and PROS1 (P = 1.17E-06), and also in 3 upregulated gene C1QB, PLAU, and PLAUR.

BUB1B, CENPF, and SLPI protein expression by IHC

To validate the results of CSD genes in NPC from gene expression profiling datasets, we evaluated the expression of BUB1B, CENPF, and SLPI proteins in a wide range of NPC tissues to characterize their expressions in situ by IHC using the NPC TMA (Fig. 3). BUB1B, CENPF, and SLPI were the dysregulated genes in NPC identified in 6 microarray studies, but the details of protein expressions in NPC were still lacking. Consistent with the mRNA transcript data, there were remarkable staining differences of BUB1B and CENPF protein between NPC tissues and chronic inflammation of nasopharyngeal mucosa and normal adjacent epithelia of NPC (P < 0.001). BUB1B protein was detected in 367 of 540 (67.96%) NPC tissues, 12 of 70 (17.14%) chronic inflammation of nasopharyngeal mucosa, and 15 of 100 (15.00%) normal adjacent epithelia of NPC. The positive expression rates of CENPF protein were 69.25% (374/540) in NPC tissues, 21.42% (15/70) in chronic inflammation of nasopharyngeal mucosa, and 17.00% (17/100) in normal adjacent epithelia of NPC. IHC revealed that SLPI protein was significantly expressed in glands of normal nasopharyngeal mucosa and normal adjacent epithelia of NPC (109/109, 100%), but was underexpressed in nasopharyngeal adenocarcinoma (4/21, 19.05%, P < 0.001), which is a rare histologic type of nasopharyngeal neoplasm. There was no significant difference between nasopharyngeal squamous cell carcinoma (13/519, 2.50%) and squamous and columnar epithelia cells of normal nasopharyngeal mucosa and the normal adjacent epithelia of NPC (3/170, 1.76%, P = 0.8026).

Figure 3.

Protein expressions of BUB1B, CENPF, and SLPI in NPC TMA. BUB1B weakly expressed in chronic inflammation of nasopharyngeal mucosa (A) and strongly expressed in nonkeratinizing NPC (B). CENPF showed weak expression chronic inflammation of nasopharyngeal mucosa (C) and strong staining in nonkeratinizing NPC (D). SLPI protein significantly expressed in glands of normal nasopharyngeal mucosa (E) and downexpressed in nasopharyngeal adenocarcinoma (F). Scar bar = 100 μm.

Figure 3.

Protein expressions of BUB1B, CENPF, and SLPI in NPC TMA. BUB1B weakly expressed in chronic inflammation of nasopharyngeal mucosa (A) and strongly expressed in nonkeratinizing NPC (B). CENPF showed weak expression chronic inflammation of nasopharyngeal mucosa (C) and strong staining in nonkeratinizing NPC (D). SLPI protein significantly expressed in glands of normal nasopharyngeal mucosa (E) and downexpressed in nasopharyngeal adenocarcinoma (F). Scar bar = 100 μm.

Close modal

Prognostic value of BUB1B and CENPF in NPC

To investigate whether BUB1B and CENPF protein expression are associated with the outcome of NPC, survival curves were calculated for 103 cases of NPC patients. The clinical characteristics of the NPC patients were described in detail in Supplementary Table S5. The mean follow-up period was 46.4 ± 23.2 months with a range from 3 to 96 months. Survival curves calculated by the Kaplan–Meier method and analyzed using the log-rank test showed that the survival rates of the patients with high expression of BUB1B (P = 0.000) and CENPF (P = 0.000) were significantly decreased (Fig. 4). Using univariate analysis (Cox's proportional hazards model), the high expression of BUB1B (P = 0.000, 95% CI: 1.817–6.340) and CENPF (P = 0.000, 95% CI: 2.219–8.085) were significantly associated with the prognosis.

Figure 4.

Kaplan–Meier survival curves according to expression levels of BUB1B and CENPF in NPC.

Figure 4.

Kaplan–Meier survival curves according to expression levels of BUB1B and CENPF in NPC.

Close modal

The conceptual basis for the present study is the observation that the genomic-wide molecular signature predicts the development and progression of NPC. NPC, like other cancers, develop in a multistep process involving alterations of numerous genes (18). Recently, using different array technologies and tissue samples, several groups have reported dysregulated genes in NPC (3–11). In this study, we have investigated and integrated the global gene expression profiling of NPC from 8 different studies for the first time. Multiplatform integrative analysis of gene expression profiling datasets provides an opportunity to study in detail the molecular mechanisms regulating the development and progression of NPC. Furthermore, the identified significantly dysregulated gene in NPC represent promising biomarkers and targets for the clinical diagnosis and therapy.

Despite the substantial progress made by different investigators during recent years, the knowledge of the molecular signature of NPC and the factors that induce tumorgenesis remains limited. Using gene expression profiling and integrative criteria, 962 CSD genes (651 upregulated and 311 downregulated genes) in NPC were identified multiple studies.

Chromosome mapping of CSD genes may be prove to be a useful tool in the detection of the genetic hotspots of dysregulated gene expression patterns in NPC. Gains of 2q31 (19), 7p15 (19), 12q (19–22), and 18q (19, 21) in NPC have been previously reported. Chromosome 12 has been reported to undergo a gain of its long arm in NPC, with the common band region indicating gain at 12q13-q15, and high-copy-number increases of chromosomal materials in 12q14-q15 (21). Interestingly, on chromosomes 2q23, 7p15, 12q15, 12q22, 18q11, and 18q12, the enriched genes all belonged to the upregulated genes in NPC. Previous studies suggested that loss of heterozygosity on chromosome 14q32 was the common genetic event in NPC (23–27). The enriched genes on 14q32 were CLMN, CRIP2, AKT1, AHNAK2, ZFYVE21, IGH@, IGHG1, IGHG3, all belonged to the downregulated genes in NPC. There are no reports for gain of 2q23 and loss of 16q13 in regard to the relationship to NPC.

Functional gene annotation of GO analysis and pathway analysis provided further insight into the expression profiling of CSD genes in NPC. The functions of CSD genes were enriched in the biological processes and pathways involving classical cancer development and progression. As expected, the most significantly enriched biological processes and pathways of upregulated genes in NPC were related to cell proliferation, adhesion, and invasion, which was common in all cancers.

Notably, 40 and 89 downregulated genes were enriched in the GO terms, the immune system process and the response to stimulus, respectively. Tumor growth, invasion, and metastasis are important aspects of the tumor immune escape response. As mentioned previously, downregulation or loss of expression of immune molecules are strategies used by tumor cells to escape the immune system. Complement components play an important role in regulation of inflammation and the immune response. In this study, we found 8 downregulated genes, C3, C4A, CD55, CFB, CFH, CR2, F3, and PROS1, and 3 upregulated genes, C1QB, PLAU, and PLAUR involved in the GO term “complement activation,” or the complement and coagulation cascades pathway. These dysregulated genes suggested that the immune response was silenced in NPC. In addition, NPC is closely associated with persistent Epstein–Barr virus (EBV) infection, which has evolved a plethora of strategies to evade immune system recognition and to establish latent infection in memory B cells (28). Seven downregulated genes, CR2, C3, CLU, CD55, FOXJ1, C4A, and BCL6 were enriched in the B-cell–mediated immunity response.

It is not surprising that many of CSD genes have been reported to be associated with a variety of distinct human cancer types in the literature. Several of the dysregulated genes identified in 5 and more studies, such as MAD2L1 (29), CCND2 (30), PCNA (31), CDK1 (32), CCNB1 (33), GJA1 (34), E2F3 (35), CCNA2 (36), TYMS (37), C12ORF5 (38), TMPO (39), FZD7 (11), LTF (40), CENPF (41), and PIGR (42), have been explored in their involvement with NPC. However, the studies of the expression and molecular mechanisms of the upregulated gene BUB1B and the downregulated gene SLPI involved in NPC were still lacking.

BUB1B, a member of the BUB (budding uninhibited by benzimidazole) gene family, is one of the key molecules in the spindle assembly checkpoint. High expression of BUB1B has often been reported in several malignancies and correlated with chromosomal instability (43, 44). In this study, our results showed for the first time that BUB1B protein was upregulated in NPC compared with chronic inflammation of nasopharyngeal mucosa and normal adjacent epithelia of NPC, and that the high expression of BUB1B was associated with the prognosis of NPC patients. The data suggest that BUB1B might be a prognostic biomarker and potential therapeutic target for NPC.

Previous studies have shown that SLPI (secretory leukocyte peptidase inhibitor) is produced by the epithelial cells lining the respiratory, digestive, and reproductive tracts, and protect proepithelin from elastase cleavage in wound healing (45). SLPI inhibits cell growth through the apoptotic pathway in ovarian cancer (46) and suppresses cancer cell invasion but promotes blood-borne metastasis via an invasion-independent pathway (47). In our work, the results showed for the first time that SLPI protein was significantly expressed in the glands of normal nasopharyngeal mucosa and normal adjacent epithelia of NPC but underexpressed in nasopharyngeal adenocarcinoma. There was weak expression of SLPI in squamous and columnar epithelia cells of normal nasopharyngeal tissues and nasopharyngeal squamous cell carcinoma. Although the normal nasopharyngeal tissues used in the microarray analysis generally were the mixtures containing glands and squamous epithelium, the NPC tissues contained few or no glandular structures. The difference of glands in tissues may the main reason for promoting the downregulated expression of SLPI in the microarray datasets. The data suggest that SLPI might be a biomarker and potential therapeutic target for nasopharyngeal adenocarcinoma.

Moreover, we examined the expression of CENPF protein. CENPF, a member of the human centromeric protein family, is involved in centromere formation and kinetochore organization during mitosis (48). Cao and colleagues reported that CENPF, which were identified from their cDNA microarray data, is a valuable marker of NPC progression (41). The protein expression of CENPF is associated with poor overall survival of patients. In our study, CENPF was expressed in 374 (69.25%) of 540 NPC tissues and the high expression of CENPF was associated with the survival of NPC patients, according to the previous study (41). Thus, these data suggest that CENPF might be a biomarker for the carcinogenesis and prognosis of NPC and may have therapeutic implications.

One limitation of our study is that it only directly integrates differentially expressed genes in the datasets generated from the different array platforms. Even with this limitation, we still obtain a large number of common dysregulated genes (962 genes) in NPC. With the rapid increase of available microarray studies of NPC, the amount of gene expression profiling data generated from the different platforms will continue to grow, which will make our results of the integration analysis more useful.

In conclusion, by integrated analysis of multiple gene expression profiling datasets, the common gene signatures and molecular markers of NPC have been identified. The marked enrichments of chromosome aberrations in the upregulated genes were 2q23, 2q31, 7p15, 12q15, 12q22, 18q11, and 18q12, and in the downregulated genes were 14q32 and 16q13. The activated category related to cell proliferation, adhesion, and invasion and the downregulated immune response have been functionally related to the progression of NPC. Significant overexpression of BUB1B and CENPF in NPC and downregulation of SLPI in nasopharyngeal adenocarcinoma were identified. Furthermore, survival curves and univariate analysis revealed that the expression of BUB1B or CENPF was the prognostic indicator for survival of the patients. Further studies of the CSD genes as gene signatures and molecular markers of NPC might improve our understanding of NPC and identify new drug targets of clinical treatment.

No potential conflicts of interest were disclosed.

This work was supported by Hunan Province Natural Sciences Foundations (10JJ7003; 11JJ1013).

The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.

1.
Chang
ET
,
Adami
HO
. 
The enigmatic epidemiology of nasopharyngeal carcinoma
.
Cancer Epidemiol Biomarkers Prev
2006
;
15
:
1765
77
.
2.
Garcia-Escudero
R
,
Paramio
JM
. 
Gene expression profiling as a tool for basic analysis and clinical application of human cancer
.
Mol Carcinog
2008
;
47
:
573
9
.
3.
Bose
S
,
Yap
LF
,
Fung
M
,
Starzcynski
J
,
Saleh
A
,
Morgan
S
, et al
The ATM tumour suppressor gene is down-regulated in EBV-associated nasopharyngeal carcinoma
.
J Pathol
2009
;
217
:
345
52
.
4.
Fang
W
,
Li
X
,
Jiang
Q
,
Liu
Z
,
Yang
H
,
Wang
S
, et al
Transcriptional patterns, biomarkers and pathways characterizing nasopharyngeal carcinoma of Southern China
.
J Transl Med
2008
;
6
:
32
.
5.
Liu
ZQ
,
Tian
YQ
,
Hu
YF
,
Li
XL
,
Ma
FR
,
Li
GY
. 
Alteration of gene expression during nasopharyngeal carcinogenesis revealed by oligonucleotide microarray after microdissection of tumor tissue and normal epithelia from nasopharynx
.
Chin Med J (Engl)
2009
;
122
:
437
43
.
6.
Sengupta
S
,
den Boon
JA
,
Chen
IH
,
Newton
MA
,
Dahl
DB
,
Chen
M
, et al
Genome-wide expression profiling reveals EBV-associated inhibition of MHC class I expression in nasopharyngeal carcinoma
.
Cancer Res
2006
;
66
:
7999
8006
.
7.
Shi
W
,
Bastianutto
C
,
Li
A
,
Perez-Ordonez
B
,
Ng
R
,
Chow
KY
, et al
Multiple dysregulated pathways in nasopharyngeal carcinoma revealed by gene expression profiling
.
Int J Cancer
2006
;
119
:
2467
75
.
8.
Sriuranpong
V
,
Mutirangura
A
,
Gillespie
JW
,
Patel
V
,
Amornphimoltham
P
,
Molinolo
AA
, et al
Global gene expression profile of nasopharyngeal carcinoma by laser capture microdissection and complementary DNA microarrays
.
Clin Cancer Res
2004
;
10
:
4944
58
.
9.
Xie
L
,
Xu
L
,
He
Z
,
Zhou
W
,
Wang
L
,
Zhang
L
, et al
Identification of differentially expressed genes in nasopharyngeal carcinoma by means of the Atlas human cancer cDNA expression array
.
J Cancer Res Clin Oncol
2000
;
126
:
400
6
.
10.
Zeng
Z
,
Zhou
Y
,
Xiong
W
,
Luo
X
,
Zhang
W
,
Li
X
, et al
Analysis of gene expression identifies candidate molecular markers in nasopharyngeal carcinoma using microdissection and cDNA microarray
.
J Cancer Res Clin Oncol
2007
;
133
:
71
81
.
11.
Zeng
ZY
,
Zhou
YH
,
Zhang
WL
,
Xiong
W
,
Fan
SQ
,
Li
XL
, et al
Gene expression profiling of nasopharyngeal carcinoma reveals the abnormally regulated Wnt signaling pathway
.
Hum Pathol
2007
;
38
:
120
33
.
12.
Dennis
G
 Jr
,
Sherman
BT
,
Hosack
DA
,
Yang
J
,
Gao
W
,
Lane
HC
, et al
DAVID: Database for Annotation, Visualization, and Integrated Discovery
.
Genome Biol
2003
;
4
:
P3
.
13.
Huang da
W
,
Sherman
BT
,
Lempicki
RA
. 
Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources
.
Nat Protoc
2009
;
4
:
44
57
.
14.
Duncan
D
,
Prodduturi
N
,
Zhang
B
. 
WebGestalt2: an updated and expanded version of the Web-based Gene Set Analysis Toolkit
.
BMC Bioinformatics
2010
;
11
(
Suppl 4
):
10
.
15.
Zhang
B
,
Kirov
S
,
Snoddy
J
. 
WebGestalt: an integrated system for exploring gene sets in various biological contexts
.
Nucleic Acids Res
2005
;
33
:
W741
8
.
16.
Zhang
B
,
Schmoyer
D
,
Kirov
S
,
Snoddy
J
. 
GO Tree Machine (GOTM): a web-based platform for interpreting sets of interesting genes using Gene Ontology hierarchies
.
BMC Bioinformatics
2004
;
5
:
16
.
17.
Ma
J
,
Zhou
J
,
Fan
S
,
Wang
L
,
Li
X
,
Yan
Q
, et al
Role of a novel EGF-like domain-containing gene NGX6 in cell adhesion modulation in nasopharyngeal carcinoma cells
.
Carcinogenesis
2005
;
26
:
281
91
.
18.
Chou
J
,
Lin
YC
,
Kim
J
,
You
L
,
Xu
Z
,
He
B
, et al
Nasopharyngeal carcinoma–review of the molecular mechanisms of tumorigenesis
.
Head Neck
2008
;
30
:
946
63
.
19.
Tao
Q
,
Chan
AT
. 
Nasopharyngeal carcinoma: molecular pathogenesis and therapeutic developments
.
Expert Rev Mol Med
2007
;
9
:
1
24
.
20.
Chen
YJ
,
Ko
JY
,
Chen
PJ
,
Shu
CH
,
Hsu
MT
,
Tsai
SF
, et al
Chromosomal aberrations in nasopharyngeal carcinoma analyzed by comparative genomic hybridization
.
Genes Chromosomes Cancer
1999
;
25
:
169
75
.
21.
Fang
Y
,
Guan
X
,
Guo
Y
,
Sham
J
,
Deng
M
,
Liang
Q
, et al
Analysis of genetic alterations in primary nasopharyngeal carcinoma by comparative genomic hybridization
.
Genes Chromosomes Cancer
2001
;
30
:
254
60
.
22.
Gebhart
E
. 
Comparative genomic hybridization (CGH): ten years of substantial progress in human solid tumor molecular cytogenetics
.
Cytogenet Genome Res
2004
;
104
:
352
8
.
23.
Cheng
Y
,
Ko
JM
,
Lung
HL
,
Lo
PH
,
Stanbridge
EJ
,
Lung
ML
. 
Monochromosome transfer provides functional evidence for growth-suppressive genes on chromosome 14 in nasopharyngeal carcinoma
.
Genes Chromosomes Cancer
2003
;
37
:
359
68
.
24.
Mutirangura
A
,
Pornthanakasem
W
,
Sriuranpong
V
,
Supiyaphun
P
,
Voravud
N
. 
Loss of heterozygosity on chromosome 14 in nasopharyngeal carcinoma
.
Int J Cancer
1998
;
78
:
153
6
.
25.
Shao
J
,
Li
Y
,
Wu
Q
,
Liang
X
,
Yu
X
,
Huang
L
, et al
High frequency loss of heterozygosity on the long arms of chromosomes 13 and 14 in nasopharyngeal carcinoma in Southern China
.
Chin Med J (Engl)
2002
;
115
:
571
5
.
26.
Shao
JY
,
Huang
XM
,
Yu
XJ
,
Huang
LX
,
Wu
QL
,
Xia
JC
, et al
Loss of heterozygosity and its correlation with clinical outcome and Epstein-Barr virus infection in nasopharyngeal carcinoma
.
Anticancer Res
2001
;
21
:
3021
9
.
27.
Shao
JY
,
Wang
HY
,
Huang
XM
,
Feng
QS
,
Huang
P
,
Feng
BJ
, et al
Genome-wide allelotype analysis of sporadic primary nasopharyngeal carcinoma from southern China
.
Int J Oncol
2000
;
17
:
1267
75
.
28.
Merlo
A
,
Turrini
R
,
Dolcetti
R
,
Martorelli
D
,
Muraro
E
,
Comoli
P
, et al
The interplay between Epstein-Barr virus and the immune system: a rationale for adoptive cell therapy of EBV-related disorders
.
Haematologica
2010
;
95
:
1769
77
.
29.
Cheung
HW
,
Jin
DY
,
Ling
MT
,
Wong
YC
,
Wang
Q
,
Tsao
SW
, et al
Mitotic arrest deficient 2 expression induces chemosensitization to a DNA-damaging agent, cisplatin, in nasopharyngeal carcinoma cells
.
Cancer Res
2005
;
65
:
1450
8
.
30.
Wang
S
,
Li
X
,
Li
ZG
,
Lu
J
,
Fang
WY
,
Ding
YQ
, et al
Gene expression profile changes and possible molecular subtypes in differentiated-type nonkeratinizing nasopharyngeal carcinoma
.
Int J Cancer
2011
;
128
:
753
62
.
31.
Wang
LF
,
Chai
CY
,
Kuo
WR
,
Tai
CF
,
Lee
KW
,
Ho
KY
. 
The prognostic value of proliferating cell nuclear antigen (PCNA) and p53 protein expression in patients with advanced nasopharyngeal carcinoma
.
Acta Otolaryngol
2006
;
126
:
769
74
.
32.
Wang
MQ
,
Liu
YN
,
Zhao
X
,
Zhang
Y
. 
[Anticancer effects of a novel indolinone compound in nasopharyngeal carcinoma cells]
.
Beijing Da Xue Xue Bao
2009
;
41
:
640
4
.
33.
Taheri-Kadkhoda
Z
,
Magnusson
B
,
Svensson
M
,
Mercke
C
,
Bjork-Eriksson
T
. 
Expression modes and clinical manifestations of latent membrane protein 1, Ki-67, cyclin-B1, and epidermal growth factor receptor in nonendemic nasopharyngeal carcinoma
.
Head Neck
2009
;
31
:
482
92
.
34.
Hattori
Y
,
Fukushima
M
,
Maitani
Y
. 
Non-viral delivery of the connexin 43 gene with histone deacetylase inhibitor to human nasopharyngeal tumor cells enhances gene expression and inhibits in vivo tumor growth
.
Int J Oncol
2007
;
30
:
1427
39
.
35.
Peng
C
,
Liu
HY
,
Zhou
M
,
Zhang
LM
,
Li
XL
,
Shen
SR
, et al
BRD7 suppresses the growth of Nasopharyngeal Carcinoma cells (HNE1) through negatively regulating beta-catenin and ERK pathways
.
Mol Cell Biochem
2007
;
303
:
141
9
.
36.
Wang
L
,
Ma
J
,
Li
J
,
Li
X
,
Zhang
Q
,
Peng
S
, et al
NGX6 gene inhibits cell proliferation and plays a negative role in EGFR pathway in nasopharyngeal carcinoma cells
.
J Cell Biochem
2005
;
95
:
64
73
.
37.
Sze
WM
,
Lee
AW
,
Yau
TK
,
Yeung
RM
,
Lau
KY
,
Leung
SK
, et al
Primary tumor volume of nasopharyngeal carcinoma: prognostic significance for local control
.
Int J Radiat Oncol Biol Phys
2004
;
59
:
21
7
.
38.
Lui
VW
,
Lau
CP
,
Cheung
CS
,
Ho
K
,
Ng
MH
,
Cheng
SH
, et al
An RNA-directed nucleoside anti-metabolite, 1-(3-C-ethynyl-beta-d-ribo-pentofuranosyl)cytosine (ECyd), elicits antitumor effect via TP53-induced Glycolysis and Apoptosis Regulator (TIGAR) downregulation
.
Biochem Pharmacol
2010
;
79
:
1772
80
.
39.
Chen
SL
,
Shi
Y
,
Jin
YL
,
Liu
Y
,
Zhao
FT
,
Zhu
LP
. 
[Differential gene expression in nasopharyngeal carcinoma cell with reduced and normal expression of 6A8 alpha-mannosidase]
.
Zhongguo Yi Xue Ke Xue Yuan Xue Bao
2005
;
27
:
305
10
.
40.
Zhou
Y
,
Zeng
Z
,
Zhang
W
,
Xiong
W
,
Wu
M
,
Tan
Y
, et al
Lactotransferrin: a candidate tumor suppressor-deficient expression in human nasopharyngeal carcinoma and inhibition of NPC cell proliferation by modulating the mitogen-activated protein kinase pathway
.
Int J Cancer
2008
;
123
:
2065
72
.
41.
Cao
JY
,
Liu
L
,
Chen
SP
,
Zhang
X
,
Mi
YJ
,
Liu
ZG
, et al
Prognostic significance and therapeutic implications of centromere protein F expression in human nasopharyngeal carcinoma
.
Mol Cancer
2010
;
9
:
237
.
42.
Hirunsatit
R
,
Kongruttanachok
N
,
Shotelersuk
K
,
Supiyaphun
P
,
Voravud
N
,
Sakuntabhai
A
, et al
Polymeric immunoglobulin receptor polymorphisms and risk of nasopharyngeal cancer
.
BMC Genet
2003
;
4
:
3
.
43.
Ando
K
,
Kakeji
Y
,
Kitao
H
,
Iimori
M
,
Zhao
Y
,
Yoshida
R
, et al
High expression of BUBR1 is one of the factors for inducing DNA aneuploidy and progression in gastric cancer
.
Cancer Sci
2010
;
101
:
639
45
.
44.
Lira
RC
,
Miranda
FA
,
Guimaraes
MC
,
Simoes
RT
,
Donadi
EA
,
Soares
CP
, et al
BUBR1 expression in benign oral lesions and squamous cell carcinomas: correlation with human papillomavirus
.
Oncol Rep
2010
;
23
:
1027
36
.
45.
Nukiwa
T
,
Suzuki
T
,
Fukuhara
T
,
Kikuchi
T
. 
Secretory leukocyte peptidase inhibitor and lung cancer
.
Cancer Sci
2008
;
99
:
849
55
.
46.
Nakamura
K
,
Takamoto
N
,
Hongo
A
,
Kodama
J
,
Abrzua
F
,
Nasu
Y
, et al
Secretory leukoprotease inhibitor inhibits cell growth through apoptotic pathway on ovarian cancer
.
Oncol Rep
2008
;
19
:
1085
91
.
47.
Sugino
T
,
Yamaguchi
T
,
Ogura
G
,
Kusakabe
T
,
Goodison
S
,
Homma
Y
, et al
The secretory leukocyte protease inhibitor (SLPI) suppresses cancer cell invasion but promotes blood-borne metastasis via an invasion-independent pathway
.
J Pathol
2007
;
212
:
152
60
.
48.
Perdoni
F
,
Bottone
MG
,
Soldani
C
,
Veneroni
P
,
Alpini
C
,
Pellicciari
C
, et al
Distribution of centromeric proteins and PARP-1 during mitosis and apoptosis
.
Ann N Y Acad Sci
2009
;
1171
:
32
7
.