Abstract
The formation of G-quadruplex structures within the nuclease hypersensitive element (NHE) III1 region of the c-myc promoter and the ability of these structures to repress c-myc transcription have been well established. However, just how these extremely stable DNA secondary structures are transformed to activate c-myc transcription is still unknown. NM23-H2/nucleoside diphosphate kinase B has been recognized as an activator of c-myc transcription via interactions with the NHE III1 region of the c-myc gene promoter. Through the use of RNA interference, we confirmed the transcriptional regulatory role of NM23-H2. In addition, we find that further purification of NM23-H2 results in loss of the previously identified DNA strand cleavage activity, but retention of its DNA binding activity. NM23-H2 binds to both single-stranded guanine- and cytosine-rich strands of the c-myc NHE III1 and, to a lesser extent, to a random single-stranded DNA template. However, it does not bind to or cleave the NHE III1 in duplex form. Significantly, potassium ions and compounds that stabilize the G-quadruplex and i-motif structures have an inhibitory effect on NM23-H2 DNA-binding activity. Mutation of Arg88 to Ala88 (R88A) reduced both DNA and nucleotide binding but had minimal effect on the NM23-H2 crystal structure. On the basis of these data and molecular modeling studies, we have proposed a stepwise trapping-out of the NHE III1 region in a single-stranded form, thus allowing single-stranded transcription factors to bind and activate c-myc transcription. Furthermore, this model provides a rationale for how the stabilization of the G-quadruplex or i-motif structures formed within the c-myc gene promoter region can inhibit NM23-H2 from activating c-myc gene expression. [Mol Cancer Ther 2009;8(5):1363–77]
Introduction
Several nuclease hypersensitive elements (NHE) have been identified, which play important roles in the regulation of c-myc transcription (1). Notably, one of these, the NHE III1, has been the focus of considerable research over the past two decades (2–4) because of its major role in controlling 75% to 95% of the total c-myc transcription (Fig. 1; refs. 5, 6). In vitro characterization of the DNA within this region revealed that it is capable of engaging in a slow equilibrium between a typical Watson-Crick base-paired double helix and both single-stranded and atypical DNA secondary structures (7). Specifically, oligonucleotides representing the purine- and pyrimidine-rich strands have been shown to adopt G-quadruplex and i-motif structures, respectively (8–13). In addition to the DNA structural transitions within the NHE III1, many human protein factors that recognize this element either in vitro or in vivo have also been documented, including hnRNP A1, A2, and B1 (14), hnRNP K (14–16), NM23-H2/nucleoside diphosphate (NDP) kinase B (3, 17), CNBP (18), NSEP-1 (19), Sp1 (20, 21), Sp3 (21, 22), CTCF (23), MAZi (24), THZif-1 (25, 26), and c-MYB (27, 28). It is interesting to note that only a small number of these proteins exclusively recognize the duplex B-DNA conformation of the NHE III1, whereas the majority of these proteins bind sequence specifically to either the purine-rich or the pyrimidine-rich strand in unwound or non-B-DNA conformations. On the basis of these and several other observations, many models have been suggested to explain the details of c-myc transcription with respect to the NHE III1 region; however, none of these models have been confirmed.
NM23-H2/NDP kinase B (from the list above; hereafter called NM23-H2) comes from a family of proteins that have been known for decades as housekeeping enzymes that catalyze the transfer of α-phosphate between nucleoside triphosphates and diphosphates (reviewed in ref. 29). The genes that encode NDP kinases have been sequenced and cloned from bacteria to humans and all have displayed significant sequence (∼44% from Escherichia coli to human) and structural homology. However, despite their similarity in sequence and monomeric structure, variations occur in the tertiary structures of NDP kinases. Most bacterial enzymes are tetrameric, whereas all known eukaryotic enzymes have been shown to form hexamers (29, 30). In general, NDP kinases have been acknowledged as a large family of highly conserved proteins that participate in nucleotide metabolism, yet the actual biological significance of their enzymatic activity has remained elusive.
Over the past decade, extensive experimental evidence has suggested that the biological activities of NDP kinases extend beyond their originally described enzymatic means as phosphotransferases, functioning in several other unrelated regulatory roles including, but not limited to, development and differentiation, proliferation, metastasis, and apoptosis. In humans, eight different NDP kinase genes, designated nm23-H1 to nm23-H8, have been sequenced and characterized (reviewed in ref. 31). The most studied and most abundant isoforms in human cells are NM23-H1 and NM23-H2, which are encoded by the genes nm23-H1 and nm23-H2, respectively. The nm23-H1 gene was cloned by Steeg et al. based on its potential as a metastasis suppressor gene (32), whereas NM23-H2 was isolated as a result of sequence homology to NM23-H1 (33). Since their discovery, a variety of metastasis model systems have shown that expression levels of NM23-H1 and, to a lesser extent, NM23-H2 inversely correlate with metastatic potential (34–38).
A potential explanation for some of the observed regulatory properties of NDP kinases was initially described when the purine-binding transcription factor, which activates transcription of the c-myc oncogene via the NHE III1 region, was unexpectedly identified as NM23-H2 (17). Subsequently, NM23-H2 was shown to possess equivalent DNA-binding properties to those of the purine-binding transcription factor and the ability to stimulate transcription from the c-myc promoter both in vitro and in cell transfection assays (6, 17, 39). Both of these activities are independent from the originally discovered phosphotransferase activity of NM23-H2, because the DNA-binding and transcriptional activation properties are retained in a catalytically inactive mutant of NM23-H2 (H118F; ref. 40). In addition, NM23-H2 has been shown to localize within the nucleus and associate with chromatin, which is in agreement with its putative role in transcriptional regulation (41, 42). However, negative results from a study by Michelotti et al. suggest that NM23-H2 is not a conventional transcription factor because it lacks a characteristic transcriptional activation domain and is unable to stimulate c-myc transcription on its own (43).
Despite the many reports, controversy still remains in terms of the molecular mechanisms behind the activation of c-myc transcription by NM23-H2. Thus far, the DNA-binding properties of NM23-H2 have been somewhat perplexing, given that disparity occurs depending on the DNA sequence, the enzyme source, and the assay employed. Indeed, in vitro and in vivo footprinting experiments have both shown sequence-specific interactions between NM23-H2 and the c-myc NHE III1 (3, 39), which suggests a defined consensus binding sequence for NM23-H2. On the other hand, NM23-H2 has also been shown to bind to both single-stranded and duplex portions of the c-myc NHE III1 (44, 45) and to bind with high affinity to single-stranded DNA in a non-sequence-specific manner (46–48). In addition, the affinity of NM23-H2 has been shown to depend on the length of the DNA substrate in that longer c-myc NHE III1–containing DNA fragments bind better than shorter fragments (45). These results indicate that the DNA recognition properties of NM23-H2 may be related to the structural conformation of DNA rather than to a specific DNA sequence. In addition, to further complicate matters, in the course of DNA-binding studies, it became evident that NM23-H2 apparently exhibits an intrinsic nuclease activity, wherein NM23-H2 reversibly cleaves duplex DNA substrates containing the c-myc NHE III1 and produces site-specific double-stranded breaks within the repeated polypurine/polypyrimidine sequence of the NHE III1 (49). This has been proposed to be an active site-directed reaction, in which a covalent protein–DNA complex is formed between Lys12, a residue located in the nucleotide-binding site of NM23-H2, and the DNA phosphodiester backbone (50). Moreover, in relation to c-myc transcription, it has been suggested that NM23-H2, through the breaking and rejoining of DNA strands, might be involved in the alteration or removal of unusual DNA conformations within the c-myc NHE III1 (45). At present, two closely related models of how genes are regulated by NM23-H2 have been presented. Both allow for DNA topological changes or the elimination of DNA secondary structures through protein–DNA interactions; however, the mechanisms governing these transformations differ, given that one involves DNA local unwinding, whereas the other entails DNA strand cleavage (45).
The current study focuses on evaluating the biochemical mechanisms in connection with the two aforementioned models of how NM23-H2 activates c-myc transcription. In addition to providing supplementary evidence for the ability of NM23-H2 to specifically activate c-myc gene expression through the use of RNA interference technologies, we also show that the addition of a size-exclusion chromatography step or a heparin affinity chromatography step to the purification of recombinant NM23-H2 protein leads to separation of the nuclease activity from the NM23-H2-containing fractions, which seemingly eliminates the DNA strand cleavage as a potential mechanism for c-myc transcriptional activation by NM23-H2. However, the single-stranded DNA-binding activity of NM23-H2 is preserved after either of these additional purification steps. Therefore, on the basis of our DNA-binding data and molecular modeling studies, we propose a new model wherein the transcriptional activation of c-myc by NM23-H2 is a result of its ability to stabilize or trap-out the c-myc NHE III1 region in a single-stranded form, subsequently allowing single-strand-specific transcription factors, such as hnRNP K and CNBP, to recognize their DNA-binding sites more efficiently. In addition, this model provides a rationale for how ligand-induced stabilization of the DNA secondary structures formed within the NHE III1 region of the c-myc gene promoter inhibits NM23-H2 from activating c-myc gene expression.
Materials and Methods
RNA Interference
Commercially available NM23-H2 small interfering RNA (siRNA) SMARTpool and siRNA SMARTpool nontargeting control were purchased from Dharmacon. Both siRNAs were transiently transfected into HeLa cells at a final concentration of 10 nmol/L using siLentFect reagent (Bio-Rad) according to the manufacturer's instructions. Cells were harvested 24 and 48 h post-transfection. Transcript and protein levels were determined by real-time reverse transcription-PCR and Western blot, respectively.
Quantitative Real-time Reverse Transcription-PCR
After siRNA transfection, total RNA was extracted from cultured cells using the Nucleospin RNA extraction kit (Clontech) according to the manufacturer's instructions. First-strand cDNA synthesis was then done with 1 μg total RNA using the iScript cDNA Synthesis Kit (Bio-Rad). Quantitative real-time PCR was done in triplicate in 16 μL reaction volumes containing iQ SYBR Green Supermix (Bio-Rad), 200 nmol/L of each primer, and 1 μL cDNA using the iCycler MyiQ real-time PCR detection system (Bio-Rad). The following primers were used: NM23-H2 forward 5′-GGGCTGAACGTGGTGAAGAC-3′ and reverse 5′-TTCAGGCTTAAACCATAGGC-3′, NM23-H1 forward 5′-GGGTCTTGTGGGAGAGATTATC-3′ and reverse 5′-GACGGTCCTTCAGGTCAACG-3′, c-myc forward 5′-GCTGCTTAGACGCTGGATT-3′ and reverse 5′-TCCTCCTCGTCGCAGTAGA-3′, and β-actin forward 5′-CTGGAACGGTGAAGGTGACA-3′ and reverse 5′-AAGGGACTTCTGTAACAACGCA-3′. For each set of primers, melting curve analysis yielded a single peak consistent with one PCR product.
Western Blot
After siRNA transfection, HeLa cells were lysed with 1% NP-40 lysis buffer. Equal amounts of protein (50 μg/lane) were resolved on a 4% to 15% SDS-polyacrylamide gel. The proteins were then transferred to nitrocellulose membranes and probed with human anti-NM23-H2 antibody (1:1,000; Seikagaku). Signal detection was carried out using a secondary antibody conjugated to horseradish peroxidase (1:3,000; Bio-Rad) and an enhanced chemiluminescence kit (Cell Signaling Technology).
Recombinant Protein and Purification
The wild-type and R88A mutant NM23-H2 expression vectors were both provided by Dr. Edith Postel's laboratory (17). Briefly, expression was in E. coli strain BL21(DE3)pLysS followed by ammonium sulfate fractionation and DEAE and hydroxyapatite column chromatography steps. An additional size-exclusion chromatography step on a HiPrep 26/60 Sephacryl S-100 high-resolution column (GE Healthcare) was added that led to separation of nuclease activity from the NM23-H2-containing fraction. The wild-type protein displayed excellent NDP kinase activity in a coupled enzyme assay involving pyruvate kinase and lactate dehydrogenase as described previously (44). The protein concentration was determined by absorbance at 280 nm using a calculated extinction coefficient of 1.3 mL/cm/mg.
Preparation and End-Labeling of Oligonucleotides
The oligonucleotides Pu47, 5′-d(GGGGCGCTTATGGGGAGGGTGGGGAGGGTGGGGAAGGTGGGGAGGAG)-3′, and Py47, 5′-d(CTCCTCCCCACCTTCCCCACCCTCCCCACCCTCCCCATAAGCGCCCC)-3′, were 5′-end-labeled using T4 polynucleotide kinase. Unincorporated radioactive nucleotides were removed using a Bio-Spin 6 chromatography column (Bio-Rad) after inactivation of the kinase by heating for 5 min at 95°C. Labeled single-stranded oligonucleotides were purified using denaturing gel electrophoresis (16% PAGE). For construction of double-stranded oligonucleotides, labeled single-stranded oligonucleotides were annealed with their complementary strand and then purified using nondenaturing gel electrophoresis (16% PAGE).
Gel Mobility Shift Assay
Labeled DNA substrates were mixed with varying amounts of recombinant NM23-H2 (as specified in the figures) in reaction buffer [12 mmol/L HEPES, 4 mmol/L Tris-HCl (pH 7.4), 1 mmol/L EDTA, 5% glycerol, 1 mmol/L DTT, 0.5 μg bovine serum albumin, 1.5 mmol/L MgCl2] for ∼20 min at 4°C in a total volume of 20 μL. Reactions containing G-quadruplex-interactive agents or KCl were incubated for 1 h at 25°C before the addition of NM23-H2. To separate the protein-DNA complexes from free DNA, the reactions were electrophoresed on 5% native polyacrylamide gels in the presence of 0.5 TBE at 4°C for ∼1 h at 150 V. All gel mobility shift assays were done three times. Representative gels are shown in the figures.
DNA Strand Breakage Assay
Reactions were prepared as described above for gel mobility shift assays using a 47-mer duplex (ds47). After incubation, the reactions were terminated by adding an equal volume of alkaline stop buffer (95% formamide, 10 mmol/L EDTA, 10 mmol/L NaOH, 0.1% xylene cyanol, 0.1% bromophenol blue). The samples were subsequently heated to 95°C for 5 min and analyzed on a 16% sequencing PAGE gel.
Surface Plasmon Resonance
Biotinylated G-strand (Pu44; 5′-CGCTTATGGGGAGGGTGGGGAGGGTGGGGAAGGTGGGGAGGAGA-biotin-3′) and C-strand (Py44; 5′-biotin-TCTCCTCCCCACCTTCCCCACCCTCCCCACCCTCCCCATAAGCG-3′) oligonucleotides were cartridge purified (Biosearch Technologies). Surface plasmon resonance (SPR) experiments were done using a BIAcore 2000 optical biosensor system (BIAcore). Streptavidin chips were docked into the instrument and preconditioned with three consecutive 1 min injections of 1 mol/L NaCl in 50 mmol/L NaOH at a flow rate of 30 μL/min. Biotinylated G-strand and C-strand oligonucleotides at a concentration of 50 μg/mL were flowed through the streptavidin chip surface for 7 min at a flow rate of 15 μL/min. Protein alone or in the presence of compounds (at the indicated concentrations) was mixed in running buffer [12 mmol/L HEPES, 4 mmol/L Tris-HCl (pH 7.4), 1 mmol/L EDTA, 1.5 mmol/L MgCl2]. The buffer solution was filtered through a 0.22 μm filter and degassed thoroughly before use.
Heparin Affinity Column Chromatography
A column was packed with heparin (Sigma) and equilibrated with buffer B [25 mmol/L Tris-HCl (pH 7.6), 50 mmol/L NaCl, 0.5 mmol/L MgCl2, 1 mmol/L EDTA, 5 mmol/L β-mercaptoethanol, 1 mmol/L DTT, 10% glycerol]. A sample of recombinant NM23-H2 in buffer B was applied to the column, which was then washed twice with 1 mL buffer B. Elution of the protein was carried out using a linear gradient of 0 to 1.0 mol/L NaCl in buffer B. Fractions (0.25 mL) were collected and analyzed by SDS-PAGE. DNA-binding activity and DNA cleavage activity were analyzed using electrophoretic mobility shift assay and the DNA strand breakage assay, respectively.
Isothermal Titration Calorimetry
Nucleotide binding constants were measured using a MicroCal VP isothermal titration calorimetry system following the manufacturer's instructions. A solution containing either 17 μmol/L NM23-H2 protein, 20 mmol/L Tris-HCl (pH 8.0), and 0.5 mmol/L EDTA or 20 mmol/L potassium phosphate buffer (pH 7.1), 75 mmol/L KCl, 5 mmol/L MgCl2, and 1 mmol/L DTT was injected with 0.3 mmol/L nucleotide (GDP or ADP) or 0.1 mmol/L dinucleotide [2′-deoxyadenylyl(3′,5′)-2′-deoxyguanosine d(AG); 7 μL/injection]. All experiments were done at 25°C. Thermodynamic constants were extracted using the Origin software package provided by MicroCal.
Crystallography
The wild-type NM23 and the R88A mutant proteins were buffered with 20 mmol/L Tris-HCl (pH 7.0) and concentrated by ultrafiltration to 17 and 10 mg/mL, respectively. Crystallization was by the hanging-drop method, in which 2 μL of the protein solution was combined with 2 μL of the precipitant solution and equilibrated through vapor diffusion over the precipitant solution. A total of nine crystal forms were obtained, three of which diffracted well and were further pursued. Crystals of the wild-type protein were obtained using a precipitant solution of 19% PEG 1500, 50 mmol/L sodium citrate (pH 6.5), 20 mmol/L MgCl2, and 5 mmol/L DTT. The complex with the dinucleotide d(AG) was obtained by soaking an NM23 crystal for 30 min in crystallization buffer supplemented to contain 10 mmol/L d(AG) and 30% PEG 1500 followed by flash-freezing in liquid nitrogen. Cocrystallization attempts with d(AG) and longer oligonucleotides were unsuccessful. The GDP complex was obtained by mixing a protein solution containing 13.6 mg/mL NM23 and 14 mmol/L GDP (sodium salt) in Tris-HCl (pH 7), with precipitant solution containing 50 mmol/L Tris-HCl (pH 7.5), 20% PEG 1500, 200 mmol/L MgCl2, and 5 mmol/L DTT, and crystallizing at 10°C. Crystals of the R88A mutant were obtained from 20% PEG 1500, 100 mmol/L sodium citrate (pH 5.6), 20 mmol/L MgCl2, and 2 mmol/L DTT. Crystals were flash-frozen after briefly soaking in crystallization buffer containing 35% PEG 1500.
Diffraction data were measured at BioCars Sector 14, Advanced Photon Source, Argonne National Laboratory and reduced with d*TREK (51). The structures were determined by difference Fourier (GDP complex) or molecular replacement using PDB entry 1NUE (52) as the starting model and the program MOLREP in the CCP4 program suite (53). Model building was with the program COOT (54) and model refinement with REFMAC as implemented in CCP4. All three structures contain a full hexamer in the asymmetric unit; noncrystallographic restraints were not applied. Final crystallographic parameters are listed in Table 1. The X-ray figures (shown later in Fig. 9) were prepared with PyMOL (DeLano Scientific).
. | d(AG) complex . | GDP complex . | R88A mutant . |
---|---|---|---|
Data collection statistics | |||
X-ray source | APS 14BM-C | APS 14ID-B | APS 14BM-C |
Detector | Quantum 4 | MARCCD | Quantum 4 |
Wavelength (Å) | 0.9000 | 1.1271 | 0.9000 |
Resolution (Å) | 1.3 | 1.7 | 1.7 |
Space group | P212121 | P212121 | P212121 |
Cell | 52.5, 118.2, 128.9 | 69.8, 86.8, 159.1 | 71.8, 104.6, 118.1 |
Z (hexamers) | 4 | 4 | 4 |
Unique reflections | 195789 | 103224 | 93076 |
Completeness (%)* | 99.3 (98.3) | 96.6 (81.4) | 99.8 (99.4) |
Multiplicity* | 8.0 (6.8) | 6.5 (4.3) | 7.2 (6.9) |
I/σ(I)* | 16.4 (2.3) | 25.4 (3.3) | 16.9 (3.4) |
Rmerge* | 0.086 (0.42) | 0.062 (0.33) | 0.093 (0.43) |
Refinement | |||
Rcryst | 0.18 | 0.20 | 0.17 |
Rfree | 0.22 | 0.24 | 0.22 |
RMSD bonds (Å) | 0.02 | 0.02 | 0.02 |
RMSD angles (°) | 1.9 | 1.7 | 1.7 |
. | d(AG) complex . | GDP complex . | R88A mutant . |
---|---|---|---|
Data collection statistics | |||
X-ray source | APS 14BM-C | APS 14ID-B | APS 14BM-C |
Detector | Quantum 4 | MARCCD | Quantum 4 |
Wavelength (Å) | 0.9000 | 1.1271 | 0.9000 |
Resolution (Å) | 1.3 | 1.7 | 1.7 |
Space group | P212121 | P212121 | P212121 |
Cell | 52.5, 118.2, 128.9 | 69.8, 86.8, 159.1 | 71.8, 104.6, 118.1 |
Z (hexamers) | 4 | 4 | 4 |
Unique reflections | 195789 | 103224 | 93076 |
Completeness (%)* | 99.3 (98.3) | 96.6 (81.4) | 99.8 (99.4) |
Multiplicity* | 8.0 (6.8) | 6.5 (4.3) | 7.2 (6.9) |
I/σ(I)* | 16.4 (2.3) | 25.4 (3.3) | 16.9 (3.4) |
Rmerge* | 0.086 (0.42) | 0.062 (0.33) | 0.093 (0.43) |
Refinement | |||
Rcryst | 0.18 | 0.20 | 0.17 |
Rfree | 0.22 | 0.24 | 0.22 |
RMSD bonds (Å) | 0.02 | 0.02 | 0.02 |
RMSD angles (°) | 1.9 | 1.7 | 1.7 |
*All data (outer shell).
Molecular Modeling
Modeling of NM23-H2 with the single-stranded DNA sequences corresponding to the guanine- and cytosine-rich strands of the c-myc NHE III1 was based on the known crystal structure of NM23-H2 bound to GDP (1NUE; ref. 52). The single-stranded DNA sequences were built using the Biopolymer module within the Insight II molecular modeling software. Electrostatic potential maps for NM23-H2 were calculated using the Delphi program and displayed using GRASP. The model for the complex between NM23-H2 and the single-stranded DNA was built using manual docking and the Discover_3 minimization program within Insight II. The complex was soaked in a TIP3P water layer of 10 Å thickness and then subjected to molecular dynamics simulations with equilibration for 40 ps and simulation for 100 ps. Trajectories were collected every 0.1 ps. Low potential energy frames were minimized using 3,000 steps of Discover_3 minimization. Minimized structures of the complex were used to analyze the interactions.
Results
Knockdown of Endogenous NM23-H2 Expression by siRNA Results in Repression of c-myc Transcription
Because several studies have previously shown that overexpression of NM23-H2 augments the levels of c-myc transcription, we decided to expand on these studies and examine whether depletion of endogenous NM23-H2 decreases such transcription reactions. To answer this question, we took advantage of RNA interference technologies, which have been proven to be an effective tool for identifying transcription factor target genes (55). First, to verify the optimal silencing conditions, we transiently transfected increasing concentrations of an NM23-H2 siRNA pool into HeLa cells and determined the minimal amount needed to achieve maximal reduction of NM23-H2. At all concentrations, >80% knockdown of NM23-H2 expression was observed, whereas no effect was detected on the expression of NM23-H1, a homologue of NM23-H2 (data not shown). Therefore, the effects of siRNA-mediated knockdown of NM23-H2 on c-myc expression were examined in HeLa cells following 24 and 48 h of treatment. Within 24 h of transfection of the NM23-H2 siRNA, the mRNA (Fig. 2A) and protein (Fig. 2B) levels of NM23-H2 were reduced to nearly undetectable levels, which were maintained for the duration of the experiment. More interestingly, knockdown of NM23-H2 resulted in a reduction of c-myc mRNA expression levels of ∼40% after 24 h (P < 0.05), which increased to ∼50% after 48 h (P < 0.001; Fig. 2A). In addition, there was no effect on either NM23-H2 or c-myc expression in the untreated cells or the cells treated with a control nontargeting siRNA pool (Fig. 2A). These results add to previous findings and strengthen the evidence for the involvement of NM23-H2 in the transcriptional activation of the c-myc gene.
Relationship between NM23-H2 and c-myc Expression in Colon Tumor Cell Lines
It has been shown that NM23-H2 mRNA levels are among the most abundant in colorectal cancers (56). Likewise, the c-myc gene is overexpressed in ∼70% of colorectal cancers. Therefore, in a straightforward attempt to link NM23-H2 to the regulation of the c-myc gene, we analyzed the expression levels of both genes in five colorectal cancer cell lines (SW480, SW620, CaCo-2, HT29, and HCT116). Total RNA from normal colon tissue was used as a control. As anticipated, there was a clear positive correlation (R2 = 0.856) between NM23-H2 and c-myc expression in the small number of cell lines analyzed (Fig. 3). Such a correlation is in line with the involvement of NM23-H2 in the activation of c-myc transcription in colorectal cancer. However, despite several efforts, the molecular basis for this relationship has remained elusive.
Nuclease Activity Does Not Associate with NM23-H2 during Size-Exclusion Chromatography or Heparin Affinity Chromatography
In general, the ambiguity surrounding the possible DNA cleaving activity of NM23-H2 may perhaps be attributable to the differing sources and the purity of the recombinant protein. In this study, recombinant NM23-H2 was expressed and purified as described previously by Postel et al. (17). Although we did observe DNA cleavage products in the presence of this recombinant NM23-H2 protein (Fig. 4, right), we were unable to detect any gel retardation signal corresponding to a protein-DNA complex in native polyacrylamide gel analysis (data not shown). These results are inconsistent with previous studies, which have shown the presence of three separate species, a protein-DNA complex, free DNA, and cleaved DNA. When the recombinant NM23-H2 was subjected to further purification by size-exclusion chromatography, we reproducibly found that the protein lost its nuclease activity (Fig. 4, left). The protein eluted at a volume consistent with a molecular mass of ∼100 kDa, the expected size for the hexameric NM23-H2 protein, making it unlikely that the contaminating activity remained associated with the protein. Despite the fact that equal amounts of the protein were loaded on the left- and right-hand gels (Fig. 4), we cannot ignore the possibility that the absence of the nuclease activity in the size-excluded preparation is a consequence of inactivation of the nuclease in this preparation relative to the conventionally purified NM23-H2.
In an attempt to further determine whether the apparent nuclease activity was associated with NM23-H2, the recombinant protein that displayed the DNA cleavage activity was applied to a heparin affinity column and eluted with a linear gradient of 0 to 1.0 mol/L NaCl. As shown in Fig. 5A, the major protein peak was eluted in fractions 8 to 12 at ∼0.3 mol/L NaCl. SDS-PAGE of the fractions across this peak revealed a single band corresponding to a polypeptide with a molecular mass of ∼17 kDa (Fig. 5B), which is in agreement with the molecular mass determined from the NM23-H2 amino acid sequence. The DNA-binding activity and DNA cleavage activity of the eluted fractions were analyzed using a gel mobility shift assay and a DNA cleavage assay, respectively. Interestingly, the Pu47 DNA-binding activity coeluted with the NM23-H2 protein peak, whereas the ds47 DNA cleavage activity was associated with the 1.0 mol/L NaCl fractions (Fig. 5A). These results suggest that the DNA cleavage activity might be a consequence of either a contaminating nuclease or an NM23-H2-interacting protein that becomes dissociated on further purification. However, the identity of this nuclease remains elusive in view of the fact that proteomic analysis of the fractions containing the nuclease activity was unable to identify any known proteins with the ability to cause DNA strand breaks. Nevertheless, these observations clearly show that the nuclease activity detected was not associated with 17 kDa NM23-H2.
DNA-Binding Activity of NM23-H2
Having shown that NM23-H2 does not display inherent nuclease activity, the next step was to examine its ability to interact with DNA. However, due to mixed reports pertaining to the DNA-binding specificity of NM23-H2 for the c-myc NHE III1 region, we found it necessary to reevaluate some of the previously described studies. Initially, to assess directly the relative affinity of NM23-H2 for single-stranded versus duplex portions of the c-myc NHE III1 region, we carried out gel mobility shift assays under essentially the same conditions as Hildebrandt et al. (47), with the exception of the addition of a nonspecific DNA competitor and the use of slightly longer DNA substrates (47 bp). As shown in the representative gels in Fig. 6A, protein-DNA complexes were formed between NM23-H2 and the single-stranded oligonucleotides corresponding to the purine-rich (Pu47) and the pyrimidine-rich (Py47) strands of the c-myc NHE III1 region as evidenced by the occurrence of gel retardation bands; however, no such complex was observed with a duplex oligonucleotide of the same sequence (ds47). We also found that NM23-H2 appeared to favor the Pu47 sequence over the complementary Py47 sequence; however, the noticeable smear of radioactivity in the gel shift employing the Py47 sequence could represent dissociation of the bound complex during electrophoresis, which is indicative of the formation of an unstable complex with Py47. NM23-H2 also bound to a random 47-mer sequence but only at about a 10-fold higher concentration than to the Pu47.
To further examine the stability of the NM23-H2-Pu47 complex, a competition experiment was carried out. Briefly, protein-DNA complexes were equilibrated for ∼30 min and then incubated an additional 30 min with unlabeled competitor oligonucleotides corresponding to the Pu47 sequence or an unrelated sequence. Figure 7B shows the results of an experiment in which the complex between Pu47 and NM23-H2 was exposed to only a 5-fold excess of unlabeled Pu47 oligonucleotide. Disruption of the complex was found under these conditions but not by the random oligonucleotide sequence up to 50-fold excess (Fig. 6B). In addition to the instability of the Py47-NM23-H2 complex, these results also suggest that NM23-H2 might only weakly bind to the guanine-rich strand of the c-myc NHE III1 given that the preformed protein-DNA complex is easily disrupted by the presence of a minor amount of specific competitor. This observation is in agreement with the previously suggested role of NM23-H2 as an accessory protein, which temporarily binds to and prepares the DNA for interactions with other specific transcription factors.
To quantitatively determine the specificity of NM23-H2 to the purine- and pyrimidine-rich stands of the NHE III1 region of the c-myc promoter, SPR was used. Two biotinylated 44-mers, which correspond to the guanine- and cytosine-rich strands of the NHE III1 region of the c-myc promoter, were immobilized on a streptavidin-coated sensor chip. Increasing concentrations of purified NM23-H2 (1–1,000 nmol/L) were injected and the interaction of the protein with the DNA was measured. Binding between NM23-H2 and the guanine-rich strand (Fig. 6C, left) and cytosine-rich strand (Fig. 6C, right) was observed at concentrations as low as 50 nmol/L as illustrated by the increase in response that remained after the completion of the initial protein injection. The intensity of the observed response increased with increasing concentrations of protein in a dose-dependent manner. By fitting the sensorgrams using a 1:1 Langmuir model, the KD for the guanine- and cytosine-rich strands were calculated as 16.23 and 17.1 nmol/L, respectively. These data strongly suggest that NM23-H2 binds equally well to the guanine- and cytosine-rich strands. The difference between the results of the gel mobility shift assay and the SPR results is most likely related to the instability of the NM23-H2-Py47 complex versus the NM23-H2-Pu47 complex (see Fig. 6A, middle). SPR allows for analysis of complex association and dissociation in “real-time,” whereas gel shift assays are a nonequilibrium measurement that reflects the maintenance of an intact complex in a complicated environment. Thus, less stable complexes (the NM23-H2-Py47) may partially dissociate during gel loading or running, in which the affinity would be underestimated.
Stabilization of G-Quadruplex and i-Motif Structures Inhibits Binding of NM23-H2 to the c-myc NHE III1
Again using SPR, the effect of the G-quadruplex/i-motif-interactive agent TMPyP4 on NM23-H2 DNA binding was studied (11, 57). TMPyP2, a positional isomer of TMPyP4 unable to interact well with the G-quadruplex or i-motif, was used as a control (11, 57).
Before the protein was coinjected with the G-quadruplex-interactive agents, the compounds were injected alone to determine if the compounds are able to interact with the DNA. The interaction of TMPyP4 with both guanine- and cytosine-rich 44-mers was observed weakly at 0.1 μmol/L and strongly at 1 and 10 μmol/L [KD, 0.49 μmol/L (G-strand) and 0.45 μmol/L (C-strand)].7
7Steven S. Carey, Song Zuohe, Emmanuelle J. Mevillet, and Laurence H. Hurley, unpublished results.
A Mutation within the Active Site of NM23-H2 Results in a Decrease in Single-Stranded DNA Binding
Prior studies have shown that various nucleotide substrates can inhibit the DNA-binding activity of NM23-H2 (46). Inversely, single-stranded oligonucleotides have also been shown to be competitive inhibitors of the NDP kinase activity of NM23-H2 (58). Therefore, it has been suggested that one of the DNA bases within a given oligonucleotide may have a similar binding mode to that of a mononucleotide substrate (59). To determine whether the DNA and the nucleotide substrates involved in the catalytic reaction of NM23-H2 share a common binding site, we used an active site mutant protein that contained an arginine-to-alanine mutation at position 88 (R88A) within the primary amino acid sequence of NM23-H2. The rationale for investigating this mutant was based on previous studies that have shown that the R88A mutation completely abolishes the NDP kinase activity, DNA-binding activity, and nuclease activity of NM23-H2 (44, 50). The crystal structure of the R88A mutant described below confirms that the loss in activity is due to the loss of a critical contact between Arg88 and the phosphate group of the nucleotide. Using isothermal titration calorimetry, we also have shown that the R88A mutant was incapable of binding GDP, whereas the wild-type NM23-H2 convincingly bound GDP, giving a dissociation constant of 5.2 ± 0.8 μmol/L (data not shown), a value nearly identical to that previously reported for NM23-H1 using a fluorescence anisotropy measurement (60). Therefore, because of the inability of the R88A mutant to interact with GDP, together with the suggestion that the mononucleotides and polynucleotides within NM23-H2 have a common binding site, we reasoned that the R88A mutant would also be defective in DNA-binding activity (50). The first clue to the low DNA-binding affinity of the R88A mutant was seen from its low retention in a heparin affinity column during the initial purification of this protein. The structure and negative charge of heparin enable it to mimic DNA in its overall binding properties; thus, if the R88A mutant and the wild-type NM23-H2 have similar DNA-binding affinity, then both should have similar retention times on a heparin column. However, whereas the wild-type protein was eluted at ∼0.3 mol/L NaCl, the R88A mutant protein was in one of the first fractions eluted from the column, which is indicative of its inability to bind to the negatively charged heparin (Fig. 8A). The low DNA-binding affinity associated with the R88A mutant was directly confirmed using a gel mobility shift assay (Fig. 8B) and SPR measurements (Fig. 8C). These results further confirm that the six corresponding active sites for the phosphotransferase reaction within NM23-H2 also serve as binding sites for single-stranded DNA.
To further examine nucleotide binding to NM23-H2 in general, and the role of Arg88 in particular, we determined three crystal structures of the protein to resolutions between 1.3 and 1.7 Å (Table 1). First, we re-determined the structure of the complex with GDP but to a higher resolution than reported previously (52). The new structure was in good agreement with the previous structure, displaying Arg88 in close association with the β-phosphate of GDP and rotation of the helix-turn-helix domain (residues 42–71) by 13° toward the binding pocket such that the GDP purine ring becomes sandwiched between Phe60 and Val112 (Fig. 9A), which move toward one another by ∼2.3 Å compared with the unliganded subunits (described below). Magnesium coordination to oxygen of the α-phosphate occurs in all six active sites and displays octahedral coordination with five water molecules and an α-phosphate oxygen (Fig. 9A). We also found that Cys145 appears to be partially modified in all six subunits, apparently displaying a mixture of reduced cysteine, oxidized cysteine disulfide linked to DTT, and possibly oxidized cysteine disulfide linked to the neighboring subunit. Subunits A–F, B–D, and C–E are aligned such that Cys145-Cys145′ disulfide bonds could occur, suggesting that the protein may be under redox regulation.
We then examined the wild-type protein after soaking in the dinucleotide d(AG), which has been reported to bind tightly to NM23-H2, based on a competition assay (58). Weak binding of the dinucleotide was found in the structure, but the density for the dinucleotide was poorly defined, with only approximate positioning indicated. Interestingly, evidence for both nucleotides was found in two active sites (monomers C and E), and two active sites displayed only a single nucleotide, modeled as guanine (monomers B and F). The remaining two active sites did not appear to have any bound nucleotide. The mobile helix-turn-helix tracked the active site contents. In those subunits containing a single nucleotide, the mobile domain moved inward, similarly to that found in the GDP complex. In those subunits containing the dinucleotide, the domain moved outward, opening up the binding pocket and allowing the two stacked purine rings to fit between Phe60 and Val112 (Fig. 9B and C). Possibly a similar motion occurs on interaction with single-stranded DNA. Because electron density for the dinucleotide appeared weak in the crystal, we directly examined binding by isothermal titration calorimetry. Dinucleotide binding was not detectable by this method, consistent with a weak (high micromolar to millimolar) interaction.
We also determined the R88A crystal structure in the absence (data not shown) and presence (Table 1) of d(AG). The R88A mutant is nearly identical to the wild-type protein, with Ala88 well resolved in both structures. The overall RMSD in carbon α positions was 0.87 Å when fitting to the GDP-containing hexamer and 0.95 Å when fitting to the dinucleotide soaked hexamer; pairwise RMSD between individual subunits ranged between 0.4 and 1.0 Å depending on the relative positions of the mobile helix-turn-helix subdomains. Water molecules fill the region previously occupied by the arginine side chain. No indication of nucleotide binding was found in the d(AG)-soaked crystal.
Molecular Modeling of Single-Stranded DNA Bound to NM23-H2
Two different models have been proposed for the binding of DNA to NM23-H2. The first describes the binding of a duplex DNA molecule at the equatorial region of NM23-H2, which was derived from mutational studies that identified three essential residues for DNA binding located on the surface of the hexamer at the 2-fold axis (Arg34, Asn69, and Lys135; ref. 61). The second describes the binding of an 11-mer single-stranded oligonucleotide with the 4 and the A helices (also known as H4 and HA helices; ref. 58), which was based on in vitro cross-linking experiments as well as clues taken from NM23-H2 crystal structures (52, 62, 63). In addition, this model suggested an active site-directed binding mode for the single-stranded DNA wherein a guanine at the 3′-end of the oligonucleotide binds to the active site (58) in a manner similar to that of the GDP in complex with NM23-H2 (52). Furthermore, it has been shown that this oligonucleotide containing a 3′-end guanine results in a 6-fold increase in binding affinity for NM23-H2 in comparison with an oligonucleotide without a guanine at the 3′-end (58). Indeed, several lines of evidence have shown that the catalytic efficiency of NM23-H2 is dependent on the nucleotide of the substrate (64). For example, at least two independent studies have shown a relative specificity with an order of G > T = A > C (46, 58, 64), with an approximate difference between G and C of 50-fold (46). Supplementary support for this specificity has also been derived from the crystal structures of NDP kinases bound to different nucleotide substrates, wherein all the nucleobases have been shown to possess similar hydrophobic interactions with specific Phe and Val residues within the active site, whereas only guanine nucleobases form a specific hydrogen bond between their N2 amino group and the δ-carboxylate of a Glu residue (52, 65, 66). The latter model is in agreement with our observed preferential stability of NM23-H2 bound to the single-stranded guanine-rich strand of DNA and our studies with the active site R88A mutant protein. Thus, on the basis of available in-house binding data, we have expanded the latter model to describe the interaction of a 27-mer sequence corresponding to the guanine-rich strand of the c-myc NHE III1 with NM23-H2.
As revealed by the calculated electrostatic potential maps for NM23-H2 shown in Fig. 10A, positive ion channels exist that link the individual active sites along the surface of each trimer. We reasoned that these paths of electropositive potential could accommodate the anionic backbone of DNA and thus allow a single-stranded DNA molecule to interact with all three active sites within a given trimer of NM23-H2. In addition, we envisage that the binding of a single base to each active site could be used to anchor the single-stranded DNA to the trimeric surface of the protein. On the basis of the preferred binding of guanine to the active sites of NM23-H2 and the optimal number of bases that could be accommodated by the electropositive channels, we have selected the three guanine residues positioned at the 3′-ends of the runs of four guanines within the guanine-rich strand of the c-myc NHE III1 region, which are equally spaced at intervals of eight bases, to interact with three active sites within a given trimer of NM23-H2 (Fig. 10D).
Initially, the single-stranded 27-mer sequence corresponding to the c-myc NHE III1 was docked in a way analogous to the model proposed by Raveh et al. (58), in which the 5′-end of the DNA molecule entered NM23-H2 by interacting with the equatorial region or trimer interface. The DNA molecule was then extended between the 4 and A helices to allow the first guanine residue to interact with the active site of a given monomer unit. Following binding to the first active site, the DNA molecule was further extended to the second and third active sites along the surface of the trimer unit by way of the electropositive channels. Finally, the 3′-end of the single-stranded DNA molecule exited the protein through interactions with the residues of the α2 helix. This resulted in an increase in gap length of 15.7 Å between the 3′-end and the 5′-end of the oligomer. This measurement represents the difference in gap length between the nearest adenines flanking the G-quadruplex at the 3′- and 5′-ends (21 Å) and the corresponding gap length after the quadruplex is unfolded on the face of the trimeric NM23-H2 (36.7 Å). The resulting model of the single-stranded guanine-rich strand docked onto a single trimer unit of NM23-H2 is shown in Fig. 10B. This model reveals that the intervening bases stack readily with the deoxyribose backbone in the positive ion channels, with a favorable eight-base interval between the guanine bases, which were docked in the active sites (Fig. 10B and C). The significance of this unfolding of the G-quadruplex in transcriptional activation is described next.
Discussion
Of all the transcriptional factors involved in c-myc activation, NM23-H2 has been the most controversial. In this study, we set out to alleviate some of this controversy by evaluating and expanding on previous studies related to the biochemical mechanisms of how NM23-H2 activates c-myc transcription through the NHE III1 regulatory element. The first step was to confirm the regulatory role of NM23-H2, because some reports have shown that NM23-H2 is capable of activating c-myc transcription via the NHE III1 region, whereas others have reported negative results. Using an RNA interference approach, we discovered that knockdown of endogenous NM23-H2 expression results in ∼50% reduction in c-myc transcription in HeLa cells (Fig. 2). The incomplete reduction in c-myc expression substantiates the previously described function of NM23-H2 as a possible accessory protein as opposed to a classic transcription factor. In addition, it is noteworthy that this level of reduction in c-myc expression has been shown to be sufficient to induce apoptosis and regression of tumors (67, 68).
It is well demonstrated that c-myc is overexpressed in the majority of colorectal and other malignancies (1, 69). However, until recently, it has been less certain how NM23-H2 is involved in colorectal cancer. More recently, there have been several studies (using tumors taken directly from patients) showing increased nm23 expression as a predictor of metastasis and of poorer survival (70–73). Specifically, a study measuring protein levels in colon tumors and colon mucosa taken directly from patients using two-dimensional gel electrophoresis and SEIDI mass spectroscopy showed that NM23-H2 was differentially expressed in colon cancer tissue versus normal colon mucosa (from the same patient; ref. 73). Correspondingly, we have also shown that NM23-H2 gene expression is elevated in several human colon cancer cell lines and that there is a direct correlation between NM23-H2 and c-myc gene expression within the same cell lines (Fig. 3). Taken together, these studies corroborate previous findings and add to the growing evidence that NM23-H2 plays a role in the activation of c-myc transcription.
Most of the controversy surrounding NM23-H2 is attributable to the variable reports concerning NM23-H2-DNA substrate specificity and strand cleavage activity. We initially envisioned that the unprecedented mechanism involving reversible covalent attachment of NM23-H2 to DNA at six specific sites within the c-myc NHE III1 could result in unfolding of the G-quadruplex and i-motif structures for subsequent capture by DNA-binding proteins involved in transcriptional activation. Unexpectedly, we discovered that this previously detected DNA cleavage activity was unassociated with NM23-H2 and could be related to a minor contaminant associated with the recombinant protein or to an accessory protein that is lost on more extensive purification or on mutation of NM23-H2 (44). Specifically, the results of this investigation showed that the peak of DNA-binding activity and that of DNA cleavage activity were not coincident during heparin affinity chromatography (Fig. 5), which provides evidence against the former proposition that NM23-H2 possessed an inherent nuclease activity. If the protein responsible for the nuclease activity is tightly associated with NM23-H2, it may function under certain physiologic conditions to cause strand breaks in DNA in this region. Selective cleavage of the NHE III1 could be an important event in apoptosis or even in transcriptional activation if the cleavage was reversible.
We next explored the DNA-binding activity of NM23-H2 as a potential regulatory mechanism. Using the additionally purified NM23-H2, we observed that NM23-H2 bound equally well to both single-stranded guanine- and cytosine-rich strands of the c-myc NHE III1 and, to a lesser extent, to a random single-stranded oligomer, but it had no affinity to the duplex form (Fig. 6A). In addition, we found that NM23-H2 binding to the guanine-rich strand could be easily reversed based on the observation that the addition of unlabeled oligomer to the preformed complex reverses the binding, whereas a random sequence has no effect (Fig. 6B). These findings suggest that NM23-H2 potentially functions in transcription through the recognition of DNA structural features within the target sequence (e.g., G-quadruplex and i-motif structures) and in subsequent remodeling and/or trapping-out of the c-myc NHE III1 region in a single-stranded or duplex form, which would allow single-strand-specific or duplex-specific transcription factors to recognize their DNA-binding sites more efficiently. Importantly, we found that TMPyP4, and to a much lesser extent TMPyP2, reduced binding of NM23-H2 to the guanine- and cytosine-rich strands of the NHE III1 (Fig. 7A and B). Taken together, these results suggest that stabilization of the G-quadruplex and i-motif structures within the NHE III1 region hinder the recognition and remodeling functions of NM23-H2 in relation to the guanine- and cytosine-rich strands.
Prior to these studies, it was clear that the NHE III1 regulatory element in the promoter region of the c-myc gene was not a static matrix but was in equilibrium between at least three different DNA structural populations: two that cause activation and one that results in repression of c-myc transcription (74). One of the transcriptionally active complexes binds to the single-stranded binding proteins CNBP (purine-rich strand) and hnRNP K (pyrimidine-rich strand), whereas the second most likely binds to duplex binding transcriptional factors, such as Sp1 (74). The third and final transcriptionally silent form is an open, repressed form involving the formation of G-quadruplex structures, and possibly equivalent i-motif structures on the complementary strand within the NHE III1, leading to a quite distinct appearance from that of the active, attenuated, or vacant promoter forms described earlier (11). At present, one major question still remains to be addressed relating to this model: how does the NHE III1 region transition between the silenced form and the transcriptionally active forms? On the basis of the data presented herein, we have provided a potential answer to this question by inserting NM23-H2 into our working model, although other mechanisms involving topoisomerase I should also be considered. In either case, these enzymes assume the role of an accessory or transacting factor, functioning in the removal of repressive DNA secondary structures and subsequently preparing the DNA within the c-myc NHE III1 region for activating factors of c-myc transcription. Because the silencer element in the c-myc promoter is extremely stable (the G-quadruplex having a melting temperature of 85°C) and is not the substrate for any known transcription factors, it is not surprising that there might be specific proteins that are required for transcriptional activation of c-myc, which must reorganize and resolve these structures.
The cartoon in Fig. 11 illustrates the proposed transition from the nonnucleosomal form (A) of the NHE III1 region in the c-myc promoter to the silenced form (B), its subsequent remodeling by NM23-H2 to the unfolded form (C), and its ultimate capture by specific single-stranded DNA-binding proteins to give a transcriptionally active form (D). Recent in vitro footprinting studies of a supercoiled plasmid containing the c-myc promoter region have revealed that the transition of the duplex DNA into secondary structures is accompanied by the local unwinding of the regions neighboring the NHE III1 region.8
8Sun D, Hurley LH. Formation of secondary structures in the c-myc promoter induced by negative superhelicity and the implications for transcriptional control. J Med Chem 2009; Doi: 10.1021=jm900055s.
In conclusion, we believe that we have provided a working model of how NM23-H2 can activate c-myc transcription by recognition of the structural and functional features of the c-myc NHE III1, resulting in subsequent remodeling and trapping of the c-myc silencer element in an unfolded form. We have also established a new proof-of-principle that the c-myc-lowering effect can be achieved through drug stabilization of the G-quadruplex and consequent interference with the interaction of NM23-H2 with the silencer element in the c-myc promoter. However, further exploration into the interactions between the c-myc NHE III1 and NM23-H2 is of interest for the development of novel pharmacologic agents.
Disclosure of Potential Conflicts of Interest
No potential conflicts of interest were disclosed.
Acknowledgments
We thank Dr. Edith Postel for generous supply of materials, including protein for preliminary studies and clones and purification protocols for wild-type and R88A mutant NM23-H2; Jacquie Brailey for help with protein purification; and David Bishop for preparing, proofreading, and editing the final version of the article and figures. Coordinates and structure factors have been deposited in the Protein Data Bank (PDB entries 3BBB, 3BBF, and 3BBC).
References
Competing Interests
The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.