Tumor-derived proteins may occur in the circulation as a result of secretion, shedding from the cell surface, or cell turnover. We have applied an in-depth comprehensive proteomic strategy to plasma from intestinal tumor–bearing Apc mutant mice to identify proteins associated with tumor development. We used quantitative tandem mass spectrometry of fractionated mouse plasma to identify differentially expressed proteins in plasma from intestinal tumor–bearing Apc mutant mice relative to matched controls. Up-regulated proteins were assessed for the expression of corresponding genes in tumor tissue. A subset of proteins implicated in colorectal cancer were selected for further analysis at the tissue level using antibody microarrays, Western blotting, tumor immunohistochemistry, and novel fluorescent imaging. We identified 51 proteins that were elevated in plasma with concordant up-regulation at the RNA level in tumor tissue. The list included multiple proteins involved in colon cancer pathogenesis: cathepsin B and cathepsin D, cullin 1, Parkinson disease 7, muscle pyruvate kinase, and Ran. Of these, Parkinson disease 7, muscle pyruvate kinase, and Ran were also found to be up-regulated in human colon adenoma samples. We have identified proteins with direct relevance to colorectal carcinogenesis that are present both in plasma and in tumor tissue in intestinal tumor–bearing mice. Our results show that integrated analysis of the plasma proteome and tumor transcriptome of genetically engineered mouse models is a powerful approach for the identification of tumor-related plasma proteins.

Although our understanding of colorectal cancer has substantially improved, few circulating biomarkers have emerged that have diagnostic utility. The current gold standard is screening by visual endoscopy. Newer modalities, such as computed tomographic colography or fecal DNA testing, have not achieved widespread usage (1, 2). There remains a substantial need for noninvasive diagnostic methods.

An alternative approach to screen for colorectal cancer is through blood-based testing. Proteins detectable in serum and plasma are the basis of commonly relied upon tests to detect prostate, ovarian, and pancreatic cancer through the measurement of prostate-specific antigen, CA125, and CA19.9, respectively (35). Current colorectal cancer circulating markers, exemplified by carcinoembryonic antigen, have poor sensitivity and specificity that preclude their usage as a population-wide screening tool (6). The development of panels of protein markers may provide the necessary sensitivity and specificity for blood-based testing of colorectal cancer.

Current proteomics technologies allow systematic interrogation of complex proteomes and identification of differentially expressed proteins, whether in cells, tissues, or body fluids (7). However, biomarker discovery in humans is challenged by extensive heterogeneity at the disease and patient sample procurement levels. Prior to embarking on a large-scale effort to identify colorectal cancer–specific biomarkers from human patients, we examined the feasibility of the approach by using a genetically modified mouse model. Genetically engineered mouse models of human cancer can be interrogated at defined stages of tumor development, under homogenized breeding and environmental conditions, and with standardized blood sampling, thereby reducing biological and nonbiological heterogeneity and permitting the application of proteomics to the identification of cancer markers found in the circulation.

Colorectal cancer, whether it is sporadic or the result of cancer predisposition syndromes, is associated with a mutation in the Apc gene (8). Several mouse cell lines, each carrying a different mutation in the Apc gene, have been described (9). Most of these genetically modified mice show an intestinal tumor predisposition phenotype and develop few to many adenomas and adenocarcinomas. In this study, we have investigated a mouse model of intestinal tumorigenesis, Apc Δ580, to determine the spectrum of protein changes that occur in mouse plasma with tumor development and the extent to which these observed changes reflect the tumor tissue of origin versus inflammation and other nonspecific disease processes (10).

Animal husbandry

Heterozygous Apc Δ580 mice on the C57bl/6 (B6) background were mated with wild-type B6 mice (10). The resulting offspring were screened by PCR of tail DNA using standard methods. Heterozygous Apc Δ580 mice were used for the studies. Wild-type age-matched and sex-matched littermates were used as controls.

Immunodepletion of abundant proteins and acrylamide labeling

Plasma pools (1 mL) from 10 tumor-bearing Apc Δ580 mice (10-12 weeks) and from 10 non–tumor-bearing wild-type littermates were individually immunodepleted of the top three most abundant proteins (albumin, IgG, and transferrin) using an Ms-3 column (4.6 × 250 mm; Agilent). Briefly, columns were equilibrated with buffer A at (0.5 mL/min) for 13 min and aliquots of 75 μL of the pooled sera were injected after filtration through a 0.22-μm syringe filter. The flowthrough fractions were collected for 10 min using buffer A at a flow rate of 0.5 mL/min, combined and stored at −80°C until use. The column-bound material was recovered by elution for 8 min with buffer B at 1 mL/min. Subsequently, immunodepleted samples were concentrated using Centricon YM-3 devices (Millipore) and re-diluted in 8 mol/L of urea, 30 mmol/L of Tris (pH 8.5), and 0.5% octyl-β-d-glucopyranoside (Roche). Samples were reduced with DTT in 50 μL of 2 mol/L Tris-HCl (pH 8.5; 0.66 mg DTT/mg protein), and isotopic labeling of intact proteins in cysteine residues was done with acrylamide. Controls received the light acrylamide isotope (D0-acrylamide, >99.5% purity; Fluka), whereas cases received the heavy 2,3,3′-D3-acrylamide isotope (D3-acrylamide, >98% purity; Cambridge Isotope Laboratories). Alkylation with acrylamide was done for 1 h at room temperature by the addition of 7.1 mg of D0-acrylamide or 7.4 mg of D3-acrylamide per milligram of protein, diluted in a small volume of 2 mol/L of Tris-HCl (pH 8.5; ref. 11).

Two-dimensional protein fractionation by liquid chromatography

After isotopic labeling, the case and control pools were mixed in a 1:1 ratio, diluted to 10 mL with 20 mmol/L of Tris in 6% isopropanol, and 4 mol/L of urea (pH 8.5). The combined sample pools were then separated using a two-dimensional protein fractionation strategy (anion exchange followed by reverse phase chromatography). For the first dimension, samples were injected onto a Mono-Q 10/100 column (Amersham Biosciences). The buffer system consisted of solvent A: 20 mmol/L of Tris in 6% isopropanol and 4 mol/L of urea (pH 8.5); and solvent B: 20 mmol/L of Tris in 6% isopropanol, 4 mol/L of urea, and 1 mol/L of NaCl (pH 8.5). The separation was done at 4.0 mL/min in a gradient of 0% to 35% solvent B for 44 min; 35% to 50% solvent B for 3 min; 50% to 100% solvent B for 5 min, and 100% solvent B for an additional 5 min. A total of nine pools were collected. For the second dimension, each of the anion exchange fractions were further fractionated using a Poros R2 column (4.6 × 50 mm; Applied Biosystems) with TFA/acetonitrile as buffer system (solvent A: 95% H2O, 5% acetonitrile + 0.1% TFA; and solvent B: 90% acetonitrile, 10% H2O + 0.1% TFA) at 2.7 mL/min. The gradient used was 5% solvent A until absorbance reached baseline (desalting step) and then 5% to 50% solvent B for 18 min; 50% to 80% solvent B for 7 min, and 80% to 95% solvent B for 2 min. Sixty fractions of 900 μL were collected during the run, corresponding to a total of 540 fractions. Fractions were pooled to normalize masses, resulting in 117 discrete fractions.

Mass spectrometry

The final dimension of separation was done online with a 75-μm × 20 cm internal diameter column packed with C18 media running at a 225 nL/min flow rate provided by a Surveyor MS pump using a gradient of 5% to 60% water 0.1% formic acid, acetonitrile 0.1% formic acid over the course of 100 min with a total run time of 150 min. The samples were run on a high-resolution LTQ-FT-MS (ThermoFisher Scientific) in a top nine configuration (one MS 100K resolution full scan and nine MS/MS scans). Dynamic exclusion was set to 1 with a limit of 180 s and early expiration set to six full scans. To ascertain column performance and to observe any potential carryover that might have occurred, a 2.5-h standard mixture of five Angio mix peptides (Michrom BioResources) was run after every 12 experimental samples. The acquired mass spectral data was automatically processed by the Computational Proteomics Analysis System (12). Searches were done considering cysteine alkylation with the light form of acrylamide as a fixed modification and heavy form of acrylamide (+3.01884) as a variable modification. For the identification of proteins with a false discovery rate of <5%, liquid chromatography MS/MS spectra were subjected to tryptic searches against a database consisting of forward and reverse mouse IPI databases released in January 2006 (v.3.12) using X!Comet (13). The database search results were then analyzed by PeptideProphet (14) and ProteinProphet (15), and those proteins that resulted in a Protein Prophet error rate of ≤5% were retained.

Quantitative analysis of acrylamide isotopes

Quantitative information was extracted from acrylamide-labeled peptides using the q3 script to obtain the relative quantification for each pair of peptides identified by MS/MS that contains cysteine residues (11). All peptide ratios for a specific protein present in a particular fraction were normalized and log-averaged to obtain the local relative protein ratio.

Bioinformatics analysis

To assign biological significance to differentially labeled proteins, we accessed the Expert Protein Analysis System proteomics server to query Swiss-Prot and TrEMBL for ontological information regarding protein function, domain-family taxa, tissue specificity, posttranslational modifications, disease associations, etc. (16). We also interrogated biological text analysis databases such as MedMiner and XplorMed to determine whether these candidates were known plasma proteins, secreted or membrane proteins, or were associated with the intestinal epithelium, cancer, or colorectal cancer (17, 18). Lastly, we used the Database for Annotation, Visualization, and Integrated Discovery 2.1, FatiGO, and Ingenuity (19, 20). Proteins that were known to be associated with colorectal cancer, cancer biology, or the intestinal epithelium were prioritized for further validation. Thus, integration of relative protein abundance, gene expression data, and functional and ontologic information provided a basis for further validation.

Tissue immunohistochemistry

Paraffin-embedded tissue sections (5 μm) were deparafinized in xylene, followed by alcohol rehydration. After quenching endogenous peroxidases in 3% H2O2 in methanol, the slides were rinsed in distilled water, and an antigen retrieval step was carried out in a microwave oven for a total of 10 min in preheated citrate buffer (pH 6.0). The slides were then incubated with primary antibodies at room temperature overnight. The following primary antibodies were used: cullin-1 (Abcam), cathepsin B and cathepsin D (Santa Cruz Biotechnology), Ran (Santa Cruz Biotechnology), PKM2 (Cell Signaling Technology), and DJ-1 (Covance). After washing in PBS, slides were incubated with either anti-rabbit (Jackson Immunoresearch Laboratories) or anti-goat secondary antibody for 1 h at room temperature. The Vectastain Elite ABC kit (Vector Laboratories) was used for detection as per the instructions of the manufacturer. The slides were stained with 3,3′-diaminobenzidine and counterstained with Mayer's hematoxylin.

Fluorescence imaging

A protease activatable probe, Prosense 680 (VisEn Medical), was injected i.v. at a dose of 2 nmol/mouse 24 h prior to ex vivo imaging. The imaging probe is a long circulating graft copolymer that markedly increases its fluorescence intensity after selective enzymatic cleavage of lysine-lysine bonds, primarily by cathepsin B in vivo. Prior to imaging, the mice were sacrificed. The small bowel was removed, splayed, and washed in saline. White light imaging, immediately followed by near-IR imaging using a filter set optimized for Cy5.5, was done on a prototype OV-100 small animal fluorescence imaging system (Olympus).

Antibody microarray

Antibody microarray validation was done as previously described (21). The clusterin, cathepsin B, and cathepsin D antibodies were purchased from R&D Systems.

Western blotting

Mouse plasma samples were separated on 4% to 12% Bis-Tris 26-well gels (Criterion XT, Bio-Rad), transferred to a nitrocellulose membrane (0.45 μmol/L, Bio-Rad), and blocked overnight. For sample detection, the nitrocellulose was incubated with a rabbit anti–DJ-1 polyclonal antibody (Santa Cruz Biotechnology). A donkey anti-rabbit IgG conjugated with horseradish peroxidase (Pierce) was used as the secondary antibody. Antigen-antibody bands were visualized using the SuperSignal West Pico Chemiluminescent substrate (Pierce), followed by exposure to Kodak Biomax XAR film (Sigma-Aldrich). The film was scanned and the intensities of the bands were quantified using the image analysis software, Quantity One (version 4.4.1, Bio-Rad).

Mass spectrometry analysis of the plasma proteome in Apc Δ580 mice

The study was designed to test directly whether in-depth quantitative proteomic analysis of plasma from cases relative to matched controls yields identification of increased levels of plasma proteins derived from intestinal tumor tissue versus nonspecific protein changes related to the host response. An overview of this workflow is presented in Fig. 1. Plasma was obtained from 10 tumor-bearing Apc Δ580 mice at 10 to 12 weeks of age and 10 age-matched and sex-matched controls. Tumor status of individual mice was assessed by histopathology. These mice presented with 30 to 50 adenomas in both the small and large intestine. For quantitative analysis, differential isotopic labeling was applied to pooled plasma from tumor-bearing and control animals, respectively. The pools were combined in a 1:1 ratio, followed by extensive fractionation of intact proteins using anion exchange and reverse phase chromatography, yielding a total of 117 distinct plasma protein fractions. After tryptic digestion, individual fractions were analyzed separately by tandem mass spectrometry (MS/MS). Approximately 1,400,000 mass spectra were produced and analyzed in this study. Collectively, the fractions analyzed resulted in a primary list of 1,213 distinct proteins identified with high confidence (<5% false discovery rate, based on reverse-database searches). Of this total, 1,132 proteins were identified with two or more peptides. Interestingly, 304 proteins were increased by ≥1.5-fold in case pools compared with control pools, whereas 20 proteins were decreased by ≥1.5-fold, as may be expected with the release of tumor-derived proteins into the circulation. These proteins were broadly distributed in relation to their representation as secreted, membrane, cytoplasmic, or nuclear proteins based on gene ontology (Fig. 2A).

Fig. 1

MS/MS analysis of plasma from intestinal tumor–bearing Apc D580 mice and wild-type littermate controls. Workflow schema of multidimensional fractionation coupled to MS/MS analysis of mouse plasma.

Fig. 1

MS/MS analysis of plasma from intestinal tumor–bearing Apc D580 mice and wild-type littermate controls. Workflow schema of multidimensional fractionation coupled to MS/MS analysis of mouse plasma.

Close modal
Fig. 2

Marker candidates are tumor-derived secreted, membrane, and cytoplasmic proteins. A, gene ontology analysis of cellular localization of identified proteins. B, representative Ingenuity network analysis showing relationship to cancer, cellular growth and proliferation, cell cycle, cell death, cell to cell signaling and interaction, and tumor morphology.

Fig. 2

Marker candidates are tumor-derived secreted, membrane, and cytoplasmic proteins. A, gene ontology analysis of cellular localization of identified proteins. B, representative Ingenuity network analysis showing relationship to cancer, cellular growth and proliferation, cell cycle, cell death, cell to cell signaling and interaction, and tumor morphology.

Close modal

Concordance between the plasma proteome and tumor transcriptome

Plasma proteins whose levels were increased in the plasmas from cases versus controls may have originated either from overexpression in tumor tissue, or from a systemic host response. To identify those plasma proteins that may have originated from intestinal tumor tissue, we correlated protein levels in plasma with expression of corresponding genes at the RNA level in tumor tissue. Reliance on gene expression, although limited by potential discordance between RNA and protein levels, takes advantage of the wealth of transcriptomic data already available for various tumor types and is intended to assess the potential tumor tissue origin of candidate markers. One such transcriptome study yielded 2,545 genes that were overexpressed in adenomas and 2,576 genes that were overexpressed in adenocarcinomas (22). Based on this analysis, 51 proteins that were found to have increased levels in plasma from tumor-bearing mice also exhibited overexpression at the RNA level in tumor tissue (Table 1).

Table 1

Overexpressed plasma proteins from intestinal tumor–bearing mice that show concordance with overexpressed genes from the intestinal tumor transcriptome

ProteinIPI no.Entrez IDPlasma proteomeAdenoma transcriptomeCarcinoma transcriptome
Monocyte differentiation antigen CD14 precursor IPI00308990 12475 3.98 2.17 2.44 
Serum albumin precursor IPI00131695 11657 3.97 1.08 1.99 
Cullin-1 IPI00124047 26965 3.74 1.35 1.18 
Fatty acid-binding protein, epidermal IPI00114162 16592 3.46 1.08 1.08 
Clusterin precursor IPI00320420 12759 3.39 68.85 58.88 
Cathepsin Z IPI00125220 64138 3.38 1.08 1.22 
Cellular nucleic acid-binding protein IPI00330767 12785 3.27 1.82 2.21 
Copper transport protein ATOX1 IPI00117379 11927 3.04 1.64 1.73 
4632419I22Rik protein IPI00226497 74038 2.98 1.18 1.37 
Cathepsin B precursor IPI00113517 13030 2.88 1.25 1.71 
Transitional endoplasmic reticulum ATPase IPI00622235 269523 2.78 1.4 1.44 
Cystatin C precursor IPI00123744 13010 2.74 2.48 1.98 
Peptidyl-prolyl cis-trans isomerase A IPI00554989 268373 2.71 1.57 1.71 
Talin-1 IPI00465786 21894 2.69 1.34 1.28 
Phosphoglycerate mutase 1 IPI00457898 18648 2.69 3.36 2.56 
Collagen α-1(III) chain precursor IPI00129571 12825 2.66 1.33 1.71 
Complement factor I precursor IPI00320675 12630 2.61 49.69 45.36 
Elongation factor 1-α1 IPI00307837 13627 2.46 1.65 1.53 
Cathepsin D precursor IPI00111013 13033 2.44 2.78 2.01 
Insulin-like growth factor-binding protein 4 precursor IPI00112487 16010 2.43 1.03 1.46 
Heat shock cognate 71 kDa protein IPI00323357 15481 2.43 1.6 1.96 
PDZ and LIM domain protein 1 IPI00309768 54132 2.35 1.21 1.15 
14 kDa phosphohistidine phosphatase IPI00118757 75454 2.32 1.25 1.27 
Plasma retinol-binding protein precursor IPI00122429 19662 2.28 3.29 4.28 
Chromobox protein homologue 3 IPI00129468 12417 2.22 1.58 2.16 
Tenascin precursor IPI00420656 21923 2.2 1.01 1.86 
Isoform 1 of neuroplastin precursor IPI00123831 20320 2.14 1.34 2.21 
Coagulation factor V IPI00406603 14067 2.11 1.17 1.65 
Protein DJ-1 IPI00117264 57320 2.07 2.36 2.44 
23 kDa protein IPI00648658 12757 1.9 1.38 1.33 
Peroxiredoxin-2 IPI00117910 21672 1.87 2.55 2.58 
Isoform long of complement C3 precursor IPI00323624 12266 1.85 1.09 1.54 
Lactotransferrin precursor IPI00323235 17002 1.74 3.72 2.53 
Galectin-1 IPI00229517 16852 1.6 2.28 1.62 
Sulfated glycoprotein 1 precursor IPI00321190 19156 1.59 1.12 1.22 
Peroxiredoxin-6 IPI00555059 11758 1.59 1.56 1.64 
Insulin-like growth factor-binding protein 5 precursor IPI00114022 16011 1.59 7.45 25.41 
Transmembrane receptor Notch 1 IPI00322561 18128 1.53 1.29 1.29 
Interleukin-1 receptor type II precursor IPI00122155 16178 1.5 1.7 1.57 
Actin, β, cytoplasmic IPI00473320 11461 1.5 1.89 1.78 
Pyruvate kinase isozyme M2 IPI00407130 18746 1.49 4.63 4.62 
FK506-binding protein 1A IPI00266899 14225 1.47 2.4 2.39 
Fibronectin 1 IPI00352163 14268 1.46 2.46 2.87 
Tropomyosin α-4 chain IPI00421223 326618 1.44 4.64 5.31 
Tumor necrosis factor receptor superfamily member 1B precursor IPI00137744 21938 1.42 1.58 2.34 
GTP-binding nuclear protein Ran IPI00134621 19384 1.23 2.22 2.64 
Creatine kinase B-type IPI00136703 12709 1.23 1.85 1.68 
Ran-specific GTPase-activating protein (fragment) IPI00321978 19385 1.19 1.55 1.8 
Procollagen, type V, α2 IPI00121120 12832 1.19 1.16 1.79 
Dickkopf-related protein 3 precursor IPI00131904 50781 1.14 2.6 1.94 
Fructose-bisphosphate aldolase A IPI00221402 11674 1.07 2.05 1.69 
ProteinIPI no.Entrez IDPlasma proteomeAdenoma transcriptomeCarcinoma transcriptome
Monocyte differentiation antigen CD14 precursor IPI00308990 12475 3.98 2.17 2.44 
Serum albumin precursor IPI00131695 11657 3.97 1.08 1.99 
Cullin-1 IPI00124047 26965 3.74 1.35 1.18 
Fatty acid-binding protein, epidermal IPI00114162 16592 3.46 1.08 1.08 
Clusterin precursor IPI00320420 12759 3.39 68.85 58.88 
Cathepsin Z IPI00125220 64138 3.38 1.08 1.22 
Cellular nucleic acid-binding protein IPI00330767 12785 3.27 1.82 2.21 
Copper transport protein ATOX1 IPI00117379 11927 3.04 1.64 1.73 
4632419I22Rik protein IPI00226497 74038 2.98 1.18 1.37 
Cathepsin B precursor IPI00113517 13030 2.88 1.25 1.71 
Transitional endoplasmic reticulum ATPase IPI00622235 269523 2.78 1.4 1.44 
Cystatin C precursor IPI00123744 13010 2.74 2.48 1.98 
Peptidyl-prolyl cis-trans isomerase A IPI00554989 268373 2.71 1.57 1.71 
Talin-1 IPI00465786 21894 2.69 1.34 1.28 
Phosphoglycerate mutase 1 IPI00457898 18648 2.69 3.36 2.56 
Collagen α-1(III) chain precursor IPI00129571 12825 2.66 1.33 1.71 
Complement factor I precursor IPI00320675 12630 2.61 49.69 45.36 
Elongation factor 1-α1 IPI00307837 13627 2.46 1.65 1.53 
Cathepsin D precursor IPI00111013 13033 2.44 2.78 2.01 
Insulin-like growth factor-binding protein 4 precursor IPI00112487 16010 2.43 1.03 1.46 
Heat shock cognate 71 kDa protein IPI00323357 15481 2.43 1.6 1.96 
PDZ and LIM domain protein 1 IPI00309768 54132 2.35 1.21 1.15 
14 kDa phosphohistidine phosphatase IPI00118757 75454 2.32 1.25 1.27 
Plasma retinol-binding protein precursor IPI00122429 19662 2.28 3.29 4.28 
Chromobox protein homologue 3 IPI00129468 12417 2.22 1.58 2.16 
Tenascin precursor IPI00420656 21923 2.2 1.01 1.86 
Isoform 1 of neuroplastin precursor IPI00123831 20320 2.14 1.34 2.21 
Coagulation factor V IPI00406603 14067 2.11 1.17 1.65 
Protein DJ-1 IPI00117264 57320 2.07 2.36 2.44 
23 kDa protein IPI00648658 12757 1.9 1.38 1.33 
Peroxiredoxin-2 IPI00117910 21672 1.87 2.55 2.58 
Isoform long of complement C3 precursor IPI00323624 12266 1.85 1.09 1.54 
Lactotransferrin precursor IPI00323235 17002 1.74 3.72 2.53 
Galectin-1 IPI00229517 16852 1.6 2.28 1.62 
Sulfated glycoprotein 1 precursor IPI00321190 19156 1.59 1.12 1.22 
Peroxiredoxin-6 IPI00555059 11758 1.59 1.56 1.64 
Insulin-like growth factor-binding protein 5 precursor IPI00114022 16011 1.59 7.45 25.41 
Transmembrane receptor Notch 1 IPI00322561 18128 1.53 1.29 1.29 
Interleukin-1 receptor type II precursor IPI00122155 16178 1.5 1.7 1.57 
Actin, β, cytoplasmic IPI00473320 11461 1.5 1.89 1.78 
Pyruvate kinase isozyme M2 IPI00407130 18746 1.49 4.63 4.62 
FK506-binding protein 1A IPI00266899 14225 1.47 2.4 2.39 
Fibronectin 1 IPI00352163 14268 1.46 2.46 2.87 
Tropomyosin α-4 chain IPI00421223 326618 1.44 4.64 5.31 
Tumor necrosis factor receptor superfamily member 1B precursor IPI00137744 21938 1.42 1.58 2.34 
GTP-binding nuclear protein Ran IPI00134621 19384 1.23 2.22 2.64 
Creatine kinase B-type IPI00136703 12709 1.23 1.85 1.68 
Ran-specific GTPase-activating protein (fragment) IPI00321978 19385 1.19 1.55 1.8 
Procollagen, type V, α2 IPI00121120 12832 1.19 1.16 1.79 
Dickkopf-related protein 3 precursor IPI00131904 50781 1.14 2.6 1.94 
Fructose-bisphosphate aldolase A IPI00221402 11674 1.07 2.05 1.69 

NOTE: List of proteins with IPI protein and Entrez gene identifiers. Values for plasma proteome, adenoma transcriptome, and carcinoma transcriptome were presented as fold change in intestinal tumor–bearing versus wild-type littermate controls.

Biological significance of proteins with elevated levels in plasma and concordant increased expression of their corresponding genes in intestinal tumor tissue

To further determine the relationship of these 51 proteins to tumorigenesis, we annotated these proteins using knowledgebases such as MedMiner, XplorMed, Database for Annotation, Visualization, and Integrated Discovery 2.1, and FatiGO (1720). Ingenuity Pathway Analysis was also used to determine their cancer relevance from information in the Ingenuity Pathways Knowledge Base (Fig. 2B). The 51 proteins were found to be involved in major processes such as cell death (28 proteins), cellular growth and proliferation (25 proteins), cell to cell signaling and interactions (19 proteins), and cell cycle control (12 proteins). One protein was related to the Wnt signaling pathway associated with inactivation of the Apc gene.

Orthogonal antibody-based validation

The finding that many of the proteins that occurred at increased levels in plasma were overexpressed in the tumor transcriptome suggested that these circulating proteins were tumor-derived. To confirm this relationship, we examined protein expression in tumor tissue by immunohistochemistry. We focused our validation efforts on proteins for which antibodies with the prerequisite specificity were available. Six proteins that met these criteria were cathepsin B, cathepsin D, Cul1, Park7, Pkm2, and Ran, all of which were found to be overexpressed in tumor tissue with low to undetectable expression in normal intestinal epithelium (Fig. 3). Cathepsins B and D, Park7, and Pkm2 were present in tumor cells in a cytoplasmic distribution. Cul1 and Ran were also found in tumor cells, but in both nuclear and cytoplasmic compartments. Increased levels of Park7, Pkm2, and Ran were also observed in human colon adenoma tissue (Fig. 4).

Fig. 3

Marker candidates are overexpressed in mouse intestinal tumor. Immunohistochemistry for cathepsin B (A and B), cathepsin D (C and D), cullin-1 (E and F), pyruvate kinase M2 (G and H), DJ-1 (I and J), and Ran (K and L) of normal intestinal epithelium and intestinal tumor tissue, respectively.

Fig. 3

Marker candidates are overexpressed in mouse intestinal tumor. Immunohistochemistry for cathepsin B (A and B), cathepsin D (C and D), cullin-1 (E and F), pyruvate kinase M2 (G and H), DJ-1 (I and J), and Ran (K and L) of normal intestinal epithelium and intestinal tumor tissue, respectively.

Close modal
Fig. 4

Marker candidates are overexpressed in human colon adenomas. Immunohistochemistry for DJ-1 (A and B), pyruvate kinase M2 (C and D), and Ran (E and F) of normal intestinal epithelium and colon adenoma tissue, respectively.

Fig. 4

Marker candidates are overexpressed in human colon adenomas. Immunohistochemistry for DJ-1 (A and B), pyruvate kinase M2 (C and D), and Ran (E and F) of normal intestinal epithelium and colon adenoma tissue, respectively.

Close modal

Using available antibodies, we examined the levels of a subset of up-regulated proteins in the plasma of an independent set of 40 individual tumor-bearing mice and 40 matched controls using antibody microarrays and Western blotting. Cathepsin B, cathepsin D, clusterin, and Park7 were found to occur at increased levels in plasma from individual tumor-bearing mice (Fig. 5). The P values were highly significant, ranging from P = 0.002 for cathepsin B and Park7 to P = 6.66 × 10−7 for cathepsin D.

Fig. 5

Antibody-based validation shows the overexpression of marker candidates in plasma. Antibody microarray results for clusterin (A), cathepsin B (B), and cathepsin D (C). Western blot analysis for DJ-1 (D).

Fig. 5

Antibody-based validation shows the overexpression of marker candidates in plasma. Antibody microarray results for clusterin (A), cathepsin B (B), and cathepsin D (C). Western blot analysis for DJ-1 (D).

Close modal

Cathepsin activity–based tumor imaging

Cathepsin B and cathepsin D were two of the proteins that were identified at increased levels in plasma from tumor-bearing mice by MS and antibody microarray, and in tumor tissue by immunohistochemistry. Both of these proteins are cysteine proteases that have been implicated in cancer pathogenesis. To assess their biological activity in tumors compared with normal mucosa, we injected tumor-bearing mice with the Prosense imaging agent, a normally inert fluorophore that is activated through cleavage by cathepsins (23). Comparison of white light and near-IR imaging that detects the activated Prosense probe revealed that only the tumor and not the surrounding normal mucosa were positive, indicating that cathepsin activity occurred at the tumor site (Fig. 6).

Fig. 6

Prosense imaging shows the intestinal tumor specificity of cathepsin. Intestinal tumor–bearing mutant mice were injected with Prosense imaging reagent. Ex vivo imaging shows concordance between white light imaging (A) for tumors and false-colored near-IR Prosense imaging (B) for cathepsin.

Fig. 6

Prosense imaging shows the intestinal tumor specificity of cathepsin. Intestinal tumor–bearing mutant mice were injected with Prosense imaging reagent. Ex vivo imaging shows concordance between white light imaging (A) for tumors and false-colored near-IR Prosense imaging (B) for cathepsin.

Close modal

Genetically modified mouse models are attractive as a source of markers because of their genetic homogeneity. In a prior study of an ApcMin mouse model, we showed that plasma proteome analysis of relatively abundant proteins could distinguish between tumor-bearing mice and their wild-type counterparts (21). The proteins that provided such discrimination were largely acute phase reactants which have poor specificity for intestinal tumors. To identify intestinal tumor markers with higher specificity, we have conducted an in-depth quantitative multidimensional MS analysis of plasma proteins from tumor-bearing mutant Apc Δ580 mice (cases) and non–tumor-bearing wild-type littermates (controls).

Previous MS-based biomarker discovery studies have focused analysis on either the peripheral blood compartment or directly on the colorectal cancer tissue (2427). Whereas, the former approach ensures that the identified candidate markers are present in blood, the tumor origin of such proteins is uncertain. Conversely, the latter approach assures that the candidate markers are tumor-derived; however, it is unclear if they occur in serum or plasma. In this study, we implemented an approach in which quantitative tandem MS-based plasma proteome data was integrated with tumor tissue transcriptome analysis. To this effect, we have taken advantage of differential isotopic labeling of plasma proteins with acrylamide, coupled with extensive fractionation to achieve both quantitative analysis and high sensitivity (11). We have also integrated proteomic data with tumor transcriptomic data and biological annotation data to assess the intestinal tumor relevance of up-regulated plasma proteins.

The depletion of high-abundance plasma proteins, coupled with extensive fractionation prior to mass spectrometry, allowed us to identify >1,200 proteins in plasma, most of which were identified with two or more unique peptides with an overall accuracy of detection estimated to be 96.5% based on reverse database search. The predominance of up-regulated proteins reflects the contributions of secretion, protein shedding, and cell turnover to the plasma proteome. During the development of the tumor, some genes and their corresponding proteins were overexpressed whereas others lost their expression or were underexpressed relative to normal cells. Examination of plasma would only reveal those that are more abundantly expressed, and therefore, it is not surprising that most of the differentially expressed proteins that we detected in plasma were up-regulated.

To determine the relevance of up-regulated proteins to intestinal tumors, we integrated findings from the plasma proteome with intestinal tumor transcriptomic and biological annotation data. Such an analysis, combined with biological annotation data, was informative in highlighting the relevance of several proteins found to be up-regulated in colorectal cancer pathogenesis. Antibody microarrays and Western blot analysis of plasma provided a confirmation of MS-based findings. Intestinal tumor immunohistochemistry and Prosense imaging showed that the proteins examined were indeed expressed at the tumor site, and in some cases, were overexpressed specifically in tumor cells. Below, we discuss potential candidate markers for colorectal cancer.

Cullin 1

Progression through the cell cycle is based on precisely regulated levels of cyclins and cyclin-dependent kinases through the ubiquitin-proteasome pathway. The SCF E3 ubiquitin ligase is responsible for the attachment of ubiquitin and contains Rbx1, Cul1, and Skp1. Variable F-box proteins, such as Skp2, provide target specificity (28, 29). Skp2 has been shown to be critical for the degradation of p27, thereby allowing progression through the G1-S checkpoint (30, 31). Degradation of p27 has been shown to be important for colorectal cancer in both mouse models and in humans (32, 33). As such, circulating Cul1 might serve as a surrogate marker for increased p27 degradation, reflecting the characteristic rapid progression through the cell cycle.

Cathepsin B and cathepsin D

Cathepsins are involved in the degradation of the basement membrane and extracellular matrix, a key step in tumor invasion and metastasis. Two such proteases, cathepsin B and cathepsin D, have been shown to be associated with colorectal cancer (3437). In accordance with the ELISA data, cathepsin B and cathepsin D were determined to be of intestinal tumor origin by both immunohistochemistry and ex vivo fluorescence imaging using Prosense.

Pyruvate kinase, M2

In normal cells, pyruvate kinase is a key glycolytic enzyme that is responsible for the production of ATP through the conversion of phosphoenolpyruvate to pyruvate. However, in tumor cells, pyruvate kinase is converted from its active tetrameric form into an inactive dimeric form, resulting in a shunting of the upstream phosphometabolites towards synthesis of nucleic acids, amino acids, and phospholipids (38). In accordance with our findings above, Pkm2 has been shown to be increased in cancer including colorectal cancer and has shown potential as a tissue-based biomarker (3941). Furthermore, the switch to the Pkm2 isoform in tumors has been recently shown to be necessary for aerobic glycolysis and is responsible for their subsequent selective growth advantage (42).

Parkinson disease 7

Park7 or Dj-1 is a gene that has been implicated in autosomal recessive, early-onset Parkinsonism (43). However, Dj-1 has also been shown to have transforming activity in conjunction with Ras signaling and to be a negative regulator of Pten (44). Furthermore, its expression is increased in both breast and lung carcinoma (45). Lastly, this gene has been identified as a novel circulating tumor antigen in patients with breast cancer (46). Taken together, circulating Dj-1 might act as a surrogate marker for PI3K activation in colorectal cancer.

Ran

Ran is a GTPase that is involved in protein trafficking across the nuclear membrane, formation of the mitotic spindle, and nuclear envelope assembly (47). This may explain the elevated levels in tumor cells.

Our confirmation studies were limited based on the availability of robust antibodies. Therefore, additional proteins from the identified set may also have relevance as plasma-based markers for colorectal cancer. Our findings support the view that genetically modified mice provide useful models for biomarker discovery as recently reported for a mouse model of pancreatic cancer (48). Our results also suggest that examination of plasma from tumor-bearing individuals and comparing the profile with those of age-matched and sex-matched controls would yield interesting biomarkers that might be useful for the detection of human colorectal cancer.

No potential conflicts of interest were disclosed.

1
Davies
RJ
,
Miller
R
,
Coleman
N
. 
Colorectal cancer screening: prospects for molecular stool analysis
.
Nat Rev Cancer
2005
;
5
:
199
209
.
2
Rockey
DC
,
Barish
M
,
Brill
JV
, et al
. 
Standards for gastroenterologists for performing and interpreting diagnostic computed tomographic colonography
.
Gastroenterology
2007
;
133
:
1005
24
.
3
Lieberman
R
. 
Evidence-based medical perspectives: the evolving role of PSA for early detection, monitoring of treatment response, and as a surrogate end point of efficacy for interventions in men with different clinical risk states for the prevention and progression of prostate cancer
.
Am J Ther
2004
;
11
:
501
6
.
4
Badgwell
D
,
Bast
RC
 Jr
. 
Early detection of ovarian cancer
.
Dis Markers
2007
;
23
:
397
410
.
5
Katz
MH
,
Savides
TJ
,
Moossa
AR
,
Bouvet
M
. 
An evidence-based approach to the diagnosis and staging of pancreatic cancer
.
Pancreatology
2005
;
5
:
576
90
.
6
Duffy
MJ
,
van Dalen
A
,
Haglund
C
, et al
. 
Tumour markers in colorectal cancer: European Group on Tumour Markers (EGTM) guidelines for clinical use
.
Eur J Cancer
2007
;
43
:
1348
60
.
7
Hanash
SM
,
Pitteri
SJ
,
Faca
VM
. 
Mining the plasma proteome for cancer biomarkers
.
Nature
2008
;
452
:
571
9
.
8
Kinzler
KW
,
Vogelstein
B
. 
Lessons from hereditary colorectal cancer
.
Cell
1996
;
87
:
159
70
.
9
Taketo
MM
. 
Mouse models of gastrointestinal tumors
.
Cancer Sci
2006
;
97
:
355
61
.
10
Kuraguchi
M
,
Wang
XP
,
Bronson
RT
, et al
. 
Adenomatous polyposis coli (APC) is required for normal development of skin and thymus
.
PLoS Genet
2006
;
2
:
e146
.
11
Faca
V
,
Coram
M
,
Phanstiel
D
, et al
. 
Quantitative analysis of acrylamide labeled serum proteins by LC-MS/MS
.
J Proteome Res
2006
;
5
:
2009
18
.
12
Rauch
A
,
Bellew
M
,
Eng
J
, et al
. 
Computational Proteomics Analysis System (CPAS): an extensible, open-source analytic system for evaluating and publishing proteomic data and high throughput biological experiments
.
J Proteome Res
2006
;
5
:
112
21
.
13
MacLean
B
,
Eng
JK
,
Beavis
RC
,
McIntosh
M
. 
General framework for developing and evaluating database scoring algorithms using the TANDEM search engine
.
Bioinformatics
2006
;
22
:
2830
2
.
14
Keller
A
,
Nesvizhskii
AI
,
Kolker
E
,
Aebersold
R
. 
Empirical statistical model to estimate the accuracy of peptide identifications made by MS/MS and database search
.
Anal Chem
2002
;
74
:
5383
92
.
15
Nesvizhskii
AI
,
Keller
A
,
Kolker
E
,
Aebersold
R
. 
A statistical model for identifying proteins by tandem mass spectrometry
.
Anal Chem
2003
;
75
:
4646
58
.
16
Gasteiger
E
,
Gattiker
A
,
Hoogland
C
,
Ivanyi
I
,
Appel
RD
,
Bairoch
A
. 
ExPASy: The proteomics server for in-depth protein knowledge and analysis
.
Nucleic Acids Res
2003
;
31
:
3784
8
.
17
Tanabe
L
,
Scherf
U
,
Smith
LH
,
Lee
JK
,
Hunter
L
,
Weinstein
JN
. 
MedMiner: an Internet text-mining tool for biomedical information, with application to gene expression profiling
.
Biotechniques
1999
;
27
:
1210
4, 6–7.
.
18
Perez-Iratxeta
C
,
Perez
AJ
,
Bork
P
,
Andrade
MA
. 
Update on XplorMed: a web server for exploring scientific literature
.
Nucleic Acids Res
2003
;
31
:
3866
8
.
19
Al-Shahrour
F
,
Diaz-Uriarte
R
,
Dopazo
J
. 
FatiGO: a web tool for finding significant associations of gene ontology terms with groups of genes
.
Bioinformatics
2004
;
20
:
578
80
.
20
Dennis
G,
 Jr.
,
Sherman
BT
,
Hosack
DA
, et al
. 
DAVID: Database for Annotation, Visualization, and Integrated Discovery
.
Genome Biol
2003
;
4
:
P3
.
21
Hung
KE
,
Kho
AT
,
Sarracino
D
, et al
. 
Mass spectrometry-based study of the plasma proteome in a mouse intestinal tumor model
.
J Proteome Res
2006
;
5
:
1866
78
.
22
Paoni
NF
,
Feldman
MW
,
Gutierrez
LS
,
Ploplis
VA
,
Castellino
FJ
. 
Transcriptional profiling of the transition from normal intestinal epithelia to adenomas and carcinomas in the APCMin/+ mouse
.
Physiol Genomics
2003
;
15
:
228
35
.
23
Funovics
MA
,
Alencar
H
,
Montet
X
,
Weissleder
R
,
Mahmood
U
. 
Simultaneous fluorescence imaging of protease expression and vascularity during murine colonoscopy for colonic lesion characterization
.
Gastrointest Endosc
2006
;
64
:
589
97
.
24
Ward
DG
,
Suggett
N
,
Cheng
Y
, et al
. 
Identification of serum biomarkers for colon cancer by proteomic analysis
.
Br J Cancer
2006
;
94
:
1898
905
.
25
Chaurand
P
,
DaGue
BB
,
Pearsall
RS
,
Threadgill
DW
,
Caprioli
RM
. 
Profiling proteins from azoxymethane-induced colon tumors at the molecular level by matrix-assisted laser desorption/ionization mass spectrometry
.
Proteomics
2001
;
1
:
1320
6
.
26
Friedman
DB
,
Hill
S
,
Keller
JW
, et al
. 
Proteome analysis of human colon cancer by two-dimensional difference gel electrophoresis and mass spectrometry
.
Proteomics
2004
;
4
:
793
811
.
27
Balasubramani
M
,
Day
BW
,
Schoen
RE
,
Getzenberg
RH
. 
Altered expression and localization of creatine kinase B, heterogeneous nuclear ribonucleoprotein F, and high mobility group box 1 protein in the nuclear matrix associated with colon cancer
.
Cancer Res
2006
;
66
:
763
9
.
28
Cardozo
T
,
Pagano
M
. 
The SCF ubiquitin ligase: insights into a molecular machine
.
Nat Rev Mol Cell Biol
2004
;
5
:
739
51
.
29
Nakayama
KI
,
Nakayama
K
. 
Regulation of the cell cycle by SCF-type ubiquitin ligases
.
Semin Cell Dev Biol
2005
;
16
:
323
33
.
30
Carrano
AC
,
Eytan
E
,
Hershko
A
,
Pagano
M
. 
SKP2 is required for ubiquitin-mediated degradation of the CDK inhibitor p27
.
Nat Cell Biol
1999
;
1
:
193
9
.
31
Nakayama
K
,
Nagahama
H
,
Minamishima
YA
, et al
. 
Targeted disruption of Skp2 results in accumulation of cyclin E and p27(Kip1), polyploidy and centrosome overduplication
.
EMBO J
2000
;
19
:
2069
81
.
32
Loda
M
,
Cukor
B
,
Tam
SW
, et al
. 
Increased proteasome-dependent degradation of the cyclin-dependent kinase inhibitor p27 in aggressive colorectal carcinomas
.
Nat Med
1997
;
3
:
231
4
.
33
Philipp-Staheli
J
,
Kim
K-H
,
Payne
SR
, et al
. 
Pathway-specific tumor suppression: reduction of p27 accelerates gastrointestinal tumorigenesis in Apc mutant mice, but not in Smad3 mutant mice
.
Cancer Cell
2002
;
1
:
355
68
.
34
Adenis
A
,
Huet
G
,
Zerimech
F
,
Hecquet
B
,
Balduyck
M
,
Peyrat
JP
. 
Cathepsin B, L, and D activities in colorectal carcinomas: relationship with clinico-pathological parameters
.
Cancer Lett
1995
;
96
:
267
75
.
35
Campo
E
,
Munoz
J
,
Miquel
R
, et al
. 
Cathepsin B expression in colorectal carcinomas correlates with tumor progression and shortened patient survival
.
Am J Pathol
1994
;
145
:
301
9
.
36
Sebzda
T
,
Saleh
Y
,
Gburek
J
, et al
. 
Cathepsin D expression in human colorectal cancer: relationship with tumour type and tissue differentiation grade
.
J Exp Ther Oncol
2005
;
5
:
145
50
.
37
Talieri
M
,
Papadopoulou
S
,
Scorilas
A
, et al
. 
Cathepsin B and cathepsin D expression in the progression of colorectal adenoma to carcinoma
.
Cancer Lett
2004
;
205
:
97
106
.
38
Mazurek
S
,
Boschek
CB
,
Hugo
F
,
Eigenbrodt
E
. 
Pyruvate kinase type M2 and its role in tumor growth and spreading
.
Semin Cancer Biol
2005
;
15
:
300
8
.
39
Eigenbrodt
E
,
Basenau
D
,
Holthusen
S
,
Mazurek
S
,
Fischer
G
. 
Quantification of tumor type M2 pyruvate kinase (Tu M2-PK) in human carcinomas
.
Anticancer Res
1997
;
17
:
3153
6
.
40
Hardt
PD
,
Mazurek
S
,
Toepler
M
, et al
. 
Faecal tumour M2 pyruvate kinase: a new, sensitive screening tool for colorectal cancer
.
Br J Cancer
2004
;
91
:
980
4
.
41
Mazurek
S
,
Grimm
H
,
Oehmke
M
,
Weisse
G
,
Teigelkamp
S
,
Eigenbrodt
E
. 
Tumor M2-PK and glutaminolytic enzymes in the metabolic shift of tumor cells
.
Anticancer Res
2000
;
20
:
5151
4
.
42
Christofk
HR
,
Vander Heiden
MG
,
Harris
MH
, et al
. 
The M2 splice isoform of pyruvate kinase is important for cancer metabolism and tumour growth
.
Nature
2008
;
452
:
230
3
.
43
van Duijn
CM
,
Dekker
MC
,
Bonifati
V
, et al
. 
Park7, a novel locus for autosomal recessive early-onset parkinsonism, on chromosome 1p36
.
Am J Hum Genet
2001
;
69
:
629
34
.
44
Nagakubo
D
,
Taira
T
,
Kitaura
H
, et al
. 
DJ-1, a novel oncogene which transforms mouse NIH3T3 cells in cooperation with ras
.
Biochem Biophys Res Commun
1997
;
231
:
509
13
.
45
Kim
RH
,
Peters
M
,
Jang
Y
, et al
. 
DJ-1, a novel regulator of the tumor suppressor PTEN
.
Cancer Cell
2005
;
7
:
263
73
.
46
Le Naour
F
,
Misek
DE
,
Krause
MC
, et al
. 
Proteomics-based identification of RS/DJ-1 as a novel circulating tumor antigen in breast cancer
.
Clin Cancer Res
2001
;
7
:
3328
35
.
47
Quimby
BB
,
Dasso
M
. 
The small GTPase Ran: interpreting the signs
.
Curr Opin Cell Biol
2003
;
15
:
338
44
.
48
Faca
VM
,
Song
KS
,
Wang
H
, et al
. 
A mouse to human search for plasma proteome changes associated with pancreatic tumor development
.
PLoS Med
2008
;
5
:
e123
.