We assessed the autoantibody repertoire of a mouse model engineered to develop breast cancer and the repertoire of autoantibodies in human plasmas collected at a preclinical time point and at the time of clinical diagnosis of breast cancer. In seeking to identify common pathways, networks, and protein families associated with the humoral response, we elucidated the dynamic nature of tumor antigens and autoantibody interactions. Lysate proteins from an immortalized cell line from a MMTV-neu mouse model and from MCF7 human breast cancers were spotted onto nitrocellulose microarrays and hybridized with mouse and human plasma samples, respectively. Immunoglobulin-based plasma immunoreactivity against glycolysis and spliceosome proteins was a predominant feature observed both in tumor-bearing mice and in prediagnostic human samples. Interestingly, autoantibody reactivity was more pronounced further away than closer to diagnosis. We provide evidence for dynamic changes in autoantibody reactivity with tumor development and progression that may depend, in part, on the extent of antigen–antibody interactions. Cancer Res; 73(5); 1502–13. ©2012 AACR.

The dynamic interactions between circulating tumor antigens and autoantibodies during breast cancer development and progression have not been well characterized. Most studies of autoantibodies in cancer have relied on samples obtained after clinical diagnosis. Analysis of preclinical samples provides an opportunity to evaluate changes in the humoral response with tumor development and progression.

Several studies have yielded circulating autoantibodies against specific antigens at the time of diagnosis (1–3). High-throughput screening of proteins for autoantibody response is facilitated by the use of microarray technology. Discovery of novel autoantigens has been made through arrays composed of recombinant proteins (4–6), tumor homogenates (7), and phage-display libraries (8, 9). Peptide arrays have also been used to identify cancer-associated autoantibodies (10). Arrays of lysate-derived proteins allow delineation of immunogenic signatures involving natural proteins that may be subject to posttranslational modification, as previously applied to the studies of lung (11–13), colon (14, 15), prostate (16), and pancreatic (17) cancers. We recently showed that global profiling of the plasma proteome allows identification of sets of proteins progressively released into circulation at a preclinical stage of breast tumor development that may be grouped into biologic pathways (18). There is similarly a need to delineate sets of proteins associated with biologic pathways and networks that elicit autoantibodies with breast tumor development. To this effect, we assessed the autoantibody repertoire of a mouse model engineered to develop breast cancer and of both preclinical human samples from a longitudinal cohort and clinical human samples obtained at the time of clinical diagnosis of breast cancer.

Plasma samples

Mouse model samples.

Plasma samples from MMTV-neu mouse model were serially collected at the University of Washington Tumor Vaccine Group, Specific Pathogen-Free Facility, and Institutional Animal Care and Use Committee protocol #2878–01, from a baseline of 8 weeks until the animals were euthanized because of excessive tumor volume. Baseline samples and 2 blood collections just before the palpable tumors were used for 23 tumor-bearing mice. Samples were collected retro-orbitally at approximately 100–200 μL of whole blood. Analyzed blood samples were collected on average 121 days and 144 days after baseline sample.

Human samples.

Prediagnostic EDTA plasma samples were collected as part of the Women's Health Initiative (WHI) observational study (Table 1). Autoantibody analysis was conducted using plasmas from 48 postmenopausal women having no history of hormone therapy use who were later diagnosed with estrogen receptor–positive (ER+)/progesterone receptor positive (PR+) breast cancer and 65 healthy controls with similar distributions of age, time of blood collection (±6 months), and hormone therapy use. Newly diagnosed plasma samples from 61 postmenopausal women diagnosed with stage I/II ER+/PR+ breast cancer and 61 matched healthy controls were also investigated (Table 1). Assays of pyruvate kinase isozyme M1/M2 (PKM2) were conducted on plasma samples from an additional 118 postmenopausal WHI participants who were later diagnosed with ER+ breast cancer and 118 healthy controls matched on age and ethnicity. These samples were not matched on hormone therapy usage.

Table 1.

Human subject characteristics

WHI samples for autoantibodyWHI samples for PKM2 ELISANewly diagnosed
CaseControlCaseControlCaseControl
Sample number  48 65 118a 120 61 61 
Average age  66 (50–78) 65.5 (50–78) 66.3 (50–79) 66.3 (50–79) 55.4 (33–77) 51.3 (33–76) 
Stage 16 (36%) — 79 (67%) — 31 (51%) — 
 II 20 (44%) — 16 (14%) — 30 (49%) — 
 III 8 (18%) — 5 (4%) — — 
 IV 1 (2%) — 1 (1%) — — 
 Unknown — 17 (14%) — — 
Avg. days to diagnosis  144.4 (21–259) — 220.9 (11–363) — — — 
WHI samples for autoantibodyWHI samples for PKM2 ELISANewly diagnosed
CaseControlCaseControlCaseControl
Sample number  48 65 118a 120 61 61 
Average age  66 (50–78) 65.5 (50–78) 66.3 (50–79) 66.3 (50–79) 55.4 (33–77) 51.3 (33–76) 
Stage 16 (36%) — 79 (67%) — 31 (51%) — 
 II 20 (44%) — 16 (14%) — 30 (49%) — 
 III 8 (18%) — 5 (4%) — — 
 IV 1 (2%) — 1 (1%) — — 
 Unknown — 17 (14%) — — 
Avg. days to diagnosis  144.4 (21–259) — 220.9 (11–363) — — — 

aThirty-six case samples were collected within 150 days before diagnosis, 82 were collected more than 150 days before diagnosis.

Protein fractionation and array construction

One hundred and fifty milligrams of protein derived from MMTV-neu and MCF7 cell lysates were each subjected to orthogonal two-dimensional high-performance liquid chromatography (2D-HPLC) fractionation in an automated system (Shimadzu Corporation; Fig. 1A; ref. 19). An excess of protein from each cell line was fractioned to ensure adequate protein content in arrayed spots and availability of protein fractions for further investigation and validation. Fractionation was based on anion exchange (SAX/10 column, 7.5 mm ID×150 mm, Column Technology Inc.) using a 40 step-elution, followed by a second dimension reversed-phase separation (RP/5D column, 4.6 mm ID×150 mm, Column Technology Inc.). A total of 2,430 fractions were collected from the 2D separation. Fr_X_Y denotes the Yth fraction from the reverse-phase liquid chromatography of the Xth fraction from the anion exchange separation. The first dimension anion-exchange chromatography mobile-phase A was 20 mmol/L Tris, pH 8.5, and mobile-phase B was 20 mmol/L Tris, 1 mol/L NaCl, and pH 8.5. The second dimension reversed-phase chromatography mobile-phase A was 95% water, 5% acetonitrile, and 0.1% trifluoroacetic acid (TFA), and mobile-phase B was 90% acetonitrile, 10% water, and 0.1% TFA.

Figure 1.

A, experimental design. B, identification of immunogenic proteins was on the basis of autoantibody reactivity to arrayed MCF7 fractions. Peaks were determined qualitatively from microarray data. C, Western blot analyses of individual fractions within a cluster with individual plasma samples were used to determine the molecular weight of reactive bands whose pattern qualitatively matched the microarray response pattern. D, mass spectrometry analysis of fractions within a cluster identifies proteins by total peptide counts that match microarray and Western blot analysis data.

Figure 1.

A, experimental design. B, identification of immunogenic proteins was on the basis of autoantibody reactivity to arrayed MCF7 fractions. Peaks were determined qualitatively from microarray data. C, Western blot analyses of individual fractions within a cluster with individual plasma samples were used to determine the molecular weight of reactive bands whose pattern qualitatively matched the microarray response pattern. D, mass spectrometry analysis of fractions within a cluster identifies proteins by total peptide counts that match microarray and Western blot analysis data.

Close modal

Three hundred microliters of each fraction was lyophilized and resuspended in 30 μL of printing buffer (250 mmol/L of Tris-HCl, pH 6.8, 0.5% SDS, 25% glycerol, 0.05% TritonX-100, and 62.5 mmol/L of dithiothreitol). A total of 1,950 MCF7 fractions or 2,808 MMTV-neu fractions, together with printing buffer as negative controls and purified human or mouse immunoglobulin G (IgG) as positive controls were printed onto nitrocellulose-coated slides using a contact printer as previously described (11, 20). Approximately 500 fractions were excluded from arraying due to low UV absorbance observed during fractionation. Plasma samples were hybridized with an individual microarray at a dilution of 1:150. Reactivity was quantified using an indirect immunofluorescence protocol, as previously described (13). Local background-subtracted median spot intensities were generated using GenePix Pro 6.1 and used for downstream statistical analysis using R 2.9.0. Spot intensities were log (base 2) transformed before statistical analysis. P-values were calculated using a student's t-test.

Western blot analysis

One hundred microliters of individual fractions was lyophilized and resuspended in 40 μL of loading buffer. Fractions were run in separate lanes of a 4% to 12% Bis-Tris Criterion XT Precast Gel. Gels were transferred to polyvinylidene difluoride membranes for 1.5 hours at 80 V. Membranes were blocked in 3% bovine serum albumin (BSA) at room temperature for 1 hour. Plasma samples were diluted 1:500 in 3% BSA and incubated with the membrane at 4°C overnight. Samples were removed and membranes were washed with 0.1% PBS with tween (PBST) 5 times for 5 minutes each. Horseradish peroxidase–labeled anti-mouse or anti-human IgG at a 1:2,000 dilution was incubated with the membrane at room temperature for 1 hour. Solutions were removed and membranes were washed with 0.1% PBST 5 times for 5 minutes each. Membranes were exposed to enhanced chemiluminescence (ECL) for 1 minute and exposed to ECL hyperfilm for varied lengths of time. Films were developed and scanned for qualitative analysis.

Mass spectrometry analysis

On the basis of the protein microarray analysis, 50 μL of each interesting fraction from the 2D-HPLC was lyophilized using a freeze-drying system (Labconco). The lyophilized protein samples were dissolved in 100 mmol/L NH4HCO3 (pH 8.5) followed by overnight in-solution digestion with trypsin at 37°C. The digestion was quenched by adding 5 μL of 1.0% formic acid solution before liquid chromatography/tandem mass spectrometry (LC/MS-MS) analysis as described previously (21). Briefly, peptides were separated by reversed-phase chromatography using a nano HPLC system (Eksigent) coupled online with a LTQ-FT mass spectrometer (Thermo Fisher Scientific, Inc.). Mass spectrometer parameters were spray voltage 2.5 kV, capillary temperature 200°C, FT resolution 100,000, FT target value 8 × 105, LTQ target value 104, 1 FT microscan with 850 ms injection time, and 1 LTQ microscan with 100 ms injection time. Mass spectra were acquired in a data-dependent mode with the m/z range of 400 to 2000. The full mass spectrum (MS scan) was acquired by the FT, and tandem mass spectrum (MS/MS scan) was acquired by the LTQ with 35% normalized collision energy. Acquisition of each full mass spectrum was followed by the acquisition of tandem mass spectrometry spectra for the 5 most intense +2 or +3 ions within a one second duty cycle. The minimum signal threshold (counts) for a precursor occurring during a mass spectrometry scan was set at 1,000 for triggering a MS/MS scan.

The acquired LC/MS-MS data were processed by the Computational Proteomics Analysis System (22–24). Briefly, LC/MS-MS data were first converted to mzXML format using ReAdW software (version 1.2) to generate the peak list for protein database searching. The X!Tandem search engine (version 2005.12.01) parameters included cysteine (Cys) alkylated with iodoacetamide (57.02146@C) as a fixed modification and methionine (Met) oxidation (15.99491@M) as a variable modification. Data were searched against the International Protein Index human protein knowledge base (version v3.57), which contained entries for 76,542 proteins. The minimum criterion for peptide matching was a Peptide Prophet Score of 0.2 or more. Peptides meeting this criterion were grouped to protein sequences using the Protein Prophet algorithm at an error rate of 5% or less to maximize protein discovery and identification. Total peptide count in each fraction was used as a measure of protein concentration within that fraction.

Identification of immunogenic proteins

Significantly elevated fractions and neighboring fractions were grouped into “fraction clusters” based on microarray reactivity (Fig. 1B). Clusters were subjected to analysis by Western blot and mass spectrometry analyses to determine the immunogenic proteins within each cluster. Western blot analysis was used to determine the molecular weight of proteins with autoantibody reactivity in plasma samples (Fig. 1C). Blots of fraction clusters were probed with plasma samples seen to be reactive from the microarray analysis. Observed bands that matched microarray reactivity data were counted as positive hits. When multiple reactive bands were observed within a single fraction cluster, multiple proteins were identified as reactive for that cluster. Mass spectrometry analysis was used to identify proteins present in each fraction cluster (Fig. 1D). Total peptide count from individual fractions was matched to microarray reactivity data to determine protein identifications. When no reactive bands were observed in Western blot analyses, protein identification was based solely on results from the mass spectrometry analysis of fractions. While most proteins had more than 1 peptide identified in each analyzed fraction, no minimum peptide count criterion was applied.

Enzyme linked immunosorbent assay

PKM2 (Schebo) measurements were conducted on prediagnostic plasma samples according to the manufacturer's suggested protocol. Absorbance was measured using a SpectraMax Plus 384 and results calculated with SoftMax Pro v4.7.1 (Molecular Devices). Sample optical density (OD) values were log2 transformed and median normalized across plates. Normalized values were further standardized such that the mean of the control samples is 0 and the standard deviation (SD) is equal to 1. P values were computed using a Mann–Whitney–Wilcoxon test.

Autoantibody profiles in neu-transgenic mouse models before the occurrence of palpable tumor

Three plasma samples, consisting of a baseline blood draw and 2 draws before palpable tumor, from 23 individual mice were hybridized in singlet with mouse breast cancer cell lysate protein arrays. Of the approximately 2,800 fractions from the MMTV-neu mouse cell line that were arrayed, 120 fractions displayed significantly elevated IgG reactivity (P < 0.05) with a case-to-control ratio of at least 1.2 in an assay of the initial plasmas collected at an average of 25 weeks of age and before palpable tumor. Analysis of plasmas from a second blood draw before palpable tumor from the same mice also yielded significant reactivity for a subset of 38 of the 120 reactive fractions. A pattern of reactivity among neighboring fractions was observed, consistent with elution of reactive proteins over sequential fractions. Reactivity profiles across reversed-phase fractions were used to select clusters around statistically significant fractions that formed a peak pattern (Fig. 1B). Twenty-seven such fraction clusters were subjected to mass spectrometry and Western blot analysis that yielded identification of 25 reactive proteins (Table 2) from the MMTV-neu model.

Table 2.

Immunogenic proteins identified in neu-transgenic mouse cancer cell lysate

Q value (FDR)
GeneDescriptionSubcellular localizationCase-to-control ratiosTime point #1aTime point #2b
Aldoa Fructose-bisphosphate aldolase A Cytoplasm 1.25 0.7687 0.0297 
Aldoc Fructose-bisphosphate aldolase C Cytoplasm 1.25 0.7687 0.0297 
Arpc2 Actin-related protein 2/3 complex subunit 2 Cytoplasm 2.69 0.7687 0.0463 
Dnajb14 DnaJ homolog subfamily B member 14 Nucleus 1.96 0.7687 0.0297 
Eno1 Alpha-enolase Cytoplasm 1.38 0.7687 0.0238 
Hist1h1b Histone H1.5 Nucleus 2.69 0.7687 0.0463 
Hist1h1d Histone H1.3 Nucleus 2.01 0.3306 0.0118 
Hist1h2ba Histone H2B type 1-A Nucleus 1.45 0.7398 0.0126 
Hist1h2bc Histone H2B Nucleus 2.01 0.3306 0.0118 
Hnrnpa2b1 Heterogeneous nuclear ribonucleoproteins A2/B1 Nucleus 1.20 0.7687 0.0463 
Krt18 Keratin, type I cytoskeletal 18 Cytoplasm 2.51 0.7687 0.0297 
Krt7 Keratin, type II cytoskeletal 7 Cytoplasm 2.51 0.7948 0.0297 
Krt8 Keratin, type II cytoskeletal 8 Cytoplasm 2.51 0.7948 0.0186 
Mdh2 Malate dehydrogenase, mitochondrial Cytoplasm 1.69 0.7398 0.0126 
Nme1 Nucleoside diphosphate kinase A Nucleus 1.38 0.7687 0.0238 
Nme2 Nucleoside diphosphate kinase B Nucleus 1.20 0.7687 0.0463 
Nsfl1c NSFL1 cofactor p47 Cytoplasm 2.39 0.7398 0.0397 
Pdia3 Protein disulfide-isomerase A3 Cytoplasm 2.37 0.7687 0.0278 
Pkm2 Pyruvate kinase isozymes M1/M2 Cytoplasm 1.72 0.7687 0.0323 
Rbm3 Putative RNA-binding protein 3 Nucleus 1.34 0.7687 0.0297 
Serbp1 Plasminogen activator inhibitor 1 RNA-binding protein Nucleus 2.22 0.3306 0.0118 
Sfrs3 Serine/arginine-rich splicing factor 3 Nucleus 4.77 0.3306 0.0118 
Sfrs7 Serine/arginine-rich splicing factor 7 Nucleus 2.01 0.7687 0.0444 
Ubfd1 Ubiquitin domain-containing protein UBFD1 Unknown 2.02 0.7687 0.0434 
Znf518b Zinc finger protein 518B Unknown 2.69 0.7687 0.0463 
Q value (FDR)
GeneDescriptionSubcellular localizationCase-to-control ratiosTime point #1aTime point #2b
Aldoa Fructose-bisphosphate aldolase A Cytoplasm 1.25 0.7687 0.0297 
Aldoc Fructose-bisphosphate aldolase C Cytoplasm 1.25 0.7687 0.0297 
Arpc2 Actin-related protein 2/3 complex subunit 2 Cytoplasm 2.69 0.7687 0.0463 
Dnajb14 DnaJ homolog subfamily B member 14 Nucleus 1.96 0.7687 0.0297 
Eno1 Alpha-enolase Cytoplasm 1.38 0.7687 0.0238 
Hist1h1b Histone H1.5 Nucleus 2.69 0.7687 0.0463 
Hist1h1d Histone H1.3 Nucleus 2.01 0.3306 0.0118 
Hist1h2ba Histone H2B type 1-A Nucleus 1.45 0.7398 0.0126 
Hist1h2bc Histone H2B Nucleus 2.01 0.3306 0.0118 
Hnrnpa2b1 Heterogeneous nuclear ribonucleoproteins A2/B1 Nucleus 1.20 0.7687 0.0463 
Krt18 Keratin, type I cytoskeletal 18 Cytoplasm 2.51 0.7687 0.0297 
Krt7 Keratin, type II cytoskeletal 7 Cytoplasm 2.51 0.7948 0.0297 
Krt8 Keratin, type II cytoskeletal 8 Cytoplasm 2.51 0.7948 0.0186 
Mdh2 Malate dehydrogenase, mitochondrial Cytoplasm 1.69 0.7398 0.0126 
Nme1 Nucleoside diphosphate kinase A Nucleus 1.38 0.7687 0.0238 
Nme2 Nucleoside diphosphate kinase B Nucleus 1.20 0.7687 0.0463 
Nsfl1c NSFL1 cofactor p47 Cytoplasm 2.39 0.7398 0.0397 
Pdia3 Protein disulfide-isomerase A3 Cytoplasm 2.37 0.7687 0.0278 
Pkm2 Pyruvate kinase isozymes M1/M2 Cytoplasm 1.72 0.7687 0.0323 
Rbm3 Putative RNA-binding protein 3 Nucleus 1.34 0.7687 0.0297 
Serbp1 Plasminogen activator inhibitor 1 RNA-binding protein Nucleus 2.22 0.3306 0.0118 
Sfrs3 Serine/arginine-rich splicing factor 3 Nucleus 4.77 0.3306 0.0118 
Sfrs7 Serine/arginine-rich splicing factor 7 Nucleus 2.01 0.7687 0.0444 
Ubfd1 Ubiquitin domain-containing protein UBFD1 Unknown 2.02 0.7687 0.0434 
Znf518b Zinc finger protein 518B Unknown 2.69 0.7687 0.0463 

NOTE: Case-to-control ratios are an average of 2 analyzed blood draws.

aq values based on all 2,808 arrayed fractions.

bq values based on 120 fractions significantly elevated (P < 0.05) in cases in time point #1.

Reactivity against anti-IgG controls printed on the arrays indicated that there was no difference in IgG amount between case and control samples. More than 90% of reactive proteins identified were annotated as intracellular with enrichment in the nuclear proteins (Fig. 2A) notably in spliceosome C complex proteins (e.g., Hnrnpa2b1, Sfrs3, and Sfrs7; ref. 25). Identified proteins were subjected to analysis of gene sets represented in the Kyoto Encyclopedia of Genes and Genomes (KEGG) biologic pathway database (26). Four of the 27 identified proteins (Aldoa, Aldoc, Eno1, and Pkm2) were identified in the glycolysis gene set (Fig. 2C) with an estimated false discovery rate (FDR = 0.0037). Interestingly some of the immunogenic proteins identified (Hist1h1d, Eno1, Hnrnpa2b1, Sfrs3, Nme2, and Krt18) were previously associated with autoimmune disease (25, 27–31) indicative of an overlapping set of antigens between autoimmune disease and the humoral response observed in tumor-bearing mice.

Figure 2.

Subcellular localization of immunogenic proteins identified in mouse (A) and human (B) samples. C, immunogenic proteins identified in the glycolysis pathway. Purple indicates autoantibodies were elevated in cases compared with controls for both prediagnostic mouse and human samples, whereas red indicates autoantibodies were elevated only in prediagnostic human cases compared with controls. No color indicates autoantibodies to these proteins were not identified.

Figure 2.

Subcellular localization of immunogenic proteins identified in mouse (A) and human (B) samples. C, immunogenic proteins identified in the glycolysis pathway. Purple indicates autoantibodies were elevated in cases compared with controls for both prediagnostic mouse and human samples, whereas red indicates autoantibodies were elevated only in prediagnostic human cases compared with controls. No color indicates autoantibodies to these proteins were not identified.

Close modal

Autoantibody signatures in human breast cancer plasmas before clinical diagnosis

Prediagnostic plasmas from 48 women with ER+/PR+ breast cancer and 65 healthy controls, all participants in the WHI cohort study, were individually hybridized with MCF7-derived protein arrays. Of the approximately 1,960 printed fractions, 285 individual fractions yielded a case-to-control ratio of 1.2 or more and P value <0.05 or less using a Student t test. Analysis of a subset of these fraction clusters yielded 90 protein identifications (Table 3). Thirty-five percent of identified proteins were of nuclear origin (Fig. 2B), concordant with the mouse model data.

Table 3.

Immunogenic proteins identified by autoantibody responses from ER+/PR+ breast cancer patients to MCF7 cancer cell lysate

GeneDescriptionSubcellular localizationCase-to-control ratioQ value (FDR)
ACAD9 Acyl-CoA dehydrogenase family, member 9 Cytoplasm 1.21 0.0468 
ACO2 Aconitase 2, mitochondrial Cytoplasm 1.21 0.0468 
ACTA1 Actin, α–1, skeletal muscle Cytoplasm 1.45 0.0468 
ACTN1 Actinin, α-1 Cytoplasm 1.45 0.0468 
ACTN4 Actinin, α-4 Cytoplasm 1.35 0.0468 
ACTR1A ARP1 actin-related protein 1 homolog A, centractin alpha (yeast) Cytoplasm 1.30 0.0468 
AHNAK AHNAK nucleoprotein Nucleus 1.31 0.0468 
AK2 Adenylate kinase 2 Cytoplasm 1.31 0.0468 
ALDH7A1 Aldehyde dehydrogenase 7 family, member A1 Cytoplasm 1.31 0.0468 
ALDOA Aldolase A, fructose-bisphosphate Cytoplasm 1.31 0.0468 
APEX1 APEX nuclease (multifunctional DNA repair enzyme) 1 Nucleus 1.31 0.0468 
C14orf156 Chromosome 14 open reading frame 156 Cytoplasm 1.36 0.0468 
C1QBP Complement component 1, q subcomponent binding protein Cytoplasm 1.46 0.0468 
CALR Calreticulin Cytoplasm 1.56 0.0468 
CSDA Cold shock domain protein A Nucleus 1.32 0.0468 
CTSD Cathepsin D Cytoplasm 1.37 0.0468 
DDTL D-dopachrome tautomerase-like Unknown 1.37 0.0468 
DDX17 DEAD (Asp-Glu-Ala-Asp) box polypeptide 17 Nucleus 1.37 0.0468 
DIABLO Diablo, IAP-binding mitochondrial protein Cytoplasm 1.24 0.0468 
DLD Dihydrolipoamide dehydrogenase Cytoplasm 1.23 0.0468 
DPY30 dpy-30 homolog (C. elegans) Nucleus 1.23 0.0468 
DSP Desmoplakin Plasma membrane 1.23 0.0468 
EFTUD2 Elongation factor Tu GTP binding domain containing 2 Nucleus 1.45 0.0468 
EIF5A Eukaryotic translation initiation factor 5A Cytoplasm 1.36 0.0468 
ENO1 Enolase 1, (alpha) Cytoplasm 1.39 0.0468 
EZR Ezrin Plasma membrane 1.23 0.0468 
FBP1 Fructose-1,6-bisphosphatase 1 Cytoplasm 1.27 0.0468 
FLNA Filamin A, alpha Cytoplasm 1.30 0.0468 
FLNB Filamin B, beta Cytoplasm 1.30 0.0468 
FUS Fused in sarcoma Nucleus 1.32 0.0468 
G3BP1 GTPase activating protein (SH3 domain) binding protein 1 Nucleus 1.39 0.0468 
GAPDH Glyceraldehyde-3-phosphate dehydrogenase Cytoplasm 1.38 0.0468 
GPI Glucose-6-phosphate isomerase Extracellular space 1.30 0.0468 
GRB2 Growth factor receptor-bound protein 2 Cytoplasm 1.35 0.0468 
GRN Granulin Extracellular space 1.59 0.0468 
GSN Gelsolin Extracellular Space 1.31 0.0468 
H1FX H1 histone family, member X Nucleus 1.30 0.0468 
HADH Hydroxyacyl-CoA dehydrogenase Cytoplasm 1.30 0.0468 
HIST1H1D Histone cluster 1, H1d Nucleus 1.30 0.0468 
HIST1H1E Histone cluster 1, H1e Nucleus 1.30 0.0468 
HNRNPA1 Heterogeneous nuclear ribonucleoprotein A1 Nucleus 1.53 0.0468 
HNRNPA2B1 Heterogeneous nuclear ribonucleoprotein A2/B1 Nucleus 1.34 0.0468 
HNRNPD Heterogeneous nuclear ribonucleoprotein D (AU-rich element RNA binding protein 1, 37kDa) Nucleus 1.44 0.0468 
HNRNPK Heterogeneous nuclear ribonucleoprotein K Nucleus 1.44 0.0468 
HSP90AA1 Heat shock protein 90kDa alpha (cytosolic), class A member 1 Cytoplasm 1.32 0.0468 
HSP90AB1 Heat shock protein 90kDa alpha (cytosolic), class B member 1 Cytoplasm 1.42 0.0468 
HSP90B1 Heat shock protein 90kDa beta (Grp94), member 1 Cytoplasm 1.29 0.0468 
HSPA5 Heat shock 70kDa protein 5 (glucose-regulated protein, 78kDa) Cytoplasm 1.29 0.0468 
HSPA8 Heat shock 70kDa protein 8 Cytoplasm 1.29 0.0468 
HSPA9 Heat shock 70kDa protein 9 (mortalin) Cytoplasm 1.29 0.0468 
HSPB1 Heat shock 27kDa protein 1 Cytoplasm 1.44 0.0468 
ISOC1 Isochorismatase domain containing 1 Cytoplasm 1.27 0.0468 
JUP Junction plakoglobin Plasma membrane 1.69 0.0468 
MCCC2 Methylcrotonoyl-CoA carboxylase 2 (beta) Cytoplasm 1.31 0.0468 
NCL Nucleolin Nucleus 1.37 0.0468 
NME2 Nonmetastatic cells 2, protein (NM23B) expressed in Nucleus 1.37 0.0468 
NOLC1 Nucleolar and coiled-body phosphoprotein 1 Nucleus 1.41 0.0468 
OSGIN1 Oxidative stress induced growth inhibitor 1 Unknown 1.36 0.0468 
P4HB Prolyl 4-hydroxylase, beta polypeptide Cytoplasm 1.36 0.0468 
PDIA4 Protein disulfide isomerase family A, member 4 Cytoplasm 1.39 0.0468 
PDXK Pyridoxal (pyridoxine, vitamin B6) kinase Cytoplasm 1.30 0.0468 
PKM2 Pyruvate kinase, muscle Cytoplasm 1.34 0.0468 
PPL Periplakin Cytoplasm 1.34 0.0468 
PRDX2 Peroxiredoxin 2 Cytoplasm 1.34 0.0468 
PSAP Prosaposin Extracellular space 1.23 0.0468 
PTBP1 Polypyrimidine tract binding protein 1 Nucleus 1.42 0.0468 
RAD23B RAD23 homolog B (S. cerevisiae) Nucleus 1.18 0.0468 
RALY RNA binding protein, autoantigenic (hnRNP-associated with lethal yellow homolog (mouse)) Nucleus 1.46 0.0468 
SAP18 Sin3A-associated protein, 18kDa Nucleus 1.38 0.0468 
SF3A1 Splicing factor 3a, subunit 1, 120kDa Nucleus 1.32 0.0468 
SFRS1 Serine/arginine-rich splicing factor 1 Nucleus 1.34 0.0468 
SFRS3 Serine/arginine-rich splicing factor 3 Nucleus 1.60 0.0468 
SFRS6 Serine/arginine-rich splicing factor 6 Nucleus 1.39 0.0468 
SPTBN1 Spectrin, beta, nonerythrocytic 1 Plasma membrane 1.35 0.0468 
SSB Sjogren syndrome antigen B (autoantigen La) Nucleus 1.35 0.0468 
STMN1 Stathmin 1 Cytoplasm 1.31 0.0468 
SUMO2 SMT3 suppressor of mif two 3 homolog 2 (S. cerevisiae) Nucleus 1.41 0.0468 
SVEP1 Sushi, von Willebrand factor type A, EGF and pentraxin domain containing 1 Cytoplasm 1.38 0.0468 
SYNCRIP Synaptotagmin binding, cytoplasmic RNA interacting protein Nucleus 1.31 0.0468 
TAF15 TAF15 RNA polymerase II, TATA box binding protein (TBP)-associated factor, 68kDa Nucleus 1.31 0.0468 
TAGLN2 Transgelin 2 Cytoplasm 1.34 0.0468 
TPI1 Triosephosphate isomerase 1 Cytoplasm 1.29 0.0468 
TRAP1 TNF receptor-associated protein 1 Cytoplasm 1.36 0.0468 
TXN Thioredoxin Cytoplasm 1.47 0.0468 
U2AF1 U2 small nuclear RNA auxiliary factor 1 Nucleus 1.49 0.0468 
UBA1 Ubiquitin-like modifier activating enzyme 1 Cytoplasm 1.46 0.0468 
UGGT1 UDP-glucose glycoprotein glucosyltransferase 1 Cytoplasm 1.46 0.0468 
VCP Valosin containing protein Cytoplasm 1.46 0.0468 
YWHAB Tyrosine 3-monooxygenase/tryptophan 5-monooxygenase activation protein, beta polypeptide Cytoplasm 1.32 0.0468 
YWHAZ Tyrosine 3-monooxygenase/tryptophan 5-monooxygenase activation protein, zeta polypeptide Cytoplasm 1.44 0.0468 
GeneDescriptionSubcellular localizationCase-to-control ratioQ value (FDR)
ACAD9 Acyl-CoA dehydrogenase family, member 9 Cytoplasm 1.21 0.0468 
ACO2 Aconitase 2, mitochondrial Cytoplasm 1.21 0.0468 
ACTA1 Actin, α–1, skeletal muscle Cytoplasm 1.45 0.0468 
ACTN1 Actinin, α-1 Cytoplasm 1.45 0.0468 
ACTN4 Actinin, α-4 Cytoplasm 1.35 0.0468 
ACTR1A ARP1 actin-related protein 1 homolog A, centractin alpha (yeast) Cytoplasm 1.30 0.0468 
AHNAK AHNAK nucleoprotein Nucleus 1.31 0.0468 
AK2 Adenylate kinase 2 Cytoplasm 1.31 0.0468 
ALDH7A1 Aldehyde dehydrogenase 7 family, member A1 Cytoplasm 1.31 0.0468 
ALDOA Aldolase A, fructose-bisphosphate Cytoplasm 1.31 0.0468 
APEX1 APEX nuclease (multifunctional DNA repair enzyme) 1 Nucleus 1.31 0.0468 
C14orf156 Chromosome 14 open reading frame 156 Cytoplasm 1.36 0.0468 
C1QBP Complement component 1, q subcomponent binding protein Cytoplasm 1.46 0.0468 
CALR Calreticulin Cytoplasm 1.56 0.0468 
CSDA Cold shock domain protein A Nucleus 1.32 0.0468 
CTSD Cathepsin D Cytoplasm 1.37 0.0468 
DDTL D-dopachrome tautomerase-like Unknown 1.37 0.0468 
DDX17 DEAD (Asp-Glu-Ala-Asp) box polypeptide 17 Nucleus 1.37 0.0468 
DIABLO Diablo, IAP-binding mitochondrial protein Cytoplasm 1.24 0.0468 
DLD Dihydrolipoamide dehydrogenase Cytoplasm 1.23 0.0468 
DPY30 dpy-30 homolog (C. elegans) Nucleus 1.23 0.0468 
DSP Desmoplakin Plasma membrane 1.23 0.0468 
EFTUD2 Elongation factor Tu GTP binding domain containing 2 Nucleus 1.45 0.0468 
EIF5A Eukaryotic translation initiation factor 5A Cytoplasm 1.36 0.0468 
ENO1 Enolase 1, (alpha) Cytoplasm 1.39 0.0468 
EZR Ezrin Plasma membrane 1.23 0.0468 
FBP1 Fructose-1,6-bisphosphatase 1 Cytoplasm 1.27 0.0468 
FLNA Filamin A, alpha Cytoplasm 1.30 0.0468 
FLNB Filamin B, beta Cytoplasm 1.30 0.0468 
FUS Fused in sarcoma Nucleus 1.32 0.0468 
G3BP1 GTPase activating protein (SH3 domain) binding protein 1 Nucleus 1.39 0.0468 
GAPDH Glyceraldehyde-3-phosphate dehydrogenase Cytoplasm 1.38 0.0468 
GPI Glucose-6-phosphate isomerase Extracellular space 1.30 0.0468 
GRB2 Growth factor receptor-bound protein 2 Cytoplasm 1.35 0.0468 
GRN Granulin Extracellular space 1.59 0.0468 
GSN Gelsolin Extracellular Space 1.31 0.0468 
H1FX H1 histone family, member X Nucleus 1.30 0.0468 
HADH Hydroxyacyl-CoA dehydrogenase Cytoplasm 1.30 0.0468 
HIST1H1D Histone cluster 1, H1d Nucleus 1.30 0.0468 
HIST1H1E Histone cluster 1, H1e Nucleus 1.30 0.0468 
HNRNPA1 Heterogeneous nuclear ribonucleoprotein A1 Nucleus 1.53 0.0468 
HNRNPA2B1 Heterogeneous nuclear ribonucleoprotein A2/B1 Nucleus 1.34 0.0468 
HNRNPD Heterogeneous nuclear ribonucleoprotein D (AU-rich element RNA binding protein 1, 37kDa) Nucleus 1.44 0.0468 
HNRNPK Heterogeneous nuclear ribonucleoprotein K Nucleus 1.44 0.0468 
HSP90AA1 Heat shock protein 90kDa alpha (cytosolic), class A member 1 Cytoplasm 1.32 0.0468 
HSP90AB1 Heat shock protein 90kDa alpha (cytosolic), class B member 1 Cytoplasm 1.42 0.0468 
HSP90B1 Heat shock protein 90kDa beta (Grp94), member 1 Cytoplasm 1.29 0.0468 
HSPA5 Heat shock 70kDa protein 5 (glucose-regulated protein, 78kDa) Cytoplasm 1.29 0.0468 
HSPA8 Heat shock 70kDa protein 8 Cytoplasm 1.29 0.0468 
HSPA9 Heat shock 70kDa protein 9 (mortalin) Cytoplasm 1.29 0.0468 
HSPB1 Heat shock 27kDa protein 1 Cytoplasm 1.44 0.0468 
ISOC1 Isochorismatase domain containing 1 Cytoplasm 1.27 0.0468 
JUP Junction plakoglobin Plasma membrane 1.69 0.0468 
MCCC2 Methylcrotonoyl-CoA carboxylase 2 (beta) Cytoplasm 1.31 0.0468 
NCL Nucleolin Nucleus 1.37 0.0468 
NME2 Nonmetastatic cells 2, protein (NM23B) expressed in Nucleus 1.37 0.0468 
NOLC1 Nucleolar and coiled-body phosphoprotein 1 Nucleus 1.41 0.0468 
OSGIN1 Oxidative stress induced growth inhibitor 1 Unknown 1.36 0.0468 
P4HB Prolyl 4-hydroxylase, beta polypeptide Cytoplasm 1.36 0.0468 
PDIA4 Protein disulfide isomerase family A, member 4 Cytoplasm 1.39 0.0468 
PDXK Pyridoxal (pyridoxine, vitamin B6) kinase Cytoplasm 1.30 0.0468 
PKM2 Pyruvate kinase, muscle Cytoplasm 1.34 0.0468 
PPL Periplakin Cytoplasm 1.34 0.0468 
PRDX2 Peroxiredoxin 2 Cytoplasm 1.34 0.0468 
PSAP Prosaposin Extracellular space 1.23 0.0468 
PTBP1 Polypyrimidine tract binding protein 1 Nucleus 1.42 0.0468 
RAD23B RAD23 homolog B (S. cerevisiae) Nucleus 1.18 0.0468 
RALY RNA binding protein, autoantigenic (hnRNP-associated with lethal yellow homolog (mouse)) Nucleus 1.46 0.0468 
SAP18 Sin3A-associated protein, 18kDa Nucleus 1.38 0.0468 
SF3A1 Splicing factor 3a, subunit 1, 120kDa Nucleus 1.32 0.0468 
SFRS1 Serine/arginine-rich splicing factor 1 Nucleus 1.34 0.0468 
SFRS3 Serine/arginine-rich splicing factor 3 Nucleus 1.60 0.0468 
SFRS6 Serine/arginine-rich splicing factor 6 Nucleus 1.39 0.0468 
SPTBN1 Spectrin, beta, nonerythrocytic 1 Plasma membrane 1.35 0.0468 
SSB Sjogren syndrome antigen B (autoantigen La) Nucleus 1.35 0.0468 
STMN1 Stathmin 1 Cytoplasm 1.31 0.0468 
SUMO2 SMT3 suppressor of mif two 3 homolog 2 (S. cerevisiae) Nucleus 1.41 0.0468 
SVEP1 Sushi, von Willebrand factor type A, EGF and pentraxin domain containing 1 Cytoplasm 1.38 0.0468 
SYNCRIP Synaptotagmin binding, cytoplasmic RNA interacting protein Nucleus 1.31 0.0468 
TAF15 TAF15 RNA polymerase II, TATA box binding protein (TBP)-associated factor, 68kDa Nucleus 1.31 0.0468 
TAGLN2 Transgelin 2 Cytoplasm 1.34 0.0468 
TPI1 Triosephosphate isomerase 1 Cytoplasm 1.29 0.0468 
TRAP1 TNF receptor-associated protein 1 Cytoplasm 1.36 0.0468 
TXN Thioredoxin Cytoplasm 1.47 0.0468 
U2AF1 U2 small nuclear RNA auxiliary factor 1 Nucleus 1.49 0.0468 
UBA1 Ubiquitin-like modifier activating enzyme 1 Cytoplasm 1.46 0.0468 
UGGT1 UDP-glucose glycoprotein glucosyltransferase 1 Cytoplasm 1.46 0.0468 
VCP Valosin containing protein Cytoplasm 1.46 0.0468 
YWHAB Tyrosine 3-monooxygenase/tryptophan 5-monooxygenase activation protein, beta polypeptide Cytoplasm 1.32 0.0468 
YWHAZ Tyrosine 3-monooxygenase/tryptophan 5-monooxygenase activation protein, zeta polypeptide Cytoplasm 1.44 0.0468 

Analysis of gene sets represented in the human KEGG biologic pathway database revealed significant enrichment of proteins in the glycolysis gene set (FDR = 7.5E-6) and spliceosome gene set (FDR = 0.0011). Nine proteins were identified in the glycolysis gene set (Fig. 2C): ALDH7A1, ALDOA, DLD, ENO1, FBP1, GAPDH, GPI, PKM2, and TPI. Three of these proteins, ALDOA, ENO1, and PKM2 were also identified in plasma from tumor-bearing mice. A set of 9 proteins was associated with the spliceosome by KEGG analysis: EFTUD2, HNRNPA1, HNRNPK, HSPA8, SF3A1, SFRS1, SFRS3, SFRS6, and U2AF1. Additional proteins, namely HNRNPA2B1, PTBP1, RALY, SAP18, and SYNCRIP, not included in the spliceosome signature by KEGG analysis, are known to be part of the spliceosome complex (25, 32, 33). SFRS3 and HNRNPA2B1 overlap with antigenic proteins identified in the mouse. As with the mouse model, some of the identified proteins have been associated with autoimmune diseases. Thirteen proteins (AHNAK, CALR, ENO1, GAPDH, HADH, HIST1H1D, HIST1H1E, HNRNPA1, HNRNPA2B1, HNRNPK, NCL, and SFRS3) were previously described as autoantigens in systemic lupus erythematosus (SLE; refs. 25, 27, 28, 31, 32, 34–39), whereas others (EZR, GPI, and TXN) were associated with rheumatoid arthritis (27, 31, 36) and other autoimmune diseases (40–45).

To speculate on the use as of these biologic signatures as potential biomarker panels, receiver operator characteristic (ROC) analysis was conducted on the basis of the set of proteins in the glycolysis (9 proteins) and spliceosome (14 proteins) signatures using a linear regression model based on a least squares estimation (Supplementary Table S1). The glycolysis and spliceosome signatures gave areas under the ROC curve (AUC) of 0.68 [95% confidence interval (CI): 0.59–0.78] and 0.73 (95% CI: 0.63–0.82), respectively. Combining these 2 signatures yielded an AUC of 0.77 (95% CI: 0.68–0.86), with 35% sensitivity at 95% specificity (Fig. 3A). This combination, while not statistically better than the spliceosome signature alone, shows the additive potential of autoantibody signatures.

Figure 3.

ROC analysis of proteins identified in glycolysis and spliceosome signatures for all samples (A), cases collected within 150 days before diagnosis compared with controls (B), and cases collected more than 150 days before diagnosis of breast cancer compared with controls (C). D, case-to-control ratios based on time-to-diagnosis for a subset of immunogenic proteins assayed in a cohort of newly diagnosed, postmenopausal women.

Figure 3.

ROC analysis of proteins identified in glycolysis and spliceosome signatures for all samples (A), cases collected within 150 days before diagnosis compared with controls (B), and cases collected more than 150 days before diagnosis of breast cancer compared with controls (C). D, case-to-control ratios based on time-to-diagnosis for a subset of immunogenic proteins assayed in a cohort of newly diagnosed, postmenopausal women.

Close modal

Temporal patterns of circulating protein and autoantibody levels preceding a diagnosis of breast cancer

Plasmas from cases were divided on the basis of the time of blood collection in relation to diagnosis of breast cancer. Interestingly, autoantibody reactivity among case plasmas collected further from clinical diagnosis (more than 150 days prior) was more than reactivity among more proximal case plasmas, compared with controls. Plasmas from cases collected closer to clinical diagnosis (less than 150 days) exhibited less significant elevation of autoantibody response with only 4 identified proteins found to be significantly elevated. Further support for a temporal pattern of reactivity in relation to time of diagnosis was derived from analysis of plasmas from newly diagnosed postmenopausal women, which did not exhibit significantly increased reactivity among cases relative to controls (Fig. 3D).

Using the same linear regression model previously established, ROC analysis yielded increased performance for samples collected further from diagnosis for the spliceosome signature (AUC = 0.83 further from diagnosis and 0.63 closer to diagnosis), whereas that of the glycolysis signature remained relatively constant (AUC = 0.69 and 0.70, respectively; Fig. 3B and C).

We previously showed progressively increased release of glycolysis proteins into the circulation in relation to time to diagnosis (18). We therefore examined the patterns of PKM2 levels in relation to time to diagnosis following blood collection and in relation to PKM2 autoantibodies in a separate set of samples from the WHI cohort. Circulating PKM2 levels were significantly elevated in WHI samples collected within 150 days before diagnosis compared with controls, but not in samples collected further from diagnosis (Fig. 4A). In contrast, autoantibody response to PKM2 exhibited an opposite trend, with significant elevation further from diagnosis (Fig. 4B). Seropositivity, defined as 2 SDs above the mean of the controls for that marker, ranged for autoantibodies to individual glycolysis proteins from 6.3% (ALDOA, GPI) to 14.6% (TPI1) and for spliceosome proteins from 2% (HSPA8, SFRS3) to 12.5% (HNRNPA1; Fig. 4C and D), consistent with the range of biomarker positivity reported in other autoantibody studies (46). Seropositivity for both sets of proteins, based on time of blood draw before diagnosis, ranged from approximately 2 months to 8.5 months. Multiple positive markers among glycolysis or spliceosome proteins were often observed within an individual sample, indicating a broad immune response across proteins in these pathways.

Figure 4.

A, normalized ELISA results for PKM2 in prediagnostic samples from the WHI separated by days before diagnosis of blood draw. B, normalized autoantibody response from natural protein arrays for PKM2 separated by days before diagnosis. *, P < 0.05. C, distribution of positive autoantibody responses for each of the identified glycolysis pathway proteins as a function of time before diagnosis in case samples. D, distribution of positive autoantibody responses for each of the identified spliceosome proteins as a function of time before diagnosis in case samples. In C and D, individual proteins are marked by unique symbols.

Figure 4.

A, normalized ELISA results for PKM2 in prediagnostic samples from the WHI separated by days before diagnosis of blood draw. B, normalized autoantibody response from natural protein arrays for PKM2 separated by days before diagnosis. *, P < 0.05. C, distribution of positive autoantibody responses for each of the identified glycolysis pathway proteins as a function of time before diagnosis in case samples. D, distribution of positive autoantibody responses for each of the identified spliceosome proteins as a function of time before diagnosis in case samples. In C and D, individual proteins are marked by unique symbols.

Close modal

Immune complex formation with increasing levels of antigen is one possible explanation for the observed decrease in autoantibody signal closer to diagnosis. Mass spectrometry analysis of affinity-purified immunoglobulin fractions from newly diagnosed and prediagnostic samples yielded evidence of circulating immune complexes for a number of proteins identified by microarray analysis. Of the 9 identified glycolytic enzymes, 5 (ALDOA, ENO1, GAPDH, PKM2, and TPI1) were observed as part of immune complexes. The most highly significant of these, GAPDH, exhibited an increase in the immunoglobulin bound fraction in cancer samples compared with controls in plasmas from newly diagnosed cases, but not in prediagnostic plasmas consistent with increasing amount of antigen bound to immunoglobulin with tumor development and progression (Supplementary Table S2).

We have used natural protein arrays for comprehensive profiling of autoantigens and autoantibody signatures in a mouse model of breast cancer and in ER+/PR+ breast cancer. Far more proteins were identified in human samples than from the mouse model. Humans are genetically heterogenic and diverse with many more class II alleles represented than the mouse. All mouse models are genetically inbred; therefore, one would expect a more restrictive repertoire. Despite this immunogenic difference, 2 autoantibody signatures consisting of glycolysis and splicesome proteins observed in the mouse model were also observed in human breast cancer plasmas. A strong similarity with antigens associated with SLE and other autoimmune diseases, was noted among identified antigenic proteins in our breast cancer study. We have recently provided evidence of release into circulation of proteins associated with the glycolysis pathway in ER+ breast cancer plasmas before clinical diagnosis (18). We showed increased release of glycolysis proteins with decreasing time-to-diagnosis after blood collection indicative of a positive correlation between circulating glycolysis protein levels and tumor growth. In this study, we provide evidence of stronger reactivity of circulating immunoglobulins with glycolysis proteins arrayed further from diagnosis in relation to blood collection. Analysis of circulating plasma PKM2 levels and associated autoantibodies yielded increased circulating protein levels closer to diagnosis, whereas measurable autoantibodies to PKM2 exhibited an inverse relationship, with increased levels observed further from diagnosis. Because of limited availability of ELISAs, only PKM2 was available for testing of the proteins identified in the glycolysis and spliceosome signatures. The formation of circulating immune complexes depleting plasma of free autoantibodies could explain these observed phenomena. Mass spectrometry analysis of proteins bound to immunoglobulin fractions provided evidence for an increase in immune complexes to the glycolytic enzyme GAPDH in newly diagnosed samples.

An immune response to spliceosome proteins has been associated with autoimmune disease (25). A recent study of fine-needle aspirates from breast cancers and benign lesions yielded evidence of differential expression of spliceosome assembly proteins in tumors compared with benign lesions (47). Reported responses in autoimmune diseases have been limited to Sm and nRNP proteins that are part of the spliceosome complex. In this work, multiple nRNP proteins were identified as autoantigens in prediagnostic breast cancer plasmas. HNRNPs play central roles in DNA repair, cell signaling, and regulation of gene expression at transcriptional and translational levels often through spliceosome complexes. Some splicesome proteins have been implicated in cancer through their regulation of downstream targets. Increased mRNA levels of HNRNPA2B1 were previously reported in melanoma and associated with increased levels of the protein in circulation (48). Expression of HNRNPQ has been shown to be affected by siRNA targeting of ER-α in MCF7 (49). Other HNRNPs have been observed as overexpressed in cancer plasma compared with control (50).

Another component of the spliceosome not previously reported as autoantigenic is the SFRS family of proteins, which we found to be associated with autoantibodies in breast cancer. While most reports localize the SFRS proteins to the cytoplasm and nucleus, one recent study provided evidence of release of SFRS1 into the media of pancreatic cancer cell lines (51). Cell surface localization of SFRS proteins has also been shown in lung cancer (52). Surface-localized SFRS proteins bound fucosylated oligosaccharides through the same mechanism as RNA binding. Our extensive proteomic analysis of the MCF7 breast cancer cell line revealed 10 SFRS proteins that occurred both on the cell surface and in the conditioned media, and 2 additional SFRS proteins in the media. SFRS1, SFRS3, and SFRS6 were all identified on the cell surface and in the media of MCF7.

There is prior evidence of occurrence of autoantibodies to nuclear proteins in cancer (53); however, identification of spliceosome-related autoantibodies is a novel finding in our study. A prior study using recombinant proteins did not yield splicesome autoantibodies in cancer (54). Given the known occurrence of posttranslational modifications in spliceosome proteins, it is plausible that the use of natural proteins in our study may account for the autoantibody reactivity we have observed for splicesome proteins.

Our study shows the use of microarrays of fractionated tumor cell lysate proteins for uncovering immunogenic pathways and autoantibody signatures in breast cancer plasmas. Signatures discovered in a breast cancer mouse model were recapitulated in human, indicative of a similar process of immunogenicity to breast tumor antigens in both mouse and humans. Postmenopausal women with newly diagnosed breast cancer exhibited reduced plasma immunoglobulin binding to arrayed proteins compared with prediagnostic plasmas. This was consistent with the trend observed in prediagnostic, postmenopausal women, in that autoantibody response decreased as time of blood collection before diagnosis decreased. Many recent publications support the idea of using a combination of complementary autoantibody markers for diagnosis. In our study, combining the glycolysis and spliceosome signatures yielded an AUC of 0.77 with 35% sensitivity at 95% specificity. The AUCs obtained may be optimistic, as ROC curves were based on the same data used in generating the pathways of interest. Because these signatures were chosen based on their biologic significance, their use and additivity as biomarker panels may be limited. The sensitivity of biomarkers in prediagnostic samples can also be limited depending on disease progression (55). Because of the limited availability of prediagnostic samples, validation of the described classifier was not possible in this study. Proteomic analysis showed an increase in cancer-related immune complexes to glycolytic enzymes in newly diagnosed postmenopausal patients. The concurrent increase in circulating glycolysis protein levels (18) suggests formation of these complexes may explain the observed immune suppression. With a relatively low sensitivity, but high specificity, one could envision application of this signature to determine women that may be at higher risk of developing breast cancer within a year, thus compelling them to seek screening. Another potential application is for identifying or monitoring masses observed during a routine mammogram. A prospective study involving samples collected at the time of mammography would be necessary to test this application. Blood-based biomarkers for early cancer detection would have great use, especially for women with dense breasts or chronically abnormal mammograms. However, there are data to suggest that general screening may be useful. In a study of nearly 600 reduction mammoplasties, 20% of women, ages 30 to 49 years, had evidence of a proliferative lesion, atypical hyperplasia, or carcinoma in situ in the pathologic specimen (56). These lesions all increase the risk of the development of subsequent invasive malignancy. An inexpensive blood test may assist in identifying cancers early in these patients.

N.L. Disis has an ownership interest (including patents) in the University of Washington and is a consultant/advisory board member of VentiRx, Roche, BMS, EMD Serono, and Immunovaccine. No potential conflicts of interest were disclosed by the other authors.

Conception and design: J.J. Ladd, T. Chao, J. Qiu, C.I. Li, S.M. Hanash

Development of methodology: J.J. Ladd, T. Chao, J. Qiu, M.W. McIntosh, S.M. Hanash

Acquisition of data (provided animals, acquired and managed patients, provided facilities, etc.): J.J. Ladd, T. Chao, M.M. Johnson, J. Qiu, A. Chin, J. Mao, M. Wu, C.I. Li, R. Prentice

Analysis and interpretation of data (e.g., statistical analysis, biostatistics, computational analysis): J.J. Ladd, T. Chao, M.M. Johnson, J. Qiu, R. Israel, S.J. Pitteri, L.M. Amon, M.W. McIntosh, R. Prentice, N.L. Disis, S.M. Hanash

Writing, review, and/or revision of the manuscript: J.J. Ladd, T. Chao, S.J. Pitteri, C.I. Li, N.L. Disis, S.M. Hanash

Administrative, technical, or material support (i.e., reporting or organizing data, constructing databases): T. Chao, R. Israel, N.L. Disis, S.M. Hanash

Study supervision: S.M. Hanash

The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.

1.
Desmetz
C
,
Bascoul-Mollevi
C
,
Rochaix
P
,
Lamy
PJ
,
Kramar
A
,
Rouanet
P
, et al
Identification of a new panel of serum autoantibodies associated with the presence of in situ carcinoma of the breast in younger women
.
Clin Cancer Res
2009
;
15
:
4733
41
.
2.
Goodell
V
,
Disis
ML
. 
Human tumor cell lysates as a protein source for the detection of cancer antigen-specific humoral immunity
.
J Immunol Methods
2005
;
299
:
129
38
.
3.
Chapman
C
,
Murray
A
,
Chakrabarti
J
,
Thorpe
A
,
Woolston
C
,
Sahin
U
, et al
Autoantibodies in breast cancer: their use as an aid to early diagnosis
.
Ann Oncol
2007
;
18
:
868
73
.
4.
Anderson
KS
,
Sibani
S
,
Wallstrom
G
,
Qiu
J
,
Mendoza
EA
,
Raphael
J
, et al
Protein microarray signature of autoantibody biomarkers for the early detection of breast cancer
.
J Proteome Res
2011
;
10
:
85
96
.
5.
Massoner
P
,
Lueking
A
,
Goehler
H
,
Hopfner
A
,
Kowald
A
,
Kugler
KG
, et al
Serum-autoantibodies for discovery of prostate cancer specific biomarkers
.
Prostate
2012
;
72
:
427
36
.
6.
Babel
I
,
Barderas
R
,
Diaz-Uriarte
R
,
Martinez-Torrecuadrada
JL
,
Sanchez-Carbayo
M
,
Casal
JI
. 
Identification of tumor-associated autoantigens for the diagnosis of colorectal cancer in serum using high density protein microarrays
.
Mol Cell Proteomics
2009
;
8
:
2382
95
.
7.
Tamesa
MS
,
Kuramitsu
Y
,
Fujimoto
M
,
Maeda
N
,
Nagashima
Y
,
Tanaka
T
, et al
Detection of autoantibodies against cyclophilin A and triosephosphate isomerase in sera from breast cancer patients by proteomic analysis
.
Electrophoresis
2009
;
30
:
2168
81
.
8.
Chang
W
,
Wu
L
,
Cao
F
,
Liu
Y
,
Ma
L
,
Wang
M
, et al
Development of autoantibody signatures as biomarkers for early detection of colorectal carcinoma
.
Clin Cancer Res
2011
;
17
:
5715
24
.
9.
Babel
I
,
Barderas
R
,
Diaz-Uriarte
R
,
Moreno
V
,
Suarez
A
,
Fernandez-Acenero
MJ
, et al
Identification of MST1/STK4 and SULF1 proteins as autoantibody targets for the diagnosis of colorectal cancer by using phage microarrays
.
Mol Cell Proteomics
2011
;
10
:
M110.001784
.
10.
Wandall
HH
,
Blixt
O
,
Tarp
MA
,
Pedersen
JW
,
Bennett
EP
,
Mandel
U
, et al
Cancer biomarkers defined by autoantibody signatures to aberrant O-glycopeptide epitopes
.
Cancer Res
2010
;
70
:
1306
13
.
11.
Madoz-Gurpide
J
,
Kuick
R
,
Wang
H
,
Misek
DE
,
Hanash
SM
. 
Integral protein microarrays for the identification of lung cancer antigens in sera that induce a humoral immune response
.
Mol Cell Proteomics
2008
;
7
:
268
81
.
12.
Pereira-Faca
S
,
Qiu
J
,
Krasnoselsky
A
,
Newcomb
L
,
Hanash
S
. 
Proteomic identification of tumor antigens in lung cancer
.
Mol Cell Proteomics
2005
;
4
:
S125
S
.
13.
Qiu
J
,
Choi
G
,
Li
L
,
Wang
H
,
Pitteri
SJ
,
Pereira-Faca
SR
, et al
Occurrence of autoantibodies to annexin I, 14–3-3 theta and LAMR1 in prediagnostic lung cancer sera
.
J Clin Oncol
2008
;
26
:
5060
6
.
14.
Nam
MJ
,
Kee
MK
,
Kuick
R
,
Hanash
SM
. 
Identification of defensin alpha 6 as a potential biomarker in colon adenocarcinoma
.
J Biol Chem
2005
;
280
:
8260
5
.
15.
Nam
MJ
,
Madoz-Gurpide
J
,
Wang
H
,
Lescure
P
,
Schmalbach
CE
,
Zhao
R
, et al
Molecular profiling of the immune response in colon cancer using protein microarrays: occurrence of autoantibodies to ubiquitin C-terminal hydrolase L3
.
Proteomics
2003
;
3
:
2108
15
.
16.
Bouwman
K
,
Qiu
J
,
Zhou
HP
,
Schotanus
M
,
Mangold
LA
,
Vogt
R
, et al
Microarrays of tumor cell derived proteins uncover a distinct pattern of prostate cancer serum immunoreactivity
.
Proteomics
2003
;
3
:
2200
7
.
17.
Hong
SH
,
Misek
DE
,
Wang
H
,
Puravs
E
,
Giordano
TJ
,
Greenson
JK
, et al
An autoantibody-mediated immune response to calreticulin isoforms in pancreatic cancer
.
Cancer Res
2004
;
64
:
5504
10
.
18.
Amon
LM
,
Pitteri
SJ
,
Li
CI
,
McIntosh
MW
,
Ladd
J
,
Disis
ML
, et al
Concordant release of glycolysis proteins into the plasma preceding a diagnosis of ER +breast cancer
.
Cancer Res
2012
;
72
:
1935
42
.
19.
Wang
H
,
Hanash
S
. 
Multi-dimensional liquid phase based separations in proteomics
.
J Chromatogr B-Analyt Technol Biomed Life Sci
2003
;
787
:
11
8
.
20.
Qiu
J
,
Madoz-Gurpide
J
,
Misek
DE
,
Kuick
R
,
Brenner
DE
,
Michailidis
G
, et al
Development of natural protein microarrays for diagnosing cancer based on an antibody response to tumor antigens
.
J Proteome Res
2004
;
3
:
261
7
.
21.
Wang
H
,
Hanash
SM
. 
Increased throughput and reduced carryover of mass spectrometry-based proteomics using a high-efficiency nonsplit nanoflow parallel dual-column capillary HPLC system
.
J Proteome Res
2008
;
7
:
2743
55
.
22.
MacLean
B
,
Eng
JK
,
Beavis
RC
,
McIntosh
M
. 
General framework for developing and evaluating database scoring algorithms using the TANDEM search engine
.
Bioinformatics
2006
;
22
:
2830
2
.
23.
Nesvizhskii
AI
,
Keller
A
,
Kolker
E
,
Aebersold
R
. 
A statistical model for identifying proteins by tandem mass spectrometry
.
Analyt Chem
2003
;
75
:
4646
58
.
24.
Rauch
A
,
Bellew
M
,
Eng
J
,
Fitzgibbon
M
,
Holzman
T
,
Hussey
P
, et al
Computational Proteomics Analysis System (CPAS): an extensible, open-source analytic system for evaluating and publishing proteomic data and high throughput biological experiments
.
J Proteome Res
2006
;
5
:
112
21
.
25.
Arbuckle
MR
,
Schilling
AR
,
Harley
JB
,
James
JA
. 
A limited lupus anti-spliceosomal response targets a cross-reactive, proline rich motif
.
J Autoimmun
1998
;
11
:
431
8
.
26.
Kanehisa
M
,
Goto
S
. 
KEGG: kyoto encyclopedia of genes and genomes
.
Nucleic Acids Res
2000
;
28
:
27
30
.
27.
Caporali
R
,
Bugatti
S
,
Bruschi
E
,
Cavagna
L
,
Montecucco
C
. 
Autoantibodies to heterogeneous nuclear ribonucleoproteins
.
Autoimmunity
2005
;
38
:
25
32
.
28.
Hardin
JA
,
Thomas
JO
. 
Antibodies to histones in systemic lupus-erythematosus–localization of prominent auto-antigens on histone-H1 and histone-H2b
.
P Natl Acad Sci Biol
1983
;
80
:
7410
4
.
29.
Kuo
YB
,
Chang
CA
,
Wu
YK
,
Hsieh
MJ
,
Tsai
CH
,
Chen
KT
, et al
Identification and clinical association of anti-cytokeratin 18 autoantibody in COPD
.
Immunol Lett
2010
;
128
:
131
6
.
30.
Leidinger
P
,
Keller
A
,
Heisel
S
,
Ludwig
N
,
Rheinheimer
S
,
Klein
V
, et al
Novel autoantigens immunogenic in COPD patients
.
Resp Res
2009
;
10
:
20
.
31.
Terrier
B
,
Degand
N
,
Guilpain
P
,
Servettaz
A
,
Guillevin
L
,
Mouthon
L
. 
Alpha-enolase: a target of antibodies in infectious and autoimmune diseases
.
Autoimmun Rev
2007
;
6
:
176
82
.
32.
Hassfeld
W
,
Steiner
G
,
Studnickabenke
A
,
Skriner
K
,
Graninger
W
,
Fischer
I
, et al
Autoimmune-response to the spliceosome– an immunological link between rheumatoid-arthritis, mixed connective-tissue disease, and systemic lupus-erythematosus
.
Arthritis Rheum
1995
;
38
:
777
85
.
33.
Poole
BD
,
Schneider
RI
,
Guthridge
JM
,
Velte
CA
,
Reichlin
M
,
Harley
JB
, et al
Early targets of nuclear RNP humoral autoimmunity in human systemic lupus erythematosus
.
Arthritis Rheum
2009
;
60
:
848
59
.
34.
Eggleton
P
,
Ward
FJ
,
Johnson
S
,
Khamashta
MA
,
Hughes
GRV
,
Hajela
VA
, et al
Fine specificity of autoantibodies to calreticulin: epitope mapping and characterization
.
Clin Exp Immunol
2000
;
120
:
384
91
.
35.
Katsumata
Y
,
Kawaguchi
Y
,
Baba
S
,
Hattori
S
,
Tahara
K
,
Ito
K
, et al
Identification of three new autoantibodies associated with systemic lupus erythematosus using two proteomic approaches
.
Mol Cell Proteomics
2011
;
10
:
M110.005330
.
36.
Mandik-Nayak
L
,
Allen
PM
. 
Initiation of an autoimmune response–insights from a transgenic model of rheumatoid arthritis
.
Immunol Res
2005
;
32
:
5
13
.
37.
Minota
S
,
Jarjour
WN
,
Suzuki
N
,
Nojima
Y
,
Roubey
RAS
,
Mimura
T
, et al
Autoantibodies to nucleolin in systemic lupus-erythematosus and other diseases
.
J Immunol
1991
;
146
:
2249
52
.
38.
Skoldberg
F
,
Ronnblom
L
,
Thornemo
M
,
Lindahl
A
,
Bird
PI
,
Rorsman
F
, et al
Identification of AHNAK as a novel autoantigen in systemic lupus erythematosus
.
Biochem Biophys Res Commun
2002
;
291
:
951
8
.
39.
Takasaki
Y
,
Kaneda
K
,
Matsushita
M
,
Yamada
H
,
Nawata
M
,
Matsudaira
R
, et al
Glyceraldehyde 3-phosphate dehydrogenase is a novel autoantigen leading autoimmune responses to proliferating cell nuclear antigen multiprotein complexes in lupus patients
.
Int Immunol
2004
;
16
:
1295
304
.
40.
Fossey
SC
,
Vnencak-Jones
CL
,
Olsen
NJ
,
Sriram
S
,
Garrison
G
,
Deng
XQ
, et al
Identification of molecular biomarkers for multiple sclerosis
.
J Mol Diagn
2007
;
9
:
197
204
.
41.
Hauck
SM
,
Dietter
J
,
Kramer
RL
,
Hofmaier
F
,
Zipplies
JK
,
Amann
B
, et al
Deciphering membrane-associated molecular processes in target tissue of autoimmune uveitis by label-free quantitative mass spectrometry
.
Mol Cell Proteomics
2010
;
9
:
2292
305
.
42.
Huguet
S
,
Labas
V
,
Duclos-Vallee
JC
,
Bruneel
A
,
Vinh
J
,
Samuel
D
, et al
Heterogeneous nuclear ribonucleoprotein A2/B1 identified as an autoantigen in autoimmune hepatitis by proteome analysis
.
Proteomics
2004
;
4
:
1341
5
.
43.
Jastorff
AM
,
Haegler
K
,
Maccarrone
G
,
Holsboer
F
,
Weber
F
,
Ziemssen
T
, et al
Regulation of proteins mediating neurodegeneration in experimental autoimmune encephalomyelitis and multiple sclerosis
.
Proteomics Clin Appl
2009
;
3
:
1273
87
.
44.
Tahiri
F
,
Le Naour
F
,
Huguet
S
,
Lai-Kuen
R
,
Samuel
D
,
Johanet
C
, et al
Identification of plasma membrane autoantigens in autoimmune hepatitis type 1 using a proteomics tool
.
Hepatology
2008
;
47
:
937
48
.
45.
Xiang
Y
,
Sekine
T
,
Nakamura
H
,
Imajoh-Ohmi
S
,
Fukuda
H
,
Nishioka
K
, et al
Proteomic surveillance of autoimmunity in osteoarthritis: identification of triosephosphate isomerase as an autoantigen in patients with osteoarthritis
.
Arthritis Rheum
2003
;
48
:
S292
S
.
46.
Lu
H
,
Goodell
V
,
Disis
ML
. 
Humoral immunity directed against tumor-associated antigens as potential biomarkers for the early diagnosis of cancer
.
J Proteome Res
2008
;
7
:
1388
94
.
47.
Andre
F
,
Michiels
S
,
Dessen
P
,
Scott
V
,
Suciu
V
,
Uzan
C
, et al
Exonic expression profiling of breast cancer and benign lesions: a retrospective analysis
.
Lancet Oncol
2009
;
10
:
381
90
.
48.
Suzuki
A
,
Iizuka
A
,
Komiyama
M
,
Takikawa
M
,
Kume
A
,
Tai
S
, et al
Identification of melanoma antigens using a Serological Proteome Approach (SERPA)
.
Cancer Genomics Proteomics
2010
;
7
:
17
23
.
49.
Luqmani
YA
,
Al Azmi
A
,
Al Bader
M
,
Abraham
G
,
El Zawahri
M
. 
Modification of gene expression induced by siRNA targeting of estrogen receptor a in MCF7 human breast cancer cells
.
Int J Oncol
2009
;
34
:
231
42
.
50.
Ma
YL
,
Peng
JY
,
Zhang
P
,
Huang
L
,
Liu
WJ
,
Shen
TY
, et al
Heterogeneous nuclear ribonucleoprotein A1 is identified as a potential biomarker for colorectal cancer based on differential proteomics technology
.
J Proteome Res
2009
;
8
:
4525
35
.
51.
Schiarea
S
,
Salinas
G
,
Allavena
P
,
Scigliuolo
GM
,
Bagnati
R
,
Fanelli
R
, et al
Secretome analysis of multiple pancreatic cancer cell lines reveals perturbations of key functional networks
.
J Proteome Res
2010
;
9
:
4376
92
.
52.
Hatakeyama
S
,
Sugihara
K
,
Nakayama
J
,
Akama
TO
,
Wong
SMA
,
Kawashima
H
, et al
Identification of mRNA splicing factors as the endothelial receptor for carbohydrate-dependent lung colonization of cancer cells
.
Proc Natl Acad Sci U S A
2009
;
106
:
3095
100
.
53.
Imai
H
,
Ochs
RL
,
Kiyosawa
K
,
Furuta
S
,
Nakamura
RM
,
Tan
EM
. 
Nucleolar antigens and autoantibodies in hepatocellular-carcinoma and other malignancies
.
Am J Pathol
1992
;
140
:
859
70
.
54.
Backes
C
,
Ludwig
N
,
Leidinger
P
,
Harz
C
,
Hoffmann
J
,
Keller
A
, et al
Immunogenicity of autoantigens
.
Bmc Genomics
2011
;
12
:
340
.
55.
Lu
H
,
Ladd
J
,
Feng
Z
,
Wu
M
,
Goodell
V
,
Pitteri
SJ
, et al
Evaluation of known oncoantibodies, HER2, p53, and cyclin B1, in prediagnostic breast cancer sera
.
Cancer Prev Res (Phila)
2012
;
5
:
1036
43
.
56.
Clark
CJ
,
Whang
S
,
Paige
KT
. 
Incidence of precancerous lesions in breast reduction tissue: a pathologic review of 562 consecutive patients
.
Plast Reconstr Surg
2009
;
124
:
1033
9
.