Cell-free DNA (cfDNA) concentrations from patients with cancer are often elevated compared with those of healthy controls, but the sources of this extra cfDNA have never been determined. To address this issue, we assessed cfDNA methylation patterns in 178 patients with cancers of the colon, pancreas, lung, or ovary and 64 patients without cancer. Eighty-three of these individuals had cfDNA concentrations much greater than those generally observed in healthy subjects. The major contributor of cfDNA in all samples was leukocytes, accounting for ∼76% of cfDNA, with neutrophils predominating. This was true regardless of whether the samples were derived from patients with cancer or the total plasma cfDNA concentration. High levels of cfDNA observed in patients with cancer did not come from either neoplastic cells or surrounding normal epithelial cells from the tumor's tissue of origin. These data suggest that cancers may have a systemic effect on cell turnover or DNA clearance.

Significance:

The origin of excess cfDNA in patients with cancer is unknown. Using cfDNA methylation patterns, we determined that neither the tumor nor the surrounding normal tissue contributes this excess cfDNA—rather it comes from leukocytes. This finding suggests that cancers have a systemic impact on cell turnover or DNA clearance.

See related commentary by Thierry and Pisareva, p. 2122.

This article is featured in Selected Articles from This Issue, p. 2109

In the peripheral blood of healthy individuals, the vast majority of cell-free DNA (cfDNA) is derived from cells of hematopoietic lineage, consistent with lymphoid and myeloid cell death as the predominant source of cfDNA (1–4). The first measurements of cfDNA from patients with cancer were performed more than 50 years ago (5–8). Although these studies were performed before the development of technologies that could characterize the DNA fragments in detail, it was clear that the concentration of DNA was often elevated in patients with cancer compared with healthy controls (5, 7, 9, 10). Numerous studies since then have confirmed that cfDNA in patients with cancer is often elevated and that patients with more advanced cancer are more likely to have higher cfDNA concentrations (9, 11). Sensitive studies of mutations and chromosome copy-number changes have shown that part of the cfDNA derived from patients with cancer is derived from neoplastic cells within the tumor (9–16). However, many recent studies of cfDNA are based on characteristics that are not entirely specific to neoplastic cells. These include assays based on cfDNA fragment ends, lengths, and enrichments in specific sequences such as promoters, as well as other epigenetic features that can be used to identify not only the presence of cancer but the cell type of origin. In general, previous studies have not been able to distinguish whether the great excess of cfDNA that is observed in some patients with cancer is derived from the neoplastic cells within the tumor rather than from normal epithelial cells surrounding the tumor that may have been damaged by the cancer (Supplementary Note 1 and refs. 3, 14–44). We sought to answer this question through a combination of genetic and epigenetic analysis of cfDNA, particularly in samples with very high concentrations of cfDNA (sample and analysis schema shown in Supplementary Fig. S1).

The Amount of cfDNA in Normal Individuals and Patients with Cancer

The normal concentration of cfDNA in healthy individuals generally ranges from 1 to 10 ng/mL of plasma, and the average concentration of cfDNA in the plasma of patients with cancer is higher than in healthy individuals (7, 9–11, 33–36). As an example, the distribution of cfDNA concentrations as measured by quantitative real-time PCR among 812 healthy individuals and 1,005 patients with cancer from a recently reported study (9) is shown in Fig. 1A. The mean concentration of cfDNA in the plasma of the normal individuals was 4.3 ± 8.6 ng/mL of plasma [2.9 ± 1.6 ng/mL, median ± median absolute deviation (MAD)], whereas the mean concentration of cfDNA in the plasma of patients with stage I–III cancer was 12.6 ± 18.1 ng/mL of plasma (6.30 ± 3.5 ng/mL, median ± MAD). The concentrations varied considerably with cancer type, with lung cancers having the lowest at 5.23 ± 6.4 ng/mL of plasma (3.3 ± 1.5 ng/mL, median ± MAD) and liver the highest at 46.0 ± 35.6 ng/mL of plasma (42.3 ± 29.4 ng/mL, median ± MAD; Fig. 1B). A particularly high concentration of cfDNA in liver cancers has been previously noted (3, 37, 38). There was a significant difference in cfDNA concentration between American Joint Commission on Cancer (AJCC) 7th edition stage I–III cancers as a whole (Supplementary Fig. S2A, P < 0.01), which varied within each cancer type (Supplementary Fig. S2B). Note that none of the 1,005 patients with cancer evaluated in Fig. 1B and Supplementary Fig. S2A and S2B had distant metastatic disease at the time plasma was taken, though it is well known that patients with metastatic disease have the highest cfDNA concentrations (7, 10, 11, 38).

Figure 1.

Plasma cfDNA concentrations in previously described patients. A, Distribution of the average concentration of cfDNA in the plasma of patients with cancer as determined by qPCR shows that it is elevated compared with normal controls. Blue line = normal controls (N = 812). Red line = patients with cancer (N = 1,005). B, The concentration of cfDNA as determined by qPCR for normal controls (N = 812) and patients with breast (N = 209), colorectal (N = 388), esophageal (N = 45), liver (N = 44), lung (N = 104), ovarian (N = 54), pancreatic (N = 93), and stomach (N = 68) cancer. Data are derived from the previously published CancerSEEK study (9). ***, P < 0.001

Figure 1.

Plasma cfDNA concentrations in previously described patients. A, Distribution of the average concentration of cfDNA in the plasma of patients with cancer as determined by qPCR shows that it is elevated compared with normal controls. Blue line = normal controls (N = 812). Red line = patients with cancer (N = 1,005). B, The concentration of cfDNA as determined by qPCR for normal controls (N = 812) and patients with breast (N = 209), colorectal (N = 388), esophageal (N = 45), liver (N = 44), lung (N = 104), ovarian (N = 54), pancreatic (N = 93), and stomach (N = 68) cancer. Data are derived from the previously published CancerSEEK study (9). ***, P < 0.001

Close modal

The concentration of cfDNA in the plasma of the new cohort of normal individuals included in the present study was very similar to the cohort described in ref. 9 and Supplementary Note 2. For 64 normal individuals, the mean concentration of cfDNA was 6.0 ± 10.5 ng/mL of plasma (3.4 ± 0.8 ng/mL, median ± MAD; P = 0.15) was similar to that reported in ref. 9. For 178 patients with stage I–IV cancer evaluated in the new cohort studied here, the mean concentration of cfDNA was greater at 21.8 ± 26.5 ng/mL of plasma (P < 0.001), similar to that reported in ref. 9 at 11.8 ± 5.9, median ± MAD, reflecting the different cancer types and inclusion of patients with metastatic disease in the new cohort.

The Tissue Origin of cfDNA in Normal Individuals

In 64 normal individuals, whole-genome bisulfite sequencing data showed that the vast majority of cfDNA arises from leukocytes regardless of the total concentration of cfDNA in the plasma (Supplementary Note 3; Fig. 2, blue dots; Supplementary Fig. S3; Supplementary Table S1). Neutrophils accounted for roughly two thirds of the leukocyte cfDNA, consistent with the ∼2:1 ratio of neutrophils to lymphocytes in the circulation of healthy individuals. Interestingly, the fraction of cfDNA contributed by B cells was not consistent with the fraction of B cells in the circulation of healthy individuals. B cells are expected to account for only 10% to 15% of blood lymphocytes, whereas T cells account for most of the remainder (80%; ref. 45). But B cell– and T cell–derived DNA accounted for 16.4% ± 12.0% and 17.5% ± 4.3%, respectively, of lymphocyte cfDNA in the circulation. This difference between the ratio of B to T cells expected in the circulation and the ratio of B cell–derived DNA to T cell–derived DNA in the circulation was highly significant (P < 0.001; Supplementary Note 4). Other minor tissue contributors to cfDNA were the liver, colon, heart, brain, and lung, accounting for 5.8% ± 6.6%, 4.0% ± 7.1%, 3.5% ± 3.4%, 3.1% ± 3.3%, and 2.4% ± 3.4% of the total, respectively (Fig. 2, blue dots; Supplementary Table S1). Deconvolution using nonnegative least squares linear regression (NNLS; refs. 40–42) instead of quadratic programming (QP; ref. 39) yielded nearly identical results (Supplementary Fig. S4).

Figure 2.

Deconvolution of the plasma cfDNA methylation profile using the reference cell-type matrix derived from Sun and colleagues (3). The total methylation profile of plasma cfDNA was deconvoluted into 12 different tissue types using quadratic programming. The total leukocyte concentration was taken to be the sum of the concentrations of neutrophils, B cells, and T cells. Note that the y-axes for the different tissue types shown are not the same. Blue and red dots represent normal individuals and cancer patients, respectively. Blue and red rectangles denote correlation metrics.

Figure 2.

Deconvolution of the plasma cfDNA methylation profile using the reference cell-type matrix derived from Sun and colleagues (3). The total methylation profile of plasma cfDNA was deconvoluted into 12 different tissue types using quadratic programming. The total leukocyte concentration was taken to be the sum of the concentrations of neutrophils, B cells, and T cells. Note that the y-axes for the different tissue types shown are not the same. Blue and red dots represent normal individuals and cancer patients, respectively. Blue and red rectangles denote correlation metrics.

Close modal

Using a separate reference deconvolution matrix from Moss and colleagues (43) and QP (39) as the deconvolution algorithm, we found that leukocytes were again the predominant contributor to plasma cfDNA at 61.2% ± 23.4% (Fig. 3, blue dots; Supplementary Table S1), with neutrophils contributing the most cfDNA, followed by monocytes, natural killer (NK) cells, myeloid progenitors, B cells, and T cells. Similar to data obtained with the Sun and colleagues (3) deconvolution reference matrix, other minor contributors included colon epithelial cells at 7.2% ± 8.5% and hepatocytes at 6.7% ± 9.8%, respectively. Again, deconvolution using NNLS (40–42) instead of QP (39) provided very similar results (Supplementary Fig. S5). Contributions of overlapping cell types as determined by the Moss and colleagues (43) and Sun and colleagues (3) reference matrices were similar for total leukocytes, neutrophils, B cells, T cells, hepatocytes, colon epithelial cells, and brain (Supplementary Tables S1 and S2).

Figure 3.

Deconvolution of the plasma cfDNA methylation profile using the reference cell-type matrix derived from Moss and colleagues (43). The total methylation profile of plasma cfDNA was deconvoluted into 24 different tissue types using QP. The total leukocyte concentration was taken to be the sum of myeloid progenitors, monocytes, neutrophils, B cells, CD4 T cells, CD8 T cells, and natural killer (NK) cells. Note that the y-axes for the different tissue types shown are not the same. GI, gastrointestinal. Blue and red dots represent normal individuals and cancer patients, respectively. Blue and red rectangles denote correlation metrics.

Figure 3.

Deconvolution of the plasma cfDNA methylation profile using the reference cell-type matrix derived from Moss and colleagues (43). The total methylation profile of plasma cfDNA was deconvoluted into 24 different tissue types using QP. The total leukocyte concentration was taken to be the sum of myeloid progenitors, monocytes, neutrophils, B cells, CD4 T cells, CD8 T cells, and natural killer (NK) cells. Note that the y-axes for the different tissue types shown are not the same. GI, gastrointestinal. Blue and red dots represent normal individuals and cancer patients, respectively. Blue and red rectangles denote correlation metrics.

Close modal

Deconvolution with the Loyfer and colleagues (44) matrix and the NNLS algorithm similarly showed that leukocytes were the predominant contributor to plasma cfDNA at 54.8% ± 20.3% (Supplementary Table S2), with blood granulocytes contributing the most cfDNA, followed by megakaryocytes, blood monocytes/macrophages, keratinocytes, erythrocyte progenitors, endothelial cells, NK cells, and T cells. Analysis of our healthy cohort showed tissue contributions very similar to those reported in the healthy patients evaluated by Loyfer and colleagues (44).

For individuals with nonelevated concentrations of cfDNA, the results described above are consistent with prior studies showing that most cfDNA comes from cells of lymphoid and myeloid lineage (2–4, 27). The novel aspect of the current study is the determination of these origins in individuals with elevated cfDNA. No prior studies had evaluated the tissue of origin of these elevated cfDNA concentrations in such individuals, and we hypothesized that such individuals might have had tissue-specific damage that accounted for their extremely high cfDNA levels. However, the results did not confirm our hypothesis; there was a linear correlation between the amount of DNA contributed by leukocytes and the total cfDNA concentration at all concentrations regardless of whether the Sun and colleagues (3), Moss and colleagues (43), or Loyfer and colleagues (44) reference matrices were used for deconvolution, or whether the plasma cfDNA methylation signature was deconvoluted by QP or NNLS (Fig. 2, blue dots, R2 = 0.99, P < 0.001; Fig. 3, blue dots, R2 = 0.96, P < 0.001; Supplementary Table S1). In other words, the great majority [average 79%, interquartile range (IQR) 74%–88%] of the cfDNA in healthy plasma, even when the total cfDNA concentration was more than 10× the normal level, arose from leukocytes. Similar fractions of DNA arising from neutrophils, B cells, and T cells were discovered regardless of the total cfDNA concentrations (R2 = 0.99, 0.96, and 0.89, respectively, P < 0.001; Fig. 2, blue dots). The amount of liver and lung DNA was proportionately increased with total cfDNA concentration, in the same way as observed in healthy individuals without elevated cfDNA (Fig. 2, blue dots). Additionally, the same unexpectedly high contribution of B cells to total cfDNA was observed in normal individuals with high cfDNA concentrations as in those with low concentrations (Fig. 2, blue dots). In addition to total leukocytes, analysis using the Moss and colleagues (43) reference matrix highlighted the contribution of myeloid progenitor cells (R2 = 0.91, P < 0.001), monocytes (R2 = 0.86, P < 0.001), and neutrophils (R2 = 0.79, P < 0.001) at all concentrations of total cfDNA (Fig. 3, blue dots).

The Tissue Origins of cfDNA in Patients with Cancer

We analyzed plasma from 178 patients with colorectal (N = 18), lung (N = 31), ovarian (N = 36), or pancreatic (N = 93) cancer to determine the source of their cfDNA. As in the normal individuals described above, leukocyte lysis during sample collection or processing or other contribution from high-molecular-weight DNA was excluded in all patients with cancer (Supplementary Fig. S3; Supplementary Table S1).

In patients with cancer, the tissue source of cfDNA (Fig. 2, red dots) was markedly similar to that in normal individuals (Fig. 2, blue dots). Using the reference matrix from Sun and colleagues (3) and deconvolution via QP, we discovered that 70.5% ± 13.7% (73.6% ± 5.4%, median ± MAD) of the cfDNA in these patients was contributed by leukocytes, with an average of 11.4% ± 11.4%, 5.9% ± 9.0%, 3.6% ± 2.8%, 3.1% ± 3.0%, 2.2% ± 3.7%, and 2.2% ± 2.7% contributed by liver, colon, brain, heart, lungs, and pancreas, respectively (Supplementary Table S1). Of the leukocyte DNA, about two thirds was derived from neutrophils in patients with cancer, just as in normal individuals (Supplementary Table S1). These results are consistent with previous studies on patients with cancer without elevated cfDNA concentrations (3). Deconvolution strategies using the Moss and colleagues (43) reference matrix or using NNLS instead of QP yielded similar results (Fig. 3, red dots, compared with Fig. 2, red dots; Supplementary Figs. S4 and S5; Supplementary Table S1). Analysis using the Loyfer and colleagues (44) approach also produced similar results, with total leukocytes contributing 56.6% ± 13.3% to the total cfDNA pool, with predominant contributions by blood granulocytes, megakaryocytes, blood monocytes/macrophages, and hepatocytes, just as in healthy individuals (Supplementary Table S2).

In patients with elevated concentrations of cfDNA, we expected that a major source of the large amounts of DNA in patients would be from the neoplastic cells and the surrounding nonneoplastic epithelial cells. This expectation was not confirmed by the experiment. As with normal individuals, there was a linear correlation between the amount of DNA contributed by leukocytes and the total cfDNA concentration (Fig. 2, red dots, R2 = 0.92, P < 0.001; Fig. 3, red dots, R2 = 0.82, P < 0.001).

Interestingly, 10 patients with cancer had approximately 5 ng/mL or more of their cfDNA derived from colonic epithelium (Supplementary Table S1). These concentrations were significantly different from those derived from normal individuals regardless of the total cfDNA concentration in the normal individuals (P < 0.001). Of the 10, 80% (8/10) of these patients had colorectal cancers. We sought to understand the origin of this epithelial DNA. In theory, it could have come from the neoplastic cells themselves or the surrounding nonneoplastic colorectal epithelial cells that had been destroyed by the cancer. It is well known that cancers destroy surrounding normal organ cells during the invasive process, and these dead or dying cells could in principle contribute to cfDNA (46–48). To distinguish between these two possibilities, tumor-specific mutations and copy-number alterations (CNA) in the cfDNA were used to determine the fraction of the cfDNA contributed by the neoplastic cells themselves.

We found a linear correlation between the fraction of cfDNA derived from colon epithelial cells and the fraction of cfDNA derived from neoplastic colon epithelial cells in patients with colorectal cancer (Supplementary Fig. S6, R2 = 0.95, P < 0.001; Supplementary Note 5). The former was assessed by the whole-genome bisulfite sequencing of plasma cfDNA, whereas the latter was assessed by SafeSeqS analysis of mutations in the same plasma cfDNA samples, as described in Kinde and colleagues (49). The data in Supplementary Fig. S6 and Supplementary Table S1 show that the amount of cfDNA derived from all colonic epithelial cells (assessed by methylation) was similar to that expected from the contribution of the neoplastic colonic epithelial cells alone (as assessed by mutation; ref. 50). This conclusion was supported by copy-number analysis of the cfDNA (Supplementary Table S1).

The Tissue Origins of cfDNA following Surgery

Other conditions besides cancer have been associated with elevated cfDNA concentrations (51–53). For example, it has been shown that large increases in cfDNA occur 1 day after surgery (10, 54, 55). To investigate the source of the extra cfDNA in such patients, we obtained plasma samples approximately 24 hours after surgery from nine of the patients with pancreatic cancer included in Supplementary Table S1, all of whom had Whipple procedures for tumor resection. Prior to surgery, eight of the nine patients had total cfDNA concentrations in the normal range (Supplementary Fig. S7). Following surgery, there was a dramatic elevation of the total cfDNA, ranging from 2.3- to 18-fold (median of 8.7-fold) in these eight patients (Supplementary Fig. S7; Supplementary Table S1, P = 0.001). The only patient for whom the cfDNA concentration did not increase following surgery was the one (PANCA 1248) with elevated cfDNA (29.1 ng/mL of plasma) prior to surgery.

Bisulfite sequencing of plasma cfDNA in these eight patients revealed the following:

  • (i) The amount of cfDNA from all evaluable tissue sources increased after surgery, though the additional contribution from the lungs, brain, esophagus, small intestines, pancreas, and heart appeared to be slightly elevated (P = 0.09, 0.13, 0.08, 0.09, 0.13, and 0.10, respectively; Fig. 4; Supplementary Table S1).

  • (ii) The majority (average 57%; range, 42%–70%) of the total cfDNA after surgery was from leukocytes, and the predominant contributors to leukocyte cfDNA were neutrophils (average 73%; range, 67%–82% of the leukocyte cfDNA; Supplementary Fig. S8; Supplementary Table S1).

  • (iii) For most tissues, the proportional representation of each evaluable tissue either decreased or changed only slightly after surgery—except for the liver (Supplementary Fig. S8; Supplementary Table S1). There was a striking increase (average 46-fold, range, 6- to 144-fold) in the fraction of total cfDNA derived from hepatocytes following surgery, in marked contrast to the other tissues (Supplementary Fig. S8). The amount of the “neo-cfDNA” can be defined and calculated by subtracting the amount of cfDNA present before surgery from the amount of cfDNA present after surgery. This calculation showed that hepatocytes contributed an average of 38.2% (range, 23.9%–57.6%) and leukocytes contributed an average of 48.4% (range, 23.3%–67.1%) of the neo-cfDNA.

Figure 4.

The amount of cfDNA from evaluable tissue sources before and ∼24 hours after surgery. Each color represents a separate patient (see Supplementary Table S1). *, P < 0.05; **, P < 0.01; ***, P < 0.001.

Figure 4.

The amount of cfDNA from evaluable tissue sources before and ∼24 hours after surgery. Each color represents a separate patient (see Supplementary Table S1). *, P < 0.05; **, P < 0.01; ***, P < 0.001.

Close modal

Based on the clinical history of PANC 696 described above, as well as on previous data (3, 52), we thought it likely that much of the extra cfDNA following surgery was due to liver damage. We were able to obtain standard measurements of liver function using alanine transaminase (ALT) and aspartate aminotransferase (AST) levels in five of the eight patients whose cfDNA concentrations increased following surgery. AST and ALT levels substantially increased in all five patients (P < 0.05; Supplementary Fig. S9). Notably, in the one patient (PANCA 1248) whose total cfDNA did not increase after surgery, AST levels were already increased prior to surgery, unlike the other patients assessed (Supplementary Table S1).

The results of this study lend support to previous observations about the origins of cfDNA in healthy individuals and patients with cancer who have normal to slightly elevated concentrations of cfDNA. Additionally, the results of this study lead to several important conclusions about the origin of excess cfDNA in patients with greatly elevated cfDNA concentrations:

  • (i) In patients with colorectal, lung, ovarian, and pancreatic cancers with high concentrations of cfDNA in the present study, the increased cfDNA does not primarily come from either the neoplastic cells within the cancer or the adjacent nonneoplastic epithelial cells.

  • (ii) Instead, the increased cfDNA in these patients with cancer comes largely from leukocytes, primarily neutrophils.

  • (iii) The elevated cfDNA in patients with cancer studied is attributable to a systemic effect. It is not just neutrophils, but also B and T lymphocytes, and in some cases hepatocytes, colon epithelial cells, and lung epithelial cells, that release more DNA into the circulation when cfDNA concentrations are elevated.

  • (iv) Similarly, the elevated cfDNA that routinely occurs following surgery of patients with pancreatic cancer arises from a systemic effect, resulting in the release of cfDNA from leukocytes but in this case also from hepatocytes.

  • (v) The cfDNA contributed by leukocytes is often associated with an overrepresentation of B cells compared with T cells regardless of disease state and cfDNA concentration.

One of the major questions raised by our data is the nature of the systemic factor(s) that are responsible for increasing the contributions of cfDNA from all major tissue sources of cfDNA (56–73). One possibility is that the systemic factor(s) are one or more of the myriad of proteins and other molecules known to be secreted by neoplastic cells (74, 75), or released upon the death of cancer cells in situ (46, 75). Another possibility is that these factors come from endothelial cells within the cancers. There is convincing evidence indicating that the tumor vasculature is abnormal (76, 77) and endothelial cells are in direct contact with the systemic circulation. Inflammatory cells within the tumor could also release cytotoxic products (78). A completely different but enticing possibility is that cell turnover is normal in these patients, but clearance of cfDNA is abnormal. We hope that the results of this study will stimulate research to identify the biochemical basis for the pronounced elevation of cfDNA observed in patients with cancer and in other clinical scenarios.

Sample Collection and DNA Isolation

All individuals participating in the study provided written informed consent after approval by the institutional review board (IRB) at the patient's participating institution (including Johns Hopkins IRB00075499 and Melbourne Health Human Research Ethics Committee 2011.225), and the study complied with the Health Insurance Portability and Accountability Act and the Declaration of Helsinki. Peripheral blood was collected in K2-EDTA tubes after informed consent was obtained and prior to and/or 24 hours after patients underwent surgical resection. General demographics, surgical pathology, and AJCC stage (7th) were documented. The cohort is outlined in Supplementary Fig. S1. The healthy cohort consisted of peripheral blood samples obtained from 64 individuals of median age 48.5 years (IQR, 28–58 years) with no history of cancer. The cancer and healthy control samples were processed in an identical manner. Plasma samples from 18 patients with colorectal cancer, 31 patients with lung cancer, 36 patients with ovarian cancer, and 93 patients with pancreatic cancer were included in the study (median age 67 years; IQR, 56–74 years).

The 242 individuals included in this study were chosen from cfDNA samples collected for studies described in ref. 48 and similar ongoing studies to evaluate the use of cfDNA for the earlier detection of cancer in patients prior to surgery or any other form of therapy. All individuals for whom sufficient plasma was available for construction of libraries for whole-genome sequencing of bisulfite-treated DNA were considered. Any individual with a cfDNA concentration >15 ng/mL of plasma from this collection was chosen for analysis. Additionally, there were two different blood samples available from nine of the 21 patients with pancreatic cancer, one collected prior to surgery and the other collected approximately 24 hours after surgery, and these were chosen for analysis. Finally, normal individuals with cfDNA concentrations <15 ng/mL of plasma, as well as patients with cancer with cfDNA concentrations <15 ng/mL, were chosen randomly. DNA from each of these 242 patients (251 total plasma samples) was purified with a BioChain cfDNA Extraction Kit (BioChain, cat. #K5011610) using the manufacturer's recommended protocol. DNA from peripheral white blood cells was purified with the QIAsymphony DSP DNA Midi Kit (Qiagen, cat. #937255) as specified by the manufacturer. cfDNA was quantified using qPCR using Sso Advanced SYBR Green Supermix (Bio-Rad, cat. # 1725271) as directed by the manufacturer and using the following primers:

  • 5″-CACACAGGAAACAGCTATGACCATGGGTAACAGCTTTATCTATTGACATTATGC-3″

  • 5″-CGACGTAAAACGACGGCCAGTNNNNNNNNNNNNNNAAACTTCATGCTTCATCTAGTCAGC-3″

National Institute of Standards and Technology (NIST) human DNA quantification standard NIST SRM 2372a, diluted to 1 ng/mL, served as the reference standard. cfDNA or NIST 2372a DNA (2.5 μL) was added to 97.5 μL of 1:1,000 SYBR Green I diluted in 1× PBS. Amplification and fluorescence detection conditions were as follows: one cycle of 98°C for 120 seconds and then 30 cycles of 98°C for 10 seconds, 57°C for 120 seconds, and 72°C for 120 seconds.

Bisulfite Treatment, Library Preparation, and Sequencing

For libraries prepared using the Accel-NGS Methyl-Seq DNA Library Kit (Swift BioSciences, cat. #30024; Supplementary Table S1), the EZ DNA Methylation Kit (Zymo Research, cat. #D5001) was used to prepare DNA samples as follows. DNA was denatured in dilute M-Dilution buffer at 37°C for 15 minutes and then bisulfite-converted in the dark at 50°C for 16 hours before being placed on ice for 10 minutes (79). After a single wash with M-Wash buffer, the sample was desulphonated for 15 minutes at room temperature. The sample was washed twice in M-Wash buffer and then eluted in the Zymo Elution Buffer and stored at –20°C (49). Sequencing libraries were then prepared using the Accel-NGS Methyl-Seq DNA Library Kit, with nine PCR cycles used at the indexing stage. Samples assessed using MethylSaferSeqS (Supplementary Table S1) were prepared as described in detail by Wang and colleagues (80), in which library preparation is performed prior to bisulfite treatment (50). Tissue-specific methylation status as assessed by the two library preparation methods produced indistinguishable results (50). Each library was paired-end sequenced to 150 bp on a single lane of an Illumina HiSeq 4000 instrument. Reads passing Illumina CASAVA Chastity filters were used for subsequent analysis.

DNA Sequencing Data Analysis

Illumina adapters and bases with quality scores below 25 were trimmed from the head and tail of each read using Trimmomatic (81). To improve mapping efficiency by reducing spurious mutations introduced by end repair, 15 bp were additionally cropped from the tail of Read 1 and the head of Read 2 using Trimmomatic per Swift's recommendations. BSMAP was used to align each paired-end read to the bisulfite-converted hg19 genome, and the average methylation at each CpG was computed using BSMAP's methratio.py script (82).

Identification of Methylation Markers for Plasma cfDNA Tissue Deconvolution by QP

The average contribution of 12 tissue types (liver, lungs, colon, small intestines, pancreas, adrenal glands, esophagus, heart, brain, T cells, B cells, and neutrophils) to the total cfDNA pool was determined using 5,653 differentially methylated 500-bp regions described by Sun and colleagues (3). In brief, the approach was bioinformatically based on whole-genome bisulfite sequencing of normal DNA from the liver, lungs, esophagus, heart, pancreas, colon, small intestines, adrenal glands, brain, and T cells, which was retrieved from the Human Epigenome Atlas from the Baylor College of Medicine. The bisulfite sequencing data for B cells and neutrophils are from Hodges and colleagues (83). All CpG islands (CGI) and CpG shores on autosomes were assessed for potential inclusion into the methylation marker set. CGIs and CpG shores on sex chromosomes were not used, so as to minimize potential variations in methylation levels related to the sex-associated chromosome dosage difference in the source data. CGIs were downloaded from the University of California Santa Cruz (UCSC) database (genome.ucsc.edu; 27,048 CGIs for the human genome; ref. 84), and CpG shores were defined as 2-kb flanking windows of the CGIs (85). Then, the CGIs and CpG shores were subdivided into non­overlapping 500-bp units, and each unit was considered a potential methylation marker.

The methylation densities (i.e., the percentage of CpGs being methylated within a 500-bp unit) of all the potential marker loci were compared between the 12 tissue types. Using the methylation profiles of the 12 tissue types, two types of methylation markers were identified. Type I markers refer to any genomic loci with methylation densities that are 3 SDs below or above in one tissue compared with the mean level of the 12 tissue types. Type II markers are genomic loci that demonstrate highly variable methylation densities across the 12 tissue types. A locus is considered highly variable when (i) the methylation density of the most hypermethylated tissue is at least 20% higher than that of the most hypomethylated one, and (ii) the SD of the methylation densities across the 12 tissue types when divided by the mean methylation density (i.e., the coefficient of variation) of the group is at least 0.25. To reduce the number of potentially redundant markers, only one marker would be selected in one contiguous block of two CpG shores flanking one CGI. The genomic locations of the type I and II markers used in this study can be found in Supplementary Table S1 in Sun and colleagues (3).

Plasma cfDNA Tissue Deconvolution by QP

As described in the work by Sun and colleagues (3), the mathematical relationship between the methylation densities of the different methylation markers in plasma and the corresponding methylation markers in different tissues can be expressed as

where |${\overline {MD} }_{\rm{i}}$| represents the methylation density of the methylation biomarker i in the plasma; pk represents the proportional contribution of tissue k to the plasma; and MDik represents the methylation density of the methylation biomarker i in tissue k. The aim of the deconvolution process was to determine the proportional contribution of tissue k to the plasma, namely, pk, for each member of the panel of tissues. QP (39) was used to solve the simultaneous equations. A matrix was compiled including the panel of tissues and their corresponding methylation densities for each methylation marker on the combined list of type I and II markers (a total of 5,653 markers). The program input a range of pk values for each tissue type and determined the expected plasma DNA methylation density for each marker. The tested range of pk values should fulfill the expectation that the total contribution of all candidate tissues, namely, the liver, neutrophils, and lymphocytes for this study, to plasma DNA would be 100% and the values of all pk would be nonnegative. The program then identified the set of pk values that resulted in expected methylation densities across the markers that most closely resembled the data obtained from the plasma DNA bisulfite sequencing.

The total contribution from T cells and B cells was regarded as the contribution from the lymphocytes, and the total contribution from leukocytes was regarded as the contribution from the lymphocytes and neutrophils. To obtain absolute levels of cfDNA (ng/mL) per cell type, the resulting contribution was multiplied by the total concentration of cfDNA present in the sample.

Identification of Methylation Markers for Plasma cfDNA Tissue Deconvolution by NNLS

The average contribution of 24 tissue types (neutrophils, monocytes, CD4 T cells, CD8 T cells, B cells, NK cells, myeloid progenitors, adipocytes, cortical neurons, hepatocytes, lung cells, pancreatic acinar cells, pancreatic duct cells, vascular endothelial cells, colon epithelial cells, left atrium, bladder, breast, head and neck/larynx, kidney, prostate, thyroid, upper gastrointestinal, uterus/cervix) to the total cfDNA pool was determined using 7,890 differentially methylated CpGs, as described in Moss and colleagues (43). In brief, all DNA methylation profiles were determined either on the Illumina Infinium Human Methylation 450K or using the EPIC BeadChip arrays. DNA methylation data for white blood cells (neutrophils, monocytes, B cells, CD4+ T cells, CD8+ T cells, and NK cells, n = 6 each) were downloaded from GSE110555 (EPIC; ref. 86). Data for myeloid progenitors (n = 5) were downloaded from GSE63409 (450K; ref. 87), and data for left atrium (n = 4) were downloaded from GSE62727 (450K; ref. 88). Data for bladder (n = 19), breast (n = 98), cervix (n = 3), colon (n = 38), esophagus (n = 16), oral cavity (n = 34), kidney (n = 160), prostate (n = 50), rectum (n = 7), stomach (n = 2), thyroid (n = 56), and uterus (n = 34) were downloaded from The Cancer Genome Atlas (89). DNA methylation data for adipocytes (n = 3, 450K), hepatocytes (n = 3, 450K and EPIC), alveolar lung cells (n = 3, EPIC), neurons (n = 3, 450K and EPIC), vascular endothelial cells (n = 2, EPIC), pancreatic acinar cells (n = 3, 450K and EPIC), pancreatic duct cells (n = 3, 450K and EPIC), and colon epithelial cells (n = 3, EPIC) were generated by Moss and colleagues (43) and can be requested from the authors.

To analyze DNA methylation samples composed of admixed methylomes from various cell types, the authors approximated the plasma cfDNA methylation profile as a linear combination of the methylation profiles of cell types in the reference atlas. According to this model, the relative contributions of different cell types to plasma cfDNA can be determined using NNLS as described in refs. 40–42. To select candidate CpGs, the authors of refs. 40–42 first excluded CpGs whose variance across the entire methylation atlas was below 0.1% or was missing. They then selected the K = 100 most specific hypermethylated CpGs for each cell type, denoting the methylation matrix X, composed of N rows (CpGs) by d columns (cell types). They then divided each row (the methylation pattern of one CpG over all cell types) by its sum:

For each cell type j, they identified the top K hypermethylated CpGs with the highest Xi,j values. To identify uniquely hypomethylated CpGs, they performed a similar process for the reversed methylation matrix (1−X). Finally, for each cell type, they included both the top K hypermethylated and the top K unmethylated CpGs in the reference matrix. To this set of CpGs, they added neighboring CpGs up to 50 bp. Pairwise-specific CpGs were iteratively selected as follows: Given the current set S of CpGs, they projected the reference atlas on those coordinates and calculated the Euclidean distances between pairs of cell types. Once the closest pair of cell types was identified, they selected the CpG site where they differed the most and added it into the set S. This process was iteratively repeated, focusing on the most confusing pair of cell types in each iteration. Admixing experiments, similar to those performed in Sun and colleagues (3), were performed using buffy coat bisulfite sequencing data mixed with liver, lung, colon epithelial cell, or left atrium bisulfite sequencing data, showing excellent agreement between predicted fraction and actual fraction (as shown in Supplementary Fig. S10A–S10D).

Plasma cfDNA Tissue Deconvolution by NNLS

A custom Python script adapted from the nnls package in MATLAB and described in Moss and colleagues (43) was used to perform NNLS (40–42) to calculate the relative contribution of each cell type to a given sample. Given a matrix X of reference methylation values with N CpGs and d cell types, and a vector Y of methylation values of length N, nonnegative coefficients β were identified by solving argmin βXβY2, subject to β ≥ 0. The resulting β was adjusted to have a sum of 1, where for each βj was defined as:

To obtain absolute levels of cfDNA (ng/mL) per cell type, the resulting βj′ was multiplied by the total concentration of cfDNA present in the sample, as measured by quantitative PCR.

A similar analysis using NNLS with an expanded matrix of 39 cell types is described in Loyfer and colleagues (44).

RealSeqS

RealSeqS was used to test the plasma samples for evidence of aneuploidy and contamination with high-molecular-weight DNA derived from leukocytes that were lysed during venipuncture or blood processing (12). RealSeqS uses a single primer pair to amplify ∼750,000 loci scattered throughout the genome (12). PCR was performed in 25-μL reactions containing 7.25 μL of water, 0.125 μL of each primer, 12.5 μL of NEBNext Ultra II Q5 Master Mix (New England Biolabs, cat. #M0544S), and 5 μL of DNA. The cycling conditions were one cycle of 98°C for 120 seconds and then 15 cycles of 98°C for 10 seconds, 57°C for 120 seconds, and 72°C for 120 seconds. Each plasma DNA sample was assessed in eight independent reactions, and the amount of DNA per reaction varied from ∼0.1 to 0.25 ng. A second round of PCR was then performed to add dual indexes (barcodes) to each PCR product prior to sequencing, as described in Douville and colleagues (12). The second round of PCR was performed in 25-μL reactions containing 7.25 μL of water, 0.125 μL of each primer, 12.5 μL of NEBNext Ultra II Q5 Master Mix (New England Biolabs, cat. #M0544S), and 5 μL of DNA containing 5% of the PCR product from the first round. The cycling conditions were one cycle of 98°C for 120 seconds and then 15 cycles of 98°C for 10 seconds, 65°C for 15 seconds, and 72°C for 120 seconds. Amplification products from the second round were purified with AMPure XP beads (Beckman, cat. #a63880), as per the manufacturer's instructions, prior to sequencing. As noted above, each sample was amplified in eight independent PCRs in the first round. Each of the eight independent PCRs was then reamplified using index primers in the second PCR round. The sequencing reads from the 8 replicates were summed for the bioinformatic analysis but could also be assessed individually for quality control purposes. Massively parallel sequencing was performed on an Illumina HiSeq 4000. During the first round of PCR, degenerate bases at the 5′ end of one of the primers were used as molecular barcodes [unique identifiers (UID)] to uniquely label each DNA template molecule (13). This ensured that each DNA template molecule was counted only once, as described in Kinde and colleagues (39). In all instances for RealSeqS in this article, the term “reads” refers to uniquely identified reads (UIDs). If multiple reads had the same UID, at least 50% of the reads were required to map to the same genomic location. Reads with the same UID but with discordant genomic locations were discarded from analysis.

After massively parallel sequencing, gains or losses of each of the 39 chromosome arms covered by the assay were determined using a bespoke statistical learning method (13). A support vector machine (SVM) was used to discriminate between aneuploid and euploid samples. The SVM was trained using 2,651 aneuploid samples and 1,348 euploid plasma samples to yield a “genome-wide aneuploidy score.” Samples were scored as positive when the genome-wide aneuploidy score was >0.441.

Plasma samples were also analyzed for genomic DNA contamination using RealSeqS. RealSeqS enables the detection of genomic DNA by virtue of the differently sized amplicons generated during PCR amplification (12). Because the average size of cfDNA is ∼160 to 180 bp, almost all the ∼750,000 amplicons are present in an average cfDNA sample. However, there were 1,241 amplicons of size 200 to 500 bp, which represent contamination by genomic DNA. Coverage at these long amplicons is proportional to the background rate of genomic DNA contamination, as described in ref. 13. In samples containing >15 ng of DNA per mL of plasma in which RealSeqS data were not available, an Agilent BioAnalyzer System was used to evaluate the fraction of DNA >500 bp.

Somatic Mutations

For patients with colorectal cancer, a panel of 15 genes was designed to find mutations in DNA from primary tumors, as described in Tie and colleagues (90). This panel enabled the detection of at least one mutation in 98% of colorectal cancer samples tested (90). The mutation with the highest mutant allele frequency in the primary tumor was then used to assess plasma DNA, as described by Tie and colleagues (90). The SafeSeqS approach, using unique identifiers (UIDs, aka molecular barcodes), was then used to assess the plasma DNA for the mutation of interest (12). For patients with pancreatic cancer, plasma was directly assessed with SaferSeqS primers (9, 91) for mutations at codons 12, 13, 59, 60, and 61, as >95% of pancreatic cancers harbor a mutation at one of these positions (92).

CNAs

ichorCNA version 3.2 was downloaded on August 25, 2022, and applied to whole-genome sequencing data on bisulfite-treated cfDNA. Tumor fraction estimates were based on CNA of 500-kb intervals using default parameters. The lower limit of detection was considered to be 3% based on data from ref. 93.

Statistical Considerations

A 𝛘2 test was used to compare the number of individuals with elevated concentrations of cfDNA in eight cancer types compared with healthy persons. A one-way ANOVA was used to compare the number of individuals with elevated concentrations of cfDNA in eight cancer types by AJCC 7th edition stage. The Pearson correlation coefficient was used to determine the relationship between total cfDNA concentration and relative contribution from individual tissues, and a t statistic was used to determine statistical significance. A Student two-tailed t test was used to compare the total concentration of cfDNA before and after surgery and AST and ALT levels before and after surgery. A P value ≤0.05 was considered statistically significant. In general, the numbers reported in this article are accurate to two significant digits. Three digits are reported in some instances solely because three digits were convenient for comparing various drafts of the article during its writing and editing.

Data Availability Statement

Data on methylation (bisulfite sequencing) and CNAs in plasma DNA are deposited in the European Genome-phenome Archive (EGAS00001005400). Similarly, data on mutations in plasma are available from the European Genome-phenome Archive (EGAS00001002764 and EGAS00001002444). Commercial use remains restricted due to Johns Hopkins Medicine's legal requirements.

C. Douville reports personal fees from Exact Sciences during the conduct of the study; personal fees from Belay Diagnostics outside the submitted work; and a patent for RealSeqS licensed and with royalties paid from Exact Sciences. Y. Wang reports a patent for MethylSaferSeqS pending to Johns Hopkins, and is a consul­tant for Thrive Earlier Detection/Exact Sciences. A.H. Pearlman reports grants from the Lustgarten Foundation for Pancreatic Cancer Research, the Virginia and D.K. Ludwig Fund for Cancer Research, The Sol Goldman Center for Pancreatic Cancer Research, The Marcus Foundation, the John Templeton Foundation, and the NIH during the conduct of the study; grants from the Lustgarten Foundation for Pancreatic Cancer Research, the Virginia and D.K. Ludwig Fund for Cancer Research, The Sol Goldman Center for Pancreatic Cancer Research, The Marcus Foundation, the John Templeton Foundation, and the NIH outside the submitted work; and a patent for 15/560,241 pending, a patent for 16/614,005 pending, a patent for 63/059,638 pending, a patent for 17/783,506 pending, a patent for PCT/US2022/019980 pending, a patent for PCT/US2021/061453 pending, a patent for 63/119,753 pending, and a patent for PCT/US23/12290 pending. J.D. Cohen reports other support from Haystack Oncology and personal fees from Exact Sciences outside the submitted work. J. Tie reports personal fees from Haystack Oncology, AstraZeneca, Pierre Fabre, MSD, Bristol Myers Squibb, BeiGene, Seres Therapeutics, and Takeda outside the submitted work. C. Bettegowda reports personal fees from Haystack Oncology, Privo Technologies, Depuy-Synthes, Galectin Therapeutics, and Bionaut Labs, and other support from OrisDx and Belay Diagnostics outside the submitted work. R.H. Hruban reports personal fees from Thrive Earlier Detection during the conduct of the study. C. Tomasetti reports grants from the John Templeton Foundation during the conduct of the study; other support from Exact Sciences, Haystack Oncology, and PrognomiQ outside the submitted work; a patent for CancerSEEK with royalties paid from Exact Sciences, a patent for RealSeqS with royalties paid from Exact Sciences, a patent for BestSeqS pending, and a patent for BarcSeqS pending; and consulting for Bayer. P. Jiang reports grants from the Innovation and Technology Commission of the Hong Kong SAR Government (InnoHK Initiative) during the conduct of the study; personal fees from KingMed Future, Take2 Health, and Illumina and nonfinancial support from DRA outside the submitted work; and many patents/patent applications related to cfDNA molecules pending, issued, licensed, and with royalties paid from Grail, Illumina, DRA, Take2 Health, and Xcelom. K.C.A. Chan is a director of Take2, DRA, Insighta, and Prenetics; holds equities in Take2, DRA, Insighta, Prenetics, and Illumina; and holds patents on molecular diagnostics, and parts of the patent portfolio are licensed to Take2, DRA, Insighta, Grail, Illumina, and Labcorp and he receives royalties from these companies. Y.M.D. Lo is supported by the Innovation and Technology Fund under the InnoHK Initiative and by an endowed chair from the Li Ka Shing Foundation; holds equities in Insighta, DRA Limited, and Take2; has patents or patent applications that have been licensed to and receives royalties from Grail, Illumina, Labcorp, Xcelom, DRA Limited, Take2, and Insighta; and is a scientific cofounder of Grail. N. Papadopoulos reports grants from the NIH and the Commonwealth Foundation during the conduct of the study; personal fees and other support from Thrive Earlier Detection, an Exact Sciences Company, other support from Personal Genome Diagnostics, a Labcorp Company, Haystack Oncology, CAGE Pharma, NeoPhore, and ManaT Bio, and personal fees from Vidium outside the submitted work; a patent for MethylSaferSeqS: a method for simultaneous detection of genetic and epigenetic changes pending and a patent for bisulfite-converted duplexes for the strand-specific detection and quantification of rare mutations pending; and some of the companies named above, as well as other companies, have licensed previously described technologies related to the work described in this article from Johns Hopkins University, and N. Papadopoulos is an inventor on some of these technologies. Licenses to these technologies are or will be associated with equity or royalty payments to the inventors as well as to Johns Hopkins University. Patent applications on the work described in this article may be filed by Johns Hopkins University. The terms of all these arrangements are being managed by Johns Hopkins University in accordance with its conflict-of-interest policies. K.W. Kinzler reports grants from the Lustgarten Foundation for Pancreatic Cancer Research, the Virginia and D.K. Ludwig Fund for Cancer Research, The Sol Goldman Center for Pancreatic Cancer Research, The Marcus Foundation, the John Templeton Foundation, and the NIH during the conduct of the study; personal fees and other support from Thrive Earlier Detection, an Exact Sciences Company, and other support from Personal Genome Diagnostics, CAGE Pharma, NeoPhore, ManaT Bio, and Haystack Oncology outside the submitted work; a patent for MethylSaferSeqS: a method for simultaneous detection of genetic and epigenetic changes pending and a patent for bisulfite-converted duplexes for the strand-specific detection and quantification of rare mutations pending; and some of the companies named above, as well as other companies, have licensed previously described technologies related to the work described in this article from Johns Hopkins University, and K.W. Kinzler is an inventor on some of these technologies. Licenses to these technologies are or will be associated with equity or royalty payments to the inventors as well as to Johns Hopkins University. Patent applications on the work described in this article may be filed by Johns Hopkins University. The terms of all these arrangements are being managed by Johns Hopkins University in accordance with its conflict-of-interest policies. B. Vogelstein reports grants from the Lustgarten Foundation for Pancreatic Research, the Virginia and D.K. Ludwig Fund for Cancer Research, The Sol Goldman Center for Pancreatic Cancer Research, The Marcus Foundation, the John Templeton Foundation, and the NIH during the conduct of the study; other support from Thrive Earlier Detection, an Exact Sciences Company, Personal Genome Diagnostics, CAGE Pharma, NeoPhore, ManaT Bio, Haystack Oncology, and Catalio Capital Management outside the submitted work; a patent for MethylSaferSeqS: a method for simultaneous detection of genetic and epigenetic changes pending and a patent for bisulfite-converted duplexes for the strand-specific detection and quantification of rare mutations pending; and some of the companies named above, as well as other companies, have licensed previously described technologies related to the work described in this article from Johns Hopkins University, and B. Vogelstein is an inventor on some of these technologies. Licenses to these technologies are or will be associated with equity or royalty payments to the inventors as well as to Johns Hopkins University. Patent applications on the work described in this article may be filed by Johns Hopkins University. The terms of all these arrangements are being managed by Johns Hopkins University in accordance with its conflict-of-interest policies. No disclosures were reported by the other authors.

A.K. Mattox: Conceptualization, resources, data curation, software, formal analysis, validation, investigation, visualization, methodology, writing–original draft, writing–review and editing. C. Douville: Formal analysis, writing–review and editing. Y. Wang: Data curation, writing–review and editing. M. Popoli: Data curation, writing–review and editing. J. Ptak: Resources, writing–review and editing. N. Silliman: Resources, writing–review and editing. L. Dobbyn: Resources, writing–review and editing. J. Schaefer: Resources, writing–review and editing. S. Lu: Data curation, writing–review and editing. A.H. Pearlman: Data curation, writing–review and editing. J.D. Cohen: Writing–review and editing. J. Tie: Resources, writing–review and editing. P. Gibbs: Resources, writing–review and editing. K. Lahouel: Software, writing–review and editing. C. Bettegowda: Supervision, writing–review and editing. R.H. Hruban: Resources, writing–review and editing. C. Tomasetti: Formal analysis, writing–review and editing. P. Jiang: Formal analysis, writing–review and editing. K.C.A. Chan: Formal analysis, writing–review and editing. Y.M.D. Lo: Formal analysis, writing–review and editing. N. Papadopoulos: Formal analysis, writing–review and editing. K.W. Kinzler: Formal analysis, writing–review and editing. B. Vogelstein: Conceptualization, resources, data curation, software, formal analysis, supervision, funding acquisition, validation, investigation, visualization, methodology, writing–original draft, project administration, writing–review and editing.

The publication costs of this article were defrayed in part by the payment of publication fees. Therefore, and solely to indicate this fact, this article is hereby marked “advertisement” in accordance with 18 USC section 1734.

Note: Supplementary data for this article are available at Cancer Discovery Online (http://cancerdiscovery.aacrjournals.org/).

1.
Lui
YY
,
Chik
KW
,
Chiu
RW
,
Ho
CY
,
Lam
CW
,
Lo
YM
.
Predominant hematopoietic origin of cell-free DNA in plasma and serum after sex-mismatched bone marrow transplantation
.
Clin Chem
2002
;
48
:
421
7
.
2.
Zheng
YW
,
Chan
KC
,
Sun
H
,
Jiang
P
,
Su
X
,
Chen
EZ
, et al
.
Nonhematopoietically derived DNA is shorter than hematopoietically derived DNA in plasma: a transplantation model
.
Clin Chem
2012
;
58
:
549
58
.
3.
Sun
K
,
Jiang
P
,
Chan
KC
,
Wong
J
,
Cheng
YK
,
Liang
RH
, et al
.
Plasma DNA tissue mapping by genome-wide methylation sequencing for noninvasive prenatal, cancer, and transplantation assessments
.
Proc Natl Acad Sci U S A
2015
;
112
:
E5503
12
.
4.
Snyder
MW
,
Kircher
M
,
Hill
AJ
,
Daza
RM
,
Shendure
J
.
Cell-free DNA comprises an in vivo nucleosome footprint that informs its tissues-of-origin
.
Cell
2016
;
164
:
57
68
.
5.
Mandel
P
,
Metais
P
.
Nuclear acids in human blood plasma
.
C R Seances Soc Biol Fil
1948
;
142
:
241
3
.
6.
Steinman
CR
.
Free DNA in serum and plasma from normal adults
.
J Clin Invest
1975
;
56
:
512
5
.
7.
Leon
SA
,
Shapiro
B
,
Sklaroff
DM
,
Yaros
MJ
.
Free DNA in the serum of cancer patients and the effect of therapy
.
Cancer Res
1977
;
37
:
646
50
.
8.
Stroun
M
,
Anker
P
,
Lyautey
J
,
Lederrey
C
,
Maurice
PA
.
Isolation and characterization of DNA from the plasma of cancer patients
.
Eur J Cancer Clin Oncol
1987
;
23
:
707
12
.
9.
Cohen
JD
,
Li
L
,
Wang
Y
,
Thoburn
C
,
Afsari
B
,
Danilova
L
, et al
.
Detection and localization of surgically resectable cancers with a multi-analyte blood test
.
Science
2018
;
359
:
926
30
.
10.
Diehl
F
,
Schmidt
K
,
Choti
MA
,
Romans
K
,
Goodman
S
,
Li
M
, et al
.
Circulating mutant DNA to assess tumor dynamics
.
Nat Med
2008
;
14
:
985
90
.
11.
Bettegowda
C
,
Sausen
M
,
Leary
RJ
,
Kinde
I
,
Wang
Y
,
Agrawal
N
, et al
.
Detection of circulating tumor DNA in early- and late-stage human malignancies
.
Sci Transl Med
2014
;
6
:
224ra24
.
12.
Douville
C
,
Cohen
JD
,
Ptak
J
,
Popoli
M
,
Schaefer
J
,
Silliman
N
, et al
.
Assessing aneuploidy with repetitive element sequencing
.
Proc Natl Acad Sci U S A
2020
;
117
:
4858
63
.
13.
Douville
C
,
Springer
S
,
Kinde
I
,
Cohen
JD
,
Hruban
RH
,
Lennon
AM
, et al
.
Detection of aneuploidy in patients with cancer through amplification of long interspersed nucleotide elements (LINEs)
.
Proc Natl Acad Sci U S A
2018
;
115
:
1871
6
.
14.
Chan
KC
,
Jiang
P
,
Chan
CW
,
Sun
K
,
Wong
J
,
Hui
EP
, et al
.
Noninvasive detection of cancer-associated genome-wide hypomethylation and copy number aberrations by plasma DNA bisulfite sequencing
.
Proc Natl Acad Sci U S A
2013
;
110
:
18761
8
.
15.
Chan
KC
,
Jiang
P
,
Zheng
YW
,
Liao
GJ
,
Sun
H
,
Wong
J
, et al
.
Cancer genome scanning in plasma: detection of tumor-associated copy number aberrations, single-nucleotide variants, and tumoral heterogeneity by massively parallel sequencing
.
Clin Chem
2013
;
59
:
211
24
.
16.
Murtaza
M
,
Dawson
SJ
,
Tsui
DW
,
Gale
D
,
Forshew
T
,
Piskorz
AM
, et al
.
Non-invasive analysis of acquired resistance to cancer therapy by sequencing of plasma DNA
.
Nature
2013
;
497
:
108
12
.
17.
Mouliere
F
,
El Messaoudi
S
,
Gongora
C
,
Guedj
AS
,
Robert
B
,
Del Rio
M
, et al
.
Circulating cell-free DNA from colorectal cancer patients may reveal high KRAS or BRAF mutation load
.
Transl Oncol
2013
;
6
:
319
28
.
18.
Larson
MH
,
Pan
W
,
Kim
HJ
,
Mauntz
RE
,
Stuart
SM
,
Pimentel
M
, et al
.
A comprehensive characterization of the cell-free transcriptome reveals tissue- and subtype-specific biomarkers for cancer detection
.
Nat Commun
2021
;
12
:
2357
.
19.
Diehl
F
,
Schmidt
K
,
Durkee
KH
,
Moore
KJ
,
Goodman
SN
,
Shuber
AP
, et al
.
Analysis of mutations in DNA isolated from plasma and stool of colorectal cancer patients
.
Gastroenterology
2008
;
135
:
489
98
.
20.
Diehl
F
,
Li
M
,
Dressman
D
,
He
Y
,
Shen
D
,
Szabo
S
, et al
.
Detection and quantification of mutations in the plasma of patients with colorectal tumors
.
Proc Natl Acad Sci U S A
2005
;
102
:
16368
73
.
21.
Thierry
AR
,
El Messaoudi
S
,
Gahan
PB
,
Anker
P
,
Stroun
M
.
Origins, structures, and functions of circulating DNA in oncology
.
Cancer Metastasis Rev
2016
;
35
:
347
76
.
22.
Chabon
JJ
,
Hamilton
EG
,
Kurtz
DM
,
Esfahani
MS
,
Moding
EJ
,
Stehr
H
, et al
.
Integrating genomic features for non-invasive early lung cancer detection
.
Nature
2020
;
580
:
245
51
.
23.
Razavi
P
,
Li
BT
,
Brown
DN
,
Jung
B
,
Hubbell
E
,
Shen
R
, et al
.
High-intensity sequencing reveals the sources of plasma circulating cell-free DNA variants
.
Nat Med
2019
;
25
:
1928
37
.
24.
Mouliere
F
,
Thierry
AR
.
The importance of examining the proportion of circulating DNA originating from tumor, microenvironment and normal cells in colorectal cancer patients
.
Expert Opin Biol Ther
2012
;
12 Suppl 1
:
S209
15
.
25.
Bronkhorst
AJ
,
Ungerer
V
,
Diehl
F
,
Anker
P
,
Dor
Y
,
Fleischhacker
M
, et al
.
Towards systematic nomenclature for cell-free DNA
.
Hum Genet
2021
;
140
:
565
78
.
26.
Anker
P
,
Stroun
M
,
Maurice
PA
.
Spontaneous release of DNA by human blood lymphocytes as shown in an in vitro system
.
Cancer Res
1975
;
35
:
2375
82
.
27.
Liu
MC
,
Oxnard
GR
,
Klein
EA
,
Swanton
C
,
Seiden
MV
;
CCGA Consortium
.
Sensitive and specific multi-cancer detection and localization using methylation signatures in cell-free DNA
.
Ann Oncol
2020
;
31
:
745
59
.
28.
Nuzzo
PV
,
Berchuck
JE
,
Korthauer
K
,
Spisak
S
,
Nassar
AH
,
Abou Alaiwi
S
, et al
.
Detection of renal cell carcinoma using plasma and urine cell-free DNA methylomes
.
Nat Med
2020
;
26
:
1041
3
.
29.
Guo
S
,
Diep
D
,
Plongthongkum
N
,
Fung
HL
,
Zhang
K
,
Zhang
K
.
Identification of methylation haplotype blocks aids in deconvolution of heterogeneous tissue samples and tumor tissue-of-origin mapping from plasma DNA
.
Nat Genet
2017
;
49
:
635
42
.
30.
Guler
GD
,
Ning
Y
,
Ku
CJ
,
Phillips
T
,
McCarthy
E
,
Ellison
CK
, et al
.
Detection of early stage pancreatic cancer using 5-hydroxymethylcytosine signatures in circulating cell free DNA
.
Nat Commun
2020
;
11
:
5270
.
31.
Lo
YMD
,
Han
DSC
,
Jiang
P
,
Chiu
RWK
.
Epigenetics, fragmentomics, and topology of cell-free DNA in liquid biopsies
.
Science
2021
;
372
:
eaaw3616
.
32.
Nelson
WG
,
De Marzo
AM
,
Yegnasubramanian
S
.
Epigenetic alterations in human prostate cancers
.
Endocrinology
2009
;
150
:
3991
4002
.
33.
Mouliere
F
,
El Messaoudi
S
,
Pang
D
,
Dritschilo
A
,
Thierry
AR
.
Multi-marker analysis of circulating cell-free DNA toward personalized medicine for colorectal cancer
.
Mol Oncol
2014
;
8
:
927
41
.
34.
Mouliere
F
,
Robert
B
,
Arnau Peyrotte
E
,
Del Rio
M
,
Ychou
M
,
Molina
F
, et al
.
High fragmentation characterizes tumour-derived circulating DNA
.
PLoS One
2011
;
6
:
e23418
.
35.
Meddeb
R
,
Dache
ZAA
,
Thezenas
S
,
Otandault
A
,
Tanos
R
,
Pastor
B
, et al
.
Quantifying circulating cell-free DNA in humans
.
Sci Rep
2019
;
9
:
5220
.
36.
Newman
AM
,
Bratman
SV
,
To
J
,
Wynne
JF
,
Eclov
NC
,
Modlin
LA
, et al
.
An ultrasensitive method for quantitating circulating tumor DNA with broad patient coverage
.
Nat Med
2014
;
20
:
548
54
.
37.
Yan
L
,
Chen
Y
,
Zhou
J
,
Zhao
H
,
Zhang
H
,
Wang
G
.
Diagnostic value of circulating cell-free DNA levels for hepatocellular carcinoma
.
Int J Infect Dis
2018
;
67
:
92
7
.
38.
Banini
BA
,
Sanyal
AJ
.
The use of cell free DNA in the diagnosis of HCC
.
Hepatoma Res
2019
;
5
:
34
.
39.
Van den Meersche
K
,
Soetaert
K
,
Van Oevelen
D
.
xsample(): an R function for sampling linear inverse problems
.
J Stat Softw
2009
,
Code Snippets
30
:
1
15
.
40.
Accomando
WP
,
Wiencke
JK
,
Houseman
EA
,
Nelson
HH
,
Kelsey
KT
.
Quantitative reconstruction of leukocyte subsets using DNA methylation
.
Genome Biol
2014
;
15
:
R50
.
41.
Houseman
EA
,
Accomando
WP
,
Koestler
DC
,
Christensen
BC
,
Marsit
CJ
,
Nelson
HH
, et al
.
DNA methylation arrays as surrogate measures of cell mixture distribution
.
BMC Bioinf
2012
;
13
:
86
.
42.
Houseman
EA
,
Christensen
BC
,
Yeh
RF
,
Marsit
CJ
,
Karagas
MR
,
Wrensch
M
, et al
.
Model-based clustering of DNA methylation array data: a recursive-partitioning algorithm for high-dimensional data arising as a mixture of beta distributions
.
BMC Bioinf
2008
;
9
:
365
.
43.
Moss
J
,
Magenheim
J
,
Neiman
D
,
Zemmour
H
,
Loyfer
N
,
Korach
A
, et al
.
Comprehensive human cell-type methylation atlas reveals origins of circulating cell-free DNA in health and disease
.
Nat Commun
2018
;
9
:
5068
.
44.
Loyfer
N
,
Magenheim
J
,
Peretz
A
,
Cann
G
,
Bredno
J
,
Klochendler
A
, et al
.
A DNA methylation atlas of normal human cell types
.
Nature
2023
;
613
:
355
64
.
45.
Hoffman
R
.
Hematology: basic principles and practice
.
New York
:
Churchill Livingstone
;
2000
.
46.
Heitzer
E
,
Auinger
L
,
Speicher
MR
.
Cell-free DNA and apoptosis: how dead cells inform about the living
.
Trends Mol Med
2020
;
26
:
519
28
.
47.
Jahr
S
,
Hentze
H
,
Englisch
S
,
Hardt
D
,
Fackelmayer
FO
,
Hesch
RD
, et al
.
DNA fragments in the blood plasma of cancer patients: quantitations and evidence for their origin from apoptotic and necrotic cells
.
Cancer Res
2001
;
61
:
1659
65
.
48.
Schwarzenbach
H
,
Hoon
DS
,
Pantel
K
.
Cell-free nucleic acids as biomarkers in cancer patients
.
Nat Rev Cancer
2011
;
11
:
426
37
.
49.
Kinde
I
,
Wu
J
,
Papadopoulos
N
,
Kinzler
KW
,
Vogelstein
B
.
Detection and quantification of rare mutations with massively parallel sequencing
.
Proc Natl Acad Sci U S A
2011
;
108
:
9530
5
.
50.
Ili
C
,
Buchegger
K
,
Demond
H
,
Castillo-Fernandez
J
,
Kelsey
G
,
Zanella
L
, et al
.
Landscape of genome-wide DNA methylation of colorectal cancer metastasis
.
Cancers
2020
;
12
:
2710
.
51.
Cao
L
,
Quan
XB
,
Zeng
WJ
,
Yang
XO
,
Wang
MJ
.
Mechanism of hepatocyte apoptosis
.
J Cell Death
2016
;
9
:
19
29
.
52.
Jiang
P
,
Chan
KCA
,
Lo
YMD
.
Liver-derived cell-free nucleic acids in plasma: biology and applications in liquid biopsies
.
J Hepatol
2019
;
71
:
409
21
.
53.
Aucamp
J
,
Bronkhorst
AJ
,
Badenhorst
CPS
,
Pretorius
PJ
.
The diverse origins of circulating cell-free DNA in the human body: a critical re-evaluation of the literature
.
Biol Rev Camb Philos Soc
2018
;
93
:
1649
83
.
54.
Gogenur
M
,
Burcharth
J
,
Gogenur
I
.
The role of total cell-free DNA in predicting outcomes among trauma patients in the intensive care unit: a systematic review
.
Crit Care
2017
;
21
:
14
.
55.
Lo
YM
,
Rainer
TH
,
Chan
LY
,
Hjelm
NM
,
Cocks
RA
.
Plasma DNA as a prognostic marker in trauma patients
.
Clin Chem
2000
;
46
:
319
23
.
56.
Hu
Z
,
Chen
H
,
Long
Y
,
Li
P
,
Gu
Y
.
The main sources of circulating cell-free DNA: apoptosis, necrosis and active secretion
.
Crit Rev Oncol Hematol
2021
;
157
:
103166
.
57.
Grabuschnig
S
,
Bronkhorst
AJ
,
Holdenrieder
S
,
Rosales Rodriguez
I
,
Schliep
KP
,
Schwendenwein
D
, et al
.
Putative origins of cell-free DNA in humans: a review of active and passive nucleic acid release mechanisms
.
Int J Mol Sci
2020
;
21
:
8062
.
58.
Ungerer
V
,
Bronkhorst
AJ
,
Van den Ackerveken
P
,
Herzog
M
,
Holdenrieder
S
.
Serial profiling of cell-free DNA and nucleosome histone modifications in cell cultures
.
Sci Rep
2021
;
11
:
9460
.
59.
Pastor
B
,
Abraham
JD
,
Pisareva
E
,
Sanchez
C
,
Kudriavstev
A
,
Tanos
R
, et al
.
Association of neutrophil extracellular traps with the production of circulating DNA in patients with colorectal cancer
.
iScience
2022
;
25
:
103826
.
60.
Jiang
P
,
Sun
K
,
Peng
W
,
Cheng
SH
,
Ni
M
,
Yeung
PC
, et al
.
Plasma DNA end-motif profiling as a fragmentomic marker in cancer, pregnancy, and transplantation
.
Cancer Discov
2020
;
10
:
664
73
.
61.
Lo
YM
,
Chan
KC
,
Sun
H
,
Chen
EZ
,
Jiang
P
,
Lun
FM
, et al
.
Maternal plasma DNA sequencing reveals the genome-wide genetic and mutational profile of the fetus
.
Sci Transl Med
2010
;
2
:
61ra91
.
62.
Sin
STK
,
Jiang
P
,
Deng
J
,
Ji
L
,
Cheng
SH
,
Dutta
A
, et al
.
Identification and characterization of extrachromosomal circular DNA in maternal plasma
.
Proc Natl Acad Sci U S A
2020
;
117
:
1658
65
.
63.
Ma
ML
,
Zhang
H
,
Jiang
P
,
Sin
STK
,
Lam
WKJ
,
Cheng
SH
, et al
.
Topologic analysis of plasma mitochondrial DNA reveals the coexistence of both linear and circular molecules
.
Clin Chem
2019
;
65
:
1161
70
.
64.
Thakur
BK
,
Zhang
H
,
Becker
A
,
Matei
I
,
Huang
Y
,
Costa-Silva
B
, et al
.
Double-stranded DNA in exosomes: a novel biomarker in cancer detection
.
Cell Res
2014
;
24
:
766
9
.
65.
Tan
M
,
Xu
FF
,
Peng
JS
,
Li
DM
,
Chen
LH
,
Lv
BJ
, et al
.
Changes in the level of serum liver enzymes after laparoscopic surgery
.
World J Gastroenterol
2003
;
9
:
364
7
.
66.
Cho
YJ
,
Park
YJ
,
Min
SH
,
Ryu
HG
.
The effect of general anesthesia on aminotransferase levels in patients with elevated aminotransferase levels: a single-center 5-year retrospective study
.
Anesth Analg
2015
;
121
:
1529
33
.
67.
Sender
R
,
Milo
R
.
The distribution of cellular turnover in the human body
.
Nat Med
2021
;
27
:
45
8
.
68.
Tomasetti
C
,
Vogelstein
B
.
Cancer etiology. Variation in cancer risk among tissues can be explained by the number of stem cell divisions
.
Science
2015
;
347
:
78
81
.
69.
Bauernhofer
T
,
Kuss
I
,
Henderson
B
,
Baum
AS
,
Whiteside
TL
.
Preferential apoptosis of CD56dim natural killer cell subset in patients with cancer
.
Eur J Immunol
2003
;
33
:
119
24
.
70.
Czystowska
M
,
Gooding
W
,
Szczepanski
MJ
,
Lopez-Abaitero
A
,
Ferris
RL
,
Johnson
JT
, et al
.
The immune signature of CD8(+)CCR7(+) T cells in the peripheral circulation associates with disease recurrence in patients with HNSCC
.
Clin Cancer Res
2013
;
19
:
889
99
.
71.
Saito
T
,
Kuss
I
,
Dworacki
G
,
Gooding
W
,
Johnson
JT
,
Whiteside
TL
.
Spontaneous ex vivo apoptosis of peripheral blood mononuclear cells in patients with head and neck cancer
.
Clin Cancer Res
1999
;
5
:
1263
73
.
72.
Hoffmann
TK
,
Dworacki
G
,
Tsukihiro
T
,
Meidenbauer
N
,
Gooding
W
,
Johnson
JT
, et al
.
Spontaneous apoptosis of circulating T lymphocytes in patients with head and neck cancer and its clinical importance
.
Clin Cancer Res
2002
;
8
:
2553
62
.
73.
McCracken
JM
,
Allen
LA
.
Regulation of human neutrophil apoptosis and lifespan in health and disease
.
J Cell Death
2014
;
7
:
15
23
.
74.
Gelman
MS
,
Ye
XK
,
Stull
R
,
Suhy
D
,
Jin
L
,
Ng
D
, et al
.
Identification of cell surface and secreted proteins essential for tumor cell survival using a genetic suppressor element screen
.
Oncogene
2004
;
23
:
8158
70
.
75.
Welsh
JB
,
Sapinoso
LM
,
Kern
SG
,
Brown
DA
,
Liu
T
,
Bauskin
AR
, et al
.
Large-scale delineation of secreted protein biomarkers overexpressed in cancer tissue and serum
.
Proc Natl Acad Sci U S A
2003
;
100
:
3410
5
.
76.
Jain
RK
.
Normalization of tumor vasculature: an emerging concept in antiangiogenic therapy
.
Science
2005
;
307
:
58
62
.
77.
Kerbel
RS
.
Tumor angiogenesis
.
N Engl J Med
2008
;
358
:
2039
49
.
78.
Grivennikov
SI
,
Greten
FR
,
Karin
M
.
Immunity, inflammation, and cancer
.
Cell
2010
;
140
:
883
99
.
79.
Mattox
AK
,
Wang
Y
,
Springer
S
,
Cohen
JD
,
Yegnasubramanian
S
,
Nelson
WG
, et al
.
Bisulfite-converted duplexes for the strand-specific detection and quantification of rare mutations
.
Proc Natl Acad Sci U S A
2017
;
114
:
4733
8
.
80.
Wang
Y
,
Douville
C
,
Cohen
JD
,
Mattox
A
,
Curtis
S
,
Silliman
N
, et al
.
Detection of rare mutations, copy number alterations, and methylation in the same template DNA molecules
.
Proc Natl Acad Sci U S A
2023
;
120
:
e2220704120
.
81.
Bolger
AM
,
Lohse
M
,
Usadel
B
.
Trimmomatic: a flexible trimmer for Illumina sequence data
.
Bioinformatics
2014
;
30
:
2114
20
.
82.
Xi
Y
,
Li
W
.
BSMAP: whole genome bisulfite sequence MAPping program
.
BMC Bioinf
2009
;
10
:
232
.
83.
Hodges
E
,
Molaro
A
,
Dos Santos
CO
,
Thekkat
P
,
Song
Q
,
Uren
PJ
, et al
.
Directional DNA methylation changes and complex intermediate states accompany lineage specificity in the adult hematopoietic compartment
.
Mol Cell
2011
;
44
:
17
28
.
84.
Kent
WJ
,
Sugnet
CW
,
Furey
TS
,
Roskin
KM
,
Pringle
TH
,
Zahler
AM
, et al
.
The human genome browser at UCSC
.
Genome Res
2002
;
12
:
996
1006
.
85.
Irizarry
RA
,
Ladd-Acosta
C
,
Wen
B
,
Wu
Z
,
Montano
C
,
Onyango
P
, et al
.
The human colon cancer methylome shows similar hypo- and hypermethylation at conserved tissue-specific CpG island shores
.
Nat Genet
2009
;
41
:
178
86
.
86.
Reinius
LE
,
Acevedo
N
,
Joerink
M
,
Pershagen
G
,
Dahlen
SE
,
Greco
D
, et al
.
Differential DNA methylation in purified human blood cells: implications for cell lineage and studies on disease susceptibility
.
PLoS One
2012
;
7
:
e41361
.
87.
Jung
N
,
Dai
B
,
Gentles
AJ
,
Majeti
R
,
Feinberg
AP
.
An LSC epigenetic signature is largely mutation independent and implicates the HOXA cluster in AML pathogenesis
.
Nat Commun
2015
;
6
:
8489
.
88.
Zhou
J
,
Gao
J
,
Liu
Y
,
Gu
S
,
Zhang
X
,
An
X
, et al
.
Human atrium transcript analysis of permanent atrial fibrillation
.
Int Heart J
2014
;
55
:
71
7
.
89.
Weisenberger
DJ
.
Characterizing DNA methylation alterations from The Cancer Genome Atlas
.
J Clin Invest
2014
;
124
:
17
23
.
90.
Tie
J
,
Cohen
JD
,
Wang
Y
,
Christie
M
,
Simons
K
,
Lee
M
, et al
.
Circulating tumor DNA analyses as markers of recurrence risk and benefit of adjuvant therapy for stage III colon cancer
.
JAMA Oncol
2019
;
5
:
1710
7
.
91.
Cohen
JD
,
Douville
C
,
Dudley
JC
,
Mog
BJ
,
Popoli
M
,
Ptak
J
, et al
.
Detection of low-frequency DNA variants by targeted sequencing of the Watson and Crick strands
.
Nat Biotechnol
2021
;
39
:
1220
7
.
92.
Jones
S
,
Zhang
X
,
Parsons
DW
,
Lin
JC
,
Leary
RJ
,
Angenendt
P
, et al
.
Core signaling pathways in human pancreatic cancers revealed by global genomic analyses
.
Science
2008
;
321
:
1801
6
.
93.
Adalsteinsson
VA
,
Ha
G
,
Freeman
SS
,
Choudhury
AD
,
Stover
DG
,
Parsons
HA
, et al
.
Scalable whole-exome sequencing of cell-free DNA reveals high concordance with metastatic tumors
.
Nat Commun
2017
;
8
:
1324
.

Supplementary data