Pediatric papillary thyroid carcinoma (PPTC) is clinically distinct from adult-onset disease. Although there are higher rates of metastasis and recurrence in PPTC, prognosis remains highly favorable. Molecular characterization of PPTC has been lacking. Historically, only 40% to 50% of childhood papillary thyroid carcinoma (PTC) were known to be driven by genomic variants common to adult PTC; oncogenic drivers in the remainder were unknown. This contrasts with approximately 90% of adult PTC driven by a discrete number of variants. In this study, 52 PPTCs underwent candidate gene testing, followed in a subset by whole-exome and transcriptome sequencing. Within these samples, candidate gene testing identified variants in 31 (60%) tumors, while exome and transcriptome sequencing identified oncogenic variants in 19 of 21 (90%) remaining tumors. The latter were enriched for oncogenic fusions, with 11 nonrecurrent fusion transcripts, including two previously undescribed fusions, STRN-RET and TG-PBF. Most fusions were associated with 3′ receptor tyrosine kinase (RTK) moieties: RET, MET, ALK, and NTRK3. For advanced (distally metastatic) tumors, a driver variant was described in 91%. Gene expression analysis defined three clusters that demonstrated distinct expression of genes involved in thyroid differentiation and MAPK signaling. Among RET-CCDC6–driven tumors, gene expression in pediatric tumors was distinguishable from that in adults. Collectively, these results show that the genomic landscape of pediatric PTC is different from adult PTC. Moreover, they identify genomic drivers in 98% of PPTCs, predominantly oncogenic fusion transcripts involving RTKs, with a pronounced impact on gene expression. Notably, most advanced tumors were driven by a variant for which targeted systemic therapy exists.

Significance:

This study highlights important distinctions between the genomes and transcriptomes of pediatric and adult papillary thyroid carcinoma, with implications for understanding the biology, diagnosis, and treatment of advanced disease in children.

Papillary thyroid carcinoma (PTC) is the most common malignancy in adolescents and young adults (1, 2) and incidence has been increasing across all age groups for the past three decades (3). Pediatric PTC (PPTC), diagnosed among individuals 18 years and under, constitutes less than 2% of all PTC diagnoses; thus, the biology and molecular genetics of pediatric disease has been sparsely characterized relative to adults.

Clinical features of PTC in children differ from those in adults, recognition of which led to the development of pediatric-specific guidelines, first published in 2015 (4). Children with PTC present with more advanced disease, larger primary nodules, and more aggressive features than adults (5). Lymph node involvement and distant metastases are more common in the pediatric population (6, 7). Recurrence rates are also higher, reaching up to 40% in children versus 20% in adults (8, 9). These differences may reflect the host environment, but may also reflect distinct molecular mechanisms underlying oncogenesis and invasiveness in children (10).

In 2014, The Cancer Genome Atlas (TCGA) adopted an integrated multiplatform analysis to define the genomic landscape of 496 PTCs (11). This study demonstrated that PTC is primarily driven by mutually exclusive substitutions in BRAF or RAS (which accounted for 59.7% and 12.9%, of tumors respectively) or by oncogenic fusions, most commonly between RET and either CCDC6 (RET-PTC1) or NCOA4 (RET-PTC3), which comprised an additional 5.3%. Rarer substitutions and fusions were identified in a further 12% of PTCs. These findings affirmed the importance of aberrant signaling through the MAPK and PI3K pathways in thyroid oncogenesis. TCGA further described gene expression patterns according to (i) impact on markers of thyroid differentiation; (ii) activation of the MAPK-ERK signaling pathway; and (iii) a novel BRAF-RAS score (BRS) that designated tumors as BRAF-like or RAS-like. A fundamental shortcoming of TCGA, however, with respect to PPTC, was the inclusion of only 3 tumors from individuals less than 18 years and no individuals less than 12 years at diagnosis.

Molecular characterization of pediatric disease has been slower in arriving. Several studies have adopted PCR-based genotyping or targeted next-generation sequencing (NGS) panels to describe PPTC (12–18), summarized nicely by Bauer (19) and Paulson (20), although such analyses are limited to relatively small cohorts and finite numbers of candidate genetic variants. Nevertheless, they identified that the distribution of oncogenic variants in children differs from those in adults, with substantially lower substitution rates (specifically BRAFV600E and RAS substitutions). More recently two studies have employed targeted next-generation gene panels, identifying oncogenic drivers 69% to 77% of tumors (18, 21). Oncogenic fusions appear to exist in excess, relative to substitutions (13, 15, 21). Finally, there is a significantly greater proportion of PPTC without an identified genomic driver. We recently summarized these studies and found that of 338 tumors analyzed, only 164 (49%) were associated with a genetic alteration (22) while the remainder constituted “dark matter” tumors. This is a meaningful deviation from TCGA, wherein 89.9% of tumors was associated with a solitary oncogenic variant and only 10.1% were dark matter.

The current study sought to identify the oncogenic drivers of pediatric dark matter PTCs using whole-exome sequencing (WES) and transcriptome-wide NGS [RNA sequencing (RNA-seq)], to characterize gene expression patterns in pediatric PTC and to examine these in contrast to adult PTC.

Patients and samples

This study was approved by the Institutional Research Ethics Board and all patients provided informed written consent for participation. Thyroid nodules were snap-frozen from 45 consecutive pediatric patients at our institution. Peripheral blood lymphocytes were collected for germline DNA extraction. An additional 12 archival tumors were obtained from the Cooperative Human Tissue Network, although sample quantity or quality limited analysis to genotyping alone for all but 4 of these [3 PTC and 1 follicular thyroid carcinoma (FTC)]. Five histologically benign lesions (1 follicular adenoma, 4 follicular nodular disease) were included for gene expression analysis. DNA and RNA were extracted using the AllPrep Universal Kit (QIAGEN).

Pathology review

All surgical specimens were reported utilizing a standardized synoptic report. To ensure accurate tumor subclassification using the 2017 WHO classification of thyroid neoplasms (23), with the exception of 4 cases, all specimens were rereviewed by 2 pathologists with endocrine pathology expertise (O. Mete and R. Chami). Four samples lacked diagnostic-quality archival material or images, and thus we relied on the original pathologist's report for diagnosis. Histologic variant could not be determined for 3 PPTCs based on available images and original pathology report.

Genetic analysis: genotyping of common PTC alterations

Allele-specific quantitative PCR or quantitative RT-PCR, using Thyroid Cancer Mutation Detection Kit and the Thyroid Cancer Fusion Detection Kit (Entrogen Inc), was performed to genotype all 52 carcinoma samples for 17 of the most common genetic alterations found in adult thyroid carcinoma (alterations in BRAF, HRAS, KRAS, NRAS, RET, and PAX8 genes). DICER1 analysis was previously reported (22). The human telomerase reverse transcriptase (hTERT) promoter was generated as a single 220 bp amplicon using primers FWD-AGCGCTGCCTGAAACTCG and hTERT-REV-CACAGACGCCCAGGACCG and custom TaqMan probes for the C228T and C250T variants and 7deaza-dGTP modification as described (24). Digital PCR was performed on a QX200 droplet digital PCR System (BioRad Inc) according to manufacturer's directions. Samples were prepared with 2x ddPCR Supermix, forward and reverse primers, and partitioned into a median of approximately 14,000 droplets per well in the QX200 droplet generator. After PCR amplification, samples were read on the QX200 droplet reader using BioRad QuantaSoft software.

Genetic analysis: NGS

Exome and transcriptome sequencing were performed by The Centre for Applied Genomics (TCAG) using established protocols.

Sequencing information, raw FASTQ files and clinical characteristics for the 390 TCGA PTC samples were obtained from the TCGA data portal (https://portal.gdc.cancer.gov/). The files were downloaded and processed using the FusionValidator pipeline (described below).

WES

Exome libraries were prepared using Agilent SureSelect Human Exome Library Preparation V5 kit and the Agilent Bravo Automation System followed by paired-end sequencing on an Illumina HiSeq 2500 platform. Genomic DNA (750 ng) was fragmented to 200-bp on average using a Covaris LE220 instrument. Sheared DNA was end-repaired and the 3′ ends adenylated prior to ligation of adapters with overhang-T. Genomic library was amplified by PCR using 10 cycles and hybridized with biotinylated probes that target exonic regions; the enriched exome libraries were amplified by an additional 8 cycles of PCR. Exomic libraries were validated on a Bioanalyzer 2100 DNA High Sensitivity chip (Agilent Technologies) for size and by qPCR using the Kapa Library Quantification Illumina/ABI Prism Kit protocol (KAPA Biosystems) for quantities. Exome libraries were pooled and sequenced with the TruSeq SBS sequencing chemistry using a V4 high throughput flowcell on a HiSeq 2500 platform following Illumina's recommended protocol. Approximately 6 to 8 gigabases of raw paired end data of 126-bases were generated per exome library.

RNA-seq

Quality of total RNA samples was determined using the Agilent Bioanalyzer 2100 RNA Nano chip. Concentration was measured by Qubit RNA HS Assay on a Qubit fluorometer (Thermo Fisher). RNA library preparation was performed following the NEB NEBNext Ultra II Directional RNA Library Preparation protocol. Briefly, 400 ng of total RNA were used as the input material and enriched for poly-A mRNA, fragmented into the 200- to 300-base range for 4 minutes at 94°C and converted to double stranded cDNA, end-repaired, and adenylated at the 3′ to create an overhang A to allow for ligation of Illumina adapters with an overhang T; library fragments were amplified under the following conditions: initial denaturation at 98°C for 10 seconds, followed by 15 cycles of 98°C for 10 seconds, 60°C for 30 seconds, and 72°C for 30 seconds, and finally an extension step for 5 minutes at 72°C; at the amplification step, each sample was amplified with a different barcoded adapters to allow for multiplex sequencing. One microliter of the final RNA library was loaded on a Bioanalyzer 2100 DNA High Sensitivity chip (Agilent Technologies) to check for size; RNA libraries were quantified by qPCR using the Kapa Library Quantification Illumina/ABI Prism Kit protocol (KAPA Biosystems). Libraries were pooled in equimolar quantities and paired-end sequenced on 2 lanes of a High Throughput Run Mode flowcell with the V4 sequencing chemistry on an Illumina HiSeq 2500 platform following Illumina's recommended protocol to generate paired-end reads of 126-bases in length.

Substitution calling

WES paired reads were aligned to the human genome (hg19/GRCg37) using BWA-MEM (v.07.8). PCR duplicates were identified with Picard MarkDuplicates (v.1.108). Somatic mutations were detected using MuTect2 as part of GATK v.3.5 and Delly v.0.7.1 and annotated with ANNOVAR. Custom filters were implemented to increase specificity of detection as described (25). The mutational burden was calculated by dividing the total number of exonic variants identified by the SSM pipeline by the total capture region of an exome, which is on average 30Mb.

Gene fusions

Gene fusions were detected using an in-house pipeline, FusionValidator (25). In brief, transcripts were identified using an approach that integrates five independent fusion algorithm detection tools: Defuse 0.6.2, ChimeraScan 0.4.5, STAR Fusion 0.7.0, MapSplice 2.1.9, and FusionCatcher 0/99.4d_beta (26–30), and then performs a dynamic realignment against a large data set of approximately 1,300 nonneoplastic samples from 43 different tissues (31) to remove recurrent artifacts and read-throughs. Putative fusions were then validated in silico by transcript assembly. Finally, a machine-learning predictive model was applied to all fusions to generate a final predictive score (the result of a function based on a linear combination of predictor variables). Gene fusions were annotated and Circos plots generated using FusionHub (32). Breakpoints and validation scores are summarized in Supplementary Table S1.

Gene expression analysis

RNA-seq reads were aligned on human genome using STAR 2.4.2 and HT-Seq was used to count reads aligning on every single gene, after removing PCR duplicates and reads mapping on ribosomal RNA, miRNA, and small nucleolar RNA. Raw gene counts were then normalized using the trimmed mean of M-value (TMM) method in the EdgeR package, on genes with at least one read per million bases in at least 3 samples.

K-means clustering was performed on normalized gene expression values and two well established methods were used to identify the number of distinct clusters present across the cohort and determine the optimal number of clusters: the Elbow Method (33) and the Gap Statistic (34). For the elbow method the within-cluster sum of squares for k = 1 to k = 15 was extracted and the wss values per cluster were plotted. For the Gap statistic a max k of 10 with 50 bootstraps was used.

Differentially expressed genes in each cluster were identified using the Limma package in R38. Unsupervised hierarchical clustering with principal component analysis was used to exclude batch effects. P values were adjusted for multiple testing using the Bonferroni correction for FDR. Differentially expressed genes were identified between each cluster and the remainder of the cohort and clusters were considered statistically significant if FDR ⇐ 0.05.

To ascertain whether gene expression differed between pediatric and adult PTC, we first performed unsupervised clustering including all 37 tumors from the present cohort as well as 389 tumors from TCGA. We further performed a generalized linear model likelihood ratio test to compare within-variant expression for the most common oncogenic variants, BRAFV600E (6 pediatric vs. 233 adult tumors), CCDC6/RET (3 pediatric vs. 13 adult), NCOA4/RET (4 pediatric vs. 4 adult), and ETV6-NTRK3 (3 pediatric vs. 5 adult). The latter two analyses lacked statistical power to discriminate gene expression (with only 0 and 4 differentially expressed genes, respectively). P values for the generalized linear model test were adjusted for multiple testing using the Benjamini–Hochberg method for controlling the FDR.

Data availability

Raw exome data files are available at the European Genome-Phenome Archive, accession EGAD00001007502 and RNA-seq data EGAD00001007499. Source data for gene expression are accessible at https://doi.org/10.5683/SP2/IANCYA.

Thyroid differentiation score and ERK score calculation

Thyroid differentiation score (TDS) and ERK score were generated as previously described using Ht-Seq–derived expression scoring (35). The TDS is defined by 16 thyroid-specific genes, associated with thyroid metabolism and function and was determined as the mean of the median-centred rlog value for the 16 genes.

The ERK score was defined by a 52 gene signature derived to reflect MAPK activation levels. Fifty of the 52 ERK genes were identified in our analysis and used to determine ERK score. KIR3DL2 and GTF2A1 L were not identified and were not used in ERK score derivation in the current analysis. Both are associated with ERK upregulation. ERK score was defined as the mean of the median-centered rlog values for the 50 genes. Independently, we derived a MAPK score using a nonparametric, unsupervised method.

Statistical analysis

Analyses were performed using R software 3.5.3, RStudio 1.1.463, GraphPad Prism 7.0 or SPSS v.26. Comparisons of categorical variables were performed via χ2 or Fisher exact test. Comparisons of continuous variables were performed via independent t test. ANOVA tests were conducted when continuous variables between more than 2 groups was required, with Tukey correction used for multiple comparisons. The limit of significance for all analyses was defined as P-value < 0.05 using Bonferroni correction where applicable. Kruskal–Wallis and Wilcoxon rank–sum tests were used to assess significant difference of TDS and ERK scores where indicated.

Demographics, clinical data, and canonical genomic variants

Fifty-one patients with PPTC, 1 with FTC, and 5 with benign nodular disease (BND) were enrolled over an 8-year period. Baseline characteristics are summarized in Table 1, Supplementary Table S2 and Fig. S1. The median age at diagnosis was 13 years (range 5–17). The female:male ratio was 1.04:1. One individual had been exposed to radiation for myeloablation prior to stem-cell transplant. The classical variant of PTC was most common, comprising 33 of 51 PTCs (65%).

Table 1.

Baseline characteristics of study cohort.

Total0–10 yrs10.1–14 yrs14.1–18 yrsPc
Age (yrs): mean ± SD 12.8 ± 2.9 8.5 ± 1.5 12.3 ± 1.1 15.6 ± 0.9 <0.0001 
% Female 56% 36% 63% 59% 0.375 
Size (cm): mean ± SD 2.9 ± 1.6 3.5 ± 1.8 3.2 ± 1.6 2.5 ± 1.4 0.126 
Histologic variant 
 Benign nodule 0.356 
 Classical variant PTC 33 12 14  
 Follicular variant PTC 2b  
 DSV PTC 3a  
 Solid variant PTC  
 Other or undetermined variant PTC  
 FTC  
Nodal metastases 
 N0/Nx 15 10 0.012 
 N1a 10  
 N1b 27 10 11  
Distant metastases      
 M0 28 11 16 0.001 
 M1 11  
 Mx 13  
AI 
 AI absent 10 0.016 
 AI present 29 15  
Total0–10 yrs10.1–14 yrs14.1–18 yrsPc
Age (yrs): mean ± SD 12.8 ± 2.9 8.5 ± 1.5 12.3 ± 1.1 15.6 ± 0.9 <0.0001 
% Female 56% 36% 63% 59% 0.375 
Size (cm): mean ± SD 2.9 ± 1.6 3.5 ± 1.8 3.2 ± 1.6 2.5 ± 1.4 0.126 
Histologic variant 
 Benign nodule 0.356 
 Classical variant PTC 33 12 14  
 Follicular variant PTC 2b  
 DSV PTC 3a  
 Solid variant PTC  
 Other or undetermined variant PTC  
 FTC  
Nodal metastases 
 N0/Nx 15 10 0.012 
 N1a 10  
 N1b 27 10 11  
Distant metastases      
 M0 28 11 16 0.001 
 M1 11  
 Mx 13  
AI 
 AI absent 10 0.016 
 AI present 29 15  

Abbreviations: AI, angioinvasion; DSV, diffuse sclerosing variant; yrs, years.

aOne tumor was most consistent with the DSV, but did not fulfill complete criteria to meet this diagnosis.

bOne patient with poorly differentiated thyroid carcinoma arising from a follicular-variant PTC is classified here with follicular variant.

cOne-way ANOVA for continuous variables and Fisher exact test for categorical variables.

Figure 1.

Histopathologic features by age. A, X, mean; horizontal lines denote median; boxes denote interquartile range; whiskers-95%ile. B, N0, no lymph node (LN) metastases; N1a, LN metastases present in the central neck; N1b, LN metastases in the lateral neck; NX, LN metastases could not be evaluated; M0, no distant metastases; M1, distant metastases present; MX, distant metastases could not be determined. P values reflect one-way ANOVA for size and Fisher exact test for categorical variables.

Figure 1.

Histopathologic features by age. A, X, mean; horizontal lines denote median; boxes denote interquartile range; whiskers-95%ile. B, N0, no lymph node (LN) metastases; N1a, LN metastases present in the central neck; N1b, LN metastases in the lateral neck; NX, LN metastases could not be evaluated; M0, no distant metastases; M1, distant metastases present; MX, distant metastases could not be determined. P values reflect one-way ANOVA for size and Fisher exact test for categorical variables.

Close modal

We segregated the cohort into 3 age categories: predominantly prepubertal (0–10 years), peri-pubertal (10.1–14 years), and predominantly pubertal (14.1–18 years). Younger patients were more likely to have locoregional lymph node metastases (P = 0.012), distally metastatic disease (P = 0.001), and angioinvasion (P = 0.0016). There were no differences in tumor size (P = 0.126) among the three age categories (Fig. 1, Table 1).

Targeted genotyping of the 17 most common thyroid cancer–associated genetic alterations identified variants in 31 of 51 (61%) of malignancies. Substitutions in BRAF, NRAS, and gene fusions (RET and PAX8) were identified in 11 (22%), 1 (2%), 18 (35%), and 1 (2%) tumors respectively. The remaining 20 tumors (39%) comprised the so called “dark matter” samples and were evaluated using WES and RNA-seq.

In addition to the 20 dark-matter samples, an additional 5 BND and 12 tumors with known variants (6 BRAFV600E, 1 NRASQ61R, and 5 RET fusion samples) were included for the purposes of gene expression analysis and to serve as positive controls. In total, a cohort of 37 tumors, enriched for tumors without common genetic variants, underwent exome and transcriptome sequencing.

The mutational landscape of these 37 tumors is summarized in Figs. 2AE. Substitutions accounted for 3 (15%) driver-unknown tumors (Fig. 2B) and gene fusions were identified in 15 (75%; Fig. 2C), and are further described below. There were no pathogenic variants identified in 5 BND, 1 PTC, and 1 FTC (Figs. 2D and E). Altogether, pathogenic substitutions and gene fusions were identified in 30 of 31 (97%) of these next-generation sequenced PPTCs and 49 of 50 (98%) of the overall cohort of PPTCs (excluding the solitary FTC).

Figure 2.

Integrated genomic analysis and patient/tumor characteristics. Each column reflects an individual patient/tumor. A, Demographic and histopathologic characteristics. B, Substitutions (black). C, Oncogenic fusion transcripts (dark gray). D, No genomic variant identified after qPCR and NGS analysis (light gray). E, BND (hatched). F, Distribution of variant class by age in the current cohort vs. adult PTC driver distribution from TCGA. G, Circos plot demonstrating all oncogenic fusions within this series.

Figure 2.

Integrated genomic analysis and patient/tumor characteristics. Each column reflects an individual patient/tumor. A, Demographic and histopathologic characteristics. B, Substitutions (black). C, Oncogenic fusion transcripts (dark gray). D, No genomic variant identified after qPCR and NGS analysis (light gray). E, BND (hatched). F, Distribution of variant class by age in the current cohort vs. adult PTC driver distribution from TCGA. G, Circos plot demonstrating all oncogenic fusions within this series.

Close modal

Somatic substitutions are infrequent drivers of PPTC

WES identified somatic variants in DICER1 in two samples, as previously described (22). Additionally, 1 patient carried a GNAS (c.636G>T:p.Q212H) substitution, which has been previously reported (36), albeit not in PTC. All six of the control BRAFV600E and one NRASQ61R substitution were confirmed by WES. No other candidate driver substitutions or insertion–deletions (indels) were identified in the remaining samples. Thus, WES identified 3 tumors with substitutions that were not identified by the initial genotyping. The mean mutational burden was estimated at 0.5 mutation/Mb.

With one exception, substitutions were mutually exclusive, both with each other and with oncogenic fusions. The aforementioned GNASQ212H substitution was identified in a tumor derived from a child with a pathogenic germline PTEN substitution (c.710delA).

Although canonical substitutions in the hTERT promoter (C228T and C250T) have been associated with poor prognosis in adults, particularly in association with BRAFV600E (37), these hTERT promoter substitutions appear to be rare in children (38, 39). Consistent with prior studies, none of the tumors in this cohort bore either of the canonical hTERT promoter substitutions.

Transcriptome analysis demonstrates abundant and diverse oncogenic fusions

Oncogenic fusion transcripts were identified in 20 of 31 (65%) PPTCs evaluated by RNA-seq and no benign lesions (Fig. 2C, F, and G and Table 2). In addition to the known RET-CCDC6 and RET-NCOA4 transcripts, RNA-seq identified several pathogenic oncogenic fusion transcripts infrequently or not previously reported in adult PTC (AKAP13-RET, TRIM27-RET, TNIP1-RET, ETV6-NTRK3, SQSTM1-NTRK3, TFG-MET, TRIM24-MET, EML4-ALK, and TG-THADA) and two novel oncogenic fusions (STRN-RET and TG-PBF). There were no BRAF fusions. Two of the dark-matter samples expressed RET-CCDC6 and RET-NCOA4 and reflect ‘false negatives’ from the initial qPCR genotyping.

Table 2.

Demographic and histologic features characterized according to oncogenic driver class for all tumors in this cohort.

SubstitutionFusionBNDPc
N 16 34  
Age (yrs): mean ± SD 13.8 ± 2.5 12.0 ± 2.9 14.6 ±2.0 0.033 
Size (cm): mean ± SD 2.9 ± 1.3 3.1 ± 1.8 2.2 ± 0.8 0.486 
% Femalea 44% 53% 100.0% 0.138 
Histologic varianta 
 Benign nodule — — 0.875 
 Classical variant PTC 11 21 —  
 Follicular variant PTC —  
 DSV PTC 1b —  
 Solid variant PTC —  
 Other or undetermined variant PTC —  
 FTC —  
Nodal metastasesa 
 N0/Nx NA 0.001 
 N1a NA  
 N1b 24 NA  
Distant metastasesa 
 M0 11 16 NA 0.338 
 M1 NA  
 Mx 10 NA  
AIa 
 AI absent NA 0.116 
 AI present 21 NA  
SubstitutionFusionBNDPc
N 16 34  
Age (yrs): mean ± SD 13.8 ± 2.5 12.0 ± 2.9 14.6 ±2.0 0.033 
Size (cm): mean ± SD 2.9 ± 1.3 3.1 ± 1.8 2.2 ± 0.8 0.486 
% Femalea 44% 53% 100.0% 0.138 
Histologic varianta 
 Benign nodule — — 0.875 
 Classical variant PTC 11 21 —  
 Follicular variant PTC —  
 DSV PTC 1b —  
 Solid variant PTC —  
 Other or undetermined variant PTC —  
 FTC —  
Nodal metastasesa 
 N0/Nx NA 0.001 
 N1a NA  
 N1b 24 NA  
Distant metastasesa 
 M0 11 16 NA 0.338 
 M1 NA  
 Mx 10 NA  
AIa 
 AI absent NA 0.116 
 AI present 21 NA  

Note: Two tumors where pathogenic variant was not identified were excluded from this analysis.

Abbreviations: DSV, diffuse sclerosing variant; NA, not applicable.

aBenign nodules were excluded from descriptive analysis of sex, histologic variant, nodal, and distant metastasis and AI.

bOne tumor appeared most consistent with the DSV, but did not fulfill complete criteria to meet this diagnosis.

cOne-way ANOVA for continuous variables and Fisher exact test for categorical variables.

Primary data from the 2014 TCGA project were reanalyzed using an in-house bioinformatic pipeline (25). This pipeline accurately identified all oncogenic fusions reported in the initial TCGA analysis, as well as two additional presumed oncogenic fusion transcripts (TPM3-NTRK1 and RAF1-AGGF1) among 47 tumors that had previously been categorized by TCGA as dark matter.

Phenotypes differ based on oncogenic variant class

We evaluated the relationship between oncogenic driver and patient/tumor characteristics. Clinical features were first analyzed according to variant class (fusion vs. substitution). The solitary PPTC without oncogenic variant and 1 FTC were excluded from this analysis. Data are summarized in Table 2. The mean age was 14.0 ± 2.5, 12.0 ± 2.9, and 14.6 ± 2.0 years for patients with tumors driven by fusions, substitutions and for benign lesions, respectively (P = 0.033).

Fusion-driven tumors were more likely to be associated with locoregional lymph node metastases (P = 0.001), while variant class was not significantly associated with differences in the rates of distant metastasis (P = 0.338) or angioinvasion (P = 0.116).

Gene expression analysis defines three distinct tumor clusters and highlights the functional biology of oncogenic variants

Unsupervised hierarchical clustering analysis of the 1,000 most variably expressed genes identified 3 distinct gene expression clusters, shown in Fig. 3A and B. The presence of 3 unique clusters was confirmed independently by k-means clustering analysis using the Elbow and Gap Statistic methods (Supplementary Fig. S1A and S1B, respectively). These 3 clusters correspond broadly to (i) non-BRAFV600E substitution-driven tumors; (ii) BRAFV600E and RET-NCOA4 driven tumors; and (iii) RET-CCDC6, and other fusion-driven tumors (Table 3 and Supplementary Table S3). In contrast to variant class, gene expression cluster was strongly associated with histologic variant (P < 0.001). There were no between-cluster differences in size (P = 0.973), distal metastases (P = 0.333), or angioinvasion (P = 0.834).

Figure 3.

A, Heat map displaying gene expression of the 1,000 most variably expressed genes demonstrating segregation into three distinct clusters as confirmed by elbow and gap statistic methods (Supplementary Fig. S1). B, Multidimensional scaling (MDS) plot visualizing principal component analysis in two dimensions. Tumors cluster closest to other tumors with gene expression most similar to themselves. The difference between samples is calculated as the leading fold change, defined as the root-mean-square of the log2–fold change (FC) between pairs of samples, labeled according to oncogenic variant. A total of 19,714 genes passed filtering and normalization and were used to generate this plot.

Figure 3.

A, Heat map displaying gene expression of the 1,000 most variably expressed genes demonstrating segregation into three distinct clusters as confirmed by elbow and gap statistic methods (Supplementary Fig. S1). B, Multidimensional scaling (MDS) plot visualizing principal component analysis in two dimensions. Tumors cluster closest to other tumors with gene expression most similar to themselves. The difference between samples is calculated as the leading fold change, defined as the root-mean-square of the log2–fold change (FC) between pairs of samples, labeled according to oncogenic variant. A total of 19,714 genes passed filtering and normalization and were used to generate this plot.

Close modal
Table 3.

Demographic and histologic features characterized according to the 3 gene expression clusters as illustrated in Fig. 3A.

Cluster 1Cluster 2Cluster 3Pc
N 11 17  
Age (yrs): mean ± SD 14.2 ± 2.5 12.3 ± 3.4 12.9 ± 2.5 0.224 
Size (cm): mean ± SD 3.1 ± 1.8 3.0 ± 1.8 3.1 ± 1.2 0.973 
% Female 81.8% 47.1% 44.0% 0.134 
Histologic variant 
 Benign nodule <0.001 
 Classical variant PTC 16  
 Follicular variant PTC  
 DSV PTCa  
 Solid variant PTC  
 Other or undetermined variant PTC  
 FTC  
Nodal metastasesb 
 N0/Nx 0.005 
 N1a  
 N1b 10  
Distant metastasesb 
 M0 0.333 
 M1  
 Mx  
AIb 
 AI absent 0.834 
 AI present 11  
Cluster 1Cluster 2Cluster 3Pc
N 11 17  
Age (yrs): mean ± SD 14.2 ± 2.5 12.3 ± 3.4 12.9 ± 2.5 0.224 
Size (cm): mean ± SD 3.1 ± 1.8 3.0 ± 1.8 3.1 ± 1.2 0.973 
% Female 81.8% 47.1% 44.0% 0.134 
Histologic variant 
 Benign nodule <0.001 
 Classical variant PTC 16  
 Follicular variant PTC  
 DSV PTCa  
 Solid variant PTC  
 Other or undetermined variant PTC  
 FTC  
Nodal metastasesb 
 N0/Nx 0.005 
 N1a  
 N1b 10  
Distant metastasesb 
 M0 0.333 
 M1  
 Mx  
AIb 
 AI absent 0.834 
 AI present 11  

Note: Only tumors evaluated by RNA-seq are included.

Abbreviation: DSV, diffuse sclerosing variant.

a

One tumor was most consistent with the DSV, but did not fulfill complete criteria to meet this diagnosis.

bBenign nodules were excluded from descriptive analysis of nodal and distant metastasis and AI.

cOne-way ANOVA for continuous variables and Fisher exact test for categorical variables.

To ascertain whether gene expression in pediatric tumors differed markedly from adult PTC, we applied the same clustering analysis to the entire set of pediatric tumors in addition to 389 PTCs from TCGA (Fig. 4A). Globally, pediatric tumors did not segregate from the adult tumors based on gene expression. When we compared adult and pediatric tumors driven by the same variant, however, CCDC6-RET–driven pediatric tumors showed separable gene expression clustering (Fig. 4B), while BRAFV600E-driven tumors did not (Fig. 4C).

Figure 4.

Clustering of PPTC cohort with 389 adult PTC samples. A, Unsupervised clustering of the 500 most variably expressed genes across all 37 PPTC samples and 389 adult PTC samples from the TCGA project. Expression values are displayed as Z scores after the values of each gene were scaled across samples. B, Analysis of RET-CCDC6 translocated tumors demonstrated 1,192 differentially expressed genes with distinct clustering of three pediatric samples (cyan) from 13 adult samples (navy). C, Subanalysis of BRAFV600E-expressing tumors with 1,730 differentially expressed genes in 6 pediatric samples (cyan) vs. 233 adult (navy) pediatric samples do not cluster separately from adults.

Figure 4.

Clustering of PPTC cohort with 389 adult PTC samples. A, Unsupervised clustering of the 500 most variably expressed genes across all 37 PPTC samples and 389 adult PTC samples from the TCGA project. Expression values are displayed as Z scores after the values of each gene were scaled across samples. B, Analysis of RET-CCDC6 translocated tumors demonstrated 1,192 differentially expressed genes with distinct clustering of three pediatric samples (cyan) from 13 adult samples (navy). C, Subanalysis of BRAFV600E-expressing tumors with 1,730 differentially expressed genes in 6 pediatric samples (cyan) vs. 233 adult (navy) pediatric samples do not cluster separately from adults.

Close modal

We then assessed the impact of genomic variants on the expression of genes related to thyroid differentiation and function. Utilizing a 16-gene TDS (11), tumors expressing the BRAFV600E substitution demonstrated the lowest expression of genes associated with the differentiated state, while BND, FTC, and those expressing TG-PBF and PTEN substitution showed highest expression of these genes (Fig. 5A and B). Tumors in Cluster 2 differed significantly from those in Cluster 3 with respect to expression of DUOX2, PAX8, SLC5A5 (sodium-iodide cotransporter), and thyroid hormone receptor alpha (THRA). Pairwise comparisons of gene expression amongst all genes comprising the TDS is shown in Supplementary Fig. S2.

Figure 5.

A and B, TDS demonstrating mean expression of 16 genes associated with thyrocyte differentiation displayed according to oncogenic variant (A) or gene expression cluster (B). Expression of individual genes comprising the TDS is shown in Supplementary Fig. S3. C and D, ERK score demonstrating expression of genes associated with increasing levels of signaling through the RAS–RAF–MAPK–ERK pathway displayed according to oncogenic variant (C) or gene expression cluster (D). Box plots were generated using Tukey method. Median values are represented by horizontal lines; box heights describe the interquartile ranges; outliers are represented by dots; Kruskal–Wallis test conducted across all samples. UNK, unknown variant.

Figure 5.

A and B, TDS demonstrating mean expression of 16 genes associated with thyrocyte differentiation displayed according to oncogenic variant (A) or gene expression cluster (B). Expression of individual genes comprising the TDS is shown in Supplementary Fig. S3. C and D, ERK score demonstrating expression of genes associated with increasing levels of signaling through the RAS–RAF–MAPK–ERK pathway displayed according to oncogenic variant (C) or gene expression cluster (D). Box plots were generated using Tukey method. Median values are represented by horizontal lines; box heights describe the interquartile ranges; outliers are represented by dots; Kruskal–Wallis test conducted across all samples. UNK, unknown variant.

Close modal

Signaling through the BRAF–RAS–MAPK–ERK pathway was also assessed, as previously described (11), and demonstrated a reciprocal pattern with respect to the TDS, inasmuch as less differentiated tumors demonstrated higher RAS–MAPK–ERK signaling (Fig. 5C). Overall, Cluster 2 and 3 did not differ significantly here, although both were significantly different when compared with Cluster 1 (Fig. 5D).

As there was only a single RAS-associated tumor in this cohort, we were unable to independently derive a BRS. Nonetheless, gauging expression of 67 of 71 genes from the TCGA BRS, allowed segregation into BRAF-like and RAS-like tumors. Tumors associated with Cluster 1 (RAS, PTEN/GNAS, DICER1) demonstrated RAS-like profiles, while (with one exception), Cluster 2 and 3 tumors are enriched in tumors with a BRAF-like profile (Supplementary Table S4).

Finally, we applied Gene Ontology, Gene Set Enrichment Analysis (GSEA), and Gene Set Variation Analysis (Supplementary Fig. S3) using the Kyoto Encyclopedia of Genes and Genomes pathway database (40). The 20 most significantly enriched pathways in each cluster (FDR q-value < 0.01, P-value < 0.0001) were identified by GSEA and are summarized in Supplementary Table S3. Pathways enriched in Clusters 2 and 3 showed significant overlap, including upregulation of pathways related to cell adhesion, extracellular matrix, cytokine–cytokine receptor interaction, MAPK signaling, and p53 signaling, while these were largely downregulated in Cluster 1.

This is the first study to apply unbiased whole-exome and whole-transcriptome analysis to an exclusively pediatric cohort of thyroid tumors. In so doing, it defines a genomic landscape distinct from that in adults, dominated by diverse oncogenic fusions with a substantially smaller proportion of BRAFV600E-driven tumors.

Oncogenic drivers were identified in 98% of tumors, with the majority of the previous unknowns driven by rare oncogenic-fusion transcripts. These results complement the landmark TCGA project wherein only 3 tumors were derived from individuals under 18 years of age; and to other pediatric series that adopted targeted genotyping approaches for the most common variants in adult PTC and demonstrated that approximately 50% of pediatric tumors (41, 42) lacked identified oncogenic variants.

We found that 12 of 32 (38%) PPTCs were driven by fusions that are not among the common adult PTC drivers. Nevertheless, 10 of 12 fusions incorporated an RTK fused to the 3′end of the transcript, a finding with meaningful therapeutic implications, as discussed below.

Given the diversity of fusion transcripts in PPTCs, we propose that the unbiased nature of full transcriptome RNA-seq establishes it as the de facto gold standard for genomic analysis of pediatric thyroid tumors. With decreasing costs and widespread availability, we propose this as an appropriate ‘universal’ test. Moreover, RNA-seq analysis affords the opportunity to evaluate variations in gene expression associated with oncogenic variants, lending biological context to genomic alterations, while targeted RNA analysis methodologies (whether qPCR or NGS-based) are primarily geared towards detection of known fusions, rather than novel transcripts and expression analysis.

Diverse oncogenic fusions dominate the landscape of PPTC

As with prior targeted analyses, we identified a predominance of oncogenic fusion transcripts underlying PPTC, suggesting that pediatric tumors arise primarily from double-stranded DNA breaks, rather than errors in replication, lending insight into the primary oncogenic events and/or failed repair mechanisms that drive these tumors. In addition to the canonical RET-CCDC6, RET-NCOA4, and ETV6-NTRK3, several rare fusion transcripts, uncommon to PTC (although identified in other malignancies) were identified. These include AKAP13-RET (11), TRIM27-RET (43), TNIP1-RET (44), SQSTM1-NTRK3 (45), TFG-MET (46), TRIM24-MET (47), EML4-ALK (48), and TG-THADA (11). In addition, the current study identified 2 novel oncogenic fusion transcripts, STRN-RET and TG-PBF (PTTG1IP).

STRN is located on 2p22.2 and has been reported in the context of PTC when fused to ALK, although it has not been previously reported in association with RET. STRN-ALK can transform cells in vitro and results in tumor formation in mice (49, 50). The STRN-RET fusion retains both the coiled-coil domain of STRN and the tyrosine-kinase domain of RET. As such, we suggest that this novel fusion likely retains the potential to drive tumorigenesis.

TG-PBF, has not been described in human tumors, although overexpression of wild-type PBF, has been associated with tumor induction in xenograft models (51) and a transgenic mouse expressing PBF under the influence of the thyroglobulin promoter demonstrates a pattern of follicular epithelial proliferation and radio-iodine resistance (52). PBF was previously identified as a cancer driver across 34 tumor types (53) and PBF overexpression in human thyroid carcinoma is associated with impaired DNA-damage response and poorer patient outcomes (54). In the solid-variant PPTC described herein, PBF was among the 0.5% most highly expressed transcripts. Thus, although lacking the classic features of an oncogenic (RTK-associated) fusion transcript, the TG-PBF fusion transcript, driving high expression of PBF, may indeed represent a novel oncogenic variant.

Oncogenic variants in well-differentiated thyroid carcinoma are almost invariably mutually exclusive (11, 55). We identified a solitary patient with a germline PTEN variant and a somatic GNAS variant. To our knowledge, alterations in these 2 genes have not previously been described together in PTC, however, Kirschner and colleagues previously demonstrated FTCs developing in mice doubly deficient in PTEN and PRKAR1a (which lies downstream of GNAS activation; refs. 56, 57), emphasizing the impact of convergent activation of the PTEN-mTOR and GNAS-PKA signaling pathways in follicular cell transformation.

FTC is rare in children and this cohort included only a single case. We did not identify a genomic variant in this tumor, although the BRS estimate defined this as a RAS-like tumor, and other series have described enrichment for pathogenic RAS substitutions among FTC (55).

Three gene expression clusters distinguish tumors by TDS and ERK signaling

Gene expression analysis identified 3 distinct clusters. Interestingly, carcinomas driven by DICER1, PTEN/GNAS, and NRAS demonstrated expression profiles more similar to BND than to tumors driven by BRAFV600E or by oncogenic fusions (Fig. 3) and were enriched in immunoglobulin gene expression, as has been recently described (58). Additionally, we were intrigued to find that RET-CCDC6 and RET-NCOA4 fell into separate gene expression clusters. To our knowledge, this is the first report of major differences in gene expression between these two common RET-fusions in humans, and may explain differences in tumor behavior as has been described previously (59). Classical variant histology constituted 0% of Cluster 1 tumors, 100% of Cluster 2 tumors, and 56% of Cluster 3 tumors. Thus, among molecular tests, gene expression appears a better discriminant of lesional architecture than oncogenic variant class (Tables 2 and 3; Supplementary Fig. S4A and S4B).

Cluster 2 tumors (including those driven by BRAFV600E) demonstrate the lowest expression of markers of differentiation [including genes necessary for iodine transport (SLC5A5) and organification (TPO, SLC26A4)] and highest levels of ERK signaling. This may have meaningful therapeutic ramifications with respect to ability of tumors (and metastases) to concentrate radioactive iodine (RAI), as well as response to RAS–RAF–MEK–ERK inhibition, although clinical validation of these associations is certainly warranted. Conversely, Cluster 1 tumors are functionally well differentiated and are rarely associated with metastatic disease.

In light of the clinical differences between pediatric and adult PTC, we anticipated that distinct gene-expression patterns would distinguish tumors between these the pediatric and adult populations. Although we did not find this to be the case overall (Fig. 4A), when focusing exclusively on tumors driven by CCDC6-RET (Fig. 4B), gene expression in children was separable from that in adults. This was not true among BRAFV600E-driven tumors (Fig. 4C). While these studies were underpowered to explore these differences in depth, we note that children with CCDC6-RET-driven tumors were younger than those with BRAFV600E-driven tumors (mean age 10.1 vs. 14.5 years). Clinical differences between younger children and adolescents have been described, as shown in Fig. 1 and prior studies (60, 61), and this finding would support the hypothesis that tumors arising in young (pre-/peri-pubertal) children constitute a biologically and genomically unique category, although further study is clearly necessary. These hypothesis-generating analyses were limited by small sample numbers and merit confirmation with larger sample sizes to establish generalizability.

Tumor sequencing identifies targets for systemic therapy

The prognosis for children with PTC is highly favorable. Nonetheless, 15% to 25% of children present with distally metastatic disease, and of these, only 0% to 16% achieve structural or biochemical remission with current therapies (7, 62–64). In this series, the mean age of individuals with distally metastatic (pulmonary) disease was 10.4 years with 8 of 11 M1 tumors driven by oncogenic fusions. Although only a small proportion of children develop progressive, RAI refractory and/or symptomatic disease, therapeutic implications for such patients, based on the current study are significant. In addition, 58% of children in this study (and 73% with M1 disease) had tumors driven by RET, NTRK, ALK, or MET fusions, all of which are targetable with effective systemic therapies: selpercatinib and pralsetinib (RET), larotrectinib and entrectinib (NTRK) and crizotinib (ALK and MET), while 11 of 52 (21%) carried the BRAFV600E substitution (dabrafenib and vemurafenib). Overall 79% of tumors (and 91% of distally metastatic tumors) were driven by variants with currently available systemic therapies. Thus, we propose that comprehensive molecular analysis be considered for all children with distally metastatic tumors that are either progressive and/or radio-iodine refractory.

There were several limitations to this study. First, given the rarity of pediatric thyroid carcinoma, sample availability was limiting, particularly among the youngest patients. Nonetheless, by prospectively ascertaining high-quality frozen samples, we were able to generate robust genomic data, whereas retrospective analysis of archival samples often limits the quality of data that can be generated using genome-wide approaches. As has been noted in other series, BRAFV600E-driven tumors are infrequent in children and RAS-driven tumors even more so (41, 42, 65). This precluded the independent derivation of a BRAF-RAS score to characterize tumors in the current study. Many of the fusions identified herein have been previously described and associated with thyroid or other malignancies. For novel fusions, notwithstanding biologic plausibility, functional validation is necessary prior to formally ascribing oncogenic potential. Finally, the majority of patients included in this cohort underwent surgery within the past 5 years and many transitioned care upon reaching adulthood, thus we are unable to associate intermediate or long-term outcomes, including recurrence, with underlying genotype. This, however, remains an important area for future investigation, as theoretically, tumor genotype could aid in prognostication and potentially in treatment planning.

Further association of the genomic drivers of PPTC with salient clinical outcomes including radio-iodine responsiveness and achievement of a state of undetectable disease, is an important next step in this process. Such determinations, will require broader sequencing of pediatric tumors and longitudinal studies relating genotype and gene expression to these outcomes.

This study provides an exome- and transcriptome-wide analysis of genomic alterations in childhood PTC. It highlights the genomic distinctions from adult PTC and raises the possibility that these distinctions explain clinical differences that have long been reported between the two. These results provide meaningful insights into the underlying biology of disease and into treatment options for children with progressive and distally metastatic disease.

J.D. Wasserman reports personal fees from Bayer Inc. outside the submitted work. No disclosures were reported by the other authors.

A. Stosic: Formal analysis, investigation, visualization, methodology, writing–original draft, writing–review and editing. F. Fuligni: Software, formal analysis, supervision, writing–review and editing. N.D. Anderson: Formal analysis, investigation, methodology. S. Davidson: Formal analysis, supervision, methodology, writing–review and editing. R. de Borja: Software, formal analysis, investigation, methodology. M. Acker: Resources, project administration. V. Forte: Resources, writing–review and editing. P. Campisi: Resources, writing–review and editing. E.J. Propst: Resources, writing–review and editing. N.E. Wolter: Resources, writing–review and editing. R. Chami: Formal analysis, investigation, writing–review and editing. O. Mete: Formal analysis, investigation, writing–review and editing. D. Malkin: Conceptualization, resources, writing–review and editing. A. Shlien: Conceptualization, resources, software, formal analysis, supervision, funding acquisition, visualization, methodology, writing–review and editing. J.D. Wasserman: Conceptualization, resources, data curation, formal analysis, supervision, funding acquisition, writing–original draft, writing–review and editing.

This work was funded by the C17 Research Network (J.D. Wasserman) and SickKids Foundation (J.D. Wasserman). A. Stosic was the recipient of a Restracomp Graduate Scholarship. A. Shlien is partially supported by an Early Researcher Award from the Ontario Ministry of Research and Innovation, the Canada Research Chair in Childhood Cancer Genomics, and the Robert J. Arceci Innovation Award from the St. Baldrick's Foundation. We are grateful to Sholeh Ghayoori for assistance with the initial stages of this investigation. Anita Villani provided critical review of this manuscript, for which we are also grateful. Tissue samples were provided by the Cooperative Human Tissue Network, which is funded by the National Cancer Institute as well as the Children's Oncology Group (study ARAR15B1). Other investigators may have received specimens from the same subjects. The hTERT promoter assay was developed and validated by Tara Paton and Nunio de Rocha Oliveira Nunes.

The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.

1.
Araque
DVP
,
Bleyer
A
,
Brito
JP
. 
Thyroid cancer in adolescents and young adults
.
Future Oncol
2017
;
13
:
1253
61
.
2.
Canadian Cancer Society's Steering Committee on Cancer Statistics
.
Canadian Cancer Statistics 2012
.
Ontario (Canada)
:
Canadian Cancer Society
; 
2012
.
3.
Dermody
S
,
Walls
A
,
Harley
EH
 Jr
. 
Pediatric thyroid cancer: An update from the SEER database 2007–2012
.
Int J Pediatr Otorhinolaryngol
2016
;
89
:
121
6
.
4.
Francis
GL
,
Waguespack
SG
,
Bauer
AJ
,
Angelos
P
,
Benvenga
S
,
Cerutti
JM
, et al
Management guidelines for children with thyroid nodules and differentiated thyroid cancer
.
Thyroid
2015
;
25
:
716
59
.
5.
Hay
ID
,
Johnson
TR
,
Kaggal
S
,
Reinalda
MS
,
Iniguez-Ariza
NM
,
Grant
CS
, et al
Papillary thyroid carcinoma (PTC) in children and adults: comparison of initial presentation and long-term postoperative outcome in 4432 patients consecutively treated at the mayo clinic during eight decades (1936–2015)
.
World J Surg
2018
;
42
:
329
42
.
6.
Hogan
AR
,
Zhuge
Y
,
Perez
EA
,
Koniaris
LG
,
Lew
JI
,
Sola
JE
. 
Pediatric thyroid carcinoma: incidence and outcomes in 1753 patients
.
J Surg Res
2009
;
156
:
167
72
.
7.
Alzahrani
AS
,
Alswailem
M
,
Moria
Y
,
Almutairi
R
,
Alotaibi
M
,
Murugan
AK
, et al
Lung metastasis in pediatric thyroid cancer: radiological pattern, molecular genetics, response to therapy, and outcome
.
J Clin Endocrinol Metab
2019
;
104
:
103
10
.
8.
Rubinstein
JC
,
Herrick-Reynolds
K
,
Dinauer
C
,
Morotti
R
,
Solomon
D
,
Callender
GG
, et al
Recurrence and complications in pediatric and adolescent papillary thyroid cancer in a high-volume practice
.
J Surg Res
2020
;
249
:
58
66
.
9.
Demidchik
YE
,
Demidchik
EP
,
Reiners
C
,
Biko
J
,
Mine
M
,
Saenko
VA
, et al
Comprehensive clinical assessment of 740 cases of surgically treated thyroid cancer in children of Belarus
.
Ann Surg
2006
;
243
:
525
32
.
10.
Kumagai
A
,
Namba
H
,
Saenko
VA
,
Ashizawa
K
,
Ohtsuru
A
,
Ito
M
, et al
Low frequency of BRAF T1796A mutations in childhood thyroid carcinomas
.
The Journal of Clinical Endocrinol Metabol
2004
;
89
:
4280
4
.
11.
Cancer Genome Atlas Research Network
. 
Integrated genomic characterization of papillary thyroid carcinoma
.
Cell
2014
;
159
:
676
90
.
12.
Picarsic
JL
,
Buryk
MA
,
Ozolek
J
,
Ranganathan
S
,
Monaco
SE
,
Simons
JP
, et al
Molecular characterization of sporadic pediatric thyroid carcinoma with the DNA/RNA ThyroSeq v2 next-generation sequencing assay
.
Pediatr Dev Pathol
2016
;
19
:
115
22
.
13.
Prasad
ML
,
Vyas
M
,
Horne
MJ
,
Virk
RK
,
Morotti
R
,
Liu
Z
, et al
NTRK fusion oncogenes in pediatric papillary thyroid carcinoma in northeast United States
.
Cancer
2016
;
122
:
1097
107
.
14.
Ricarte-Filho
JC
,
Li
S
,
Garcia-Rendueles
ME
,
Montero-Conde
C
,
Voza
F
,
Knauf
JA
, et al
Identification of kinase fusion oncogenes in post-Chernobyl radiation-induced thyroid cancers
.
J Clin Invest
2013
;
123
:
4935
44
.
15.
Cordioli
MI
,
Moraes
L
,
Bastos
AU
,
Besson
P
,
Alves
MT
,
Delcelo
R
, et al
Fusion oncogenes are the main genetic events found in sporadic papillary thyroid carcinomas from children
.
Thyroid
2017
;
27
:
182
8
.
16.
Nikita
ME
,
Jiang
W
,
Cheng
S-M
,
Hantash
FM
,
McPhaul
MJ
,
Newbury
RO
, et al
Mutational analysis in pediatric thyroid cancer and correlations with age, ethnicity, and clinical presentation
.
Thyroid
2016
;
26
:
227
34
.
17.
Sassolas
G
,
Hafdi-Nejjari
Z
,
Ferraro
A
,
Decaussin-Petrucci
M
,
Rousset
B
,
Borson-Chazot
F
, et al
Oncogenic alterations in papillary thyroid cancers of young patients
.
Thyroid
2012
;
22
:
17
26
.
18.
Alzahrani
AS
,
Alswailem
M
,
Alswailem
AA
,
Al-Hindi
H
,
Goljan
E
,
Alsudairy
N
, et al
Genetic alterations in pediatric thyroid cancer using a comprehensive childhood cancer gene panel
.
J Clin Endocrinol Metab
2020
;
105
:
dgaa389
.
19.
Bauer
AJ
. 
Molecular genetics of thyroid cancer in children and adolescents
.
Endocrinol Metab Clin North Am
2017
;
46
:
389
403
.
20.
Paulson
VA
,
Rudzinski
ER
,
Hawkins
DS
. 
Thyroid cancer in the pediatric population
.
Genes (Basel)
2019
;
10
:
723
.
21.
Pekova
B
,
Sykorova
V
,
Dvorakova
S
,
Vaclavikova
E
,
Moravcova
J
,
Katra
R
, et al
RET, NTRK, ALK, BRAF and MET fusions in a large cohort of pediatric papillary thyroid carcinomas
.
Thyroid
2020
;
30
:
1771
80
.
22.
Wasserman
JD
,
Sabbaghian
N
,
Fahiminiya
S
,
Chami
R
,
Mete
O
,
Acker
M
, et al
DICER1 mutations are frequent in adolescent-onset papillary thyroid carcinoma
.
J Clin Endocrinol Metab
2018
;
103
:
2009
15
.
23.
Osamura
R
,
Grossman
A
,
Korbonits
M
,
Kovacs
K
,
Lopes
M
,
Matsuno
A
, et al
WHO Classification of Tumours of Endocrine Organs
.
Lyon, France
:
IARC
; 
2017
.
24.
Colebatch
AJ
,
Witkowski
T
,
Waring
PM
,
McArthur
GA
,
Wong
SQ
,
Dobrovic
A
. 
Optimizing amplification of the GC-Rich TERT promoter region using 7-Deaza-dGTP for droplet digital PCR quantification of TERT promoter mutations
.
Clin Chem
2018
;
64
:
745
7
.
25.
Anderson
ND
,
de Borja
R
,
Young
MD
,
Fuligni
F
,
Rosic
A
,
Roberts
ND
, et al
Rearrangement bursts generate canonical gene fusions in bone and soft tissue tumors
.
Science
2018
;
361
:
eaam8419
.
26.
Haas
BJ
,
Dobin
A
,
Li
B
,
Stransky
N
,
Pochet
N
,
Regev
A
. 
Accuracy assessment of fusion transcript detection via read-mapping and de novo fusion transcript assembly-based methods
.
Genome Biol
2019
;
20
:
213
.
27.
Iyer
MK
,
Chinnaiyan
AM
,
Maher
CA
. 
ChimeraScan: a tool for identifying chimeric transcription in sequencing data
.
Bioinformatics
2011
;
27
:
2903
4
.
28.
McPherson
A
,
Hormozdiari
F
,
Zayed
A
,
Giuliany
R
,
Ha
G
,
Sun
MG
, et al
deFuse: an algorithm for gene fusion discovery in tumor RNA-Seq data
.
PLoS Comput Biol
2011
;
7
:
e1001138
.
29.
Nicorici
D
,
Satalan
M
,
Edgren
H
,
Kangaspeska
S
,
Murumagi
A
,
Kallioniemi
O
, et al
FusionCatcher-a tool for finding somatic fusion genes in paired-end RNA-sequencing data
.
BioRxiv
2014
:
011650
.
30.
Wang
K
,
Singh
D
,
Zeng
Z
,
Coleman
SJ
,
Huang
Y
,
Savich
GL
, et al
MapSplice: accurate mapping of RNA-seq reads for splice junction discovery
.
Nucleic Acids Res
2010
;
38
:
e178
.
31.
GTEx Consortium
. 
The genotype-tissue expression (GTEx) project
.
Nat Genet
2013
;
45
:
580
5
.
32.
Panigrahi
P
,
Jere
A
,
Anamika
K
. 
FusionHub: A unified web platform for annotation and visualization of gene fusion events in human cancer
.
PLoS One
2018
;
13
:
e0196588
.
33.
Madhulatha
TS
. 
An overview on clustering methods
.
IOSR Journal of Engineering
2012
;
2
:
719
25
. arXiv:1205.1117.
34.
Yan
M
,
Ye
K
. 
Determining the number of clusters using the weighted gap statistic
.
Biometrics
2007
;
63
:
1031
7
.
35.
Anders
S
,
Pyl
PT
,
Huber
W
. 
HTSeq–a Python framework to work with high-throughput sequencing data
.
Bioinformatics
2015
;
31
:
166
9
.
36.
Williamson
EA
,
Ince
PG
,
Harrison
D
,
Kendall-Taylor
P
,
Harris
PE
. 
G-protein mutations in human pituitary adrenocorticotrophic hormone-secreting adenomas
.
Eur J Clin Invest
1995
;
25
:
128
31
.
37.
Yang
J
,
Gong
Y
,
Yan
S
,
Chen
H
,
Qin
S
,
Gong
R
. 
Association between TERT promoter mutations and clinical behaviors in differentiated thyroid carcinoma: a systematic review and meta-analysis
.
Endocrine
2020
;
67
:
44
57
.
38.
Onder
S
,
Sari
SO
,
Yegen
G
,
Sormaz
IC
,
Yilmaz
I
,
Poyrazoglu
S
, et al
Classic architecture with multicentricity and local recurrence, and absence of TERT promoter mutations are correlates of BRAF (V600E) harboring pediatric papillary thyroid carcinomas
.
Endocr Pathol
2016
;
27
:
153
61
.
39.
Oishi
N
,
Kondo
T
,
Nakazawa
T
,
Mochizuki
K
,
Inoue
T
,
Kasai
K
, et al
Frequent BRAF (V600E) and absence of TERT promoter mutations characterize sporadic pediatric papillary thyroid carcinomas in japan
.
Endocr Pathol
2017
;
28
:
103
11
.
40.
Subramanian
A
,
Tamayo
P
,
Mootha
VK
,
Mukherjee
S
,
Ebert
BL
,
Gillette
MA
, et al
Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles
.
Proc Natl Acad Sci U S A
2005
;
102
:
15545
50
.
41.
Mostoufi-Moab
S
,
Labourier
E
,
Sullivan
L
,
LiVolsi
V
,
Li
Y
,
Xiao
R
, et al
Molecular testing for oncogenic gene alterations in pediatric thyroid lesions
.
Thyroid
2018
;
28
:
60
7
.
42.
Galuppini
F
,
Vianello
F
,
Censi
S
,
Barollo
S
,
Bertazza
L
,
Carducci
S
, et al
Differentiated thyroid carcinoma in pediatric age: Genetic and clinical scenario
.
Front Endocrinol (Lausanne)
2019
;
10
:
552
.
43.
Saenko
V
,
Rogounovitch
T
,
Shimizu-Yoshida
Y
,
Abrosimov
A
,
Lushnikov
E
,
Roumiantsev
P
, et al
Novel tumorigenic rearrangement, Delta rfp/ret, in a papillary thyroid carcinoma from externally irradiated patient
.
Mutat Res
2003
;
527
:
81
90
.
44.
Pietrantonio
F
,
Di Nicolantonio
F
,
Schrock
AB
,
Lee
J
,
Morano
F
,
Fuca
G
, et al
RET fusions in a small subset of advanced colorectal cancers at risk of being neglected
.
Ann Oncol
2018
;
29
:
1394
401
.
45.
Iyama
K
,
Matsuse
M
,
Mitsutake
N
,
Rogounovitch
T
,
Saenko
V
,
Suzuki
K
, et al
Identification of Three novel fusion oncogenes, SQSTM1/NTRK3, AFAP1L2/RET, and PPFIBP2/RET, in thyroid cancers of young patients in fukushima
.
Thyroid
2017
;
27
:
811
8
.
46.
Flucke
U
,
van Noesel
MM
,
Wijnen
M
,
Zhang
L
,
Chen
CL
,
Sung
YS
, et al
TFG-MET fusion in an infantile spindle cell sarcoma with neural features
.
Genes Chromosomes Cancer
2017
;
56
:
663
7
.
47.
Hiemenz
MC
,
Skrypek
MM
,
Cotter
JA
,
Biegel
JA
. 
Novel TRIM24-MET fusion in a neonatal brain tumor
.
JCO Precis Oncol
2019
:
1
6
.
48.
Panebianco
F
,
Nikitski
AV
,
Nikiforova
MN
,
Kaya
C
,
Yip
L
,
Condello
V
, et al
Characterization of thyroid cancer driven by known and novel ALK fusions
.
Endocr Relat Cancer
2019
;
26
:
803
14
.
49.
Bastos
AU
,
de Jesus
AC
,
Cerutti
JM
. 
ETV6-NTRK3 and STRN-ALK kinase fusions are recurrent events in papillary thyroid cancer of adult population
.
Eur J Endocrinol
2018
;
178
:
83
91
.
50.
Nikitski
AV
,
Rominski
SL
,
Wankhede
M
,
Kelly
LM
,
Panebianco
F
,
Barila
G
, et al
Mouse model of poorly differentiated thyroid carcinoma driven by STRN-ALK fusion
.
Am J Pathol
2018
;
188
:
2653
61
.
51.
Stratford
AL
,
Boelaert
K
,
Tannahill
LA
,
Kim
DS
,
Warfield
A
,
Eggo
MC
, et al
Pituitary tumor transforming gene binding factor: a novel transforming gene in thyroid tumorigenesis
.
J Clin Endocrinol Metab
2005
;
90
:
4341
9
.
52.
Read
ML
,
Lewy
GD
,
Fong
JC
,
Sharma
N
,
Seed
RI
,
Smith
VE
, et al
Proto-oncogene PBF/PTTG1IP regulates thyroid cell growth and represses radioiodide treatment
.
Cancer Res
2011
;
71
:
6153
64
.
53.
Melloni
GE
,
Ogier
AG
,
de Pretis
S
,
Mazzarella
L
,
Pelizzola
M
,
Pelicci
PG
, et al
DOTS-Finder: a comprehensive tool for assessing driver genes in cancer genomes
.
Genome Med
2014
;
6
:
44
.
54.
Read
ML
,
Fong
JC
,
Modasia
B
,
Fletcher
A
,
Imruetaicharoenchoke
W
,
Thompson
RJ
, et al
Elevated PTTG and PBF predicts poor patient outcome and modulates DNA damage response genes in thyroid cancer
.
Oncogene
2017
;
36
:
5296
308
.
55.
Yoo
SK
,
Lee
S
,
Kim
SJ
,
Jee
HG
,
Kim
BA
,
Cho
H
, et al
Comprehensive analysis of the transcriptional and mutational landscape of follicular and papillary thyroid cancers
.
PLos Genet
2016
;
12
:
e1006239
.
56.
Pringle
DR
,
Vasko
VV
,
Yu
L
,
Manchanda
PK
,
Lee
AA
,
Zhang
X
, et al
Follicular thyroid cancers demonstrate dual activation of PKA and mTOR as modeled by thyroid-specific deletion of Prkar1a and Pten in mice
.
J Clin Endocrinol Metab
2014
;
99
:
E804
12
.
57.
Kari
S
,
Vasko
VV
,
Priya
S
,
Kirschner
LS
. 
PKA activates AMPK through LKB1 signaling in follicular thyroid cancer
.
Front Endocrinol (Lausanne)
2019
;
10
:
769
.
58.
Cui
M
,
Huang
J
,
Zhang
S
,
Liu
Q
,
Liao
Q
,
Qiu
X
. 
Immunoglobulin expression in cancer cells and its critical roles in tumorigenesis
.
Front Immunol
2021
;
12
:
613530
.
59.
Arighi
E
,
Borrello
MG
,
Sariola
H
. 
RET tyrosine kinase signaling in development and cancer
.
Cytokine Growth Factor Rev
2005
;
16
:
441
67
.
60.
Hampson
S
,
Stephens
D
,
Wasserman
JD
. 
Young age is associated with increased rates of residual and recurrent paediatric differentiated thyroid carcinoma
.
Clin Endocrinol (Oxf)
2018
;
89
:
212
8
.
61.
Lazar
L
,
Lebenthal
Y
,
Steinmetz
A
,
Yackobovitch-Gavan
M
,
Phillip
M
. 
Differentiated thyroid carcinoma in pediatric patients: comparison of presentation and course between pre-pubertal children and adolescents
.
J Pediatr
2009
;
154
:
708
14
.
62.
Chesover
AD
,
Vali
R
,
Hemmati
SH
,
Wasserman
JD
. 
Lung metastasis in children with differentiated thyroid cancer: Factors associated with diagnosis and outcomes of therapy
.
Thyroid
2020
;
31
:
50
60
.
63.
Zhang
XY
,
Song
HJ
,
Qiu
ZL
,
Shen
CT
,
Chen
XY
,
Sun
ZK
, et al
Pulmonary metastases in children and adolescents with papillary thyroid cancer in China: prognostic factors and outcomes from treatment with (131)I
.
Endocrine
2018
;
62
:
149
58
.
64.
Nies
M
,
Vassilopoulou-Sellin
R
,
Bassett
RL
,
Yedururi
S
,
Zafereo
ME
,
Cabanillas
ME
, et al
Distant metastases from childhood differentiated thyroid carcinoma: clinical course and mutational landscape
.
J Clin Endocrinol Metab
2021
;
106
:
e1683
e97
.
65.
Gertz
RJ
,
Nikiforov
Y
,
Rehrauer
W
,
McDaniel
L
,
Lloyd
RV
. 
Mutation in BRAF and other members of the MAPK pathway in papillary thyroid carcinoma in the pediatric population
.
Arch Pathol Lab Med
2016
;
140
:
134
9
.