The genomics of desmoplastic small round cell tumor reveals the deregulation of genes related to DNA damage response, epithelial–mesenchymal transition, and immune response
Cancer Communications volume 38, Article number: 70 (2018)
Desmoplastic small round cell tumor (DSRCT) is a rare, aggressive, and poorly investigated simple sarcoma with a low frequency of genetic deregulation other than an Ewing sarcoma RNA binding protein 1 (EWSR1)-Wilm’s tumor suppressor (WT1) translocation. We used whole-exome sequencing to interrogate six consecutive pre-treated DSRCTs whose gene expression was previously investigated.
DNA libraries were prepared from formalin-fixed, paraffin-embedded archival tissue specimens following the Agilent SureSelectXT2 target enrichment protocol and sequenced on Illumina NextSeq 500. Raw sequence data were aligned to the reference genome with Burrows–Wheeler Aligner algorithm. Somatic mutations and copy number alterations (CNAs) were identified using MuTect2 and EXCAVATOR2, respectively. Biological functions associated with altered genes were investigated through Ingenuity Pathway Analysis (IPA) software.
A total of 137 unique somatic mutations were identified: 133 mutated genes were case-specific, and 2 were mutated in two cases but in different positions. Among the 135 mutated genes, 27% were related to two biological categories: DNA damage-response (DDR) network that was also identified through IPA and mesenchymal–epithelial reverse transition (MErT)/epithelial–mesenchymal transition (EMT) already demonstrated to be relevant in DSRCT. The mutated genes in the DDR network were involved in various steps of transcription and particularly affected pre-mRNA. Half of these genes encoded RNA-binding proteins or DNA/RNA-binding proteins, which were recently recognized as a new class of DDR players. CNAs in genes/gene families, involved in MErT/EMT and DDR, were recurrent across patients and mostly segregated in the MErT/EMT category. In addition, recurrent gains of regions in chromosome 1 involving many MErT/EMT gene families and loss of one arm or the entire chromosome 6 affecting relevant immune-regulatory genes were recorded.
The emerging picture is an extreme inter-tumor heterogeneity, characterized by the concurrent deregulation of the DDR and MErT/EMT dynamic and plastic programs that could favour genomic instability and explain the refractory DSRCT profile.
Desmoplastic small round cell tumor (DSRCT) is a very rare sarcoma arising from the abdominal/pelvic peritoneum that mainly occurs in young male adults and is included in simple sarcomas in the molecular sarcoma classification . DSRCT is characterized by an ominous outcome as current aggressive polychemotherapies do not lead to a significant improvement in response and overall survival ; it is therefore included in the “most wanted” list of sarcomas requiring new agents .
The hallmark of DSRCT is the specific t(1;22)(p13;q12) translocation that leads to the fusion of the Ewing sarcoma RNA binding protein 1 (EWSR1) and Wilm’s tumor suppressor (WT1) genes. Most of other reported molecular alterations occur in single cases. The MET proto-oncogene, receptor tyrosine kinase (c-MET) N375S mutation (the most frequently encountered in lung carcinoma) and two mutations in phosphatidylinositol-4,5-bisphosphate 3-kinase catalytic subunit alpha (PI3KCA) have been described in 2 of 10 pre-treated DSRCTs interrogated using a single-gene polymerase chain reaction (PCR)-based assay . Evidence indicating the involvement of epigenetic regulators has been reported in two cases: the first, investigated using immunohistochemistry (IHC), showed the complete loss of SWI/SNF-related, matrix-associated, actin-dependent regulator of chromatin, subfamily b, member 1 (SMARCB1/INI) protein , and the other, investigated using targeted next-generation sequencing (NGS), showed a mutation in AT-rich interaction domain 1A (ARID1A) gene corresponding to a variant of unknown significance . More recently, whole-exome sequencing (WES) analysis of a thoroughly investigated, pre-treated case revealed the presence of 15 acquired somatic mutations , 7 of which were regulated by the same lymphoid enhancer binding factor 1 (LEF1) transcription factor which, in addition to being involved in the Wnt/beta-catenin signalling pathway, is a facilitator of the epithelial–mesenchymal transition (EMT) . Finally, amplification of aurora kinase B (AURKB) and MCL1, BCL2 family apoptosis regulator 1 (MCL1) has been reported in one case of DSRCT interrogated using targeted NGS , and gains in chromosome 5 and 18 and loss at 13q have been detected in one case interrogated using WES and comparative genomic hybridization . In the present study, we used WES to investigate six cases of DSRCT known to carry the specific EWSR1-WT1 translocation and was previously investigated using gene expression profiling complemented by immunophenotyping, microRNA (miRNA) in situ hybridisation (ISH), and a cancer stem cell array analysis .
Materials and methods
Patients and study design
Seven consecutive cases of primary DSRCT that were surgically removed after multi-drug chemotherapy between 2000 and 2016 were analyzed. The presence of the EWSR1-WT1 translocation was detected using fluorescent in situ hybridisation coupled with immunolabeling restricted to WT-C19 . The clinical data (including follow-up data when available) were obtained from the patients’ records and were previously described . The study was approved by the Independent Ethics Committee of the Fondazione IRCCS Istituto Nazionale dei Tumori di Milano. All patients gave their written consent to donating the tissue remaining after diagnostic procedures. One of the seven cases was excluded from WES analysis due to insufficient DNA quantity.
DNA library preparation
The WES analysis was made using materials dissected from ten 7-μm methylene blue-stained sections of non-necrotic tissue representative of tumoral areas, which were paired with the corresponding adjacent normal tissue in formalin-fixed, paraffin-embedded (FFPE) archival tissue specimens. DNA was extracted using a GeneRead DNA FFPE kit (Qiagen, Germantown, MD, USA) and quantified using a Qubit Fluorometer (ThermoFisher, Waltham, MA, USA). Shearing of 500 ng of DNA was carried out in 50 μl of 1× TE buffer using a Covaris M220 equipped with microTUBE AFA fibre tubes and SonoLab 188.8.131.52 software (Covaris Inc., Woburn, MA, USA). The resulting size distribution, which was about 160 bp in all cases, was checked using a TapeStation 4200 (Agilent, Santa Clara, CA, USA). The libraries were prepared using SureSelectXT2 Reagent kit (Agilent) in accordance with the manufacturer’s instructions and included the end-repair of fragmented DNA, A-tailing, adapter ligation and amplification, with purifications being carried out between each step using Agencourt AMPure XP beads (Agilent). The yield of the constructed libraries was estimated using a Qubit dsDNA HS kit (Qiagen). Exomes were captured using SureSelectXT2 Human All Exon probes (Agilent). Hybridisation of the pooled libraries with the capture probes, the removal of any non-hybridised library molecules, and PCR amplifications were carried out in accordance with Agilent SureSelectXT2 instructions. The captured libraries were then sequenced on a NextSeq 500 sequencer (Illumina, San Diego, CA, USA), with sample dilution, flow cell loading, and sequencing being carried in accordance with Illumina specifications.
Sequencing data analysis
The workflow of the analyses is reported in Additional file 1: Figure S1. Raw unmapped reads of the tumor and normal sample were aligned to the human reference genome (hg19 build) using the Burrows–Wheeler Aligner (BWA)-enrichment application v2.1.0 of Illumina BaseSpace . After alignment, we removed, from sorted and indexed compressed binary version of Sequence Alignment Map (BAM) files, unmapped reads with samtools v1.3.1  and duplicate reads with Picard software v1.79 (http://broadinstitute.github.io/picard/). We then used Genome Analysis Toolkit (GATK) v3.7  to perform left alignment of small insertions and deletions (indels), indel realignment, and base quality score recalibration. Concordance and cross sample contamination was assessed using the computational method Conpair which detects cross sample contamination among tumor-normal pairs at levels as low as 0.1%, even in presence of copy number changes . Somatic variant calling was performed on tumor-normal pairs using MuTect2 v3.7 . The criteria to remove possible false positive mutations induced by FFPE or oxidative DNA damage [15, 16] are reported in the legend of Additional file 1: Figure S1. Oncotator  was used to annotate point mutations and indels with functional data and functional consequences. A mutation was defined deleterious if at least one algorithm among MutationAssessor , MutationTaster , and SIFT  assigned a deleterious effect on the function of a protein affected by somatic mutation.
Somatic CNAs from WES data were identified using EXCAVATOR2 . The mutational signatures designated in Catalogue Of Somatic Mutations In Cancer (COSMIC; https://cancer.sanger.ac.uk/cosmic) were analyzed on the basis of the SNVs and their sequence context, considering the immediately flanking 5′ and 3′ nucleotides using the R package deconstructSigs . This analysis was performed aggregating mutations from all six cases.
Functional pathway and manual curation analyses
Mutations were analyzed for presence in COSMIC and in the Cancer Census collection of genes which contain mutations that have been causally implicated in cancer (downloaded on 18 February 2018 from http://cancer.sanger.ac.uk/census).
Molecular and cellular function analyses of the mutated genes and of the genes present in the aberrant chromosomal regions were carried out using Ingenuity® Pathway Analysis (IPA®, Qiagen; Bioinformatics, Redwood City, CA, USA; http://www.qiagen.com/ingenuity). Manual curation of selected mutated/deregulated genes was done exploiting previous knowledge and the following websites: http://www.ncbi.nlm.nih.gov/pubmed; http://cancer.sanger.ac.uk/cosmic; and https://cancergenome.nih.gov.
A right-tailed Fisher’s exact test, corrected for multiple testing by the Benjamini–Hochberg false discovery rate (FDR), was used to calculate P values for IPA over-representation analysis. An FDR < 0.05 was considered to select significantly enriched pathways. Pearson’s correlation coefficient was used to compute correlation between the total number of mutations and chromosome length. The statistical significance of mutational signatures was calculated according to the R package deconstructSigs . Mutational signatures with > 10% weight (> 0.1) were considered to have substantial contribution to the overall mutational landscape.
Quality control of sequence data
We performed WES in six cases of DSRCT that, on the basis of previously published results of transcriptome profiling, microRNA (miRNA) in situ hybridisation (ISH), and IHC assays, were divided into three groups that recapitulated the traits of mesenchymal–epithelial reverse transition (MErT), hybrid/partial EMT, and EMT . After alignment of raw reads, we evaluated the quality of data in terms of coverage and sample purity. Analysis with Conpair showed that each tumor-normal pair was correctly matched: a sample concordance > 99% and negligible levels of cross-sample contamination among tumor-normal pairs were observed (Additional file 1: Figure S2a, b). The mean coverage, after removal of duplicate reads, ranged from 109.4× to 123.9× for tumor samples and from 52.3× to 71.7× for normal samples (Additional file 1: Figure S2c). We also estimated the percentage of target bases covered at least 50× that ranged from 71.4% to 88.8% for tumor samples and from 46.0% to 71.9% for normal samples (Additional file 1: Figure S2d).
After variant calling and filtering to remove possible artefacts and false positive variants (see criteria in Additional file 1: Figure S1), we identified 137 somatic mutations affecting 135 genes (listed in Additional file 2: Table S1).
Mutation spectrum across the cases
Each case was characterized by a sizeable number of mutations: from 8 to 33 mutations per case (mean, 23) (Fig. 1a). The number of mutations for each case according to chromosomal location indicated a non-preferential pattern of distribution (Fig. 1a) and a positive correlation between the total number of mutations/chromosome and chromosome length (Pearson’s r = 0.67). A consistent proportion of mutations were categorized as missense or intronic, followed by silent mutations (Fig. 1b).
Figure 2 shows the inter-patient heterogeneity of mutated genes across the six DSRCT cases. Among them, 135 were case-specific and only two genes, mucin 19 (MUC19) and glucuronidase, beta pseudogene (GUSBP1), were found mutated in two cases but in different positions.
Comparison between the mutational profile and COSMIC mutational signatures
Before surgery, all patients had received 3–6 cycles of multi-drug chemotherapy including one or two alkylating agents . To evaluate whether the identified mutations may have been due to the dominant effect of chemotherapy, we tested 30 mutational signatures from COSMIC in the aggregated list of our 137 somatic mutations. The comparison between the observed (Fig. 3a) versus the reconstructed (Fig. 3b) mutational profiles with an irrelevant error (Fig. 3c) indicated that the DSRCT profile essentially derived from the contribution of three mutational signatures denominated in COSMC as 1, 3 and 29 (respective weights: 0.36, 0.26, and 0.10); the mutational signature associated with treatment with alkylating agents (signature 11, weight = 0) did not contributed to mutational profile.
Mutated genes, associated pathways, and manual curation
Functional analysis by using IPA indicated that the 135 genes affected by damaging mutations in DSRCTs caused enrichment in the following biological pathways: DNA damage response (DDR) of kidney cell lines, DNA damage, delay in repair of DNA, repair of DNA, DNA damage checkpoint, and DDR of epithelial cell lines (all P < 0.001) (Fig. 4a).
Manual curation of the list of mutated genes was based on the recent evidence that genes encoding RNA-binding proteins (RBPs) and DNA/RNA-binding proteins (DRBPs) as well as other genes related to RNA machinery are strictly connected to DDR  and on our previous data . According to this section, we found that 27% of the 135 mutated genes belonged to the DDR network (Fig. 4b) or to MErT/EMT process (Fig. 4c).
Categorization in these two biological processes showed that each case harboured mutation in at least one gene for each category. Details regarding each gene hereafter described, such as full name, a summary from NCBI Entrez Gene database (http://www.ncbi.nlm.nih.gov/gene) and PubMed (http://www.ncbi.nlm.nih.gov/pubmed) and their presence in COSMIC and in the Cancer Census are reported in Additional file 2: Table S2.
Mutations in the DDR network
The DDR network was affected by 26 mutated genes that can be grouped in three subsets: core genes, genes encoding RBPs, and other genes related to RNA machinery.
A damaging missense mutation in ATR serine/threonine kinase (ATR), one of the two core genes of DDR (the other is serine/threonine kinase, ATM), was found in DSRCT4 of the MErT group. The function abrogation of this gene could allow tumors to escape checkpoint control and could lead to uncontrolled replication (see also references in Additional file 2: Table S2).
Tumor protein p53 (TP53), even not a canonical core gene of DDR, being a master regulator of transcription and clearly associated with DDR, was included in the subset of core genes. The mutation here found in DSRCT5 of the hybrid EMT group was a deleterious missense mutation that was localized in the canonical hotspot disabling the encoded protein. The allele frequency of this mutation was low (12.9%, Additional file 2: Table S1), but this value may be considered in agreement with the strong p53 nuclear immunostaining with a patchwork pattern in half of the tumor cells (data not shown).
Genes encoding RBPs
All six cases had at least one mutation affecting RBP genes. Among the 12 identified mutations, 2 were intronic, 1 was an in-frame deletion, and 6 were missense (Fig. 4b), being mutations in BCL2-associated transcription factor 1 (BCLAF1) and cleavage and polyadenylation factor subunit (PCF11) genes predicted to be damaging.
Beside BCLAF1 and PCF11, three other mutated genes [CWC22 spliceosome-associated protein homolog (CWC22), debranching RNA lariats 1 (DBR1), and adenosine deaminase domain containing 1 (ADAD1)] are involved in pre-mRNA production, defective DNA repair, miRNA and snoRNA regulation with prevalence in the early step of spliceosome editing. In particular, ADAD1 function has not yet been deciphered but, on the basis of the hypothesis that the adenosine deaminase acting on RNA (ADAR) enzyme family controls RNA editing, an alteration in one of its members may affect a wide range of RNA processing activities [24, 25]. The function of HIV-1 Tat specific factor 1 (HTATSF) gene is also largely unknown, but its mutation has been reported to parallel the decreased expression of many genes .
Zinc finger proteins (ZNFs) belong to a number of different structural families. Three out of ZNF mutated genes (ZNF254, ZNF600, and ZNF225) were annotated among the RBP subset because they belong to the class of Cys2-His2 (C2H2) ZNFs and thus are expected to act consistently .
Finally, two additional genes, U2 small nuclear RNA auxiliary factor (U2AF1) and RNA binding motif protein 45 (RBM45), were found mutated at an intron site, and such mutations might have a negative impact on translational efficiency, as reported [28, 29].
Other genes related to RNA machinery
This group included nine genes with a missense mutation, four of which with damaging effect, and two genes with a damaging nonsense mutation (Fig. 4b).
The genes affected by a damaging mutation were ribonucleotide reductase regulatory subunit M2 (RRM2), whose deregulation was reported to impair a key step in DNA synthesis; ARID1A, whose wild type-encoded protein acts as an epigenetic regulator; eukaryotic translation initiation factor 4 gamma 1 (EIF4G1), whose deregulation could negatively affect the correct mRNA circularization and/or translation activity downstream of serine/threonine-protein kinase mTOR; ZNF708, that, like RBPs, is reported to act as interaction module with DNA, RNA, proteins, and other molecules and whose deregulation could affect gene transcription, translation, and mRNA trafficking .
The remaining two genes [ubiquitin specific peptidase 9 X-linked (USP9X) and WW and C2 domain containing 1 (WWC1)], harbouring damaging mutations, are described in the following subsection since they are also involved in MErT/EMT.
Mutations in MErT/EMT genes
Eight mutated genes are reported to be directly involved in MErT/EMT. Four genes [transgelin (TAGLN1), ubiquitin specific peptidase 9, X chromosome (USP9X), WW and C2 domain containing 1 (WWC1), and transducing beta like 1 X-linked receptor 1 (TBL1XR1)], harbouring missense damaging mutations, were found in the MErT group, strongly supporting the hypothesis that such mutations induced changes consistent with an epithelial-like phenotype. In particular, the two mutated genes (USP9X and WWC1) present in both biological categories were found in the DSRCT2 case, paradigm of MErT being, according to IHC, the most enriched in epithelial-related molecules .
Two damaging mutations were also found in the actin like 8 (ACTL8) and glutamate metabotropic receptor 7 (GRM7) genes, with each one recorded in one of the cases of the Hybrid EMT group. Interestingly, aberrant expression of ACTL8, a cancer/testis antigen gene, is reported to associate with stem cell-like enrichment and an EMT signature , both of which are characteristics of DSRCT. As GRM7, it has been proposed that its silencing may provide a further mechanism that regulates MErT/EMT by inhibiting TGFbeta/SNAIL via AMPK activation.
A non-damaging missense mutation was detected in dehydrodolichyl diphosphate synthase subunit (NUS1) gene in a case belonging to the EMT group. High expression of this gene is reported to associate with EMT, and its silencing with MErT. Furthermore, missense non-damaging mutations were observed in two genes, deasmoglein2 (DSG2) and calcium-responsive transcription factor (CARF), whose involvement in MErT is indirectly suggested: for DSG2, by its belonging to a cadherin cell adhesion molecule superfamily; for CARF, by its contribution to WNT signalling activation.
Regarding the non-damaging splice-site mutation in TYRO3 protein tyrosine kinase (TYRO3), the aberrant SNAIL-mediated expression of TYRO3 is reported to be associated with EMT.
CNA landscape of DSRCT
The pattern of somatic CNAs showed several regions amplified or lost in a case-specific fashion in addition to large amplifications of the long arm of chromosome 1 recurring in more than 50% of cases and complete or partial loss of chromosome 6 present in 50% of cases (Fig. 5a). All CNA events, as details of category, chromosome location, gene in the region and cytoband, are reported in Additional file 2: Table S3.
CNAs in DDR and MErT/EMT genes
Focusing on gain and loss of copy number (Additional file 2: Table S4), we found that a sizable number of CNA-affected genes were recurrent across patients and affected both DDR and MErt/EMT; details for each of the genes cited below are reported in Additional file 2: Table S2.
Among the DDR category, the following genes displayed gain/amplification (Fig. 5b, left part): TATA box-binding protein-associated factor (TAF7), amplified in two cases, and reported to be involved in very early steps of transcription; high mobility group nucleosome-binding domain 1 (HMGN1) and bromodomain and WD repeat domain containing 1 (BRWD1), playing a role in chromatin structure and remodelling. A homozygous deletion of ataxin 2 (ATXN2), codifying for a RBP, was observed; it has been reported that silencing of ATXN2 gene, which is known to affect neurodegenerative diseases, led to disturbance in RNA transcription.
Overall CNAs mostly segregated in the MErT/EMT category (Fig. 5b, right part). In particular, in DSRCT2 we found high amplification of many genes associated to squamous or terminal squamous differentiation, late cornified envelope (LCE, 18 genes) gene family, small proline-rich protein family (SPRR, 11 genes), and cysteine rich C-terminal 1 (CRCT1), to sulfur hair keratin (keratin-associated protein, KRTAP, 33 genes) family, and brain-specific cadherin-like adhesion molecules (protocadherin, PCDH, 54 genes). Many of these genes showed recurrent gains (three copies) in DSRCT5 and 6, and genes of the PCDH family were also present in DSRCT6.
ETS proto-oncogene 2, transcription factor (ETS2) amplification was present in DSRCT4 of the MErT group. It has been reported, at preclinical level, that ETS2 is a specific master factor, able to promote hybrid EMT by directly binding (and thus preventing) miRNA-200 transcription.
FOXQ1 and FOXF2 genes on chromosome 6 were present in homozygous deletion in DSRTC5 of the hybrid EMT group and in heterozygous deletion in DSRTC2, 3, and 7.
It has been demonstrated that FOXQ1 is a critical mediator of EMT, and, in our cases, the null (DSRCT5) or attenuated (DSRCT2, 3, and 7) profiles dictated by the defective status of the gene were in line with both the gain/amplification of the epithelial-related genes found here (DSRCT2, 5, and 7) and the previously reported expression of E-cadherin/miRNA-200 module in DSRCT2 and 5 .
In vitro FOXF2 loss acts as an EMT-suppressing transcription factor whose deficiency induces EMT through twist family bHLH transcription factor 1 (TWIST) up-regulation, a finding consistent with zinc finger E-box binding homeobox 1 (ZEB1) expression observed in DSRCT3, 5, and 7 .
Imbalances of chromosomes 1 and 6
A recurrent gain of the long arm of chromosome 1 was identified in four cases (DSRCT2, 3, 5, and 7) (Fig. 6a), loss of the entire chromosome 6 in two cases (DSRCT3 and 7), and loss restricted to the long arm of chromosome 6 in one case (DSRCT5) (Fig. 7a and Additional file 1: Figure S3). Given the high frequency of these CNAs across the six cases, we hypothesized that these regions could contain genes relevant to DSRCT biology, and we performed a functional analysis through IPA.
Regarding chromosome 1, 410 genes were commonly amplified in the four cases (Additional file 2: Table S5). Remarkably, the majority of amplified genes was found to be associated with “Cell movement” (77 amplified genes) and “Cell migration” (68 amplified genes, also present in Cell movement) (both P < 0.001) (Fig. 6b and Additional file 2: Table S6). Among these genes, laminin subunit gamma 2 (LAMC2), paired related homeobox 1 (PRRX1), and regulator of G protein signaling 4 (RGS4) are reported to be greatly involved in the dynamic and reversible MErT/EMT process.
Among the 526 genes mapping on chromosome 6q and lost in half of the cases (Additional file 2: Table S7), the category “cellular assembly and organization, DNA replication, recombination, and repair” most significantly enriched in lost genes was involved in the DDR network with 21 gene members of the histone H1 family and related to formation of nucleosomes (P < 0.001) (Fig. 7b).
Finally, since the short arm of chromosome 6 carries the major histocompatibility complex (Additional file 2: Table S7) which encodes HLA class I antigens, mandatory for tumor cells to be recognized by cytotoxic T cells, the two monosomic cases (DSRCT3 and 7) are expected to show characteristics of immunoescape . Further strength to a deficient immune response in these two cases is given by the loss of the following genes located on short arm of chromosome 6: proteasome subunit beta 8 (PSMB8) and proteasome subunit beta 9 (PSMB9), transporter 1, ATP-binding cassette subfamily B member (TAP1) transporter 2, ATP-binding cassette subfamily B member (TAP2), and TAP-binding protein (TAPBP) associated with antigen presentation; and interferon regulatory factor 4 (IRF4, also named MUM1) recently demonstrated to be fundamental in generation of Type 1 T helper (Th1) response .
It has been reported that DSRCT carries a driven translocation, EWSR1-WT1 , and is a simple sarcoma with a low frequency of genetic deregulations [4,5,6,7]. Our WES and CNA analyses showed that a sizable number of somatic mutations, CNAs, and chromosome imbalances affected genes involved in DDR network, MErT/EMT, and immune response.
Molecular classification of sarcomas splits these tumors into two broad categories: simple sarcomas with near diploid karyotype and simple genetic alterations including translocation or specific activating mutations; complex sarcomas with complex/unbalanced karyotypes . Even if both sarcoma categories show a low somatic mutation burden , this is particularly true for simple sarcomas (to which DSRCT belongs) which are thought to be governed by a gene translocation (the EWSR1-WT1 translocation in the case of DSRCT). Consistently, in a comprehensive and integrated genomic characterization of adult sarcomas , the 10 cases of synovial sarcoma, the only analyzed type of simple sarcomas, displayed few CNAs and very few mutations with any recurrent one. Interestingly, synovial sarcoma shares many features with DSRCT being characterized by a similar gamut of morphologic/phenotypic changes, namely shifting from epithelial to spindle cell/sarcomatous patterns. Furthermore, a study on malignant pleural mesothelioma, a tumor expressing a refractory profile and carrying EMT features, reported a low mutational burden clustering in DNA repair genes .
Although therapy-induced alterations could be expected in our DSRCT cases, being all samples taken after at least two cycles of chemotherapy, the comparison of our WES mutational profile with COSMIC cancer signatures pointed to absence of mutation profiles associated with treatment. Of note, Signature 1 is the result of an endogenous mutational process and is a common signature in most human cancer types, whereas Signature 3 indicates defective homology-directed double-strand DNA break repair and is associated with germline and somatic BRCA1 and BRCA2 mutations in breast, pancreatic, and ovarian cancers . Both COSMIC Signatures 1 and 3 were present in sarcomas . Signature 29 has been found in oral cancer samples from individuals with a tobacco chewing habit, but there is evidence that it might be associated with nucleotide excision repair defects.
Overall, considering our WES and CNAs data, we focused our analyses on the DDR network and MErT/EMT pathways, hypothesizing that the co-occurrence of a tumor with refractory characteristics and deregulation of these specific network/pathways may contribute to the distinctive traits of DSRCT.
Given the biological relevance of genes belonging to the DDR network or associated with DDR, including RBPs , it has been suggested that their mutations should be added to the list of cancer hallmarks  as the eleventh item under the name of “epigenetic and RNA deregulation” . RBPs play a pivotal role in maintaining genome integrity  and, acting as a hub, interact with proteins, coding and non-coding RNAs to create a network that regulates crucial steps in the process from the site of DNA damage to pre-mRNAs encoding DDR proteins . As a result of this interplay, RBP alterations are drivers in oncogenesis and lead to a wide range of cellular dysfunctions despite of their low expression levels .
After pathway analysis and manual curation of the entire list of identified mutated genes, we observed that 26 of the 135 genes belonged directly/indirectly to the DDR network. Altogether, the observed DDR network-associated mutations affected genes that play a role in various steps of transcription, more frequently the early steps of RNA biogenesis, but also in termination phase, translation, and miRNA production in addition to genes involved in chromatin remodelling. Noteworthy, the two genes involved in the DSRCT driver translocation are components of the DDR network, being the first an RNA binding protein and the second a zinc finger transcription factor.
Manual curation of the entire list of identified mutated genes also indicated that 10 of them belonged to the MErT/EMT pathway in keeping with our data concerning gene expression on the same DSRCT specimens . In fact, we previously suggested that ZEB1 and miRNA34 (in the form of a miRNA/protein chimera) may drive the shift either towards EMT or MErT according to the prevalence of expression of ZEB1 over that of miRNA34 or the reverse, respectively . literature data showed that an increase in mesenchymal switching may be achieved by means of the ZEB1-induced repression of the RBP epithelial splicing regulatory protein 1 (ESRP1) , but a decrease by means of miRNA34-induced repression of the RBP Musashi1 (Ms1) . Thus, the present data may support the observation that RBPs could be key players in miRNA processing .
The CNAs’ analysis focused to the DDR network and the MErT/EMT pathway showed that, unlike mutations, most CAN events were recurrent and occurred in genes involved in the MErT/EMT process. Cumulatively, the DSRCT CNA profile points to a deregulation of RNA transport/translation and disturbance of chromatin remodeling, coupled with the amplification of the recently described master factor of hybrid EMT (ETS2), along with attenuation of one gene whose deficiency favors EMT (FOXQ1) and one gene whose deficiency favors MErT (FOXF2), in keeping with the ability of the MErT/EMT process to induce or revert EMT; the identification of an hotspot on the chromosome 21, where in a small area were present the amplified ETS2, which promotes EMT, and HMGN1 and BRWD1 are involved in epigenetic regulation, a critical component of the MErT/EMT process.
Finally, the identification of loss of short arm of the chromosome 6 in 2 of the six cases, in line with their strong MErT/EMT traits , complemented and reinforced the suggestion that DSRCT has a “non-inflammatory tumor type” profile and could be included into the “non-inflammed EMT/stem like” category .
Limitations of the present study that should be considered rest essentially in (1) the low number of analyzed cases and (2) the already commented plasticity and time-dependent modulation of the profiles of the three DSRCT groups . Furthermore, we are aware that our focusing only to the DDR network and MErT/EMT pathways may result in a potential underestimation of other altered network/pathways. Further analyses are ongoing in our research group and the public availability of the entire list of mutated genes and CNA losses/gains could enable other researchers to better decipher the entire landscape of DSRCT genomic alterations. Thus, we think that what it remains mandatory is to (1) confirm our new findings in large series of preferably untreated DSRCT cases and (2) functionally prove that the identified genes harbour evidence for a DDR and/or MErT/EMT involvement.
The data described in this paper seem to reveal previously unknown characteristics of DSRCT and, if confirmed, may be useful in elucidating the complex pathogenesis and poor responsiveness to treatment of this type of simple sarcomas as well as in identifying potentially actionable alterations that may allow better tuning of current treatments for this ominous pathology.
In this study, we showed that, in addition to the EWSR1-WT1 translocation (involving two DDR network proteins), another distinguishing trait of DSRCT could be an aberrant/defective DDR. Furthermore, we reconfirmed the relevance of the MErT/EMT process which, together with the deficient DDR, could lead to the tumor extreme heterogeneity, promote genomic instability, and eventually give rise to drug resistance. Finally, the chromosome 6 loss seems to reconfirm the “immunologically ignorant” signature of this rare and ominous tumor.
copy number alteration
DNA/RNA binding protein
DNA damage response RNA binding proteins
desmoplastic small round cell tumor
epithelial mesenchymal transition
Genome Analysis Toolkit
human leukocyte antigen
Ingenuity Pathway Analysis
in situ hybridisation
mesenchymal epithelial reverse transition
polymerase chain reaction
RNA binding protein
Jain S, Xu R, Prieto VG, Lee P. Molecular classification of soft tissue sarcomas and its clinical applications. Int J Clin Exp Pathol. 2010;3:416–29.
Wong HH, Hatcher HM, Benson C, Al-Muderis O, Horan G, Fisher C, Earl HM, Judson I. Desmoplastic small round cell tumor: characteristics and prognostic factors of 41 patients and review of the literature. Clin Sarcoma Res. 2013;3:14.
Janeway KA, Maki RG. New strategies in sarcoma therapy: linking biology and novel agents. Clin Cancer Res. 2012;18:5837–44.
Jiang Y, Subbiah V, Janku F, Ludwig JA, Naing A, Benjamin RS, Brown RE, Anderson P, Kurzrock R. Novel secondary somatic mutations in Ewing’s sarcoma and desmoplastic small round cell tumors. PLoS ONE. 2014;9:e93676.
Soon GS, Petersson F. Beware of immunohistochemistry-report of a cytokeratin-, desmin- and INI-1-negative pelvic desmoplastic small round cell tumor in a 51 year old woman. Int J Clin Exp Pathol. 2015;8:973–82.
Silva JG, Corrales-Medina FF, Maher OM, Tannir N, Huh WW, Rytting ME, Subbiah V. Clinical next generation sequencing of pediatric-type malignancies in adult patients identifies novel somatic aberrations. Oncoscience. 2015;2:187–92.
Ferreira EN, Barros BD, de Souza JE, Almeida RV, Torrezan GT, Garcia S, Krepischi AC, Mello CA, Cunha IW, Pinto CA, et al. A genomic case study of desmoplastic small round cell tumor: comprehensive analysis reveals insights into potential therapeutic targets and development of a monitoring tool for a rare and aggressive disease. Hum Genom. 2016;10:36.
Santiago L, Daniels G, Wang D, Deng FM, Lee P. Wnt signaling pathway protein LEF1 in cancer, as a biomarker for prognosis and a target for treatment. Am J Cancer Res. 2017;7:1389–406.
Negri T, Brich S, Bozzi F, Volpi CV, Gualeni AV, Stacchiotti S, De Cecco L, Canevari S, Gloghini A, Pilotti S. New transcriptional-based insights into the pathogenesis of desmoplastic small round cell tumors (DSRCTs). Oncotarget. 2017;8:32492–504.
Li H, Durbin R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics. 2009;25:1754–60.
Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R, 1000 Genome Project Data Processing Subgroup. The sequence alignment/map (SAM) format and SAMtools. Bioinformatics. 2009;25:2078–9.
McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A, Garimella K, Altshuler D, Gabriel S, Daly M, et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010;20:1297–303.
Bergmann EA, Chen B-J, Arora K, Vacic V, Zody MC. Conpair: concordance and contamination estimator for matched tumor–normal pairs. Bioinformatics. 2016;32:3196–8.
Cibulskis K, Lawrence MS, Carter SL, Sivachenko A, Jaffe D, Sougnez C, Gabriel S, Meyerson M, Lander ES, Getz G. Sensitive detection of somatic point mutations in impure and heterogeneous cancer samples. Nat Biotechnol. 2013;31:213–9.
Do Hongdo, Dobrovic Alexander. Sequence artifacts in DNA from formalin-fixed tissues: causes and strategies for minimization. Clin Chem. 2015;61:64–71.
Costello M, Pugh TJ, Fennell TJ, Stewart C, Lichtenstein L, Meldrim JC, Fostel JL, Friedrich DC, Perrin D, Dionne D. Discovery and characterization of artifactual mutations in deep coverage targeted capture sequencing data due to oxidative DNA damage during sample preparation. Nucleic Acids Res. 2013;41:e67.
Ramos AH, Lichtenstein L, Gupta M, Lawrence MS, Pugh TJ, Saksena G, Meyerson M, Getz G. Oncotator: cancer variant annotation tool. Hum Mutat. 2015;6:E2423–9.
Reva B, Antipin Y, Sander C. Predicting the functional impact of protein mutations: application to cancer genomics. Nucleic Acids Res. 2011;39:e118.
Schwarz JM, Cooper DN, Schuelke M, Seelow D. MutationTaster2: mutation prediction for the deep-sequencing age. Nat Methods. 2014;11:361–2.
Kumar P, Henikoff S, Ng PC. Predicting the effects of coding non-synonymous variants on protein function using the SIFT algorithm. Nat Protoc. 2009;4:1073–81.
D’Aurizio R, Pippucci T, Tattini L, Giusti B, Pellegrini M, Magi A. Enhanced copy number variants detection from whole-exome sequencing data using EXCAVATOR2. Nucleic Acids Res. 2016;44:e154.
Rosenthal R, McGranahan N, Herrero J, Taylor BS, Swanton C. DeconstructSigs: delineating mutational processes in single tumors distinguishes DNA repair deficiencies and patterns of carcinoma evolution. Genome Biol. 2016;17:31.
Wickramasinghe VO, Venkitaraman AR. RNA processing and genome stability: cause and consequence. Mol Cell. 2016;61(4):496–505.
Glisovic T, Bachorik JL, Yong J, Dreyfuss G. RNA-binding proteins and post-transcriptional gene regulation. FEBS Lett. 2008;582:1977–86.
Permuth JB, Reid B, Earp M, Chen YA, Monteiro AN, Chen Z, AOCS Study Group, Chenevix-Trench G, Fasching PA, Beckmann MW. Inherited variants affecting RNA editing may contribute to ovarian cancer susceptibility: results from a large-scale collaboration. Oncotarget. 2016;7:72381–94.
Miller HB, Robinson TJ, Gordân R, Hartemink AJ, Garcia-Blanco MA. Identification of Tat-SF1 cellular targets by exon array analysis reveals dual roles in transcription and splicing. RNA. 2011;17:665–74.
Traci M, Hall TanaKa. Multiple modes of RNA recognition by zinc finger proteins. Curr Opin Struct Biol. 2005;15:357–73.
Pereira B, Billaud M, Almeida R. RNA-binding proteins in cancer: old players and new actors. Trends Cancer. 2017;3:506–28.
Li Y, Collins M, Geiser R, Bakkar N, Riascos D, Bowser R. RBM45 homo-oligomerization mediates association with ALS-linked proteins and stress granules. Sci Rep. 2015;5:14262.
Lunde BM, Moore C, Varani G. RNA-binding proteins: modular design for efficient function. Nat Rev Mol Cell Biol. 2007;8:479–90.
Yang P, Huo Z, Liao H, Zhou Q. Cancer/testis antigens trigger epithelial–mesenchymal transition and genesis of cancer stem-like cells. Curr Pharm Des. 2015;21:1292–300.
Vokes NI, Van Allen EM. Tumor evolution: a problem of histocompatibility. Cell. 2017;171:1252–3.
Mahnke J, Schumacher V, Ahrens S, Kading N, Feldhoff LM, Huber M, Rupp J, Raczkowski E, Mittrucker HW. Interferon regulatory factor 4 controls TH1 cell effector function and metabolism. Sci Rep. 2016;6:36621.
Kang HJ, Park JH, Chen W, Kang SI, Moroz K, Ladanyi M, Lee SB. EWS-WT1 oncoprotein activates neuronal reprogramming factor ASCL1 and promotes neural differentiation. Cancer Res. 2014;74:4526–35.
Taylor BS, Barretina J, Maki RG, Antonescu CR, Singer S, Ladanyi M. Advances in sarcoma genomics and new therapeutic targets. Nat Rev Cancer. 2011;11(8):541–57.
Cancer Genome Atlas Research Network. Comprehensive and integrated genomic characterization of adult soft tissue sarcomas. Cell. 2017;171(950–965):e28.
Carbone M, Gaudino G, Yang H. Recent insights emerging from malignant mesothelioma genome sequencing. J Thorac Oncol. 2015;10:409–11.
Sarras H, Alizadeh Azami S, McPherson JP. In search of a function for BCLAF1. Sci World J. 2010;10:1450–61.
Dutertre M, Vagner S. DNA-damage response RNA-binding proteins (DDRBPs): perspectives from a new class of proteins and their RNA targets. J Mol Biol. 2017;429:3139–45.
Hanahan D, Weinberg RA. Hallmarks of cancer: the next generation. Cell. 2011;144:646–74.
Imielinski M, Berger AH, Hammerman PS, Hernandez B, Pugh TJ, Hodis E, Cho J, Suh J, Capelletti M, Sivachenko A, et al. Mapping the hallmarks of lung adenocarcinoma with massively parallel sequencing. Cell. 2012;150:1107–20.
Nishida K, Kuwano Y, Nishikawa T, Masuda K, Rokutan K. RNA binding proteins and genome integrity. Int J Mol Sci. 2017;18:1341.
Dutertre M, Lambert S, Carreira A, Amor-Guéret M, Vagner S. DNA damage: RNA-binding proteins protect from near and far. Trends Biochem Sci. 2014;39:141–9.
Larsen JE, Nathan V, Osborne JK, Farrow RK, Deb D, Sullivan JP, Dospoy PD, Augustyn A, Hight SK, Sato M, et al. ZEB1 drives epithelial-to-mesenchymal transition in lung cancer. J Clin Invest. 2016;126:3219–35.
Vo DT, Qiao M, Smith AD, Burns SC, Brenner AJ, Penalva LO. The oncogenic RNA-binding protein Musashi1 is regulated by tumor suppressor miRNAs. RNA Biol. 2011;8:817–23.
Ratnadiwakara M, Mohenska M, Änkö ML. Splicing factors as regulators of miRNA biogenesis-links to human disease. Semin Cell Dev Biol. 2017;17:30124–6.
Zou W, Wolchok JD, Chen L. PD-L1 (B7-H1) and PD-1 pathway blockade for cancer therapy: mechanisms, response biomarkers, and combination. Sci Transl Med. 2016;8:328rv4.
Design of research studies: SC, SP. Study supervision: SC, MS, SP. Patient enrollment and clinical diagnoses: SS. Conducting experiments and acquiring data: LDC, DP, GD, SB. Data analysis and interpretation: AD, MD, SC, MS, SP. Manuscript writing and revision: AD, LDC, MD, GD, SB, MS, SC, SP. All authors read and approved the final manuscript.
This work was supported in part through the computational resources and staff expertise provided by the Platform of Integrated Biology at the Department of Applied Research and Technological Development.
The authors declare that they have no competing interests.
Availability of data and materials
All SNVs and CNAs identified in this study and supporting the conclusions of the article are included in Additional file 2: Table S1 for patient mutations (provided as nucleotide and amino acid changes) and Additional file 2: Table S3 for CNA. The raw WES sequencing data are not publicly available since they contain information that could compromise research participant privacy.
Consent for publication
Ethics approval and consent to participate
The study was approved by the Independent Ethics Committee of the Fondazione IRCCS Istituto Nazionale dei Tumori di Milano (INT-MI).
This work was partially supported by Associazione Italiana Ricerca Cancro (IG2013 14102 to Silvana Pilotti).
Workflow of data processing and bioinformatics pipeline for whole-exome sequencing (WES) and copy number alteration (CNA) analyses. Raw sequence data obtained with Illumina NextSeq500 (Illumina, San Diego, CA, USA) were aligned to the reference genome (hg 19) with Burrows-Wheeler Aligner (BWA) algorithm. Somatic mutations and CNAs were identified using MuTect2 (https://software.broadinstitute.org/gatk/documentation/tooldocs/3.8-0/org_broadinstitute_gatk_tools_walkers_cancer_m2_MuTect2.php) and EXCAVATOR2, respectively. We excluded variants matching at least one of the following criteria: 1) if a variant is supported by 1 or more reads in matched normal sample; 2) if read depth of variant position is < 50, or the variant is supported by less than 10 reads; 3) C>T / G>A variants with a frequency less than 0.1 (possible FFPE artifacts); 4) C>A / G>T variants with a frequency less than 0.1 (possible artifactual mutations due to oxidative DNA damage during sample preparation); 5) if indels are supported only by forward or reverse reads. Biological functions associated with altered genes were investigated through Ingenuity Pathway Analysis (IPA®, Qiagen; Bioinformatics, Redwood City, CA, USA; https://www.qiagen.com/ingenuity). Figure S2. Quality control of sequence data analyzed with Conpair. a) each tumor-normal pair was correctly matched with a sample concordance above 99%; b) levels of cross sample contamination among tumor-normal pairs are negligible. c) after removal of duplicates reads, the mean coverage for tumor samples ranged from 109× to 124×, while for normal samples from 52× to 72×. d) The percentage of target bases covering at least 50× ranged from 71.4% to 88.8% for tumor samples and from 46.0% to 71.9% for normal samples. Figure S3. Copy number profiles for chromosome 6. Profiles of log2 ratio values estimated by EXCAVATOR2 for chromosome 6 of each DSRCT case. Segmented values are represented by the red line.
Table S1. List of somatic mutations identified in each patient. Table S2. List, information and literature supply for genes mutated or copy number altered described in the main text as belonging to DDR or MErT/EMT categories. Table S3. List of somatic copy number aberrations identified by EXCAVATOR2. Table S4. List of gains with at least two copies and losses with homozygous deletions. Table S5. List of recurrent amplified genes on chromosome 1. Table S6. List of recurrently amplified genes of chromosome 1, belonging to the two significant biological functions identified by Ingenuity Pathway Analysis (IPA®, Qiagen; Bioinformatics, Redwood City, CA, USA; http://www.qiagen.com/ingenuity). For the entire name of the genes, reported as gene ID, see Table S5. Table S7. List of recurrent deleted genes on chromosome 6.
About this article
Cite this article
Devecchi, A., De Cecco, L., Dugo, M. et al. The genomics of desmoplastic small round cell tumor reveals the deregulation of genes related to DNA damage response, epithelial–mesenchymal transition, and immune response. Cancer Commun 38, 70 (2018). https://doi.org/10.1186/s40880-018-0339-3