Skip to main content

Whole exome sequencing and single nucleotide polymorphism array analyses to identify germline alterations in genes associated with testosterone metabolism in a patient with androgen insensitivity syndrome and early-onset colorectal cancer



Androgen insensitivity syndrome (AIS), a disorder of sexual development in 46, XY individuals, is caused by loss-of-function mutations in the androgen receptor (AR) gene. A variety of tumors have been reported in association with AIS, but no cases with colorectal cancer (CRC) have been described.

Case presentation

Here, we present a male patient with AIS who developed multiple early-onset CRCs and his pedigree. His first cousin was diagnosed with AIS and harbored the same AR gene mutation, but with no signs of CRC. The difference in clinical management for the two patients was that testosterone treatment was given to the proband for a much longer time compared with the cousin. The CRC family history was negative, and no germline mutations in well-known CRC-related genes were identified. A single nucleotide polymorphism array revealed a microduplication on chromosome 22q11.22 that encompassed a microRNA potentially related to CRC pathogenesis. In the proband, whole exome sequencing identified a polymorphism in an oncogene and 13 rare loss-of-function variants, of which two were in CRC-related genes and four were in genes associated with other human cancers.


By pathway analysis, all inherited germline genetic events were connected in a unique network whose alteration in the proband, together with continuous testosterone stimulation, may have played a role in CRC pathogenesis.


Tissue resistance to androgens (i.e., testosterone and its 5α-reduced product dihydrotestosterone) leads to androgen insensitivity syndrome (AIS). AIS is the most well-known specific cause of the 46,XY disorder of sex development (DSD), but estimates of its prevalence vary widely, depending on the molecular assessment method used for diagnosis [13]. Pathogenesis of this disorder is essentially the result of mutations in the androgen receptor (AR) gene, but as clearly stated by Gottlieb et al. [4], “… solely identifying the genotype of an AIS individual may no longer be sufficient to draw conclusions about the effect of an AR mutation on the persons’ phenotype…”. The AR gene, located on chromosome Xq11–q12, contains eight exons and seven introns spanning ~90 kb of DNA. It encodes for the androgen receptor, which is a transcription factor and a member of the steroid hormone receptor family within the nuclear receptor superfamily [2]. AR is the only gene linked with AIS. More than 500 different AR mutations, distributed throughout the gene sequence, have been reported [3]. Furthermore, there is variable phenotypic expression related to specific mutations, with identical mutations linked to different phenotypes [3]. The clinical manifestations of AIS vary from the complete form (CAIS), where testosterone has no effect on sexual development and the genitals are entirely female, to milder degrees of undervirilization (the partial form; PAIS) or the mild form (MAIS) in which males have gynecomastia and/or infertility. The lifetime risk of development of neoplasms in patients with AIS, even though not defined accurately due to the small size of the analyzed series, was reported to be 0.8% for CAIS and 5.5%–15% for PAIS, with tumors originating from testicular germ cells, testicular stromal cells, or other mesenchymal cells [58].

To date, no cases of AIS with a documented history of colorectal cancer (CRC) have been described. In this paper, we describe the clinical and molecular analyses of a proband affected by PAIS who developed multiple CRCs. The patient presented with primary CRC at an early age (27 years). With the aim of identifying the constitutional genetic events that could have played a role in the early onset of CRC in the proband, we used a combination of two advanced high-throughput technologies: (i) high-resolution single nucleotide polymorphism (SNP) array analysis to investigate copy number variation (CNV) and uniparental disomy (UPD) events and (ii) whole exome sequencing (WES) analysis to investigate single nucleotide variants (SNVs) and small insertions and deletions (INDELs). A set of inherited germline genetic variations that may be responsible for CRC pathogenesis were identified in this patient.

Case report

Herein, we report a 46,XY male patient (III.2 of the family presents in Fig. 1) who, at birth (1983), presented with undescended testes, severe hypospadias, and a micropenis and who received a clinical and molecular diagnosis of PAIS. At 6 months of age, adrenal hormone dosage results were normal, which excluded adrenal disease. Testosterone enanthate (250 mg every 2 weeks through intramuscular injection) was provided regularly from age 3 to 27 years. At 10 years of age, adrenocorticotropic hormone and beta human chorionic gonadotropin tests excluded a deficit in 5α-reductase. At 13 years of age, gynecomastia was recorded. The proband developed CRC at 27 years old and underwent a left hemicolectomy; at postoperative pathologic examination, the tumor was diagnosed as a G2, stage pT1N0 rectosigmoid adenocarcinoma. When he was 30 years old, a metachronous CRC was diagnosed and treated with segmental resection; at postoperative pathologic examination, the tumor was diagnosed as a G2, stage pT3N0 adenocarcinoma. The tumor relapsed locally 4 months later and was treated with radiotherapy, chemotherapy (capecitabine), and surgical proctectomy. At present, the disease is in progression.

Fig. 1
figure 1

Pedigree of the family. Squares indicate males, and circles represent females. Slashed symbols indicate deceased individuals. Unfilled symbols indicate unaffected individuals. Black dotted circles indicate unaffected individuals carrying the androgen receptor (AR) gene mutation. Half-black shaded symbols represent individuals carrying the AR gene mutation who are affected by partial androgen insensitivity syndrome (PAIS). The blue arrow indicates the proband. The half-blue shaded symbol indicates that the proband presented with colorectal cancer (CRC). y.o. indicates the age at the time of the analysis. The proband developed primary CRC at 27 years old, metachronous CRC at 30 years old, and recurrent CRC at 31 years old

The family history of CRC was negative. The proband was tested for germline mutations in the genes responsible for major CRC syndromes using Sanger sequencing [911], but no pathogenic mutations were identified (Additional file 1: Table S1). Immunohistochemical characterization of mismatch repair proteins as well as microsatellite instability analysis were performed as reported [1214] on the primary CRC tissue. The normal staining pattern for all proteins is nuclear, and a case is considered positive only in the presence of nuclear staining of neoplastic cells. A case is considered defective for the expression of one protein only when there is a complete absence of nuclear staining of neoplastic cells in the presence of an unquestionable internal positive control represented by normal epithelial cells, stromal cells, muscle cells, or lymphocytes. The proband’s CRC tissue revealed no alterations in mismatch repair proteins.

The proband’s cousin (III.3) was also diagnosed with PAIS. He underwent gynecomastia, hypospadia, and cryptorchidism corrective surgeries, and testosterone enanthate (250 mg every 2 weeks through intramuscular injection) was provided in an irregular regimen from age 15 to 20 years. At present, the cousin is healthy.

Molecular analysis of the AR gene through Sanger sequencing [15] of DNA from both III.2 and III.3 revealed a missense mutation in AR exon 7 (NM_000044: c.G2522A; NP_000035.2: p.R841H). By segregation analysis, the AR mutation was found to be inherited from their healthy mothers (II.2, II.3), who were born from consanguineous parents. Furthermore, this variant was present in the DNA of the proband’s sister (III.1).

Molecular analyses

Genomic DNA was extracted from whole blood specimens of the proband and his relatives using a QIAamp DNA Blood Maxi Kit (Qiagen, Hilden, Germany) and from secondary CRC and matched normal colon tissue of the proband using a QIAamp DNA mini Kit (Qiagen). Genomic DNA from whole blood samples from family subjects (II.1, II.2, II.3, III.2, and III.3) and from CRC tissue of the proband was subjected to high-resolution genome-wide SNP array analysis using a Cytoscan HD array (Affymetrix, Inc., Santa Clara, CA, USA) and WES using the SOLiD 5500xl WF platform (Life Technologies, Gaithersburg, MD, USA). WES was also performed on genomic DNA isolated from the normal colon mucosa of the proband. Two predictive tools, i.e., sorting intolerant from tolerant (SIFT) [16] and polymorphism phenotyping version 2 (PolyPhen-2) [17], were used to predict the effect of coding non-synonymous variants on protein function. To ensure the selection of genes associated with the currently best-characterized hereditary CRC syndromes, a review of the CRC panel-based clinical genetic tests listed in the GeneTest database was performed using “colorectal cancer” and “colon cancer” as key words. The final list of genes was then used to query in the PubMed and Online Mendelian Inheritance in Man (OMIM) databases to verify the association between the genes and a CRC predisposition. Genes that were found to be significantly somatically mutated in CRC patients and reported in the TumorPortal website ( were included. A final list of 57 genes was used for analysis of the WES data (Additional file 2: Table S2; Additional file 3: Tables S3).

A network investigation of the identified mutated genes and genes harbored in the aberrant chromosomal regions was performed using Ingenuity® Pathway Analysis (IPA®, QIAGEN Redwood City,; Version 23814503—23 March 2015).

Identification of genetic events involved in PAIS

To identify the potential involvement of further genomic imbalances in the etiology of PAIS, we first evaluated the CNVs identified in the two affected cousins. Shared chromosomal regions identified as being deleted or duplicated in both cousins were carefully evaluated to verify their potential role in the pathogenesis of AIS. This analysis did not reveal the presence of CNVs that might have a role in the manifestation of the PAIS phenotype. Then, we scanned the exome data of the affected cousins using a list of 37 genes known to be responsible for DSD (Additional file 4: Table S4). This analysis confirmed the presence of the missense c.2522G > A mutation of the AR gene and excluded the occurrence of other causative genetic events being responsible for PAIS in both cousins (Additional file 5: Table S5).

Identification of CNVs involved in the early onset of CRC

High-density SNP array (blood) analysis of the proband and his relatives was performed to investigate CNVs and UPD. Overall, in the proband, we identified 14 CNVs (Table 1; Additional file 6: Table S6): 4 (two losses and two gains) in gene desert chromosomal regions and 10 (two losses and eight gains) in chromosomal regions involving genes. All identified CNVs were also detected in the SNP array data of the CRC tissue. By comparison with the Database of Genomic Variants, 10 of the identified CNVs were found, with a frequency of <1% in healthy human control individuals from worldwide populations. Among the CNVs, four involved cancer-related genes, and one was a microRNA gene. Specifically, the identified copy number gains in the chromosomal regions 1p36.22, involving the PRY domain-containing SOCS box protein 1 (SPSB1) gene; 14q32.33, involving the immunoglobulin heavy constant gamma 1 (IGHG1) gene; Xp221.33 or Yp11.32, encompassing the cytokine receptor-like factor 2 (CRLF2) gene; and the copy number loss of chromosomal region 10q21.3, involving the intronic region of the catenin α 3 (CTNNA3) gene, have been previously identified in human cancers other than colon cancer. A microduplication of chromosome 22q11.22, encompassing MIR650 gene, previously related to CRC, was also detected.

Table 1 List of chromosomal rearrangements detected in the proband through single nucleotide polymorphism array analysis

No UPD events spanning whole chromosomes or chromosomal segments were detected in the proband. The segregation analysis of the 14 CNVs revealed that only one was de novo, whereas four were identified in both parents, three were inherited paternally, and five were inherited maternally.

Identification of gene alterations involved in the early onset of CRC

WES generated an average of 456 × 106 reads per sample, mapped to the reference genome with an average of 64.6% at a mean depth of 179-fold exon coverage (Table 2). The average depth coverage above 150, obtained for the proband, father, and cousin, far exceeded the normal standards applied for the identification of germline variants, making the results very robust. Furthermore, in the proband, the germline variants were cross validated in different tissues. Finally, the genotype of each proband variant could be further confirmed from the exome data of his parents.

Table 2 Metrics of whole-exome sequencing (WES) analysis of the specimens from the proband and his relatives entering into the present study

To identify variants associated with CRC in the proband, we carried out WES on DNA from blood and normal colon mucosa samples, and the variants shared between the two sources were analyzed using a candidate gene approach and a whole-exome approach (Fig. 2). The final set of prioritized variants was evaluated using the exome data from the relatives of the proband.

Fig. 2
figure 2

Analysis of the workflow applied to prioritize variants found in the whole-exome sequencing (WES) data from the proband. Two different sources of the proband’s DNA (blood and normal colon tissues) were analyzed with WES. A quality filter was first applied to minimize false positive variants, permitting us to obtain 19,673 and 18,696 variants in DNA in the blood and normal colon tissues from the proband, respectively. Since massively parallel sequencing can generate artifacts, a total of 11,737 variants shared between the two sources of DNA were used to apply a twofold strategy to identify variants associated with CRC. The first approach (“candidate gene approach”) selected 26 variants from the 57 CRC susceptibility genes; the second approach (“whole-exome approach”) selected 13 truncating variants with a minor allele frequency (MAF) <0.01. IGV the Integrative Genomic Viewer software

Candidate gene approach

By searching variants in the 57 prior candidate CRC-related genes without filtering criteria, we identified 27 variants present in the normal genomic DNA of the proband. After visually examining the variants using Integrative Genomic Viewer (IGV) software, one variant was excluded, thus reducing the final list to 26 variants (13 heterozygous and 13 homozygous) harbored in 17 CRC-related genes (Table 3; Additional file 6: Table S6).

Table 3 List of variants identified in the WES data of the proband (blood and normal tissue) through the “Candidate gene approach”

Analysis using the 1000 genomes database revealed that 24 of the 26 variants had a minor allele frequency (MAF) >5%, indicating that these SNVs could be considered common variants, whereas the remaining two identified in the polymerase (DNA directed), epsilon, catalytic subunit (POLE) (chr12:133233977G > A, synonymous mutation) and protein tyrosine phosphatase, receptor type, J (PTPRJ) (rs11382316, intronic mutation) genes represented rare variants.

We focused our analysis on variants that produced an altered protein product and found that five of the 26 variants were non-synonymous. Among these, only the rs1058808 polymorphism of the erb-b2 receptor tyrosine kinase 2 (ERBB2) gene was predicted to be deleterious by both SIFT (damaging, tolerance index score ≤0.05, low confidence prediction) and PolyPhen-2 (possibly damaging, tolerance index score in the 0.453–0.956 range) (Additional file 7: Table S7).

A search for the identified variants in the MAF file reporting somatic variations of the 233 samples from the Cancer Genome Atlas (TCGA) project indicated that they were not described in the CRC data set. The Human Gene Mutation Database (HGMD) [18], Leiden Open Variation Database (LOVD) [19], and Human Genome Variation Society (HGVS) and ClinVar databases [20], which were used to assess the pathogenicity of these variants, indicated no evident pathogenicity associated with CRC.

The variants identified in the proband were then evaluated in the WES data of his relatives. Among the 13 heterozygous variants, six were inherited paternally, two were inherited maternally, and three were present in both parents; for the remaining two variants, due to low coverage of the targeted analyzed regions, the inheritance was not assessable. Among the 13 homozygous variants, eight were inherited from both parents. Two of the homozygous variants included one allele that was transmitted from the father, while presumably the SNV harbored in the other allele occurred de novo. The inheritance of three of the homozygous variants was not assessable. A majority of the identified variants (22 of 26) was also detected in the proband’s cousin.

Whole-exome approach

We focused our search on variants functionally annotated and predicted to truncate the protein products, i.e., frameshift insertions and deletions and non-sense mutations. Variants were excluded if they were present in the 1000 Genomes Project and dbSNP138 databases at only MAF >0.01. Among 11,737 variants shared by the two sources of normal DNA of the proband, 18 truncating variants were identified, but the manual revision of these through IGV software subsequently excluded five of them. The remaining 13 variants were in a heterozygous state (Table 4) and were essentially confirmed by the WES data of the proband’s relatives, i.e., four SNVs were inherited maternally and three were inherited paternally, whereas the presence of the other three in the aunt and cousin suggested possible inheritance from the proband’s mother. One rare variant in the HYDIN gene was identified in a heterozygous state in all members of the family.

Table 4 List of truncating mutations identified in the proband and his relatives

Network and pathway analysis

To test putative enrichment for canonical pathways, disease, biological functions, and molecular networks, the genes residing in CNVs and those harboring truncating mutations were analyzed using QIAGEN’s Ingenuity® Pathway Analysis.

Significant gene association was observed only when the gene list was reduced to AR (mutated and causative of PAIS in the proband), MIR650 (already related to CRC pathogenesis and present in a rearranged chromosomal region), ERBB2 (determined to be functionally impaired by the candidate gene approach), and the 13 genes harboring truncating variants (identified by the whole-exome approach). In fact, 14 (87.5%) of these 16 genes were found to be connected to each other in a unique relevant network called “Gene expression, cellular growth and proliferation, tissue development” (IPA score 46) (Fig. 3). When a testosterone metabolite (dihydrotestosterone) was added to the network, indirect interactions with genes of the network, including ERBB2, were found in addition to the expected direct interaction with AR. Table 5 summarizes the main characteristics of the altered genes related to this network and their genetic association with the analyzed samples.

Fig. 3
figure 3

Network of genes and molecules inferred from Ingenuity® Pathway Analysis (IPA). The 13 genes exhibiting truncating variants were combined with the erb-b2 receptor tyrosine kinase 2 (ERBB2) gene harboring the rs1058808 single nucleotide polymorphism (SNP), which was predicted to be deleterious by the sorting intolerant from tolerant (SIFT) and polymorphism phenotyping version 2 (PolyPhen-2) tools, the microRNA-650 (MIR650) gene (a colorectal cancer-related gene), and the androgen receptor (AR) gene harboring the pathogenic rs9332969 SNP to test the putative enrichment of the canonical pathways, disease and biological functions, and molecular networks. When the short list of altered genes was examined using the IPA software, a single pathway (top functions: gene expression, cellular growth and proliferation, tissue development) with a high score (46, P < 0.001) was identified. Genes highlighted in blue were identified to be altered in III.2 (proband) and III.3 (cousin). Genes highlighted in red were found to be altered only in subject III.2. The dihydrotestosterone molecule is highlighted in green

Table 5 Summary of the characteristics of genes connected in the “Gene expression, cellular growth and proliferation, tissue development” network


This study provided a deep genomic analysis using SNP array and WES data of the constitutive DNA of a patient harboring the missense c.2225G > A (p.R841H or rs9332969) mutation of the AR gene. It has been reported that the R841H protein variation influences its binding to and interactions with androgens [21]; however, this substitution does not result in complete AR functional disruption [22] and gives rise to variable expression of the AIS phenotype [2, 23], suggesting that alterations in other genes may modulate and affect the final phenotype. Our SNP array and WES analysis of both affected cousins in this study did not reveal the presence of additional causative genetic events responsible for the PAIS phenotype, thus strongly supporting the pathogenic role of the p.R841H variant in PAIS.

Our proband patient developed multiple CRCs at an early age, suggesting a hallmark of inherited cancer susceptibility along with familial aggregation [24, 25]. Moreover, it has been recently reported that genetic instability could explain the early onset of CRC [18]. Germline alterations in the major CRC-associated genes adenomatous polyposis coli (APC), mutY homolog (MUTYH), mutL homolog 1 (MLH1), mutS homolog 2 (MSH2), and mutS homolog 6 (MSH6) were previously excluded. In this study, we employed two advanced high-throughput technologies, namely SNP array and WES, to investigate the involvement of other genetic susceptibility mechanisms responsible for early onset of CRC.

Among the 14 CNVs identified by the SNP array, four involved genes (SPSB1, CTNNA3, IGHG1, and CRFL2) have been implicated in tumorigenesis, however, other than in colon cancer, thus only one rearrangement, a duplication of the chromosomal region 22q11.22, may be of interest and relevant to this study. The amplified chromosomal region included MIR650, which has among its targets the promoter region of the NDRG family member 2 (NDRG2) tumor suppressor gene where it has been reported to be able to repress NDRG2 gene expression at the transcriptional level in CRC [26].

Among the 26 variants identified by WES following the candidate gene approach, a literature search revealed that the APC, caspase 8, apoptosis-related cysteine peptidase (CASP8), ERBB2, MLH1, and PTPRJ gene variants have been previously investigated to verify their roles in cancer predisposition (see all the references in Additional file 7: Table S7). Even if no clear pathogenic mutations responsible for CRC insurgence were identified, it is worth noting that rs1058808 in the ERBB2 gene, present in the heterozygous state, is harbored in the regulatory C-terminal domain of the ErbB2 protein. This protein represents a hub of the protein–protein interaction network associated with the ErbB2 receptor, which plays a crucial role in signaling cascades regulating cell growth, differentiation, and migration [27]. Furthermore, previous studies investigating the role of the rs1058808 polymorphism have suggested that it may represent a functional polymorphism inducing the overexpression of the ErbB2 protein [28, 29]. At present, in a series of 104 sporadic CRC patients, three mutations in the ERBB2 kinase domain have been found [30]; however, the mutation identified in the proband patient was located in a different region. Finally, our WES approach identified 13 genes harboring loss-of-function variants in the heterozygous state in the proband.

The analysis of the prioritized genes harboring the loss-of-function variants combined with the ERBB2, AR, and MIR650 genes through IPA software revealed that 14 of the 16 genes were connected in a unique network, called the “Gene expression, cellular growth and proliferation, tissue development” network. The core molecular hub in this network is represented by seven genes (MIR650, alcohol dehydrogenase, iron containing 1 [ADHFE1], charged multivesicular body protein 2A [CHMP2B], DENN domain containing 2D [DENND2D], kalirin RhoGEF kinase [KALRN], meprin A subunit alpha [MEP1A], and transglutaminase 4 [TGM4]) connected with AR and ERBB2, revealing an altered cross-talk between these two signaling pathways. From the literature, four of these six genes (CHMP2B, DENND2D, KALRN, and TGM4) have been previously identified as being involved in oncogenesis, while 3 (MIR650, ADHFE1, and MEP1A) may be related to CRC development and differentiation (see all the references in Additional file 8: Table S8). Considering the genes connected in the network but that were not altered in sequence, the presence of the interleukin-6 (IL-6) gene and its relation to MEP1A are also noteworthy. Indeed, IL-6 has been shown to be increased in Meprin KO mice [31], and IL-6 has been found to be up-regulated in colon cancer tissues and significantly associated with an increased risk of relapse [32]. Moreover, it should be noted that, among these 14 connected genes, five were not altered in the cousin. Testosterone, indirectly connected with TGM4 and ERBB2, was provided continuously for 24 years to the proband, including developmental years, whereas only 5 years of discontinuous treatment was provided to the cousin after age 15 years.

In physiologic conditions and in the presence of an active AR, the activity of testosterone and dihydrotestosterone hormones can be modulated by (i) a classic genomic model mediated by the cytosolic AR protein and (ii) a non-genomic model mediated by the functional membrane AR [33]. Since membrane AR activation via testosterone induces cytosolic signaling cascades involving molecules belonging to the ERBB2 pathway, it is tempting to hypothesize the possible existence of cross-talk between testosterone, membrane AR, and ErbB2 protein [3438]. Considering the role of testosterone in CRC development, the literature indicates that the risk of CRC shows sexual dimorphism, with a higher incidence and earlier onset in men [39]. Whether this disparity depends on protective factors in women, tumor-promoting factors in males, or both, is unknown. A recent epidemiologic study indicated that, in men with functional AR protein, higher levels of testosterone and sex hormone-binding globulin were associated with a decreased risk for developing CRC, but lower androgenicity may increase the risk of developing CRC in males [40, 41]. A tumor-promoting effect of testosterone, likely not dependent on AR activity, has been reported via the elegant use of the APCPirc/+ (Pirc) rat model, suggesting that sex disparity in the development of colonic adenomas depends directly on the effect of testosterone, rather than on a protective effect of female hormones [42]. Testosterone has been shown to stimulate the growth of lung and colon cancer cells in vitro, a mechanism that can be halted with anti-androgens [43, 44]. A population-based study that aimed to verify the association between plasma testosterone and overall cancer risk revealed that increased levels of plasma testosterone were associated with an increased risk of early death after cancer, but not with risk of cancer incidence [45]. Putting all these factors together, these observations suggest a potential effect of testosterone on cancer development and prognosis.

Thus, we speculate that testosterone treatment may have played a role in the initiation of cancer in the colon tissue of the proband, probably exerting a direct interaction with the poorly or non-functional cytosolic AR protein or an indirect action through the membrane AR protein. This could have led to the activation of a cytosolic signaling cascade involving the ErbB2 protein, which may have a functional effect in the patient.


The etiology of CRC is complex and multifactorial, involving an interplay between multiple genetic and environmental factors. Our in-depth genetic analysis of the proband and his family supports the hypothesis that the identified genetic alterations, in a multi-hit model of the inheritance of common and rare variants in multiple genes, might have contributed to the early onset of CRC in combination with environmental factors, such as a potential iatrogenic effect due to a continuous high-dose treatment with testosterone during developmental years. To confirm the relevance of the identified network of genes and molecules in CRC predisposition, further AIS families and preclinical model systems should be analyzed. Moreover, these findings highlight the importance of these types of studies to develop a better understanding of the molecular basis of a disease and, consequently, for better clinical management of patients.


All experiments were approved by the Ethical Committee of Fondazione IRCCS Istituto Nazionale dei Tumori (Milan, Italy). A written informed consent was obtained from all subjects for publication of individual patient data.


  1. Jääskeläinen J. Molecular biology of androgen insensitivity. Mol Cell Endocrinol. 2012;352(1–2):4–12.

    Article  PubMed  Google Scholar 

  2. Audi L, Fernández-Cancio M, Carrascosa A, Andaluz P, Torán N, Piró C, et al. Novel (60%) and recurrent (40%) androgen receptor gene mutations in a series of 59 patients with a 46,XY disorder of sex development. J Clin Endocrinol Metab. 2010;95:1876–88.

    Article  CAS  PubMed  Google Scholar 

  3. Ahmed SF, Cheng A, Dovey L, Hawkins JR, Martin H, Rowland J, et al. Phenotypic features, androgen receptor binding, and mutational analysis in 278 clinical cases reported as androgen insensitivity syndrome. J Clin Endocrinol Metab. 2000;85:658–65.

    CAS  PubMed  Google Scholar 

  4. Gottlieb B, Beitel LK, Nadarajah A, Paliouras M, Trifiro M. The androgen receptor gene mutations database: 2012 update. Hum Mutat. 2012;33:887–94.

    Article  CAS  PubMed  Google Scholar 

  5. Cools M, Drop SLS, Wolffenbuttel KP, Oosterhuis JW, Looijenga LH. Germ cell tumours in the intersex gonad: old paths, new directions, moving frontiers. Endocr Rev. 2006;27(5):468–84.

    Article  CAS  PubMed  Google Scholar 

  6. Fallat ME, Donahoe PK. Intersex genetic anomalies with malignant potential. Curr Opin Pediatr. 2006;18(3):305–11.

    Article  PubMed  Google Scholar 

  7. Handa N, Nagasaki A, Tsunoda M, Ohgami H, Kawanami T, Sueishi K, et al. Yolk sac tumor in a case of testicular feminization syndrome. J Pediatr Surg. 1995;30(9):1366–7 (discussion 1367–8).

    Article  CAS  PubMed  Google Scholar 

  8. Rutgers JL, Scully RE. The androgen insensitivity syndrome (testicular feminization): a clinicopathologic study of 43 cases. Int J Gynecol Pathol. 1991;10:126–44.

    Article  CAS  PubMed  Google Scholar 

  9. Evaluation of Genomic Applications in Practice and Prevention (EGAPP) Working Group. Recommendations from the EGAPP Working Group: genetic testing strategies in newly diagnosed individuals with colorectal cancer aimed at reducing morbidity and mortality from Lynch syndrome in relatives. Genet Med. 2009;11(1):35–41.

    Article  Google Scholar 

  10. Syngal S, Brand RE, Church JM, Giardiello FM, Hampel HL, Burt RW, American College of Gastroenterology. ACG clinical guideline: genetic testing and management of hereditary gastrointestinal cancer syndromes. Am J Gastroenterol. 2015;110(2):223–62.

    Article  PubMed  PubMed Central  Google Scholar 

  11. Ruijs MW, Verhoef S, Rookus MA, Pruntel R, van der Hout AH, Hogervorst FB, et al. TP53 germline mutation testing in 180 families suspected of Li-Fraumeni syndrome: mutation detection rate and relative frequency of cancers in different familial phenotypes. J Med Genet. 2010;47(6):421–8.

    Article  CAS  PubMed  Google Scholar 

  12. Frattini M, Signoroni S, Pilotti S, Bertario L, Benvenuti S, Zanon C, et al. Phosphatase protein homologue to tensin expression and phosphatidylinositol-3 phosphate kinase mutations in colorectal cancer. Cancer Res. 2005;65(23):11227.

    Article  CAS  PubMed  Google Scholar 

  13. Zlobec I, Kovac M, Erzberger P, Molinari F, Bihl MP, Rufle A, et al. Combined analysis of specific KRAS mutation, BRAF and microsatellite instability identifies prognostic subgroups of sporadic and hereditary colorectal cancer. Int J Cancer. 2010;127(11):2569–75.

    Article  CAS  PubMed  Google Scholar 

  14. Pietrantonio F, Perrone F, de Braud F, Castano A, Maggi C, Bossi I, et al. Activity of temozolomide in patients with advanced chemorefractory colorectal cancer and MGMT promoter methylation. Ann Oncol. 2014;25(2):404–8.

    Article  CAS  PubMed  Google Scholar 

  15. Boehmer AL, Brinkmann O, Brüggenwirth H, van Assendelft C, Otten BJ, Verleun-Mooijman MC, et al. Genotype versus phenotype in families with androgen insensitivity syndrome. J Clin Endocrinol Metab. 2001;86(9):4151–60.

    Article  CAS  PubMed  Google Scholar 

  16. Ng PC, Henikoff S. SIFT: predicting amino acid changes that affect protein function. Nucleic Acids Res. 2003;31(13):3812–4.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  17. Adzhubei IA, Schmidt S, Peshkin L, Ramensky VE, Gerasimova A, Bork P, et al. A method and server for predicting damaging missense mutations. Nat Methods. 2010;7(4):248–9.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  18. Stenson PD, Mort M, Ball EV, Shaw K, Phillips A, Cooper DN. The human gene mutation database: building a comprehensive mutation repository for clinical and molecular genetics, diagnostic testing and personalized genomic medicine. Hum Genet. 2014;133(1):1–9.

    Article  CAS  PubMed  Google Scholar 

  19. Fokkema IF, Taschner PE, Schaafsma GC, Celli J, Laros JF, den Dunnen JT. LOVD v.2.0: the next generation in gene variant databases. Hum Mutat. 2011;32(5):557–63.

    Article  CAS  PubMed  Google Scholar 

  20. Landrum MJ, Lee JM, Riley GR, Jang W, Rubinstein WS, Church DM, et al. ClinVar: public archive of relationships among sequence variation and human phenotype. Nucleic Acids Res. 2014;42(Database issue):D980–5.

    Article  CAS  PubMed  Google Scholar 

  21. Lundberg Giwercman Y, Nikoshkov A, Lindsten K, Byström B, Pousette A, Chibalin AV, et al. Functional characterisation of mutations in the ligand-binding domain of the androgen receptor gene in patients with androgen insensitivity syndrome. Hum Genet. 1998;103(4):529–31.

    Article  CAS  PubMed  Google Scholar 

  22. Shojaei A, Behjati F, Ebrahimzadeh-Vesal R, Razzaghy-Azar M, Derakhshandeh-Peykar P, Izadi P, et al. Mutation analysis of androgen receptor gene: multiple uses for a single test. Gene. 2014;552(2):234–8.

    Article  CAS  PubMed  Google Scholar 

  23. Wang X, Wang XR, Liu MG, Wang Q, Liu JY. Genetic analysis of a family with 46,XY “female” associated with infertility. Yi Chuan Xue Bao. 2006;33(1):19–25.

    PubMed  Google Scholar 

  24. Giráldez MD, Balaguer F, Bujanda L, Cuatrecasas M, Muñoz J, Alonso-Espinaco V, et al. MSH6 and MUTYH deficiency is a frequent event in early-onset colorectal cancer. Clin Cancer Res. 2010;16(22):5402–13.

    Article  PubMed  PubMed Central  Google Scholar 

  25. Tanskanen T, Gylfe AE, Katainen R, Taipale M, Renkonen-Sinisalo L, Järvinen H, et al. Systematic search for rare variants in Finnish early-onset colorectal cancer patients. Cancer Genet. 2015;208(1–2):35–40.

    Article  CAS  PubMed  Google Scholar 

  26. Feng L, Xie Y, Zhang H, Wu Y. Down-regulation of NDRG2 gene expression in human colorectal cancer involves promoter methylation and microRNA-650. Biochem Biophys Res Commun. 2011;406(4):534–8.

    Article  CAS  PubMed  Google Scholar 

  27. Bornet O, Nouailler M, Feracci M, Sebban-Kreuzer C, Byrne D, Halimi H, et al. Identification of a Src kinase SH3 binding site in the C-terminal domain of the human ErbB2 receptor tyrosine kinase. FEBS Lett. 2014;588(12):2031–6.

    Article  CAS  PubMed  Google Scholar 

  28. Han W, Kang D, Lee JE, Park IA, Choi JY, Lee KM, et al. A haplotype analysis of HER-2 gene polymorphisms: association with breast cancer risk, HER-2 protein expression in the tumor, and disease recurrence in Korea. Clin Cancer Res. 2005;11(13):4775–8.

    Article  CAS  PubMed  Google Scholar 

  29. Su Y, Jiang Y, Sun S, Yin H, Shan M, Tao W, et al. Effects of HER2 genetic polymorphisms on its protein expression in breast cancer. Cancer Epidemiol. 2015;39(6):1123–7.

    Article  PubMed  Google Scholar 

  30. Lee JW, Soung YH, Seo SH, Kim SY, Park CH, Wang YP, et al. Somatic mutations of ERBB2 kinase domain in gastric, colorectal, and breast carcinomas. Clin Cancer Res. 2006;12(1):57–61.

    Article  CAS  PubMed  Google Scholar 

  31. Banerjee S, Jin G, Bradley SG, Matters GL, Gailey RD, Crisman JM, et al. Balance of meprin A and B in mice affects the progression of experimental inflammatory bowel disease. Am J Physiol Gastrointest Liver Physiol. 2011;300(2):G273–82.

    Article  CAS  PubMed  Google Scholar 

  32. Olsen J, Kirkeby LT, Olsen J, Eiholm S, Jess P, Gögenur I, et al. High interleukin-6 mRNA expression is a predictor of relapse in colon cancer. Anticancer Res. 2015;35(4):2235–40.

    CAS  PubMed  Google Scholar 

  33. Kirli HZ, Tesikova M, Saatcioglu F. Androgen receptor (AR). In: Choi S, editor. Encyclopedia of signaling molecules. Berlin: Springer; 2012. p. 100–6.

    Google Scholar 

  34. Gu S, Papadopoulou N, Gehring EM, Nasir O, Dimas K, Bhavsar SK, et al. Functional membrane androgen receptors in colon tumors trigger pro-apoptotic responses in vitro and reduce drastically tumor incidence in vivo. Mol Cancer. 2009;8:114.

    Article  PubMed  PubMed Central  Google Scholar 

  35. Gu S, Papadopoulou N, Nasir O, Föller M, Alevizopoulos K, Lang F, et al. Activation of membrane androgen receptors in colon cancer inhibits the prosurvival signals Akt/bad in vitro and in vivo and blocks migration via vinculin/actin signaling. Mol Med. 2011;17(1–2):48–58.

    CAS  PubMed  Google Scholar 

  36. Gu S, Kounenidakis M, Schmidt EM, Deshpande D, Alkahtani S, Alarifi S, et al. Rapid activation of FAK/mTOR/p70S6K/PAK1-signaling controls the early testosterone-induced actin reorganization in colon cancer cells. Cell Signal. 2013;25(1):66–73.

    Article  CAS  PubMed  Google Scholar 

  37. Alkahtani S. Testosterone induced apoptosis in colon cancer cells is regulated by PI3 K/Rac1 signaling. Asian J Androl. 2013;15(6):831–4.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  38. Gu S, Honisch S, Kounenidakis M, Alkahtani S, Alarifi S, Alevizopoulos K, et al. Membrane androgen receptor down-regulates c-src-activity and beta-catenin transcription and triggers GSK-3beta-phosphorylation in colon tumor cells. Cell Physiol Biochem. 2014;34(4):1402–12.

    Article  CAS  PubMed  Google Scholar 

  39. Siegel RL, Miller KD, Jemal A. Cancer statistics, 2015. CA Cancer J Clin. 2015;65(1):5–29.

    Article  PubMed  Google Scholar 

  40. Lin JH, Zhang SM, Rexrode KM, Manson JE, Chan AT, Wu K, et al. Association between sex hormones and colorectal cancer risk in men and women. Clin Gastroenterol Hepatol. 2013;11(4):419–24.

    Article  CAS  PubMed  Google Scholar 

  41. Slattery ML, Sweeney C, Murtaugh M, Ma KN, Wolff RK, Potter JD, et al. Associations between ERalpha, ERbeta, and AR genotypes and colon and rectal cancer. Cancer Epidemiol Biomarkers Prev. 2005;14(12):2936–42.

    Article  CAS  PubMed  Google Scholar 

  42. Amos-Landgraf JM, Heijmans J, Wielenga MC, Dunkin E, Krentz KJ, Clipson L, et al. Sex disparity in colonic adenomagenesis involves promotion by male hormones, not protection by female hormones. Proc Natl Acad Sci USA. 2014;111(46):16514–9.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  43. Tutton PJ, Barkla DH. The influence of androgens, anti-androgens, and castration on cell proliferation in the jejunal and colonic crypt epithelia, and in dimethylhydrazine-induced adenocarcinoma of rat colon. Virchows Arch B Cell Pathol Incl Mol Pathol. 1982;38(3):351–5.

    CAS  PubMed  Google Scholar 

  44. Maasberg M, Rotsch M, Jaques G, Enderle-Schmidt U, Weehle R, Havemann K. Androgen receptors, androgen-dependent proliferation, and 5 alpha-reductase activity of small-cell lung cancer cell lines. Int J Cancer. 1989;43(4):685–91.

    Article  CAS  PubMed  Google Scholar 

  45. Orsted DD, Nordestgaard BG, Bojesen SE. Plasma testosterone in the general population, cancer prognosis and cancer risk: a prospective cohort study. Ann Oncol. 2014;25(3):712–8.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

Download references

Authors’ contributions

All authors participated in the manuscript revision. VD, LDC, SS, LB and MV participated in the conception and design of the study; MM performed the histopathological revision; and IHC, OP and MC performed the SNP array experiments and related bioinformatics analysis; VD, LDC and PD performed the WES experiments; VD, OP and MC performed the interpretation of the SNP array data; VD and AD carried out the bioinformatics analyses and interpretation of the WES data; GV critically revised the WES data and manuscript; MAP critically revised the manuscript and provided funding to the project; VD, LDC, SS and SC coordinated and finalized the manuscript drafting and revision. All authors read and approved the final manuscript.


We would like to thank the family for participating in this research and allowing us to share the findings. We also thank Affymetrix (Affymetrix Inc. Santa Barbara, USA) for providing the arrays for this study.

This work was supported in part by funds obtained through an Italian law that allows taxpayers to allocate 0.5 percent share of their income tax contribution to a research institution of their choice.

Competing interests

The authors declare that they have no competing interests.

Author information

Authors and Affiliations


Corresponding author

Correspondence to Silvana Canevari.

Additional information

Stefano Signoroni and Loris De Cecco contributed equally to this work

Additional files

Additional file 1: Table S1. Variants of tumor genes detected by Sanger sequencing in the proband.

Additional file 2: Table S2. List of genes associated with inherited predisposition to colorectal cancer (CRC).

Additional file 3: Table S3. List of genes significantly mutated in 233 colorectal tumors (TCGA project).

Additional file 4: Table S4. List of disorder of sex development (DSD) genes used to filter exome sequencing variants.


Additional file 5: Table S5. List of single nucleotide variants (SNVs) and small insertions and deletions (INDELs) detected in WES data of III.2 and III.3 subjects.


Additional file 6: Table S6. Chromosomal rearrangements of cancer function-related genes detected in the proband through single nucleotide polymorphism.


Additional file 7: Table S7. List of variants identified in whole-genome sequencing (WES) data of the proband (blood sample and normal colon tissue) through the candidate gene approach.

Additional file 8: Table S8. List of truncating mutations identified in the proband and his relatives.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Disciglio, V., Devecchi, A., Palumbo, O. et al. Whole exome sequencing and single nucleotide polymorphism array analyses to identify germline alterations in genes associated with testosterone metabolism in a patient with androgen insensitivity syndrome and early-onset colorectal cancer. Chin J Cancer 35, 51 (2016).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: