Advertisement
Research ArticleGeneticsOncology Free access | 10.1172/JCI122095
1Department of Pathology and Simmons Comprehensive Cancer Center, University of Texas Southwestern (UTSW) Medical Center, Dallas, Texas, USA.
2Department of Hematopathology, Division of Pathology and Laboratory Medicine, University of Texas MD Anderson Cancer Center, Houston, Texas, USA.
3Lyda Hill Department of Bioinformatics, UTSW Medical Center, Dallas, Texas, USA.
Address correspondence to: Diego H. Castrillon, UT Southwestern Department of Pathology, 6000 Harry Hines Boulevard, Dallas Texas 75390-9072, USA. Phone: 214.648.4032; Email: diego.castrillon@utsouthwestern.edu.
Authorship note: HDL and IC contributed equally to this work.
Find articles by Li, H. in: JCI | PubMed | Google Scholar |
1Department of Pathology and Simmons Comprehensive Cancer Center, University of Texas Southwestern (UTSW) Medical Center, Dallas, Texas, USA.
2Department of Hematopathology, Division of Pathology and Laboratory Medicine, University of Texas MD Anderson Cancer Center, Houston, Texas, USA.
3Lyda Hill Department of Bioinformatics, UTSW Medical Center, Dallas, Texas, USA.
Address correspondence to: Diego H. Castrillon, UT Southwestern Department of Pathology, 6000 Harry Hines Boulevard, Dallas Texas 75390-9072, USA. Phone: 214.648.4032; Email: diego.castrillon@utsouthwestern.edu.
Authorship note: HDL and IC contributed equally to this work.
Find articles by Cuevas, I. in: JCI | PubMed | Google Scholar
1Department of Pathology and Simmons Comprehensive Cancer Center, University of Texas Southwestern (UTSW) Medical Center, Dallas, Texas, USA.
2Department of Hematopathology, Division of Pathology and Laboratory Medicine, University of Texas MD Anderson Cancer Center, Houston, Texas, USA.
3Lyda Hill Department of Bioinformatics, UTSW Medical Center, Dallas, Texas, USA.
Address correspondence to: Diego H. Castrillon, UT Southwestern Department of Pathology, 6000 Harry Hines Boulevard, Dallas Texas 75390-9072, USA. Phone: 214.648.4032; Email: diego.castrillon@utsouthwestern.edu.
Authorship note: HDL and IC contributed equally to this work.
Find articles by Zhang, M. in: JCI | PubMed | Google Scholar |
1Department of Pathology and Simmons Comprehensive Cancer Center, University of Texas Southwestern (UTSW) Medical Center, Dallas, Texas, USA.
2Department of Hematopathology, Division of Pathology and Laboratory Medicine, University of Texas MD Anderson Cancer Center, Houston, Texas, USA.
3Lyda Hill Department of Bioinformatics, UTSW Medical Center, Dallas, Texas, USA.
Address correspondence to: Diego H. Castrillon, UT Southwestern Department of Pathology, 6000 Harry Hines Boulevard, Dallas Texas 75390-9072, USA. Phone: 214.648.4032; Email: diego.castrillon@utsouthwestern.edu.
Authorship note: HDL and IC contributed equally to this work.
Find articles by Lu, C. in: JCI | PubMed | Google Scholar
1Department of Pathology and Simmons Comprehensive Cancer Center, University of Texas Southwestern (UTSW) Medical Center, Dallas, Texas, USA.
2Department of Hematopathology, Division of Pathology and Laboratory Medicine, University of Texas MD Anderson Cancer Center, Houston, Texas, USA.
3Lyda Hill Department of Bioinformatics, UTSW Medical Center, Dallas, Texas, USA.
Address correspondence to: Diego H. Castrillon, UT Southwestern Department of Pathology, 6000 Harry Hines Boulevard, Dallas Texas 75390-9072, USA. Phone: 214.648.4032; Email: diego.castrillon@utsouthwestern.edu.
Authorship note: HDL and IC contributed equally to this work.
Find articles by Alam, M. in: JCI | PubMed | Google Scholar
1Department of Pathology and Simmons Comprehensive Cancer Center, University of Texas Southwestern (UTSW) Medical Center, Dallas, Texas, USA.
2Department of Hematopathology, Division of Pathology and Laboratory Medicine, University of Texas MD Anderson Cancer Center, Houston, Texas, USA.
3Lyda Hill Department of Bioinformatics, UTSW Medical Center, Dallas, Texas, USA.
Address correspondence to: Diego H. Castrillon, UT Southwestern Department of Pathology, 6000 Harry Hines Boulevard, Dallas Texas 75390-9072, USA. Phone: 214.648.4032; Email: diego.castrillon@utsouthwestern.edu.
Authorship note: HDL and IC contributed equally to this work.
Find articles by Fu, Y. in: JCI | PubMed | Google Scholar
1Department of Pathology and Simmons Comprehensive Cancer Center, University of Texas Southwestern (UTSW) Medical Center, Dallas, Texas, USA.
2Department of Hematopathology, Division of Pathology and Laboratory Medicine, University of Texas MD Anderson Cancer Center, Houston, Texas, USA.
3Lyda Hill Department of Bioinformatics, UTSW Medical Center, Dallas, Texas, USA.
Address correspondence to: Diego H. Castrillon, UT Southwestern Department of Pathology, 6000 Harry Hines Boulevard, Dallas Texas 75390-9072, USA. Phone: 214.648.4032; Email: diego.castrillon@utsouthwestern.edu.
Authorship note: HDL and IC contributed equally to this work.
Find articles by You, M. in: JCI | PubMed | Google Scholar
1Department of Pathology and Simmons Comprehensive Cancer Center, University of Texas Southwestern (UTSW) Medical Center, Dallas, Texas, USA.
2Department of Hematopathology, Division of Pathology and Laboratory Medicine, University of Texas MD Anderson Cancer Center, Houston, Texas, USA.
3Lyda Hill Department of Bioinformatics, UTSW Medical Center, Dallas, Texas, USA.
Address correspondence to: Diego H. Castrillon, UT Southwestern Department of Pathology, 6000 Harry Hines Boulevard, Dallas Texas 75390-9072, USA. Phone: 214.648.4032; Email: diego.castrillon@utsouthwestern.edu.
Authorship note: HDL and IC contributed equally to this work.
Find articles by Akbay, E. in: JCI | PubMed | Google Scholar
1Department of Pathology and Simmons Comprehensive Cancer Center, University of Texas Southwestern (UTSW) Medical Center, Dallas, Texas, USA.
2Department of Hematopathology, Division of Pathology and Laboratory Medicine, University of Texas MD Anderson Cancer Center, Houston, Texas, USA.
3Lyda Hill Department of Bioinformatics, UTSW Medical Center, Dallas, Texas, USA.
Address correspondence to: Diego H. Castrillon, UT Southwestern Department of Pathology, 6000 Harry Hines Boulevard, Dallas Texas 75390-9072, USA. Phone: 214.648.4032; Email: diego.castrillon@utsouthwestern.edu.
Authorship note: HDL and IC contributed equally to this work.
Find articles by Zhang, H. in: JCI | PubMed | Google Scholar |
1Department of Pathology and Simmons Comprehensive Cancer Center, University of Texas Southwestern (UTSW) Medical Center, Dallas, Texas, USA.
2Department of Hematopathology, Division of Pathology and Laboratory Medicine, University of Texas MD Anderson Cancer Center, Houston, Texas, USA.
3Lyda Hill Department of Bioinformatics, UTSW Medical Center, Dallas, Texas, USA.
Address correspondence to: Diego H. Castrillon, UT Southwestern Department of Pathology, 6000 Harry Hines Boulevard, Dallas Texas 75390-9072, USA. Phone: 214.648.4032; Email: diego.castrillon@utsouthwestern.edu.
Authorship note: HDL and IC contributed equally to this work.
Find articles by Castrillon, D. in: JCI | PubMed | Google Scholar |
Authorship note: HDL and IC contributed equally to this work.
Published August 20, 2018 - More info
Over 40 years ago, Loeb and colleagues proposed that errors in DNA replication produce a mutator phenotype that is involved in generating the multiple mutations required for tumor development. In this issue of the JCI, Li, Castrillon, and colleagues describe a mouse model containing a single base change in the gene encoding replicative DNA polymerase ε (POLE) that mimics the “ultramutator” phenotype recently reported in many human tumors. Their seminal accomplishment validates Loeb’s hypothesis and the use of mutational signatures to understand the origins and potentially the treatment of human tumors, and it offers an exciting opportunity to further explore the mechanisms responsible for normal DNA replication fidelity and their perturbations.
Thomas A. Kunkel
Mutations underlie all cancers, and their identification and study are the foundation of cancer biology. We describe what we believe to be a novel approach to mutagenesis and cancer studies based on the DNA polymerase ε (POLE) ultramutator phenotype recently described in human cancers, in which a single amino acid substitution (most commonly P286R) in the proofreading domain results in error-prone DNA replication. We engineered a conditional PoleP286R allele in mice. PoleP286R/+ embryonic fibroblasts exhibited a striking mutator phenotype and immortalized more efficiently. PoleP286R/+ mice were born at Mendelian ratios but rapidly developed lethal cancers of diverse lineages, yielding the most cancer-prone monoallelic model described to date, to our knowledge. Comprehensive whole-genome sequencing analyses showed that the cancers were driven by high base substitution rates in the range of human cancers, overcoming a major limitation of previous murine cancer models. These data establish polymerase-mediated ultramutagenesis as an efficient in vivo approach for the generation of diverse animal cancer models that recapitulate the high mutational loads inherent to human cancers.
Eukaryotic DNA replication is astoundingly accurate, with an error rate of only 1 per 109 to 1010 bases, or approximately 1 mutation per cell division. Replication proceeds at DNA forks, where leading-strand and discontinuous lagging-strand synthesis places different demands on the polymerases responsible for DNA synthesis. The DNA polymerases δ (Pol δ) and ε (Pol ε), which replicate the bulk of the nuclear genome, act on opposite DNA strands at replication forks. Pol δ is the principal lagging-strand replicase, while Pol ε replicates the leading strand (1). Pol ε is a heterotetramer composed of 3 accessory subunits and a single catalytic subunit encoded by the POLE locus (2–4).
The remarkable fidelity of DNA replication by Pol δ and Pol ε is determined by 3 mechanisms acting in series. First, nucleotide incorporation into the nascent strand is highly accurate. If an incorrect nucleotide enters the nucleotide-binding pocket of the polymerase, the nucleotide leaves the pocket. The intrinsic nucleotide fidelity of the replicative DNA polymerases is approximately 1 error per 104 to 105 nucleotides (5, 6). Second, Pol δ and Pol ε are unique among mammalian nuclear DNA polymerases, in that they contain 3′ exonuclease (“proofreading”) domains that remove most misincorporated nucleotides (3, 7, 8). This endogenous proofreading activity lowers the error rate to approximately 1 error in 106 to 107 nucleotides (4, 9). Third, errors escaping proofreading are usually repaired via postreplicative mismatch repair (MMR) pathways, whereby base-base mismatches are recognized, excised, and repaired (10). The remarkably low intrinsic DNA replication error rate underscores the relentless selective pressure maintaining high overall DNA replication fidelity. And yet the residual error is a fundamental driver of human cancer, with strong selection for rare mutations that promote genome instability and accelerate the acquisition of mutations by diverse mechanisms, i.e., the mutator phenotype (11, 12).
Although mutation is the basis of all cancer, mutation incidences vary by orders of magnitude among individual cancers. Most carcinomas have base substitution rates in the range of 1 to 100 per million bases (Mb). MMR-deficient carcinomas are at the high end of this range (≥10/Mb), termed hypermutation. Cancers associated with chronic mutagen exposure, such as lung cancer and melanoma, also have high base substitution rates (13). Much more recently, cancers with even higher base substitution rates of 100/Mb or higher (ultramutation) have been attributed to somatically acquired POLE missense mutations leading to single amino acid substitution in the proofreading domain (e.g., P286R or V411L) (14). POLE-driven ultramutation represents a recently described mutator phenotype that is common among endometrial and colorectal cancers but also occurs in diverse carcinomas, sarcomas, and hematopoietic malignancies. Some POLE-driven cancers have base substitution rates in the hypermutated range (10–100 Mb), while some MMR-deficient cancers have base substitution rates of greater than 100/Mb. Thus, there is overlap with respect to base substitution rates among MMR-deficient and POLE-driven human cancers. Analogous exonuclease domain mutations in POLD1 (the catalytic subunit of Pol δ) occur less commonly and in fewer cancer types than do those in POLE (15).
The biochemical impact of the recurrent POLE amino acid substitutions such as P286R is not well understood, but the result is misincorporation of nucleotides during DNA replication, with an error rate much higher than would result from mere inactivation of the exonuclease domain (16–18). The frequency of POLE amino acid substitutions among ultramutated human cancers correlates with the strength of the resulting mutator phenotype; P286R, the most common substitution, results in the strongest mutator phenotype. The POLEP286R mutation behaves in a genetically dominant manner in cancers, as the WT allele is retained (16). This genetic dominance reflects the fact that POLEP286R is a neomorphic mutation that does not result merely in gain or loss of function but produces a new function not intrinsic to the normal allele (active misincorporation of nucleotides) (19, 20). The overall replication error rate of POLEP286R/+ cells is thus driven by the presumptive 50% of Pol ε holoenzymes harboring P286R.
Genetically engineered mice are essential tools for the study of cancer. Typically in such models, 1–3 mutations that occur in a human cancer are recapitulated via conditional or other genetic approaches. While such models have proven remarkably effective for studies of diverse aspects of oncogenic signaling pathways to carcinogenesis, there has also been growing concern that genetically engineered mouse models fail to recapitulate essential aspects of human tumor biology. Perhaps foremost among these is the fact that mouse models of cancer — e.g., Egfr-, Kras-, or Myc-driven models of lung cancer — have dramatically lower average mutational frequencies than do human lung carcinomas. The same limitations are likely to be encountered with cancer models based on newer genome-editing methods (21), pointing to the need for different approaches to optimize mouse models with respect to mutational load, which defines many aspects of tumor biology, clinical behavior, and treatment response (22–24).
In this study, we recapitulate POLEP286R-driven ultramutation in mice with a conditional knockin allele to permit control of the initiation of mutagenesis. Activation of the allele was efficient and resulted in striking phenotypes in both embryonic fibroblasts and live mice. The incidence and spectrum of malignancies point to PoleP286R as the most potent oncogenic driver mutation described to date, to our knowledge. Whole-genome sequencing (WGS) showed that the cancers harbored base substitutions in the range of up to 100/Mb and in the range of human carcinomas, including hypermutant and ultramutant carcinomas (25). We believe our approach represents a new, efficient, and widely applicable route to the creation of mouse models that recapitulate the mutational loads inherent to human cancer.
Generation of a conditional LSL-PoleP286R–knockin allele. The human POLE and mouse Pole genes both contain 49 exons encoding proteins of 262 kDa (Figure 1A). Proline 286 is encoded by exon 9, and the 10 amino acids flanking this residue are perfectly conserved in mice and humans (Figure 1B). An 8.0-kb mouse Pole genomic fragment spanning exon 9 was cloned; the c.857C>G point mutation was introduced by site-directed mutagenesis, and a Lox-Stop-Lox (LSL) cassette (26) was inserted into a native intronic XhoI site to generate the targeting construct. Following targeting of embryonic stem (ES) cells and the births of chimeric animals, germline transmission was achieved and a LSL-PoleP286R/+ mouse line was established. The LSL cassette prevents the expression of active protein; Cre-mediated recombination excises the LSL cassette, resulting in PoleP286R expression (Figure 1C). LSL-PoleP286R/+ heterozygous mice were healthy and fertile. We developed a multiplex genotyping protocol to distinguish between the +, LSL-PoleP286R, and PoleP286R alleles (Supplemental Figure 1A; supplemental material available online with this article; https://doi.org/10.1172/JCI122095DS1). cDNA sequencing with intron-spanning primers confirmed equal expression of the mutant and WT alleles at the RNA level, as well as absent expression from the LSL-PoleP286R allele (Supplemental Figure 1B).
Generation of the LSL-PoleP286R conditional knockin allele. (A) POLE protein domain structure and position of P286R. The C-terminal half of the protein is not essential for polymerase activity but serves as a protein-protein interaction domain. pol, polymerase domain; exo, exonuclease domain. (B) Human and mouse gene information and sequence context for the c.857C>G p.pro286arg (P286R) mutation. c.857C>G p.pro286arg is equivalent in humans and mice (i.e., the position of the base and amino acid are the same in human and mouse cDNA and polypeptide reference sequences). (C) Schematic showing a portion of the mouse Pole locus used for generating the targeting construct. The XhoI site was used for insertion of the LSL cassette.
PoleP286R/+ mouse embryonic fibroblasts immortalize more efficiently. Mouse embryonic fibroblasts (MEFs) are useful for the characterization of cell growth and immortalization phenotypes (27). To study how PoleP286R influences these fundamental cell-growth properties, LSL-PoleP286R/+ heterozygous males were bred with heterozygous Ddx4-Cre/+ females. Ddx4-Cre is expressed only in germ cells and induces global recombination of floxed loci within oocytes, without mosaicism in 100% of the progeny, even in progeny that do not inherit Ddx4-Cre (maternal effect) (28). PoleP286R/+ and sibling Pole+/+ (hereafter referred to as +/+) embryos were harvested at E13.5 and used to establish MEF lines via 3T3 assays, in which 3 × 105 cells were serially passaged and their growth kinetics (i.e., cell doublings) measured every 3 days. The culture of murine fibroblasts induces p16INK4a and p19ARF, causing cell-cycle arrest in most cells within a few passages. The immortalization of WT MEFs requires serial passage and stochastic mutational events leading to p53 or p19ARF loss, among others (27). Initially, PoleP286R/+ and +/+ MEFs proliferated at similar rates, but the PoleP286R/+ MEFs bypassed tissue culture–induced senescence earlier (Figure 2A). This could also be clearly seen for a subset of 6 MEF lines obtained from the same litter (Figure 2B). By passage 20 (P20), PoleP286R/+ MEFs had undergone an average of 26.2 doublings versus 15.6 for +/+ MEFs (Figure 2A, P = 0.005, unpaired Student’s t test). Also, all PoleP286R/+ MEFs were successfully immortalized (10 of 10), whereas this was not the case for +/+ MEFs (7 of 10) (Figure 2A). The earlier, more frequent, and consistent immortalization of PoleP286R/+ MEFs was suggestive of an increased incidence of immortalizing mutations (see below). Of note, we found that markers of DNA damage were not elevated, indicating that the underlying process did not trigger DNA breaks or a general DNA damage response (Figure 2C).
Efficient immortalization and absence of DNA damage responses in PoleP286R/+ MEFs. (A) 3T3 assays of 10 PoleP286R/+ and 10 control +/+ MEF lines. (B) 3T3 assays of a select subset of MEF lines (embryos from a single litter) performed up to P52. (C) Western blot analysis of DNA damage markers and p16INK4a performed at P15, showing no constitutional abnormalities in PoleP286R/+ MEFs. Each lane corresponds to one of the MEF lines shown in B. Exposure durations are shown for each panel; long exposure durations were used for some markers to permit the visualization of bands and lack of differences between experimental and control samples. p-, phosphorylated.
We then performed a simple forward genetic screen to further explore these inferences. MEFs from P20 were subjected to 6-thioguanine (6-TG) selection. Within cells, 6-TG is converted by hypoxanthine-guanine phosphoribosyltransferase (HPRT) into cytotoxic metabolites. The HPRT gene is X-linked, and treatment with 6-TG selects for rare cells with inactivating HPRT mutations. Following standard determinations of plating efficiencies, male MEF lines were subjected to 6-TG selection. Consistent with early passage, the +/+ MEFs failed to give rise to resistant colonies, whereas 6-TG–resistant colonies arose consistently among PoleP286R/+ MEFs (106 per plate, n = 5 replicates per line) (Supplemental Figure 2A). Cloning and sequencing of HPRT cDNAs from the resistant colonies revealed a number of missense mutations and also some deletions (Supplemental Figure 2B). This experiment (together with the data below) demonstrates that PoleP286R leads to an increase in the overall mutation rate, with functional repercussions.
Polymerase-mediated ultramutagenesis in mice elicits malignant cancers of diverse lineages. To attempt to study the impact of PoleP286R across all cell lineages in vivo, cohorts of PoleP286R/+ and +/+ sibling controls were generated by breeding LSL-PoleP286R/+ males with Ddx4-Cre females. PoleP286R/+ heterozygous mice were born at the expected Mendelian ratios, and the animals appeared healthy, demonstrating that expression of PoleP286R in all cells of a living organism does not interfere with essential physiologic functions and is well tolerated, at least for a period of time. To identify age-related phenotypes, PoleP286R/+ mice (n = 50) were permitted to age naturally. While there were no deaths in the control cohort (n = 22) past 350 days, PoleP286R mice exhibited striking age-dependent mortality. The first deaths occurred at only 72 days of age, and the median age at death was 138 days, with all animals succumbing by 293 days (P < 0.00001 +/+ vs. P286R/+, Figure 3A).
PoleP286R/+ mice rapidly succumb to diverse cancers. (A) Survival analysis of 50 PoleP286R/+, 22 sibling control (+/+) mice, and 2 PoleP286R LSL-PoleP286R mice. P values were determined by log-rank test. (B–G) Examples of malignant neoplasms. (B) Large thymic T cell lymphoma expanding the chest cavity, with infiltration into the lungs (lg) and below the heart (h). (C) Angiosarcoma of the hind leg, with hemorrhage. (D) Angiosarcoma of the shoulder (different mouse from that shown in C). (E) Histiocytic “sarcoma” involving 1 ovary. These tumors are not true sarcomas, but rather malignant hematopoietic neoplasms with histiocytic differentiation. The tumor was disseminated and present in other tissues. (F) Aggressive SCC of the snout. (G) Adenocarcinoma of the lung. Scale bars: 1 cm.
The animals died of cancers (Figure 3, B–G). The cancer-prone phenotype was remarkable in that at least 1 malignant neoplasm was identified in all but 1 PoleP286R/+ mouse, and most had multiple distinct malignancies of diverse lineages, even within the same organ. The most common malignancies were lung adenocarcinomas and aggressive CD3+CD19– thymus-based T cell lymphomas that typically infiltrated the lungs and were widely disseminated, with some animals in the leukemic phase and with bone marrow infiltration (Figures 3 and 4). Other malignancies included diverse sarcomas, most commonly angiosarcomas, and diverse carcinomas including neuroendocrine, mammary gland, uterine, colon, and squamous cell carcinomas (SCCs), among others (Figures 3 and 4, and Supplemental Table 1A). The vast majority of tumors were highly invasive and obviously malignant, but some tumors appeared benign (e.g., hemangiomas; see Figure 5E for 1 example and Supplemental Table 1A). Some mice also exhibited dysplastic (in situ) epithelial precancers, e.g., in the colon and mammary gland (Figure 5B), but these were not systematically analyzed. In total, we identified 97 tumors in 47 necropsied mice (average of 2.1 tumors/mouse), with mice showing as many as 8 distinct malignant neoplasms. These results show that (a) PoleP286R is active across cell lineages, (b) PoleP286R acts in a genetically dominant manner in mice, (c) PoleP286R alone efficiently drives the formation of cancers of diverse epithelial, hematopoietic, and mesenchymal lineages, with the result that (d) PoleP286R/+ is among the most tumor-prone animal models described to date, to our knowledge — which is all the more remarkable for being based on a monoallelic point mutation leading to a single heterozygous amino acid substitution.
Characterization of thymus-based high-grade T cell lymphomas in PoleP286R/+ mice. (A) Low-magnification scan of H&E-stained slide showing large thymus-based T cell lymphoma, with infiltration between the lobes of the lung and pericardial and lung parenchymal infiltration (arrowheads). (B) Lymphomas had high-grade features, numerous mitoses (red arrowhead), and apoptotic bodies (black arrowhead). (C) IHC of paraffin-embedded, formalin-fixed tissue shows that tumor cells were CD3+CD19–. Admixed are rare CD19+ B lymphocytes. (D–F) Different mouse tissues illustrating an aggressive T cell lymphoma phenotype. (D) Extensive infiltration into pulmonary parenchyma in a characteristic perivascular pattern. (E) Infiltration within myocardium. (F) Infiltration within kidney. (G) Femur showing extensive infiltration. (H) Femur (bone marrow) from a +/+ mouse showing normal trilineage maturation for comparison. (I) CBCs of 9 mice with obvious thymic lymphomas at necropsy, also shown is 1 reference (+/+) sample. Asterisk indicates mice in the blast phase; their neutrophil counts were only mildly elevated, showing that leukocytosis was not due to sepsis/neutrophilia. Scale bars: 5 mm (A) and 50 μm (B–H).
Characterization of other lethal malignancies observed in PoleP286R/+ mice. (A) Lung adenocarcinoma with extensive infiltration (inset) and invasion through pleura. (B) Mammary gland carcinoma, with focus of ductal carcinoma in situ (DCIS) and invasive carcinoma (left). (C) Skin with invasive SCC, ranging from well differentiated to poorly differentiated. (D) Pancreatic adenocarcinoma; area shown is well differentiated. (E) Invasive endometrioid adenocarcinoma (left); inset shows low-magnification scan (actual size) of H&E-stained slide demonstrating endometrial adenocarcinoma in the left uterine horn and a benign hemangioma (ha) on the right horn. The endometrioid adenocarcinoma invaded through myometrium and uterine serosa into adjacent structures such as oviducts. (F) Colonic adenocarcinoma. (G) Histiocytic “sarcoma” in the liver (of hematopoietic origin/histiocytic differentiation). This tumor was widely disseminated and replaced more than 50% of the liver. (H) Angiosarcoma of the colon; entrapped colonic gland is visible in the upper right-hand corner. (I) Osteosarcoma with mineralized osteoid matrix. IHC stains are shown as insets in A, C, and G for selected lineage-specific markers confirming histotypes (TTF-1, ER, p63, and F4/80; ×50 magnification for all).Scale bars: 200 μm (A and C), 50 μm (B and D–I).
The presence of the LSL cassette renders the LSL-PoleP286R allele functionally null with respect to Pole function. PoleP286R LSL-PoleP286R mice would thus be genetically hemizygous and express only PoleP286R protein. To determine whether such mice might be viable, we conducted crosses between PoleP286R/+ and LSL-PoleP286R/+ mice. Of 164 live-born progeny, only 2 were PoleP286R LSL-PoleP286R versus the expected 41 (P < 0.00001), indicating embryonic lethality. Thus, PoleP286R hemizygosity resulted in a more severe phenotype than did heterozygosity, although rare escaper animals survived. The hemizygous live-born animals were minute and rapidly succumbed (both at 66 days of age) to aggressive T cell lymphomas, with each mouse also harboring multiple independent malignancies (P < 0.00001, P286R/LSL vs. P286R/+; Figure 3A and Supplemental Table 1B). Consistent with prior studies in yeast, these results show that the P286R amino acid substitution does not block DNA replication per se, but imply that the correspondingly higher levels of mutation associated with hemizygosity are incompatible with normal organismal development (16).
WGS of 20 samples reveals a high frequency of base substitutions in the range of human cancers. First, we performed WGS at an average depth of 21.7× for the 6 MEF lines shown in Figure 2B; representative quality control metrics are shown in Supplemental Figure 3. SNPs present in the parental mouse strains, which were independently sequenced (n = 2 samples per the 2 parental [FVB/n and 129/SvEvTac] strains), as well as known mouse SNPs were filtered out (29). To facilitate the accurate identification of de novo mutations in the setting of very high anticipated clonal variation and low allelic frequencies for the large number of variants, DNA was obtained from colonies cloned from single flow-sorted MEF cells. Such single-cell cloning of MEFs is possible only following immortalization, which occurred by P15 in the +/+ and PoleP286R/+ MEF lines. At P15, PoleP286R/+ MEF clones consistently showed high overall base substitution rates relative to those of WT controls (Figure 6A). To estimate the mutation rate per cell division, the experiment was repeated with P30 PoleP286R/+ clones from the same MEF lines. The P30 clones consistently showed higher base substitution rates. The PoleP286R/+ mutation rate was then “clocked” by calculating the mutation rate per cell division as determined by the 3T3 assays, which measure the number of cell doublings at each passage (Figure 2). This analysis yielded an estimate of 1.6 nucleotide substitutions per Mb per cell division, at least 3 orders of magnitude higher than the intrinsic DNA replication error rate (Figure 6B).
WGS analyses of 20 samples to determine genome-wide base substitution rates expressed as mutations per Mb. (A) WGS analysis of 9 MEF clones (same lines as shown in Figure 2B). DNA was prepared from single cell–derived clones isolated at P30 for the three +/+ MEF lines and at P15 and P30 for the three PoleP286R/+ MEF lines. Clones obtained from the same MEF lines at P15 and P30 are shown in the same color. (B) Data in A were used to calculate the number of mutations per cell division between P15 and P30. (C) Overall base substitution rates for 9 MEF lines shown in A and B, together with 11 primary tumors from PoleP286R/+ and PoleP286R/LSL mice. The 2 carcinomas from PoleP286R/LSL mice correspond to 1 lung adenocarcinoma and 1 cutaneous SCC. The blue and red lines indicate the average mutation rates in human lung and ultramutated cancers, respectively (13, 15).
We also conducted WGS on 7 PoleP286R/+ primary cancers, including 3 lung adenocarcinomas, 3 T cell lymphomas, 1 cutaneous SCC, and 4 cancers from the 2 PoleP286R LSL-PoleP286R (hemizygous) mice. Despite the presence of multiple tumors in some mice, the tumors could be inferred to be clonal, because they arose within otherwise histologically normal tissues, demonstrating that only very rare cells became neoplastic. Neoplastic transformation in this model can be anticipated to require unique and statistically improbable combinations of mutations in oncogenes and/or tumor suppressors. Thus, although clonal variation is expected to continue unabated, each tumor should contain a large number of mutations identifiable by WGS, as with human POLE–driven cancers, which are also clonal (15). WGS revealed a very high frequency of mutations, in the range of 10 to 100/Mb (Figure 6C). Notably, this is in the range for human malignancies such as lung cancers (11, 15). In contrast, lung carcinomas from diverse genetically engineered mouse models (Trp53/Rb1-, Egfr-, Myc-, and Kras-driven) of adenocarcinoma have consistently shown exceedingly low mutation burdens (~50 times lower than human lung cancers) (22–24). Thus, to our knowledge, PoleP286R/+ is the first genetically engineered mouse model to recapitulate the high mutation burden and therefore the clonal variation and heterogeneity intrinsic to human cancer.
The most common mutations were of the missense type, with significant numbers of nonsense and splicing mutations. Indels were correspondingly rare (Figure 7), and PoleP286R did not produce detectable large-scale rearrangements in most tumors (Supplemental Figure 4). We then tabulated base substitutions in the context of flanking bases to determine whether there was a recurring PoleP286R-related mutational signature with respect to trinucleotide context (Supplemental Figure 5). PoleP286R/+ cancers and MEFs showed distinct and consistent signatures across samples, with a high incidence of C>A and C>T substitutions and correspondingly rarer C>G and T>A substitutions. Of note, C>A substitutions occurred most frequently in a TCT flanking base context, as in POLE-mutant human cancers, although our data show a greater dependence on the 3′ base (i.e., a higher incidence of substitutions in NCT than in TCN trinucleotides). We also noted that T>G substitutions occurred with a bias toward a TTT context in some samples, as previously observed for POLE-driven human cancers, but with a bias against this context in other samples; the biological basis of these differences is uncertain. In contrast, the +/+ control MEF clones did not have this signature (Supplemental Figure 5). Thus, we document a mutational signature in murine PoleP286R/+ MEFs and tumors of diverse lineages that resembles signature 10, which was previously described in POLE exonuclease domain–mutant tumors (30–32).
WGS-derived mutation types in PoleP286R/+ (heterozygous) MEFs and tumors and (hemizygous) PoleP286R/LSL tumors. (A) Deletions and insertions were rare, consistent with a preponderance of single base substitutions. Somatic mutations located within coding sequences were selected. Mutations were assigned to different categories using the SnpEff tool with GENECODE M16 as a reference. The x axis represents the number of somatic mutations in each sample. (B) Distributions of allele frequencies for representative samples. Mutations were classified into 3 categories: missense, synonymous, and nonsense.
This signature should skew toward specific codon substitution patterns. Indeed, as previously reported for human POLEP286R tumors, arginine was the most frequently substituted amino acid, and replacements by nonsense codons (most frequently from glutamate or arginine) were among the most frequent codon substitutions (Supplemental Table 2 and ref. 30). However, as stated above, missense mutations were the most common subtype of mutation in PoleP286R tumors (Figure 7). These findings reveal that PoleP286R is particularly efficient at generating missense and nonsense mutations capable of inactivating tumor suppressor loci, although in some cases, nonsense mutations can also turn on protein activity (i.e., by removing regulatory C-terminal domains). Inspection of mutations in the WGS data of cancer driver genes selected on the basis of their relevance to the observed tumors (Notch1, Kras, Trp53, and Pten) illustrated this point. For example, all Notch1 mutations occurred in the T cell lymphomas, consistent with the presence of frequent Notch1 mutations in human T cell acute lymphoblastic leukemia. One of the mutations was a truncating c.7395C>A p.Tyr2465→STOP mutation, a well-known hot spot for Notch1 hyperactivating mutations that remove the negative regulatory C-terminal PEST domain (33). Truncating mutations were also identified in Pten and Trp53 (Supplemental Table 3). Canonical Kras activating mutations c.182A>G p.Gln61Arg and c.436G>A p.Ala146Thr were also observed in 2 of the cancers, further emphasizing that PoleP286R drives carcinogenesis through gain-of-function mutations in oncogenes as well as inactivating loss-of-function mutations in tumor suppressors (34).
This study establishes polymerase-mediated ultramutagenesis as a genetic approach to the generation and study of cancers in animal models. The anticipated deleterious effect of PoleP286R on DNA replication fidelity called for a conditional strategy. We engineered the conditional knockin allele LSL-PoleP286R, in which expression of PoleP286R is blocked by the LSL cassette (26). Cre-mediated recombination efficiently deleted the cassette, converting the LSL-PoleP286R allele to PoleP286R, resulting in PoleP286R expression at the same level as the WT allele. The LSL-PoleP286R allele is functionally null with respect to Pole function; LSL-PoleP286R/+ mice thus only had 1 functional Pole allele, but were nonetheless healthy and fertile. This result is consistent with the rare human hereditary condition known as FILS syndrome (facial dysmorphism, immunodeficiency, livedo, and short stature) described in a consanguineous kindred. Affected individuals carry a homozygous nucleotide substitution that strongly affects POLE splicing, leading to a 90% decrease in overall POLE protein levels. However, heterozygous carriers are asymptomatic, demonstrating that POLE protein levels approximately 50% below normal do not have a discernable physiologic impact (35).
A mouse Pole–knockin allele (Polee) has been described previously, in which the conserved and functionally essential D272 and E274 residues in the 3′ exonuclease (proofreading) domain were replaced with alanines (36). These substitutions preserve normal polymerase activity, but selectively inactivate the proofreading function. Heterozygous Polee/+ mice did not exhibit any phenotypes or decreased longevity, demonstrating that heterozygous loss of Pole proofreading activity has minor functional effects, if any. In contrast, homozygous Polee/e mice had a median survival of 16 months and progressively succumbed to cancers beginning at 9 months of age. The most frequent tumors were intestinal adenomas/adenocarcinomas (45% incidence), followed by histiocytic sarcomas (36%), lymphomas (24%), and lung adenocarcinomas (12%) (36). Although primary tumors were not analyzed, mutation rates in Polee/e MEF lines were elevated, consistent with a general mutator phenotype due to defective proofreading. Some interesting and notable distinctions can be drawn between PoleP286R and Polee. First, PoleP286R is a genetically dominant neomorphic allele (19, 20), whereas Polee is recessive (loss of function). This simplifies the breeding schema considerably, as only a single allele is needed for strong phenotypic expression. Second, the much greater severity of the heterozygous PoleP286R/+ phenotype versus the homozygous Polee/e phenotype (median survival of 5 months vs. 16 months) is in keeping with prior studies in yeast showing that PoleP286R causes a much stronger mutator phenotype than does mere inactivation of proofreading activity (16, 18), as well as prior analyses of human tumors, in which PoleP286R mutations were heterozygous with retention of the WT allele (15, 16, 30).
Another notable feature of the cancers in PoleP286R/+ mice was their aggressive clinical features and histology. For example, the T cell lymphomas were large, and many were widely disseminated and in the leukemic phase with bone marrow infiltration, a markedly aggressive phenotype for a mouse model of T cell lymphoma (37, 38). Also, few benign tumors were identified, with most tumors exhibiting striking tissue invasion and thus representing obvious malignancies. We anticipate that the LSL-PoleP286R allele could be used to generate a wide range of murine cancers through the use of tissue-specific Cre drivers or through other approaches such as the use of Adeno-Cre to recapitulate Pole ultramutator phenotypes (i.e., in cancers in which this phenomenon is frequent) or the high mutational loads that are common in many human cancers. It will be of interest to determine whether conditional expression of PoleP286R can induce malignant transformation of any cell lineage in the mouse, although the observed exceptionally rapid onset of a wide spectrum of malignancies encompassing diverse cell lineages (mesenchymal, epithelial, hematopoietic), together with the potent mutator phenotype documented in MEFs, suggests that this will be the case. It is also noteworthy that such mice can be efficiently and rapidly generated in a single generation (i.e., by breeding LSL-PoleP286R/+ with any Cre-driver line). It should also be possible to combine LSL-PoleP286R with other alleles that drive malignant transformation to efficiently create high base substitution rates in any genetically engineered mouse model of cancer.
WGS revealed high base substitution rates, in line with those described for most human carcinomas and with a similar mutational signature previously described for POLE-driven human tumors (30), consistent with strong sequence and functional conservation among the human and mouse polymerases. We observed similar mutational signatures among MEF clones and tumors, arguing that the signatures are not dependent on selective pressures unique to tumor progression in vivo. It is also not clear why some samples (including both MEFs and tumors) showed a T>G bias in a TTT context, while others did not, although such variability has been documented in human tumors (15). While a specific mutational signature associated with POLE proofreading domain mutations (signature 10) has been described (29–32), it should be noted that the impact of other biological factors (such as chemotherapy or radiation therapy, among many other variables) on signature 10 remains incompletely defined. The role of MMR deficiency in PoleP286R/+ mouse tumors remains to be formally determined. Although the rapid kinetics of mutation in the MEFs and in vivo argues against the necessity of superimposed MMR deficiency, several studies have found evidence for synergism between MMR deficiency and POLE proofreading domain mutations, as could be expected, given that the MMR pathway repairs most DNA replication errors (15, 39–42).
One interesting open question is the impact of this signature on the specific range of mutations that serve as effective cancer drivers in diverse lineages. On one hand, as we and others have shown, PoleP286R is particularly effective at generating stop codons, implying that inactivation of tumor suppressors should be frequent in such tumors (30). Indeed, we identified nonsense mutations in Notch1, Trp53, and Pten. At the same time, diverse amino acid substitutions were also dramatically increased, creating a rich assortment of codon substitutions with potential oncogenic impact. This is illustrated by the 2 KRAS activating mutations identified in this study, one in a lung adenocarcinoma (c.182A>G; Q61R) and the other in a T cell lymphoma (c.436G>A; A146T). Both are known oncogenic KRAS mutations previously described in human cancer, although Q61R is much more common in human cancers and in carcinogen-induced mouse lung tumors (24). However, while A146T occurs in a favored AGC/GCT context, Q61R occurs in a strongly disfavored CAA/TTG context, demonstrating that both favored and unfavored substitutions can occur in PoleP286R-driven cancers.
One limitation of classic genetically engineered mouse models of cancer — and one to which increasing attention has been drawn — is their surprisingly low overall mutation frequencies, which are far below what is observed in the vast majority of human cancers (22, 23). These properties make most such models particularly useful for certain studies, such as those of the functional or signaling consequences of specific tumor suppressor or oncogene mutations, but greatly limit their utility for others, such as studies of tumor heterogeneity and clonal variation or the impact of mutational load/base substitution rates on tumor behavior and response to therapy. The latter has become particularly relevant with the advent of immune checkpoint therapies, as there is growing evidence that treatment success correlates with a high incidence of somatic mutations irrespective of tumor type (43). That knockout/knockin mouse models of cancer have not proven particularly useful for testing immune checkpoint therapies probably relates to mutational burdens too low to recapitulate human tumor immunology or trigger robust immune responses following immune checkpoint therapy (44, 45). In this context, PoleP286R could be used to “humanize” a wide range of cancer models with respect to mutational load, opening new avenues of investigation. PoleP286R-mediated ultramutagenesis may also prove useful for a variety of studies such as forward genetic screens for diverse phenotypes such as drug resistance, either in vivo or in vitro, as illustrated by the 6-TG studies.
In summary, we present an approach to the generation of mouse models of cancer based on the POLE ultramutator phenotype recently described in human cancers. Mutations accumulate initially in a biologically silent manner, without promoting cell death or triggering DNA damage responses. Mice expressing heterozygous PoleP286R mutations died of multiple aggressive malignancies of diverse lineages, resulting in one of the most cancer-prone animal models described to date. WGS of the tumors revealed very high mutation frequencies, in the range of 10 to 100/Mb described for human malignancies such as lung cancers. This Pole-based mouse model is the first genetic model to our knowledge to recapitulate the very high mutational burdens and therefore the clonal variation and heterogeneity intrinsic to human cancer. Our data demonstrate that this approach will enable a wide range of genetic approaches in the study of cancer or other biological processes.
Generation and validation of the PoleP286R(L) (LSL-P286R) conditional knockin allele. First, a 8.0-kb genomic fragment including exons 3–14 was generated by PCR from 129/SvEvTac DNA using the primers L3 (5′-GGTGTCGACGATGAGTGATAGAGCAGAGGAAGGCACG-3′) and R3 (5′-GGCGGCCGCGTCTCCAGAATCATTCCACCTCAAGCATC-3′). SalI and NotI sites (underlined) were added to the 5′ ends of the L and R primers. PCR was performed with Phusion High Fidelity Taq (New England BioLabs) under the following conditions: initial denaturation at 98°C for 3 minutes followed by 24 cycles of denaturation at 98°C for 1 minute, annealing at 65°C for 30 seconds, and amplification at 72°C for 6 minutes. Following addition of 3′ A overhangs to the gel-purified PCR product by incubating with ATP and Taq polymerase at 37°C for 30 minutes, the 8.0-kb fragment was cloned into pCR2.1TOPO (Invitrogen, Thermo Fisher Scientific). The 8.0-kb insert was Sanger sequenced to confirm the absence of PCR mutations. Second, the CCT→CGT mutation resulting in the P286R amino acid substitution (exon 9, underlined, see also Figure 1A) was generated by in vitro mutagenesis with the oligonucleotide 5′-CCAAACTGCCTCTCAAATTCCGTGATGCTGAGACCGATCAG-3′ using the QuickChange II XL Site-Directed Mutagenesis Kit (Agilent Technologies). Site-specific mutagenesis and absence of other undesired mutations were confirmed by Sanger sequencing of all exons. Third, the Pole genomic fragment containing the mutation was gel purified following digestion with SalI and Not and then subcloned into the pKOII targeting construct (contains a pGKNeo positive selection cassette flanked by frt sites and a diphtheria toxin negative selection cassette) digested with XhoI and NotI (46). This procedure removes the pGKNeo cassette from pKOII, which is not needed, as this functionality is provided by LSL (see below); however, the pKOII diphtheria toxin negative selection marker is retained. Fourth, the LSL cassette (5.8 kb) was gel purified following XhoI digestion of plasmid LSL TOPO (Addgene no. 11584) and inserted into the naturally occurring XhoI site within intron 7–8 by conventional cloning methods including phosphatase treatment of the vector fragment (Figure 1C). A clone with the desired orientation was identified by Sanger sequencing. In addition to the tetrameric tandem array of SV40 polyadenylation sequences (the “stopper”) and additional sequences including a strong splice acceptor site to prevent splicing around the stopper, the LSL cassette contains a puromycin selection marker (26). The insertion of the LSL cassette created targeting arms 1 (3.9 kb) and 2 (4.1 kb) from the original 8.0-kb cloned Pole genomic fragment. The targeting construct was sequenced in its entirety to confirm the desired structure; also, the functionality of the LoxP sites (and deletion of the intervening sequences) was confirmed by Sanger sequencing following transformation into the Cre-expressing E. coli strain 294-Cre (47).
Following NotI linearization and column purification of 300 μg of the targeting construct (18.0 kb), DNA was electroporated into SM-1 ES cells (derived from the 129/SvEvTac inbred mouse strain), and subsequent “plus/minus” selection (puromycin/diphtheria toxin) was conducted by standard methods (48). Screening for homologous recombination was then performed by PCR followed by Sanger sequencing to confirm the presence of the Pole point mutation. Two clones were selected for blastocyst injection. Chimeric mice were bred with 129/SvEvTac mice, and germline transmission and confirmation of allele function of both lines (e.g., Cre-mediated recombination at the LoxP sites) was performed as described in the text. One LSL-PoleP286R line was selected and had been maintained on a pure 129/SvEvTac genetic background in the heterozygous state by breeding with 129/SvEvTac mice (Taconic Biosciences).
Mouse husbandry, survival analysis, and clinical assays. Mice were housed in a pathogen-free animal facility in microisolator cages and fed standard chow ad libitum. Survival analyses were conducted on experimental and control animals selected at the time of weaning. Complete blood counts (CBCs) were performed on 50 μl blood collected into K3-EDTA tubes (Sarstadt), with gentle mixing of the sample by gently inverting the tube multiple times immediately after collection. The mouse CBC analyses were performed on a ProCyte Dx Hematology Analyzer (IDEXX).
Genotyping and reverse transcription PCR. Tail DNA was extracted using standard protocols (49). MEF DNA was extracted using the QiaAmp DNA Mini Kit (catalog 51306, QIAGEN). Multiplex genotyping was performed with 10 μM of the primers 220 (5′-ACATTCGCCTCTCCATTGAC-3′), 222, (5′-ATCATCTCGTCGGGTGATTT-3′) and 5F (5′-GAATTCCGCAAGCTAGCCAC-3′) and GoTaq polymerase in 1× GoTaq buffer in 20 μl of total reaction volumes. The PCR conditions were as follows: initial denaturation step of 95°C for 3 minutes, followed by 35 cycles of 95°C for 30 seconds, 65°C for 30 seconds, and 72°C for 45 seconds, followed by 72°C for 10 minutes. The PCR products were as follows: WT (220 + 222) 413 bp, P286R (220 + 222) 531 bp, and LSL (5F + 222) 512 bp.
For reverse transcription PCR (RT-PCR), RNA extraction was performed with the RNeasy Mini Kit (QIAGEN). The High-Capacity cDNA Reverse Transcription Kit (Thermo Fisher Scientific) was used to reverse transcribe 1 μg RNA. PCR was performed on 20 ng cDNA, 0.2 mM each dNTP, and 2 mM MgCl2, and with an intron-spanning primer pair (forward: 5′-GTGGACATGCGGGAGTATGA-3′; reverse: 5′-TTCTGGCTTAGGGGTGAACTC-3′) and an initial denaturation step of 95°C for 2 minutes, followed by 35 cycles of 95°C for 15 seconds, 60°C for 15 seconds, and 72°C for 30 seconds for 35 cycles, followed by 72°C for 3 minutes. PCR products were purified with the QIAquick PCR Purification Kit (QIAGEN) and then Sanger sequenced.
MEF generation and 6-TG selection. MEF lines were prepared using standard methods (27). Timed matings were established, with the morning of the vaginal plug being dated as E0.5. Embryos were harvested at E13.5, and genotyping of mutant and control lines was performed on the unattached cells 1 day after initial plating. 3T3 assays were performed as described previously (27). One MEF line per genotype was used for HPRT mutation rate analysis. For 6-TG selection, 1 × 106 cells were plated per 10-cm dish in DMEM and 10% FBS with 60 μM 6-TG (MilliporeSigma, A4882) (50). Cells were subjected to 6-TG selection for 4 weeks, with fresh media containing 6-TG changed every week. Cells (5 × 102) per 10-cm dish were plated to determine plating efficiency. Colonies were stained with 1% methylene blue and counted. Randomly selected colonies were isolated and expanded for HPRT RT-PCR. HPRT RT-PCR was performed on 20 ng cDNA, with 0.2 mM each dNTP and 2 mM MgCl2 and an intron-spanning primer pair (forward: 5′-ATGCCGACCCGCAGTCC-3′; reverse: 5′-TTAGGCTTTGTATTTGGCTTTTCCA-3′) and an initial denaturation step of 95°C for 2 minutes, followed by 35 cycles of 95°C for 15 seconds, 60°C for 15 seconds, and 72°C for 60 seconds, followed by 72°C for 3 minutes. PCR products were purified with the QIAquick PCR Purification Kit and Sanger sequenced. Graphical representation of data was generated with MultAlin (51).
MEF protein lysates were prepared with RIPA buffer (catalog 89900, Pierce, Thermo Fisher Scientific) with a 1:100 protease inhibitor cocktail (MilliporeSigma, catalog P8340) and 0.5 mM DTT (Invitrogen, Thermo Fisher Scientific, catalog P2325). Lysates were loaded onto NuPAGE 4%–12% Bis-Tris Protein Gels (Invitrogen, Thermo Fisher Scientific, catalog NP0336BOX) and run at 150 V for 2 hours. Proteins were wet transferred onto PVDF membranes (MilliporeSigma, catalog PVH00010). Antibodies used for Western blots were from the DNA Damage Antibody Sampler Kit (Cell Signaling Technology, catalog 9947, all at 1:1,000 dilution), p16INK4a (Abcam, catalog ab189034, 1:500), and GAPDH (Cell Signaling Technology, catalog 2118, 1:10,000). A Bio-Rad ChemiDoc system was used for Western gel chemiluminescence detection for the indicated exposure durations (Figure 2C).
Tissue processing and IHC. Fixation, sectioning, antigen retrieval, blocking, and secondary detection for primary antibody dilutions in 2% BSA were performed as previously described (52). The following antibodies were used for IHC: CD3 (Abcam, catalog ab5690 rabbit polyclonal Ab [pAb], 1:2,000 dilution), CD19 (Cell Signaling Technology, catalog 90176 rabbit mAb, 1:1,600 dilution), ERα (Santa Cruz Biotechnology, catalog sc-532 rabbit pAb, 1:1,000 dilution), p63 (GeneTex, catalog GTX102425 rabbit pAb, 1:1,000 dilution), F4/80 (Abcam, catalog ab6640, rat mAb, 1:100 dilution), and TTF1 (Abcam, catalog ab76013, rabbit mAb, 1:250 dilution) (53).
DNA and library preparation. DNA from all cell and tissue samples used for high-throughput sequencing were extracted with the QiaAmp DNA Mini Kit (QIAGEN, catalog 51306). Concentration was measured with the Qubit fluorometer (Invitrogen, Thermo Fisher Scientific). Sample integrity and purification were confirmed by agarose gel electrophoresis. For the preparation of libraries, 1.5 μg DNA was fragmented with a Covaris ultrasonicator and then analyzed by gel electrophoresis to evaluate the fragmented DNA. The fragmented DNA was combined with End Repair Mix (Thermo Fisher Scientific) and incubated at 20°C for 30 minutes. The end-repaired DNA was purified with the QIAquick PCR Purification Kit (QIAGEN), followed by addition of A-Tailing Mix (New England BioLabs), and incubated at 3°C for 30 minutes. This was combined with the purified Adenylate 3′ ends DNA, adapter and ligation mix, and the ligation reaction was incubated at 20°C for 15 minutes. Adapter-ligated DNA was selected by electrophoresis on a 2% agarose gel to recover the target fragments. The DNA was gel purified with the QIAquick Gel Extraction Kit (QIAGEN). Several rounds of PCR amplification with PCR Primer Cocktail (Illumina) and PCR Master Mix (Illumina) were performed to enrich the adapter-ligated DNA fragments. Then, the PCR products were selected by electrophoresis on a 2% agarose gel to recover the target fragments, followed by gel purification with the QIAquick Gel Extraction Kit. The final library was analyzed in 2 ways: (a) determination of the average molecule length using the Agilent 2100 Bioanalyzer (Agilent DNA 1000 Reagents), and (b) real-time PCR quantification of the library by TaqMan assay (Thermo Fisher Scientific). The qualified libraries were loaded onto the HiSeq Xten Sequencer (Illumina) for paired-end sequencing, with read lengths of 100 to 150 bp.
Variant calling. Reads were mapped to the mouse reference genome (GRCm38) using BWA 0.7.17 (54). The duplicated reads were then marked using Picard (Broad Institute), and the base quality score recalibration was applied using GATK 4.0 (55). SNP and INDEL discovery across all samples was performed using SAMtools (56). A mutation was considered to exist in a sample only if the alternative allele frequency was greater than 10%. A mutation was considered somatic when it was not a known mouse variant from the Mouse Genomes Project (57), and the mutation was identified in only 1 sample and not in any other samples. All mutations were annotated using the SnpEff tool (58) on the basis of GENCODE M16 annotation (59).
Depth analyses. Average depth in each window (1 Mbp) was estimated using SAMtools. Raw depth was normalized by dividing the median depth across the genome, followed by log2 transformation.
Data availability. Sequence data supporting the findings of this study have been deposited in the NCBI’s Sequence Read Archive (SRA) database (SRA SRP150647).
Statistics. Data are presented as the mean ± SEM unless otherwise indicated. To determine P values, a 2-tailed Student’s t test or Fisher’s exact test was performed unless otherwise indicated. A P value of less than 0.05 was considered statistically significant. For survival curves, Kaplan-Meier analysis was used, with statistical comparison among curves performed using the log-rank test. Routine statistical analyses were conducted with either GraphPad Prism, version 7 (GraphPad Software) or Microsoft Excel. No statistical method was used to predetermine sample sizes. The experiments were not randomized, and the investigators were not blinded to the treatment allocation during experiments or outcome assessments.
Study approval. The animal studies were approved by the IACUC of the UTSW Medical Center.
IC, HDL, and DHC conceived the study. IC and DHC designed and generated the allele and mice. HDL and IC performed the experiments. MZ, YXF, CL, EAA, and MMA assisted with mouse studies. DHC and MJY conducted the immunohistochemical and histologic characterization of tumors. HZ designed and conducted the computational and data analyses. DHC, HDL, and IC wrote the manuscript, with input from all authors.
We thank Bradley Preston (University of Washington) for helpful discussions and Jim Malter (UTSW) and Ram Mani (UTSW) for comments on the manuscript. We gratefully acknowledge the excellent support from Robin Nguyen and Bob Hammer (UTSW Transgenic Core Facility) and Eduardo Bustamante (UTSW Animal Resource Center). This work was supported by the Vernie A. Stembridge MD fund and NIH, National Cancer Institute (NCI) grants R01HD086093 and R01CA211339 (to DHC) and R01CA164346 and R01CA200703 (to MJY). The UTSW Lyda Hill Department of Bioinformatics Core Facility and HZ were funded by the Cancer Prevention and Research Institute of Texas (RP150596).
Address correspondence to: Diego H. Castrillon, UT Southwestern Department of Pathology, 6000 Harry Hines Boulevard, Dallas Texas 75390-9072, USA. Phone: 214.648.4032; Email: diego.castrillon@utsouthwestern.edu.
Conflict of interest: The authors have declared that no conflict of interest exists.
Reference information: J Clin Invest. 2018;128(9):4179–4191.https://doi.org/10.1172/JCI122095.
See the related Commentary at A simple but profound mutation in mouse DNA polymerase ε drives tumorigenesis.