Proceedings of the National Academy of Sciences, 2015
Nonhomologous end joining (NHEJ) repairs chromosome breaks and must remain effective in the face ... more Nonhomologous end joining (NHEJ) repairs chromosome breaks and must remain effective in the face of extensive diversity in broken end structures. We show here that this flexibility is often reliant on the ability to direct DNA synthesis across strand breaks, and that polymerase (Pol) μ and Pol λ are the only mammalian DNA polymerases that have this activity. By systematically varying substrate in cells, we show each polymerase is uniquely proficient in different contexts. The templating nucleotide is also selected differently, with Pol μ using the unpaired base adjacent to the downstream 5' phosphate even when there are available template sites further upstream of this position; this makes Pol μ more flexible but also less accurate than Pol λ. Loss of either polymerase alone consequently has clear and distinguishable effects on the fidelity of repair, but end remodeling by cellular nucleases and the remaining polymerase helps mitigate the effects on overall repair efficiency. Accordingly, when cells are deficient in both polymerases there is synergistic impact on NHEJ efficiency, both in terms of repair of defined substrates and cellular resistance to ionizing radiation. Pol μ and Pol λ thus provide distinct solutions to a problem for DNA synthesis that is unique to this pathway and play a key role in conferring on NHEJ the flexibility required for accurate and efficient repair.
Elucidation of mutagenic processes shaping cancer genomes is a fundamental problem whose solution... more Elucidation of mutagenic processes shaping cancer genomes is a fundamental problem whose solution promises insights into new treatment, diagnostic and prevention strategies. Single-strand DNA-specific APOBEC cytidine deaminase(s) are major source(s) of mutation in several cancer types. Previous indirect evidence implicated APOBEC3B as the more likely major mutator deaminase, whereas the role of APOBEC3A is not established. Using yeast models enabling the controlled generation of long single-strand genomic DNA substrates, we show that the mutation signatures of APOBEC3A and APOBEC3B are statistically distinguishable. We then apply three complementary approaches to identify cancer samples with mutation signatures resembling either APOBEC. Strikingly, APOBEC3A-like samples have over tenfold more APOBEC-signature mutations than APOBEC3B-like samples. We propose that APOBEC3A-mediated mutagenesis is much more frequent because APOBEC3A itself is highly proficient at generating DNA breaks,...
A perfluorocarbon nanodroplet formulation is shown to be an effective cavitation enhancement agen... more A perfluorocarbon nanodroplet formulation is shown to be an effective cavitation enhancement agent, enabling rapid and consistent fragmentation of genomic DNA in a standard ultrasonic water bath. This nanodroplet-enhanced method produces genomic DNA libraries and next-generation sequencing results indistinguishable from DNA samples fragmented in dedicated commercial acoustic sonication equipment, and with higher throughput. This technique thus enables widespread access to fast bench-top genomic DNA fragmentation.
The bioethanol production system used in Brazil is based on the alcoholic fermentation of sucrose... more The bioethanol production system used in Brazil is based on the alcoholic fermentation of sucrose derived from sugarcane feedstock by highly adapted strains of the yeast Saccharomyces cerevisiae. PE-2 and CAT-1 are the most productive and widely adopted S. cerevisiae strains used by distilleries in Brazil. Due to no sterile condition at industrial scale, the process carries a variety of bacterial contaminants that are regularly related to yeast-bacteria co-aggregation phenotype, decreasing bioethanol yield. In this study we investigate the molecular physiology of the main S. cerevisiae commercial strain (PE-2) used on Brazilian bioethanol process under two distinct conditions: typical fermentation and flocculated (co-aggregated) fermentation. We collected samples on 13 time-points (6 time-points of typical fermentation and 7 time-points of flocculated fermentation) on sugarcane mills. Transcriptional machinery of PE-2 was assessed by high throughput sequencing-based methods (RNA-seq...
Responsible for the Irish potato famine of 1845–49, the oomycete pathogen Phytophthora infestans ... more Responsible for the Irish potato famine of 1845–49, the oomycete pathogen Phytophthora infestans caused persistent, devastating outbreaks of potato late blight across Europe in the 19th century. Despite continued interest in the history and spread of the pathogen, the genome of the famine-era strain remains entirely unknown. Here we characterize temporal genomic changes in introduced P. infestans. We shotgun sequence five 19th-century European strains from archival herbarium samples—including the oldest known European specimen, collected in 1845 from the first reported source of introduction. We then compare their genomes to those of extant isolates. We report multiple distinct genotypes in historical Europe and a suite of infection-related genes different from modern strains. At virulence-related loci, several now-ubiquitous genotypes were absent from the historical gene pool. At least one of these genotypes encodes a virulent phenotype in modern strains, which helps explain the 20th century’s episodic replacements of European P. infestans lineages.
In fungi, unisexual reproduction, where sexual development is initiated without the presence of t... more In fungi, unisexual reproduction, where sexual development is initiated without the presence of two compatible mating type alleles, has been observed in several species that can also undergo traditional bisexual reproduction, including the important human fungal pathogens Cryptococcus neoformans and Candida albicans. While unisexual reproduction has been well characterized qualitatively, detailed quantifications are still lacking for aspects of this process, such as the frequency of recombination during unisexual reproduction, and how this compares with bisexual reproduction. Here, we analyzed meiotic recombination during α-α unisexual and a-α bisexual reproduction of C. neoformans. We found that meiotic recombination operates in a similar fashion during both modes of sexual reproduction. Specifically, we observed that in α-α unisexual reproduction, the numbers of crossovers along the chromosomes during meiosis, recombination frequencies at specific chromosomal regions, as well as m...
Oxidative DNA damage is likely to be involved in the etiology of cancer and is thought to acceler... more Oxidative DNA damage is likely to be involved in the etiology of cancer and is thought to accelerate tumorigenesis via increased mutation rates. However, the majority of malignant cells acquire a specific type of genomic instability characterized by large-scale genomic rearrangements, referred to as chromosomal instability (CIN). The molecular mechanisms underlying CIN are not entirely understood. We utilized Saccharomyces cerevisiae as a model system to delineate the relationship between genotoxic stress and CIN. It was found that elevated levels of chronic, unrepaired oxidative DNA damage caused chromosomal aberrations at remarkably high frequencies under both selective and nonselective growth conditions. In this system, exceeding the cellular capacity to appropriately manage oxidative DNA damage resulted in a "gain-of-CIN" phenotype and led to profound karyotypic instability. These results illustrate a novel mechanism for genome destabilization that is likely to be rele...
In the yeast Saccharomyces cerevisiae, certain genomic regions have very high levels of meiotic r... more In the yeast Saccharomyces cerevisiae, certain genomic regions have very high levels of meiotic recombination (hot spots). The hot spot activity associated with the HIS4 gene requires the Bas1p transcription factor. To determine whether this relationship between transcription factor binding and hot spot activity is general, we used DNA microarrays to map all genomic Bas1p binding sites and to map the frequency of meiosis-specific double-strand DNA breaks (as an estimate of the recombination activity) of all genes in both wild-type and bas1 strains. We identified sites of Bas1p-DNA interactions upstream of 71 genes, many of which are involved in histidine and purine biosynthesis. Our analysis of recombination activity in wild-type and bas1 strains showed that the recombination activities of some genes with Bas1p binding sites were dependent on Bas1p (as observed for HIS4), whereas the activities of other genes with Bas1p binding sites were unaffected or were repressed by Bas1p. These...
Ongoing Cryptococcus gattii outbreaks in the Western United States and Canada illustrate the impa... more Ongoing Cryptococcus gattii outbreaks in the Western United States and Canada illustrate the impact of environmental reservoirs and both clonal and recombining propagation in driving emergence and expansion of microbial pathogens. C. gattii comprises four distinct molecular types: VGI, VGII, VGIII, and VGIV, with no evidence of nuclear genetic exchange, indicating these represent distinct species. C. gattii VGII isolates are causing the Pacific Northwest outbreak, whereas VGIII isolates frequently infect HIV/AIDS patients in Southern California. VGI, VGII, and VGIII have been isolated from patients and animals in the Western US, suggesting these molecular types occur in the environment. However, only two environmental isolates of C. gattii have ever been reported from California: CBS7750 (VGII) and WM161 (VGIII). The incongruence of frequent clinical presence and uncommon environmental isolation suggests an unknown C. gattii reservoir in California. Here we report frequent isolation...
BackgroundThe bioethanol production system used in Brazil is based on the fermentation of sucrose... more BackgroundThe bioethanol production system used in Brazil is based on the fermentation of sucrose from sugarcane feedstock by highly adapted strains of the yeast Saccharomyces cerevisiae. Bacterial contaminants present in the distillery environment often produce yeast-bacteria cellular co-aggregation particles that resemble yeast-yeast cell adhesion (flocculation). The formation of such particles is undesirable because it slows the fermentation kinetics and reduces the overall bioethanol yield.ResultsIn this study, we investigated the molecular physiology of one of the main S. cerevisiae strains used in Brazilian bioethanol production, PE-2, under two contrasting conditions: typical fermentation, when most yeast cells are in suspension, and co-aggregated fermentation. The transcriptional profile of PE-2 was assessed by RNA-seq during industrial scale fed-batch fermentation. Comparative analysis between the two conditions revealed transcriptional profiles that were differentiated pri...
Recombination between repeated DNA sequences can have drastic consequences on the integrity of th... more Recombination between repeated DNA sequences can have drastic consequences on the integrity of the genome. Repeated sequences are abundant in most eukaryotes, yet the mechanism that prevents recombination between them is currently unknown. Ty elements, the main family of dispersed repeats in Saccharomyces cerevisiae, exhibit low levels of exchange. Other regions in the genome have relatively high rates of meiotic recombination (hotspots). We show that a Ty element adjacent to the HIS4 recombination hotspot substantially reduces its activity, eliminating local DSB formation. We demonstrate that the Ty has a closed (nuclease-insensitive) chromatin configuration that is also imposed on the flanking DNA sequences. The compact chromatin structure is determined by sequences at the N terminus of the Ty. Increased binding of the Rap1 protein to the hotspot restores both open chromatin conformation and DSB formation. The chromatin configuration of Ty elements precludes initiation of recombination, thus preventing potentially lethal exchanges between repeated sequences.
Mutational heterogeneity must be taken into account when reconstructing evolutionary histories, c... more Mutational heterogeneity must be taken into account when reconstructing evolutionary histories, calibrating molecular clocks, and predicting links between genes and disease. Selective pressures and various DNA transactions have been invoked to explain the heterogeneous distribution of genetic variation between species, within populations, and in tissue-specific tumors. To examine relationships between such heterogeneity and variations in leading- and lagging-strand replication fidelity and mismatch repair, we accumulated 40,000 spontaneous mutations in eight diploid yeast strains in the absence of selective pressure. We found that replicase error rates vary by fork direction, coding state, nucleosome proximity, and sequence context. Further, error rates and DNA mismatch repair efficiency both vary by mismatch type, responsible polymerase, replication time, and replication origin proximity. Mutation patterns implicate replication infidelity as one driver of variation in somatic and germline evolution, suggest mechanisms of mutual modulation of genome stability and composition, and predict future observations in specific cancers.
Proceedings of the National Academy of Sciences, 2015
Nonhomologous end joining (NHEJ) repairs chromosome breaks and must remain effective in the face ... more Nonhomologous end joining (NHEJ) repairs chromosome breaks and must remain effective in the face of extensive diversity in broken end structures. We show here that this flexibility is often reliant on the ability to direct DNA synthesis across strand breaks, and that polymerase (Pol) μ and Pol λ are the only mammalian DNA polymerases that have this activity. By systematically varying substrate in cells, we show each polymerase is uniquely proficient in different contexts. The templating nucleotide is also selected differently, with Pol μ using the unpaired base adjacent to the downstream 5' phosphate even when there are available template sites further upstream of this position; this makes Pol μ more flexible but also less accurate than Pol λ. Loss of either polymerase alone consequently has clear and distinguishable effects on the fidelity of repair, but end remodeling by cellular nucleases and the remaining polymerase helps mitigate the effects on overall repair efficiency. Accordingly, when cells are deficient in both polymerases there is synergistic impact on NHEJ efficiency, both in terms of repair of defined substrates and cellular resistance to ionizing radiation. Pol μ and Pol λ thus provide distinct solutions to a problem for DNA synthesis that is unique to this pathway and play a key role in conferring on NHEJ the flexibility required for accurate and efficient repair.
Elucidation of mutagenic processes shaping cancer genomes is a fundamental problem whose solution... more Elucidation of mutagenic processes shaping cancer genomes is a fundamental problem whose solution promises insights into new treatment, diagnostic and prevention strategies. Single-strand DNA-specific APOBEC cytidine deaminase(s) are major source(s) of mutation in several cancer types. Previous indirect evidence implicated APOBEC3B as the more likely major mutator deaminase, whereas the role of APOBEC3A is not established. Using yeast models enabling the controlled generation of long single-strand genomic DNA substrates, we show that the mutation signatures of APOBEC3A and APOBEC3B are statistically distinguishable. We then apply three complementary approaches to identify cancer samples with mutation signatures resembling either APOBEC. Strikingly, APOBEC3A-like samples have over tenfold more APOBEC-signature mutations than APOBEC3B-like samples. We propose that APOBEC3A-mediated mutagenesis is much more frequent because APOBEC3A itself is highly proficient at generating DNA breaks,...
A perfluorocarbon nanodroplet formulation is shown to be an effective cavitation enhancement agen... more A perfluorocarbon nanodroplet formulation is shown to be an effective cavitation enhancement agent, enabling rapid and consistent fragmentation of genomic DNA in a standard ultrasonic water bath. This nanodroplet-enhanced method produces genomic DNA libraries and next-generation sequencing results indistinguishable from DNA samples fragmented in dedicated commercial acoustic sonication equipment, and with higher throughput. This technique thus enables widespread access to fast bench-top genomic DNA fragmentation.
The bioethanol production system used in Brazil is based on the alcoholic fermentation of sucrose... more The bioethanol production system used in Brazil is based on the alcoholic fermentation of sucrose derived from sugarcane feedstock by highly adapted strains of the yeast Saccharomyces cerevisiae. PE-2 and CAT-1 are the most productive and widely adopted S. cerevisiae strains used by distilleries in Brazil. Due to no sterile condition at industrial scale, the process carries a variety of bacterial contaminants that are regularly related to yeast-bacteria co-aggregation phenotype, decreasing bioethanol yield. In this study we investigate the molecular physiology of the main S. cerevisiae commercial strain (PE-2) used on Brazilian bioethanol process under two distinct conditions: typical fermentation and flocculated (co-aggregated) fermentation. We collected samples on 13 time-points (6 time-points of typical fermentation and 7 time-points of flocculated fermentation) on sugarcane mills. Transcriptional machinery of PE-2 was assessed by high throughput sequencing-based methods (RNA-seq...
Responsible for the Irish potato famine of 1845–49, the oomycete pathogen Phytophthora infestans ... more Responsible for the Irish potato famine of 1845–49, the oomycete pathogen Phytophthora infestans caused persistent, devastating outbreaks of potato late blight across Europe in the 19th century. Despite continued interest in the history and spread of the pathogen, the genome of the famine-era strain remains entirely unknown. Here we characterize temporal genomic changes in introduced P. infestans. We shotgun sequence five 19th-century European strains from archival herbarium samples—including the oldest known European specimen, collected in 1845 from the first reported source of introduction. We then compare their genomes to those of extant isolates. We report multiple distinct genotypes in historical Europe and a suite of infection-related genes different from modern strains. At virulence-related loci, several now-ubiquitous genotypes were absent from the historical gene pool. At least one of these genotypes encodes a virulent phenotype in modern strains, which helps explain the 20th century’s episodic replacements of European P. infestans lineages.
In fungi, unisexual reproduction, where sexual development is initiated without the presence of t... more In fungi, unisexual reproduction, where sexual development is initiated without the presence of two compatible mating type alleles, has been observed in several species that can also undergo traditional bisexual reproduction, including the important human fungal pathogens Cryptococcus neoformans and Candida albicans. While unisexual reproduction has been well characterized qualitatively, detailed quantifications are still lacking for aspects of this process, such as the frequency of recombination during unisexual reproduction, and how this compares with bisexual reproduction. Here, we analyzed meiotic recombination during α-α unisexual and a-α bisexual reproduction of C. neoformans. We found that meiotic recombination operates in a similar fashion during both modes of sexual reproduction. Specifically, we observed that in α-α unisexual reproduction, the numbers of crossovers along the chromosomes during meiosis, recombination frequencies at specific chromosomal regions, as well as m...
Oxidative DNA damage is likely to be involved in the etiology of cancer and is thought to acceler... more Oxidative DNA damage is likely to be involved in the etiology of cancer and is thought to accelerate tumorigenesis via increased mutation rates. However, the majority of malignant cells acquire a specific type of genomic instability characterized by large-scale genomic rearrangements, referred to as chromosomal instability (CIN). The molecular mechanisms underlying CIN are not entirely understood. We utilized Saccharomyces cerevisiae as a model system to delineate the relationship between genotoxic stress and CIN. It was found that elevated levels of chronic, unrepaired oxidative DNA damage caused chromosomal aberrations at remarkably high frequencies under both selective and nonselective growth conditions. In this system, exceeding the cellular capacity to appropriately manage oxidative DNA damage resulted in a "gain-of-CIN" phenotype and led to profound karyotypic instability. These results illustrate a novel mechanism for genome destabilization that is likely to be rele...
In the yeast Saccharomyces cerevisiae, certain genomic regions have very high levels of meiotic r... more In the yeast Saccharomyces cerevisiae, certain genomic regions have very high levels of meiotic recombination (hot spots). The hot spot activity associated with the HIS4 gene requires the Bas1p transcription factor. To determine whether this relationship between transcription factor binding and hot spot activity is general, we used DNA microarrays to map all genomic Bas1p binding sites and to map the frequency of meiosis-specific double-strand DNA breaks (as an estimate of the recombination activity) of all genes in both wild-type and bas1 strains. We identified sites of Bas1p-DNA interactions upstream of 71 genes, many of which are involved in histidine and purine biosynthesis. Our analysis of recombination activity in wild-type and bas1 strains showed that the recombination activities of some genes with Bas1p binding sites were dependent on Bas1p (as observed for HIS4), whereas the activities of other genes with Bas1p binding sites were unaffected or were repressed by Bas1p. These...
Ongoing Cryptococcus gattii outbreaks in the Western United States and Canada illustrate the impa... more Ongoing Cryptococcus gattii outbreaks in the Western United States and Canada illustrate the impact of environmental reservoirs and both clonal and recombining propagation in driving emergence and expansion of microbial pathogens. C. gattii comprises four distinct molecular types: VGI, VGII, VGIII, and VGIV, with no evidence of nuclear genetic exchange, indicating these represent distinct species. C. gattii VGII isolates are causing the Pacific Northwest outbreak, whereas VGIII isolates frequently infect HIV/AIDS patients in Southern California. VGI, VGII, and VGIII have been isolated from patients and animals in the Western US, suggesting these molecular types occur in the environment. However, only two environmental isolates of C. gattii have ever been reported from California: CBS7750 (VGII) and WM161 (VGIII). The incongruence of frequent clinical presence and uncommon environmental isolation suggests an unknown C. gattii reservoir in California. Here we report frequent isolation...
BackgroundThe bioethanol production system used in Brazil is based on the fermentation of sucrose... more BackgroundThe bioethanol production system used in Brazil is based on the fermentation of sucrose from sugarcane feedstock by highly adapted strains of the yeast Saccharomyces cerevisiae. Bacterial contaminants present in the distillery environment often produce yeast-bacteria cellular co-aggregation particles that resemble yeast-yeast cell adhesion (flocculation). The formation of such particles is undesirable because it slows the fermentation kinetics and reduces the overall bioethanol yield.ResultsIn this study, we investigated the molecular physiology of one of the main S. cerevisiae strains used in Brazilian bioethanol production, PE-2, under two contrasting conditions: typical fermentation, when most yeast cells are in suspension, and co-aggregated fermentation. The transcriptional profile of PE-2 was assessed by RNA-seq during industrial scale fed-batch fermentation. Comparative analysis between the two conditions revealed transcriptional profiles that were differentiated pri...
Recombination between repeated DNA sequences can have drastic consequences on the integrity of th... more Recombination between repeated DNA sequences can have drastic consequences on the integrity of the genome. Repeated sequences are abundant in most eukaryotes, yet the mechanism that prevents recombination between them is currently unknown. Ty elements, the main family of dispersed repeats in Saccharomyces cerevisiae, exhibit low levels of exchange. Other regions in the genome have relatively high rates of meiotic recombination (hotspots). We show that a Ty element adjacent to the HIS4 recombination hotspot substantially reduces its activity, eliminating local DSB formation. We demonstrate that the Ty has a closed (nuclease-insensitive) chromatin configuration that is also imposed on the flanking DNA sequences. The compact chromatin structure is determined by sequences at the N terminus of the Ty. Increased binding of the Rap1 protein to the hotspot restores both open chromatin conformation and DSB formation. The chromatin configuration of Ty elements precludes initiation of recombination, thus preventing potentially lethal exchanges between repeated sequences.
Mutational heterogeneity must be taken into account when reconstructing evolutionary histories, c... more Mutational heterogeneity must be taken into account when reconstructing evolutionary histories, calibrating molecular clocks, and predicting links between genes and disease. Selective pressures and various DNA transactions have been invoked to explain the heterogeneous distribution of genetic variation between species, within populations, and in tissue-specific tumors. To examine relationships between such heterogeneity and variations in leading- and lagging-strand replication fidelity and mismatch repair, we accumulated 40,000 spontaneous mutations in eight diploid yeast strains in the absence of selective pressure. We found that replicase error rates vary by fork direction, coding state, nucleosome proximity, and sequence context. Further, error rates and DNA mismatch repair efficiency both vary by mismatch type, responsible polymerase, replication time, and replication origin proximity. Mutation patterns implicate replication infidelity as one driver of variation in somatic and germline evolution, suggest mechanisms of mutual modulation of genome stability and composition, and predict future observations in specific cancers.
Uploads