34702 items (28917 unread) in 19 feeds
The readers of this journal may well be aware of Professor Crow’s scientific achievements and his role as the editor of Perspectives. In addition, for many thousands of students at the University of Wisconsin over many generations, James F. Crow was one of the most memorable teachers at both the undergraduate and graduate levels. What is less known is his major role in public service where he served as chair of many important committees for the National Academy of Sciences, the National Institutes of Health, the National Institutes of Justice as well as various international programs. In all of these efforts, Professor Crow has left a lasting impact.
The process of evolution at a given site in the genome can be influenced by the action of selection at other sites, especially when these are closely linked to it. Such selection reduces the effective population size experienced by the site in question (the Hill–Robertson effect), reducing the level of variability and the efficacy of selection. In particular, deleterious variants are continually being produced by mutation and then eliminated by selection at sites throughout the genome. The resulting reduction in variability at linked neutral or nearly neutral sites can be predicted from the theory of background selection, which assumes that deleterious mutations have such large effects that their behavior in the population is effectively deterministic. More weakly selected mutations can accumulate by Muller’s ratchet after a shutdown of recombination, as in an evolving Y chromosome. Many functionally significant sites are probably so weakly selected that Hill–Robertson interference undermines the effective strength of selection upon them, when recombination is rare or absent. This leads to large departures from deterministic equilibrium and smaller effects on linked neutral sites than under background selection or Muller’s ratchet. Evidence is discussed that is consistent with the action of these processes in shaping genome-wide patterns of variation and evolution.
Filamentous growth is a nutrient-regulated growth response that occurs in many fungal species. In pathogens, filamentous growth is critical for host–cell attachment, invasion into tissues, and virulence. The budding yeast Saccharomyces cerevisiae undergoes filamentous growth, which provides a genetically tractable system to study the molecular basis of the response. Filamentous growth is regulated by evolutionarily conserved signaling pathways. One of these pathways is a mitogen activated protein kinase (MAPK) pathway. A remarkable feature of the filamentous growth MAPK pathway is that it is composed of factors that also function in other pathways. An intriguing challenge therefore has been to understand how pathways that share components establish and maintain their identity. Other canonical signaling pathways—rat sarcoma/protein kinase A (RAS/PKA), sucrose nonfermentable (SNF), and target of rapamycin (TOR)—also regulate filamentous growth, which raises the question of how signals from multiple pathways become integrated into a coordinated response. Together, these pathways regulate cell differentiation to the filamentous type, which is characterized by changes in cell adhesion, cell polarity, and cell shape. How these changes are accomplished is also discussed. High-throughput genomics approaches have recently uncovered new connections to filamentous growth regulation. These connections suggest that filamentous growth is a more complex and globally regulated behavior than is currently appreciated, which may help to pave the way for future investigations into this eukaryotic cell differentiation behavior.
Studies of the processes leading to the construction of a bud and its separation from the mother cell in Saccharomyces cerevisiae have provided foundational paradigms for the mechanisms of polarity establishment, cytoskeletal organization, and cytokinesis. Here we review our current understanding of how these morphogenetic events occur and how they are controlled by the cell-cycle-regulatory cyclin-CDK system. In addition, defects in morphogenesis provide signals that feed back on the cyclin-CDK system, and we review what is known regarding regulation of cell-cycle progression in response to such defects, primarily acting through the kinase Swe1p. The bidirectional communication between morphogenesis and the cell cycle is crucial for successful proliferation, and its study has illuminated many elegant and often unexpected regulatory mechanisms. Despite considerable progress, however, many of the most puzzling mysteries in this field remain to be resolved.
Salmonella enterica serovar Typhimurium uses the Salmonella pathogenicity island 1 (SPI1) type III secretion system to induce inflammatory diarrhea and bacterial uptake into intestinal epithelial cells. The expression of hilA, encoding the transcriptional activator of the SPI1 structural genes, is directly controlled by three AraC-like regulators, HilD, HilC, and RtsA, each of which can activate the hilD, hilC, rtsA, and hilA genes, forming a complex feed-forward regulatory loop. A large number of factors and environmental signals have been implicated in SPI1 regulation. We have developed a series of genetic tests that allows us to determine where these factors feed into the SPI1 regulatory circuit. Using this approach, we have grouped 21 of the known SPI1 regulators and environmental signals into distinct classes on the basis of observed regulatory patterns, anchored by those few systems where the mechanism of regulation is best understood. Many of these factors are shown to work post-transcriptionally at the level of HilD, while others act at the hilA promoter or affect all SPI1 promoters. Analysis of the published transcriptomic data reveals apparent coregulation of the SPI1 and flagellar genes in various conditions. However, we show that in most cases, the factors that affect both systems control SPI1 independently of the flagellar protein FliZ, despite its role as an important SPI1 regulator and coordinator of the two systems. These results provide a comprehensive model for SPI1 regulation that serves as a framework for future molecular analyses of this complex regulatory network.
In eukaryotes, the post-translational addition of methyl groups to histone H3 lysine 4 (H3K4) plays key roles in maintenance and establishment of appropriate gene expression patterns and chromatin states. We report here that an essential locus within chromosome 3L centric heterochromatin encodes the previously uncharacterized Drosophila melanogaster ortholog (dSet1, CG40351) of the Set1 H3K4 histone methyltransferase (HMT). Our results suggest that dSet1 acts as a "global" or general H3K4 di- and trimethyl HMT in Drosophila. Levels of H3K4 di- and trimethylation are significantly reduced in dSet1 mutants during late larval and post-larval stages, but not in animals carrying mutations in genes encoding other well-characterized H3K4 HMTs such as trr, trx, and ash1. The latter results suggest that Trr, Trx, and Ash1 may play more specific roles in regulating key cellular targets and pathways and/or act as global H3K4 HMTs earlier in development. In yeast and mammalian cells, the HMT activity of Set1 proteins is mediated through an evolutionarily conserved protein complex known as Complex of Proteins Associated with Set1 (COMPASS). We present biochemical evidence that dSet1 interacts with members of a putative Drosophila COMPASS complex and genetic evidence that these members are functionally required for H3K4 methylation. Taken together, our results suggest that dSet1 is responsible for the bulk of H3K4 di- and trimethylation throughout Drosophila development, thus providing a model system for better understanding the requirements for and functions of these modifications in metazoans.
Adherence to host cells is an important step in the pathogenicity of the opportunistic fungal pathogen Candida glabrata. This adherence is mediated by some members of the large family of cell wall proteins encoded by the EPA (Epithelial Adhesin) genes present in the C. glabrata genome. The majority of the EPA genes are localized close to different telomeres in C. glabrata, resulting in a negative regulation of transcription of these genes through chromatin-based subtelomeric silencing. In vitro, adherence to epithelial cells is mainly mediated by Epa1, the only member of the EPA family that is expressed in vitro. EPA1 forms a cluster with EPA2 and EPA3 at the subtelomeric region of telomere E-R. EPA2 and EPA3 are subject to silencing that propagates from this telomere in a process that depends on the Sir2, -3, -4, and Rif1 proteins, but surprisingly not on the yKu70 and yKu80 proteins. Here we describe that the yKu70/yKu80-independent silencing of telomere E-R is due to the presence of a cis-acting protosilencer (Sil2126) located between EPA3 and the telomere. This element can silence a reporter gene when placed 31.9 kb away from this telomere, but not when it is removed from the telomere context, or when it is placed near other telomeres, or inverted with respect to the reporter. Importantly, we show that the cis-acting Sil2126 element is required for the yKu70/80-independent silencing of this telomere, underscoring the importance of cis-elements for repressive chromatin formation and spreading on some telomeres in C. glabrata.
Genetic nonself recognition systems such as vegetative incompatibility operate in many filamentous fungi to regulate hyphal fusion between genetically dissimilar individuals and to restrict the spread of virulence-attenuating mycoviruses that have potential for biological control of pathogenic fungi. We report here the use of a comparative genomics approach to identify seven candidate polymorphic genes associated with four vegetative incompatibility (vic) loci of the chestnut blight fungus Cryphonectria parasitica. Disruption of candidate alleles in one of two strains that were heteroallelic at vic2, vic6, or vic7 resulted in enhanced virus transmission, but did not prevent barrage formation associated with mycelial incompatibility. Detailed characterization of the vic6 locus revealed the involvement of nonallelic interactions between two tightly linked genes in barrage formation, heterokaryon formation, and asymmetric, gene-specific influences on virus transmission. The combined results establish molecular identities of genes associated with four C. parasitica vic loci and provide insights into how these recognition factors interact to trigger incompatibility and restrict virus transmission.
Sperm from different species have evolved distinctive motility structures, including tubulin-based flagella in mammals and major sperm protein (MSP)-based pseudopods in nematodes. Despite such divergence, we show that sperm-specific PP1 phosphatases, which are required for male fertility in mouse, function in multiple processes in the development and motility of Caenorhabditis elegans amoeboid sperm. We used live-imaging analysis to show the PP1 phosphatases GSP-3 and GSP-4 (GSP-3/4) are required to partition chromosomes during sperm meiosis. Postmeiosis, tracking fluorescently labeled sperm revealed that both male and hermaphrodite sperm lacking GSP-3/4 are immotile. Genetic and in vitro activation assays show lack of GSP-3/4 causes defects in pseudopod development and the rate of pseudopodial treadmilling. Further, GSP-3/4 are required for the localization dynamics of MSP. GSP-3/4 shift localization in concert with MSP from fibrous bodies that sequester MSP at the base of the pseudopod, where directed MSP disassembly facilitates pseudopod contraction. Consistent with a role for GSP-3/4 as a spatial regulator of MSP disassembly, MSP is mislocalized in sperm lacking GSP-3/4. Although a requirement for PP1 phosphatases in nematode and mammalian sperm suggests evolutionary conservation, we show PP1s have independently evolved sperm-specific paralogs in separate lineages. Thus PP1 phosphatases are highly adaptable and employed across a broad range of sexually reproducing species to regulate male fertility.
High-throughput genotyping and sequencing technologies can generate dense sets of genetic markers for large numbers of individuals. For most species, these data will contain many markers in linkage disequilibrium (LD). To utilize such data for population structure inference, we investigate the use of haplotypes constructed by combining the alleles at single-nucleotide polymorphisms (SNPs). We introduce a statistic derived from information theory, the gain of informativeness for assignment (GIA), which quantifies the additional information for assigning individuals to populations using haplotype data compared to using individual loci separately. Using a two-loci–two-allele model, we demonstrate that combining markers in linkage equilibrium into haplotypes always leads to nonpositive GIA, suggesting that combining the two markers is not advantageous for ancestry inference. However, for loci in LD, GIA is often positive, suggesting that assignment can be improved by combining markers into haplotypes. Using GIA as a criterion for combining markers into haplotypes, we demonstrate for simulated data a significant improvement of assigning individuals to candidate populations. For the many cases that we investigate, incorrect assignment was reduced between 26% and 97% using haplotype data. For empirical data from French and German individuals, the incorrectly assigned individuals can, for example, be decreased by 73% using haplotypes. Our results can be useful for challenging population structure and assignment problems, in particular for studies where large-scale population–genomic data are available.
Measuring fitness with precision is a key issue in evolutionary biology, particularly in studying mutations of small effects. It is usually thought that sampling error and drift prevent precise measurement of very small fitness effects. We circumvented these limits by using a new combined approach to measuring and analyzing fitness. We estimated the mutational fitness effect (MFE) of three independent mini-Tn10 transposon insertion mutations by conducting competition experiments in large populations of Escherichia coli under controlled laboratory conditions. Using flow cytometry to assess genotype frequencies from very large samples alleviated the problem of sampling error, while the effect of drift was controlled by using large populations and massive replication of fitness measures. Furthermore, with a set of four competition experiments between ancestral and mutant genotypes, we were able to decompose fitness measures into four estimated parameters that account for fitness effects of our fluorescent marker (α), the mutation (β), epistasis between the mutation and the marker (), and departure from transitivity (). Our method allowed us to estimate mean selection coefficients to a precision of 2 x 10–4. We also found small, but significant, epistatic interactions between the allelic effects of mutations and markers and confirmed that fitness effects were transitive in most cases. Unexpectedly, we also detected variation in measures of s that were significantly bigger than expected due to drift alone, indicating the existence of cryptic variation, even in fully controlled experiments. Overall our results indicate that selection coefficients are best understood as being distributed, representing a limit on the precision with which selection can be measured, even under controlled laboratory conditions.
Estimates of the coalescent effective population size Ne can be poorly correlated with the true population size. The relationship between Ne and the population size is sensitive to the way in which birth and death rates vary over time. The problem of inference is exacerbated when the mechanisms underlying population dynamics are complex and depend on many parameters. In instances where nonparametric estimators of Ne such as the skyline struggle to reproduce the correct demographic history, model-based estimators that can draw on prior information about population size and growth rates may be more efficient. A coalescent model is developed for a large class of populations such that the demographic history is described by a deterministic nonlinear dynamical system of arbitrary dimension. This class of demographic model differs from those typically used in population genetics. Birth and death rates are not fixed, and no assumptions are made regarding the fraction of the population sampled. Furthermore, the population may be structured in such a way that gene copies reproduce both within and across demes. For this large class of models, it is shown how to derive the rate of coalescence, as well as the likelihood of a gene genealogy with heterochronous sampling and labeled taxa, and how to simulate a coalescent tree conditional on a complex demographic history. This theoretical framework encapsulates many of the models used by ecologists and epidemiologists and should facilitate the integration of population genetics with the study of mathematical population dynamics.
Spatially varying selection on a given polymorphism is expected to produce a localized peak in the between-population component of nucleotide diversity, and theory suggests that the chromosomal extent of elevated differentiation may be enhanced in cases where tandemly linked genes contribute to fitness variation. An intriguing example is provided by the tandemly duplicated β-globin genes of deer mice (Peromyscus maniculatus), which contribute to adaptive differentiation in blood–oxygen affinity between high- and low-altitude populations. Remarkably, the two β-globin genes segregate the same pair of functionally distinct alleles due to a history of interparalog gene conversion and alleles of the same functional type are in perfect coupling-phase linkage disequilibrium (LD). Here we report a multilocus analysis of nucleotide polymorphism and LD in highland and lowland mice with different genetic backgrounds at the β-globin genes. The analysis of haplotype structure revealed a paradoxical pattern whereby perfect LD between the two β-globin paralogs (which are separated by 16.2 kb) is maintained in spite of the fact that LD within both paralogs decays to background levels over physical distances of less than 1 kb. The survey of nucleotide polymorphism revealed that elevated levels of altitudinal differentiation at each of the β-globin genes drop away quite rapidly in the external flanking regions (upstream of the 5' paralog and downstream of the 3' paralog), but the level of differentiation remains unexpectedly high across the intergenic region. Observed patterns of diversity and haplotype structure are difficult to reconcile with expectations of a two-locus selection model with multiplicative fitness.
To model deviations from selectively neutral genetic variation caused by different forms of selection, it is necessary to first understand patterns of neutral variation. Best understood is neutral genetic variation at a single locus. But, as is well known, additional insights can be gained by investigating multiple loci. The resulting patterns reflect the degree of association (linkage) between loci and provide information about the underlying multilocus gene genealogies. The statistical properties of two-locus gene genealogies have been intensively studied for populations of constant size, as well as for simple demographic histories such as exponential population growth and single bottlenecks. By contrast, the combined effect of recombination and sustained demographic fluctuations is poorly understood. Addressing this issue, we study a two-locus Wright–Fisher model of a population subject to recurrent bottlenecks. We derive coalescent approximations for the covariance of the times to the most recent common ancestor at two loci in samples of two chromosomes. This covariance reflects the degree of association and thus linkage disequilibrium between these loci. We find, first, that an effective population-size approximation describes the numerically observed association between two loci provided that recombination occurs either much faster or much more slowly than the population-size fluctuations. Second, when recombination occurs frequently between but rarely within bottlenecks, we observe that the association of gene histories becomes independent of physical distance over a certain range of distances. Third, we show that in this case, a commonly used measure of linkage disequilibrium, $${\sigma }_{\hbox{ d }}^{2}$$ (closely related to $${\widehat{r}}^{2}$$), fails to capture the long-range association between two loci. The reason is that constituent terms, each reflecting the long-range association, cancel. Fourth, we analyze a limiting case in which the long-range association can be described in terms of a Xi coalescent allowing for simultaneous multiple mergers of ancestral lines.
Bayesian hierarchical shrinkage methods have been widely used for quantitative trait locus mapping. From the computational perspective, the application of the Markov chain Monte Carlo (MCMC) method is not optimal for high-dimensional problems such as the ones arising in epistatic analysis. Maximum a posteriori (MAP) estimation can be a faster alternative, but it usually produces only point estimates without providing any measures of uncertainty (i.e., interval estimates). The variational Bayes method, stemming from the mean field theory in theoretical physics, is regarded as a compromise between MAP and MCMC estimation, which can be efficiently computed and produces the uncertainty measures of the estimates. Furthermore, variational Bayes methods can be regarded as the extension of traditional expectation-maximization (EM) algorithms and can be applied to a broader class of Bayesian models. Thus, the use of variational Bayes algorithms based on three hierarchical shrinkage models including Bayesian adaptive shrinkage, Bayesian LASSO, and extended Bayesian LASSO is proposed here. These methods performed generally well and were found to be highly competitive with their MCMC counterparts in our example analyses. The use of posterior credible intervals and permutation tests are considered for decision making between quantitative trait loci (QTL) and non-QTL. The performance of the presented models is also compared with R/qtlbim and R/BhGLM packages, using a previously studied simulated public epistatic data set.
Transcription factors and their binding sites have been proposed as primary targets of evolutionary adaptation because changes to single transcription factors can lead to far-reaching changes in gene expression patterns. Nevertheless, there is very little concrete evidence for such evolutionary changes. Industrial wine yeast strains, of the species Saccharomyces cerevisiae, are a geno- and phenotypically diverse group of organisms that have adapted to the ecological niches of industrial winemaking environments and have been selected to produce specific styles of wine. Variation in transcriptional regulation among wine yeast strains may be responsible for many of the observed differences and specific adaptations to different fermentative conditions in the context of commercial winemaking. We analyzed gene expression profiles of wine yeast strains to assess the impact of transcription factor expression on metabolic networks. The data provide new insights into the molecular basis of variations in gene expression in industrial strains and their consequent effects on metabolic networks important to wine fermentation. We show that the metabolic phenotype of a strain can be shifted in a relatively predictable manner by changing expression levels of individual transcription factors, opening opportunities to modify transcription networks to achieve desirable outcomes.
The environment can have a decisive influence on the structure of the genome, changing it in a certain direction. Therefore, the genomic distribution of environmentally sensitive transposable elements may vary measurably across a species area. In the present research, we aimed to detect and evaluate the level of LTR retrotransposon intraspecific variability in Aegilops speltoides (2n = 2x = 14), a wild cross-pollinated relative of cultivated wheat. The interretrotransposon amplified polymorphism (IRAP) protocol was applied to detect and evaluate the level of retrotransposon intraspecific variability in Ae. speltoides and closely related species. IRAP analysis revealed significant diversity in TE distribution. Various genotypes from the 13 explored populations significantly differ with respect to the patterns of the four explored LTR retrotransposons (WIS2, Wilma, Daniela, and Fatima). This diversity points to a constant ongoing process of LTR retrotransposon fraction restructuring in populations of Ae. speltoides throughout the species’ range and within single populations in time. Maximum changes were recorded in genotypes from small stressed populations. Principal component analysis showed that the dynamics of the Fatima element significantly differ from those of WIS2, Wilma, and Daniela. In terms of relationships between Sitopsis species, IRAP analysis revealed a grouping with Ae. sharonensis and Ae. longissima forming a separate unit, Ae. speltoides appearing as a dispersed group, and Ae. bicornis being in an intermediate position. IRAP display data revealed dynamic changes in LTR retrotransposon fractions in the genome of Ae. speltoides. The process is permanent and population specific, ultimately leading to the separation of small stressed populations from the main group.
A strategy to reduce computational demands of genome-wide association studies fitting a mixed model is presented. Improvements are achieved by utilizing a large proportion of calculations that remain constant across the multiple analyses for individual markers involved, with estimates obtained without inverting large matrices.
Drosophila polyhomeotic (ph) is one of the important polycomb group genes that is linked to human cancer. In the mosaic eye imaginal discs, while phdel, a null allele, causes only non-autonomous overgrowth, ph505, a hypomorphic allele, causes both autonomous and non-autonomous overgrowth. These allele-specific phenotypes stem from the different sensitivities of ph mutant cells to the Upd homologs that they secrete.
Studies of the processes leading to the construction of a bud and its separation from the mother cell in Saccharomyces cerevisiae have provided foundational paradigms for the mechanisms of polarity establishment, cytoskeletal organization, and cytokinesis. Here we review our current understanding of how these morphogenetic events occur and how they are controlled by the cell-cycle-regulatory cyclin-CDK system. In addition, defects in morphogenesis provide signals that feed back on the cyclin-CDK system, and we review what is known regarding regulation of cell-cycle progression in response to such defects, primarily acting through the kinase Swe1p. The bidirectional communication between morphogenesis and the cell cycle is crucial for successful proliferation, and its study has illuminated many elegant and often unexpected regulatory mechanisms. Despite considerable progress, however, many of the most puzzling mysteries in this field remain to be resolved.
To honor James F. Crow on the occasion of his 95th birthday, GENETICS has commissioned a series of Perspectives and Reviews. For GENETICS to publish the honorifics is fitting, as from their birth Crow and GENETICS have been paired. Crow was scheduled to be born in January 1916, the same month that the first issue of GENETICS was scheduled to appear, and in the many years that Crow has made major contributions to the conceptual foundations of modern genetics, GENETICS has chronicled his and other major advances in the field. The commissioned Perspectives and Reviews summarize and celebrate Professor Crow’s contributions as a research scientist, administrator, colleague, community supporter, international leader, teacher, and mentor. In science, Professor Crow was the international leader of his generation in the application of genetics to populations of organisms and in uncovering the role of genetics in health and disease. In education, he was a superb undergraduate teacher whose inspiration changed the career paths of many students. His teaching skills are legendary, his lectures urbane and witty, rigorous and clear. He was also an extraordinary mentor to numerous graduate students and postdoctoral fellows, many of whom went on to establish successful careers of their own. In public service, Professor Crow served in key administrative positions at the University of Wisconsin, participated as a member of numerous national and international committees, and served as president of both the Genetics Society of America and the American Society for Human Genetics. This Perspective examines Professor Crow as teacher and mentor through the eyes and experiences of one student who was enrolled in his genetics course as an undergraduate and who later studied with him as a graduate student.
Increasing the college graduation rates of underrepresented minority students in science disciplines is essential to attain a diverse workforce for the 21st century. The Research Internship and Science Education (RISE) program attempts to motivate and prepare students from the Atlanta Public School system, where underrepresented minority (URM) students comprise a majority of the population, for biomedical science careers by offering the opportunity to participate in an original research project. Students work in a research laboratory from the summer of their sophomore year until graduation, mentored by undergraduate and graduate students and postdoctoral fellows (postdocs). In addition, they receive instruction in college-level biology, scholastic assessment test (SAT) preparation classes, and help with the college application process. During the last 4 yr, RISE students have succeeded in the identification and characterization of a series of proteins involved in the regulation of nuclear organization and transcription. All but 1 of 39 RISE students have continued on to 4-year college undergraduate studies and 61% of those students are currently enrolled in science-related majors. These results suggest that the use of research-based experiences at the high school level may contribute to the increased recruitment of underrepresented students into science-related careers.
The yeast cell wall is a strong, but elastic, structure that is essential not only for the maintenance of cell shape and integrity, but also for progression through the cell cycle. During growth and morphogenesis, and in response to environmental challenges, the cell wall is remodeled in a highly regulated and polarized manner, a process that is principally under the control of the cell wall integrity (CWI) signaling pathway. This pathway transmits wall stress signals from the cell surface to the Rho1 GTPase, which mobilizes a physiologic response through a variety of effectors. Activation of CWI signaling regulates the production of various carbohydrate polymers of the cell wall, as well as their polarized delivery to the site of cell wall remodeling. This review article centers on CWI signaling in Saccharomyces cerevisiae through the cell cycle and in response to cell wall stress. The interface of this signaling pathway with other pathways that contribute to the maintenance of cell wall integrity is also discussed.
TOR (Target Of Rapamycin) is a highly conserved protein kinase that is important in both fundamental and clinical biology. In fundamental biology, TOR is a nutrient-sensitive, central controller of cell growth and aging. In clinical biology, TOR is implicated in many diseases and is the target of the drug rapamycin used in three different therapeutic areas. The yeast Saccharomyces cerevisiae has played a prominent role in both the discovery of TOR and the elucidation of its function. Here we review the TOR signaling network in S. cerevisiae.
Finding the genes underlying complex traits is difficult. We show that new sequencing technology combined with traditional genetic techniques can efficiently identify genetic regions underlying a complex and quantitative behavioral trait. As a proof of concept we used phenotype-based introgression to backcross loci that control innate food preference in Drosophila simulans into the genomic background of D. sechellia, which expresses the opposite preference. We successfully mapped D. simulans introgression regions in a small mapping population (30 flies) with whole-genome resequencing using light coverage (~1x). We found six loci contributing to D. simulans food preference, one of which overlaps a previously discovered allele. This approach is applicable to many systems, does not rely on laborious marker development or genotyping, does not require existing high quality reference genomes, and needs only small mapping populations. Because introgression is used, researchers can scale mapping population size, replication, and number of backcross generations to their needs. Finally, in contrast to more widely used mapping techniques like F2 bulk-segregant analysis, our method produces near-isogenic lines that can be kept and reused indefinitely.
We have used a novel method to activate the DNA damage S-phase checkpoint response in Saccharomyces cerevisiae to slow lagging-strand DNA replication by exposing cells expressing a drug-sensitive DNA polymerase (L612M-DNA pol ) to the inhibitory drug phosphonoacetic acid (PAA). PAA-treated pol3-L612M cells arrest as large-budded cells with a single nucleus in the bud neck. This arrest requires all of the components of the S-phase DNA damage checkpoint: Mec1, Rad9, the DNA damage clamp Ddc1-Rad17-Mec3, and the Rad24-dependent clamp loader, but does not depend on Mrc1, which acts as the signaling adapter for the replication checkpoint. In addition to the above components, a fully functional mismatch repair system, including Exo1, is required to activate the S-phase damage checkpoint and for cells to survive drug exposure. We propose that mismatch repair activity produces persisting single-stranded DNA gaps in PAA-treated pol3-L612M cells that are required to increase DNA damage above the threshold needed for checkpoint activation. Our studies have important implications for understanding how cells avoid inappropriate checkpoint activation because of normal discontinuities in lagging-strand replication and identify a role for mismatch repair in checkpoint activation that is needed to maintain genome integrity.
One of the least understood aspects of homologous recombination is the process by which the ends of a double-strand break (DSB) search the entire genome for homologous templates that can be used to repair the break. We took advantage of the natural competition between the alternative donors HML and HMR employed during HO endonuclease-induced switching of the budding yeast MAT locus. The strong mating-type-dependent bias in the choice of the donors is enforced by the recombination enhancer (RE), which lies 17 kb proximal to HML. We investigated factors that improve the use of the disfavored donor. We show that the normal heterochromatic state of the donors does not impair donor usage, as donor choice is not affected by removing this epigenetic silencing. In contrast, increasing the length of homology shared by the disfavored donor increases its use. This result shows that donor choice is not irrevocable and implies that there are several encounters between the DSB ends and even the favored donor before recombination is accomplished. The increase by adding more homology is not linear; these results can be explained by a thermodynamic model that determines the energy cost of using one donor over the other. An important inference from this analysis is that when HML is favored as the donor, RE causes a reduction in its effective genomic distance from MAT from 200 kb to ~20 kb, which we hypothesize occurs after the DSB is created, by epigenetic chromatin modifications around MAT.
RNA polymerase (pol) II establishes many protein–protein interactions with transcriptional regulators to coordinate different steps of transcription. Although some of these interactions have been well described, little is known about the existence of RNA pol II regions involved in contact with transcriptional regulators. We hypothesize that conserved regions on the surface of RNA pol II contact transcriptional regulators. We identified such an RNA pol II conserved region that includes the majority of the "foot" domain and identified interactions of this region with Mvp1, a protein required for sorting proteins to the vacuole, and Spo14, a phospholipase D. Deletion of MVP1 and SPO14 affects the transcription of their target genes and increases phosphorylation of Ser5 in the carboxy-terminal domain (CTD). Genetic, phenotypic, and functional analyses point to a role for these proteins in transcriptional initiation and/or early elongation, consistent with their genetic interactions with CEG1, a guanylyltransferase subunit of the Saccharomyces cerevisiae capping enzyme.
Mutations at the APM1 and APM2 loci in the green alga Chlamydomonas reinhardtii confer resistance to phosphorothioamidate and dinitroaniline herbicides. Genetic interactions between apm1 and apm2 mutations suggest an interaction between the gene products. We identified the APM1 and APM2 genes using a map-based cloning strategy. Genomic DNA fragments containing only the DNJ1 gene encoding a type I Hsp40 protein rescue apm1 mutant phenotypes, conferring sensitivity to the herbicides and rescuing a temperature-sensitive growth defect. Lesions at five apm1 alleles include missense mutations and nucleotide insertions and deletions that result in altered proteins or very low levels of gene expression. The HSP70A gene, encoding a cytosolic Hsp70 protein known to interact with Hsp40 proteins, maps near the APM2 locus. Missense mutations found in three apm2 alleles predict altered Hsp70 proteins. Genomic fragments containing the HSP70A gene rescue apm2 mutant phenotypes. The results suggest that a client of the Hsp70–Hsp40 chaperone complex may function to increase microtubule dynamics in Chlamydomonas cells. Failure of the chaperone system to recognize or fold the client protein(s) results in increased microtubule stability and resistance to the microtubule-destabilizing effect of the herbicides. The lack of redundancy of genes encoding cytosolic Hsp70 and Hsp40 type I proteins in Chlamydomonas makes it a uniquely valuable system for genetic analysis of the function of the Hsp70 chaperone complex.
In aerobic organisms, protection from oxidative damage involves the combined action of enzymatic and nonproteinaceous cellular factors that collectively remove harmful reactive oxygen species. One class of nonproteinaceous antioxidants includes small molecule complexes of manganese (Mn) that can scavenge superoxide anion radicals and provide a backup for superoxide dismutase enzymes. Such Mn antioxidants have been identified in diverse organisms; however, nothing regarding their physiology in the context of cellular adaptation to stress was known. Using a molecular genetic approach in Bakers’ yeast, Saccharomyces cerevisiae, we report that the Mn antioxidants can fall under control of the same pathways used for nutrient sensing and stress responses. Specifically, a serine/threonine PAS-kinase, Rim15p, that is known to integrate phosphate, nitrogen, and carbon sensing, can also control Mn antioxidant activity in yeast. Rim15p is negatively regulated by the phosphate-sensing kinase complex Pho80p/Pho85p and by the nitrogen-sensing Akt/S6 kinase homolog, Sch9p. We observed that loss of either of these upstream kinase sensors dramatically inhibited the potency of Mn as an antioxidant. Downstream of Rim15p are transcription factors Gis1p and the redundant Msn2/Msn4p pair that typically respond to nutrient and stress signals. Both transcription factors were found to modulate the potency of the Mn antioxidant but in opposing fashions: loss of Gis1p was seen to enhance Mn antioxidant activity whereas loss of Msn2/4p greatly suppressed it. Our observed roles for nutrient and stress response kinases and transcription factors in regulating the Mn antioxidant underscore its physiological importance in aerobic fitness.
Maize (Zea mays) has a large class of seed mutants with opaque or nonvitreous endosperms that could improve the nutritional quality of our food supply. The phenotype of some of them appears to be linked to the improper formation of protein bodies (PBs) where zein storage proteins are deposited. Although a number of genes affecting endosperm vitreousness have been isolated, it has been difficult to clone opaque7 (o7), mainly because of its low penetrance in many genetic backgrounds. The o7-reference (o7-ref) mutant arose spontaneously in a W22 inbred, but is poorly expressed in other lines. We report here the isolation of o7 with a combination of map-based cloning and transposon tagging. We first identified an o7 candidate gene by map-based cloning. The putative o7-ref allele has a 12-bp in-frame deletion of codons 350–353 in a 528-codon-long acyl-CoA synthetase-like gene (ACS). We then confirmed this candidate gene by generating another mutant allele from a transposon-tagging experiment using the Activator/Dissociation (Ac/Ds) system in a W22 background. The second allele, isolated from ~1 million gametes, presented a 2-kb Ds insertion that resembles the single Ds component of double-Ds, McClintock’s original Dissociation element, at codon 496 of the ACS gene. PBs exhibited striking membrane invaginations in the o7-ref allele and a severe number reduction in the Ds-insertion mutant, respectively. We propose a model in which the ACS enzyme plays a key role in membrane biogenesis, by taking part in protein acylation, and that altered PBs render the seed nonvitreous.
In maize, a series of seed mutants with starchy endosperm could increase the lysine content by decreased amount of zeins, the main storage proteins in endosperm. Cloning and characterization of these mutants could reveal regulatory mechanisms for zeins accumulation in maize endosperm. Opaque7 (o7) is a classic maize starchy endosperm mutant with large effects on zeins accumulation and high lysine content. In this study, the O7 gene was cloned by map-based cloning and confirmed by transgenic functional complementation and RNAi. The o7-ref allele has a 12-bp in-frame deletion. The four-amino-acid deletion caused low accumulation of o7 protein in vivo. The O7 gene encodes an acyl-activating enzyme with high similarity to AAE3. The opaque phenotype of the o7 mutant was produced by the reduction of protein body size and number caused by a decrease in the α-zeins concentrations. Analysis of amino acids and metabolites suggested that the O7 gene might affect amino acid biosynthesis by affecting α-ketoglutaric acid and oxaloacetic acid. Transgenic rice seeds containing RNAi constructs targeting the rice ortholog of maize O7 also produced lower amounts of seed proteins and displayed an opaque endosperm phenotype, indicating a conserved biological function of O7 in cereal crops. The cloning of O7 revealed a novel regulatory mechanism for storage protein synthesis and highlighted an effective target for the genetic manipulation of storage protein contents in cereal seeds.
The PHR (Pam/Highwire/RPM-1) proteins are evolutionarily conserved ubiquitin ligases that regulate axon guidance and synapse formation in Caenorhabditis elegans, Drosophila, zebrafish, and mice. In C. elegans, RPM-1 (Regulator of Presynaptic Morphology-1) functions in synapse formation, axon guidance, axon termination, and postsynaptic GLR-1 trafficking. Acting as an E3 ubiquitin ligase, RPM-1 negatively regulates a MAP kinase pathway that includes: dlk-1, mkk-4, and the p38 MAPK, pmk-3. Here we provide evidence that ppm-1, a serine/threonine phosphatase homologous to human PP2Cα(PPM1A) and PP2Cβ(PPM1B) acts as a second negative regulatory mechanism to control the dlk-1 pathway. We show that ppm-1 functions through its phosphatase activity in a parallel genetic pathway with glo-4 and fsn-1 to regulate both synapse formation in the GABAergic motorneurons and axon termination in the mechanosensory neurons. Our transgenic analysis shows that ppm-1 acts downstream of rpm-1 to negatively regulate the DLK-1 pathway, with PPM-1 most likely acting at the level of pmk-3. Our study provides insight into the negative regulatory mechanisms that control the dlk-1 pathway in neurons and demonstrates a new role for the PP2C/PPM phosphatases as regulators of neuronal development.
To maintain tissue homeostasis, some organs are able to replace dying cells with additional proliferation of surviving cells. Such proliferation can be localized (e.g., a regeneration blastema) or diffuse (compensatory growth). The relationship between such growth and the growth that occurs during development has not been characterized in detail. Drosophila melanogaster larval imaginal discs can recover from extensive damage, producing normally sized adult organs. Here we describe a system using genetic mosaics to screen for recessive mutations that impair compensatory growth. By generating clones of cells that carry a temperature-sensitive cell-lethal mutation, we conditionally ablate patches of tissue in the imaginal disc and assess the ability of the surviving sister clones to replace the lost tissue. We have used this system together with a modified whole-genome resequencing (WGS) strategy to identify several mutations that selectively compromise compensatory growth. We find specific alleles of bunched (bun) and Ribonucleoside diphosphate reductase large subunit (RnrL) reduce compensatory growth in the imaginal disc. Other genes identified in the screen, including two alleles of Topoisomerase 3-alpha (Top3α), while also required for developmental growth, appear to have an enhanced requirement during compensatory growth. Compensatory growth occurs at a higher rate than normal growth and may therefore have features in common with some types of overgrowth. Indeed, the RnrL allele identified compromises both these types of altered growth and mammalian ribonucleotide reductase and topoisomerases are targets of anticancer drugs. Finally, the approach we describe is applicable to the study of compensatory growth in diverse tissues in Drosophila.
Many animals sense environmental gases such as carbon dioxide and oxygen using specialized populations of gas-sensing neurons. The proper development and function of these neurons is critical for survival, as the inability to respond to changes in ambient carbon dioxide and oxygen levels can result in reduced neural activity and ultimately death. Despite the importance of gas-sensing neurons for survival, little is known about the developmental programs that underlie their formation. Here we identify the ETS-family transcription factor ETS-5 as critical for the normal differentiation of the carbon dioxide-sensing BAG neurons in Caenorhabditis elegans. Whereas wild-type animals show acute behavioral avoidance of carbon dioxide, ets-5 mutant animals do not respond to carbon dioxide. The ets-5 gene is expressed in BAG neurons and is required for the normal expression of the BAG neuron gene battery. ets-5 may also autoregulate its expression in BAG neurons. ets-5 is not required for BAG neuron formation, indicating that it is specifically involved in BAG neuron differentiation and the maintenance of BAG neuron cell fate. Our results demonstrate a novel role for ETS genes in the development and function of gas-detecting sensory neurons.
Mating behavior of animals is regulated by the sensory stimuli provided by the other sex. Sexually receptive females emit mating signals that can be inhibited by male ejaculate. The genetic mechanisms controlling the release of mating signals and encoding behavioral responses remain enigmatic. Here we present evidence of a Caenorhabditis elegans hermaphrodite-derived cue that stimulates male mating-response behavior and is dynamically regulated by her reproductive status. Wild-type males preferentially mated with older hermaphrodites. Increased sex appeal of older hermaphrodites was potent enough to stimulate robust response from mating-deficient pkd-2 and lov-1 polycystin mutant males. This enhanced response of pkd-2 males toward older hermaphrodites was independent of short-chain ascaroside pheromones, but was contingent on the absence of active sperm in the hermaphrodites. The improved pkd-2 male response toward spermless hermaphrodites was blocked by prior insemination or by genetic ablation of the ceh-18-dependent sperm-sensing pathway of the hermaphrodite somatic gonad. Our work suggests an interaction between sperm and the soma that has a negative but reversible effect on a hermaphrodite-derived mating cue that regulates male mating response, a phenomenon to date attributed to gonochoristic species only.
Genetic selection is a major force shaping life on earth. In classical genetic theory, response to selection is the product of the strength of selection and the additive genetic variance in a trait. The additive genetic variance reflects a population’s intrinsic potential to respond to selection. The ordinary additive genetic variance, however, ignores the social organization of life. With social interactions among individuals, individual trait values may depend on genes in others, a phenomenon known as indirect genetic effects. Models accounting for indirect genetic effects, however, lack a general definition of heritable variation. Here I propose a general definition of the heritable variation that determines the potential of a population to respond to selection. This generalizes the concept of heritable variance to any inheritance model and level of organization. The result shows that heritable variance determining potential response to selection is the variance among individuals in the heritable quantity that determines the population mean trait value, rather than the usual additive genetic component of phenotypic variance. It follows, therefore, that heritable variance may exceed phenotypic variance among individuals, which is impossible in classical theory. This work also provides a measure of the utilization of heritable variation for response to selection and integrates two well-known models of maternal genetic effects. The result shows that relatedness between the focal individual and the individuals affecting its fitness is a key determinant of the utilization of heritable variance for response to selection.
In nonrecombining genomes, genetic linkage can be an important evolutionary force. Linkage generates interference interactions, by which simultaneously occurring mutations affect each other’s chance of fixation. Here, we develop a comprehensive model of adaptive evolution in linked genomes, which integrates interference interactions between multiple beneficial and deleterious mutations into a unified framework. By an approximate analytical solution, we predict the fixation rates of these mutations, as well as the probabilities of beneficial and deleterious alleles at fixed genomic sites. We find that interference interactions generate a regime of emergent neutrality: all genomic sites with selection coefficients smaller in magnitude than a characteristic threshold have nearly random fixed alleles, and both beneficial and deleterious mutations at these sites have nearly neutral fixation rates. We show that this dynamic limits not only the speed of adaptation, but also a population’s degree of adaptation in its current environment. We apply the model to different scenarios: stationary adaptation in a time-dependent environment and approach to equilibrium in a fixed environment. In both cases, the analytical predictions are in good agreement with numerical simulations. Our results suggest that interference can severely compromise biological functions in an adapting population, which sets viability limits on adaptive evolution under linkage.
Genome-wide association studies have thus far failed to explain the observed heritability of complex human diseases. This is referred to as the "missing heritability" problem. However, these analyses have usually neglected to consider a role for epigenetic variation, which has been associated with many human diseases. We extend models of epigenetic inheritance to investigate whether environment-sensitive epigenetic modifications of DNA might explain observed patterns of familial aggregation. We find that variation in epigenetic state and environmental state can result in highly heritable phenotypes through a combination of epigenetic and environmental inheritance. These two inheritance processes together can produce familial covariances significantly higher than those predicted by models of purely epigenetic inheritance and similar to those expected from genetic effects. The results suggest that epigenetic variation, inherited both directly and through shared environmental effects, may make a key contribution to the missing heritability.
The effective population size (Ne) is one of the most fundamental parameters in population genetics. It is thought to vary across the genome as a consequence of differences in the rate of recombination and the density of selected sites due to the processes of genetic hitchhiking and background selection. Although it is known that there is intragenomic variation in the effective population size in some species, it is not known whether this is widespread or how much variation in the effective population size there is. Here, we test whether the effective population size varies across the genome, between protein-coding genes, in 10 eukaryotic species by considering whether there is significant variation in neutral diversity, taking into account differences in the mutation rate between loci by using the divergence between species. In most species we find significant evidence of variation. We investigate whether the variation in Ne is correlated to recombination rate and the density of selected sites in four species, for which these data are available. We find that Ne is positively correlated to recombination rate in one species, Drosophila melanogaster, and negatively correlated to a measure of the density of selected sites in two others, humans and Arabidopsis thaliana. However, much of the variation remains unexplained. We use a hierarchical Bayesian analysis to quantify the amount of variation in the effective population size and show that it is quite modest in all species—most genes have an Ne that is within a few fold of all other genes. Nonetheless we show that this modest variation in Ne is sufficient to cause significant differences in the efficiency of natural selection across the genome, by demonstrating that the ratio of the number of nonsynonymous to synonymous polymorphisms is significantly correlated to synonymous diversity and estimates of Ne, even taking into account the obvious nonindependence between these measures.
The rate of microsatellite mutation is dependent upon both the allele length and the repeat motif, but the exact nature of this relationship is still unknown. We analyzed data on the inheritance of human Y-chromosomal microsatellites in father–son duos, taken from 24 published reports and comprising 15,285 directly observable meioses. At the six microsatellites analyzed (DYS19, DYS389I, DYS390, DYS391, DYS392, and DYS393), a total of 162 mutations were observed. For each locus, we employed a maximum-likelihood approach to evaluate one of several single-step mutation models on the basis of the data. For five of the six loci considered, a novel logistic mutation model was found to provide the best fit according to Akaike’s information criterion. This implies that the mutation probability at the loci increases (nonlinearly) with allele length at a rate that differs between upward and downward mutations. For DYS392, the best fit was provided by a linear model in which upward and downward mutation probabilities increase equally with allele length. This is the first study to empirically compare different microsatellite mutation models in a locus-specific fashion.
Admixed populations have been used for inferring migrations, detecting natural selection, and finding disease genes. These applications often use a simple statistical model of admixture rather than a modeling perspective that incorporates a more realistic history of the admixture process. Here, we develop a general model of admixture that mechanistically accounts for complex historical admixture processes. We consider two source populations contributing to the ancestry of a hybrid population, potentially with variable contributions across generations. For a random individual in the hybrid population at a given point in time, we study the fraction of genetic admixture originating from a specific one of the source populations by computing its moments as functions of time and of introgression parameters. We show that very different admixture processes can produce identical mean admixture proportions, but that such processes produce different values for the variance of the admixture proportion. When introgression parameters from each source population are constant over time, the long-term limit of the expectation of the admixture proportion depends only on the ratio of the introgression parameters. The variance of admixture decreases quickly over time after the source populations stop contributing to the hybrid population, but remains substantial when the contributions are ongoing. Our approach will facilitate the understanding of admixture mechanisms, illustrating how the moments of the distribution of admixture proportions can be informative about the historical admixture processes contributing to the genetic diversity of hybrid populations.
The distribution of fitness effects (DFE) of new mutations is of fundamental importance in evolutionary genetics. Recently, methods have been developed for inferring the DFE that use information from the allele frequency distributions of putatively neutral and selected nucleotide polymorphic variants in a population sample. Here, we extend an existing maximum-likelihood method that estimates the DFE under the assumption that mutational effects are unconditionally deleterious, by including a fraction of positively selected mutations. We allow one or more classes of positive selection coefficients in the model and estimate both the fraction of mutations that are advantageous and the strength of selection acting on them. We show by simulations that the method is capable of recovering the parameters of the DFE under a range of conditions. We apply the method to two data sets on multiple protein-coding genes from African populations of Drosophila melanogaster. We use a probabilistic reconstruction of the ancestral states of the polymorphic sites to distinguish between derived and ancestral states at polymorphic nucleotide sites. In both data sets, we see a significant improvement in the fit when a category of positively selected amino acid mutations is included, but no further improvement if additional categories are added. We estimate that between 1% and 2% of new nonsynonymous mutations in D. melanogaster are positively selected, with a scaled selection coefficient representing the product of the effective population size, Ne, and the strength of selection on heterozygous carriers of ~2.5.
Variation in rates of molecular evolution has been attributed to numerous, interrelated causes, including metabolic rate, body size, and generation time. Speculation concerning the influence of metabolic rate on rates of evolution often invokes the putative mutagenic effects of oxidative stress. To isolate the effects of oxidative stress on the germline from the effects of metabolic rate, generation time, and other factors, we allowed mutations to accumulate under relaxed selection for 125 generations in two strains of the nematode Caenorhabditis elegans, the canonical wild-type strain (N2) and a mutant strain with elevated steady-state oxidative stress (mev-1). Contrary to our expectation, the mutational decline in fitness did not differ between N2 and mev-1. This result suggests that the mutagenic effects of oxidative stress in C. elegans are minor relative to the effects of other types of mutations, such as errors during DNA replication. However, mev-1 MA lines did go extinct more frequently than wild-type lines; some possible explanations for the difference in extinction rate are discussed.
High-throughput genomics allows genome-wide quantification of gene expression levels in tissues and cell types and, when combined with sequence variation data, permits the identification of genetic control points of expression (expression QTL or eQTL). Clusters of eQTL influenced by single genetic polymorphisms can inform on hotspots of regulation of pathways and networks, although very few hotspots have been robustly detected, replicated, or experimentally verified. Here we present a novel modeling strategy to estimate the propensity of a genetic marker to influence several expression traits at the same time, based on a hierarchical formulation of related regressions. We implement this hierarchical regression model in a Bayesian framework using a stochastic search algorithm, HESS, that efficiently probes sparse subsets of genetic markers in a high-dimensional data matrix to identify hotspots and to pinpoint the individual genetic effects (eQTL). Simulating complex regulatory scenarios, we demonstrate that our method outperforms current state-of-the-art approaches, in particular when the number of transcripts is large. We also illustrate the applicability of HESS to diverse real-case data sets, in mouse and human genetic settings, and show that it provides new insights into regulatory hotspots that were not detected by conventional methods. The results suggest that the combination of our modeling strategy and algorithmic implementation provides significant advantages for the identification of functional eQTL hotspots, revealing key regulators underlying pathways.
The frequency and character of interactions among genes influencing complex traits remain unknown. Our ignorance is most acute for segregating variation within natural populations, the epistasis most relevant for quantitative trait evolution. Here, we report a comprehensive survey of interactions among a defined set of flower-size QTL: loci polymorphic within a single natural population of yellow monkeyflower (Mimulus guttatus). We find that epistasis is typical. Observed phenotypes routinely differ from those predicted on the basis of direct allelic affects in the isogenic background, although the direction of deviations is highly variable. Across QTL pairs, there are significantly positive and negative interactions for every trait. Across traits, specific locus pairs routinely exhibit both positive and negative interactions. There was a tendency for negative epistasis to accompany positive direct effects and vice versa for the trait of corolla width, which may be due, at least in part, to the fact that QTL were identified from their direct effects on this trait.
Inbreeding depression and genetic load have been widely observed, but their genetic basis and effects on fitness during the life cycle remain poorly understood, especially for marine animals with high fecundity and high, early mortality (type-III survivorship). A high load of recessive mutations was previously inferred for the Pacific oyster Crassostrea gigas, from massive distortions of zygotic, marker segregation ratios in F2 families. However, the number, genomic location, and stage-specific onset of mutations affecting viability have not been thoroughly investigated. Here, we again report massive distortions of microsatellite-marker segregation ratios in two F2 hybrid families, but we now locate the causative deleterious mutations, using a quantitative trait locus (QTL) interval-mapping model, and we characterize their mode of gene action. We find 14–15 viability QTL (vQTL) in the two families. Genotypic frequencies at vQTL generally suggest selection against recessive or partially recessive alleles, supporting the dominance theory of inbreeding depression. No epistasis was detected among vQTL, so unlinked vQTL presumably have independent effects on survival. For the first time, we track segregation ratios of vQTL-linked markers through the life cycle, to determine their stage-specific expression. Almost all vQTL are absent in the earliest life stages examined, confirming zygotic viability selection; vQTL are predominantly expressed before the juvenile stage (90%), mostly at metamorphosis (50%). We estimate that, altogether, selection on vQTL caused 96% mortality in these families, accounting for nearly all of the actual mortality. Thus, genetic load causes substantial mortality in inbred Pacific oysters, particularly during metamorphosis, a critical developmental transition warranting further investigation.
Individuals with full or partial Trisomy 21 (Ts21) present with clinical features collectively referred to as Down syndrome (DS), although DS phenotypes vary in incidence and severity between individuals. Differing genetic and phenotypic content in individuals with DS as well as mouse models of DS facilitate the understanding of the correlation between specific genes and phenotypes associated with Ts21. The Ts1Rhr mouse model is trisomic for 33 genes (the "Down syndrome critical region" or DSCR) hypothesized to be responsible for many clinical DS features, including craniofacial dysmorphology with a small mandible. Experiments with Ts1Rhr mice showed that the DSCR was not sufficient to cause all DS phenotypes by identifying uncharacteristic craniofacial abnormalities not found in individuals with DS or other DS mouse models. We hypothesized that the origins of the larger, dysmorphic mandible observed in adult Ts1Rhr mice develop from larger embryonic craniofacial precursors. Because of phenotypic variability seen in subsequent studies with Ts1Rhr mice, we also hypothesized that genetic background differences would alter Ts1Rhr developmental phenotypes. Using Ts1Rhr offspring from two genetic backgrounds, we found differences in mandibular precursor volume as well as total embryonic volume and postnatal body size of Ts1Rhr and nontrisomic littermates. Additionally, we observed increased relative expression of Dyrk1a and differential expression of Ets2 on the basis of the genetic background in the Ts1Rhr mandibular precursor. Our results suggest that trisomic gene content and allelic differences in trisomic or nontrisomic genes influence variability in gene expression and developmental phenotypes associated with DS.
In genome-wide association studies hundreds of thousands of loci are scanned in thousands of cases and controls, with the goal of identifying genomic loci underpinning disease. This is a challenging statistical problem requiring strong evidence. Only a small proportion of the heritability of common diseases has so far been explained. This "dark matter of the genome" is a subject of much discussion. It is critical to have experimental design criteria that ensure that associations between genomic loci and phenotypes are robustly detected. To ensure associations are robustly detected we require good power (e.g., 0.8) and sufficiently strong evidence [i.e., a high Bayes factor (e.g., 106, meaning the data are 1 million times more likely if the association is real than if there is no association)] to overcome the low prior odds for any given marker in a genome scan to be associated with a causal locus. Power calculations are given for determining the sample sizes necessary to detect effects with the required power and Bayes factor for biallelic markers in linkage disequilibrium with causal loci in additive, dominant, and recessive genetic models. Significantly stronger evidence and larger sample sizes are required than indicated by traditional hypothesis tests and power calculations. Many reported putative effects are not robustly detected and many effects including some large moderately low-frequency effects may remain undetected. These results may explain the dark matter in the genome. The power calculations have been implemented in R and will be available in the R package ldDesign.
Seed dormancy has been associated with red grain color in cereal crops for a century. The association was linked to qSD7-1/qPC7, a cluster of quantitative trait loci for seed dormancy/pericarp color in weedy red rice. This research delimited qSD7-1/qPC7 to the Os07g11020 or Rc locus encoding a basic helix-loop-helix family transcription factor by intragenic recombinants and provided unambiguous evidence that the association arises from pleiotropy. The pleiotropic gene expressed in early developing seeds promoted expression of key genes for biosynthesis of abscisic acid (ABA), resulting in an increase in accumulation of the dormancy-inducing hormone; activated a conserved network of eight genes for flavonoid biosynthesis to produce the pigments in the lower epidermal cells of the pericarp tissue; and enhanced seed weight. Thus, the pleiotropic locus most likely controls the dormancy and pigment traits by regulating ABA and flavonoid biosynthetic pathways, respectively. The dormancy effect could be eliminated by a heat treatment, but could not be completely overcome by gibberellic acid or physical removal of the seed maternal tissues. The dormancy-enhancing alleles differentiated into two groups basically associated with tropical and temperate ecotypes of weedy rice. Of the pleiotropic effects, seed dormancy could contribute most to the weed adaptation. Pleiotropy prevents the use of the dormancy gene to improve resistance of white pericarp cultivars against pre-harvest sprouting through conventional breeding approaches.
Hotspots regulate the position and frequency of Spo11 (Rec12)-initiated meiotic recombination, but paradoxically they are suicidal and are somehow resurrected elsewhere in the genome. After the DNA sequence-dependent activation of hotspots was discovered in fission yeast, nearly two decades elapsed before the key realizations that (A) DNA site-dependent regulation is broadly conserved and (B) individual eukaryotes have multiple different DNA sequence motifs that activate hotspots. From our perspective, such findings provide a conceptually straightforward solution to the hotspot paradox and can explain other, seemingly complex features of meiotic recombination. We describe how a small number of single-base-pair substitutions can generate hotspots de novo and dramatically alter their distribution in the genome. This model also shows how equilibrium rate kinetics could maintain the presence of hotspots over evolutionary timescales, without strong selective pressures invoked previously, and explains why hotspots localize preferentially to intergenic regions and introns. The model is robust enough to account for all hotspots of humans and chimpanzees repositioned since their divergence from the latest common ancestor.
In this essay, we revisit the status of yeast as a model system for biology. We first summarize important contributions of yeast to eukaryotic biology that we anticipated in 1988 in our first article on the subject. We then describe transformative developments that we did not anticipate, most of which followed the publication of the complete genomic sequence of Saccharomyces cerevisiae in 1996. In the intervening 23 years it appears to us that yeast has graduated from a position as the premier model for eukaryotic cell biology to become the pioneer organism that has facilitated the establishment of the entirely new fields of study called "functional genomics" and "systems biology." These new fields look beyond the functions of individual genes and proteins, focusing on how these interact and work together to determine the properties of living cells and organisms.
Here we review recent advances in understanding the regulation of mRNA synthesis in Saccharomyces cerevisiae. Many fundamental gene regulatory mechanisms have been conserved in all eukaryotes, and budding yeast has been at the forefront in the discovery and dissection of these conserved mechanisms. Topics covered include upstream activation sequence and promoter structure, transcription factor classification, and examples of regulated transcription factor activity. We also examine advances in understanding the RNA polymerase II transcription machinery, conserved coactivator complexes, transcription activation domains, and the cooperation of these factors in gene regulatory mechanisms.
In response to nitrogen starvation in the presence of a poor carbon source, diploid cells of the yeast Saccharomyces cerevisiae undergo meiosis and package the haploid nuclei produced in meiosis into spores. The formation of spores requires an unusual cell division event in which daughter cells are formed within the cytoplasm of the mother cell. This process involves the de novo generation of two different cellular structures: novel membrane compartments within the cell cytoplasm that give rise to the spore plasma membrane and an extensive spore wall that protects the spore from environmental insults. This article summarizes what is known about the molecular mechanisms controlling spore assembly with particular attention to how constitutive cellular functions are modified to create novel behaviors during this developmental process. Key regulatory points on the sporulation pathway are also discussed as well as the possible role of sporulation in the natural ecology of S. cerevisiae.
Forward genetic screens provide a powerful approach for inferring gene function on the basis of the phenotypes associated with mutated genes. However, determining the causal mutation by traditional mapping and candidate gene sequencing is often the rate-limiting step, especially when analyzing many mutants. We report two genomic approaches for more rapidly determining the identity of the affected genes in Caenorhabditis elegans mutants. First, we report our use of restriction site-associated DNA (RAD) polymorphism markers for rapidly mapping mutations after chemical mutagenesis and mutant isolation. Second, we describe our use of genomic interval pull-down sequencing (GIPS) to selectively capture and sequence megabase-sized portions of a mutant genome. Together, these two methods provide a rapid and cost-effective approach for positional cloning of C. elegans mutant loci, and are also applicable to other genetic model systems.
In many organisms, homolog pairing and synapsis at meiotic prophase depend on interactions between chromosomes and the nuclear membrane. Male Drosophila lack synapsis, but nonetheless, their chromosomes closely associate with the nuclear periphery at prophase I. To explore the functional significance of this association, we characterize mutations in nuclear blebber (nbl), a gene required for both spermatocyte nuclear shape and meiotic chromosome transmission. We demonstrate that nbl corresponds to dtopors, the Drosophila homolog of the mammalian dual ubiquitin/small ubiquitin-related modifier (SUMO) ligase Topors. We show that mutations in dtopors cause abnormalities in lamin localizations, centriole separation, and prophase I chromatin condensation and also cause anaphase I bridges that likely result from unresolved homolog connections. Bridge formation does not require mod(mdg4) in meiosis, suggesting that bridges do not result from misregulation of the male homolog conjunction complex. At the ultrastructural level, we observe disruption of nuclear shape, an uneven perinuclear space, and excess membranous structures. We show that dTopors localizes to the nuclear lamina at prophase, and also transiently to intranuclear foci. As a role of dtopors at gypsy insulator has been reported, we also asked whether these new alleles affected expression of the gypsy-induced mutation ct6 and found that it was unaltered in dtopors homozygotes. Our results indicate that dTopors is required for germline nuclear structure and meiotic chromosome segregation, but in contrast, is not necessary for gypsy insulator function. We suggest that dtopors plays a structural role in spermatocyte lamina that is critical for multiple aspects of meiotic chromosome transmission.
Sexual reproduction is a fundamental developmental process that allows for genetic diversity through the control of zygote formation, recombination, and gametogenesis. The correct regulation of these events is paramount. Sexual reproduction in filamentous fungi, including mating strategy (self-fertilization/homothallism or outcrossing/heterothallism), is determined by the expression of mating type genes at mat loci. Aspergillus nidulans matA encodes a critical regulator that is a fungal ortholog of the hSRY/SOX9 HMG box proteins. In contrast to well-studied outcrossing systems, the molecular basis of homothallism and role of mating type genes during a self-fertile sexual cycle remain largely unknown. In this study the genetic model organism, A. nidulans, has been used to investigate the regulation and molecular functions of the matA mating type gene in a homothallic system. Our data demonstrate that complex regulatory mechanisms underlie functional matA expression during self-fertilization and sexual reproduction in A. nidulans. matA expression is suppressed in vegetative hyphae and is progressively derepressed during the sexual cycle. Elevated levels of matA transcript are required for differentiation of fruiting bodies, karyogamy, meiosis, and efficient formation of meiotic progeny. matA expression is driven from both initiator (Inr) and novel promoter elements that are tightly developmentally regulated by position-dependent and position-independent mechanisms. Deletion of an upstream silencing element, matA SE, results in derepressed expression from wild-type (wt) promoter elements and activation of an additional promoter. These studies provide novel insights into the molecular basis of homothallism in fungi and genetic regulation of sexual reproduction in eukaryotes.
It is well known that the Dpp signal transducer Mad is activated by phosphorylation at its carboxy-terminus. The role of phosphorylation on other regions of Mad is not as well understood. Here we report that the phosphorylation of Mad in the linker region by the Wg antagonist Zw3 (homolog of vertebrate Gsk3-β) regulates the development of sensory organs in the anterior–dorsal quadrant of the wing. Proneural expression of Mad-RNA interference (RNAi) or a Mad transgene with its Zw3/Gsk3-β phosphorylation sites mutated (MGM) generated wings with ectopic sensilla and chemosensory bristle duplications. Studies with pMad-Gsk (an antibody specific to Zw3/Gsk3-β-phosphorylated Mad) in larval wing disks revealed that this phosphorylation event is Wg dependent (via an unconventional mechanism), is restricted to anterior–dorsal sensory organ precursors (SOP) expressing Senseless (Sens), and is always co-expressed with the mitotic marker phospho-histone3. Quantitative analysis in both Mad-RNAi and MGM larval wing disks revealed a significant increase in the number of Sens SOP. We conclude that the phosphorylation of Mad by Zw3 functions to prevent the self-renewal of Sens SOP, perhaps facilitating their differentiation via asymmetric division. The conservation of Zw3/Gsk3-β phosphorylation sites in vertebrate homologs of Mad (Smads) suggests that this pathway, the first transforming growth factor β-independent role for any Smad protein, may be widely utilized for regulating mitosis during development.
Regulation of transcription can be a complex process in which many cis- and trans-interactions determine the final pattern of expression. Among these interactions are trans-interactions mediated by the pairing of homologous chromosomes. These trans-effects are wide ranging, affecting gene regulation in many species and creating complex possibilities in gene regulation. Here we describe a novel case of trans-interaction between alleles of the Malic enzyme (Men) locus in Drosophila melanogaster that results in allele-specific, non-additive gene expression. Using both empirical biochemical and predictive bioinformatic approaches, we show that the regulatory elements of one allele are capable of interacting in trans with, and modifying the expression of, the second allele. Furthermore, we show that nonlocal factors—different genetic backgrounds—are capable of significant interactions with individual Men alleles, suggesting that these trans-effects can be modified by both locally and distantly acting elements. In sum, these results emphasize the complexity of gene regulation and the need to understand both small- and large-scale interactions as more complete models of the role of trans-interactions in gene regulation are developed.
Properly coordinated defense signaling networks are critical for the fitness of plants. One hub of the defense networks is centered on salicylic acid (SA), which plays a key role in activating disease resistance in plants. However, while a number of genes are known to affect SA-mediated defense, relatively little is known about how these gene interact genetically with each other. Here we exploited the unique defense-sensitized Arabidopsis mutant accelerated cell death (acd) 6-1 to dissect functional relationships among key components in the SA hub. We show that while enhanced disease susceptibility (eds) 1-2 and phytoalexin deficient (pad) 4-1 suppressed acd6-1–conferred small size, cell death, and defense phenotypes, a combination of these two mutations did not incur additive suppression. This suggests that EDS1 and PAD4 act in the same signaling pathway. To further evaluate genetic interactions among SA regulators, we constructed 10 pairwise crosses in the acd6-1 background among mutants defective in: SA INDUCTION-DEFICIENT 2 for SA biosynthesis; AGD2-LIKE DEFENSE 1, EDS5, and PAD4 for SA accumulation; and NONEXPRESSOR OF PR GENES 1 for SA signaling. Systematic analysis of the triple mutants based on their suppression of acd6-1–conferred phenotypes revealed complex and interactive genetic relationships among the tested SA genes. Our results suggest a more comprehensive view of the gene networks governing SA function and provide a framework for further interrogation of the important roles of SA and possibly other signaling molecules in regulating plant disease resistance.
The Minute syndrome in Drosophila melanogaster is characterized by delayed development, poor fertility, and short slender bristles. Many Minute loci correspond to disruptions of genes for cytoplasmic ribosomal proteins, and therefore the phenotype has been attributed to alterations in translational processes. Although protein translation is crucial for all cells in an organism, it is unclear why Minute mutations cause effects in specific tissues. To determine whether the heart is sensitive to haplo-insufficiency of genes encoding ribosomal proteins, we measured heart function of Minute mutants using optical coherence tomography. We found that cardiomyopathy is associated with the Minute syndrome caused by haplo-insufficiency of genes encoding cytoplasmic ribosomal proteins. While mutations of genes encoding non-Minute cytoplasmic ribosomal proteins are homozygous lethal, heterozygous deficiencies spanning these non-Minute genes did not cause a change in cardiac function. Deficiencies of genes for non-Minute mitochondrial ribosomal proteins also did not show abnormal cardiac function, with the exception of a heterozygous disruption of mRpS33. We demonstrate that cardiomyopathy is a common trait of the Minute syndrome caused by haplo-insufficiency of genes encoding cytoplasmic ribosomal proteins. In contrast, most cases of heterozygous deficiencies of genes encoding non-Minute ribosomal proteins have normal heart function in adult Drosophila.
The steroid hormone 20-hydroxyecdysone (20E) regulates gene transcription through the heterodimeric nuclear receptor composed of ecdysone receptor (EcR) and Ultraspiracle (USP). The EcR gene encodes three protein isoforms—A, B1, and B2—with variant N-terminal domains that mediate tissue and developmental stage-specific responses to 20E. Ariadne-1a is a conserved member of the RING finger family of ubiquitin ligases first identified in Drosophila melanogaster. Loss-of-function mutations at key cysteines in either of the two RING finger motifs, as well as general overexpression of this enzyme, cause lethality in pupae, which suggests a requirement in metamorphosis. Here, we show that Ariadne-1a binds specifically the isoform A of EcR and ubiquitylates it. Co-immunoprecipitation experiments indicate that the full sequence of EcRA is required for this binding. Protein levels of EcRA and USP change in opposite directions when those of ARI-1a are genetically altered. This is an isoform-specific, E3-dependent regulatory mechanism for a steroid nuclear receptor. Further, qRT-PCR experiments show that the ARI-1a levels lead to the transcriptional regulation of Eip78C, Eip74EF, Eip75B, and Br-C, as well as that of EcR and usp genes. Thus, the activity of this enzyme results in the regulation of dimerizing receptors at the protein and gene transcription levels. This fine-tuned orchestration by a conserved ubiquitin ligase is required during insect metamorphosis and, likely, in other steroid hormone-controlled processes across species.
The acquisition and maintenance of shape is critical for the normal function of most cells. Here we investigate the morphology of the pharyngeal glands of Caenorhabditis elegans. These unicellular glands have long cellular processes that extend discrete lengths through the pharyngeal musculature and terminate at ducts connected to the pharyngeal lumen. From a genetic screen we identified several mutants that affect pharyngeal gland morphology. The most severe such mutant is an allele of sma-1, which encodes a β-spectrin required for embryonic elongation, including elongation of the pharynx. In sma-1 mutants, gland projections form normally but become increasingly abnormal over time, acquiring additional branches, outgrowths, and swelling, suggestive of hypertrophy. Rather than acting in pharyngeal glands, sma-1 functions in the surrounding musculature, suggesting that pharyngeal muscles play a critical role in maintenance of gland morphology by restricting their growth, and analysis of other mutants known to affect pharyngeal muscles supports this hypothesis. We suggest that gland morphology is maintained by a balance of forces from the muscles and the glands.
Gradients of acetylcholine can stimulate growth cone turning when applied to neurons grown in culture, and it has been suggested that acetylcholine could act as a guidance cue. However, the role acetylcholine plays in directing axon migrations in vivo is not clear. Here, we show that acetylcholine positively regulates signaling pathways that mediate axon responses to guidance cues in Caenorhabditis elegans. Mutations that disrupt acetylcholine synthesis, transportation, and secretion affect circumferential axon guidance of the AVM neuron and in these mutants exogenously supplied acetylcholine improves AVM circumferential axon guidance. These effects are not observed for the circumferential guidance of the DD and VD motor neuron axons, which are neighbors of the AVM axon. Circumferential guidance is directed by the UNC-6 (netrin) and SLT-1 (slit) extracellular cues, and exogenously supplied acetylcholine can improve AVM axon guidance in mutants when either UNC-6– or SLT-1–induced signaling is disrupted, but not when both signaling pathways are perturbed. Not in any of the mutants does exogenously supplied acetylcholine improve DD and VD axon guidance. The ability of acetylcholine to enhance AVM axon guidance only in the presence of either UNC-6 or SLT-1 indicates that acetylcholine potentiates UNC-6 and SLT-1 guidance activity, rather than acting itself as a guidance cue. Together, our results show that for specific neurons acetylcholine plays an important role in vivo as a modulator of axon responses to guidance cues.
Cytoplasmic polyadenylation element binding (CPEB) proteins bind mRNAs to regulate their localization and translation. While the first CPEBs discovered were germline specific, subsequent studies indicate that CPEBs also function in many somatic tissues including the nervous system. Drosophila has two CPEB family members. One of these, orb, plays a key role in the establishment of polarity axes in the developing egg and early embryo, but has no known somatic functions or expression outside of the germline. Here we characterize the other Drosophila CPEB, orb2. Unlike orb, orb2 mRNA and protein are found throughout development in many different somatic tissues. While orb2 mRNA and protein of maternal origin are distributed uniformly in early embryos, this pattern changes as development proceeds and by midembryogenesis the highest levels are found in the CNS and PNS. In the embryonic CNS, Orb2 appears to be concentrated in cell bodies and mostly absent from the longitudinal and commissural axon tracts. In contrast, in the adult brain, the protein is seen in axonal and dendritic terminals. Lethal effects are observed for both RNAi knockdowns and orb2 mutant alleles while surviving adults display locomotion and behavioral defects. We also show that orb2 funtions in asymmetric division of stem cells and precursor cells during the development of the embryonic nervous system and mesoderm.
Deleterious mutations tend to be recessive. Several theories, notably those of Fisher (based on selection) and Wright (based on metabolism), have been put forward to explain this pattern. Despite a long-lasting debate, the matter remains unresolved. This debate has focused on the average dominance of mutations. However, we also know very little about the distribution of dominance coefficients among mutations, and about its variation across environments. In this article we present a new approach to predicting this distribution. Our approach is based on a phenotypic fitness landscape model. First, we show that under a very broad range of conditions (and environments), the average dominance of mutation of small effects should be approximately one-quarter as long as adaptation of organisms to their environment can be well described by stabilizing selection on an arbitrary set of phenotypic traits. Second, the theory allows predicting the whole distribution of dominance coefficients among mutants. Because it provides quantitative rather than qualitative predictions, this theory can be directly compared to data. We found that its prediction on mean dominance (average dominance close to 0.25) agreed well with the data, based on a meta-analysis of dominance data for mildly deleterious mutations. However, a simple landscape model does not account for the dominance of mutations of large effects and we provide possible extension of the theory for this class of mutations. Because dominance is a central parameter for evolutionary theory, and because these predictions are quantitative, they set the stage for a wide range of applications and further empirical tests.
Adaptations are constructed through the sequential substitution of beneficial mutations by natural selection. However, the rarity of beneficial mutations has precluded efforts to describe even their most basic properties. Do beneficial mutations typically confer small or large fitness gains? Are their fitness effects environment specific, or are they broadly beneficial across a range of environments? To answer these questions, we used two subsets (n = 18 and n = 63) of a large library of mutants carrying antibiotic resistance mutations in the bacterium Pseudomonas fluorescens whose fitness, along with the antibiotic sensitive ancestor, was assayed across 95 novel environments differing in the carbon source available for growth. We explore patterns of genotype-by-environment (GxE) interactions and ecological specialization among the 18 mutants initially found superior to the sensitive ancestor in one environment. We find that GxE is remarkably similar between the two sets of mutants and that beneficial mutants are not typically associated with large costs of adaptation. Fitness effects among beneficial mutants depart from a strict exponential distribution: they assume a variety of shapes that are often roughly L shaped but always right truncated. Distributions of (beneficial) fitness effects predicted by a landscape model assuming multiple traits underlying fitness and a single optimum often provide a good description of the empirical distributions in our data. Simulations of data sets containing a mixture of single and double mutants under this landscape show that inferences about the distribution of fitness effects of beneficial mutants is quite robust to contamination by second-site mutations.
How genomic diversity within bacterial populations originates and is maintained in the presence of frequent recombination is a central problem in understanding bacterial evolution. Natural populations of Borrelia burgdorferi, the bacterial agent of Lyme disease, consist of diverse genomic groups co-infecting single individual vertebrate hosts and tick vectors. To understand mechanisms of sympatric genome differentiation in B. burgdorferi, we sequenced and compared 23 genomes representing major genomic groups in North America and Europe. Linkage analysis of >13,500 single-nucleotide polymorphisms revealed pervasive horizontal DNA exchanges. Although three times more frequent than point mutation, recombination is localized and weakly affects genome-wide linkage disequilibrium. We show by computer simulations that, while enhancing population fitness, recombination constrains neutral and adaptive divergence among sympatric genomes through periodic selective sweeps. In contrast, simulations of frequency-dependent selection with recombination produced the observed pattern of a large number of sympatric genomic groups associated with major sequence variations at the selected locus. We conclude that negative frequency-dependent selection targeting a small number of surface-antigen loci (ospC in particular) sufficiently explains the maintenance of sympatric genome diversity in B. burgdorferi without adaptive divergence. We suggest that pervasive recombination makes it less likely for local B. burgdorferi genomic groups to achieve host specialization. B. burgdorferi genomic groups in the northeastern United States are thus best viewed as constituting a single bacterial species, whose generalist nature is a key to its rapid spread and human virulence.
Mounting evidence points to differences in gene regulation as a major source of phenotypic variation. MicroRNA-mediated post-transcriptional regulation has emerged recently as a key factor controlling gene activity during development. MicroRNA genes are abundant in genomes, acting as managers of gene expression by directing translational repression. Thus, understanding the role of microRNA sequence variation within populations is essential for fully dissecting the origin and maintenance of phenotypic diversity in nature. In this study, we investigate allelic variation at microRNA loci in the nematode Caenorhabditis briggsae, a close relative of C. elegans. Phylogeographic structure in C. briggsae partitions most strains from around the globe into a "temperate" or a "tropical" clade, with a few strains having divergent, geographically restricted genotypes. Remarkably, strains that follow this latitudinal dichotomy also differ in temperature-associated fitness. With this phylogeographic pattern in mind, we examined polymorphisms in 18 miRNAs in a global sample of C. briggsae isolates and tested whether newly isolated strains conform to this phylogeography. Surprisingly, nucleotide diversity is relatively high in this class of gene that generally experiences strong purifying selection. In particular, we find that miRNAs in C. briggsae are substantially more polymorphic than in Arabidopsis thaliana, despite similar background levels of neutral site diversity between the two species. We find that some mutations suggest functional divergence on the basis of requirements for target site recognition and computational prediction of the effects of the polymorphisms on RNA folding. These findings demonstrate the potential for miRNA polymorphisms to contribute to phenotypic variation within a species. Sequences were deposited in GenBank under accession nos. JN251323–JN251744.
Analysis of genomic data requires an efficient way to calculate likelihoods across very large numbers of loci. We describe a general method for finding the distribution of genealogies: we allow migration between demes, splitting of demes [as in the isolation-with-migration (IM) model], and recombination between linked loci. These processes are described by a set of linear recursions for the generating function of branch lengths. Under the infinite-sites model, the probability of any configuration of mutations can be found by differentiating this generating function. Such calculations are feasible for small numbers of sampled genomes: as an example, we show how the generating function can be derived explicitly for three genes under the two-deme IM model. This derivation is done automatically, using Mathematica. Given data from a large number of unlinked and nonrecombining blocks of sequence, these results can be used to find maximum-likelihood estimates of model parameters by tabulating the probabilities of all relevant mutational configurations and then multiplying across loci. The feasibility of the method is demonstrated by applying it to simulated data and to a data set previously analyzed by Wang and Hey (2010) consisting of 26,141 loci sampled from Drosophila simulans and D. melanogaster. Our results suggest that such likelihood calculations are scalable to genomic data as long as the numbers of sampled individuals and mutations per sequence block are small.
In many biological scenarios, from the development of drug resistance in pathogens to the progression of healthy cells toward cancer, quantifying the selection acting on observed mutations is a central question. One difficulty in answering this question is the complexity of the background upon which mutations can arise, with multiple potential interactions between genetic loci. We here present a method for discerning selection from a population history that accounts for interference between mutations. Given sequences sampled from multiple time points in the history of a population, we infer selection at each locus by maximizing a likelihood function derived from a multilocus evolution model. We apply the method to the question of distinguishing between loci where new mutations are under positive selection (drivers) and loci that emit neutral mutations (passengers) in a Wright–Fisher model of evolution. Relative to an otherwise equivalent method in which the genetic background of mutations was ignored, our method inferred selection coefficients more accurately for both driver mutations evolving under clonal interference and passenger mutations reaching fixation in the population through genetic drift or hitchhiking. In a population history recorded by 750 sets of sequences of 100 individuals taken at intervals of 100 generations, a set of 50 loci were divided into drivers and passengers with a mean accuracy of >0.95 across a range of numbers of driver loci. The potential application of our model, either in full or in part, to a range of biological systems, is discussed.
Understanding the genetic basis of reproductive isolation between recently diverged species is a central problem in evolutionary genetics. Here, I present analyses of the genetic architecture underlying hybrid male sterility and segregation distortion between the Bogota and USA subspecies of Drosophila pseudoobscura. Previously, a single gene, Overdrive (Ovd), was shown to be necessary but not sufficient for both male sterility and segregation distortion in F1 hybrids between these subspecies, requiring several interacting partner loci for full manifestation of hybrid phenomena. I map these partner loci separately on the Bogota X chromosome and USA autosomes using a combination of different mapping strategies. I find that hybrid sterility involves a single hybrid incompatibility of at least seven interacting partner genes that includes three large-effect loci. Segregation distortion involves three loci on the Bogota X chromosome and one locus on the autosomes. The genetic bases of hybrid sterility and segregation distortion are at least partially—but not completely—overlapping. My results lay the foundation for fine-mapping experiments to identify the complete set of genes that interact with Overdrive. While individual genes that cause hybrid sterility or inviability have been identified in a few cases, my analysis provides a comprehensive look at the genetic architecture of all components of a hybrid incompatibility underlying F1 hybrid sterility. Such an analysis would likely be unfeasible for most species pairs due to their divergence time and emphasizes the importance of young species pairs such as the D. pseudoobscura subspecies studied here.
Epistatic interactions are widespread, and many of these interactions involve combinations of alleles at different loci that are deleterious when present in the same individual. The average genetic environment of sex-linked genes differs from that of autosomal genes, suggesting that the population genetics of interacting X-linked and autosomal alleles may be complex. Using both analytical theory and computer simulations, we analyzed the evolutionary trajectories and mutation–selection balance conditions for X–autosome synthetic lethals and steriles. Allele frequencies follow a set of fundamental trajectories, and incompatible alleles are able to segregate at much higher frequencies than single-locus expectations. Equilibria exist, and they can involve fixation of either autosomal or X-linked alleles. The exact equilibrium depends on whether synthetic alleles are dominant or recessive and whether fitness effects are seen in males, females, or both sexes. When single-locus fitness effects and synthetic incompatibilities are both present, population dynamics depend on the dominance of alleles and historical contingency (i.e., whether X-linked or autosomal mutations occur first). Recessive synthetic lethality can result in high-frequency X-linked alleles, and dominant synthetic lethality can result in high-frequency autosomal alleles. Many X–autosome incompatibilities in natural populations may be cryptic, appearing to be single-locus effects because one locus is fixed. We also discuss the implications of these findings with respect to standing genetic variation and the origins of Haldane’s rule.
We consider an asexual population under strong selection–weak mutation conditions evolving on rugged fitness landscapes with many local fitness peaks. Unlike the previous studies in which the initial fitness of the population is assumed to be high, here we start the adaptation process with a low fitness corresponding to a population in a stressful novel environment. For generic fitness distributions, using an analytic argument we find that the average number of steps to a local optimum varies logarithmically with the genotype sequence length and increases as the correlations among genotypic fitnesses increase. When the fitnesses are exponentially or uniformly distributed, using an evolution equation for the distribution of population fitness, we analytically calculate the fitness distribution of fixed beneficial mutations and the walk length distribution.
A fundamental problem of asexual adaptation is that beneficial substitutions are not efficiently accumulated in large populations: Beneficial mutations often go extinct because they compete with one another in going to fixation. It has been argued that such clonal interference may have led to the evolution of sex and recombination in well-mixed populations. Here, we study clonal interference, and mechanisms of its mitigation, in an evolutionary model of spatially structured populations with uniform selection pressure. Clonal interference is much more prevalent with spatial structure than without, due to the slow wave-like spread of beneficial mutations through space. We find that the adaptation speed of asexuals saturates when the linear habitat size exceeds a characteristic interference length, which becomes shorter with smaller migration and larger mutation rate. The limiting speed is proportional to μ1/2 and μ1/3 in linear and planar habitats, respectively, where the mutational supply μ is the product of mutation rate and local population density. This scaling and the existence of a speed limit should be amenable to experimental tests as they fall far below predicted adaptation speeds for well-mixed populations (that scale as the logarithm of population size). Finally, we show that not only recombination, but also long-range migration is a highly efficient mechanism of relaxing clonal competition in structured populations. Our conservative estimates of the interference length predict prevalent clonal interference in microbial colonies and biofilms, so clonal competition should be a strong driver of both genetic and spatial mixing in those contexts.
The recent progress in sequencing technologies makes possible large-scale medical sequencing efforts to assess the importance of rare variants in complex diseases. The results of such efforts depend heavily on the use of efficient study designs and analytical methods. We introduce here a unified framework for association testing of rare variants in family-based designs or designs based on unselected affected individuals. This framework allows us to quantify the enrichment in rare disease variants in families containing multiple affected individuals and to investigate the optimal design of studies aiming to identify rare disease variants in complex traits. We show that for many complex diseases with small values for the overall sibling recurrence risk ratio, such as Alzheimer’s disease and most cancers, sequencing affected individuals with a positive family history of the disease can be extremely advantageous for identifying rare disease variants. In contrast, for complex diseases with large values of the sibling recurrence risk ratio, sequencing unselected affected individuals may be preferable.
Mothers are often the most important determinant of traits expressed by their offspring. These "maternal effects" (MEs) are especially crucial in early development, but can also persist into adulthood. They have been shown to play a role in a diversity of evolutionary and ecological processes, especially when genetically based. Although the importance of MEs is becoming widely appreciated, we know little about their underlying genetic basis. We address the dearth of genetic data by providing a simple approach, using combined genotype information from parents and offspring, to identify "maternal genetic effects" (MGEs) contributing to natural variation in complex traits. Combined with experimental cross-fostering, our approach also allows for the separation of pre- and postnatal MGEs, providing rare insights into prenatal effects. Applying this approach to an experimental mouse population, we identified 13 ME loci affecting body weight, most of which (12/13) exhibited prenatal effects, and nearly half (6/13) exhibiting postnatal effects. MGEs contributed more to variation in body weight than the direct effects of the offsprings’ own genotypes until mice reached adulthood, but continued to represent a major component of variation through adulthood. Prenatal effects always contributed more variation than postnatal effects, especially for those effects that persisted into adulthood. These results suggest that MGEs may be an important component of genetic architecture that is generally overlooked in studies focused on direct mapping from genotype to phenotype. Our approach can be used in both experimental and natural populations, providing a widely practicable means of expanding our understanding of MGEs.
In intraspecific crosses between cultivated rice (Oryza sativa) subspecies indica and japonica, the hybrid male sterility gene S24 causes the selective abortion of male gametes carrying the japonica allele (S24-j) via an allelic interaction in the heterozygous hybrids. In this study, we first examined whether male sterility is due solely to the single locus S24. An analysis of near-isogenic lines (NIL-F1) showed different phenotypes for S24 in different genetic backgrounds. The S24 heterozygote with the japonica genetic background showed male semisterility, but no sterility was found in heterozygotes with the indica background. This result indicates that S24 is regulated epistatically. A QTL analysis of a BC2F1 population revealed a novel sterility locus that interacts with S24 and is found on rice chromosome 2. The locus was named Epistatic Factor for S24 (EFS). Further genetic analyses revealed that S24 causes male sterility when in combination with the homozygous japonica EFS allele (efs-j). The results suggest that efs-j is a recessive sporophytic allele, while the indica allele (EFS-i) can dominantly counteract the pollen sterility caused by S24 heterozygosity. In summary, our results demonstrate that an additional epistatic locus is an essential element in the hybrid sterility caused by allelic interaction at a single locus in rice. This finding provides a significant contribution to our understanding of the complex molecular mechanisms underlying hybrid sterility and microsporogenesis.
There is increasing evidence that epigenetic marks such as DNA methylation contribute to phenotypic variation by regulating gene transcription, developmental plasticity, and interactions with the environment. However, relatively little is known about the relationship between the stability and distribution of DNA methylation within chromosomes and the ability to detect trait loci. Plant genomes have a distinct range of target sites and more extensive DNA methylation than animals. We analyzed the stability and distribution of epialleles within the complex genome of the oilseed crop plant Brassica napus. For methylation sensitive AFLP (MSAP) and retrotransposon (RT) epimarkers, we found a high degree of stability, with 90% of mapped markers retaining their allelic pattern in contrasting environments and developmental stages. Moreover, for two distinct parental lines 97% of epialleles were transmitted through five meioses and segregated in a mapping population. For the first time we have established the genetic position for 17 of the 19 centromeres within this amphidiploid species. Epiloci and genetic loci were distributed within distinct clusters, indicating differential detection of recombination events. This enabled us to identify additional significant QTL associated with seven important agronomic traits in the centromeric regions of five linkage groups.
Allopolyploidy is an important process during plant evolution that results in the reunion of two divergent genomes into a common nucleus. Many of the immediate as well as longer-term genomic and epigenetic responses to polyploidy have become appreciated. To investigate the modifications of gene expression at the proteome level caused by allopolyploid formation, we conducted a comparative analysis of cotton seed proteomes from the allopolyploid Gossypium hirsutum (AD genome) and its model A-genome and D-genome diploid progenitors. An unexpectedly high level of divergence among the three proteomes was found, with about one-third of all protein forms being genome specific. Comparative analysis showed that there is a higher degree of proteomic similarity between the allopolyploid and its D-genome donor than its A-genome donor, reflecting a biased accumulation of seed proteins in the allopolyploid. Protein identification and genetic characterization of high-abundance proteins revealed that two classes of seed storage proteins, vicilins and legumins, compose the major component of cotton seed proteomes. Analyses further indicate differential regulation or modification of homoeologous gene products, as well as novel patterns in the polyploid proteome that may result from the interaction between homoeologous gene products. Our findings demonstrate that genomic merger and doubling have consequences that extend beyond the transcriptome into the realm of the proteome and that unequal expression of proteins from diploid parental genomes may occur in allopolyploids.
Gene expression in Anopheles gambiae shows a deficiency of testis-expressed genes on the X chromosome associated with an excessive movement of retrogene duplication. We suggest that the degeneration of sex chromosomes in this monandrous species is likely the result of pressures from X inactivation, dosage compensation, and sexual antagonism.
We tested whether functionally important sites in bacterial, yeast, and animal promoters are more conserved than their neighbors. We found that substitutions are predominantly seen in less important sites and that those that occurred tended to have less impact on gene expression than possible alternatives. These results suggest that purifying selection operates on promoter sequences.
The structure of the meiosis-specific synaptonemal complex, which is perhaps the central visible characteristic of meiotic prophase, has been a matter of intense interest for decades. Although a general picture of the interactions between the transverse filament proteins that create this structure has emerged from studies in a variety of organisms, a recent analysis of synaptonemal complex structure in Caenorhabditis elegans by Schild-Prüfert et al. (2011) has provided the clearest picture of the structure of the architecture of a synaptonemal complex to date. Although the transverse filaments of the worm synaptonemal complex are assembled differently then those observed in yeast, mammalian, and Drosophila synaptonemal complexes, a comparison of the four assemblies shows that achieving the overall basic structure of the synaptonemal complex is far more crucial than conserving the structures of the individual transverse filaments.
Four different SYP proteins (SYP-1, SYP-2, SYP-3, and SYP-4) have been proposed to form the central region of the synaptonemal complex (SC) thereby bridging the axes of paired meiotic chromosomes in Caenorhabditis elegans. Their interdependent localization suggests that they may interact within the SC. Our studies reveal for the first time how these SYP proteins are organized in the central region of the SC. Yeast two-hybrid and co-immunoprecipitation studies show that SYP-1 is the only SYP protein that is capable of homotypic interactions, and is able to interact with both SYP-2 and SYP-3 directly, whereas SYP-2 and SYP-3 do not seem to interact with each other. Specifically, the coiled-coil domain of SYP-1 is required both for its homotypic interactions and its interaction with the C-terminal domain of SYP-2. Meanwhile, SYP-3 interacts with the C-terminal end of SYP-1 via its N-terminal domain. Immunoelectron microscopy analysis provides insight into the orientation of these proteins within the SC. While the C-terminal domain of SYP-3 localizes in close proximity to the chromosome axes, the N-terminal domains of both SYP-1 and SYP-4, as well as the C-terminal domain of SYP-2, are located in the middle of the SC. Taking into account the different sizes of these proteins, their interaction abilities, and their orientation within the SC, we propose a model of how the SYP proteins link the homologous axes to provide the conserved structure and width of the SC in C. elegans.
The budding yeast Saccharomyces cerevisiae has proven to be a rich source of information about the mechanisms and regulation of homologous recombination during meiosis. A common technique for studying this process involves microdissecting the four products (ascospores) of a single meiosis and analyzing the configuration of genetic markers in the spores that are viable. Although this type of analysis is powerful, it can be laborious and time-consuming to characterize the large numbers of meioses needed to generate statistically robust data sets. Moreover, the reliance on viable (euploid) spores has the potential to introduce selection bias, especially when analyzing mutants with elevated frequencies of meiotic chromosome missegregation. To overcome these limitations, we developed a versatile, portable set of reporter constructs that drive fluorescent protein expression specifically in only those spores that inherit the reporter. These spore-autonomous fluorescence constructs allow direct visualization of inheritance patterns in intact tetrads, eliminating the need for microdissection and permitting meiotic segregation patterns to be ascertained even in aneuploid spores. As proof of principle, we demonstrate how different arrangements of reporters can be used to quantify crossover frequency, crossover interference, gene conversion, crossover/noncrossover ratios, and chromosome missegregation.
In mammals, X-chromosome inactivation (XCI) equalizes X-linked gene expression between XY males and XX females and is controlled by a specialized region known as the X-inactivation center (Xic). The Xic harbors two chromatin interaction domains, one centered around the noncoding Xist gene and the other around the antisense Tsix counterpart. Previous work demonstrated the existence of a chromatin transitional zone between the two domains. Here, we investigate the region and discover a conserved element, RS14, that presents a strong binding site for Ctcf protein. RS14 possesses an insulatory function suggestive of a boundary element and is crucial for cell differentiation and growth. Knocking out RS14 results in compromised Xist induction and aberrant XCI in female cells. These data demonstrate that a junction element between Tsix and Xist contributes to the initiation of XCI.
Metazoan genomes encode an abundant collection of mRNA-like, long noncoding (lnc)RNAs. Although lncRNAs greatly expand the transcriptional repertoire, we have a limited understanding of how these RNAs contribute to developmental regulation. Here, we investigate the function of the Drosophila lncRNA called yellow-achaete intergenic RNA (yar). Comparative sequence analyses show that the yar gene is conserved in Drosophila species representing 40–60 million years of evolution, with one of the conserved sequence motifs encompassing the yar promoter. Further, the timing of yar expression in Drosophila virilis parallels that in D. melanogaster, suggesting that transcriptional regulation of yar is conserved. The function of yar was defined by generating null alleles. Flies lacking yar RNAs are viable and show no overt morphological defects, consistent with maintained transcriptional regulation of the adjacent yellow (y) and achaete (ac) genes. The location of yar within a neural gene cluster led to the investigation of effects of yar in behavioral assays. These studies demonstrated that loss of yar alters sleep regulation in the context of a normal circadian rhythm. Nighttime sleep was reduced and fragmented, with yar mutants displaying diminished sleep rebound following sleep deprivation. Importantly, these defects were rescued by a yar transgene. These data provide the first example of a lncRNA gene involved in Drosophila sleep regulation. We find that yar is a cytoplasmic lncRNA, suggesting that yar may regulate sleep by affecting stabilization or translational regulation of mRNAs. Such functions of lncRNAs may extend to vertebrates, as lncRNAs are abundant in neural tissues.
The 5'-UTR serves as the loading dock for ribosomes during translation initiation and is the key site for translation regulation. Many genes in the yeast Saccharomyces cerevisiae contain poly(A) tracts in their 5'-UTRs. We studied these pre-AUG poly(A) tracts in a set of 3274 recently identified 5'-UTRs in the yeast to characterize their effect on in vivo protein abundance, ribosomal density, and protein synthesis rate in the yeast. The protein abundance and the protein synthesis rate increase with the length of the poly(A), but exhibit a dramatic decrease when the poly(A) length is ≥12. The ribosomal density also reaches the lowest level when the poly(A) length is ≥12. This supports the hypothesis that a pre-AUG poly(A) tract can bind to translation initiation factors to enhance translation initiation, but a long (≥12) pre-AUG poly(A) tract will bind to Pab1p, whose binding size is 12 consecutive A residues in yeast, resulting in repression of translation. The hypothesis explains why a long pre-AUG poly(A) leads to more efficient translation initiation than a short one when PABP is absent, and why pre-AUG poly(A) is short in the early genes but long in the late genes of vaccinia virus.
In Saccharomyces cerevisiae, the TEA transcription factor Tec1 controls several developmental programs in response to nutrients and pheromones. Tec1 is targeted by the pheromone-responsive Fus3/Kss1 mitogen-activated protein kinase (MAPK) cascade, which destabilizes the transcription factor to ensure efficient mating of sexual partner cells. The regulation of Tec1 by signaling pathways that control cell division and development in response to nutrients, however, is not known. Here, we show that Tec1 protein stability is under control of the nutrient-sensitive target of rapamycin complex 1 (TORC1) signaling pathway via the Tip41-Tap42-Sit4 branch. We further show that degradation of Tec1 upon inhibition of TORC1 by rapamycin does not involve polyubiquitylation and appears to be proteasome independent. However, rapamycin-induced Tec1 degradation depends on the HECT ubiquitin ligase Rsp5, which physically interacts with Tec1 via conserved PxY motives. We further demonstrate that rapamycin and mating pheromone control Tec1 protein stability through distinct mechanisms by targeting different domains of the transcription factor. Finally, we show that Tec1 is a positive regulator of yeast chronological lifespan (CLS), a known TORC1-regulated process. Our findings indicate that in yeast, Tec1 links TORC1 and MAPK signaling pathways to coordinate control of cellular development in response to different stimuli.