A global genetic interaction network maps a wiring diagram of cellular function


The experiments and analyses described in this study were based on a representative subset of the complete SGA dataset, unless otherwise noted. The subset includes ~19 million double mutants covering more than 80% of all double mutants tested, and consists of ~460,000 negative and ~275,000 positive genetic interactions, identified at an intermediate confidence threshold on the genetic interaction score that was previously defined (1). The nonessential x nonessential (NxN), essential x essential (ExE) and essential x nonessential (ExN) genetic networks based on this subset of data are described in detail below and are provided in various formats as Supplementary Data Files (Data Files S1-S2). We note that Data Files S1-S2 contain raw interaction data corresponding to all tested gene pairs. This data should be filtered for specific applications. We suggest three different thresholds [lenient (P < 0.05), intermediate (P < 0.05 and |ε| > 0.08), and stringent confidence (P < 0.05 and 0.16 < ε < -0.12)] that strike different balances between false negatives and false positives as described in our previous study (1). The quality estimates of data produced at each of these thresholds was completed as described below and is provided in fig. S2. Pearson Correlation Coefficient (PCC) matrices used to generate the essential, nonessential and global similarity networks (Fig. 1-2) are also provided (Data File S3). The dataset can also be browsed interactively at http://thecellmap.org/.

The complete genetic interaction dataset based on analysis of ~23 million double mutants, which mapped ~550,000 negative and ~350,000 positive genetic interactions and covers ~90% of all yeast genes as either array and/or query mutants can be downloaded from http://thecellmap.org/costanzo2016/.

Download the complete Supplementary Online Material package. (500MB)

  • It appears you don't have a PDF plugin for this browser. No biggie... you can still download the document on the link above.

  • Table S1 and S2. Characterization of high- and low-degree genes

    Genes on the DMA and TSA with the highest 20% (Table S1) and lowest 20% (Table S2) negative GI degrees, based on the intermediate threshold, were tested for associations with gene features. In cases where multiple alleles of one query or array gene were screened (about one third of the genes represented on the TSA) one allele was selected randomly before calculating degree. Part 1 of each table (S1-1 and S2-1) shows results of two-tailed hypergeometric tests of enrichment and depletion of the presence of binary gene features the gene set. All P-values are doubled to correct for performing two tests for each feature. Only the lower of the two P-values is shown in the table and, if significant (P < 0.05), the “Test result” column indicates which test was used. Part 2 of each table (S1-2 and S2-2) shows results of Wilcoxon rank-sum tests on continuous and ordinal gene features, comparing genes in the high- and low-20% set to all other genes. The “Test result” column indicates how the median degree of the focus gene set compares to other genes if the P-value is significant; if the medians are equal and the P-value significant, the mean degrees were compared instead and the result column says “(mean)”. “N/A” indicates that < 60% of data is present for genes on the array and the test was not done. Uncorrected P-values are listed for all features. Given that analysis of different features required using different statistical tests and some features are not expected to be independent of each other, no multiple hypotheses correction procedures were used. We do note that 31 gene features were tested.


  • Table S3. Confirmation of genetic suppression interactions

    Gene pairs corresponding to a subset of 302 positive genetic interactions involving a proteasome component (100 interactions) or gene pairs not involving a proteasome gene (202 interactions) were selected and confirmed by spot dilution growth assays on solid agar medium. Positive interactions involving the proteasome exhibited SGA scores ranging (ε) from 0.09 to 0.36 whereas SGA scores (ε) for non-proteasome gene pairs ranged from 0.19 to 0.44. Despite a lower score magnitude, 32% of positive interactions involving the proteasome were classified as genetic suppression compared to only 24% of positive interactions not involving the proteasome. Based on these results, we estimated that the proteasome participates in 1168 genetic suppression interactions compared to 544 suppression interactions involving non-proteasome genes in the ExE network.


    Table S3_suppression.pdf
  • Fig. S1. Comparison of genetic interaction trends for DAmP and TS alleles

    (A) Scatter plot comparing negative genetic interaction (GI) degree of DAmP and TS alleles of the same gene. (B) Scatter plot comparing positive GI degree of DAmP and TS alleles for the same gene. The Spearman correlation is noted on each plot and illustrates that TS and DAmP alleles for the same gene do not tend to show similar numbers of negative and positive genetic interactions. (C) Distribution of negative GI degree of all TS and DAmP queries against the essential genes on the TSA. (D) Distribution of positive GI degree of all TS and DAmP queries against the essential genes on the TSA. In (C) and (D) the header shows the P-value derived from a rank-sum test. TS alleles have higher positive and negative genetic interaction degree than DAmP mutants. (E) Single mutant fitness (SMF) distribution of all TS and DAmP query alleles screened against the TSA. The header shows the P-value derived from a rank-sum test. TS alleles exhibit greater fitness defects than DAmP alleles. (F) Negative query strain GI degree of TS and DAmP alleles binned by SMF. (G) Positive query strain GI degree of TS and DAmP alleles binned by SMF. All GI degrees were averaged across 100 randomizations where a single array allele per gene was randomly selected.


    S01.pdf
  • Fig. S2. Reproducibility analysis

    (A) Average precision and recall of genetic interactions (GI) derived from screening both essential and nonessential gene queries against the essential TS array (TSA) and nonessential deletion mutant array (DMA) at different SGA score cutoffs. These estimates are based on a subset of 40 screens (14 queries against the TSA and 26 queries against the DMA) that were each replicated 5 times and true interactions were assumed to be interactions that were found in two or more of the replicates. Error bars indicate the standard deviation. In general, negative and positive interactions derived from screens against the same mutant array exhibit similar levels of technical reproducibility. Positive and negative GIs involving essential genes tend to be more reproducible than nonessential gene interactions. (B) Fraction of GIs that were identified at least 1, 2, 3, 4, or 5 times out of 5 screens. The average fraction of GIs for queries with at least 5 replicates is shown for actual called interactions (blue lines for negative GIs; yellow lines for positive GIs) and randomly selected gene pairs (grey lines). These results are based on an intermediate genetic interaction score cutoff (genetic interaction score, |ε| ≥ 0.08, P < 0.05). (C) Fraction of interacting genes that are co-annotated to the same GO biological process term. GIs were grouped by the number of times they were found in the 5 replicates for a given query. The average across queries with at least 5 replicates is shown. The dashed line indicates the background co-annotation rate for non-interacting pairs and (*) indicates statistically significant differences from background (P < 0.05). These results are based on an intermediate genetic interaction score cutoff (genetic interaction score, |ε| ≥ 0.08, P < 0.05). These results show that highly reproducible negative interactions (blue) are more likely to connect gene pairs that are annotated to the same GO biological process terms for both essential and non-essential genes. The relationship between functional relationships and reproducibility of positive interactions was weaker for non-essential genes and a similar trend between functional relationship and interaction reproducibility was not observed for essential genes. See Methods Section entitled, “Estimating reproducibility of genetic interactions”.


    S02.pdf
  • Fig. S3. Genetic interaction network coverage

    Circles represent all unique pairs of genes, unique essential gene pairs (ExE), unique essential-nonessential gene pairs (ExN) and unique nonessential gene pairs (NxN) encoded in the S. cerevisiae genome. The fraction (%) of gene pairs tested for genetic interactions in the complete SGA dataset, ExE, ExN and NxN networks described in this study (http://thecellmap.org/costanzo2016/) are shown in blue, and the fraction of tested pairs relative to all annotated genes is indicated, whereas the fraction of all tested pairs relative to the number of gene pairs that could be tested by SGA is indicated in brackets. The fraction of gene pairs which have not yet been tested is shown in white and the fraction of gene pairs which could not be tested is shown in grey. The fraction of “Untestable” gene pairs includes mutant strains that are incompatible with SGA technology (e.g. sterile mutants, histidine, arginine and lysine auxotrophs etc.), mutant strains that do not survive SGA selection steps due to extreme fitness defects, and genes that are not represented in our array and/or query mutant collections.


    S03.pdf
  • Fig. S4. Functional evaluation of genetic interaction profiles

    Genes with varying degree of genetic interaction profile similarity were evaluated for overlap with either Gene Ontology biological process co-annotations or protein-protein interactions using precision-recall analysis as described in (62). For all plots, gene pairs were sorted by Pearson correlation coefficients reflecting similarity in their genetic interaction profiles. The dashed lines show the background rate of co-annotation for the relevant set of gene pairs. Precision-recall analysis was completed separately for genetic interaction profiles derived from the essential, nonessential and the combined genetic interaction profile similarity networks. (A) Gene pairs evaluated against a GO biological process standard. In general, a genetic interaction profile similarity network based on both nonessential and essential gene profiles (global similarity network) identified functionally related genes pairs with the greatest amount of accuracy. (B) Gene pairs evaluated against a protein-protein interaction standard. See Methods Section entitled, “Comparison of SGA genetic interactions with other genomic datasets and precision-recall analysis”.


    S04.pdf
  • Fig. S5. Gene pairs in the same essential complex exhibit more similar genetic interaction profiles than nonessential co-complexed genes

    Shown are the distributions of Pearson correlation coefficients (PCC) measuring genetic interaction (GI) profile similarity for all pairs of essential genes (purple triangles) and all pairs of nonessential genes (orange triangles). Also shown are PCCs for GI profiles corresponding to pairs of genes belonging to the same protein complex (purple circles for essential genes, orange circles for nonessential genes). Distributions were normalized (total area of 1) and smoothed over a 3-bin window. This analysis illustrates that GI profiles associated with essential genes encoding members of the same complex tend to share greater similarity to each other compared to GI profiles of nonessential genes encoding members of the same complex. This indicates that essential GIs, in general, tend to be more coherent than nonessential GIs.

    S05.pdf
  • Fig. S6. Predicting gene function from essential versus nonessential genetic interaction profile similarity networks

    We predicted Gene Ontology annotations for a set of GO biological process terms defined in (62). (A) Performance when predicting function for nonessential genes. Terms for which nonessential gene predictions were more successful when based on interactions with essential genes fall above the diagonal while nonessential gene predictions that were more accurate when based on interactions with other nonessential genes fall below the diagonal. Biological process terms that clearly fall in either area of the graph are labeled. Prediction performance was summarized for each GO term by computing the precision at 25% recall. The X-axis represents performance when only nonessential query mutant profiles were used to calculate array-array gene pair similarities. The Y-axis plots performance when only interactions from essential query mutants are used. (B) Performance when predicting function for essential genes. Terms for which predictions were more successful when based on interactions with other essential genes fall above the diagonal. Terms for which predictions were more successful when based on interactions with nonessential genes fall below the diagonal. (C) Complete precision-recall curves for the indicated terms are shown. The performance of all essential, an equal subset of nonessential queries, the complete set of nonessential queries, and all queries are shown. Predictions were made 50 times with different random allele samplings and error bars represent the standard error on precision over 50 iterations. For the complete dataset, all data (i.e. all alleles) were used, thus only a single set of results is shown. (D) Cumulative performance across all included GO terms of the K-nearest neighbor classifier is summarized for the various query mutant subsets used to calculate array similarities. The left panel shows results for nonessential genes while the right shows results when predicting function for essential genes. In general, we found that essential gene interaction profiles provided higher accuracy gene function predictions across a diverse set of bioprocesses when compared to nonessential gene interaction profiles. However, either the essential or nonessential similarity networks uniquely predicted certain functions. For example, interactions with nonessential genes were highly predictive of vacuolar transport and peroxisome functions, whereas interactions with essential genes were more informative for predicting chromosome segregation and RNA splicing related functions. Optimal functional prediction performance was achieved by using the global similarity network (black lines on graph). See Methods Section entitled, “Predicting gene function from essential versus nonessential genetic interaction profile similarity networks”.


    S06.pdf
  • Fig. S7. A genetic profile similarity-derived functional hierarchy

    (A) A gene hierarchy based on a subset of high-confidence genetic interaction profiles corresponding to ~1000 genes, including both essential and nonessential genes that a shared highly similar genetic interaction profile (PCC > 0.6) with at least one other gene (see section above, “A genetic profile similarity-derived functional hierarchy” for details). Clusters identified at different levels of the hierarchy represent functional relationships of different specificity as indicated by the colored lines on the dendogram. The colored heatmaps are colored to match the lines indicating their level of specificity on the hierarchy and the alternating shades of each color reflect different clusters (e.g. 10 clusters for the cell compartment level of the hierarchy). The grey-scale heatmap summarizes enrichment observed for the functional standards described in (B) and the Z-scores are calculated from histograms shown in (B). (B) To assess the concordance of hierarchical clusters with established functional data, we performed a hypergeometric enrichment test for each cluster identified at the same hierarchical level, using several functional standards including a protein localization standard based on automated image analysis of the yeast GFP collection (17, 119), GO biological process annotations (14), a protein complex standard (Data File S12) and a pathway standard, KEGG (120). We then measured the tendency of clusters with one or more annotation(s) in common to merge together at higher hierarchical levels, which correspond to weaker shared profile similarity. Finally, we randomized the parent assignments of each cluster and counted the number of sibling pairs sharing an annotation after each randomization. This randomization allows the distribution of clusters, their membership, their sizes, and the number of sibling pairs under each parent to remain fixed, while randomizing only the hierarchical relationships of the clusters themselves. Put differently, we simply shuffled the parent assignments across the children clusters. In any random instance, an existing sibling relationship can be preserved (with low probability) and each parent has the same number of children as it did before the shuffling, though their identities (along with their annotations) will have changed. We repeated the randomization process 1000 times to derive empirical distributions of expected annotation overlap with a randomized hierarchy (shown in gray). The observed number of siblings enriched for at least one common term derived from the real hierarchy is indicated with the black arrow along with the corresponding P-value determined empirically from the proportion of random iterations where the test statistic meets or exceed the observed value. Each column represents a different functional annotation standard and rows correspond to results of the test at varying depths in the hierarchy. Rows 1, 2, and 3, show scores for the children of hierarchy levels corresponding to cell compartment (PCC=0.05), bioprocesses (PCC=0.2), and protein pathways/complexes (PCC=0.4) thresholds, respectively. This analysis illustrates that the hierarchical structure suggested by genetic interaction profiles is supported by all tested functional standards at the deepest levels of the hierarchy, including the most specific standards (protein complexes and pathways). Larger clusters formed at intermediate levels of the hierarchy are most strongly supported by GO biological processes. Finally, the structure at the highest levels of the hierarchy (weaker but significant genetic interaction profile similarity) was supported by a protein localization standard corresponding to coherent cellular compartments. See Methods Section entitled, “A genetic profile similarity-derived functional hierarchy”


    S07.pdf
  • Fig. S8. IPA1 encodes a novel regulator of mRNA cleavage and polyadenylation

    (A) Processing extracts were prepared from wild type (WT) as well as ipa1-5001, cft2-1 and pcf11-2 TS mutants grown at 30°C or grown at 30°C and then shifted to 37oC for 2 hours where indicated. Assays were conducted for 20 min at 30°C. For the coupled cleavage-polyadenylation assays, extracts were incubated with ATP and 32P-labeled, full-length GAL7-1 RNA. For cleavage only assays, the reactions were performed as described above except that 3’-dATP was used instead of ATP. For poly(A) addition assays, the reactions were performed as above except that precleaved GAL7-9 RNA, ending at the poly(A) site, was used as a precursor. RNA products were resolved on a denaturing 5% polyacrylamide gel and visualized with a phosphorimager. The positions of substrate and products are depicted on the side of the images, and the lane marked “precursor” indicates the unreacted substrate for each reaction. (B) Distribution of identified poly(A) reads with respect to annotated transcription termination sites (69). The cumulative distribution is shown in the bottom panel. Significant differences between mutants and BY4741 were computed using the Wilcoxon rank-sum test for the 200 bp surrounding annotated transcription termination site. These results show that strains expressing mutant a mutant TS allele of IPA1 exhibit mRNA processing defects both in vitro and in vivo. See Methods section entitled, “IPA1 experimental validation”.


    S08.pdf
  • Fig. S9. The MTC pathway is important for amino acid permease localization

    (A) Triple mutant (TM) genetic interaction analysis of MTC pathway genes. Individual MTC pathway deletion mutants were mated to double mutant strains carrying mutations in redundant components of the aromatic amino acid biosynthesis pathway. Diploid strains (genotype indicated) were sporulated and 15-20 tetrads from each cross were dissected. Representative tetrads from each cross are shown below the table. The growth of haploid progeny carrying all three selectable markers is summarized (TM phenotype and comments column). (B) Representative fluorescence micrographs highlighting yeast cells with wild-type and aberrant Bap2-GFP localization in may24 and mtc2 mutants. Bap2-GFP and FM4-64 vacuolar membrane staining, as well as an image of the fluorescent signal overlay are shown. See Methods section entitled, “MTC pathway experimental validation”.


    S09.pdf
  • Fig. S10. Negative and positive genetic interaction degree distributions

    Relative genetic interaction (GI) degrees of all strains on the TSA and DMA were calculated by counting interactions that met the intermediate thresholds and dividing by the total number of query mutants screened against the relevant array. (A) Distributions of negative GI degree for TS array (TSA) mutants. (B) Distributions of positive GI degree for TS array (TSA) mutants. (C) Distributions of negative GI degree for deletion array (DMA) mutants. (D) Distributions of positive GI degree for deletion array (DMA) mutants.


    S10.pdf
  • Fig. S11 & S12. Correlation analysis of genetic interaction degree

    Negative (fig. S11) and positive (fig. S12) genetic interaction (GI) degrees were calculated for all strains on the DMA and TSA using the intermediate genetic interaction score threshold (genetic interaction score, |ε| ≥ 0.08, P < 0.05) and counting any interacting query strain. Wilcoxon rank-sum tests compared the GI degree in paired gene sets defined by absence and presence of each binary feature tested. (A) For uncorrected P-values meeting a P<0.05 threshold, the “Test result” column describes the degree of the set of genes for which the listed binary feature is true (compared to the set for which the feature is false). Tests were not performed, indicated by “N/A”, if data were present for fewer than 50 strains and strains with missing data were excluded from the tests. (B) Pearson’s correlation was used to measure associations between GI degree and features that are continuous or count-based. Error bars reflect the 95% confidence interval on the correlation coefficient. For all correlation analyses, only strains corresponding to essential genes on the TSA were included. Given that analysis of different features required using different statistical tests and some features are not expected to be independent of each other, no multiple hypotheses correction procedures were used. We do note that 29 gene features were tested. These analyses identify a set of physiological and evolutionary features associated with the frequency of negative and positive GIs for each gene, which can be exploited to predict genes that serve as highly connected genetic network hubs and thus general genetic modifiers in other organisms. The features examined in this analysis are described in the Methods section entitled “Genetic interaction degree and frequency analysis”. The same analysis was repeated based on negative and positive GI degree associated with nonessential and essential query mutant strains. The results were consistent with those shown here and the data are provided as Data File S10.


  • Fig. S13. Genetic interaction frequency across molecular functions

    Genetic interaction (GI) frequencies were evaluated for genes annotated to specific GO Molecular Function term, excluding any term with a size less than five genes. The GI frequency associated with each GO term was determined using pairs of array gene mutants annotated to that GO term and was calculated as the ratio of negative (blue) or positive (yellow) GIs to the total number of array gene pairs screened. (A) The GI frequency among nonessential genes annotated to the specified GO term. (B) The GI frequency among essential genes annotated to the specified GO term. One random allele was selected in instances where multiple mutant alleles were available for the same gene. The dotted lines represent background frequency of negative (blue) and positive (yellow) GIs in the nonessential and essential genetic networks, which were calculated by adding all interactions and screened pairs that were counted for individual GO terms, then dividing the sums. Gene assignments to GO Molecular Function terms were obtained from the S. cerevisiae-specific GO slim terms, which were downloaded from http://geneontology.org/page/go-slim-and-subset-guide in Jan 2013. Only interactions that met the intermediate threshold (genetic interaction score, |ε| > 0.08, P < 0.05) were considered. See Methods section entitled “Genetic interaction degree and frequency analysis”.


  • Fig. S14. Genetic interaction frequency across PFAM domains

    Genetic interaction (GI) frequencies were evaluated for genes encoding proteins consisting of a common PFAM-annotated domain were measured, excluding any domain that appeared in fewer than five proteins. (A) The GI frequency among nonessential genes annotated to the specified PFAM domain. (B) The GI frequency among essential genes annotated to the specified PFAM domain. The GI frequency associated with each PFAM domain determined using pairs of array gene mutants annotated to a specific PFAM domain and was calculated as the ratio of negative (blue) and positive (yellow) GIs to the total number of array gene pairs screened. One random allele was selected in instances where multiple mutant alleles were available for the same gene. The dotted lines represent background frequency of negative (blue) and positive (yellow) GIs in the nonessential and essential genetic networks, which were calculated by adding all interactions and screened pairs that were counted for individual GO terms, then dividing the sums. Gene assignments to PFAM groups were downloaded from http://pfam.xfam.org/proteome?taxId=559292 in May 2014. Only interactions that met the intermediate threshold (genetic interaction score, |ε| > 0.08, P < 0.05) were considered. See Methods section entitled “Genetic interaction degree and frequency analysis”.


  • Fig. S15. Functional evaluation of negative and positive interactions in the ExE, NxN and ExN genetic networks

    Genetic interactions (GIs) were evaluated for overlap with either Gene Ontology biological process co-annotations or protein-protein interactions using precision-recall analysis as described in (62). For all plots, negative and positive GIs were thresholded at the intermediate cutoff (genetic interaction score, |ε| ≥ 0.08, P < 0.05) and sorted by |ε|. The dashed line indicates the background rate of co-annotation for the relevant gene pairs. Precision-recall analysis was completed separately for positive (yellow lines) and negative (blue lines) GIs on different sets of gene pairs and different standards. (A) Essential-nonessential (ExN) gene pairs evaluated against a GO biological process standard. (B) Nonessential-nonessential (NxN) gene pairs evaluated against a protein-protein interaction standard. (C) Essential-essential (ExE) gene pairs evaluated against a protein-protein interaction standard. (D) Essential-nonessential gene pairs (ExN) evaluated against a protein-protein interaction standard. See Methods Section entitled “Comparison of SGA genetic interactions with other genomic datasets and precision-recall analysis”.


    S15.pdf
  • Fig. S16. Essential gene negative interactions are as informative as genetic interaction profile similarity for predicting functional relationships

    Genetic interactions (GIs) and GI profile similarities were evaluated for overlap with either Gene Ontology biological process co-annotations or protein-protein interactions using precision-recall analysis as described in (62). For all analysis of direct GIs, negative GIs were thresholded at the intermediate cutoff (genetic interaction score, |ε| ≥ 0.08, P < 0.05) and sorted by |ε|. Pearson correlation coefficients (PCCs) were calculated for array-array mutant pairs (matrix columns) and only a single allele of each gene (selected at random) was used. (A) Comparison of functional information associated with direct negative GIs (blue line) versus profile similarity (Pearson, orange line) for essential genes evaluated against a GO biological process co-annotation standard. (B) Comparison of functional information associated with direct negative GIs (blue line) versus GI profile similarity (Pearson, orange line) for essential genes evaluated against a protein-protein interaction standard. (C) Comparison of functional information associated with direct negative GIs (blue line) versus profile similarity (Pearson, orange line) for nonessential genes evaluated against a GO biological process co-annotation standard. (D) Comparison of functional information associated with direct negative GIs (blue line) versus GI profile similarity (Pearson, orange line) for nonessential genes evaluated against protein-protein interaction standard. See Methods Section entitled “Comparison of SGA genetic interactions with other genomic datasets and precision-recall analysis”.


    S16.pdf
  • Fig. S17. Positive interactions in the essential gene network (ExE) are shared among mutant alleles of the same gene and are coherent among members of the same protein complex

    (A) Jaccard similarity coefficient between different alleles of the same gene (left) and between alleles of different genes (right). The median value across different alleles is shown for each gene. This analysis shows that alleles of the same gene tend to share more positive genetic interactions (GIs) in common than random pairs of alleles of different essential genes. (B) The same analysis as shown in (A), but restricted to profile similarity between query mutants belonging to genes comprising “Nuclear-related” functional clusters defined in cluster in Fig. 5D. Profile similarity in this analysis was based on positive GIs with genes found in the “Cytosolic/Vesicle traffic-related” functional clusters also defined in Fig. 5D. (C) The reciprocal analysis of (B), where profile similarity of query mutants in the “cytosolic” functional clusters was evaluated based on positive GIs with genes in the “nuclear-related” functional cluster defined in Fig. 5D. P-values were calculated with a Wilcoxon rank-sum test. (D) Precision-recall curve for predicting protein complex membership based on similarity of positive GI profiles alone. Pairs of query mutants were sorted based on their Jaccard similarity coefficient calculated using binarized profiles of positive GIs. (E) The same analysis as in (D) but query mutants were restricted to those in the “Nuclear-related” functional cluster and GI profiles were based only on positive GIs involving genes belonging to the “Cytosolic/Vesicle traffic-related” functional cluster defined in fig. 5D. (F) The reciprocal analysis where query mutants were restricted to those in the “Cytosolic/Vesicle traffic-related” functional cluster and the GI profiles were based only on positive GIs involving genes in the “Nuclear-related” functional cluster defined in Fig. 5D. The dashed grey line shows the background precision expected from randomly selected gene pairs. A list of genes and alleles used in this analysis is provided in Data File S6. See Methods section entitled “Evaluating functional coherence of positive interactions”.


    S17.pdf
  • Fig. S18. Chemical genomic response of a ynl181w DAmP mutant across 92 diverse compounds compared to the average DAmP mutant strain response

    A pooled DAmP allele collection was screened against 92 diverse compounds. (A) Compared to the average score per compound (filled circles), a ynl181W DAmP allele mutant (open circle) exhibited a wide range of responses to different compounds from strong resistance to strong sensitivity. For example, the ynl181w mutant strain was resistant to cycloheximide (translation inhibitor) and microazole (ergosterol biosynthesis inhibitor) but highly sensitive to amphotericin (binds ergosterol) and micafungin (inhibits glucan synthesis). A list of the compounds screened and the corresponding Chemical Genetic Interaction (CGI) scores are provided in Data File S17. (B) Growth of a ynl181w-DAmP allele strain in liquid culture in the presence of the indicated compounds relative to an isogenic wild type strain grown in the same condition. The ynl181w mutant strain was sensitive to amphotericin B (binds ergosterol) and caspofungin (inhibits glucan synthesis) as predicted from the pooled experiment. The DAmP mutant showed greater resistance to poacic acid (binds glucan) and tunicamycin (inhibits glycosylation). Error bars indicate the standard error based on three biological replicates. See Methods section entitled “YNL181W chemical genetic screens”.


    S18.pdf
  • Fig. S19. Genetic and chemical-genetic interactions and profile similarity distinguishes between the 19S and 20S proteasome subunits and members of the same protein complex show similar patterns of genetic interactions in yeast cells.

    (A) Negative genetic interaction (GI) frequency in the essential (ExE) network for genes encoding members of the 19S and 20S proteasome subunits and negative GI frequency between 19S and 20S proteasome subunit genes. (B) The boxplot shows query gene profile similarity (PCC) between the query gene profiles of (i) the 19S proteasome subunit; (ii) the 20S proteasome subunit; (iii) the 19S and 20S proteasome subunits; (iv) the 19S subunit and all query genes not annotated to the proteasome; and (v) the 20S subunit and all query genes not annotated to the proteasome. Significance was assessed with Wilcoxon rank-sum tests. Results shown correspond to a representative randomization where a single query and allele were selected per gene for genes that have more than one allele. (C) Heterozygous deletion of yeast genes encoding 20S proteasome subunits showed increased sensitivity to the proteasome inhibitor, Boretzomib, compared to strains heterozygous for deletion of 19S proteasome genes. Yeast chemical-genetic interactions for Bortezomib were obtained from (30). Frequency distribution of the heterozygous chemical-genetic interaction profile scores (MADL) for Bortezomib at 200 nM for genes annotated to the 20S proteasome core particle (purple), 19S proteasome regulatory particle (orange) and all other heterozygous mutants (black). Heterozygous mutants in the 20S proteasome core particle show significantly more sensitivity to Bortezomib than the background (Wilcoxon rank-sum P-value < 10-10). Conversely, 19S proteasome regulatory particle gene mutants show significantly more resistance to Bortezomib than the background (Wilcoxon rank-sum P-value = 0.003).


    S19.pdf
  • Fig. S20. 26S proteasome knockdown sensitizes Drosophila S2 cells to Bortezomib.

    (A) Cell fitness (ATP content in cell population) upon knockdown of components of the 19S (blue) or 20S (purple) subunit of the 26S proteasome without Bortezomib treatment as a fraction of the non-targeting RNAi reagent (targeting Rluc) effect (dashed line). Each dot represents the median of 6 measurements. Both subunits were not separated by the knockdown effects of their components (P = 0.976, Wilcoxon rank sum test, two-sided). (B) Dose-response curves to Bortezomib upon knockdown of quality-filtered 26S proteasome components. The effects of each knockdown to Bortezomib were normalized to the control (0 nM) treatment. Box plots represent 36 measurements of the non-targeting RNAi reagent at each Bortezomib concentration. For each knockdown of a 26S proteasome component, the lines connect the median of 6 measurements per knockdown and Bortezomib concentration. Dashed lines connect the median absolute deviation of the 6 measurements per knockdown and Bortezomib concentration. Purple lines represent 20S, blue lines 19S components. (C) Distinct sensitivities of 19S and 20S subunit components upon Bortezomib treatment. Boxplots represent the median values (of 6 measurements) for all components of the 20S (purple) or 19S (blue) subunit of the 26S proteasome. Members of the 19S and 20S subunits were separated from the control RNAi measurement at Bortezomib concentrations from 2 nM to 8 nM (p < 0.05, Wilcoxon rank sum test, two-sided), and members of those two proteasome subunits were separated from each other between 2 nM and 16 nM (p < 0.05, Wilcoxon rank sum test, two-sided). See Methods section entitled “RNAi and cell fitness measurements in Drosophila cell culture”.


    S20.pdf
  • Fig. 21. Features associated with genes exhibiting negative interactions with the proteasome

    Fold enrichment in genes containing at least 1, 2, or 3 structural domains among the 10% of genes that show the highest number of negative (blue) and positive (yellow) interactions with proteasome encoding genes. A domain was only considered once in cases where a gene encoded multiple repeats of the same domains. The dotted line reflects the expected fraction of genes in the top 10% interactors with the proteasome and “*” marks significant differences from the background fraction. See Methods section entitled “Protein features associated with proteasome interacting genes”.


    S21.pdf
  • Fig. S22. Protein complexes enriched for positive interactions in the ExE network.

    (A) Distribution of protein complexes enriched (yellow) or depleted (grey) for positive genetic interactions (GIs) in the nonessential (NxN) network (see above, Methods section entitled “Analysis of protein complexes enriched for positive interactions” for more details and Data file S15). Protein complexes that met the chosen threshold (N_fold_pos > 1.5, Data File S15) were evaluated, using Fisher's exact test, and found to be enriched for proteostasis-related functions (118). (B) Distribution of the positive:negative interaction ratio for protein complexes enriched for positive interactions in (A). Genes belonging to complexes biased for positive interactions in the nonessential genetic network (N_fold_pos > 1X and posGI_bias_with_N > 1.5X, Data File S15) did not show significant functional enrichment for genes with cell cycle phenotypes. (C) Genes belonging to complexes biased for positive interactions in the ExE network are enriched for negative genetic interactions with cell cycle checkpoint-related genes (www.yeastgenome.org). P-values shown are based on a one-sided Fisher’s test.


    S22.pdf
  • Fig. S23. Genes and complexes involved in protein folding share positive interactions in common with the proteasome

    We applied an intermediate threshold on the genetic interaction score (genetic interaction score, |ε| ≥ 0.08, P < 0.05) and computed Jaccard similarity coefficients between TSA-derived positive genetic interaction (GI) profiles for query strains corresponding to members of the CCT, prefoldin and proteasome complexes. (A) The boxplot shows the Jaccard similarity coefficient of specific query mutants with the proteasome complex. The left box (Within) shows similarity for genes in the proteasome complex; the center box (CCT & prefoldin) shows similarity between genes of the CCT and prefoldin complexes with genes of the proteasome complex; the right box (Other) shows similarity of all other query mutants in the global network with genes in the proteasome complex. Similarity of query mutants for the same gene against the proteasome complex was averaged. (B) Similar to (A), but Jaccard similarities were calculated with respect to the CCT complex. (C) Similar to (A), but Jaccard similarities were calculated with respect to the prefoldin complex. (A-C) suggest that members of the CCT, the proteasome and the prefoldin complexes share positive GIs in common. (D) Precision-recall curve for predicting protein folding-related genes (118) using query mutant positive GI profile similarity (Jaccard) to the proteasome complex. This analysis shows that genes involved in protein folding related functions (118) tend to share positive GIs in common with the proteasome. Query genes annotated to the proteasome, CCT, and prefoldin complexes were excluded from this analysis. See Methods section entitled “Evaluating functional coherence of positive interactions”.


    S23.pdf
  • Fig. S24. Proteasome genes are positive interaction hubs in the essential TS genetic network and RNA decay genes are positive interaction hubs in the DAmP genetic interaction network

    (A) Enrichment analysis for positive and negative genetic interactions (GIs) with the proteasome among DAmP and TS query mutants. Fold enrichment was calculated as the frequency of GIs between DAmP or TS query mutant alleles with the proteasome divided by the frequency of GIs between DAmP and TS query mutant allele interactions with all mutants on the TS array (TSA). GIs between different members of the proteasome complex were excluded. GI frequencies were averaged across 100 randomizations where a single query and array allele per gene were randomly selected. The dashed line indicates the expected frequency of interactions. Fold enrichment and significance of the enrichment with respect to background are shown on top of each bar. (B) Similar to (A), but GI frequency was calculated against mRNA decay-annotated essential TS array genes.


    S24.pdf
  • Data File S1. Raw genetic interaction datasets: Pair-wise interaction format

    This folder contains complete SGA genetic interaction data for the following:

    • A list of all query and array mutant strains represented in the genetic interaction network along with their corresponding fitness estimates
    • Nonessential x Nonessential network (NxN)
    • Essential x Essential network (ExE)
    • Essential x Nonessential network (ExN)
    • Genetic interactions involving DAmP alleles of essential genes
    • Genetic interactions for HSP90 and corresponding controls

    The interaction datasets are provided in a tab-delimited format with 11 columns:

    • Query ORF
    • Query gene name
    • Array ORF
    • Array gene name
    • Array Type (DMA or TSA) and Temperature (26°C or 30°C)
    • Genetic interaction score (ε)
    • P-value
    • Query single mutant fitness (SMF)
    • Array SMF
    • Double mutant fitness
    • Double mutant fitness standard deviation

    Data File S1. Raw genetic interaction datasets: Pair-wise interaction format.zip - 430MB
  • Data File S2. Raw genetic interaction datasets: Matrix format

    This folder contains complete SGA genetic interaction data matrix for the following:

    • Nonessential x Nonessential network (NxN)
    • Essential x Essential network (ExE)
    • Combined essential and nonessential network (ExN)

    Data File S2. Raw genetic interaction datasets: Matrix format.zip - 23MB
  • Data File S3. Genetic interaction profile similarity matrices

    Matrix files containing genetic interaction profile similarity values (as measured by Pearson correlation) for every pair of mutant strains in the dataset. Similarity values were computed for essential (ExE), non-essential (NxN) and the global similarity network derived from a combined set of all genetic interactions (ExE, NxN, ExN) as described above (see "Constructing genetic interaction profile similarity networks"). Each matrix contains 2 sets of row and column headers, providing a unique allele name for every mutant strain (row & column header #1) as well as a systematic ORF name (row & column header #2).


    Data File S3. Genetic interaction profile similarity matrices.zip - 65MB
  • Data File S4. GO bioprocess functions predicted by the nonessential and essential similarity networks using a K-nearest neighbor approach.

    This file reports the performance of gene function prediction for non-essential or essential genes based on genetic interaction profiles. For both classes of genes (either nonessential or essential), the performance of a KNN classifier is reported as the Precision at 25% Recall based on interactions derived from TS queries (PR_TSQ) or nonessential deletion queries (PR25_SN). Although analyses were performed using complete genetic interaction profiles (e.g. negative and positive genetic interactions), similar prediction performance was obtained using genetic interaction profiles based on negative interactions alone.


    Data File S4. GO bioprocess functions predicted by the nonessential and essential similarity networks using a K-nearest neighbor approach.
  • Data File S5. SAFE analysis: Gene cluster identity and functional enrichment

    This file lists the results from SAFE analysis of the global genetic profile similarity network (Fig. 1 and Fig. 2). Functional terms enriched within specific network clusters associated with GO biological processes (14) and/or protein complexes (Data File S12). A list of genes comprising each bioprocess-enriched cluster shown on the global similarity network is also provided. Functional terms enriched within specific network clusters associated with cell compartments (17, 119) are all shown on Fig. 2B.


    Data File S5. SAFE analysis_Gene cluster identity and functional enrichments.xlsx
  • Data File S6. Genetic profile similarity-based hierarchy analysis

    The first tab (“Gene to hierarchy cluster mapping”) lists the clusters identified at each level of the genetic interaction-based hierarchy and the deletion and TS allele array mutants assigned to each cluster. Examples of clusters described in the main text are highlighted. The subsequent 9 tabs indicate enrichment of clusters resolved at the specified profile similarity range for specific cell compartments (Cyclops_enrich), biological processes (GO BP_enrich), protein complexes (complex_enrich) and KEGG pathways (KEGG_enrich). The final tab in the file indicates the clusters used to map the functional distribution of negative and positive interactions shown in Fig. 5D.


    Data File S6. Genetic profile similarity-based hierarchy analysis.xlsx
  • Data File S7. Pleiotropic gene analysis

    This file lists nonessential and essential query genes associated with high confidence pleiotropy scores based on their genetic interactions derived from the TSA (Essential derived pleiotropy) and DMA (Nonessential derived pleiotropy). The file also contains a second list of nonessential and essential query genes that participate in many genetic interactions but exhibited low pleiotropy scores indicating that these genes are more functionally specific.


    Data File S7. Pleiotropic gene analysis.xlsx
  • Data File S8. Mass spectrometric evidence for Ipa1 interactions

    This file lists proteins identified with high confidence as specific physical interactors with strains expressing Ipa1-GFP from its endogenous locus or Ipa1-HA from a galactose-inducible plasmid.


    Data File S8_Mass spectrometric evidence for Ipa1 interactions.xlsx
  • Data File S9. High and low interaction degree genes

    This file lists the negative and positive interaction degree associated with every nonessential deletion (sn#), essential TS (tsq#), and DAmP (damp#) query mutant strain screened against the DMA (“query degree X DMA” tab) and/or TSA (“query degree X TSA” tab). A subset of strains were found to carry a second, spontaneous suppressor mutation that affected fitness of the query mutant strain. Strains carrying a suppressor mutation mapped through SGA analysis are indicated (“-supp”). Query mutants comprising the 20% highest and lowest degree groups of strains are indicated. Furthermore, a “Co-batch signal” rank is provided for every query (see “Co-batch filtering of query mutant strains”). Low ranks correspond to evidence for lingering batch effects. Another column, “ Gene with correlated GI profiles that are co-annotated with the query gene (%)", provides the percent of correlated gene pairs that are co-annotated to the particular query. A low negative interaction degree (e.g. 20% lowest negative interaction degree) coupled with a low co-batch rank (e.g. < ~0.2) and a low fraction of correlated pairs that share a similar functional annotation with a given query strain (e.g. < ~0.15) may be indicative of a low confidence screen. However, these criteria should be considered as loose indicators and not definitive metrics of screen quality and thus, should not be used as strict filters on the global interaction dataset. Another list (“Queries removed - batch effects” tab) indicates ~300 query strains that exhibited severe systematic batch effects and thus were removed from the indicated data set. Finally, two additional tabs provide the negative and positive interaction degree associated with every nonessential (“nonessential array degree” tab) and essential (“essential array degree” tab) array mutant, respectively.


    Data File S9_High and low interaction degree genes.xls
  • Data File S10. Correlation analysis of query strain GI degree

    As a complement to analysis of array strains (fig. S11-S12), GI degrees were calculated for query strains by counting negative interactions (tab 1, interactions with DMA strains; tab 2, interactions with TSA strains) and by counting positive interactions (tab 3, interactions with DMA strains; tab 4, interactions with TSA strains). Essential and nonessential queries were analyzed separately and results are labeled by grouped column headers. Wilcoxon rank-sum tests compared the GI degree in paired gene sets defined by absence and presence of each binary feature tested (top table). If the P-value is significant (< 0.05), the “Test result” column describes the degree of the set of genes for which the listed binary feature is true (compared to the set for which the feature is false). Tests were not performed, indicated by “N/A”, if data were present for fewer than 50 strains; strains with missing data were excluded from the tests. Pearson’s correlation (column labeled “r”) was used to measure associations between GI degree and features that are continuous or counts (bottom table). Uncorrected P-values are shown. The features examined in this analysis are described above (see Methods section entitled, “ Genetic interaction degree and frequency analysis”). Given that analysis of different features required using different statistical tests and some features are not expected to be independent of each other, no multiple hypotheses correction procedures were used. We do note that 31 gene features were tested.


    Data File S10_Correlation analysis of query strain GI degree.xlsx
  • Data File S11. Nonessential and essential GI hub functional enrichment analysis

    This file lists GO biological process, molecular function and cellular component terms that are enriched among of 10% of array strains with the most negative and 10% of array strains with the most positive interactions identified in the ExE network. Enrichments are also included for the 5% of array strains with the most negative interactions and 5% of array strains with the most positive interactions in the NxN genetic interaction network.


    Data File S11_Nonessential and essential GI hub functional enrichment analysis.xlsx
  • Data File S12. Protein complex standard

    This file provides a list of protein complexes compiled from two sources: Baryshnikova 2010 (5) and Benschop 2010 (117).


    Data File S12_Protein complex standard.xlsx
  • Data File S13. Genetic interaction enrichment among protein complexes.

    This file lists all possible pairs of protein complexes tested in the ExE, NxN and ExN networks. Enrichment for negative and positive interactions between genes in the same complex or between genes in different complexes is indicated. In addition, enrichment for genetic interaction in general (Combined interaction enrichment) regardless of the type is also indicated. Finally, Interaction Sign Bias indicates the distribution of interactions between genes within the same complex or between genes in different complexes. The interaction sign bias is computed as the mean over all interactions for within a given complex or between a pair of complexes. For example, an interaction sign bias of -1 indicates that all interactions identified between a set of genes encoding complexes members are negative, whereas a score of 1 indicates that only positive interactions were identified between a particular set of protein complex encoding genes. Rows highlighted in blue indicated complex-complex pairs enriched for negative interactions where greater than 75% of all interactions detected were negative. Rows highlighted in yellow indicated complex-complex pairs enriched for negative interactions where greater than 75% of all interactions detected were positive. The analysis shown in Fig. 7 is based on subset of complexes composed of 75% essential genes (i.e. considered essential complexes) or 75% nonessential genes (i.e. considered nonessential complexes). The complexes used for this analysis and their enrichment results are listed in the tabs labeled, “_filtered”. The tabs named “all” list within and between complex enrichment for all protein complexes without prior filtering of complexes composed of less than 75% essential or 75% nonessential genes.


    Data File S13_Genetic interaction enrichment among protein complexes.xlsx
  • Data File S14. D. melanogaster S2 cell fitness

    This file provides D. melanogaster S2 cell fitness upon RNAi-mediated 26S proteasome depletion and Bortezomib treatment.


    Data File S14_D. melanogaster S2 cell fitness.xlsx
  • Data File S15. Protein complex interaction enrichment bias

    This file indicates fold enrichment and biases in positive vs. negative interaction frequency for protein complexes and is described in detail above (see “Analysis of protein complexes exhibiting a positive interaction enrichment bias”). Rows highlighted in yellow indicate protein complexes that show > 1.5X enrichment for positive interactions (“E_fold_pos”) stronger enrichment for positive versus negative interactions when screened against the essential TSA. The file consists of the following columns:

    1. Protein complex name
    2. Number of complex member-encoding query genes screened against the DMA (“queries_vs_DMA”).
    3. Number of complex member-encoding query genes screened against the TSA (“queries_vs_TSA”).
    4. Nonessential-negative GI fold enrichment (“N_fold_neg”): negative interaction fold enrichment for a complex of interest with nonessential genes not in the complex.
    5. Essential-negative GI fold enrichment (“E_fold_neg”): negative interaction fold enrichment for a complex of interest with essential genes not in the complex.
    6. Nonessential-positive GI fold enrichment (“N_fold_pos”): positive interaction fold enrichment for a complex of interest with nonessential genes not in the complex.
    7. Essential-positive GI fold enrichment (“E_fold_pos”): positive interaction fold enrichment for a complex of interest with essential genes not in the complex. Complexes with a positive GI enrichment > 1.5X are highlighted in yellow. These values were used to generate Fig. 8C.
    8. Positive GI bias with essential genes (“posGI_bias_with_E”): the relative positive:negative enrichment ratio of essential to nonessential genes for the complex of interest (calculated as [D/E]/[F/G]). Complexes with a positive GI enrichment > 1 and a positive GI bias > 1.5 are highlighted in yellow. These values were used to generate Fig. 8D.
    9. Positive GI bias with nonessential genes (posGI_bias_with_N”): the relative positive:negative enrichment ratio of nonessential to essential genes for the complex of interest (calculated as [F/G]/[D/E]).


    Data File S15_Protein complex interaction enrichment and bias.xlsx
  • Data File S16. Genetic suppression analysis

    This file includes raw data from spot dilution growth assays to identify positive interactions that can be classified as genetic suppression. The suppression score is based on visual assessment of double mutant strain growth relative to a wild type and single mutant control strains. The score reflects strength of suppression with a score of 4 indicative of a strong suppression interaction where double mutant growth exceeded growth of the sickest single mutant and a score of 0 indicates failure to confirm a suppression interaction.


    Data File S16_Genetic suppression analysis.xlsx
  • Data file S17. YNL181W chemical genetics data

    Relative growth of a YNL181W-DAmP strain (CG score) measured in the presence of 92 different compounds.


    Data File S17_YNL181W chemical genetics data.xlsx
  1. M. Costanzo et al., The genetic landscape of a cell. Science 327, 425 (2010).
  2. J. L. t. Hartman, B. Garvik, L. Hartwell, Principles for the buffering of genetic variation. Science 291, 1001 (2001).
  3. O. Zuk, E. Hechter, S. R. Sunyaev, E. S. Lander, The mystery of missing heritability: Genetic interactions create phantom heritability. Proc Natl Acad Sci U S A 109, 1193 (2012).
  4. J. S. Bloom et al., Genetic interactions contribute less than additive effects to quantitative trait variation in yeast. Nat Commun 6, 8712 (2015).
  5. A. Baryshnikova et al., Quantitative analysis of fitness and genetic interactions in yeast on a genome scale. Nat Methods 7, 1017 (2010).
  6. T. Roemer, C. Boone, Systems-level antimicrobial drug and drug synergy discovery. Nat Chem Biol 9, 222 (2013).
  7. H. E. Bryant et al., Specific killing of BRCA2-deficient tumours with inhibitors of poly(ADP-ribose) polymerase. Nature 434, 913 (2005).
  8. Materials and Methods are available as supporting online material.
  9. E. A. Winzeler et al., Functional characterization of the S. cerevisiae genome by gene deletion and parallel analysis. Science 285, 901 (1999).
  10. G. Giaever et al., Functional profiling of the Saccharomyces cerevisiae genome. Nature 418, 387 (2002).
  11. M. Kofoed et al., An Updated Collection of Sequence Barcoded Temperature-Sensitive Alleles of Yeast Essential Genes. G3 (Bethesda) 5, 1879 (2015).
  12. Z. Li et al., Systematic exploration of essential yeast gene function with temperature-sensitive mutants. Nature biotechnology 29, 361 (2011).
  13. M. Schuldiner et al., Exploration of the function and organization of the yeast early secretory pathway through an epistatic miniarray profile. Cell 123, 507 (2005).
  14. M. Ashburner et al., Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet 25, 25 (2000).
  15. A. Baryshnikova, Systematic Functional Annotation and Visualization of Biological Networks. Cell Syst, (2016).
  16. J. Dutkowski et al., A gene ontology inferred from molecular networks. Nat Biotechnol 31, 38 (2013).
  17. J. L. Koh et al., CYCLoPs: A Comprehensive Database Constructed from Automated Analysis of Protein Abundance and Subcellular Localization Patterns in Saccharomyces cerevisiae. G3 (Bethesda) 5, 1223 (2015).
  18. T. A. Sangster, S. Lindquist, C. Queitsch, Under cover: causes, effects and implications of Hsp90-mediated genetic capacitance. BioEssays : news and reviews in molecular, cellular and developmental biology 26, 348 (2004).
  19. S. Chan, E. A. Choi, Y. Shi, Pre-mRNA 3'-end processing complex assembly and function. Wiley interdisciplinary reviews. RNA 2, 321 (2011).
  20. J. M. Gordon et al., Reconstitution of CF IA from overexpressed subunits reveals stoichiometry and provides insights into molecular topology. Biochemistry 50, 10203 (2011).
  21. F. Abe, H. Minegishi, Global screening of genes essential for growth in high-pressure and cold environments: searching for basic adaptive strategies using a yeast deletion library. Genetics 178, 851 (2008).
  22. S. G. Addinall et al., A genomewide suppressor and enhancer analysis of cdc13-1 reveals varied cellular processes influencing telomere capping in Saccharomyces cerevisiae. Genetics 180, 2251 (2008).
  23. A. Greenall et al., A genome wide analysis of the response to uncapped telomeres in budding yeast reveals a novel role for the NAD+ biosynthetic gene BNA2 in chromosome end protection. Genome Biol 9, R146 (2008).
  24. S. Mnaimneh et al., Exploration of essential gene functions via titratable promoter alleles. Cell 118, 31 (2004).
  25. E. N. Koch et al., Conserved rules govern genetic interaction degree across species. Genome biology 13, R57 (2012).
  26. S. Lahiri et al., A conserved endoplasmic reticulum membrane protein complex (EMC) facilitates phospholipid transfer from the ER to mitochondria. PLoS Biol 12, e1001969 (2014).
  27. R. Sanchez, A. Sali, Large-scale protein structure modeling of the Saccharomyces cerevisiae genome. Proc Natl Acad Sci U S A 95, 13597 (1998).
  28. D. Segre, A. Deluna, G. M. Church, R. Kishony, Modular epistasis in yeast metabolism. Nat Genet 37, 77 (2005).
  29. S. Bandyopadhyay, R. Kelley, N. J. Krogan, T. Ideker, Functional maps of protein complexes from quantitative genetic interaction data. PLoS Comput Biol 4, e1000065 (2008).
  30. D. Hoepfner et al., High-resolution chemical dissection of a model eukaryote reveals targets, pathways and gene functions. Microbiol Res 169, 107 (2014).
  31. S. P. Bell, The origin recognition complex: from simple origins to complex functions. Genes Dev 16, 659 (2002).
  32. M. Lei, B. K. Tye, Initiating DNA synthesis: from recruiting to activating the MCM complex. J Cell Sci 114, 1447 (2001).
  33. S. A. MacNeill, Structure and function of the GINS complex, a key component of the eukaryotic replisome. Biochem J 425, 489 (2010).
  34. J. M. Peters, The anaphase promoting complex/cyclosome: a machine designed to destroy. Nat Rev Mol Cell Biol 7, 644 (2006).
  35. F. Khosrow-Khavar et al., The yeast ubr1 ubiquitin ligase participates in a prominent pathway that targets cytosolic thermosensitive mutants for degradation. G3 (Bethesda) 2, 619 (2012).
  36. R. Li, A. W. Murray, Feedback control of mitosis in budding yeast. Cell 66, 519 (1991).
  37. S. Bandyopadhyay et al., Rewiring of genetic networks in response to DNA damage. Science 330, 1385 (2010).
  38. C. A. Hutchison, 3rd et al., Design and synthesis of a minimal bacterial genome. Science 351, aad6253 (2016).
  39. R. D. Dowell et al., Genotype to phenotype: a complex problem. Science 328, 469 (2010).
  40. B. VanderSluis et al., Genetic interactions reveal the evolutionary trajectories of duplicate genes. Molecular Systems Biology 6, 429 (2010).
  41. D. Altshuler, M. J. Daly, E. S. Lander, Genetic mapping in human disease. Science 322, 881 (2008).
  42. B. Lehner, C. Crombie, J. Tischler, A. Fortunato, A. G. Fraser, Systematic mapping of genetic interactions in Caenorhabditis elegans identifies common modifiers of diverse signaling pathways. Nat Genet 38, 896 (2006).
  43. A. B. Byrne et al., A global analysis of genetic interactions in Caenorhabditis elegans. J Biol 6, 8 (2007).
  44. B. Fischer et al., A map of directional genetic interactions in a metazoan cell. Elife 4, (2015).
  45. R. Blekhman et al., Host genetic variation impacts microbiome composition across human body sites. Genome Biol 16, 191 (2015).
  46. C. M. Buchovecky et al., A suppressor screen in Mecp2 mutant mice implicates cholesterol metabolism in Rett syndrome. Nature genetics 45, 1013 (2013).
  47. K. M. Keeling, X. Xue, G. Gunn, D. M. Bedwell, Therapeutics based on stop codon readthrough. Annu Rev Genomics Hum Genet 15, 371 (2014).
  48. P. Tsvetkov et al., Compromising the 19S proteasome complex protects cells from reduced flux through the proteasome. Elife 4, (2015).
  49. S. J. Dixon et al., Significant conservation of synthetic lethal genetic interaction networks between distantly related eukaryotes. Proc Natl Acad Sci U S A 105, 16653 (2008).
  50. A. Roguev et al., Conservation and rewiring of functional modules revealed by an epistasis map in fission yeast. Science 322, 405 (2008).
  51. R. Deshpande et al., A comparative genomic approach for identifying synthetic lethal interactions in human cancer. Cancer research 73, 6128 (2013).
  52. K. J. McManus, I. J. Barrett, Y. Nouhi, P. Hieter, Specific synthetic lethal killing of RAD54B-deficient human colorectal cancer cells by FEN1 silencing. Proc Natl Acad Sci U S A 106, 3276 (2009).
  53. D. M. van Pel et al., An evolutionarily conserved synthetic lethal interaction network identifies FEN1 as a broad-spectrum target for anticancer therapeutic development. PLoS Genet 9, e1003254 (2013).
  54. L. H. Hartwell, P. Szankasi, C. J. Roberts, A. W. Murray, S. H. Friend, Integrating genetic approaches into the discovery of anticancer drugs. Science 278, 1064 (1997).
  55. T. Hart et al., High-Resolution CRISPR Screens Reveal Fitness Genes and Genotype-Specific Cancer Liabilities. Cell 163, 1515 (2015).
  56. T. Wang et al., Identification and characterization of essential genes in the human genome. Science 350, 1096 (2015).
  57. V. A. Blomen et al., Gene essentiality and synthetic lethality in haploid human cells. Science 350, 1092 (2015).
  58. S. Ben-Aroya et al., Toward a comprehensive temperature-sensitive mutant repository of the essential genes of Saccharomyces cerevisiae. Mol Cell 30, 248 (2008).
  59. E. Kuzmin et al., Synthetic genetic array analysis for global mapping of genetic networks in yeast. Methods Mol Biol 1205, 143 (2014).
  60. R. Deshpande, B. Vandersluis, C. L. Myers, Comparison of profile similarity measures for genetic interaction networks. PLoS One 8, e68664 (2013).
  61. P. Shannon et al., Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res 13, 2498 (2003).
  62. C. L. Myers, D. R. Barrett, M. A. Hibbs, C. Huttenhower, O. G. Troyanskaya, Finding function: evaluation methods for functional genomic data. BMC Genomics 7, 187 (2006).
  63. J. Bellay et al., Putting genetic interactions in context through a global modular decomposition. Genome Research 21, 1375 (2011).
  64. R. Zhao et al., Navigating the chaperone network: an integrative map of physical and genetic interactions mediated by the hsp90 chaperone. Cell 120, 715 (2005).
  65. A. Subramanian et al., Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci U S A 102, 15545 (2005).
  66. J. Zhao, M. Kessler, S. Helmling, J. P. O'Connor, C. Moore, Pta1, a component of yeast CF II, is required for both cleavage and poly(A) addition of mRNA precursor. Mol Cell Biol 19, 7733 (1999).
  67. S. Wilkening et al., An efficient method for genome-wide polyadenylation site mapping and RNA quantification. Nucleic Acids Res 41, e65 (2013).
  68. T. D. Wu, S. Nacu, Fast and SNP-tolerant detection of complex variants and splicing in short reads. Bioinformatics 26, 873 (2010).
  69. Z. Xu et al., Bidirectional promoters generate pervasive transcription in yeast. Nature 457, 1033 (2009).
  70. M. I. Love, W. Huber, S. Anders, Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol 15, 550 (2014).
  71. W. K. Huh et al., Global analysis of protein localization in budding yeast. Nature 425, 686 (2003).
  72. M. Taipale et al., A quantitative chaperone interaction network reveals the architecture of cellular protein homeostasis pathways. Cell 158, 434 (2014).
  73. D. Kessner, M. Chambers, R. Burke, D. Agus, P. Mallick, ProteoWizard: open source software for rapid proteomics tools development. Bioinformatics 24, 2534 (2008).
  74. D. N. Perkins, D. J. Pappin, D. M. Creasy, J. S. Cottrell, Probability-based protein identification by searching sequence databases using mass spectrometry data. Electrophoresis 20, 3551 (1999).
  75. J. K. Eng, T. A. Jahan, M. R. Hoopmann, Comet: an open-source MS/MS sequence database search tool. Proteomics 13, 22 (2013).
  76. D. Shteynberg et al., iProphet: multi-level integrative analysis of shotgun proteomic data improves peptide and protein identification rates and error estimates. Mol Cell Proteomics 10, M111 007690 (2011).
  77. G. Teo et al., SAINTexpress: improvements and additional features in Significance Analysis of INTeractome software. J Proteomics 100, 37 (2014).
  78. D. Mellacheruvu et al., The CRAPome: a contaminant repository for affinity purification-mass spectrometry data. Nat Methods 10, 730 (2013).
  79. D. M. Gelperin et al., Biochemical and genetic analysis of the yeast proteome with a movable ORF collection. Genes and Development 19, 2816 (2005).
  80. T. Srikumar, M. C. Lewicki, B. Raught, A global S. cerevisiae small ubiquitin-related modifier (SUMO) system interactome. Mol Syst Biol 9, 668 (2013).
  81. R. Craig, R. C. Beavis, TANDEM: matching proteins with tandem mass spectra. Bioinformatics 20, 1466 (2004).
  82. T. A. Vida, S. D. Emr, A new vital stain for visualizing vacuolar membrane dynamics and endocytosis in yeast. J Cell Biol 128, 779 (1995).
  83. D. A. Saenz, M. S. Chianelli, C. A. Stella, L-Phenylalanine Transport in Saccharomyces cerevisiae: Participation of GAP1, BAP2, and AGP1. J Amino Acids 2014, 283962 (2014).
  84. P. A. Gibney, C. Lu, A. A. Caudy, D. C. Hess, D. Botstein, Yeast metabolic and signaling genes are required for heat-shock survival and have little overlap with the heat-induced genes. Proc Natl Acad Sci U S A 110, E4393 (2013).
  85. A. P. Rosebrock, A. A. Caudy, "Metabolomic Analysis of Budding Yeast". In Budding Yeast: A Laboratory Manual. B. Andrews, C. Boone, T. Davis, S. Fields, eds. (Cold Spring Harbor Press, 2016), pp. 599-620.
  86. L. C. Wan et al., Reconstitution and characterization of eukaryotic N6-threonylcarbamoylation of tRNA using a minimal enzyme system. Nucleic Acids Res 41, 6332 (2013).
  87. G. Ostlund et al., InParanoid 7: new algorithms and tools for eukaryotic orthology analysis. Nucleic Acids Res 38, D196 (2010).
  88. I. Wapinski, A. Pfeffer, N. Friedman, A. Regev, Natural history and evolutionary principles of gene duplication in fungi. Nature 449, 54 (2007).
  89. P. Rice, I. Longden, A. Bleasby, EMBOSS: the European Molecular Biology Open Software Suite. Trends Genet 16, 276 (2000).
  90. M. E. Hillenmeyer et al., The chemical genomic portrait of yeast: uncovering a phenotype for all genes. Science 320, 362 (2008).
  91. C. Huttenhower, M. Hibbs, C. Myers, O. G. Troyanskaya, A scalable method for integration and functional analysis of multiple microarray datasets. Bioinformatics 22, 2890 (2006).
  92. G. Liti et al., Population genomics of domestic and wild yeasts. Nature 458, 337 (2009).
  93. R. Jelier, J. I. Semple, R. Garcia-Verdugo, B. Lehner, Predicting phenotypic variation in yeast from individual genome sequences. Nat Genet 43, 1270 (2011).
  94. Z. Yang, PAML 4: phylogenetic analysis by maximum likelihood. Mol Biol Evol 24, 1586 (2007).
  95. R. C. Edgar, MUSCLE: a multiple sequence alignment method with reduced time and space complexity. BMC Bioinformatics 5, 113 (2004).
  96. F. Holstege et al., Dissecting the Regulatory Circuitry of a Eukaryotic Genome. Cell 95, 717 (1998).
  97. A. P. Gasch et al., Genomic expression programs in the response of yeast cells to environmental changes. Mol Biol Cell 11, 4241 (2000).
  98. R. B. Brem, J. D. Storey, J. Whittle, L. Kruglyak, Genetic interactions between polymorphisms that affect gene expression in yeast. Nature 436, 701 (2005).
  99. D. A. Skelly et al., Integrative phenomics reveals insight into the structure of phenotypic diversity in budding yeast. Genome Res 23, 1496 (2013).
  100. K. P. Byrne, K. H. Wolfe, The Yeast Gene Order Browser: combining curated homology and syntenic context reveals gene fate in polyploid species. Genome Research 15, 1456 (2005).
  101. Z. Gu, A. Cavalcanti, F. C. Chen, P. Bouman, W. H. Li, Extent of gene duplication in the genomes of Drosophila, nematode, and yeast. Mol Biol Evol 19, 256 (2002).
  102. B. Rost, Twilight zone of protein sequence alignments. Protein Eng 12, 85 (1999).
  103. S. F. Levy, M. L. Siegal, Network hubs buffer environmental variation in Saccharomyces cerevisiae. PLoS Biol 6, e264 (2008).
  104. A. C. Gavin et al., Proteome survey reveals modularity of the yeast cell machinery. Nature 440, 631 (2006).
  105. N. J. Krogan et al., Global landscape of protein complexes in the yeast Saccharomyces cerevisiae. Nature 440, 637 (2006).
  106. H. Yu et al., High-quality binary protein interaction map of the yeast interactome network. Science 322, 104 (2008).
  107. J. R. Newman et al., Single-cell proteomic analysis of S. cerevisiae reveals the architecture of biological noise. Nature 441, 840 (2006).
  108. J. J. Ward, L. J. McGuffin, K. Bryson, B. F. Buxton, D. T. Jones, The DISOPRED server for the prediction of protein disorder. Bioinformatics 20, 2138 (2004).
  109. C. Stark et al., The BioGRID Interaction Database: 2011 update. Nucleic Acids Res 39, D698 (2011).
  110. K. Tarassov et al., An in vivo map of the yeast protein interactome. Science 320, 1465 (2008).
  111. M. Babu et al., Interaction landscape of membrane-protein complexes in Saccharomyces cerevisiae. Nature 489, 585 (2012).
  112. J. S. Piotrowski et al., Plant-derived antifungal agent poacic acid targets beta-1,3-glucan. Proc Natl Acad Sci U S A 112, E1490 (2015).
  113. N. Kato, S. Takahashi, T. Nogawa, T. Saito, H. Osada, Construction of a microbial natural product library for chemical biology studies. Curr Opin Chem Biol 16, 101 (2012).
  114. J. S. Piotrowski et al., Chemical genomic profiling via barcode sequencing to predict compound mode of action. Methods Mol Biol 1263, 299 (2015).
  115. S. Y. Fung et al., Unbiased screening of marine sponge extracts for anti-inflammatory agents combined with chemical genomics identifies girolline as an inhibitor of protein synthesis. ACS Chem Biol 9, 247 (2014).
  116. Y. Bukhman et al., Modeling Microbial growth with GCAT. Bioenergy Research, 8, 1022-1030 (2015).
  117. J. J. Benschop et al., A consensus of core protein complex compositions for Saccharomyces cerevisiae. Mol Cell 38, 916 (2010).
  118. D. Finley, H. D. Ulrich, T. Sommer, P. Kaiser, The ubiquitin-proteasome system of Saccharomyces cerevisiae. Genetics 192, 319 (2012).
  119. Y. T. Chong et al., Yeast Proteome Dynamics from Single Cell Imaging and Automated Analysis. Cell 161, 1413 (2015).
  120. M. Kanehisa et al., Data, information, knowledge and principle: back to metabolism in KEGG. Nucleic Acids Res 42, D199 (2014).