Nonlinear expression patterns and multiple shifts in gene network interactions underlie robust phenotypic change in Drosophila melanogaster selected for night sleep duration

Caetano Souto-Maior; Yazmin L. Serrano Negron; Susan T. Harbison

doi:10.1371/journal.pcbi.1011389

Abstract

All but the simplest phenotypes are believed to result from interactions between two or more genes forming complex networks of gene regulation. Sleep is a complex trait known to depend on the system of feedback loops of the circadian clock, and on many other genes; however, the main components regulating the phenotype and how they interact remain an unsolved puzzle. Genomic and transcriptomic data may well provide part of the answer, but a full account requires a suitable quantitative framework. Here we conducted an artificial selection experiment for sleep duration with RNA-seq data acquired each generation. The phenotypic results are robust across replicates and previous experiments, and the transcription data provides a high-resolution, time-course data set for the evolution of sleep-related gene expression. In addition to a Hierarchical Generalized Linear Model analysis of differential expression that accounts for experimental replicates we develop a flexible Gaussian Process model that estimates interactions between genes. 145 gene pairs are found to have interactions that are different from controls. Our method appears to be not only more specific than standard correlation metrics but also more sensitive, finding correlations not significant by other methods. Statistical predictions were compared to experimental data from public databases on gene interactions. Mutations of candidate genes implicated by our results affected night sleep, and gene expression profiles largely met predicted gene-gene interactions.

Author summary

Understanding the molecular bases of phenotypes remains a challenge of complex trait biology. We used a combination of selective breeding, RNA-Seq, and Gaussian Process modeling to determine whether de novo gene expression networks could be derived for sleep duration in Drosophila. We bred flies with long and short sleep times, and sequenced RNA from the flies at each generation of selection. Using a hierarchical Bayesian Generalized Linear Model, we identified genes with altered expression across generation in the selected populations. Gene expression trajectories were largely non-linear across time, however, so we developed a Gaussian Process method to more accurately model the data. The Gaussian Process provides an adaptable framework that adjusts to the complexity of gene expression patterns we observed, eliminating the need to specify or assume a specific polynomial model. The Gaussian Process also enabled us to compute covariances among pairs of genes, elucidating gene expression networks for sleep duration. Follow-up mutational analyses validated the candidate genes’ effects on sleep duration and transcriptional analyses of the mutations largely confirmed gene expression network predictions. The Gaussian Process framework is broadly applicable to gene expression data collected across time.

Figures

Citation: Souto-Maior C, Serrano Negron YL, Harbison ST (2023) Nonlinear expression patterns and multiple shifts in gene network interactions underlie robust phenotypic change in Drosophila melanogaster selected for night sleep duration. PLoS Comput Biol 19(8): e1011389. https://doi.org/10.1371/journal.pcbi.1011389

Editor: Attila Csikász-Nagy, Pázmány Péter Catholic University: Pazmany Peter Katolikus Egyetem, HUNGARY

Received: February 15, 2023; Accepted: July 25, 2023; Published: August 10, 2023

This is an open access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 public domain dedication.

Data Availability: Night sleep phenotypes per selection scheme/sex/generation/population replicate are listed in S15 Table. All scripts used for the model have been deposited in Git Hub https://github.com/caesoma/Multiple-shifts-in-gene-network-interactions-shape-phenotypes-of-Drosophila-melanogaster. All RNA-Seq data from this study are available from the National Center for Biotechnology Information (NCBI) Gene Expression Omnibus (GEO) under the accession number GSE202600.

Funding: This research was supported by the Intramural Research Program of the National Institutes of Health, the National Heart Lung and Blood Institute, ZIA HL006146 to STH. CSM, YLSN, and STH received salary support from the NIH funding listed above. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Competing interests: The authors declare that they have no conflict of interest.

Introduction

Despite the plethora of modern and increasingly refined molecular biology assays—from DNA to metabolites and beyond—systematically uncovering the molecular bases of phenotypes remains one of the thorniest challenges in biology. “Omics” approaches allow whole genome, transcriptome, proteome, and other “omes” to be generated and candidate genes to be fished out of these high dimensional data, but understanding how these biomolecules interact even in the simplest pathways requires painstaking follow-on experimentation, construction of databases, and an immense collective effort to make connections from disjointed assays into a coherent model. Despite the large amount of studies and data generated for many systems, a full understanding of underlying processes has not yet been achieved; this is clear indication that better methods are needed to obtain the understanding of biological processes from data. For complex traits the task is even more difficult. Sleep is a complex phenotype the evolution of which remains a classic mystery in biology. Although sleep and sleep-like behavior is conserved among species, its main purpose is not completely understood, and hypotheses for its purpose span functions like conservation of resources [1–3], pruning of synapses and memory formation [4–7], and management of metabolite and waste products [8, 9]. It is plausible that sleep is a manifestation of multiple functions, and that it involves the activity of many genes to regulate a complex higher-level function; indeed many genes have been implicated in sleep [10–20]. Assuming anything but the simplest possible model would therefore require a description that accounts for this complexity in the interactions of genes and gene products.

Artificial selection plus sequencing/resequencing is a powerful approach for identifying heritable variation in phenotypes and their underlying molecular bases [21], typically assaying DNA or RNA expression in the initial and evolved populations and comparing them to controls [22, 23]. Coupling selection with gene expression identified candidate genes for diurnal preference [24], olfactory behavior [25, 26], food consumption [27], mating behavior [28], resistance to parasitism [29], environmental stressors [30, 31], ethanol tolerance [32], and aggressive behavior [33]. Caveats of that method include often not having molecular data on the intermediate generations, and relying on traditional statistical methods to assess the significance of polymorphic variants. In the case of gene expression, RNA levels are often modeled for each gene individually using linear models, without further consideration of the processes involved or interactions between genes. Inferring interaction between genes (as opposed to individual changes) requires observations of how the genes covary in time. Correlation [34, 35] or information theory-based methods (and others, reviewed in [36–38]) could be applied to estimate the relationship between the genes when that information is present, but neither is time course data usually available, nor are these methods standard in artificial selection experiments.

Recent work applies Gaussian Process models [39, 40] to data sampled over time in order to evaluate dynamic parameters. Where gene and protein expression dynamics can be modeled with differential equations, Gaussian Process methods estimate parameters for non-linear systems [41], elucidate the spatial-temporal dynamics of developmental morphogen gradients [42], model signaling and gene regulatory networks [43, 44] infer latent transcription factor activity [45], and find transcription factor targets [46]. Other applications of Gaussian Processes account for missing or irregularly sampled gene expression data [47], model spatial interactions between cells [48] and generate clusters [47, 49]. Gaussian Processes can also infer relationships among multiple disparate data types [50, 51] to explore, for example latent processes underlying spatial-temporal relationships among brain structure, brain activity metrics, and behavior [52], or relationships among multi-modal spatially- and temporally-varying data [53]. Gaussian Processes thus provide a flexible framework for the estimation of latent relationships.

In this work we have artificially selected Drosophila melanogaster for increased or decreased night sleep duration and sequenced the mRNA of the flies from each generation of selection. The selection procedure produced both long- and short-sleeping fly populations significantly deviant from unselected controls. The RNA sequence data, which consisted of expression levels as a function of time (measured in generations), was analyzed using a Multi-Channel Gaussian Process [50, 51] where each gene is described by one of these “channels”, and their relationships are estimated by an underlying covariance structure in the model. We describe the expression of 85 genes that had significant changes in the artificial selection long or short schemes along generation common to both males and females. We used this model to infer the magnitude of all 3,570 possible pairwise interactions between all possible pairs of genes. Results from this analysis and comparison to unselected controls suggest that multiple shifts in interactions underlie the increase and decrease of night sleep duration, with 145 interactions not being observed in the controls. Further experiments revealed candidate genes that impact night sleep and confirm these interactions.

Materials and methods

Construction of outbred population

We constructed an outbred population of flies using ten lines from the Drosophila Genetic Reference Panel (DGRP) [54, 55] with extreme night sleep phenotypes [11]. Five lines had the shortest average night sleep for both males and females combined in the population: DGRP_38, DGRP_310, DGRP_365, DGRP_808, and DGRP_832. The other five lines had the longest average night sleep in the population: DGRP_235, DGRP_313, DGRP_335, DGRP_338, and DGRP_379. The ten lines were crossed in a full diallel design, resulting in 100 crosses. Two virgin females and two males from the F1 of each cross were randomly assigned into 20 bottles, with 10 males and 10 females placed in each bottle. At each subsequent generation, 20 virgin females and 20 males from each bottle were randomly mixed across bottles to propagate the next generation. The census population size was 800 for each generation of random mating. This mating scheme was continued for 21 generations, resulting in the Sleep Advanced Intercross Population, or SAIP [10, 56]. The SAIP was maintained by pooling the flies from each bottle together, then randomly assigning 20 males and 20 females to each bottle each generation.

Artificial selection procedure for night sleep

At generation 47 of the SAIP, we began the artificial selection procedure, which we defined as generation 0. We seeded six bottles with 25 males and 25 females mixed from all bottles of the outbred population. Two replicate bottles were designated for the short-sleeping protocol (S1 and S2), two for the long-sleeping protocol (L1 and L2), and two for a control (unselected) protocol (C1 and C2). Each generation, 100 virgin males and 100 virgin females were collected from each of the six population bottles. Virgins were maintained at 20 individuals to a same-sex vial for four days to control for the potential effects of social exposure on sleep [57]. Flies were placed into Trikinetics (Waltham, MA) sleep monitors, and sleep and activity were recorded continuously for four days. We used an in-house C# program (R. Sean Barnes, personal communication) to calculate sleep duration, bout number, and average bout length during the night and day, as well as waking activity. We also calculated sleep latency, defined as the number of minutes prior to the first sleep bout after the incubator lights turn off. In addition, we computed the coefficient of environmental variation (CV_E) for each sleep trait as the product of the standard deviation in each replicate population (σ) divided by the mean (μ) × 100 [58].

All sleep traits including night sleep duration were averaged over the four-day period. For the short (long)-sleeping populations, we chose the 25 males and 25 females in each replicate population having the lowest (highest) average night sleep as parents for the next generation. Any flies found dead were discarded, and the next shortest (longest)-sleeping fly was used in order to ensure that 25 females and 25 males were used as parents. For the control populations, we chose 25 males and 25 females at random to start the next generation. Flies were not mixed across replicate populations. We repeated this procedure for 13 generations.

Quantitative genetic analyses of selected and correlated phenotypic responses

We analyzed the differences in night sleep among selection populations as well as other potentially correlated sleep traits using a mixed analysis of variance (ANOVA) model: where Y is the phenotype; μ is the overall phenotypic mean; Sel, Sex, and Gen are the fixed effects of selection scheme (short- or long-sleeper), sex, and generation, respectively; Rep is the random effect of replicate population; and ε is the error term. The CV_E traits were assessed using the same model with the replicate terms removed. A statistically significant Sel term indicates a response of the trait to selection for night sleep; a significant Sel × Sex term indicates a sex-specific response to selection. We repeated the analysis for sexes separately using the reduced model where the terms are as defined above. We also analyzed the response to selection in each generation separately using the reduced model and the reduced model for each sex separately per generation. Finally, we analyzed the change in sleep parameters over generations in the control populations using the model where each factor is as defined above. We estimated realized heritability (h²) using the breeder’s equation: where ΣR and ΣS are the cumulative selection response and differential, respectively [59]. The selection response is computed as the difference between the offspring mean night sleep and the mean night sleep of the parental generation. The selection differential is the difference between the mean night sleep of the selected parents and the mean night sleep of the parental generation.

RNA extraction and sequencing

As described above, sleep was monitored in 100 virgin males and 100 virgin females each generation. Twenty-five flies of either sex were used as parents for the next generation, leaving 75 flies of each sex in each selection and control population. Four pools of 10 flies of each sex were chosen at random from these 75 flies and frozen for RNA extraction at 12:00 pm (i.e., ZT6). This timepoint was arbitrarily chosen and is during the fly’s active period. RNA was extracted from two of these pools; the remaining two pools were kept as back-up samples and used if needed. Samples were collected for the initial generation (0), and all subsequent generations. RNA was extracted using Qiazol (Qiagen, Hilden, Germany), followed by phenol-chloroform extraction, isopropanol precipitation, and DNase digestion (Qiagen, Hilden, Germany). Qiagen RNeasy MinElute Cleanup kits (Qiagen, Hilden, Germany) were used to purify RNA according to the manufacturer’s instructions. With the exception of generation 1, which had RNA that was degraded, RNA from all other generations was sequenced. This produced 312 RNA samples (6 populations × 13 generations × 2 sexes × 2 replicate RNA samples).

Poly-A selected stranded mRNA libraries were constructed from 1 μg total RNA using the Illumina TruSeq Stranded mRNA Sample Prep Kits (Illumina, San Diego, CA) according to manufacturer’s instructions with the following exception: PCR amplification was performed for 10 cycles rather than 15 in order to minimize the risk of over-amplification. Unique barcode adapters were applied to each library. Libraries were pooled for sequencing. The pooled libraries were sequenced on multiple lanes of an Illumina HiSeq2500 using version 4 chemistry to achieve a minimum of 38 million 126 base read pairs. The sequences were processed using RTA version 1.18.64 and CASAVA 1.8.2.

RNA alignment of reads

Sequences were assessed for standard quality parameters using fastqc (0.11.4) (Babraham Institute, Cambridge, UK). Reads were aligned to the FB2015_04 Release 6.07 reference annotation of the Drosophila melanogaster genome using STAR [60]. Default parameters were used except that the minimum intron size was specified as 2, and the maximum intron size was specified as 268,107, consistent with the largest intron size in the D. melanogaster genome. STAR outputs aligned sequence to a SAM file format, which contains the code ‘NH’ [60]. An NH of 1 indicates a uniquely mapped read, while NH > 1 indicates that the read did not map uniquely. HTSeq was used to count only the uniquely mapped reads (NH = 1) [61].

Principal Component Analysis (PCA)

It was expected from previous studies of gene expression that there would be large differences in gene expression due to sex [62–72]. We performed Principal Component Analysis to assess those differences (S1 Fig). The principal components of the normalized RNA-seq count normalized matrix were computed, with each gene being treated as a different variable, and each sample a different observation. Samples were projected in the planes of the ten first components, and clustering according to the experimental labels was inspected visually.

Gene normalization and filtering

The combined genic and intergenic counts were normalized by the expression of a pseudo-reference sample computed from the geometric mean of all samples, using the method described by Love et al. [73]. Filtering was performed by computing the 95^th percentile of the distribution of normalized, base 2 logarithm, levels in the intergenic regions for males and females and using those values as cut-off level for the genic regions—i.e. any genes that did not have expression above this level for at least one sample were removed from further analyses [74]. The (linear scale) cutoff expression value for males was 48.6, and for females 102.

Generalized Linear Model analysis of expression data

Analysis of differential expression between selection schemes was initially performed for each gene independently. Given the separation of the expression levels by sex seen in the PCA analysis, analyses were conducted separately for the subsets of male or female flies. We implemented a generalized linear model (GLM) with a hierarchical structure to account for non-independent, replicate-specific parameters. The description is similar to a generalized linear mixed model (GLMM), but uses a Bayesian formulation to specify the hyper-priors and is fully described below. Normalization factors for the RNA levels was performed using the scheme described by Love et al. [73]. A negative binomial likelihood was used and parameterized with the mean (given by the prediction of the linear model) and dispersion parameters; the number of samples (156 for each sex) allowed estimation of the latter together with model coefficients, dispensing with the need of other schemes applied when the number of samples is small, commonly implemented in some packages.

Bayesian inference was used and parameter priors were exploited to treat replicate effects in a hierarchical formulation [75]. Specifically, for each replicate-dependent parameter (say β_short,rep), two parameters were specified at the top-level (μ_short and σ_short), given (hyper-)priors, and estimated from the data together with all other parameters. Below that, both replicate-specific model parameters (β_short,1 and β_short,2) are given the same gaussian prior using top-level parameters (e.g. ) for that coefficient in replicate 1 as well as replicate 2). Under this formulation the full model for the expression of a gene j is given by logμ_j ∝ sel_rep + gen + sel × gen_rep, where a relationship between each set of replicate-dependent parameters is enforced hierarchically through their higher level common parameters and hyperpriors. Explicitly, we have: where X is the design matrix, with binary 0/1 variables indicating parameters that apply to specific treatments (e.g. the entries multiplying β₁,β₂, are present for all, that β_short,1, is present for short sleepers from replicate 1, etc.) except for parameters dependent on the gen variable which takes the value of the generation (e.g. 0 through 13 for the entries multiplying the β_gen parameter in all treatments, and for those multiplying β_short×gen,1 for short sleepers from replicate 1, etc.). Table 1 lists all parameters, their descriptions, design matrix values associated to them, and priors.

Download:

Table 1. Parameter names, description, design values, and priors for Bayesian inference.

https://doi.org/10.1371/journal.pcbi.1011389.t001

Maximum a posteriori probability (MAP) estimates and confidence intervals were obtained using the Stan package [76]. Significance was calculated using a likelihood ratio test comparing the point estimates from the full model to a reduced model not including the interaction terms (i.e. logμ_j,rep = sel_rep + gen). Model p-values were corrected for multiple testing using the Benjamini-Hochberg method [77], with significance defined at the 0.001 level, consistent with the lower threshold applied in other artificial selection studies [28, 32, 33].

Calculation of non-parametric correlations between genes

The correlation coefficients (ρ) between any two pairs of genes can be computed directly from the data. Pearson correlation assumes the relationship between the two variables is linear, while Spearman correlation is rank-based and therefore accommodates non-linear relationships, although it still assumes the relationship is monotonically increasing or decreasing. We therefore computed Spearman correlations between genes that were found to be significant for both males and females in the GLM analysis—one correlation coefficient was obtained for the data subset from each sex-selection combination. The significance of each correlation coefficient is tested using the null hypothesis that ρ = 0. Because the main interest is the interaction between genes in the selected populations that are different from controls we compare the coefficients by computing and comparing the confidence intervals for ρ_sel (where sel can be “short” or “long”) and ρ_control using the normal approximation to arctanh(ρ) [78]. We note that this is not exactly equivalent to the significance testing of the null hypothesis that ρ_sel = ρ_control [79] (which relies on computing the confidence interval for ρ_sel − ρ_control using the same method), since it overestimates the total variance (i.e., one would find fewer significant instances). Nevertheless, the approach is valid and is more broadly applicable, in that it can be computed when a joint distribution with the two variables cannot be obtained—we use the term “significant” for either kind of difference, but explicitly state which one is used.

Gaussian Process regression

Gaussian Processes (GP) are an alternative function-space formulation to the well-known weight-space linear models of the form y = f(x) + ε; their use dates back to the 19^th century and they have been covered extensively in the statistical and information theory literature [80], becoming popular in machine learning applications [39, 81], and more recently implemented in less technical contexts like the life sciences [40]. We give a brief overview of their usefulness, motivate their use in this work, and point to the references above for formal description of the method.

The weight-space linear model expresses the observations in terms of explicit linear coefficients (or weights) of the independent variable, x, possibly with further basis function expansions (e.g. square, x², or higher order polynomials, xⁿ), for instance y = β₀ + β₁x + β₂x² + ε, (where ε is normally distributed noise). Gaussian Processes describe the basis functions implicitly instead, with ; that is, a set y of N observations is distributed according to a multivariate normal distribution with mean given by the vector μ (of size N) and covariance between the values of x given by the matrix K (with dimension N × N). The entries of this matrix in row i, column j are defined by some covariance function such that k_ij = cov(x_i, x_j)—if the covariance function is linear in the values of x, for instance, the prediction for y is a straight line similar to y = β₀ + β₁x. Formulating the model in terms of function-space enables the use of flexible sets of basis functions; this approach of only implicitly describing a basis function, thus avoiding specification of a potentially large basis is called the “kernel trick”. Functions like the commonly used squared exponential kernel can be shown to be equivalent to an infinite number of basis functions [39], and therefore cannot be incorporated in the explicit terms of the weight-space formulation.

While Gaussian Processes are a classic formulation in statistics, the recent surge in machine learning applications has popularized its use in the natural sciences. They have been used to analyze gene expression by using their flexible output in combination with ordinary differential equations [41, 43, 44, 46], with clustering approaches [49], within other regression models [82], or modeling spatial covariance [48]. In the context of our experimental design Gaussian Process Regression could be used as a flexible alternative to GLMs, with each selection scheme having a different mean function μ_sel and a squared exponential covariance function where x takes the values of the generations in our experiment. The exponentiated term gives the correlation c(x, x′) between a pair of time points, with parameter ℓ modulating the correlation level given a distance r = x − x′, and being the signal variance of the data. Under this model, unlike with the GLM analysis, the change in RNA-seq counts is a function not of slope coefficients but of the signal variance . It is worth noting that the signal variance is a scalar constant for all terms in the covariance matrix, so it can also be written as , where C is analogous to K but with correlations instead of covariances, a notation that will be useful shortly.

Multi-channel Gaussian Processes

Despite the extensive use of Gaussian Processes, most applications in the life sciences have been restricted to single-channel GPs; that is, models that only describe one set of observations at a time (here the expression time series for a single gene). These models—in this aspect not unlike GLMs—describe expression of genes independently, i.e. they implicitly assume genes do not interact in any way. Gaussian Processes can however be extended to include covariance between two or more sets of observations, a formulation that seems to be underexploited in the biological literature (but see [53] and [52]). The different dependent variables y_i are sometimes called channels or tasks, and the resulting model is called a multi-task or multi-channel Gaussian Process. The details of the specification of this model can be found in [51] and [50], which we summarize below. For an array of two genes only, for instance, instead of describing each vector y₁ and y₂ separately as multivariate gaussians of dimension N₁ and N₂, respectively, the concatenated vector [y₁ y₂]^T with N₁ + N₂ observations can be modeled as a single multivariate gaussian with a covariance matrix of K dimensions (N₁ + N₂) × (N₁ + N₂), or . The diagonal blocks of the covariance matrix with dimensions N₁ × N₁ and N₂ × N₂ are the same as above, and the off-diagonal blocks of dimensions N₂ × N₁ and N₁ × N₂ specify the correlations between the two points ij from channels 1 and 2 [50].

Finally, the signal variance for each of those blocks need to be specified, and the final matrix is given by [51], and the mean of the multivariate gaussian is specified by a concatenated vector μ = [μ₁ μ₂]^T. The number of parameters is reduced by recognizing that the covariance matrix is symmetric so in this example , where we also dropped the subscript f. For this model, the variation in the RNA levels of say gene 1 is a function not only of , but also of . Therefore, fitting the data with this model infers interaction between genes from scratch without any external information not contained in the array of RNA-seq counts.

The model can be extended to any number of genes, although computational requirements for performing the necessary matrix operations on K also grow with its size and may be limiting—the computational and mathematical limitations of this approach are discussed in S1 Appendix.

Bayesian MCMC inference of Gaussian Processes

Analogously to GLM models, we maintain the negative binomial likelihood for the Gaussian Process inference, but unlike the transition between linear models and their generalized versions, the incorporation of non-gaussian likelihoods is not as straightforward, and requires methods to approximate the underlying latent Gaussian Process model, leading to what is sometimes referred to as Gaussian Process Classification [39]. Because of the Bayesian inference implemented for this model we chose to infer the latent function via Markov Chain Monte Carlo sampling as these variables can be estimated jointly with the other parameters and have priors that by design are standard gaussian, and therefore are straightforward to specify. Table 2 gives the description of all parameters in the Multi-Channel Gaussian Process model and their priors.

Download:

Table 2. Parameter names, description, and priors for Gaussian Process Bayesian inference.

https://doi.org/10.1371/journal.pcbi.1011389.t002

The number of covariance parameters in a multi-channel Gaussian Process model with M channels is (M² − M)/2, and the total number of parameters scales roughly as as the number of channels becomes large. For 100 genes, for instance, that would result in about 5,000 covariances. Due to the statistical challenge of exploring a parameter space with a dimension of several thousand, as well as the computational demand of factorizing a large matrix at each MCMC step, the estimation of the signal covariance parameters between genes was not performed jointly. Instead, each pair of genes was fitted separately, with a single-channel Gaussian Process being first used to estimate the signal variance and bandwidth parameters for each gene and this estimate being used as a prior for the (pairwise) joint inference. This procedure effectively breaks down a Gaussian Process inference of any size into several smaller inference problems requiring factorization of a matrix of size 2N, with a total number of parameters of the order of N, which are computationally much more manageable and can be run in parallel. Because the covariance parameters depend only on the relationship between two variables (here, genes), separate estimation does not affect inference of the parameters; in fact, it removes the constraint of positive-definiteness on the matrix of covariances of all genes (which instead applies to the matrix of two genes only, see S1 Appendix.

Eight parallel chains were run for each estimation with 40 thousand samples each; half were excluded as warm-up and 1 out of every 40 was kept for further calculations. Convergence was assessed using the metric and observing the number of effective samples (ESS) [75]. The annotated model implemented in the Stan probabilistic language is made available at https://github.com/caesoma/Multiple-shifts-in-gene-network-interactions-shape-phenotypes-of-Drosophila-melanogaster. Because inference was done separately for each selection scheme, differences between them were assessed by comparing the posterior distribution of the parameters of interest.

Confirmatory experiments

We tested Minos insertions putatively disrupting five genes from the significant Gaussian Process correlation for their effect on sleep, along with their two background controls. We assayed phenotypes using the same procedure outlined above. Twenty-four flies per sex per line were assayed, and the experiment was replicated twice. Sleep was analyzed using the following ANOVA model: where Sex and Rep are as previously defined above and Genotype refers to the Minos insertion line or control. We conducted RNA-Seq on the Minos insertion lines and their controls. We collected 10 flies per sex/line at the conclusion of sleep monitoring for RNA extraction. RNA was extracted as detailed above, with the following exception: ERCC spike-ins (ThermoFisher Scientific, Waltham, MA) were added to the RNA after the extraction procedure. A total of 28 samples (7 lines × 2 sexes × 2 replicates) were collected and processed. We then sequenced these samples and processed them as detailed above. Note that we discarded one sample due to failed quality control during library preparation. We compared normalized gene expression in the mutants to their respective controls using a Kruskal-Wallis non-parametric test. Expression ratios were computed between knock down genotypes and their wild-type controls (w¹¹¹⁸ or y¹w^67c23) for individual genes predicted by the Gaussian Processes to be significantly correlated with the knocked down genes in the relevant sex-selection scheme combination (S1 Table), henceforth candidate genes. Expression ratios were also computed for sets of 1000 genes chosen at random, each set matching the genetic backgrounds (knockdown and controls) as well as the sex (henceforth random sets); a distribution of expression ratios was generated for each random set.

Results

Phenotypic response to artificial selection

The selection procedure for night sleep was very effective. Long-sleeper and short-sleeper populations had significant differences in night sleep across all generations (P_Sel = 0.0003; S2 Table); in fact, night sleep was different for the two selection schemes for each generation considered separately except for generations 0 and 1 (S3 Table). Both males and females responded equally to the selection procedure. Fig 1A shows the phenotypic response to 13 generations of selection for night sleep. At generation 13, the long-sleeper populations averaged 642.2 ± 3.83 and 667.8 ± 2.97 minutes of night sleep for Replicate 1 and Replicate 2, respectively. The short-sleeper populations averaged 104.3 ± 6.71 and 156.2 ± 8.76 minutes of night sleep for Replicate 1 and Replicate 2, respectively. The average difference between the long- and short-sleeper lines was 537.9 minutes for Replicate 1, and 511.6 minutes for Replicate 2. In contrast, the two control populations did not have differences in their night sleep after 13 generations of random mating (P_Gen = 0.7083; S4 Table). In the initial generation, night sleep was 519.6 ± 10.57 minutes in the Replicate 1 control and 567.9 ± 7.63 minutes in the Replicate 2 control. At generation 13, night sleep was 563.4 ± 7.62 and 542.3 ± 7.91 in Replicates 1 and 2, respectively, a difference of only 43.8 and 25.6 minutes. These negligible changes in night sleep in the control population suggest that little inbreeding depression occurred over the course of the experiment [59]. Selection was asymmetric, with a greater phenotypic response in the direction of reduced night sleep. Note also that night sleep is bounded from 0 to 720 minutes, and the initial generation had 515.39 minutes of night sleep on average across all populations, a fairly long night sleep phenotype. This high initial sleep may explain why the response to selection for short night sleep was more effective. Night sleep is sexually dimorphic [11, 83, 84]; yet both males and females responded to the selection protocol equally (P_Sel×Sex = 0.9492; S2 Table). Thus, we constructed a set of selection populations with nearly 9 hours difference in night sleep.

Download:

Fig 1. Response to artificial selection for night sleep.

(A) Mean and (B) coefficient of environmental variation of night sleep. Plot and regression lines of cumulated selection differential (ΣS) against cumulated selection response (ΣR) for (C) long- and (D) short-sleeping populations, and against cumulated differential ΣD for (E) controls. Light green, Replicate 1 long-sleeper population; Dark green, Replicate 2 long-sleeper population; Orange, Replicate 1 short-sleeper population; Red, Replicate 2 short-sleeper population; Gray, Replicate 1 control population; Black, Replicate 2 control population.

https://doi.org/10.1371/journal.pcbi.1011389.g001

In an artificial selection experiment, some amount of inbreeding will necessarily take place. Only a subset of the animals are selected each generation as parents; thus phenotypic variance is expected to decrease as selection proceeds [59]. However, this is not the case for all artificial selection experiments [59]. We calculated the coefficient of environmental variation (CV_E) [58] and evaluated its trajectory across time in order to determine whether the populations were becoming more or less variable over time. As Fig 1B shows, night sleep CV_E increased over time in the short sleepers, and decreased over time in the long sleepers (P < 0.0001; S5 Table). The increase in CV_E in short sleepers was largely due to a decrease in the population mean as the standard deviation also decreased over time, indicating that the phenotypic variance decreased (S2 Fig). Likewise, the standard deviation decreased in the long sleepers over time, even as the mean night sleep increased, indicating decreased variability in these populations as well. These changes in CV_E mimic previous observations in populations artificially selected for sleep [10]. Regressions of the cumulated response on the cumulated selection differential were used to estimate heritability (h²). Long-sleeper population h² (±SE of the coefficient of regression) were estimated as 0.145 ± 0.021 and 0.141 ± 0.014 (all P < 0.0001) for Replicates 1 and 2, respectively (Fig 1C); short-sleeper population h² were 0.169 ± 0.013 and 0.183 ± 0.019 (all P < 0.0001) for Replicates 1 and 2 (Fig 1D). In contrast, estimated regression coefficients for the control population were non-significant and with high standard errors associated to the regression estimates: 0.405 ± 0.695 (P = 0.57) and −0.078 ± 0.487 (P = 0.88) for Replicates 1 and 2, respectively (Fig 1E).

Correlated response of other sleep traits to selection for night sleep

Traits that are genetically correlated with night sleep might also respond to selection for long or short night sleep [59]. Indeed, some sleep and activity traits have been previously shown to be phenotypically and genetically correlated [11, 83, 84]. We examined the other sleep and activity traits for evidence of a correlated response to selection. Night and day average bout length (P = 0.0008 and P = 0.0391, respectively) and sleep latency (P = 0.0023) exhibited a correlated response to selection for night sleep across generations 0–13, while night and day bout number, day sleep, and waking activity did not (S2 Fig; S2 Table). In the case of day average bout length, the correlated response was sex-specific to males (P = 0.0140) (S2 Table). Significant correlated responses for night and day average bout length and sleep latency did not occur in all generations (S3 Table). Night average bout length responded to selection for night sleep in most generations, while day average bout length responded in only four of the last six generations. Sleep latency responded to selection after the second generation. In addition, we observed significant differences between the long-sleeping and short-sleeping populations for the CV_E of all sleep traits except waking activity CV_E (S2 Fig; S5 Table). However, the pattern of the CV_E for each trait appeared to be more random across time. These correlated responses concur with previous observations we made in selected populations originating from the same outbred population for night sleep and night average bout length, and night sleep and sleep latency [10]. However, unlike the previous study, we did not see a correlated response between night sleep and day sleep, and night sleep and day bout number [10]. The lack of correlated response reflects the relatively low genetic correlation these two traits have with night sleep [11, 84].

Phenotypes in flies used for RNA-Seq

Every generation, we harvested RNA from flies chosen at random from the 200 measured for sleep in each selection population, with the exception of the flies chosen as parents for the next generation. We extracted RNA from two replicates of 10 flies each per sex and selection population. Since these flies amount to only 20% of the flies measured for sleep each generation, their sleep may or may not be representative of the group as a whole. We therefore correlated the mean night sleep for each generation in the flies harvested for RNA with the mean night sleep of all flies measured to determine how similar night sleep was to the total in the group (S3 Fig). The correlations were very high for the selected populations: long-sleeper flies harvested for RNA were very well correlated with the total measured in each population [r² = 0.99 and 0.96 (all P < 0.0001) for Replicate 1 and 2 respectively], as were short-sleepers [r² = 0.99 for Replicate 1 and 0.97 for Replicate 2 (all P < 0.0001)]. The control populations, which did not undergo selection, were somewhat less well correlated. Replicate 1 of the control population had an r² of 0.75 (P = 0.0001) and Replicate 2 had an r² of 0.85 (P < 0.0001). The lower correlations observed in control flies indicate that they were less inbred than the selected populations. Thus, the flies harvested for RNA are very good representatives of each population as a whole.

Hierarchical Generalized Linear Model analysis reveals that selection for night sleep impacts gene expression

For each gene, the linear model analysis produced posterior distributions for the parameters as well as log-likelihood values for the full and reduced models. Point estimates (MAP) are shown in S6 Table for females and S7 Table for males. For the male flies 11,778 genes passed the filtering for low expression, of which 405 were found to have a significant selection scheme effect over the generations of artificial selection (i.e., significant likelihood ratio test for the sel × gen term). Thus, the expression level shift given by the slope of the generalized linear model is different from controls and attributable to selection for long and/or short sleep. For the females 820 genes out of 9,370 with detectable expression were found to be significant. Genes with opposite trends in the short and long selection schemes were compared using the group-level parameter μ_short×gen and μ_long×gen (i.e. the effect that best explains both replicates): 384 genes in females (S8 Table) and 204 genes in the males (S9 Table showed opposite trends by that criterion. Between males and females, 85 genes were common to both sexes. Known functions of these 85 genes from the DAVID gene ontology database are presented in S10 Table. We used these 85 genes in subsequent analyses; see below. Fig 2 shows the fit for one gene.

Download:

Fig 2. Fit of Hierarchical Generalized Linear Model to gene CG1304 for flies selected for short sleep, unselected controls, and selected for long sleep.

The solid lines show the expected value of full model, dashed lines for reduced model, and shaded regions show the 95% credibility interval. Replicate 1 data points are shown in dark gray, Replicate 2 in light gray.

https://doi.org/10.1371/journal.pcbi.1011389.g002

Pairwise Spearman correlation is non-specific and significant for a large fraction of genes

We computed Spearman correlations for all pairwise combinations of the 85 genes common between sexes (S11 Table). Correlations computed using the Spearman method were found to be significant at 95% confidence for 2,999 of the 3,570 possible pairs. The confidence intervals for the correlation coefficients showed no overlap with controls for either short sleepers, long sleepers, or both populations in 1,348 of 3,570 pairs. Thus, a simple correlational analysis identifies a minimum of 38% of the possible interactions among genes as relevant.

Gaussian Process model analysis uncovers nonlinear trends and specifically identifies covariance in expression between genes

As noted above, a simple correlational analysis suggested that large numbers of genes are potentially interacting to alter sleep. Because direct computation of linear model-based correlations cannot account for non-linear effects or spurious confounding trends we fit Gaussian Process models that can account for temporal variation in multiple genes even in the absence of actual interactions between them. The 85 significant genes overlapping between males and females potentially have 3,570 pairwise interactions. To that end, the parameter of interest in the Gaussian Process model is the signal covariance between each pair of genes. This covariance is a measure of the degree of their interaction. We applied the Gaussian Process model for each of the 3,570 pairs for each selection scheme (long, short, and control). As an example, the model fit for one pair of genes from the female gene expression data is shown in Fig 3.

Download:

Fig 3. Fit of Gaussian Process model to pair of genes LysC and CG1304, for female flies selected for short sleep, unselected controls, and selected for long sleep.

The solid lines show the expected value, while the shaded regions show the 95% credibility interval. Replicate 1 data points are shown in dark gray, Replicate 2 in light gray). The expectation for correlations (ρ_sel) is shown for each selection scheme. An asterisk indicates significant difference from controls in selection scheme, as opposed to non-significance (n.s.).

https://doi.org/10.1371/journal.pcbi.1011389.g003

Convergence for all three runs was on the order of , and close to the 4,000 samples expected for each run; therefore, the wide confidence intervals are likely a product of the large dispersion in the data itself. Correlation between gene expression patterns of the two genes is computed by dividing the signal covariance by the square root of the signal variance of each gene—e.g. —that is, similar to computing a correlation coefficient from variances and covariances, but taken as the expectation over the posterior distribution obtained from MCMC.

Fig 3 illustrates the nonlinear trajectories of gene expression that cannot be detected by the GLM model. The two trajectories exhibited high signal covariance between the expression of the two genes in the long sleepers (ρ_l = 0.89) that was significantly different from controls; however, intermediate covariance in the short sleepers (ρ_s = 0.53) did overlap with that of controls, and therefore was not significantly different.

S4 Fig part A shows a pair where interactions in both short and long selection schemes are different from controls, S4 Fig part B shows another pair of genes where neither scheme is different from controls. This illustrates a range of possibilities, including a case where Spearman correlations are significant but GP correlations are not (the opposite also occurs). Parts C and D of S4 Fig fit each gene individually, and the fit does not change substantially between single to multiple channel models.

The 85 single-channel fits were good despite varying levels of dispersion and occasional outliers, indicating no issues with the Gaussian Processes’ ability to fit the temporal patterns of any one gene. For the two-channel inference, upwards of 90% of the chains initially converged under the criterion that ; because the inference method is stochastic it is expected that by chance some chains may not converge and/or mix well with their replicates. Chains that initially failed were rerun up to two times. After three runs over 99% of the chains converged; the reasons for lack of convergence of the remaining were not investigated further. Fig 4 shows six heat maps (one for each sex and selection scheme combination) with the correlations for all pairs of genes calculated as described in the previous figure, summarizing the inferred interactions. Of the 3,570 correlations, 1,612 were greater than 0.5 and 98 greater than 0.9.

Download:

Fig 4. Signal variances and covariances normalized to range [-1,1] for females and males in each of the selection schemes: Short, control, and long.

Each off-diagonal square is the expected value of the interaction between two of 85 genes, for a total of 3,570 pairs.

https://doi.org/10.1371/journal.pcbi.1011389.g004

In addition to computing expected values, the posterior distributions were used to compare the signal covariances between selection schemes and set a cutoff. Distributions of the parameter for each sex-selection scheme were assembled from the parallel MCMC runs; 145 gene pairs in the selected populations are found to be different from controls (i.e. do not overlap with them at 95% credibility for either short, long or both populations). Out of the 145, twelve gene pairs were common between males and females selected for long night sleep and one pair to males and females selected for short sleep; one gene pair was common to females in both selection schemes, and three pairs were common to males. S11 Table shows the expected values of signal covariances normalized by the variances for all two-way interactions side by side with the Spearman correlations. S12 Table shows the subset of significant Gaussian Processes correlations.

We constructed a network for each sex/selection scheme combination based on the magnitude of the correlation between genes. The network for males selected for long sleep having significant gene interactions is shown in Fig 5 (S5 Fig shows the networks for the remaining three sex-selection scheme combinations). S13 Table lists the number of connections (degrees) that each gene has with the others in the network. The average number of connections for long-sleeper males was 2.6; the other three networks had average degrees of 2.0 or less (2.0 for long-sleeper females and short-sleeper males; 1.75 for short-sleeper females).

Download:

Fig 5. Gene interaction network in males selected for long sleep.

Edges represent signal covariances whose posterior distributions do not overlap with that of controls at 95% credibility. Colors and line thickness indicate indicate the strength and the direction of the correlation. Thin gray lines show all 145 interactions significant for at least one of the four sex-selection scheme combinations.

https://doi.org/10.1371/journal.pcbi.1011389.g005

For comparison, looking at significant (ρ_sel ≠ 0) Spearman correlations keeps almost three thousand interactions (i.e. excludes just a bit more than a tenth of the genes), and comparing the distributions ρ_sel versus ρ_control—similar to how the Gaussian Processes are compared—still has over thirteen hundred. Therefore, computing correlations between genes using covariance estimates from the Gaussian Processes appears to increase specificity over direct correlations. Furthermore, the Gaussian Processes appear to be more sensitive in finding 68 gene pairs that are not found to be significant by the first Spearman approach and 18 not found by the second.

Finally, we examined known interactions between the 85 genes and any other genes using the Drosophila Interaction Database, DroID [85]. We found 2,830 interactions; 8 of these were one of the 3,570 between the 85 genes, but none of them overlapped with the 145 gene pairs found to be different from controls. The gene interactions we observed may therefore be unique to sleep.

Mutational analyses confirms role of candidate genes and interacting gene expression networks in sleep

We tested five genes for differences in sleep as compared to their isogenic control: CG12560, CG13793, Cytochrome P450 6a16 (Cyp6a16), highwire (hiw), and Jonah 65Aii (Jon65Aii) (S1 Table). All of the Minos insertions altered night sleep (Fig 6). Night sleep increased from 50—115 minutes beyond the w¹¹¹⁸ control line (all P-values < 0.0125, the Bonferroni-corrected P-value; S1 Table). Flies having a Minos insertion in Jon65Aii slept 66 minutes less than their corresponding y¹w^67c23control (P < 0.0001). All Minos insertions had the same directional effect on night sleep for both males and females, but only the CG12560 and Jon65Aii insertions had statistically significant effects on night sleep on each sex separately (S1 Table). Thus, all genes affected night sleep duration, but CG12560 and Jon65Aii had the greatest effect on both sexes.

Download:

Fig 6. Night sleep in Minos mutations.

The figure compares night sleep duration in each mutant with that of its respective control. Blue circles indicate Minos insertions with w¹¹¹⁸ control strain; red triangles indicate Minos insertions with y¹w^67c23 control strain. **** or ^####, P < 0.0001; ***, P < 0.001, three-way ANOVA. All P-values are less than the Bonferroni-corrected P-value of 0.0125.

https://doi.org/10.1371/journal.pcbi.1011389.g006

Gene expression decreased significantly in the CG12560 and Jon65Aii Minos insertions relative to their controls (S6 Fig). The remaining Minos insertion lines had some changes in gene expression relative to the control, however, the changes were not formally significant. Potential reasons for the lack of a significant change in gene expression in the remaining lines include: the position of the insertion within the targeted gene, which has variable effects on its expression; the relatively low statistical power of the experiment; confining our observation to a single timepoint during the day; or pooling whole flies, which might obscure gene expression changes occurring at a single-tissue level.

Our baseline expectation was that expression levels between knockdown and control lines should not be affected for most genes. For candidate genes, we hypothesized that the ratio of gene expression between the Minos insertion line and its respective control would differ considerably from 1.0; conversely, the ratio of gene expression between Minos insertion line and control would be approximately 1.0 for unrelated genes—that expectation is confirmed by the distributions consistently centered around unity (median = 0.995). We plotted the ratios of candidate genes against the distributions of the matching random sets (Fig 7). Our supposition was largely realized for CG12560 and Jon65Aii, the two genes having significant knockdown in gene expression (S7 Fig and S14 Table).

Download:

Fig 7. Comparison of ratios of gene expression between genes with significant Gaussian Process correlations and unrelated genes for CG12560 and Jon65Aii mutants.

Purple lines show the ratio of mutant gene expression to control for genes with significant Gaussian Process correlations. The distribution of gene ratios for 1, 000 unrelated genes is plotted in the background. Genes having the most extreme ratios are indicated; see S14 Table for the calculations. (A) CG12560 females; (B) CG12560 males; (C) Jon65Aii females; (D) Jon65Aii males.

https://doi.org/10.1371/journal.pcbi.1011389.g007

Discussion

We have shown that robust, reproducible phenotypic changes in Drosophila melanogaster sleep are associated with hundreds (405 in males, 820 in females) of individual shifts in gene expression—and as a consequence hundreds of thousands of potential combinations [ and ]. Nevertheless, unique interactions important to the phenotypes are a comparatively small number (145 out of possible combinations of the 85 genes common to males and females). We have also shown that these interactions cannot be found with linear model analyses or conventional correlation calculations only, but are specifically identified using a combination of an informative experimental design with densely-sampled time points to generate a large scale data set, and a nonparametric, nonlinear model-based approach that explicitly accounts for covariance in gene expression.

The genes we identify herein overlap and extend previous work. Of the 1,140 genes implicated in the generalized linear model, 151 (13.2%) overlapped with previous candidate gene, random mutagenesis, gene expression, and genome-wide association studies of sleep and circadian behavior in flies [10, 11, 84, 86–99]. Notably, previous studies identified the genes CG17574, cry, dro, mip120, Mtk, NPFR1, pdgy, PGRP-LC, Shal, and vari as affecting sleep duration [84, 88–91, 93, 97, 99]. Two genes, ringer and mip120, overlapped with our previous study of DNA sequence variation in flies selected for long and short sleep [10]. In that study we identified a polymorphism in an intron of ringer that changed in allele frequency with selection, with increases in the population frequency of the ‘G’ allele with increasing sleep, while the frequency of the ‘A’ allele increased with decreasing sleep. When the selective breeding procedure was relaxed, the frequency of the ‘G’ allele increased in short-sleeping populations, paralleling an increase in sleep [100]. One possibility is that this polymorphism contributes to the changes in gene expression in ringer that we observed in the present study. Of the 85 genes common to both sexes that we used in the gene interaction networks, 11 (13%) appear in other studies of sleep: CG10444, CG2003, CG5142, CG6785, CG9114, CG9676, CR42646, hiw, NPFR1, Tie, and wb [11, 89, 92, 95]. Thus, our study corroborates genes known to affect sleep, and identifies new candidate genes for sleep as well.

Interestingly, our Gene Ontology analysis identified nine genes from the 85-gene network with predicted Serine endopeptidase/peptidase/hydrolase activity: CG1304, CG10472, CG14990, CG32523, CG9676, grass, Jon65Ai, Jon65Aii, and Jon99Fii. All of these genes are expressed in neurons and epithelial cells, and all genes are expressed at the adult stage [101]. Serine proteases are a large group of proteins (257 in Drosophila) that perform a variety of functions [102]. Their predicted enzymatic activity suggests a putative role in proteolysis. This is an intriguing observation given pioneering work in mammals which suggested a role for sleep in exchanging interstitial fluid and metabolites between the brain and cerebral spinal fluid [8]. Recent work demonstrated that a similar function is conserved in flies via vesicular trafficking through the fly blood-brain barrier [103]. It would be interesting to determine whether these genes function in this process.

We observed changes in night sleep duration for all Minos insertions tested, and the effects were more prominent in females. Sex-specific effects of mutations on sleep are common in flies [84, 104–108], and sex-biased effects are often noted in females [109–113]. Remarkably, we noted gene expression relationships among genes with predicted significant Gaussian Process correlations in the Minos insertion lines despite the fact that the sleep was neither extremely long or short in mutants or controls, and the genetic background of these lines (w¹¹¹⁸ or y¹w^67c23) is completely different from the outbred Sleep Advanced Intercross Population that we used for artificial selection.

Here we extracted RNA from flies at a single circadian timepoint, ZT6. However, gene expression is known to cycle in fly heads and bodies, begging the question of whether the genes we identified at a single timepoint are subject to cycling over the 24-hour day. We therefore compared our list of genes that were significantly associated with selection scheme over generation (405 genes for males; 820 genes for females; 85 genes overlapping both sexes) with genes known to have cycling expression [114–120]. We found that 47 of the 405 genes identified for males cycle (11.6%), 170 of the 820 genes identified for females cycle (20.7%), and 13 of the 85 genes overlapping between males and females cycle (15.3%). Thus, most of the genes we identified are not known to cycle over the 24-hour day.

That complex traits can be mostly explained by additive effects of individual genes (and their expression) is a common and sometimes useful assumption. While it underpins preliminary analyses that allow whole-transcriptome data to be understood, it eliminates the ability to infer interactions between them from the data and stops short from identifying relevant processes. Complex traits involve multiple genes, and the actual interactions giving rise to phenotypes are likely to be highly nonlinear [121]. These nonlinearities are not a mathematical construct, but a biological reality arising from chemical kinetics. Favoring approaches that account for these features will not only increase statistical power, but understanding of actual biological mechanisms beyond simple network representations of gene expression [122].

In most correlation and information-theory based methods the dimension (e.g. time or space) across which samples covary is only implicit [36]; the only possible conclusion from a significant correlation between two sets of observations is that one may have an effect on the other—i.e. the data alone does not allow the distinction between actual interactions and spurious correlation. Bioinformatic pipelines that have correlation as their starting point—in addition to carrying over its limitations—are not straightforwardly comparable to our approach (see S1 Appendix). In the context of Gaussian Processes, correlation between all pairs of data points—including within the same time series, i.e. autocorrelation—is explicit in time (or other dimension), so similar trends do not necessarily imply covariance between the sets of observations. Therefore, on the one hand GPs are a nonparametric method that requires no more biological knowledge than that for computing a linear correlation; on the other hand, while not an explicit description of dynamic biological processes, it is also a model-based approach that can be used within more mechanistic formalisms like differential equations [41, 43], or potentially be used to formulate specific hypotheses and build mechanistic models.

Although somewhat self-evident, it is important to highlight the fact that to describe correlations along time, multiple time points are needed—put another way, the use of a nonlinear model requires enough resolution in the data that the trajectory can be identified. To that end, a single high-resolution, large data set with a specific design, like the one generated in this work, will be more useful than several small data sets, for instance with only initial and final time points and allowing only two-sample linear comparison. Gene expression measured at the terminal generation of selection and compared among selected and control groups does identify candidate genes [24, 25, 28, 29, 31–33], but the relationship between pairs of genes is lost. Some studies evaluated gene expression during the last 2–3 generations of selection [27, 30]; however, the additional sampling was used to confirm consistency rather than change across time. Our approach of sampling over time enabled us to derive interactions between genes and demonstrated that unique gene expression network profiles develop in long sleepers as compared to short sleepers.

When employing methods of increasing complexity or sophistication there is always the question of how relevant the inference is or, in other words, how “real” are the parameters or processes in the model. This pursuit of simplicity may favor the use of methods based on linear models as more palpable approaches and less prone to arbitrary assumptions about how the parameters are put together; however, it is important to realize that linear coefficients are no more real than those of any other model. On the contrary, biological processes are not restricted by our ability to comprehend them. Therefore, what may seem as an Occam’s Razor-like simplicity will probably hinder accurate description of nature. Systems-level understanding of complex biology requires not only more and more detailed data, but better descriptions of the processes and methodology that captures higher-order phenomena. Equivalently, experimental validation of these phenomena will be more technically challenging to accomplish. Despite the additional difficulties, it must be recognized that methods that cannot possibly match the complexity of nature are doomed to scratch all over the surface without realizing a deeper understanding. The Gaussian Processes we apply herein have broad applications to other experimental designs, such as gene expression measured at varying time intervals over the circadian day, or time-based sampling of gene expression responses to drug administration.

Supporting information

S1 Fig. Principal Component Analysis (PCA).

PCA on matrix of normalized expression data shows complete separation of sexes along the first component, which explains 65% of the variance in the data.

https://doi.org/10.1371/journal.pcbi.1011389.s001

(PDF)

S2 Fig. Correlated response to selection for long/short night sleep and associated coefficient of environmental variation.

A, day bout number; B, day bout number coefficient of environmental variation (CV_E); C, day sleep; D, day sleep CV_E; E, night bout number; F, night bout number CV_E; G, night sleep; H, night sleep CV_E; I, waking activity; J, waking activity CV_E; K, sleep latency; L, sleep latency CV_E; M, day average bout length; N, day average bout length CV_E; O, night average bout length; P, night average bout length CV_E; Q, night sleep standard deviation. Light green, Replicate 1 long-sleeper population; Dark green, Replicate 2 long-sleeper population; Orange, Replicate 1 short-sleeper population; Red, Replicate 2 short-sleeper population; Gray, Replicate 1 control population; Black, Replicate 2 control population. CV_E, coefficient of environmental variation.

https://doi.org/10.1371/journal.pcbi.1011389.s002

(PDF)

S3 Fig. Correlation of night sleep between flies harvested for RNA and all flies in the population.

A, long-sleeping Replicate 1; B, long-sleeping Replicate 2; C, short-sleeping Replicate 1; D, short-sleeping Replicate 2; E, control Replicate 1; F, control Replicate 2.

https://doi.org/10.1371/journal.pcbi.1011389.s003

(PDF)

S4 Fig. Gaussian Process model fits to selected genes.

A, fit of Gaussian Process model to pair of genes haf and CG1304; B, fit of Gaussian Process model to pair of genes CR43242 and CG1304; C, fit of single-channel Gaussian Process model to CG1304 gene; D, fit of single-channel Gaussian Process model to LysC gene.

https://doi.org/10.1371/journal.pcbi.1011389.s004

(PDF)

S5 Fig. Gene interaction networks.

A, Males selected for short sleep; B, Females selected for long sleep; C, Females selected for short sleep.

https://doi.org/10.1371/journal.pcbi.1011389.s005

(PDF)

S6 Fig. Gene expression in Minos mutants.

For each candidate gene, the gene expression in the Minos mutant and corresponding control are plotted. * or ^#P < 0.05 by Kruskal-Wallis test. A, CG12560; B, Jon65Aii; C, CG13793; D, Cyp6a16; E, hiw.

https://doi.org/10.1371/journal.pcbi.1011389.s006

(PDF)

S7 Fig. Comparison of ratios of gene expression between genes with significant Gaussian Process correlations and unrelated genes for CG13793, Cyp6a16, and hiw Minos mutants.

A, CG13793 females; B, CG13793 males; C, Cyp6a16 males; D, hiw females; E, hiw males.

https://doi.org/10.1371/journal.pcbi.1011389.s007

(PDF)

S1 Appendix. Notes on multichannel Gaussian Processes.

https://doi.org/10.1371/journal.pcbi.1011389.s008

(PDF)

S1 Table. Effects of Minos insertions on sleep.

For each gene the table lists the Flybase ID, Bloomington Drosophila Stock Center (BDSC) number, Minos genotype, and isogenic control line. For each sleep trait, the number of flies tested and mean sleep phenotype is given for sexes combined and females and males separately. P-values are listed for each term in the ANOVA model for sexes combined and for males and females separately. Significance is indicated by bold P-values.

https://doi.org/10.1371/journal.pcbi.1011389.s009

(XLSX)

S2 Table. Quantitative genetics of the response to selection for long or short night sleep and related sleep parameters.

For each trait, the ANOVA analysis results are presented. Source indicates each factor in the model. gen, generation; rep, replicate; sel, selection scheme; d.f., degrees of freedom; M.S., Type III mean squares; F, F ratio statistic; P, P–value.

https://doi.org/10.1371/journal.pcbi.1011389.s010

(XLSX)

S3 Table. Quantitative genetics of the response to selection for long or short night sleep per generation.

For each sleep trait, the ANOVA analysis results are presented for each generation. Source indicates each factor in the model. rep, replicate; sel, selection scheme; d.f., degrees of freedom; M.S., Type III mean squares; F, F ratio statistic; P, P-value.

https://doi.org/10.1371/journal.pcbi.1011389.s011

(XLSX)

S4 Table. Quantitative genetics of control populations.

For each sleep trait, the ANOVA analysis results are presented. Source indicates each factor in the model. gen, generation; rep, replicate; d.f., degrees of freedom; MS, Type III mean squares; F, F ratio statistic; P, P-value.

https://doi.org/10.1371/journal.pcbi.1011389.s012

(XLSX)

S5 Table. Correlated response of sleep trait coefficient of environmental variance (CV_E) to selection for long or short night sleep duration.

For each sleep trait listed, the ANOVA results are presented. Source indicates each factor in the model. gen, generation; sel, selection scheme; d.f., degrees of freedom; M.S., Type III mean squares; F, F ratio statistic; P, P-value.

https://doi.org/10.1371/journal.pcbi.1011389.s013

(XLSX)

S6 Table. GLM analysis results for females.

GLM analysis results for each gene in females are shown as a row; the Maximum a Posteriori (MAP) parameter estimates and log-likelihoods are shown as well as p-values computed from the likelihood ratio test. Significance statistics corrected for multiple testing are also included, as well as the normalized counts for all samples.

https://doi.org/10.1371/journal.pcbi.1011389.s014

(XLSX)

S7 Table. GLM analysis results for males.

GLM analysis results for each gene in males are shown as a row; the Maximum a Posteriori (MAP) parameter estimates and log-likelihoods are shown as well as p-values computed from the likelihood ratio test. Significance statistics corrected for multiple testing are also included, as well as the normalized counts for all samples.

https://doi.org/10.1371/journal.pcbi.1011389.s015

(XLSX)

S8 Table. Genes with opposite slopes for the short and long interaction terms of generation in females.

Columns have the same meaning as those in S6 Table.

https://doi.org/10.1371/journal.pcbi.1011389.s016

(XLSX)

S9 Table. Genes with opposite slopes for the short and long interaction terms of generation in males.

Columns have the same meaning as those in S7 Table.

https://doi.org/10.1371/journal.pcbi.1011389.s017

(XLSX)

S10 Table. Gene Ontology (GO) analysis results for 85 significant genes common to males and females.

The table lists GO classification (Biological Process (BP), Molecular Function (MF), or Cellular Component (CC)); the GO term description; the number of genes associated with each GO term and their percentage relative to the total number of genes with that GO term in D. melanogaster; the enrichment P value, and the Benjamini-adjusted P value.

https://doi.org/10.1371/journal.pcbi.1011389.s018

(XLSX)

S11 Table. Correlations obtained from normalizing Gaussian Process signal covariances (GP correlation) and from Spearman Correlation for each of the six sex and selection scheme combinations.

https://doi.org/10.1371/journal.pcbi.1011389.s019

(XLSX)

S12 Table. Expected values for the correlations obtained from normalizing Gaussian Process signal covariances (GP correlation) that do not overlap with controls for each of the six sex and selection scheme combinations.

The value is missing if there is an overlap with controls in that condition.

https://doi.org/10.1371/journal.pcbi.1011389.s020

(XLSX)

S13 Table. Degree for each gene in the GP network.

For each sex and selection scheme, the table lists the number of genes connected to the gene in the network. NA, not applicable.

https://doi.org/10.1371/journal.pcbi.1011389.s021

(XLSX)

S14 Table. Gene expression ratios calculated from Minos insertion line and control RNA-Seq data.

For each target gene the table lists the corresponding control line, the Flybase ID and gene symbol of the gene predicted to interact with the target gene, the normalized expression of the interacting gene for the Minos target gene line and isogenic control line, and the ratio of normalized expression (Minos/control).

https://doi.org/10.1371/journal.pcbi.1011389.s022

(XLSX)

S15 Table. Night sleep phenotypes.

For each selection scheme, sex, generation, and population replicate, the number of flies, mean night sleep, and standard deviation (SD) of night sleep are listed.

https://doi.org/10.1371/journal.pcbi.1011389.s023

(XLSX)

Acknowledgments

We thank the members of the NISC Consortium for sequence data and helpful discussions. This work used the computational resources of the National Institutes of Health High-Performance Computing Biowulf cluster (http://hpc.nih.gov). We thank N. Redekar and N. Gulzar from the NIAID Collaborative Bioinformatics Resource (NCBR) for assistance with bioinformatic processing.

References

1. Berger RJ, Phillips NH. Energy conservation and sleep. Behav Brain Res. 1995;69:65–73. pmid:7546319
- View Article
- PubMed/NCBI
- Google Scholar
2. Scharf MT, Naidoo N, Zimmerman JE, Pack AI. The energy hypothesis of sleep revisited. Progress in Neurobiology. 2008;86(3):264–280. pmid:18809461
- View Article
- PubMed/NCBI
- Google Scholar
3. Schmidt MH. The energy allocation function of sleep: A unifying theory of sleep, torpor, and continuous wakefulness. Neuroscience & Biobehavioral Reviews. 2014;47:122–153. pmid:25117535
- View Article
- PubMed/NCBI
- Google Scholar
4. Krueger JM, Obál F. A neuronal group theory of sleep function. Journal of Sleep Research. 1993;2(2):63–69. pmid:10607073
- View Article
- PubMed/NCBI
- Google Scholar
5. Tononi G, Cirelli C. Sleep and the Price of Plasticity: From Synaptic and Cellular Homeostasis to Memory Consolidation and Integration. Neuron. 2014;81(1):12–34. pmid:24411729
- View Article
- PubMed/NCBI
- Google Scholar
6. Joiner WJ. Unraveling the Evolutionary Determinants of Sleep. Current Biology. 2016;26(20):R1073–R1087. pmid:27780049
- View Article
- PubMed/NCBI
- Google Scholar
7. Ly S, Pack AI, Naidoo N. The neurobiological basis of sleep: Insights from Drosophila. Neuroscience and Biobehavioral Reviews. 2018;87:67–86. pmid:29391183
- View Article
- PubMed/NCBI
- Google Scholar
8. Xie L, Kang H, Xu Q, Chen MJ, Liao Y, Thiyagarajan M, et al. Sleep drives metabolite clearance from the adult brain. Science. 2013;342(6156):373–377. pmid:24136970
- View Article
- PubMed/NCBI
- Google Scholar
9. Hill VM, O’Connor RM, Shirasu-Hiza M. Tired and stressed: Examining the need for sleep. European Journal of Neuroscience. 2020;51(1):494–508. pmid:30295966
- View Article
- PubMed/NCBI
- Google Scholar
10. Harbison ST, Serrano Negron YL, Hansen NF, Lobell AS. Selection for long and short sleep duration in Drosophila melanogaster reveals the complex genetic network underlying natural variation in sleep. PLOS Genetics. 2017;13(12):e1007098. pmid:29240764
- View Article
- PubMed/NCBI
- Google Scholar
11. Harbison ST, McCoy LJ, Mackay TFC. Genome-wide association study of sleep in Drosophila melanogaster. BMC Genomics. 2013;14(1):281. pmid:23617951
- View Article
- PubMed/NCBI
- Google Scholar
12. Laing EE, Möller-Levet CS, Dijk DJ, Archer SN. Identifying and validating blood mRNA biomarkers for acute and chronic insufficient sleep in humans: A machine learning approach. Sleep. 2019;42(1):1–18. pmid:30247731
- View Article
- PubMed/NCBI
- Google Scholar
13. Dashti HS, Jones SE, Wood AR, Lane JM, van Hees VT, Wang H, et al. Genome-wide association study identifies genetic loci for self-reported habitual sleep duration supported by accelerometer-derived estimates. Nature Communications. 2019;10(1). pmid:30846698
- View Article
- PubMed/NCBI
- Google Scholar
14. Jones SE, Tyrrell J, Wood AR, Beaumont RN, Ruth KS, Tuke MA, et al. Genome-Wide Association Analyses in 128,266 Individuals Identifies New Morningness and Sleep Duration Loci. PLOS Genetics. 2016;12(8):e1006125. pmid:27494321
- View Article
- PubMed/NCBI
- Google Scholar
15. Jansen PR, Watanabe K, Stringer S, Skene N, Bryois J, Hammerschlag AR, et al. Genome-wide analysis of insomnia in 1,331,010 individuals identifies new risk loci and functional pathways. Nature Genetics. 2019;51(3):394–403. pmid:30804565
- View Article
- PubMed/NCBI
- Google Scholar
16. Lane JM, Jones SE, Dashti HS, Wood AR, Aragam KG, van Hees VT, et al. Biological and clinical insights from genetics of insomnia symptoms. Nature Genetics. 2019;51(3):387–393. pmid:30804566
- View Article
- PubMed/NCBI
- Google Scholar
17. Hammerschlag AR, Stringer S, de Leeuw CA, Sniekers S, Taskesen E, Watanabe K, et al. Genome-wide association analysis of insomnia complaints identifies risk genes and genetic overlap with psychiatric and metabolic traits. Nature Genetics. 2017;49(11):1584–1592. pmid:28604731
- View Article
- PubMed/NCBI
- Google Scholar
18. Diessler S, Jan M, Emmenegger Y, Guex N, Middleton B, Skene DJ, et al. A systems genetics resource and analysis of sleep regulation in the mouse. PLoS Biology. 2018;16(8). pmid:30091978
- View Article
- PubMed/NCBI
- Google Scholar
19. Joshi SS, Sethi M, Striz M, Cole N, Denegre JM, Ryan J, et al. Noninvasive sleep monitoring in large-scale screening of knock-out mice reveals novel sleep-related genes. bioRxiv. 2019; 517680.
- View Article
- Google Scholar
20. Boyle EA, Li YI, Pritchard JK. An Expanded View of Complex Traits: From Polygenic to Omnigenic. Cell. 2017;169(7):1177–1186. pmid:28622505
- View Article
- PubMed/NCBI
- Google Scholar
21. Schlötterer C, Kofler R, Versace E, Tobler R, Franssen SU. Combining experimental evolution with next-generation sequencing: a powerful tool to study adaptation from standing genetic variation. Heredity. 2015;114(5):431–440. pmid:25269380
- View Article
- PubMed/NCBI
- Google Scholar
22. Faria VG, Martins NE, Paulo T, Teixeira L, Sucena É, Magalhães S. Evolution of Drosophila resistance against different pathogens and infection routes entails no detectable maintenance costs. Evolution. 2015;69(11):2799–2809. pmid:26496003
- View Article
- PubMed/NCBI
- Google Scholar
23. Faria VG, Martins NE, Magalhães S, Paulo TF, Nolte V, Schlötterer C, et al. Drosophila Adaptation to Viral Infection through Defensive Symbiont Evolution. PLoS Genetics. 2016;12(9):1–18. pmid:27684942
- View Article
- PubMed/NCBI
- Google Scholar
24. Pegoraro M, Flavell LMM, Menegazzi P, Colombi P, Dao P, Helfrich-Forster C, et al. The genetic basis of diurnal preference in Drosophila melanogaster. BMC Genomics. 2020;21(1). pmid:32862827
- View Article
- PubMed/NCBI
- Google Scholar
25. Brown EB, Patterson C, Pancoast R, Rollmann SM. Artificial selection for odor-guided behavior in Drosophila reveals changes in food consumption. BMC Genomics. 2017;18(1):1–13. pmid:29132294
- View Article
- PubMed/NCBI
- Google Scholar
26. Brown EB, Layne JE, Elchert AR, Rollmann SM. Behavioral and transcriptional response to selection for olfactory behavior in Drosophila. G3: Genes, Genomes, Genetics. 2020;10(4):1283–1296. pmid:32024668
- View Article
- PubMed/NCBI
- Google Scholar
27. Garlapow ME, Everett LJ, Zhou S, Gearhart AW, Fay KA, Huang W, et al. Genetic and Genomic Response to Selection for Food Consumption in Drosophila melanogaster. Behavior Genetics. 2017;47(2):227–243. pmid:27704301
- View Article
- PubMed/NCBI
- Google Scholar
28. Mackay TFC, Heinsohn SL, Lyman RF, Moehring AJ, Morgan TJ, Rollmann SM. Genetics and genomics of Drosophila mating behavior. Proceedings of the National Academy of Sciences. 2005;102(Supplement 1):6622–6629. pmid:15851659
- View Article
- PubMed/NCBI
- Google Scholar
29. Wertheim B, Kraaijeveld AR, Hopkins MG, Walther Boer M, Godfray HCJ. Functional genomics of the evolution of increased resistance to parasitism in Drosophila. Molecular Ecology. 2011;20(5):932–949. pmid:21062384
- View Article
- PubMed/NCBI
- Google Scholar
30. Telonis-Scott M, Hallas R, McKechnie SW, Wee CW, Hoffmann AA. Selection for cold resistance alters gene transcript levels in Drosophila melanogaster. Journal of Insect Physiology. 2009;55(6):549–555. pmid:19232407
- View Article
- PubMed/NCBI
- Google Scholar
31. Sørensen JG, Nielsen MM, Loeschcke V. Gene expression profile analysis of Drosophila melanogaster selected for resistance to environmental stressors. Journal of Evolutionary Biology. 2007;20(4):1624–1636. pmid:17584255
- View Article
- PubMed/NCBI
- Google Scholar
32. Morozova TV, Anholt RRH, Mackay TFC. Phenotypic and transcriptional response to selection for alcohol sensitivity in Drosophila melanogaster. Genome Biology. 2007;8(10):1–15. pmid:17973985
- View Article
- PubMed/NCBI
- Google Scholar
33. Edwards AC, Rollmann SM, Morgan TJ, Mackay TFC. Quantitative Genomics of Aggressive Behavior in Drosophila melanogaster. PLoS Genetics. 2006;2(9):1386–1395. pmid:17044737
- View Article
- PubMed/NCBI
- Google Scholar
34. Langfelder P, Horvath S. WGCNA: an R package for weighted correlation network analysis. BMC Bioinformatics. 2008;9:559. pmid:19114008
- View Article
- PubMed/NCBI
- Google Scholar
35. Stone EA, Ayroles JF. Modulated Modularity Clustering as an Exploratory tool for functional genomic inference. PLoS Genetics. 2009;5(5):e1000479. pmid:19424432
- View Article
- PubMed/NCBI
- Google Scholar
36. Emmert-Streib F, Glazko GV, Altay G, dM Simoes R. Statistical inference and reverse engineering of gene regulatory networks from observational expression data. Frontiers in Genetics. 2012;3(FEB):1–15. pmid:22408642
- View Article
- PubMed/NCBI
- Google Scholar
37. Villaverde AF, Banga JR. Reverse engineering and identification in systems biology: strategies, perspectives and challenges. Journal of The Royal Society Interface. 2014;11(91):20130505. pmid:24307566
- View Article
- PubMed/NCBI
- Google Scholar
38. Liu ZP. Reverse Engineering of Genome-wide Gene Regulatory Networks from Gene Expression Data. Current Genomics. 2015;16(1):3–22. pmid:25937810
- View Article
- PubMed/NCBI
- Google Scholar
39. Rasmussen CE, Williams CKI. Gaussian Processes for Machine Learning. The MIT Press; 2006.
40. Schulz E, Speekenbrink M, Krause A. A tutorial on Gaussian process regression: Modelling, exploring, and exploiting functions. Journal of Mathematical Psychology. 2018.
- View Article
- Google Scholar
41. Yang S, Wong SWK, Kou SC. Inference of dynamic systems from noisy and sparse data via manifold-constrained Gaussian Processes. Proc Natl Acad Sci USA. 2021;118(15):e2020397118. pmid:33837150
- View Article
- PubMed/NCBI
- Google Scholar
42. Liu W, Niranjan N. Gaussian process modelling for bicoid mRNA regulation in spatio-temporal Bicoid profile. Bioinformatics. 2012;28(3):366–372. pmid:22130592
- View Article
- PubMed/NCBI
- Google Scholar
43. Äijö T, Granberg K, Lähdesmäki H. Sorad: a systems biology approach to predict and modulate dynamic signaling pathway response from phosphoproteome time-course measurements. Bioinformatics. 2013;29(10):1283–1291. pmid:23505293
- View Article
- PubMed/NCBI
- Google Scholar
44. Aalto A, Viitasaari L, Ilmonen P, Mombaerts L, Gonçalves J. Gene regulatory network inference from sparsely sampled noisy data. Nature Communications. 2020;11(1). pmid:32661225
- View Article
- PubMed/NCBI
- Google Scholar
45. Gao P, Honkela A, Rattray M, Lawrence ND. Gaussian process modelling of latent chemical species: applications to inferring transcription factor activities. Bioinformatics. 2008;24:i70–i75. pmid:18689843
- View Article
- PubMed/NCBI
- Google Scholar
46. Honkela A, Girardot C, Gustafson EH, Liu YH, Furlong EEM, Lawrence ND, et al. Model-based method for transcription factor target identification with limited data. Proceedings of the National Academy of Sciences. 2010;107(17):7793–7798. pmid:20385836
- View Article
- PubMed/NCBI
- Google Scholar
47. Hensman J, Lawrence ND, Rattray M. Hierarchical Bayesian modelling of gene expression time series across irregularly sampled replicates and clusters. BMC Bioinformatics. 2013;14:252–264. pmid:23962281
- View Article
- PubMed/NCBI
- Google Scholar
48. Arnol D, Schapiro D, Bodenmiller B, Saez-Rodriguez J, Stegle O. Modeling Cell-Cell Interactions from Spatial Molecular Data with Spatial Variance Component Analysis. Cell Reports. 2019;29(1):202–211.e6. pmid:31577949
- View Article
- PubMed/NCBI
- Google Scholar
49. McDowell IC, Manandhar D, Vockley CM, Schmid AK, Reddy TE, Engelhardt BE. Clustering gene expression time series data using an infinite Gaussian process mixture model. PLoS Computational Biology. 2018;14(1):1–27. pmid:29337990
- View Article
- PubMed/NCBI
- Google Scholar
50. Melkumyan A, Ramos F. Multi-kernel Gaussian processes. In: IJCAI International Joint Conference on Artificial Intelligence; 2011. p. 1408–1413.
51. Bonilla EV, Chai KMA, Williams CKI. Multi-task Gaussian Process prediction. In: Advances in Neural Information Processing Systems 20. NIPS Foundation; 2008. p. 153–160.
52. Bahg G, Evans DG, Galdo M, Turner BM. Gaussian process linking functions for mind, brain, and behavior. Proceedings of the National Academy of Sciences of the United States of America. 2020;117(47):29398–29406. pmid:33229563
- View Article
- PubMed/NCBI
- Google Scholar
53. Velten B, Braunger JM, Argelaguet R, Arnol D, Wirbel J, Bredikhin D, et al. Identifying temporal and spatial patterns of variation from multi- modal data using MEFISTO. Nature Methods. 2022;19:179–186. pmid:35027765
- View Article
- PubMed/NCBI
- Google Scholar
54. Mackay TFC, Richards S, Stone EA, Barbadilla A, Ayroles JF, Zhu D, et al. The Drosophila melanogaster Genetic Reference Panel. Nature. 2012;482(7384):173–178. pmid:22318601
- View Article
- PubMed/NCBI
- Google Scholar
55. Huang W, Massouras A, Inoue Y, Peiffer J, Ramia M, Tarone AM, et al. Natural variation in genome architecture among 205 Drosophila melanogaster Genetic Reference Panel lines. Genome Research. 2014;24(7):1193–1208. pmid:24714809
- View Article
- PubMed/NCBI
- Google Scholar
56. Serrano Negron YL, Hansen NF, Harbison ST. The Sleep Inbred Panel, a Collection of Inbred Drosophila melanogaster with Extreme Long and Short Sleep Duration. G3: Genes—Genomes—Genetics. 2018;8(9):2865–2873. pmid:29991508
- View Article
- PubMed/NCBI
- Google Scholar
57. Ganguly-Fitzgerald I, Donlea J, Shaw PJ. Waking Experience Affects Sleep Need in Drosophila. Science. 2006;313(5794):1775–1781. pmid:16990546
- View Article
- PubMed/NCBI
- Google Scholar
58. Mackay TF, Lyman RF. Drosophila bristles and the nature of quantitative genetic variation. Philosophical Transactions of the Royal Society B: Biological Sciences. 2005;360(1459):1513–1527. pmid:16108138
- View Article
- PubMed/NCBI
- Google Scholar
59. Falconer DS, Mackay TFC. Introduction to Quantitative Genetics, Fourth Edition. Addison Wesley Longman Limited; 1996.
60. Dobin A, Davis CA, Schlesinger F, Drenkow J, Zaleski C, Jha S, et al. STAR: Ultrafast universal RNA-seq aligner. Bioinformatics. 2013. pmid:23104886
- View Article
- PubMed/NCBI
- Google Scholar
61. Anders S, Pyl PT, Huber W. HTSeq-A Python framework to work with high-throughput sequencing data. Bioinformatics. 2015. pmid:25260700
- View Article
- PubMed/NCBI
- Google Scholar
62. Jin W, Riley RM, Wolfinger RD, White KP, Passador-Gurgel G, Gibson G. The contributions of sex, genotype and age to transcriptional variance in Drosophila melanogaster. Nature Genetics. 2001;29(4):389–395. pmid:11726925
- View Article
- PubMed/NCBI
- Google Scholar
63. Arbeitman MN, Furlong EEM, Imam F, Johnson E, Null BH, Baker BS, et al. Gene Expression During the Life Cycle of Drosophila melanogaster. Science. 2002;297(5590):2270–2275. pmid:12351791
- View Article
- PubMed/NCBI
- Google Scholar
64. Parisi M, Nuttall R, Naiman D, Bouffard G, Malley J, Andrews J, et al. Paucity of Genes on the Drosophila X Chromosome Showing Male-Biased Expression. Science. 2003;299(5607):697–700. pmid:12511656
- View Article
- PubMed/NCBI
- Google Scholar
65. Ranz JM, Castillo-Davis CI, Meiklejohn CD, Hartl DL. Sex-dependent gene expression and evolution of the Drosophila transcriptome. Science. 2003;300(5626):1742–1745. pmid:12805547
- View Article
- PubMed/NCBI
- Google Scholar
66. Harbison ST, Chang S, Kamdar KP, Mackay TFC. Quantitative genomics of starvation stress resistance in Drosophila. Genome biology. 2005;6:R36. pmid:15833123
- View Article
- PubMed/NCBI
- Google Scholar
67. Wayne ML, Telonis-Scott M, Bono LM, Harshman L, Kopp A, Nuzhdin SV, et al. Simpler mode of inheritance of transcriptional variation in male Drosophila melanogaster. Proceedings of the National Academy of Sciences. 2007;104(47):18577–18582. pmid:18003923
- View Article
- PubMed/NCBI
- Google Scholar
68. Zhang Y, Sturgill D, Parisi M, Kumar S, Oliver B. Constraint and turnover in sex-biased gene expression in the genus Drosophila. Nature. 2007;450(7167):233–237. pmid:17994089
- View Article
- PubMed/NCBI
- Google Scholar
69. Ayroles JF, Carbone MA, Stone EA, Jordan KW, Lyman RF, Magwire MM, et al. Systems genetics of complex traits in Drosophila melanogaster. Nature Genetics. 2009;41(3):299–307. pmid:19234471
- View Article
- PubMed/NCBI
- Google Scholar
70. Huylmans AK, Parsch J. Population- and Sex-Biased Gene Expression in the Excretion Organs of Drosophila melanogaster. G3: Genes—Genomes—Genetics. 2014;4(12):2307–2315. pmid:25246242
- View Article
- PubMed/NCBI
- Google Scholar
71. Huang W, Carbone MA, Magwire MM, Peiffer JA, Lyman RF, Stone EA, et al. Genetic basis of transcriptome diversity in Drosophila melanogaster. Proceedings of the National Academy of Sciences. 2015;112(44):E6010–E6019. pmid:26483487
- View Article
- PubMed/NCBI
- Google Scholar
72. Lin Y, Chen ZX, Oliver B, Harbison ST. Microenvironmental gene expression plasticity among individual drosophila melanogaster. G3: Genes, Genomes, Genetics. 2016;6(12):4197–4210. pmid:27770026
- View Article
- PubMed/NCBI
- Google Scholar
73. Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biology. 2014;15(12):550. pmid:25516281
- View Article
- PubMed/NCBI
- Google Scholar
74. Zhang Y, Malone JH, Powell SK, Periwal V, Spana E, MacAlpine DM, et al. Expression in Aneuploid Drosophila S2 Cells. PLoS Biology. 2010;8(2):e1000320. pmid:20186269
- View Article
- PubMed/NCBI
- Google Scholar
75. Gelman A, Carlin JB, Stern HS, Dunson DB, Vehtari A, Rubin DB. Bayesian data analysis, third edition. CRC Press; 2013.
76. Carpenter B, Gelman A, Hoffman MD, Lee D, Goodrich B, Betancourt M, et al. Stan: A Probabilistic Programming Language. Journal of Statistical Software. 2017;76(1). pmid:36568334
- View Article
- PubMed/NCBI
- Google Scholar
77. Benjamini Y, Hochberg Y. Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing. Journal of the Royal Statistical Society: Series B (Methodological). 1995;57(1):289–300.
- View Article
- Google Scholar
78. Ruscio J. Constructing confidence intervals for Spearman’s rank correlation with ordinal data: A simulation study comparing analytic and bootstrap methods. Journal of Modern Applied Statistical Methods. 2008;7(2):416–434.
- View Article
- Google Scholar
79. Austin PC, Hux JE. A brief note on overlapping confidence intervals. Journal of Vascular Surgery. 2002;36(1):194–195. pmid:12096281
- View Article
- PubMed/NCBI
- Google Scholar
80. MacKay DJ. Information theory, inference and learning algorithms. Cambridge university press; 2003.
81. Bishop CM. Pattern recognition and machine learning. Springer; 2006.
82. Kontio JAJ, Sillanpää MJ. Scalable Nonparametric Prescreening Method for Searching Higher-Order Genetic Interactions Underlying Quantitative Traits. Genetics. 2019;213(4):1209–1224. pmid:31585953
- View Article
- PubMed/NCBI
- Google Scholar
83. Harbison ST, Sehgal A. Quantitative Genetic Analysis of Sleep in Drosophila melanogaster. Genetics. 2008;178(4):2341–2360. pmid:18430954
- View Article
- PubMed/NCBI
- Google Scholar
84. Harbison ST, Carbone MA, Ayroles JF, Stone EA, Lyman RF, Mackay TFC. Co-regulated transcriptional networks contribute to natural genetic variation in Drosophila sleep. Nature Genetics. 2009. pmid:19234472
- View Article
- PubMed/NCBI
- Google Scholar
85. Murali T, Pacifico S, Yu J, Guest S, Roberts GG, Finley RL. DroID 2011: a comprehensive, integrated resource for protein, transcription factor, RNA and gene interactions for Drosophila. Nucleic Acids Research. 2011;39(suppl_1):D736–D743. pmid:21036869
- View Article
- PubMed/NCBI
- Google Scholar
86. Harbison ST, Kumar S, Huang W, McCoy LJ, Smith KR, Mackay TFC. Genome-Wide Association Study of Circadian Behavior in Drosophila melanogaster. Behavior Genetics. 2019;49(1):60–82. pmid:30341464
- View Article
- PubMed/NCBI
- Google Scholar
87. Thimgan MS, Suzuki Y, Seugnet L, Gottschalk L, Shaw PJ. The Perilipin homologue, Lipid Storage Droplet 2, regulates sleep homeostasis and prevents learning impairments following sleep loss. PLoS Biology. 2010;8. pmid:20824166
- View Article
- PubMed/NCBI
- Google Scholar
88. Thimgan MS, Kress N, Lisse J, Fiebelman C, Hilderbrand T. The acyl-CoA synthetase, pudgy, promotes sleep and is required for the homeostatic response to sleep deprivation. Frontiers in Endocrinology. 2018;9. pmid:30186232
- View Article
- PubMed/NCBI
- Google Scholar
89. He C, Yang Y, Zhang M, Price JL, Zhao Z. Regulation of sleep by Neuropeptide-Y-like system in Drosophila melanogaster. PLoS ONE. 2013;8. pmid:24040211
- View Article
- PubMed/NCBI
- Google Scholar
90. Mallon EB, Alghamdi A, Holdbrook RTK, Rosato E. Immune stimulation reduces sleep and memory ability in Drosophila melanogaster. PeerJ. 2014. pmid:24949247
- View Article
- PubMed/NCBI
- Google Scholar
91. Dissel S, Seugnet L, Thimgan MS, Silverman N, Angadi V, Thacher PV, et al. Differential activation of immune factors in neurons and glia contribute to individual differences in resilience/vulnerability to sleep disruption. Brain, Behavior, and Immunity. 2015;47:75–85. pmid:25451614
- View Article
- PubMed/NCBI
- Google Scholar
92. Seugnet L, Dissel S, Thimgan M, Cao L, Shaw PJ. Identification of genes that maintain behavioral and structural plasticity during sleep loss. Frontiers in Neural Circuits. 2017;11. pmid:29109678
- View Article
- PubMed/NCBI
- Google Scholar
93. Feng G, Zhang J, Li M, Shao L, Yang L, Song Q, et al. Control of sleep onset by Shal/Kv4 channels in Drosophila circadian neurons. The Journal of Neuroscience. 2018;38.
- View Article
- Google Scholar
94. Shalaby NA, Pinzon JH, Narayanan AS, Jin EJ, Ritz MP, Dove RJ, et al. JmjC domain proteins modulate circadian behaviors and sleep in Drosophila. Scientific Reports. 2018;8. pmid:29339751
- View Article
- PubMed/NCBI
- Google Scholar
95. Wu KJ, Kumar S, Serrano Negron YL, Harbison ST. Genotype influences day-to-day variability in sleep in Drosophila melanogaster. Sleep. 2018;41(2). pmid:29228366
- View Article
- PubMed/NCBI
- Google Scholar
96. Roessingh S, Rosing M, Marunova M, Ogueta M, George R, Lamaze A, et al. Temperature synchronization of the Drosophila circadian clock protein PERIOD is controlled by the TRPA channel PYREXIA. Communications Biology. 2019;2. pmid:31286063
- View Article
- PubMed/NCBI
- Google Scholar
97. Khoury S, Wang QP, Parisien M, Gris P, Bortsov AV, Linnstaedt SD, et al. Multi-ethnic GWAS and meta-analysis of sleep quality identify MPP6 as a novel gene that functions in sleep center neurons. Sleep. 2020;44.
- View Article
- Google Scholar
98. Lee J, Lim C, Han TH, Andreani T, Moye M, Curran J, et al. The E3 ubiquitin ligase adapter Tango10 links the core circadian clock to neuropeptide and behavioral rhythms. The Journal of Neuroscience. 2021;118.
- View Article
- Google Scholar
99. Pegoraro M, Sayegh Rezek E, Fishman B, Tauber E. Nucleotide variation in Drosophila cryptochrome is linked to circadian clock function: An association analysis. Frontiers in Physiology. 2022;13. pmid:35250608
- View Article
- PubMed/NCBI
- Google Scholar
100. Souto-Maior C, Serrano Negron YL, Harbison ST. Natural selection on sleep duration in Drosophila melanogaster. Scientific Reports. 2020;10. pmid:33244154
- View Article
- PubMed/NCBI
- Google Scholar
101. Li H, Janssens J, Waegeneer MD, Kolluru SS, Davie K, Gardeux V, et al. Fly Cell Atlas: A single-nucleus transcriptomic atlas of the adult fruit fly. Science. 2022;375(6584):eabk2432. pmid:35239393
- View Article
- PubMed/NCBI
- Google Scholar
102. Cao X, Jiang H. Building a platform for predicting functions of serine protease-related proteins in Drosophila melanogaster and other insects. Insect Biochemistry and Molecular Biology. 2018;103:53–69. pmid:30367934
- View Article
- PubMed/NCBI
- Google Scholar
103. Artiushin G, Zhang SL, Tricoire H, Sehgal A. Endocytosis at the Drosophila blood–brain barrier as a function for sleep. eLife. 2018;7:e43326. pmid:30475209
- View Article
- PubMed/NCBI
- Google Scholar
104. Williams JA, Sathyanarayanan S, Hendricks JC, Sehgal A. Interaction between sleep and the immune response in Drosophila: a role for the NFκB relish. Sleep. 2007;30(4):389–400. pmid:17520783
- View Article
- PubMed/NCBI
- Google Scholar
105. De Luca M, Klimentidis YC, Casazza K, Moses Chambers M, Cho R, Harbison ST, et al. A conserved role for syndecan family members in the regulation of whole-body energy metabolism. PloS one. 2010;5(6):e11286. pmid:20585652
- View Article
- PubMed/NCBI
- Google Scholar
106. Afonso DJ, Liu D, Machado DR, Pan H, Jepson JE, Rogulja D, et al. TARANIS functions with cyclin A and Cdk1 in a novel arousal center to control sleep in Drosophila. Current Biology. 2015;25(13):1717–1726. pmid:26096977
- View Article
- PubMed/NCBI
- Google Scholar
107. Glover Z, Hodges MD, Dravecz N, Cameron J, Askwith H, Shirras A, et al. Loss of angiotensin-converting enzyme-related (ACER) peptidase disrupts behavioural and metabolic responses to diet in Drosophila melanogaster. Journal of Experimental Biology. 2019;222(8):jeb194332. pmid:30940674
- View Article
- PubMed/NCBI
- Google Scholar
108. Álvarez-Rendón JP, Riesgo-Escovar JR. Circadian and rhythmic-related behavioral co-morbidities of the diabetic state in Drosophila melanogaster. General and Comparative Endocrinology. 2020;295:113477. pmid:32240709
- View Article
- PubMed/NCBI
- Google Scholar
109. Bushey D, Huber R, Tononi G, Cirelli C. Drosophila Hyperkinetic mutants have reduced sleep and impaired memory. Journal of Neuroscience. 2007;27(20):5384–5393. pmid:17507560
- View Article
- PubMed/NCBI
- Google Scholar
110. Bushey D, Tononi G, Cirelli C. The Drosophila fragile X mental retardation gene regulates sleep need. Journal of Neuroscience. 2009;29(7):1948–1961. pmid:19228950
- View Article
- PubMed/NCBI
- Google Scholar
111. Luo W, Chen WF, Yue Z, Chen D, Sowcik M, Sehgal A, et al. Old flies have a robust central oscillator but weaker behavioral rhythms that can be improved by genetic and environmental manipulations. Aging cell. 2012;11(3):428–438. pmid:22268765
- View Article
- PubMed/NCBI
- Google Scholar
112. Juneau BA, Stonemetz JM, Toma RF, Possidente DR, Heins RC, Vecsey CG. Optogenetic activation of short neuropeptide F (sNPF) neurons induces sleep in Drosophila melanogaster. Physiology & behavior. 2019;206:143–156. pmid:30935941
- View Article
- PubMed/NCBI
- Google Scholar
113. Huang H, Possidente DR, Vecsey CG. Optogenetic activation of SIFamide (SIFa) neurons induces a complex sleep-promoting effect in the fruit fly Drosophila melanogaster. Physiology & Behavior. 2021;239:113507. pmid:34175361
- View Article
- PubMed/NCBI
- Google Scholar
114. McDonald MJ, Rosbash M. Microarray Analysis and Organization of Circadian Gene Expression in Drosophila. Cell. 2001;107(5):567–578. pmid:11733057
- View Article
- PubMed/NCBI
- Google Scholar
115. Ceriani MF, Hogenesch JB, Yanovsky M, Panda S, Straume M, Kay SA. Genome-Wide Expression Analysis in DrosophilaReveals Genes Controlling Circadian Behavior. Journal of Neuroscience. 2002;22(21):9305–9319. pmid:12417656
- View Article
- PubMed/NCBI
- Google Scholar
116. Lin Y, Han M, Shimada B, Taghert PH. Influence of the period-dependent circadian clock on diurnal, circadian, and aperiodic gene expression in Drosophila melanogaster. Proc Natl Acad Sci USA. 2002;99(14):9562–9567. pmid:12089325
- View Article
- PubMed/NCBI
- Google Scholar
117. Ueda HR, Matsumoto A, Kawamura M, Iino M, Tanimura T, Hashimoto S. Genome-wide Transcriptional Orchestration of Circadian Rhythms in Drosophila. Journal of Biological Chemistry. 2002;277(16):14048–14052. pmid:11854264
- View Article
- PubMed/NCBI
- Google Scholar
118. Hughes ME, Grant GR, Paquin C, Qian J, Nitabach MN. Deep sequencing the circadian and diurnal transcriptome of Drosophila brain. Genome Research. 2012;22(7):1266–1281. pmid:22472103
- View Article
- PubMed/NCBI
- Google Scholar
119. Rodriquez J, Tang CHA, Khodor YL, Vodala S, Menet JS, Rosbash M. Nascent-Seq analysis of Drosophila cycling gene expression. Proc Natl Acad Sci USA. 2013;110(4):275–284.
- View Article
- Google Scholar
120. Kumar S, Tunc I, Tansey TR, Pirooznia M, Harbison ST. Identification of Genes Contributing to a Long Circadian Period in Drosophila Melanogaster. Journal of Biological Rhythms. 2021;36(3):239–253. pmid:33274675
- View Article
- PubMed/NCBI
- Google Scholar
121. Mackay TFC. Epistasis and quantitative traits: Using model organisms to study gene-gene interactions. Nature Reviews Genetics. 2014;15(1):22–33. pmid:24296533
- View Article
- PubMed/NCBI
- Google Scholar
122. DiFrisco J, Jaeger J. Genetic Causation in Complex Regulatory Systems: An Integrative Dynamic Perspective. BioEssays. 2020;42(6):1900226. pmid:32449193
- View Article
- PubMed/NCBI
- Google Scholar

[ref1] 1. Berger RJ, Phillips NH. Energy conservation and sleep. Behav Brain Res. 1995;69:65–73. pmid:7546319
View Article
PubMed/NCBI
Google Scholar

[2] View Article

[3] PubMed/NCBI

[4] Google Scholar

[ref2] 2. Scharf MT, Naidoo N, Zimmerman JE, Pack AI. The energy hypothesis of sleep revisited. Progress in Neurobiology. 2008;86(3):264–280. pmid:18809461
View Article
PubMed/NCBI
Google Scholar

[6] View Article

[7] PubMed/NCBI

[8] Google Scholar

[ref3] 3. Schmidt MH. The energy allocation function of sleep: A unifying theory of sleep, torpor, and continuous wakefulness. Neuroscience & Biobehavioral Reviews. 2014;47:122–153. pmid:25117535
View Article
PubMed/NCBI
Google Scholar

[10] View Article

[11] PubMed/NCBI

[12] Google Scholar

[ref4] 4. Krueger JM, Obál F. A neuronal group theory of sleep function. Journal of Sleep Research. 1993;2(2):63–69. pmid:10607073
View Article
PubMed/NCBI
Google Scholar

[14] View Article

[15] PubMed/NCBI

[16] Google Scholar

[ref5] 5. Tononi G, Cirelli C. Sleep and the Price of Plasticity: From Synaptic and Cellular Homeostasis to Memory Consolidation and Integration. Neuron. 2014;81(1):12–34. pmid:24411729
View Article
PubMed/NCBI
Google Scholar

[18] View Article

[19] PubMed/NCBI

[20] Google Scholar

[ref6] 6. Joiner WJ. Unraveling the Evolutionary Determinants of Sleep. Current Biology. 2016;26(20):R1073–R1087. pmid:27780049
View Article
PubMed/NCBI
Google Scholar

[22] View Article

[23] PubMed/NCBI

[24] Google Scholar

[ref7] 7. Ly S, Pack AI, Naidoo N. The neurobiological basis of sleep: Insights from Drosophila. Neuroscience and Biobehavioral Reviews. 2018;87:67–86. pmid:29391183
View Article
PubMed/NCBI
Google Scholar

[26] View Article

[27] PubMed/NCBI

[28] Google Scholar

[ref8] 8. Xie L, Kang H, Xu Q, Chen MJ, Liao Y, Thiyagarajan M, et al. Sleep drives metabolite clearance from the adult brain. Science. 2013;342(6156):373–377. pmid:24136970
View Article
PubMed/NCBI
Google Scholar

[30] View Article

[31] PubMed/NCBI

[32] Google Scholar

[ref9] 9. Hill VM, O’Connor RM, Shirasu-Hiza M. Tired and stressed: Examining the need for sleep. European Journal of Neuroscience. 2020;51(1):494–508. pmid:30295966
View Article
PubMed/NCBI
Google Scholar

[34] View Article

[35] PubMed/NCBI

[36] Google Scholar

[ref10] 10. Harbison ST, Serrano Negron YL, Hansen NF, Lobell AS. Selection for long and short sleep duration in Drosophila melanogaster reveals the complex genetic network underlying natural variation in sleep. PLOS Genetics. 2017;13(12):e1007098. pmid:29240764
View Article
PubMed/NCBI
Google Scholar

[38] View Article

[39] PubMed/NCBI

[40] Google Scholar

[ref11] 11. Harbison ST, McCoy LJ, Mackay TFC. Genome-wide association study of sleep in Drosophila melanogaster. BMC Genomics. 2013;14(1):281. pmid:23617951
View Article
PubMed/NCBI
Google Scholar

[42] View Article

[43] PubMed/NCBI

[44] Google Scholar

[ref12] 12. Laing EE, Möller-Levet CS, Dijk DJ, Archer SN. Identifying and validating blood mRNA biomarkers for acute and chronic insufficient sleep in humans: A machine learning approach. Sleep. 2019;42(1):1–18. pmid:30247731
View Article
PubMed/NCBI
Google Scholar

[46] View Article

[47] PubMed/NCBI

[48] Google Scholar

[ref13] 13. Dashti HS, Jones SE, Wood AR, Lane JM, van Hees VT, Wang H, et al. Genome-wide association study identifies genetic loci for self-reported habitual sleep duration supported by accelerometer-derived estimates. Nature Communications. 2019;10(1). pmid:30846698
View Article
PubMed/NCBI
Google Scholar

[50] View Article

[51] PubMed/NCBI

[52] Google Scholar

[ref14] 14. Jones SE, Tyrrell J, Wood AR, Beaumont RN, Ruth KS, Tuke MA, et al. Genome-Wide Association Analyses in 128,266 Individuals Identifies New Morningness and Sleep Duration Loci. PLOS Genetics. 2016;12(8):e1006125. pmid:27494321
View Article
PubMed/NCBI
Google Scholar

[54] View Article

[55] PubMed/NCBI

[56] Google Scholar

[ref15] 15. Jansen PR, Watanabe K, Stringer S, Skene N, Bryois J, Hammerschlag AR, et al. Genome-wide analysis of insomnia in 1,331,010 individuals identifies new risk loci and functional pathways. Nature Genetics. 2019;51(3):394–403. pmid:30804565
View Article
PubMed/NCBI
Google Scholar

[58] View Article

[59] PubMed/NCBI

[60] Google Scholar

[ref16] 16. Lane JM, Jones SE, Dashti HS, Wood AR, Aragam KG, van Hees VT, et al. Biological and clinical insights from genetics of insomnia symptoms. Nature Genetics. 2019;51(3):387–393. pmid:30804566
View Article
PubMed/NCBI
Google Scholar

[62] View Article

[63] PubMed/NCBI

[64] Google Scholar

[ref17] 17. Hammerschlag AR, Stringer S, de Leeuw CA, Sniekers S, Taskesen E, Watanabe K, et al. Genome-wide association analysis of insomnia complaints identifies risk genes and genetic overlap with psychiatric and metabolic traits. Nature Genetics. 2017;49(11):1584–1592. pmid:28604731
View Article
PubMed/NCBI
Google Scholar

[66] View Article

[67] PubMed/NCBI

[68] Google Scholar

[ref18] 18. Diessler S, Jan M, Emmenegger Y, Guex N, Middleton B, Skene DJ, et al. A systems genetics resource and analysis of sleep regulation in the mouse. PLoS Biology. 2018;16(8). pmid:30091978
View Article
PubMed/NCBI
Google Scholar

[70] View Article

[71] PubMed/NCBI

[72] Google Scholar

[ref19] 19. Joshi SS, Sethi M, Striz M, Cole N, Denegre JM, Ryan J, et al. Noninvasive sleep monitoring in large-scale screening of knock-out mice reveals novel sleep-related genes. bioRxiv. 2019; 517680.
View Article
Google Scholar

[74] View Article

[75] Google Scholar

[ref20] 20. Boyle EA, Li YI, Pritchard JK. An Expanded View of Complex Traits: From Polygenic to Omnigenic. Cell. 2017;169(7):1177–1186. pmid:28622505
View Article
PubMed/NCBI
Google Scholar

[77] View Article

[78] PubMed/NCBI

[79] Google Scholar

[ref21] 21. Schlötterer C, Kofler R, Versace E, Tobler R, Franssen SU. Combining experimental evolution with next-generation sequencing: a powerful tool to study adaptation from standing genetic variation. Heredity. 2015;114(5):431–440. pmid:25269380
View Article
PubMed/NCBI
Google Scholar

[81] View Article

[82] PubMed/NCBI

[83] Google Scholar

[ref22] 22. Faria VG, Martins NE, Paulo T, Teixeira L, Sucena É, Magalhães S. Evolution of Drosophila resistance against different pathogens and infection routes entails no detectable maintenance costs. Evolution. 2015;69(11):2799–2809. pmid:26496003
View Article
PubMed/NCBI
Google Scholar

[85] View Article

[86] PubMed/NCBI

[87] Google Scholar

[ref23] 23. Faria VG, Martins NE, Magalhães S, Paulo TF, Nolte V, Schlötterer C, et al. Drosophila Adaptation to Viral Infection through Defensive Symbiont Evolution. PLoS Genetics. 2016;12(9):1–18. pmid:27684942
View Article
PubMed/NCBI
Google Scholar

[89] View Article

[90] PubMed/NCBI

[91] Google Scholar

[ref24] 24. Pegoraro M, Flavell LMM, Menegazzi P, Colombi P, Dao P, Helfrich-Forster C, et al. The genetic basis of diurnal preference in Drosophila melanogaster. BMC Genomics. 2020;21(1). pmid:32862827
View Article
PubMed/NCBI
Google Scholar

[93] View Article

[94] PubMed/NCBI

[95] Google Scholar

[ref25] 25. Brown EB, Patterson C, Pancoast R, Rollmann SM. Artificial selection for odor-guided behavior in Drosophila reveals changes in food consumption. BMC Genomics. 2017;18(1):1–13. pmid:29132294
View Article
PubMed/NCBI
Google Scholar

[97] View Article

[98] PubMed/NCBI

[99] Google Scholar

[ref26] 26. Brown EB, Layne JE, Elchert AR, Rollmann SM. Behavioral and transcriptional response to selection for olfactory behavior in Drosophila. G3: Genes, Genomes, Genetics. 2020;10(4):1283–1296. pmid:32024668
View Article
PubMed/NCBI
Google Scholar

[101] View Article

[102] PubMed/NCBI

[103] Google Scholar

[ref27] 27. Garlapow ME, Everett LJ, Zhou S, Gearhart AW, Fay KA, Huang W, et al. Genetic and Genomic Response to Selection for Food Consumption in Drosophila melanogaster. Behavior Genetics. 2017;47(2):227–243. pmid:27704301
View Article
PubMed/NCBI
Google Scholar

[105] View Article

[106] PubMed/NCBI

[107] Google Scholar

[ref28] 28. Mackay TFC, Heinsohn SL, Lyman RF, Moehring AJ, Morgan TJ, Rollmann SM. Genetics and genomics of Drosophila mating behavior. Proceedings of the National Academy of Sciences. 2005;102(Supplement 1):6622–6629. pmid:15851659
View Article
PubMed/NCBI
Google Scholar

[109] View Article

[110] PubMed/NCBI

[111] Google Scholar

[ref29] 29. Wertheim B, Kraaijeveld AR, Hopkins MG, Walther Boer M, Godfray HCJ. Functional genomics of the evolution of increased resistance to parasitism in Drosophila. Molecular Ecology. 2011;20(5):932–949. pmid:21062384
View Article
PubMed/NCBI
Google Scholar

[113] View Article

[114] PubMed/NCBI

[115] Google Scholar

[ref30] 30. Telonis-Scott M, Hallas R, McKechnie SW, Wee CW, Hoffmann AA. Selection for cold resistance alters gene transcript levels in Drosophila melanogaster. Journal of Insect Physiology. 2009;55(6):549–555. pmid:19232407
View Article
PubMed/NCBI
Google Scholar

[117] View Article

[118] PubMed/NCBI

[119] Google Scholar

[ref31] 31. Sørensen JG, Nielsen MM, Loeschcke V. Gene expression profile analysis of Drosophila melanogaster selected for resistance to environmental stressors. Journal of Evolutionary Biology. 2007;20(4):1624–1636. pmid:17584255
View Article
PubMed/NCBI
Google Scholar

[121] View Article

[122] PubMed/NCBI

[123] Google Scholar

[ref32] 32. Morozova TV, Anholt RRH, Mackay TFC. Phenotypic and transcriptional response to selection for alcohol sensitivity in Drosophila melanogaster. Genome Biology. 2007;8(10):1–15. pmid:17973985
View Article
PubMed/NCBI
Google Scholar

[125] View Article

[126] PubMed/NCBI

[127] Google Scholar

[ref33] 33. Edwards AC, Rollmann SM, Morgan TJ, Mackay TFC. Quantitative Genomics of Aggressive Behavior in Drosophila melanogaster. PLoS Genetics. 2006;2(9):1386–1395. pmid:17044737
View Article
PubMed/NCBI
Google Scholar

[129] View Article

[130] PubMed/NCBI

[131] Google Scholar

[ref34] 34. Langfelder P, Horvath S. WGCNA: an R package for weighted correlation network analysis. BMC Bioinformatics. 2008;9:559. pmid:19114008
View Article
PubMed/NCBI
Google Scholar

[133] View Article

[134] PubMed/NCBI

[135] Google Scholar

[ref35] 35. Stone EA, Ayroles JF. Modulated Modularity Clustering as an Exploratory tool for functional genomic inference. PLoS Genetics. 2009;5(5):e1000479. pmid:19424432
View Article
PubMed/NCBI
Google Scholar

[137] View Article

[138] PubMed/NCBI

[139] Google Scholar

[ref36] 36. Emmert-Streib F, Glazko GV, Altay G, dM Simoes R. Statistical inference and reverse engineering of gene regulatory networks from observational expression data. Frontiers in Genetics. 2012;3(FEB):1–15. pmid:22408642
View Article
PubMed/NCBI
Google Scholar

[141] View Article

[142] PubMed/NCBI

[143] Google Scholar

[ref37] 37. Villaverde AF, Banga JR. Reverse engineering and identification in systems biology: strategies, perspectives and challenges. Journal of The Royal Society Interface. 2014;11(91):20130505. pmid:24307566
View Article
PubMed/NCBI
Google Scholar

[145] View Article

[146] PubMed/NCBI

[147] Google Scholar

[ref38] 38. Liu ZP. Reverse Engineering of Genome-wide Gene Regulatory Networks from Gene Expression Data. Current Genomics. 2015;16(1):3–22. pmid:25937810
View Article
PubMed/NCBI
Google Scholar

[149] View Article

[150] PubMed/NCBI

[151] Google Scholar

[ref39] 39. Rasmussen CE, Williams CKI. Gaussian Processes for Machine Learning. The MIT Press; 2006.

[ref40] 40. Schulz E, Speekenbrink M, Krause A. A tutorial on Gaussian process regression: Modelling, exploring, and exploiting functions. Journal of Mathematical Psychology. 2018.
View Article
Google Scholar

[154] View Article

[155] Google Scholar

[ref41] 41. Yang S, Wong SWK, Kou SC. Inference of dynamic systems from noisy and sparse data via manifold-constrained Gaussian Processes. Proc Natl Acad Sci USA. 2021;118(15):e2020397118. pmid:33837150
View Article
PubMed/NCBI
Google Scholar

[157] View Article

[158] PubMed/NCBI

[159] Google Scholar

[ref42] 42. Liu W, Niranjan N. Gaussian process modelling for bicoid mRNA regulation in spatio-temporal Bicoid profile. Bioinformatics. 2012;28(3):366–372. pmid:22130592
View Article
PubMed/NCBI
Google Scholar

[161] View Article

[162] PubMed/NCBI

[163] Google Scholar

[ref43] 43. Äijö T, Granberg K, Lähdesmäki H. Sorad: a systems biology approach to predict and modulate dynamic signaling pathway response from phosphoproteome time-course measurements. Bioinformatics. 2013;29(10):1283–1291. pmid:23505293
View Article
PubMed/NCBI
Google Scholar

[165] View Article

[166] PubMed/NCBI

[167] Google Scholar

[ref44] 44. Aalto A, Viitasaari L, Ilmonen P, Mombaerts L, Gonçalves J. Gene regulatory network inference from sparsely sampled noisy data. Nature Communications. 2020;11(1). pmid:32661225
View Article
PubMed/NCBI
Google Scholar

[169] View Article

[170] PubMed/NCBI

[171] Google Scholar

[ref45] 45. Gao P, Honkela A, Rattray M, Lawrence ND. Gaussian process modelling of latent chemical species: applications to inferring transcription factor activities. Bioinformatics. 2008;24:i70–i75. pmid:18689843
View Article
PubMed/NCBI
Google Scholar

[173] View Article

[174] PubMed/NCBI

[175] Google Scholar

[ref46] 46. Honkela A, Girardot C, Gustafson EH, Liu YH, Furlong EEM, Lawrence ND, et al. Model-based method for transcription factor target identification with limited data. Proceedings of the National Academy of Sciences. 2010;107(17):7793–7798. pmid:20385836
View Article
PubMed/NCBI
Google Scholar

[177] View Article

[178] PubMed/NCBI

[179] Google Scholar

[ref47] 47. Hensman J, Lawrence ND, Rattray M. Hierarchical Bayesian modelling of gene expression time series across irregularly sampled replicates and clusters. BMC Bioinformatics. 2013;14:252–264. pmid:23962281
View Article
PubMed/NCBI
Google Scholar

[181] View Article

[182] PubMed/NCBI

[183] Google Scholar

[ref48] 48. Arnol D, Schapiro D, Bodenmiller B, Saez-Rodriguez J, Stegle O. Modeling Cell-Cell Interactions from Spatial Molecular Data with Spatial Variance Component Analysis. Cell Reports. 2019;29(1):202–211.e6. pmid:31577949
View Article
PubMed/NCBI
Google Scholar

[185] View Article

[186] PubMed/NCBI

[187] Google Scholar

[ref49] 49. McDowell IC, Manandhar D, Vockley CM, Schmid AK, Reddy TE, Engelhardt BE. Clustering gene expression time series data using an infinite Gaussian process mixture model. PLoS Computational Biology. 2018;14(1):1–27. pmid:29337990
View Article
PubMed/NCBI
Google Scholar

[189] View Article

[190] PubMed/NCBI

[191] Google Scholar

[ref50] 50. Melkumyan A, Ramos F. Multi-kernel Gaussian processes. In: IJCAI International Joint Conference on Artificial Intelligence; 2011. p. 1408–1413.

[ref51] 51. Bonilla EV, Chai KMA, Williams CKI. Multi-task Gaussian Process prediction. In: Advances in Neural Information Processing Systems 20. NIPS Foundation; 2008. p. 153–160.

[ref52] 52. Bahg G, Evans DG, Galdo M, Turner BM. Gaussian process linking functions for mind, brain, and behavior. Proceedings of the National Academy of Sciences of the United States of America. 2020;117(47):29398–29406. pmid:33229563
View Article
PubMed/NCBI
Google Scholar

[195] View Article

[196] PubMed/NCBI

[197] Google Scholar

[ref53] 53. Velten B, Braunger JM, Argelaguet R, Arnol D, Wirbel J, Bredikhin D, et al. Identifying temporal and spatial patterns of variation from multi- modal data using MEFISTO. Nature Methods. 2022;19:179–186. pmid:35027765
View Article
PubMed/NCBI
Google Scholar

[199] View Article

[200] PubMed/NCBI

[201] Google Scholar

[ref54] 54. Mackay TFC, Richards S, Stone EA, Barbadilla A, Ayroles JF, Zhu D, et al. The Drosophila melanogaster Genetic Reference Panel. Nature. 2012;482(7384):173–178. pmid:22318601
View Article
PubMed/NCBI
Google Scholar

[203] View Article

[204] PubMed/NCBI

[205] Google Scholar

[ref55] 55. Huang W, Massouras A, Inoue Y, Peiffer J, Ramia M, Tarone AM, et al. Natural variation in genome architecture among 205 Drosophila melanogaster Genetic Reference Panel lines. Genome Research. 2014;24(7):1193–1208. pmid:24714809
View Article
PubMed/NCBI
Google Scholar

[207] View Article

[208] PubMed/NCBI

[209] Google Scholar

[ref56] 56. Serrano Negron YL, Hansen NF, Harbison ST. The Sleep Inbred Panel, a Collection of Inbred Drosophila melanogaster with Extreme Long and Short Sleep Duration. G3: Genes—Genomes—Genetics. 2018;8(9):2865–2873. pmid:29991508
View Article
PubMed/NCBI
Google Scholar

[211] View Article

[212] PubMed/NCBI

[213] Google Scholar

[ref57] 57. Ganguly-Fitzgerald I, Donlea J, Shaw PJ. Waking Experience Affects Sleep Need in Drosophila. Science. 2006;313(5794):1775–1781. pmid:16990546
View Article
PubMed/NCBI
Google Scholar

[215] View Article

[216] PubMed/NCBI

[217] Google Scholar

[ref58] 58. Mackay TF, Lyman RF. Drosophila bristles and the nature of quantitative genetic variation. Philosophical Transactions of the Royal Society B: Biological Sciences. 2005;360(1459):1513–1527. pmid:16108138
View Article
PubMed/NCBI
Google Scholar

[219] View Article

[220] PubMed/NCBI

[221] Google Scholar

[ref59] 59. Falconer DS, Mackay TFC. Introduction to Quantitative Genetics, Fourth Edition. Addison Wesley Longman Limited; 1996.

[ref60] 60. Dobin A, Davis CA, Schlesinger F, Drenkow J, Zaleski C, Jha S, et al. STAR: Ultrafast universal RNA-seq aligner. Bioinformatics. 2013. pmid:23104886
View Article
PubMed/NCBI
Google Scholar

[224] View Article

[225] PubMed/NCBI

[226] Google Scholar

[ref61] 61. Anders S, Pyl PT, Huber W. HTSeq-A Python framework to work with high-throughput sequencing data. Bioinformatics. 2015. pmid:25260700
View Article
PubMed/NCBI
Google Scholar

[228] View Article

[229] PubMed/NCBI

[230] Google Scholar

[ref62] 62. Jin W, Riley RM, Wolfinger RD, White KP, Passador-Gurgel G, Gibson G. The contributions of sex, genotype and age to transcriptional variance in Drosophila melanogaster. Nature Genetics. 2001;29(4):389–395. pmid:11726925
View Article
PubMed/NCBI
Google Scholar

[232] View Article

[233] PubMed/NCBI

[234] Google Scholar

[ref63] 63. Arbeitman MN, Furlong EEM, Imam F, Johnson E, Null BH, Baker BS, et al. Gene Expression During the Life Cycle of Drosophila melanogaster. Science. 2002;297(5590):2270–2275. pmid:12351791
View Article
PubMed/NCBI
Google Scholar

[236] View Article

[237] PubMed/NCBI

[238] Google Scholar

[ref64] 64. Parisi M, Nuttall R, Naiman D, Bouffard G, Malley J, Andrews J, et al. Paucity of Genes on the Drosophila X Chromosome Showing Male-Biased Expression. Science. 2003;299(5607):697–700. pmid:12511656
View Article
PubMed/NCBI
Google Scholar

[240] View Article

[241] PubMed/NCBI

[242] Google Scholar

[ref65] 65. Ranz JM, Castillo-Davis CI, Meiklejohn CD, Hartl DL. Sex-dependent gene expression and evolution of the Drosophila transcriptome. Science. 2003;300(5626):1742–1745. pmid:12805547
View Article
PubMed/NCBI
Google Scholar

[244] View Article

[245] PubMed/NCBI

[246] Google Scholar

[ref66] 66. Harbison ST, Chang S, Kamdar KP, Mackay TFC. Quantitative genomics of starvation stress resistance in Drosophila. Genome biology. 2005;6:R36. pmid:15833123
View Article
PubMed/NCBI
Google Scholar

[248] View Article

[249] PubMed/NCBI

[250] Google Scholar

[ref67] 67. Wayne ML, Telonis-Scott M, Bono LM, Harshman L, Kopp A, Nuzhdin SV, et al. Simpler mode of inheritance of transcriptional variation in male Drosophila melanogaster. Proceedings of the National Academy of Sciences. 2007;104(47):18577–18582. pmid:18003923
View Article
PubMed/NCBI
Google Scholar

[252] View Article

[253] PubMed/NCBI

[254] Google Scholar

[ref68] 68. Zhang Y, Sturgill D, Parisi M, Kumar S, Oliver B. Constraint and turnover in sex-biased gene expression in the genus Drosophila. Nature. 2007;450(7167):233–237. pmid:17994089
View Article
PubMed/NCBI
Google Scholar

[256] View Article

[257] PubMed/NCBI

[258] Google Scholar

[ref69] 69. Ayroles JF, Carbone MA, Stone EA, Jordan KW, Lyman RF, Magwire MM, et al. Systems genetics of complex traits in Drosophila melanogaster. Nature Genetics. 2009;41(3):299–307. pmid:19234471
View Article
PubMed/NCBI
Google Scholar

[260] View Article

[261] PubMed/NCBI

[262] Google Scholar

[ref70] 70. Huylmans AK, Parsch J. Population- and Sex-Biased Gene Expression in the Excretion Organs of Drosophila melanogaster. G3: Genes—Genomes—Genetics. 2014;4(12):2307–2315. pmid:25246242
View Article
PubMed/NCBI
Google Scholar

[264] View Article

[265] PubMed/NCBI

[266] Google Scholar

[ref71] 71. Huang W, Carbone MA, Magwire MM, Peiffer JA, Lyman RF, Stone EA, et al. Genetic basis of transcriptome diversity in Drosophila melanogaster. Proceedings of the National Academy of Sciences. 2015;112(44):E6010–E6019. pmid:26483487
View Article
PubMed/NCBI
Google Scholar

[268] View Article

[269] PubMed/NCBI

[270] Google Scholar

[ref72] 72. Lin Y, Chen ZX, Oliver B, Harbison ST. Microenvironmental gene expression plasticity among individual drosophila melanogaster. G3: Genes, Genomes, Genetics. 2016;6(12):4197–4210. pmid:27770026
View Article
PubMed/NCBI
Google Scholar

[272] View Article

[273] PubMed/NCBI

[274] Google Scholar

[ref73] 73. Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biology. 2014;15(12):550. pmid:25516281
View Article
PubMed/NCBI
Google Scholar

[276] View Article

[277] PubMed/NCBI

[278] Google Scholar

[ref74] 74. Zhang Y, Malone JH, Powell SK, Periwal V, Spana E, MacAlpine DM, et al. Expression in Aneuploid Drosophila S2 Cells. PLoS Biology. 2010;8(2):e1000320. pmid:20186269
View Article
PubMed/NCBI
Google Scholar

[280] View Article

[281] PubMed/NCBI

[282] Google Scholar

[ref75] 75. Gelman A, Carlin JB, Stern HS, Dunson DB, Vehtari A, Rubin DB. Bayesian data analysis, third edition. CRC Press; 2013.

[ref76] 76. Carpenter B, Gelman A, Hoffman MD, Lee D, Goodrich B, Betancourt M, et al. Stan: A Probabilistic Programming Language. Journal of Statistical Software. 2017;76(1). pmid:36568334
View Article
PubMed/NCBI
Google Scholar

[285] View Article

[286] PubMed/NCBI

[287] Google Scholar

[ref77] 77. Benjamini Y, Hochberg Y. Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing. Journal of the Royal Statistical Society: Series B (Methodological). 1995;57(1):289–300.
View Article
Google Scholar

[289] View Article

[290] Google Scholar

[ref78] 78. Ruscio J. Constructing confidence intervals for Spearman’s rank correlation with ordinal data: A simulation study comparing analytic and bootstrap methods. Journal of Modern Applied Statistical Methods. 2008;7(2):416–434.
View Article
Google Scholar

[292] View Article

[293] Google Scholar

[ref79] 79. Austin PC, Hux JE. A brief note on overlapping confidence intervals. Journal of Vascular Surgery. 2002;36(1):194–195. pmid:12096281
View Article
PubMed/NCBI
Google Scholar

[295] View Article

[296] PubMed/NCBI

[297] Google Scholar

[ref80] 80. MacKay DJ. Information theory, inference and learning algorithms. Cambridge university press; 2003.

[ref81] 81. Bishop CM. Pattern recognition and machine learning. Springer; 2006.

[ref82] 82. Kontio JAJ, Sillanpää MJ. Scalable Nonparametric Prescreening Method for Searching Higher-Order Genetic Interactions Underlying Quantitative Traits. Genetics. 2019;213(4):1209–1224. pmid:31585953
View Article
PubMed/NCBI
Google Scholar

[301] View Article

[302] PubMed/NCBI

[303] Google Scholar

[ref83] 83. Harbison ST, Sehgal A. Quantitative Genetic Analysis of Sleep in Drosophila melanogaster. Genetics. 2008;178(4):2341–2360. pmid:18430954
View Article
PubMed/NCBI
Google Scholar

[305] View Article

[306] PubMed/NCBI

[307] Google Scholar

[ref84] 84. Harbison ST, Carbone MA, Ayroles JF, Stone EA, Lyman RF, Mackay TFC. Co-regulated transcriptional networks contribute to natural genetic variation in Drosophila sleep. Nature Genetics. 2009. pmid:19234472
View Article
PubMed/NCBI
Google Scholar

[309] View Article

[310] PubMed/NCBI

[311] Google Scholar

[ref85] 85. Murali T, Pacifico S, Yu J, Guest S, Roberts GG, Finley RL. DroID 2011: a comprehensive, integrated resource for protein, transcription factor, RNA and gene interactions for Drosophila. Nucleic Acids Research. 2011;39(suppl_1):D736–D743. pmid:21036869
View Article
PubMed/NCBI
Google Scholar

[313] View Article

[314] PubMed/NCBI

[315] Google Scholar

[ref86] 86. Harbison ST, Kumar S, Huang W, McCoy LJ, Smith KR, Mackay TFC. Genome-Wide Association Study of Circadian Behavior in Drosophila melanogaster. Behavior Genetics. 2019;49(1):60–82. pmid:30341464
View Article
PubMed/NCBI
Google Scholar

[317] View Article

[318] PubMed/NCBI

[319] Google Scholar

[ref87] 87. Thimgan MS, Suzuki Y, Seugnet L, Gottschalk L, Shaw PJ. The Perilipin homologue, Lipid Storage Droplet 2, regulates sleep homeostasis and prevents learning impairments following sleep loss. PLoS Biology. 2010;8. pmid:20824166
View Article
PubMed/NCBI
Google Scholar

[321] View Article

[322] PubMed/NCBI

[323] Google Scholar

[ref88] 88. Thimgan MS, Kress N, Lisse J, Fiebelman C, Hilderbrand T. The acyl-CoA synthetase, pudgy, promotes sleep and is required for the homeostatic response to sleep deprivation. Frontiers in Endocrinology. 2018;9. pmid:30186232
View Article
PubMed/NCBI
Google Scholar

[325] View Article

[326] PubMed/NCBI

[327] Google Scholar

[ref89] 89. He C, Yang Y, Zhang M, Price JL, Zhao Z. Regulation of sleep by Neuropeptide-Y-like system in Drosophila melanogaster. PLoS ONE. 2013;8. pmid:24040211
View Article
PubMed/NCBI
Google Scholar

[329] View Article

[330] PubMed/NCBI

[331] Google Scholar

[ref90] 90. Mallon EB, Alghamdi A, Holdbrook RTK, Rosato E. Immune stimulation reduces sleep and memory ability in Drosophila melanogaster. PeerJ. 2014. pmid:24949247
View Article
PubMed/NCBI
Google Scholar

[333] View Article

[334] PubMed/NCBI

[335] Google Scholar

[ref91] 91. Dissel S, Seugnet L, Thimgan MS, Silverman N, Angadi V, Thacher PV, et al. Differential activation of immune factors in neurons and glia contribute to individual differences in resilience/vulnerability to sleep disruption. Brain, Behavior, and Immunity. 2015;47:75–85. pmid:25451614
View Article
PubMed/NCBI
Google Scholar

[337] View Article

[338] PubMed/NCBI

[339] Google Scholar

[ref92] 92. Seugnet L, Dissel S, Thimgan M, Cao L, Shaw PJ. Identification of genes that maintain behavioral and structural plasticity during sleep loss. Frontiers in Neural Circuits. 2017;11. pmid:29109678
View Article
PubMed/NCBI
Google Scholar

[341] View Article

[342] PubMed/NCBI

[343] Google Scholar

[ref93] 93. Feng G, Zhang J, Li M, Shao L, Yang L, Song Q, et al. Control of sleep onset by Shal/Kv4 channels in Drosophila circadian neurons. The Journal of Neuroscience. 2018;38.
View Article
Google Scholar

[345] View Article

[346] Google Scholar

[ref94] 94. Shalaby NA, Pinzon JH, Narayanan AS, Jin EJ, Ritz MP, Dove RJ, et al. JmjC domain proteins modulate circadian behaviors and sleep in Drosophila. Scientific Reports. 2018;8. pmid:29339751
View Article
PubMed/NCBI
Google Scholar

[348] View Article

[349] PubMed/NCBI

[350] Google Scholar

[ref95] 95. Wu KJ, Kumar S, Serrano Negron YL, Harbison ST. Genotype influences day-to-day variability in sleep in Drosophila melanogaster. Sleep. 2018;41(2). pmid:29228366
View Article
PubMed/NCBI
Google Scholar

[352] View Article

[353] PubMed/NCBI

[354] Google Scholar

[ref96] 96. Roessingh S, Rosing M, Marunova M, Ogueta M, George R, Lamaze A, et al. Temperature synchronization of the Drosophila circadian clock protein PERIOD is controlled by the TRPA channel PYREXIA. Communications Biology. 2019;2. pmid:31286063
View Article
PubMed/NCBI
Google Scholar

[356] View Article

[357] PubMed/NCBI

[358] Google Scholar

[ref97] 97. Khoury S, Wang QP, Parisien M, Gris P, Bortsov AV, Linnstaedt SD, et al. Multi-ethnic GWAS and meta-analysis of sleep quality identify MPP6 as a novel gene that functions in sleep center neurons. Sleep. 2020;44.
View Article
Google Scholar

[360] View Article

[361] Google Scholar

[ref98] 98. Lee J, Lim C, Han TH, Andreani T, Moye M, Curran J, et al. The E3 ubiquitin ligase adapter Tango10 links the core circadian clock to neuropeptide and behavioral rhythms. The Journal of Neuroscience. 2021;118.
View Article
Google Scholar

[363] View Article

[364] Google Scholar

[ref99] 99. Pegoraro M, Sayegh Rezek E, Fishman B, Tauber E. Nucleotide variation in Drosophila cryptochrome is linked to circadian clock function: An association analysis. Frontiers in Physiology. 2022;13. pmid:35250608
View Article
PubMed/NCBI
Google Scholar

[366] View Article

[367] PubMed/NCBI

[368] Google Scholar

[ref100] 100. Souto-Maior C, Serrano Negron YL, Harbison ST. Natural selection on sleep duration in Drosophila melanogaster. Scientific Reports. 2020;10. pmid:33244154
View Article
PubMed/NCBI
Google Scholar

[370] View Article

[371] PubMed/NCBI

[372] Google Scholar

[ref101] 101. Li H, Janssens J, Waegeneer MD, Kolluru SS, Davie K, Gardeux V, et al. Fly Cell Atlas: A single-nucleus transcriptomic atlas of the adult fruit fly. Science. 2022;375(6584):eabk2432. pmid:35239393
View Article
PubMed/NCBI
Google Scholar

[374] View Article

[375] PubMed/NCBI

[376] Google Scholar

[ref102] 102. Cao X, Jiang H. Building a platform for predicting functions of serine protease-related proteins in Drosophila melanogaster and other insects. Insect Biochemistry and Molecular Biology. 2018;103:53–69. pmid:30367934
View Article
PubMed/NCBI
Google Scholar

[378] View Article

[379] PubMed/NCBI

[380] Google Scholar

[ref103] 103. Artiushin G, Zhang SL, Tricoire H, Sehgal A. Endocytosis at the Drosophila blood–brain barrier as a function for sleep. eLife. 2018;7:e43326. pmid:30475209
View Article
PubMed/NCBI
Google Scholar

[382] View Article

[383] PubMed/NCBI

[384] Google Scholar

[ref104] 104. Williams JA, Sathyanarayanan S, Hendricks JC, Sehgal A. Interaction between sleep and the immune response in Drosophila: a role for the NFκB relish. Sleep. 2007;30(4):389–400. pmid:17520783
View Article
PubMed/NCBI
Google Scholar

[386] View Article

[387] PubMed/NCBI

[388] Google Scholar

[ref105] 105. De Luca M, Klimentidis YC, Casazza K, Moses Chambers M, Cho R, Harbison ST, et al. A conserved role for syndecan family members in the regulation of whole-body energy metabolism. PloS one. 2010;5(6):e11286. pmid:20585652
View Article
PubMed/NCBI
Google Scholar

[390] View Article

[391] PubMed/NCBI

[392] Google Scholar

[ref106] 106. Afonso DJ, Liu D, Machado DR, Pan H, Jepson JE, Rogulja D, et al. TARANIS functions with cyclin A and Cdk1 in a novel arousal center to control sleep in Drosophila. Current Biology. 2015;25(13):1717–1726. pmid:26096977
View Article
PubMed/NCBI
Google Scholar

[394] View Article

[395] PubMed/NCBI

[396] Google Scholar

[ref107] 107. Glover Z, Hodges MD, Dravecz N, Cameron J, Askwith H, Shirras A, et al. Loss of angiotensin-converting enzyme-related (ACER) peptidase disrupts behavioural and metabolic responses to diet in Drosophila melanogaster. Journal of Experimental Biology. 2019;222(8):jeb194332. pmid:30940674
View Article
PubMed/NCBI
Google Scholar

[398] View Article

[399] PubMed/NCBI

[400] Google Scholar

[ref108] 108. Álvarez-Rendón JP, Riesgo-Escovar JR. Circadian and rhythmic-related behavioral co-morbidities of the diabetic state in Drosophila melanogaster. General and Comparative Endocrinology. 2020;295:113477. pmid:32240709
View Article
PubMed/NCBI
Google Scholar

[402] View Article

[403] PubMed/NCBI

[404] Google Scholar

[ref109] 109. Bushey D, Huber R, Tononi G, Cirelli C. Drosophila Hyperkinetic mutants have reduced sleep and impaired memory. Journal of Neuroscience. 2007;27(20):5384–5393. pmid:17507560
View Article
PubMed/NCBI
Google Scholar

[406] View Article

[407] PubMed/NCBI

[408] Google Scholar

[ref110] 110. Bushey D, Tononi G, Cirelli C. The Drosophila fragile X mental retardation gene regulates sleep need. Journal of Neuroscience. 2009;29(7):1948–1961. pmid:19228950
View Article
PubMed/NCBI
Google Scholar

[410] View Article

[411] PubMed/NCBI

[412] Google Scholar

[ref111] 111. Luo W, Chen WF, Yue Z, Chen D, Sowcik M, Sehgal A, et al. Old flies have a robust central oscillator but weaker behavioral rhythms that can be improved by genetic and environmental manipulations. Aging cell. 2012;11(3):428–438. pmid:22268765
View Article
PubMed/NCBI
Google Scholar

[414] View Article

[415] PubMed/NCBI

[416] Google Scholar

[ref112] 112. Juneau BA, Stonemetz JM, Toma RF, Possidente DR, Heins RC, Vecsey CG. Optogenetic activation of short neuropeptide F (sNPF) neurons induces sleep in Drosophila melanogaster. Physiology & behavior. 2019;206:143–156. pmid:30935941
View Article
PubMed/NCBI
Google Scholar

[418] View Article

[419] PubMed/NCBI

[420] Google Scholar

[ref113] 113. Huang H, Possidente DR, Vecsey CG. Optogenetic activation of SIFamide (SIFa) neurons induces a complex sleep-promoting effect in the fruit fly Drosophila melanogaster. Physiology & Behavior. 2021;239:113507. pmid:34175361
View Article
PubMed/NCBI
Google Scholar

[422] View Article

[423] PubMed/NCBI

[424] Google Scholar

[ref114] 114. McDonald MJ, Rosbash M. Microarray Analysis and Organization of Circadian Gene Expression in Drosophila. Cell. 2001;107(5):567–578. pmid:11733057
View Article
PubMed/NCBI
Google Scholar

[426] View Article

[427] PubMed/NCBI

[428] Google Scholar

[ref115] 115. Ceriani MF, Hogenesch JB, Yanovsky M, Panda S, Straume M, Kay SA. Genome-Wide Expression Analysis in DrosophilaReveals Genes Controlling Circadian Behavior. Journal of Neuroscience. 2002;22(21):9305–9319. pmid:12417656
View Article
PubMed/NCBI
Google Scholar

[430] View Article

[431] PubMed/NCBI

[432] Google Scholar

[ref116] 116. Lin Y, Han M, Shimada B, Taghert PH. Influence of the period-dependent circadian clock on diurnal, circadian, and aperiodic gene expression in Drosophila melanogaster. Proc Natl Acad Sci USA. 2002;99(14):9562–9567. pmid:12089325
View Article
PubMed/NCBI
Google Scholar

[434] View Article

[435] PubMed/NCBI

[436] Google Scholar

[ref117] 117. Ueda HR, Matsumoto A, Kawamura M, Iino M, Tanimura T, Hashimoto S. Genome-wide Transcriptional Orchestration of Circadian Rhythms in Drosophila. Journal of Biological Chemistry. 2002;277(16):14048–14052. pmid:11854264
View Article
PubMed/NCBI
Google Scholar

[438] View Article

[439] PubMed/NCBI

[440] Google Scholar

[ref118] 118. Hughes ME, Grant GR, Paquin C, Qian J, Nitabach MN. Deep sequencing the circadian and diurnal transcriptome of Drosophila brain. Genome Research. 2012;22(7):1266–1281. pmid:22472103
View Article
PubMed/NCBI
Google Scholar

[442] View Article

[443] PubMed/NCBI

[444] Google Scholar

[ref119] 119. Rodriquez J, Tang CHA, Khodor YL, Vodala S, Menet JS, Rosbash M. Nascent-Seq analysis of Drosophila cycling gene expression. Proc Natl Acad Sci USA. 2013;110(4):275–284.
View Article
Google Scholar

[446] View Article

[447] Google Scholar

[ref120] 120. Kumar S, Tunc I, Tansey TR, Pirooznia M, Harbison ST. Identification of Genes Contributing to a Long Circadian Period in Drosophila Melanogaster. Journal of Biological Rhythms. 2021;36(3):239–253. pmid:33274675
View Article
PubMed/NCBI
Google Scholar

[449] View Article

[450] PubMed/NCBI

[451] Google Scholar

[ref121] 121. Mackay TFC. Epistasis and quantitative traits: Using model organisms to study gene-gene interactions. Nature Reviews Genetics. 2014;15(1):22–33. pmid:24296533
View Article
PubMed/NCBI
Google Scholar

[453] View Article

[454] PubMed/NCBI

[455] Google Scholar

[ref122] 122. DiFrisco J, Jaeger J. Genetic Causation in Complex Regulatory Systems: An Integrative Dynamic Perspective. BioEssays. 2020;42(6):1900226. pmid:32449193
View Article
PubMed/NCBI
Google Scholar

[457] View Article

[458] PubMed/NCBI

[459] Google Scholar

Abstract

Author summary

Figures

Introduction

Materials and methods

Construction of outbred population

Artificial selection procedure for night sleep

Quantitative genetic analyses of selected and correlated phenotypic responses

RNA extraction and sequencing

RNA alignment of reads

Principal Component Analysis (PCA)

Gene normalization and filtering

Generalized Linear Model analysis of expression data

Calculation of non-parametric correlations between genes

Gaussian Process regression

Multi-channel Gaussian Processes

Bayesian MCMC inference of Gaussian Processes

Confirmatory experiments

Results

Phenotypic response to artificial selection

Correlated response of other sleep traits to selection for night sleep

Phenotypes in flies used for RNA-Seq

Hierarchical Generalized Linear Model analysis reveals that selection for night sleep impacts gene expression

Pairwise Spearman correlation is non-specific and significant for a large fraction of genes

Gaussian Process model analysis uncovers nonlinear trends and specifically identifies covariance in expression between genes

Mutational analyses confirms role of candidate genes and interacting gene expression networks in sleep

Discussion

Supporting information

S1 Fig. Principal Component Analysis (PCA).

S2 Fig. Correlated response to selection for long/short night sleep and associated coefficient of environmental variation.

S3 Fig. Correlation of night sleep between flies harvested for RNA and all flies in the population.

S4 Fig. Gaussian Process model fits to selected genes.

S5 Fig. Gene interaction networks.

S6 Fig. Gene expression in Minos mutants.

S7 Fig. Comparison of ratios of gene expression between genes with significant Gaussian Process correlations and unrelated genes for CG13793, Cyp6a16, and hiw Minos mutants.

S1 Appendix. Notes on multichannel Gaussian Processes.

S1 Table. Effects of Minos insertions on sleep.

S2 Table. Quantitative genetics of the response to selection for long or short night sleep and related sleep parameters.

S3 Table. Quantitative genetics of the response to selection for long or short night sleep per generation.

S4 Table. Quantitative genetics of control populations.

S5 Table. Correlated response of sleep trait coefficient of environmental variance (CVE) to selection for long or short night sleep duration.

S6 Table. GLM analysis results for females.

S7 Table. GLM analysis results for males.

S8 Table. Genes with opposite slopes for the short and long interaction terms of generation in females.

S9 Table. Genes with opposite slopes for the short and long interaction terms of generation in males.

S10 Table. Gene Ontology (GO) analysis results for 85 significant genes common to males and females.

S11 Table. Correlations obtained from normalizing Gaussian Process signal covariances (GP correlation) and from Spearman Correlation for each of the six sex and selection scheme combinations.

S12 Table. Expected values for the correlations obtained from normalizing Gaussian Process signal covariances (GP correlation) that do not overlap with controls for each of the six sex and selection scheme combinations.

S13 Table. Degree for each gene in the GP network.

S14 Table. Gene expression ratios calculated from Minos insertion line and control RNA-Seq data.

S15 Table. Night sleep phenotypes.

Acknowledgments

References

Cookie Preference Center

Customize Your Cookie Preference

S5 Table. Correlated response of sleep trait coefficient of environmental variance (CV_E) to selection for long or short night sleep duration.