Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Uncovering the genomic and metagenomic research potential in old ethanol-preserved snakes

  • Claus M. Zacho ,

    Contributed equally to this work with: Claus M. Zacho, Martina A. Bager

    Roles Formal analysis, Investigation, Methodology, Visualization, Writing – original draft, Writing – review & editing

    Affiliation Lundbeck Foundation GeoGenetics Centre, GLOBE Institute, University of Copenhagen, Copenhagen, Denmark

  • Martina A. Bager ,

    Contributed equally to this work with: Claus M. Zacho, Martina A. Bager

    Roles Formal analysis, Investigation, Methodology, Visualization, Writing – original draft, Writing – review & editing

    Affiliation Section for EvoGenomics, GLOBE Institute, University of Copenhagen, Copenhagen, Denmark

  • Ashot Margaryan,

    Roles Resources, Software, Supervision, Validation, Visualization

    Affiliations Section for EvoGenomics, GLOBE Institute, University of Copenhagen, Copenhagen, Denmark, Center for Evolutionary Hologenomics, University of Copenhagen, Copenhagen, Denmark

  • Peter Gravlund,

    Roles Conceptualization, Methodology, Project administration, Supervision, Writing – review & editing

    Affiliation Museum of Eastern Zealand, Faxe, Denmark

  • Anders Galatius,

    Roles Formal analysis, Methodology, Resources, Software, Supervision, Validation, Writing – review & editing

    Affiliation Department of Bioscience, Aarhus University, Roskilde, Denmark

  • Arne R. Rasmussen,

    Roles Conceptualization, Investigation, Methodology, Project administration, Resources, Supervision, Writing – review & editing

    Affiliation Institute of Conservation, Royal Danish Academy—Architecture, Design, Conservation, Copenhagen, Denmark

  • Morten E. Allentoft

    Roles Conceptualization, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Supervision, Validation, Writing – original draft, Writing – review & editing

    morten.allentoft@curtin.edu.au

    Affiliations Lundbeck Foundation GeoGenetics Centre, GLOBE Institute, University of Copenhagen, Copenhagen, Denmark, Trace and Environmental DNA (TrEnD) Laboratory, School of Molecular and Life Sciences, Curtin University, Perth, Australia

Abstract

Natural history museum collections worldwide represent a tremendous resource of information on past and present biodiversity. Fish, reptiles, amphibians and many invertebrate collections have often been preserved in ethanol for decades or centuries and our knowledge on the genomic and metagenomic research potential of such material is limited. Here, we use ancient DNA protocols, combined with shotgun sequencing to test the molecular preservation in liver, skin and bone tissue from five old (1842 to 1964) museum specimens of the common garter snake (Thamnophis sirtalis). When mapping reads to a T. sirtalis reference genome, we find that the DNA molecules are highly damaged with short average sequence lengths (38–64 bp) and high C-T deamination, ranging from 9% to 21% at the first position. Despite this, the samples displayed relatively high endogenous DNA content, ranging from 26% to 56%, revealing that genome-scale analyses are indeed possible from all specimens and tissues included here. Of the three tested types of tissue, bone shows marginally but significantly higher DNA quality in these metrics. Though at least one of the snakes had been exposed to formalin, neither the concentration nor the quality of the obtained DNA was affected. Lastly, we demonstrate that these specimens display a diverse and tissue-specific microbial genetic profile, thus offering authentic metagenomic data despite being submerged in ethanol for many years. Our results emphasize that historical museum collections continue to offer an invaluable source of information in the era of genomics.

Introduction

Natural history museum collections represent a tremendous resource for studying past and present biodiversity on our planet [1]. For centuries, researchers and naturalists have deposited specimens in such collections worldwide [e.g., 2, 3], often with the aim of studying and describing variation between and within species based on detailed morphological investigations. However, the many millions of organisms preserved with precise information on collection date and locality can be equally relevant for studying species phylogenetics [4, 5], temporal patterns (for example in relation to changes in species distributions or composition) [68], trends in pollution-levels [911], or for documenting changes to the gene pools of endangered, extinct, or locally-extinct species [e.g., 1217]. Examining DNA sequences from a species or population sampled at different points in space and time offer a plethora of analytical possibilities, allowing us to understand microevolutionary responses to our rapidly changing biosphere. Obtaining such genetic measures of the past can be important in conservation contexts and aid in present day management of threatened species [18, 19]. Moreover, museum collections often represent much easier access to tissue samples compared to getting new samples from individuals in the field. This is particularly important for population-scale projects where many individuals must be sampled to cover the genetic diversity in a given gene pool. Access to a museum collection allows the researcher to reduce fieldwork costs and effort, where the outcome is often unpredictable.

However, in a molecular context museum material has one major disadvantage compared to freshly obtained tissue: poor DNA preservation. DNA is an unstable molecule which degrades post-mortem, leaving the DNA strands progressively more fragmented over time [e.g., 20, 21]. Furthermore, the state of molecular preservation in a given specimen is likely to be heavily affected by the mode of preservation. Samples obtained from museum collections rarely involve fresh tissue sampled specifically for genetic analysis. Instead, samples are often stored in ethanol as is the case for a wide range of organisms including mammals, fish, amphibians, reptiles and various invertebrate taxa. Ethanol preservation is widely used in museum collections, as this is an affordable and non-toxic conservation agent that preserves samples against microbial activity and autolysis, thus ultimately enhancing the specimen’s longevity [22]. Because it reduces microbial and hydrolytic activity, ethanol preservation will also slow down the degradation of the DNA molecules, but it cannot completely prevent a slow and steady decay, in particular if the ethanol purity is low (i.e. below 70%). Therefore, it is expected that old museum specimens in ethanol will display degraded DNA molecules.

Formalin fixation adds another highly problematic component to collection-based DNA research. Formalin has been used as a fixation agent on biological samples since the early 1890s [23, 24] and has routinely been used in museum collections. Here the agent is usually injected into the specimen and/or it is being soaked in a 10% (or higher) formalin solution for up to several weeks. The treatment hardens the tissue by generating cross-links between amino acids in protein molecules [2325]. Thus, the fixation maintains the body shape of the specimen, which is crucial for morphological studies [26]. However, the treatment also creates cross-links between amino acids and DNA and RNA [27, 28] which will complicate later DNA extraction and introduce polymerase “blocks” that prevent successful PCR reactions [27]. Additionally, an unbuffered formalin treatment can lead to very low pH (<1), resulting in rapid fragmentation of the DNA [27]. Inevitably, the effects of formalin exposure, combined with prolonged ethanol storage, will result in degraded molecules that complicate genetic analyses. To overcome these challenges, a number of different protocol improvements have been proposed [e.g., 14, 2729]. Moreover, the switching to Next Generation Sequencing (NGS) has enabled the sequencing of short degraded molecules in museum material, where PCR-based technologies often fail [12, 13, 19, 3040].

In ancient DNA (aDNA) research, challenges with short and degraded molecules can reach extreme levels. Therefore, when working with problematic museum specimens, it may be fruitful to apply methodological advances that have occurred in the aDNA field in recent years. In our current era of genomics and NGS, the importance of being very selective when sampling ancient specimens has become increasingly clear. By obtaining samples with a high endogenous DNA content (i.e., the proportion of DNA molecules deriving from the target species), the amount of sequencing required to reach a desired level of genomic coverage can be reduced by orders of magnitude. However, the endogenous DNA content for many ancient samples has proven to be very low (often <1%), and represents only a minor fraction of the total quantity of sequences, in which the majority are microbial DNA [e.g., 41]. Thus, many recent aDNA efforts have been aimed at maximizing the endogenous DNA content through improved sampling [e.g., 42, 43], improved DNA extraction [4446], and improved library preparation protocols [47, 48]. Importantly, extensive differences have been found when comparing endogenous DNA content in different types of bones [42, 43, 49] or even within different compartments of the same tooth [45], highlighting the importance of very careful substrate selection.

By recognizing the obvious similarities between museum-based DNA and aDNA research, it seems likely that similar insights could be gained by testing DNA preservation in different tissue types in old museum specimens. In the current study, we employed an ancient DNA framework to test the DNA quality in different tissue types sampled from snakes that have been preserved in ethanol for many years. Because the shotgun sequencing approach we use here is not restricted to DNA of the target species, it can also provide a genetic profile of the microbial diversity in the tissue sample. Such metagenomic analyses have gained considerable attention in the aDNA community in recent years, uncovering information on past microbial communities and prehistoric pathogen infections in humans [5058]. To our knowledge, very few studies have investigated if the degraded DNA in old ethanol-preserved museum specimens represents a similar ‘goldmine’ of metagenomic information [albeit see 59, 60]. In this study we will therefore explore this potential by examining taxonomic diversity among the millions of sequences that cannot be identified as snake DNA.

In summary, this investigation employs an aDNA methodology to systematically explore the genomic and metagenomic potential in different types of tissue (liver, skin, bone) from old ethanol-stored museum specimens. We used a silica-in-solution extraction method, which has proven highly effective in recovering very short DNA fragments [44], combined with NGS shotgun sequencing. The many millions of generated sequences were bioinformatically mapped against a suitable reference genome, allowing us to statistically explore the state of molecular preservation. We also applied an enzymatic test [61] to assess if any of these specimens had previously been exposed to formalin as part of their preservation treatment.

When performing destructive sampling of valuable museum material, it is imperative to balance the pros and cons and use an approach that is optimized towards the specific research aims. As such, our presented results add to a growing number of observations that will ultimately allow us to establish best practice in terms of sampling, chemical processing, and sequencing of biological museum specimens.

Methods

Samples

Five Thamnophis sirtalis (common garter snake) specimens stored in the collections of the Natural History Museum of Denmark were used for this study (Table 1). This species was selected because of the availability of a reference genome (see below), allowing for detailed bioinformatic assessment of the DNA quality. All specimens were collected between 1842 and 1964. Two of the specimens were originally collected in the US, reflecting the natural distribution of T. sirtalis, which ranges from Mexico, throughout the United States and southern Canada [62]. Three specimens were collected in Denmark; two were from Copenhagen Zoo, whereas the last individual was collected on the island of Bornholm. Since T. sirtalis is not native to Denmark, it is assumed that the specimen from Bornholm died in captivity. Three samples were obtained from each specimen: a piece of liver (c. 100 mg), a piece of skin with muscle attached (c. 100 mg), and a piece of the vertebral column (c. 50 mg). Each of these three samples was divided into two equal subsamples allowing replication of all experiments.

thumbnail
Table 1. Five specimens of Thamnophis sirtalis included in this study.

https://doi.org/10.1371/journal.pone.0256353.t001

DNA extraction, library preparation and sequencing

To minimize contamination, all molecular work (pre-library amplification) was conducted in dedicated ancient DNA (aDNA) cleanlab facilities at the Lundbeck Centre for GeoGenetics, University of Copenhagen. The samples were first washed in molecular grade H2O to remove residual ethanol and then finely divided into smaller bits to facilitate a more efficient digestion. Samples were initially incubated for 48 hours at 42˚C on a rotator in 3.5mL digestion buffer, containing 3.25 mL 0.5 M EDTA buffer, 40 μL of Proteinase K (0.14–0.22 mg/mL, Roche, Basel, Switzerland), 175 μL 10% N-laurylsarcosyl, and 35 μL TE buffer (100x). After 24 hours the samples were not sufficiently digested and therefore an additional 40 μL of proteinase K was added to each sample. After further 24h incubation, the samples were centrifuged for 2 minutes at 2000×g and the supernatant was transferred to 50 mL tubes.

DNA extraction was done using silica-in-solution and a binding buffer that is efficient in recovering very short and degraded DNA fragments [44]. The buffer was prepared in bulk by mixing 500 mL buffer PB (Qiagen, Hilden, Germany) with 9 mL of sodium acetate (5 M), and 2.5 mL of sodium chloride (5 M), pH adjusted to 4–5 with 37% HCL. Supernatant from digestion was mixed with 40mL binding buffer, added 100μL silica suspension and incubated for one hour at room temperature. After DNA binding, samples were centrifuged at 2000×g for 5 minutes and the supernatant removed. The silica pellet with bound DNA was then resuspended in 1 mL of the same binding buffer, followed by two wash-steps with 1 mL 80% cold ethanol. Following 2 min of centrifugation at 2000×g, all the supernatant was removed and the pellets were left to dry in a laminar flow workbench (LAF) for 15 minutes. Finally, DNA from the washed and centrifuged silica pellets were eluted by adding 85 μL of EB buffer (Qiagen) supplemented with 0.05% Tween-20 detergent. After extraction, the DNA concentration (ng/μL) in the extracts was measured in two replicates for each sample by Qubit Fluorometric dsDNA high-sensitivity Quantification (ThermoFisher Scientific).

DNA extract was built into blunt-end libraries using the NEBNext DNA Sample Prep Master Mix Set E6070 (New England Biolabs, Ipswich, MA, USA) and Illumina-specific adapters (Illumina, San Diego, CA, USA). Protocol followed manufacturer guidelines with a few modifications as outlined in Orlando et al. [63] and Damgaard et al. [45], although the nebulization step was excluded since DNA in museum specimens must be expected to be sufficiently fragmented already. The end-repair step was performed in 25.5 μL reactions using 21.25 μL of DNA extract and incubated for 20 mins at 12°C and 15 min at 37°C. Products were purified using Qiagen MinElute silica spin columns according to manufacturer guidelines, except that PB buffer was replaced with 250 μL of the modified binding buffer in the binding step. DNA was eluted in 15 μL Qiagen EB buffer and adapters were ligated to the blunt-end DNA in 25 μL reactions, using 5 μL of ligation buffer, 0.5 μL of adapter mix, 2.5 μL of Quick T4 ligase and 2 μL of H2O. The reaction was incubated for 15 mins at 20°C and products were purified with PB buffer on Qiagen MinElute columns, before being eluted in 23.5 μL EB Buffer. The adapter fill-in reaction was performed in 30 μL reactions using 23.5 μL of template, 3 μL of fill-in buffer, 2 μL dNTP mix (2.5 mM) and 1.5 μL of Bst polymerase and incubated for 20 mins at 37°C followed by 20 mins at 80°C to inactivate the enzyme. One μL of library was qPCR quantified with SYBRGreen on a Roche LightCycler 480 to determine the optimal number of PCR cycles required. Libraries were amplified and indexed in 50 μL PCR reactions, using 21 μL library template, 25 μL 1X KAPA HiFi HotStart Uracil + ReadyMix (KAPA Biosystems, Woburn, MA, USA) and 2 μL of both forward and reverse custom index primers. The reaction was amplified with an initial minute at 98°C, followed by 7–14 cycles of 15 seconds at 98°C, 30 seconds at 65°C and 30 seconds at 72°C, ending with one minute at 72°C. The amplified libraries were purified using Agencourt AMPure XP beads (Beckman Coulter, Krefeld, Germany). DNA concentrations of the amplified libraries were quantified on an Agilent Bioanalyzer 2100 (Agilent Technologies) before making the pools for shotgun sequencing (80 bp, single read) on Illumina HiSeq 2500 at the Danish National High-throughput DNA Sequencing Centre.

Bioinformatics and metagenomic analyses

Data were base-called using the Illumina software CASAVA 1.8.2 and sequences were de-multiplexed using a requirement of a full match of the six nucleotide indices that were used. The raw reads were trimmed for adapters using AdapterRemoval2 [64], and trimmed reads shorter than 30 bp were discarded. In order to assess the amount of endogenous DNA the data was mapped against the draft genome sequence of T. sirtalis [65]. The trimmed reads were mapped using BWA v. 0.7.15 aln [66] with default settings and duplicate reads were removed using the “rmdup” function in SAMtools v. 1.5 [67]. To investigate the level of DNA degradation in our samples, the mapped sequences were analysed with mapDamage v.2.0.5 [68] which quantifies DNA damage patterns, including base misincorporation levels and fragments lengths. These snake specimens are likely to have been handled many times over the years so as proxy for DNA contamination in the various tissues, we mapped the sequencing data against the human reference genome, build 37 (GRCh37) and recorded the fraction of mapped sequences for each sample.

To identify DNA from microbial organisms and thus gauge the metagenomic research potential in these old tissues, we used the taxonomic sequence classifier implemented in Kraken v2.0.7 [69]. We analyzed all our DNA sequences with default parameters and a confidence scoring threshold of 0.5 based on the standard Kraken database, containing all genomes of human, bacteria, fungi and archaea in the NCBI RefSeq database as of January 2019. The output of the Kraken2 classification was used as input for Bracken v2 [70] to obtain the abundance of taxa for each sample. The results of the metagenomic analysis were processed and visualized with the Pavian R package [71] and KronaTools v2.7 [72], respectively. We used the prcomp function in R to run a PCA analysis based on weighted number of hits obtained from the Kraken2/Bracken classification. The weighting was done by transforming the raw number of classified hits into fractions to account for differences between total number of DNA reads among the samples. We used all thirty eigenvectors obtained from the PCA as input for the uniform manifold approximation and projection (UMAP) analysis using the “umap” R package. Venn diagrams were constructed using the “VennDiagram” 1.6 R package.

Test for formalin exposure

An enzymatic test was performed to assess if the specimens had been exposed to formalin during preservation [61]. Two muscle samples of 0.5 x 0.5 x 0.5 cm were obtained from each specimen and suspended in 10% Savinase. Two parallel tests were performed (S1 Table), A) a 20 mL reaction being stirred in a beaker for 3 hours, and B) a 20 mL reaction in a red cap bottle with no stirring for 24 hours. According to the protocol, if the tissue has not degraded after 3 hours using stirring or 24 hours without stirring, it can be concluded that the specimen has been exposed to formalin [61, 73].

Linear mixed effect models

Linear mixed effect models using maximum likelihood were used to analyze the effects of tissue matrix on yield of endogenous DNA, mapped length, CT damage, content of human DNA and DNA concentration. The models were constructed with specimen as random factor while tissue matrix was fixed factor. Bone was used as the reference matrix against which values for liver and skin were compared.

Another set of linear mixed effect models were used to investigate the effects of specimen age on the same independent variables. The models were constructed with specimen and tissue matrix as random factors while specimen age in years was fixed factor.

The raw values of both replicate samples from each specimen were used in the models. The variables human DNA content and DNA concentration were log-transformed to reduce skewness and approximate normal distribution. The analyses were performed using the R package nmle [74].

Results

In general, we observed a high degree of consistency between results of replicate samples and we will therefore use average values in all the metrics discussed below, apart from the data substrate for the mixed effect model, which was the raw data. S2 Table shows results from all the DNA extracts individually.

Formalin test

The results representing each of the two enzymatic treatments are presented in S1 Table. The specimen R602827 collected in 1923 was the only specimen that consistently showed clear evidence of exposure to formalin, whereas R602826 collected in 1920 was the only specimen that consistently showed clear evidence of no exposure. For the remaining three specimens, the results were inconclusive with the tissue being partly degraded (S1 Table).

DNA concentration

Except for specimen R603729, the DNA concentration was highest in the liver tissue extracts, with four liver samples showing DNA concentrations between 22.7–92.4 ng/μL, whereas the oldest individual from 1842 (R602799) yielded only 5.7 ng/μL (Table 2). DNA concentrations from skin extracts were low in three samples (0.3–2.1 ng/μL) and relatively high in two samples (12.1 and 25.9 ng/μL). The DNA content in bone reflected similar values with two samples displaying low concentrations (1.8 and 2.6 ng/μL) and two samples with higher concentrations (10.5 and 18.6 ng/μL). For R603729, the DNA concentration in the bone extract was the highest (83.3 ng/μL). Overall, the oldest sample from 1842 (R602799) performed worst in regard to DNA concentration (Table 2, Fig 1). The linear mixed effects models revealed that tissue matrix had significant effects on DNA concentrations, and liver samples were estimated to provide the highest concentrations of DNA with 221% (p = 0,01) higher content than bone, while skin samples provided 73% (p = 0,004) lower DNA content than bone (S4 Table).

thumbnail
Fig 1. DNA preservation.

DNA concentration measurement and DNA preservation metrics listed according to specimen age. Endo %, endogenous DNA content; Length, average length (in base pairs) of mapped reads; C-T %, C-to-T deamination damage fraction recorded at first position in the 5’end. Bars represent average values of two replicate DNA extracts with the range shown.

https://doi.org/10.1371/journal.pone.0256353.g001

Bioinformatic results

Approximately 570 million DNA sequences were generated for this project, ranging from 6 to 44 million, and averaging 19 million sequences per DNA extract (S2 Table). Following bioinformatic trimming and mapping against the T. sirtalis genome, between 29.0% (R603729, liver) and 55.6% (R602827, bone) of the sequences could be aligned (Table 2, Fig 1) thus representing the endogenous DNA content in these extracts. When clonal reads were removed, these values corresponded to an overall sequencing efficiency between 12.8% and 50.3% (S2 Table). For all five specimens, the endogenous DNA content was highest in bone tissue (Table 2, Fig 1), but in general the difference between tissue types was relatively modest. The linear mixed effects models, however, revealed that tissue matrix had significant effects on all qualitative metrics (Table 3). For endogenous DNA content, bone performed best with liver and skin estimated to produce 9 (p = 0.0008) and 5 (p = 0.0298) percentage points lower values, respectively (Table 3, Table 2, Fig 1). The difference in endogenous content between bone and liver obtained from each specimen ranged from 0.9% higher in bone in R602827 to 19.5% higher in bone in R603729 (Table 2, Fig 1). In four of the five specimens, skin and liver tissue samples showed almost equal levels of endogenous DNA content (Table 2, Fig 1). The average length of the mapped sequences was short and ranged from 40.4 to 63.4 bp per sample (Table 2, Fig 1). Notably, for all five specimens, bone also performed best, with liver and skin estimated to produce 5 (p = 0.003) and 8 bp (p = 0.0001) shorter fragments, respectively (Table 3). As with the endogenous DNA content, the sequence lengths in liver and skin were somewhat similar in length although these differences were significant (Table 3). We observed a high C-T deamination fraction, ranging from 9.1% to 20.5% at the first 5’ position of the sequences. Bone was estimated to perform best with regard to CT damage, although the difference relative to skin was not statistically significant (Table 3). Samples from liver tissue were estimated to contain 2.76 percentage points (p = 0.0057) more damage than bone, while skin samples were estimated at 0.81 percentage points (p = 0.38) higher (Table 3). In summary, we see a relatively high level of endogenous DNA content in these samples, but the data reflect short and damaged DNA molecules.

Overall, we found no obvious trend between the different measures of DNA preservation and the age of the specimen (Figs 1 and 2), apart from observing a very low DNA concentration in the extract of the oldest one from 1842 (R602799). The linear mixed effects model on sample age revealed that in our samples, age had very modest and statistically insignificant effects on all qualitative metrics (S4 Table).

thumbnail
Fig 2. Human DNA contamination.

Fraction of sequences that could be mapped to the human reference genome listed according to specimen age. Bars represent average values of two replicate DNA extracts and the range is shown.

https://doi.org/10.1371/journal.pone.0256353.g002

Only two specimens could be assigned with certainty with regard to the presence/absence of formalin, but since these two specimens have almost the same age (1920 and 1923), it allows for a direct comparison of the potential effects of formalin. However, due to the small sample size, these comparisons should only be observed as potential indications. The specimen without formalin exposure (R602826) displayed higher DNA concentration, longer average mapped read lengths, as well as less C-T damage, in all tissues, in comparison to the specimen exposed to formalin (R602827). The non-formalin exposed specimen also showed the longest average read length of all five samples. Conversely, the specimen with confirmed formalin exposure showed the highest endogenous DNA content of all the five specimens.

These snake specimens are likely to have been handled many times over the years so as proxy for DNA contamination in the various tissues, we mapped the sequencing data against the human reference genome. Based on this, we could estimate the proportion of DNA sequences with a human origin. Overall, the human DNA contamination levels were low, ranging from 0.2% to 1.7% of the reads (Table 2, Fig 2). Across individual tissue types, the average human contamination was 0.8% in the liver samples, 0.5% in the skin samples and 0.3% in bone tissue. The two oldest samples (R602799 and R602815) had the highest average contamination fraction of 0.9% and 1.2% respectively, whereas the remaining samples had average contamination between 0.2% and 0.5%. Despite these low values, we observe that DNA contamination levels across different tissue types are somewhat correlated when they represent the same specimen (Fig 2). Thus, if a specimen displays an increased level of human DNA contamination, this is likely the case for all three types of tissue. The linear mixed effects models also revealed that bone was superior with regard to human DNA content, where liver and skin, despite the low absolute values in all matrices, were estimated to contain 89% (p = <0.0001) and 50% (p = 0.0033) more contamination respectively (Table 3).

In the metagenomic analyses, we binned the sequences according to tissue types but combined them across individuals. We applied a species-level designation but to reduce background ‘noise’ we considered only taxa with >5 sequence hits against the reference database. It is beyond the scope of our study to provide an accurate and complete taxonomic characterization of the microbial community in these samples. We focused on the overall metagenomic differences in exploring the potential and we note that exact taxon matches should be interpreted with caution. Nonetheless, the Kraken2 analyses showed clear differences in diversity between the tissues (S3 Table), most clearly observed in the bacterial community (Fig 3) where 441 taxa are observed in the skin samples, 309 taxa in the bone samples, and 289 taxa in the liver samples. A considerable taxonomic overlap between tissue types is visualized in a Venn diagram (Fig 3), and it is also clear that a substantial part of the diversity is observed in the extraction blanks and thus represent DNA introduced with reagents or during sequencing despite working in clean-labs, and despite observing unmeasurable levels of DNA concentration in these negative extraction controls. Taking a conservative approach, by simply ignoring all taxa that are also observed in the extraction blanks, we still observe 106 different bacterial taxa in our samples, 30 of which are exclusively observed in the skin samples.

thumbnail
Fig 3. Metagenomic profiling.

Bacterial metagenomic diversity visualized in Krona and Venn (lower right) plots for the three tissue types. The diversity is presented at species-level and includes only NCBI refseq taxa with >5 sequence matches. Colors denote taxonomic groups found within the various bacterial phyla as according to Krona visualization. Blanks denote negative extraction control.

https://doi.org/10.1371/journal.pone.0256353.g003

We applied a principal component analysis (PCA), as well as adding further dimensionality using a Uniform Manifold Approximation and Projection (UMAP), in order to assess the relationship between the metagenomic profiles from all tissues and individuals. In both the PCA and most clearly observed in the UMAP results, we observe some clustering according to individuals (S1 Fig) and in the PCA analyses, the skin/muscle tissue displays the highest apparent metagenomic variation. Moreover, in the PCA we also see some clustering by tissue types (S1 Fig), but since there was much variation between extracts from the same samples, this indication should be interpreted with caution.

Discussion

Owing to NGS technology, genome sequencing has become an affordable and widely used element in many areas of biological research. Yet, the number of studies that thoroughly explore the genomic potential of old museum specimens is still limited. By using an ancient DNA methodology, we show that genomic analyses of old ethanol-preserved museum material are indeed feasible, displaying great promise for unlocking the wealth of molecular information stored in the millions of specimens found in museum collections around the world. The successful generation of NGS data from museum specimens is not unique [3437, 75, 76], but very few studies have systematically tested different tissue types with the aim of establishing best practice protocols when sampling valuable museum material for genomic research. Below we will discuss the various DNA preservation metrics included in our study.

DNA concentration

The DNA concentration serves as a basic quality measure in many molecular contexts, because it reveals if the extraction has been successful and it can inform on how the DNA should be prepared in downstream experimental work. It is, however, only a crude indicator of the usability in genome-scale sequencing because DNA concentration can be adjusted post-extraction, and most library methods require only very limited input DNA anyway–in particular methods that have been optimized for aDNA research as in the current setup. Regardless, DNA could be detected in all our extracts showing that the extraction method has been successful despite using old ethanol-preserved specimens. This includes the formalin-exposed specimen, where we observed high DNA content (53.6–55.8 ng/μL) in the liver tissue extracts. Except for sample R603729, the DNA concentration was highest in liver tissue, ranging from 22.7–92.4 ng/μL. Similar results were found by Hykin et al. [33], comparing DNA concentration in liver, muscle and tail tips from two specimens of Anolis caroliensis collected approximately 30 years ago.

We generally observed lower DNA concentration (0.3–26.9 ng/μL) in skin extracts. Since skin, including also snake scales, are partly made of dead keratinized cells [77], it is not unexpected that these extracts contain lower amounts of DNA than liver tissue. Similar results were also obtained by Casas-Marce et al. [78] who compared DNA contents in skin versus bone samples from museum Iberian lynx (Lynx pardinus) as well as Tsai et al. [32] who found the lowest DNA concentration from dried skin tissue compared to bone and toe-pad tissues. Moreover, skin, in comparison with bone and liver tissue, is also in direct contact with the environment, which could potentially elevate processes leading to DNA damage.

The linear mixed effects model showed significant differences in DNA concentrations from the various tissues, but as discussed above this is not a particularly informative metric when assessing the potential for genomic-scale analyses.

Endogenous DNA

The endogenous DNA content is the most important factor when working with degraded samples in a genomic context, because it defines how much ‘shotgun’ sequencing is required to reach a certain genomic coverage. This is particularly pertinent in the aDNA field since most biological tissues exposed to natural environments for centuries or millennia will display mostly DNA of a microbial origin and often less than 1% endogenous DNA [41, 79, 80]. Therefore, a considerable effort in aDNA research has been aimed at maximizing the endogenous DNA content through improved sampling, DNA extraction, and library preparation protocols [4245, 49, 81].

By mapping our many millions of DNA sequences to the T. sirtalis reference genome we found a relatively high fraction of endogenous DNA in all the individuals and tissues, ranging from 26% to 56%. Although this is still markedly lower compared to working with fresh tissue, where the endogenous DNA fraction is expected to be close to 100%, we argue that all tissues from the included specimens have adequate levels of endogenous DNA for genome-wide sequencing to be feasible. In support of the sufficiency of these levels for whole genomic sequencing, whole genomes (>1X depth of coverage) are routinely sequenced from endogenous content of similar values or lower, down to c.10% in the ancient DNA field [44, 8285]

The relatively high endogenous DNA content in our extracts shows that long-term storage in 70% ethanol does not preclude an effective shotgun sequencing approach and genome-scale research. Presumably this type of storage will limit the introduction of fresh/living microbes to the specimen, thereby maintaining a relatively high endogenous DNA fraction over decades and even centuries.

Many studies have reported the successful extraction and sequencing of DNA from old museum specimens (see Burrell et al. [12], Wandeler et al. [14], Holmes et al. [19]), but very few have systematically tested different samples and tissues for their endogenous DNA content based on millions of reads. Exceptions are human skeletal remains that have been extensively tested in recent years in the aDNA field. Concerning non-human taxa, endogenous DNA fractions from various tissues have been assessed in Great Apes from museum collections [86], displaying fractions between 1–99% in teeth and >80% in soft tissue samples (fingertips and ear cartilage). Similar variation is evident from other museum collections samples. For example, the genomic mapping rate to reference genomes was c. 40% in dried Alpine chipmunk (Tamias alpinus) skin samples [35] and between 21.8–38% from rat (Rattus norvegicus) molar and toe samples [34]. Likewise, Allentoft et al. [13] used an aDNA framework to extract and shotgun-sequence samples from ethanol-preserved Aesculapian snake tissues (muscle/skin, teeth and vertebrae), for phylogenetic purposes, but also tested the mapping rate against the T. sirtalis genome. Without a suitable reference genome, the mapping rate was much lower (0.6–8.6%) than we observe here, but all tissue types proved positive for snake DNA.

The analyses showed that the differences in endogenous DNA between all three tissues were significant (Table 3). However, when evaluating which of our included substrates performed best, it should be acknowledged that liver and skin showed almost equal levels of endogenous DNA in all samples in absolute values, except in extracts from the most recently collected individual from 1964 (R603729), where skin was 16.4 percentage points higher than liver tissue. In all five specimens, bone tissue contained the highest fraction of endogenous DNA. Bone can act as a barrier against autolytic, oxidative, and hydrolytic damage [87], and perhaps also limit DNA degradation caused by unfavorable pH conditions [88] therefore more intact and non-damaged molecules may be available from this substrate. Also, bone is a denser tissue than liver and skin, and therefore presumably less permeable to invading microbes post-mortem. These factors arguably explain why bone displays the highest endogenous DNA content in our study. Likewise, in the above-mentioned rat study, Rowe et al. [34] obtained the longest DNA fragment lengths and highest total nucleotide concentrations from bone samples. Similarly, Allentoft et al. [13] obtained the highest mapping efficiency from bone tissue, compared to muscle/skin and teeth. Moreover, previous studies on squamate reptiles from museum collections also succeeded in extracting and sequencing nuclear DNA from both skeletonized bone and bone preserved in 70% ethanol and presumably also exposed to formalin [89, 90].

However, it is important to note that the absolute values we have obtained from bone are only slightly higher than those from the other tissues (Table 3). For example, the average endogenous DNA content for bone was 50.4%, whereas the average was 45.2% and 41.2% for skin and liver respectively. This is relevant for sampling strategy purposes, since sampling of skin and liver tissue is generally less complicated and invasive than sampling bone from an intact specimen preserved in ethanol. In many instances, it may therefore be advisable to sample skin or liver, in particular when sampling rare and valuable specimens, such as type specimens or extinct species where materials are rare.

Interestingly, we observed the highest endogenous DNA content (evident in all three tissues), in the specimen that proved positive for formalin exposure. Low-coverage genomic DNA has previously been reported from formalin fixed museum specimens [30, 33], although these studies encountered mixed DNA extraction success rates and DNA quantities. Ruane and Austin [30] were able to sequence Ultra-Conserved Elements (UCEs) from a c. 100 years old fluid and formalin preserved snake, showing that it is indeed possible to obtain useful data from a century old formalin-exposed sample. In this study, however, we mapped sequences to a reference genome to obtain a measure of the actual endogenous DNA content, and thus we are able to document the genomic research potential in an old formalin preserved snake from 1923 (R602827). It is perhaps not entirely surprising that we find a relatively high level of endogenous DNA in a formalin fixed sample. Formalin fixation infers several crucial alterations that complicates DNA extractions and can also elevate DNA fragmentation rates [27]. These processes can ultimately reduce DNA concentration, increase deamination damage or reduce fragment length, but we do not have reason to expect that formalin will lower the endogenous DNA content–perhaps rather the opposite as formalin exposure is likely to reduce microbial attack and decomposition. It is well known that formalin fixation may hamper or completely prevent successful recovery of DNA from a given sample, but our results show that if DNA extraction is successful, the quality and purity (i.e. endogenous DNA content) of the recovered DNA is just as good–and perhaps even better–than what is observed in non-formalin exposed samples of similar age.

DNA damage and contamination

We also investigated other proxies for DNA quality, namely average fragment lengths and levels of C-to-T deamination damage at the first 5’ position of the sequences [91, 92]. In a normal metabolically active cell, the DNA molecules are rapidly and efficiently repaired by the hosts repair pathway, but post-mortem errors in the DNA accumulate, resulting in fragmentation and C-to-T misincorporation [20, 21]. The rate at which this happens depends on microenvironmental factors such as temperature, pH and salinity [9395].

Overall, we observe high C-to-T damage (9.1% to 20.5%) in all samples. These values are surprisingly high, reflecting severely damaged DNA, equivalent to what is often observed in bones that are thousands of years old (e.g., Allentoft et al. [44]). In all except one specimen (R603729) the C-T deamination was highest in liver tissue (11.1–20.5%), whereas bone had the lowest average deamination across tissue type (13.1%) but when assessing the absolute values the differences are subtle. The specimen from 1920 (R602826), from where formalin exposure could not be detected, had the lowest average C-T values (10.3%) across tissues. C-T deamination has been reported in countless aDNA studies but is less commonly reported in studies using more recent (i.e. decades, centuries) museum material. When investigating C-T deamination in a timeseries of monkey/gorilla specimens, Sawyer et al. [93] reported rates <4% for the majority of samples younger than 120 years. Only specimens that were burned or more than 500 years old showed C-T deamination rates >9.1% (our lowest value). Similar values have been obtained from herbarium museum samples covering the last 300 years, giving values ranging from c. 1,5–5% [96]. In contrast, a recent study by Allentoft et al. [13] found 11–15% C-T deamination in old (1810 to 1863) ethanol preserved Aesculapian snakes. These values are similar to what we observe here, indicating that ethanol-preservation (possibly combined with exposure to formalin) results in a higher damage rate than in dry museum material. C-T damage is not necessarily a major problem in downstream analyses, and it can be used to authenticate that the DNA is indeed of ancient or historical origin. However, a high level of C-T damage could result in a lower reference genome mapping affinity in the bioinformatical analyses, and hence the loss of relevant data. Moreover, if not accounted for analytically, C-T misincorporation could be wrongly interpreted as true genetic variation, resulting in problematic conclusions.

Similarly, the average DNA fragment lengths we observe (40–64 bp), also reflect highly degraded DNA. The longest average lengths were found in specimen R602826 (59.3–63.4 bp), where formalin exposure was not detected, but these values are only marginally larger in absolute values than in the other specimens. However, we will emphasize that such cross-study comparisons of fragment lengths should be carried out with caution as they can be influenced by the extraction method, the library method, the PCR and sequencing chemistry. Still, it is clear that these values represent highly degraded DNA, similar to what is often observed in ancient biological substrates. Such highly fragmented DNA also explains why PCR-based methods often fail with museum material [27, 28, 97] since there is simply not enough space for binding primers to the template DNA.

Following the discussion above, it seems that storage in 70% ethanol at room temperature is not ideal for long term DNA preservation. A reason could be that in a 70% ethanol solution (as widely used in museum collections) the presence of 30% water will promote spontaneous hydrolysis of the DNA [21, 98]. Long-term storage will likely lead to lowered ethanol concentration due to evaporation, and possibly also unfavorable pH conditions [88] eventually increasing the rate of DNA degradation further [27].

Because DNA degrades post-mortem, we would intuitively expect that the DNA quality would decline with specimen age. Previous results on museum animal remains [93] and herbarium samples [96] found strong correlation between C-T deamination and time. However, a correlation between average fragment lengths and time was only observed from the herbarium samples [96]. The oldest snake in our study had the lowest DNA concentration in all three tissues, but aside from this observation, we found no correlation between age and the measured variables in the linear mixed effects model on sample age (S4 Table). Similar lack of correlation between age and state of DNA preservation was recently also observed from museum samples covering c. 100 years [99], including both ethanol preserved and dried samples.

As a final measure of DNA purity, we examined the contamination levels. It is perceivable that many years of storage in museum collections combined with periodic handling of the specimen could result in human DNA contamination. We found low levels of human contamination (0.2–1.7%) in this study, and although the differences among tissues are significant (Table 3), the levels of the observed absolute differences suggest that this is not a major concern in our case. Previous studies based on historical museum material have identified similar low levels of human DNA contamination. Using the human mitochondrial genome as reference the fraction of contamination was estimated to c. 1% in [86] and 2,3% in [36] and when based on genome-wide shotgun data, contaminant fractions of 4.3–8.9% in [100], <0,3% in [35], <0,1% in [34], and in 0.27% [33] have been reported. In summary, human DNA contamination has not severely hampered the genomic potential, and in most cases, a small proportion of human DNA sequences will not affect downstream analyses as they can easily be filtered out bioinformatically. Cross-contamination between specimens preserved in the same jars for many years might present a larger problem, but to our knowledge this remains to be investigated.

Metagenomic potential

In order to assess the metagenomic profiles of these specimen and tissues, we examined the many shotgun-sequences that could not be identified as snake DNA. From the entire dataset, the bacterial DNA fraction accounted for just 0.01–0.02% of all sequences in bone and liver respectively (Fig 3), whereas the fraction in skin was higher at 0.06%. This seem to reflect a very low concentration of microbial DNA compared to snake DNA in the tested tissues and this is probably explained by preservation in ethanol which prevents post-mortem microbial growth. According to our knowledge, no previous study has performed shotgun metagenomics on ethanol-preserved museum specimens, limiting comparative assessments. A different explanation for the low fractions could also be that we simply do not have sufficient database coverage of the microbial taxa found on snakes. Only few studies have investigated snake metagenomics [101105] and these have all analyzed microbes from the digestive tract in fresh tissues, not from ethanol preserved museum specimens, and none of them have utilized shotgun sequencing. Thus a large fraction of snake-related bacterial genomes are likely to be absent from the databases [106]. Similar low fractions of reads being assigned to taxa have also been observed in aDNA studies [107109], and although these results are not directly comparable to those obtained from ethanol preserved specimens, they still underline missingness in reference databases.

If the low fraction of reads assigned to known microbial diversity is partly a consequence of preservation in ethanol preventing post-mortem microbial growth, it could be expected that the microbial DNA profiles reflect the diversity as it appeared in the snakes at the time of preservation. The fact that we observe tissue-specific metagenomic profiles (Fig 3), and to some extent also specimen-specific ones (S1 Fig), suggest that this is indeed the case.

Under the strict criteria applied we found 30 bacterial taxa uniquely present in the skin tissue, whereas we only found two taxa uniquely associated with liver tissue, and no taxa uniquely found in bone. Because the skin of the animals has more direct contact with the environment than the other tissues, this result is perhaps not surprising but again, it is indicative that these tissues have retained characteristics expected for their original microbial profiles. However, we cannot rule out that some proportion of the microbial diversity observed in the skin is derived from the ethanol used for preservation.

Our metagenomic profiling is intended as a proof-of-concept to evaluate the potential and we caution that the exact taxonomic identification of specific microbial taxa require rigorous follow-up investigations. Nonetheless, we highlight here a few examples of the microorganisms most frequently observed in the snake specimens (S3 Table). Edwardsiella tarda was observed in two of the snakes, with highest prevalence in the skin tissue. This bacterium is known to infect fish, amphibians, mammals and reptiles and causes Edwardsiella septicemia also known as fish gangrene [110]. Enterococcus faecalis, observed in several of the snakes but predominantly in skin tissue, form part of the commensal gastrointestinal microbiota of a diverse range of taxa, including mammals, birds, reptiles and insects [111]. Bacteria in this genus are fast-emerging as both human and reptile pathogens and have previously been connected with diseases in sea turtles [112]. Morganella morganii was observed in all tissues from one of the individuals (R603729) but with the highest levels in skin. This bacterium is commonly found in the environment and intestinal tracts of humans, mammals and reptiles as part of the normal flora [113] and has been found as the primary agent of secondary wound infections after snakebites [114116]. Interestingly the taxon displaying highest overall levels in all samples and tissues, covering 97–100% of all virus sequences, is Taterapox virus; which is a virus phylogenetically very closely related to variola—the etiological agent of smallpox. Only few studies have examined the genomes from this virus, and besides indications of the virus infecting Mongolian gerbils (Meriones unguiculatus) and mice [117] we cannot currently explain why this virus is present in all our samples, but it is an interesting observation that warrants further research.

Our results suggest that there is indeed authentic metagenomic information stored alongside the actual ethanol-preserved museum specimen. This could serve as a window into past microbial communities related to for example, infections and diet (i.e. microbiome), or to investigate the prevalence of certain pathogens through space and time. Because museum specimens always have a collection date and locality associated with them, these data also present an opportunity to directly assess the micro-evolution of historical bacteria and viruses. Thereby the potential benefit of applying metagenomic analyses similar to ours, or microbial metabarcoding approaches, could add another highly informative layer to the many possibilities from museum samples, that were previously not possible in the pre-genomic era.

Conclusion

By using an ancient DNA methodology, we were able to extract and analyse both genomic and microbial DNA from all these ethanol-preserved snakes, including a 178-year-old specimen and one exposed to formalin. Bone samples performed better by the applied quality standards (endogenous DNA content, DNA damage and contamination) and therefore, if we were to recommend a tissue type for genomic analyses, based solely on these values, bone would be preferable. However, although the observed differences between tissues were statistically significant, the absolute values were marginal. Given the difficulty and damaging nature of sampling a bone from a complete ethanol-preserved museum specimen, we will not, based on our results, advocate selective sampling of bone in favour of other tissues.

Importantly, all our samples displayed a relatively high endogenous DNA content, which confirms the genomic research potential from these museum specimens. We find that high endogenous content in combination with the aDNA and shotgun sequencing setup employed here, seems like a very advantageous setup for museum sample genomics. However, the endogenous DNA levels are still lower, and the DNA is much more fragmented than observed in fresh samples. Consequently, obtaining a certain level of genomic coverage will require a lot more sequencing compared to sequencing fresh samples, but with the values we have demonstrated here, generating useful genome-wide data is indeed possible.

Although our results are encouraging, further improvements could be explored. In all our samples, we found high levels of C-to-T deamination which may compromise downstream bioinformatic analyses. One way to lower these “error rates”, could be to utilize the Uracil-Specific Excision Reagent (USER enzyme), which removes uracil residues, as widely applied in aDNA research [118]. Moreover, single-stranded library preparation protocols have been shown to significantly improve the output from ancient samples [119, 120], fragmented samples [121] and tissues stored in formalin [122]. Applying these methods to samples from ethanol-preserved specimens, similar to those we have tested, could prove beneficial and increase the research potential even further. It is important to keep optimizing and testing these methods, because museum specimens represent a valuable and finite resource and when performing destructive sampling for genetic analyses, it is imperative that everything possible is done to maximize the output.

Supporting information

S1 Fig. PCA/UMAP.

PCA analysis based on weighted number of hits obtained from the Kraken2/Bracken classification for each extract. The weighting was done by transforming the raw number of classified hits into fractions to account for differences between total number of DNA reads among the samples. The different types of tissue are noted. Uniform Manifold Approximation and Projection (UMAP), was used to add further dimensionality in order to assess the relationship between the metagenomic profiles from all tissues and individuals. Blanks were excluded for the analysis.

https://doi.org/10.1371/journal.pone.0256353.s001

(PDF)

S1 Table. Complete formalin results.

Results of formalin test using two test methods.

https://doi.org/10.1371/journal.pone.0256353.s002

(DOCX)

S2 Table. Complete data overview for all extracts.

https://doi.org/10.1371/journal.pone.0256353.s003

(XLSX)

S3 Table. Kraken2/Bracken output.

Data was classified with Kraken2 and used as input for Bracken to obtain the abundance of all taxa for each sample. Total counts of tissue-specific taxa across all samples are shown to the right. Colors highlight the tissue type used for DNA extraction in the different extracts and blanks (blue = liver, green = vertebrate/bone, red = skin/muscle, orange = blanks).

https://doi.org/10.1371/journal.pone.0256353.s004

(XLSX)

S4 Table. Mixed models on sample age.

Linear mixed effect models were used to investigate the effects of specimen age on the same independent variables. The models were constructed with specimen and tissue matrix as random factors while specimen age in years was fixed factor.

https://doi.org/10.1371/journal.pone.0256353.s005

(XLSX)

Acknowledgments

We thank Jesper Stenderup and the staff at the Danish National High-throughput Sequencing Centre for technical assistance. We thank the Natural History Museum Denmark for providing access to their collections and allowing sampling of the specimens.

References

  1. 1. Meineke EK, Davies TJ, Daru BH, Davis CC. Biological collections for understanding biodiversity in the Anthropocene. Philos Trans R Soc Lond B Biol Sci. 2018;374(1763). Epub 2018. pmid:30455204; PubMed Central PMCID: PMC6282082.
  2. 2. Suarez AV, Tsutsui ND. The Value of Museum Collections for Research and Society. BioScience. 2004;54(1):66–74.
  3. 3. Bradley RD, Bradley LC, Garner HJ, Baker RJ. Assessing the Value of Natural History Collections and Addressing Issues Regarding Long-Term Growth and Care. BioScience. 2014;64(12).
  4. 4. Rasmussen AR, Elmberg J, Sanders KL, Gravlund P. Rediscovery of the rare sea snake Hydrophis parviceps Smith 1935: identification and conservation status. Copeia. 2012;2012(2):276–82.
  5. 5. Shapiro B, Sibthorpe D, Rambaut A, Austin J, Wragg GM, Bininda-Emonds OR, et al. Flight of the dodo. Science. 2002;295(5560):1683–. pmid:11872833
  6. 6. Pergams OR, Nyberg D. Museum collections of mammals corroborate the exceptional decline of prairie habitat in the Chicago region. Journal of Mammalogy. 2001;82(4):984–92.
  7. 7. McCarthy MA. Identifying declining and threatened species with museum data. Biological conservation. 1998;83(1):9–17.
  8. 8. Shaffer HB, Fisher RN, Davidson C. The role of natural history collections in documenting species declines. Trends in ecology & evolution. 1998;13(1):27–30. pmid:21238186
  9. 9. Dietz R, Riget F, Born EW, Sonne C, Grandjean P, Kirkegaard M, et al. Trends in mercury in hair of greenlandic polar bears (Ursus maritimus) during 1892− 2001. Environmental science & technology. 2006;40(4):1120–5. pmid:16572764
  10. 10. Pokorny B, Glinšek A, Ribarič-Lasnik C. Roe Deer Antlers as a Historical Bioindicator of Lead Pollution in the Šalek Valley, Slovenia. Journal of Atmospheric Chemistry. 2004;49(1):175–89.
  11. 11. Daugaard-Petersen T, Langebæk R, Rigét FF, Letcher RJ, Hyldstrup L, Jensen J-EB, et al. Persistent organic pollutants, skull size and bone density of polar bears (Ursus maritimus) from East Greenland 1892–2015 and Svalbard 1964–2004. Environmental research. 2018;162:74–80. pmid:29287182
  12. 12. Burrell AS, Disotell TR, Bergey CM. The use of museum specimens with high-throughput DNA sequencers. Journal of human evolution. 2015;79:35–44. pmid:25532801
  13. 13. Allentoft ME, Rasmussen AR, Kristensen HV. Centuries-Old DNA from an Extinct Population of Aesculapian Snake (Zamenis longissimus) Offers New Phylogeographic Insight. Diversity. 2018;10(1):14.
  14. 14. Wandeler P, Hoeck PE, Keller LF. Back to the future: museum specimens in population genetics. Trends in ecology & evolution. 2007;22(12):634–42. pmid:17988758
  15. 15. Allentoft ME, Heller R, Oskam CL, Lorenzen ED, Hale ML, Gilbert MTP, et al. Extinct New Zealand megafauna were not in decline before human colonization. Proceedings of the National Academy of Sciences. 2014;111(13):4922–7.
  16. 16. Bourke B, Frantz A, Lavers C, Davison A, Dawson D, Burke T. Genetic signatures of population change in the British golden eagle (Aquila chrysaetos). Conservation Genetics. 2010. pmid:20563244
  17. 17. Lorenzen ED, Nogues-Bravo D, Orlando L, Weinstock J, Binladen J, Marske KA, et al. Species-specific responses of Late Quaternary megafauna to climate and humans. Nature. 2011;479(7373):359–64. pmid:22048313
  18. 18. Austin JJ, Melville J. Incorporating historical museum specimens into molecular systematic and conservation genetics research. Molecular Ecology Notes. 2006;6(4):1089–92.
  19. 19. Holmes MW, Hammond TT, Wogan GO, Walsh RE, LaBarbera K, Wommack EA, et al. Natural history collections as windows on evolutionary processes. Molecular Ecology. 2016;25(4):864–81. pmid:26757135
  20. 20. Allentoft ME, Collins M, Harker D, Haile J, Oskam CL, Hale ML, et al. The half-life of DNA in bone: measuring decay kinetics in 158 dated fossils. Proceedings of the Royal Society B: Biological Sciences. 2012;279(1748):4724–33. pmid:23055061
  21. 21. Lindahl T. Instability and decay of the primary structure of DNA. Nature. 1993;362(6422):709–15. WOS:A1993KY45000044. pmid:8469282
  22. 22. Simmons JE. Fluid preservation: a comprehensive reference: Rowman & Littlefield; 2014.
  23. 23. Fox CH, Johnson FB, Whiting J, Roller PP. Formaldehyde fixation. Journal of Histochemistry & Cytochemistry. 1985;33(8):845–53. pmid:3894502
  24. 24. Schander C, Kenneth HM. DNA, PCR and formalinized animal tissue–a short review and protocols. Organisms Diversity & Evolution. 2003;3(3):195–205.
  25. 25. Thavarajah R, Mudimbaimannar VK, Elizabeth J, Rao UK, Ranganathan K. Chemical and physical basics of routine formaldehyde fixation. Journal of oral and maxillofacial pathology: JOMFP. 2012;16(3):400. pmid:23248474
  26. 26. Hargrave CW, Sutherland KG, Marsh-Matthews E, Matthews WJ, editors. Multiple interacting factors affect pH in museum storage solutions. Collection forum; 2005.
  27. 27. Gilbert MTP, Haselkorn T, Bunce M, Sanchez JJ, Lucas SB, Jewell LD, et al. The isolation of nucleic acids from fixed, paraffin-embedded tissues–which methods are useful when? PloS one. 2007;2(6).
  28. 28. Dietrich D, Uhl B, Sailer V, Holmes EE, Jung M, Meller S, et al. Improved PCR performance using template DNA from formalin-fixed and paraffin-embedded tissues by overcoming PCR inhibition. PLoS One. 2013;8(10). pmid:24155973
  29. 29. Casas‐Marce M, Revilla E, Godoy JA. Searching for DNA in museum specimens: a comparison of sources in a mammal species. Molecular Ecology Resources. 2010;10(3):502–7. pmid:21565049
  30. 30. Ruane S, Austin CC. Phylogenomics using formalin‐fixed and 100+ year‐old intractable natural history specimens. Molecular ecology resources. 2017;17(5):1003–8. pmid:28118515
  31. 31. McGuire JA, Cotoras DD, O’Connell B, Lawalata SZ, Wang-Claypool CY, Stubbs A, et al. Squeezing water from a stone: high-throughput sequencing from a 145-year old holotype resolves (barely) a cryptic species problem in flying lizards. PeerJ. 2018;6:e4470. pmid:29576952
  32. 32. Tsai WL, Schedl ME, Maley JM, McCormack JE. More than skin and bones: Comparing extraction methods and alternative sources of DNA from avian museum specimens. Molecular ecology resources. 2019. pmid:31478338
  33. 33. Hykin SM, Bi K, McGuire JA. Fixing formalin: a method to recover genomic-scale DNA sequence data from formalin-fixed museum specimens using high-throughput sequencing. PloS one. 2015;10(10). pmid:26505622
  34. 34. Rowe KC, Singhal S, Macmanes MD, Ayroles JF, Morelli TL, Rubidge EM, et al. Museum genomics: low‐cost and high‐accuracy genetic data from historical specimens. Molecular Ecology Resources. 2011;11(6):1082–92. pmid:21791033
  35. 35. Bi K, Linderoth T, Vanderpool D, Good JM, Nielsen R, Moritz C. Unlocking the vault: next‐generation museum population genomics. Molecular ecology. 2013;22(24):6018–32. pmid:24118668
  36. 36. Guschanski K, Krause J, Sawyer S, Valente LM, Bailey S, Finstermeier K, et al. Next-generation museomics disentangles one of the largest primate radiations. Systematic biology. 2013;62(4):539–54. pmid:23503595
  37. 37. Staats M, Erkens RH, van de Vossenberg B, Wieringa JJ, Kraaijeveld K, Stielow B, et al. Genomic treasure troves: complete genome sequencing of herbarium and insect museum specimens. PLoS One. 2013;8(7). pmid:23922691
  38. 38. Besnard G, Bertrand JA, Delahaie B, Bourgeois YX, Lhuillier E, Thébaud C. Valuing museum specimens: high-throughput DNA sequencing on historical collections of New Guinea crowned pigeons (Goura). Biological Journal of the Linnean Society. 2016;117(1):71–82.
  39. 39. Kehlmaier C, Zhang X, Georges A, Campbell PD, Thomson S, Fritz U. Mitogenomics of historical type specimens of Australasian turtles: clarification of taxonomic confusion and old mitochondrial introgression. Scientific reports. 2019;9(1):1–12. pmid:30626917
  40. 40. Derkarabetian S, Benavides LR, Giribet G. Sequence capture phylogenomics of historical ethanol‐preserved museum specimens: Unlocking the rest of the vault. Molecular ecology resources. 2019;19(6):1531–44. pmid:31448547
  41. 41. Carpenter Meredith L, Buenrostro Jason D, Valdiosera C, Schroeder H, Allentoft Morten E, Sikora M, et al. Pulling out the 1%: Whole-Genome Capture for the Targeted Enrichment of Ancient DNA Sequencing Libraries. American Journal of Human Genetics. 2013;93(5):852–64. PMC3824117. pmid:24568772
  42. 42. Pinhasi R, Fernandes D, Sirak K, Novak M, Connell S, Alpaslan-Roodenberg S, et al. Optimal ancient DNA yields from the inner ear part of the human petrous bone. PloS one. 2015;10(6):e0129102. pmid:26086078
  43. 43. Hansen HB, Damgaard PB, Margaryan A, Stenderup J, Lynnerup N, Willerslev E, et al. Comparing Ancient DNA Preservation in Petrous Bone and Tooth Cementum. PloS one. 2017;12(1):e0170940. pmid:28129388
  44. 44. Allentoft ME, Sikora M, Sjogren K-G, Rasmussen S, Rasmussen M, Stenderup J, et al. Population genomics of Bronze Age Eurasia. Nature. 2015;522(7555):167–72. pmid:26062507
  45. 45. Damgaard PB, Margaryan A, Schroeder H, Orlando L, Willerslev E, Allentoft ME. Improving access to endogenous DNA in ancient bones and teeth. Sci Rep. 2015;5. pmid:26081994
  46. 46. Gamba C, Hanghøj K, Gaunitz C, Alfarhan AH, Alquraishi SA, Al‐Rasheid KA, et al. Comparing the performance of three ancient DNA extraction methods for high‐throughput sequencing. Molecular Ecology Resources. 2016;16(2):459–69. pmid:26401836
  47. 47. Gansauge M-T, Meyer M. Single-stranded DNA library preparation for the sequencing of ancient or damaged DNA. Nature protocols. 2013;8(4):737. pmid:23493070
  48. 48. Meyer M, Kircher M. Illumina Sequencing Library Preparation for Highly Multiplexed Target Capture and Sequencing. Cold Spring Harbor Protocols. 2010;2010(6):pdb.prot5448. pmid:20516186
  49. 49. Gamba C, Jones ER, Teasdale MD, McLaughlin RL, Gonzalez-Fortes G, Mattiangeli V, et al. Genome flux and stasis in a five millennium transect of European prehistory. Nat Commun. 2014;5. pmid:25334030
  50. 50. Margaryan A, Hansen HB, Rasmussen S, Sikora M, Moiseyev V, Khoklov A, et al. Ancient pathogen DNA in human teeth and petrous bones. Ecology and evolution. 2018;8(6):3534–42. pmid:29607044
  51. 51. Schubert M, Ermini L, Der Sarkissian C, Jónsson H, Ginolhac A, Schaefer R, et al. Characterization of ancient and modern genomes by SNP detection and phylogenomic and metagenomic analysis using PALEOMIX. Nature protocols. 2014;9(5):1056. pmid:24722405
  52. 52. Key FM, Posth C, Krause J, Herbig A, Bos KI. Mining metagenomic data sets for ancient DNA: recommended protocols for authentication. Trends in Genetics. 2017;33(8):508–20. pmid:28688671
  53. 53. Harkins KM, Stone AC. Ancient pathogen genomics: insights into timing and adaptation. Journal of Human Evolution. 2015;79:137–49. pmid:25532802
  54. 54. Marciniak S, Poinar HN. Ancient pathogens through human history: A paleogenomic perspective. Paleogenomics: Springer; 2018. p. 115–38.
  55. 55. Rasmussen S, Allentoft ME, Nielsen K, Orlando L, Sikora M, Sjögren K-G, et al. Early divergent strains of Yersinia pestis in Eurasia 5,000 years ago. Cell. 2015;163(3):571–82. pmid:26496604
  56. 56. Mühlemann B, Margaryan A, de Barros Damgaard P, Allentoft ME, Vinner L, Hansen AJ, et al. Ancient human parvovirus B19 in Eurasia reveals its long-term association with humans. Proceedings of the National Academy of Sciences. 2018;115(29):7557–62. pmid:29967156
  57. 57. Mühlemann B, Jones TC, de Barros Damgaard P, Allentoft ME, Shevnina I, Logvin A, et al. Ancient hepatitis B viruses from the Bronze Age to the Medieval period. Nature. 2018;557(7705):418. pmid:29743673
  58. 58. Cappellini E, Prohaska A, Racimo F, Welker F, Pedersen MW, Allentoft ME, et al. Ancient biomolecules and evolutionary inference. Annual review of biochemistry. 2018;87:1029–60. pmid:29709200
  59. 59. Greiman SE, Cook JA, Tkach VV, Hoberg EP, Menning DM, Hope AG, et al. Museum metabarcoding: a novel method revealing gut helminth communities of small mammals across space and time. International journal for parasitology. 2018;48(13):1061–70. pmid:30315762
  60. 60. Holmes I, Rabosky ARD. Natural history bycatch: a pipeline for identifying metagenomic sequences in RADseq data. PeerJ. 2018;6:e4662. pmid:29682427
  61. 61. Sørensen M, Rasmussen AR, Simonsen KP, editors. Enzymatic detection of formalin-fixed museum specimens for DNA analysis and enzymatic maceration of formalin-fixed specimens. Collection Forum; 2016: The Society for the Experimental Analysis of Behavior.
  62. 62. Uetz P, Freed P, Hošek J. Thamnophis sirtalis, The Reptile Database. Accessed on 10 of December 2019, Available at: http://wwwreptile-databaseorg. 2019.
  63. 63. Orlando L, Ginolhac A, Zhang G, Froese D, Albrechtsen A, Stiller M, et al. Recalibrating Equus evolution using the genome sequence of an early Middle Pleistocene horse. Nature. 2013;499(7456):74–8. pmid:23803765
  64. 64. Schubert M, Lindgreen S, Orlando L. AdapterRemoval v2: rapid adapter trimming, identification, and read merging. BMC research notes. 2016;9(1):1–7.
  65. 65. Warren WC, Wilson R. Thamnophis sirtalis isolate EDBJR-23777, whole genome shotgun sequencing project. https://wwwncbinlmnihgov/nuccore/LFLD000000001. 2015.
  66. 66. Li H, Durbin R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics. 2009;25(14):1754–60. pmid:19451168
  67. 67. Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009;25(16):2078–9. pmid:19505943
  68. 68. Jónsson H, Ginolhac A, Schubert M, Johnson P, Orlando L. mapDamage2.0: fast approximate Bayesian estimates of ancient DNA damage parameters. Bioinformatics. 2013. pmid:23613487
  69. 69. Wood DE, Lu J, Langmead B. Improved metagenomic analysis with Kraken 2. Genome biology. 2019;20(1):257. pmid:31779668
  70. 70. Lu J, Breitwieser F, Thielen P, Salzberg S. Bracken: estimating species abundance in metagenomics data. PeerJ Comput Sci 3: e104. 2017.
  71. 71. Breitwieser FP, Salzberg SL. Pavian: Interactive analysis of metagenomics data for microbiomics and pathogen identification. BioRxiv. 2016:084715.
  72. 72. Ondov BD, Bergman NH, Phillippy AM. Interactive metagenomic visualization in a Web browser. BMC bioinformatics. 2011;12(1):385.
  73. 73. Simonsen KP, Rasmussen AR, Mathisen P, Petersen H, Borup F. A fast preparation of skeletal materials using enzyme maceration. Journal of Forensic Sciences. 2011;56(2):480–4. pmid:21342190
  74. 74. Pinheiro J, Bates D, DebRoy S, Sarkar D, Team RC. Linear and nonlinear mixed effects models. R package version. 2020;3(57):1–89.
  75. 75. McCormack JE, Tsai WL, Faircloth BC. Sequence capture of ultraconserved elements from bird museum specimens. Molecular ecology resources. 2016;16(5):1189–203. pmid:26391430
  76. 76. Blaimer BB, Lloyd MW, Guillory WX, Brady SG. Sequence capture and phylogenetic utility of genomic ultraconserved elements obtained from pinned insect specimens. PloS one. 2016;11(8):e0161531. pmid:27556533
  77. 77. Alibardi L. Adaptation to the land: the skin of reptiles in comparison to that of amphibians and endotherm amniotes. Journal of Experimental Zoology Part B: Molecular and Developmental Evolution. 2003;298(1):12–41. pmid:12949767
  78. 78. Casas-Marce M, Revilla E, Godoy JA. Searching for DNA in museum specimens: a comparison of sources in a mammal species. Molecular Ecology Resources. 2010;10(3):502–7. pmid:21565049
  79. 79. Poinar HN, Schwarz C, Qi J, Shapiro B, MacPhee RD, Buigues B, et al. Metagenomics to paleogenomics: large-scale sequencing of mammoth DNA. science. 2006;311(5759):392–4. pmid:16368896
  80. 80. Der Sarkissian C, Ermini L, Jónsson H, Alekseev A, Crubezy E, Shapiro B, et al. Shotgun microbial profiling of fossil remains. Molecular ecology. 2014;23(7):1780–98. pmid:24612293
  81. 81. Gansauge M-T, Meyer M. Selective enrichment of damaged DNA molecules for ancient genome sequencing. Genome research. 2014;24(9):1543–9. pmid:25081630
  82. 82. Margaryan A, Lawson DJ, Sikora M, Racimo F, Rasmussen S, Moltke I, et al. Population genomics of the Viking world. Nature. 2020;585(7825):390–6. pmid:32939067
  83. 83. de Barros Damgaard P, Martiniano R, Kamm J, Moreno-Mayar JV, Kroonen G, Peyrot M, et al. The first horse herders and the impact of early Bronze Age steppe expansions into Asia. Science. 2018;360(6396). pmid:29743352
  84. 84. Prüfer K, Racimo F, Patterson N, Jay F, Sankararaman S, Sawyer S, et al. The complete genome sequence of a Neanderthal from the Altai Mountains. Nature. 2014;505(7481):43–9. pmid:24352235
  85. 85. Gaunitz C, Fages A, Hanghøj K, Albrechtsen A, Khan N, Schubert M, et al. Ancient genomes revisit the ancestry of domestic and Przewalski’s horses. Science. 2018;360(6384):111–4. pmid:29472442
  86. 86. van der Valk T, Lona Durazo F, Dalén L, Guschanski K. Whole mitochondrial genome capture from faecal samples and museum‐preserved specimens. Molecular ecology resources. 2017;17(6):e111–e21. pmid:28675688
  87. 87. Cooper A. DNA from museum specimens. Ancient DNA: Springer; 1994. p. 149–65).
  88. 88. Kotrba M, Schilling L-H, editors. Measurement of pH in Ethanol, Distilled Water, and their Mixtures: On the Assessment of pH in Ethanol-Based Natural History Wet Collections and the Detrimental Aspects of Dilution with Distilled Water. Collection Forum; 2017: Soc. for the Pres. of Natural History Collections.
  89. 89. Kearney M, Stuart BL. Repeated evolution of limblessness and digging heads in worm lizards revealed by DNA from old bones. Proceedings of the Royal Society of London Series B: Biological Sciences. 2004;271(1549):1677–83. pmid:15306287
  90. 90. Stuart BL, Dugan KA, Allard MW, Kearney M. Extraction of nuclear DNA from bone of skeletonized and fluid‐preserved museum specimens. Systematics and Biodiversity. 2006;4(2):133–6.
  91. 91. Orlando L, Ginolhac A, Raghavan M, Vilstrup J, Rasmussen M, Magnussen K, et al. True single-molecule DNA sequencing of a pleistocene horse bone. Genome research. 2011;21(10):1705–19. pmid:21803858
  92. 92. Dabney J, Meyer M, Pääbo S. Ancient DNA damage. Cold Spring Harbor perspectives in biology. 2013;5(7):a012567. pmid:23729639
  93. 93. Sawyer S, Krause J, Guschanski K, Savolainen V, Pääbo S. Temporal patterns of nucleotide misincorporations and DNA fragmentation in ancient DNA. PloS one. 2012;7(3). pmid:22479540
  94. 94. Orlando L, Gilbert MTP, Willerslev E. Reconstructing ancient genomes and epigenomes. Nature Reviews Genetics. 2015;16(7):395–408. pmid:26055157
  95. 95. Kistler L, Ware R, Smith O, Collins M, Allaby RG. A new model for ancient DNA decay based on paleogenomic meta-analysis. Nucleic Acids Research. 2017;45(11):6310–20. pmid:28486705
  96. 96. Weiß CL, Schuenemann VJ, Devos J, Shirsekar G, Reiter E, Gould BA, et al. Temporal patterns of damage and decay kinetics of DNA retrieved from plant herbarium specimens. Royal Society open science. 2016;3(6):160239. pmid:27429780
  97. 97. Miething F, Hering S, Hanschke B, Dressler J. Effect of fixation to the degradation of nuclear and mitochondrial DNA in different tissues. Journal of Histochemistry & Cytochemistry. 2006;54(3):371–4. pmid:16260588
  98. 98. Gates KS. An overview of chemical processes that damage cellular DNA: spontaneous hydrolysis, alkylation, and reactions with radicals. Chemical research in toxicology. 2009;22(11):1747–60. pmid:19757819
  99. 99. Margaryan A, Noer CL, Richter SR, Restrup ME, Bülow‐Hansen JL, Leerhøi F, et al. Mitochondrial genomes of Danish vertebrate species generated for the national DNA reference database, DNAmark. Environmental DNA. 2020.
  100. 100. Miller W, Drautz DI, Janecka JE, Lesk AM, Ratan A, Tomsho LP, et al. The mitochondrial genome sequence of the Tasmanian tiger (Thylacinus cynocephalus). Genome research. 2009;19(2):213–20. pmid:19139089
  101. 101. Colston TJ, Noonan BP, Jackson CR. Phylogenetic analysis of bacterial communities in different regions of the gastrointestinal tract of Agkistrodon piscivorus, the cottonmouth snake. PLoS One. 2015;10(6):e0128793. pmid:26039313
  102. 102. McLaughlin RW, Cochran PA, Dowd SE. Metagenomic analysis of the gut microbiota of the Timber Rattlesnake, Crotalus horridus. Molecular Biology Reports. 2015;42(7):1187–95. pmid:25663091
  103. 103. Krishnankutty SP, Muraleedharan M, Perumal RC, Michael S, Benny J, Balan B, et al. Next-generation sequencing analysis reveals high bacterial diversity in wild venomous and non-venomous snakes from India. Journal of Venomous Animals and Toxins including Tropical Diseases. 2018;24(1):1–11. pmid:30598660
  104. 104. Tang W, Zhu G, Shi Q, Yang S, Ma T, Mishra SK, et al. Characterizing the microbiota in gastrointestinal tract segments of Rhabdophis subminiatus: Dynamic changes and functional predictions. MicrobiologyOpen. 2019;8(7):e00789. pmid:30848054
  105. 105. Qin Z, Wang S, Guo D, Zhu J, Chen H, Bai L, et al. Comparative analysis of intestinal bacteria among venom secretion and non-secrection snakes. Scientific reports. 2019;9(1):1–9. pmid:30626917
  106. 106. Nasko DJ, Koren S, Phillippy AM, Treangen TJ. RefSeq database growth influences the accuracy of k-mer-based lowest common ancestor species identification. Genome biology. 2018;19(1):1–10. pmid:29301551
  107. 107. Eisenhofer R, Weyrich LS. Assessing alignment-based taxonomic classification of ancient microbial DNA. PeerJ. 2019;7:e6594. pmid:30886779
  108. 108. Sedlar K, Kupkova K, Provaznik I. Bioinformatics strategies for taxonomy independent binning and visualization of sequences in shotgun metagenomics. Computational and Structural Biotechnology Journal. 2017;15:48–55. pmid:27980708
  109. 109. Warinner C, Herbig A, Mann A, Fellows Yates JA, Weiß CL, Burbano HA, et al. A robust framework for microbial archaeology. Annual review of genomics and human genetics. 2017;18:321–56. pmid:28460196
  110. 110. Plumb J, Evans J. Edwardsiella septicemia. Aquaculture Compendium. 2006.
  111. 111. Rose K, Agius J, Hall J, Thompson P, Eden J-S, Srivastava M, et al. Emergent multisystemic Enterococcus infection threatens endangered Christmas Island reptile populations. PloS one. 2017;12(7). pmid:28727845
  112. 112. Innis CJ, Braverman H, Cavin JM, Ceresia ML, Baden LR, Kuhn DM, et al. Diagnosis and management of Enterococcus spp infections during rehabilitation of cold-stunned Kemp’s ridley turtles (Lepidochelys kempii): 50 cases (2006–2012). Journal of the American Veterinary Medical Association. 2014;245(3):315–23. pmid:25029311
  113. 113. Falagas M, Kavvadia P, Mantadakis E, Kofteridis D, Bliziotis I, Saloustros E, et al. Morganella morganii infections in a general tertiary hospital. Infection. 2006;34(6):315. pmid:17180585
  114. 114. Wagener M, Naidoo M, Aldous C. Wound infection secondary to snakebite. South African Medical Journal. 2017;107(4):315–9. pmid:28395682
  115. 115. Chen C-M, Wu K-G, Chen C-J, Wang C-M. Bacterial infection in association with snakebite: a 10-year experience in a northern Taiwan medical center. Journal of Microbiology, Immunology and Infection. 2011;44(6):456–60.
  116. 116. Jorge M, Ribeiro L. Infections in the bite site after envenoming by snakes of the Bothrops genus. Journal of Venomous Animals and Toxins. 1997;3(2):264–72.
  117. 117. Parker S, Crump R, Hartzler H, Buller RM. Evaluation of taterapox virus in small animals. Viruses. 2017;9(8):203. pmid:28763036
  118. 118. Briggs AW, Stenzel U, Meyer M, Krause J, Kircher M, Pääbo S. Removal of deaminated cytosines and detection of in vivo methylation in ancient DNA. Nucleic acids research. 2010;38(6):e87–e. pmid:20028723
  119. 119. Fu Q, Li H, Moorjani P, Jay F, Slepchenko SM, Bondarev AA, et al. Genome sequence of a 45,000-year-old modern human from western Siberia. Nature. 2014;514(7523):445–9. pmid:25341783
  120. 120. Meyer M, Kircher M, Gansauge MT, Li H, Racimo F, Mallick S, et al. A high-coverage genome sequence from an archaic Denisovan individual. Science. 2012;338(6104):222–6. Epub 2012 Aug 30. pmid:22936568
  121. 121. Troll CJ, Kapp J, Rao V, Harkins KM, Cole C, Naughton C, et al. A ligation-based single-stranded library preparation method to analyze cell-free DNA and synthetic oligos. BMC genomics. 2019;20(1):1–14. pmid:30606130
  122. 122. Gansauge M-T, Gerber T, Glocke I, Korlević P, Lippik L, Nagel S, et al. Single-stranded DNA library preparation from highly degraded DNA using T4 DNA ligase. Nucleic acids research. 2017;45(10):e79–e pmid:28119419