Evaluation of full-length nanopore 16S sequencing for detection of pathogens in microbial keratitis

Liying Low; Pablo Fuentes-Utrilla; James Hodson; John D. O’Neil; Amanda E. Rossiter; Ghazala Begum; Kusy Suleiman; Philip I. Murray; Graham R. Wallace; Nicholas J. Loman; Saaeha Rauz

doi:10.7717/peerj.10778

Evaluation of full-length nanopore 16S sequencing for detection of pathogens in microbial keratitis

Liying Low ^1,2, Pablo Fuentes-Utrilla³, James Hodson⁴, John D. O’Neil⁵, Amanda E. Rossiter⁶, Ghazala Begum^5,7, Kusy Suleiman¹, Philip I. Murray^1,2, Graham R. Wallace^1,2, Nicholas J. Loman ³, Saaeha Rauz^1,2, West Midlands Collaborative Ophthalmology Network for Clinical Effectiveness & Research by Trainees (WM CONCERT)²

1Academic Unit of Ophthalmology, Institute of Inflammation and Ageing, University of Birmingham, Birmingham, West Midlands, UK

2Birmingham and Midland Eye Centre, Sandwell and West Birmingham Hospitals National Health Service (NHS) Trust, Birmingham, West Midlands, UK

3MicrobesNG/School of Biosciences, University of Birmingham, Birmingham, West Midlands, UK

4Queen Elizabeth Hospital, University Hospitals Birmingham NHS Foundation Trust, Birmingham, West Midlands, UK

5Institute of Inflammation and Ageing, University of Birmingham, Birmingham, West Midlands, UK

6Institute of Microbiology and Infection, University of Birmingham, Birmingham, West Midlands, UK

7National Institute for Health Research Surgical Reconstruction and Microbiology Research Centre, Birmingham, UK

DOI: 10.7717/peerj.10778

Published: 2021-02-15
Accepted: 2020-12-22
Received: 2020-05-04

Academic Editor: Hossein Khiabanian

Subject Areas: Bioinformatics, Genomics, Microbiology, Infectious Diseases, Ophthalmology
Keywords: Nanopore sequencing, Eye infection, Microbial keratitis, Full length 16S rRNA sequencing, Cornea infection, Eye swab, 16S bioinformatics, Corneal infection, Ophthalmology, Molecular diagnostics

Copyright: © 2021 Low et al.
Licence: This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.

Cite this article: Low L, Fuentes-Utrilla P, Hodson J, O’Neil JD, Rossiter AE, Begum G, Suleiman K, Murray PI, Wallace GR, Loman NJ, Rauz S, West Midlands Collaborative Ophthalmology Network for Clinical Effectiveness & Research by Trainees (WM CONCERT). 2021. Evaluation of full-length nanopore 16S sequencing for detection of pathogens in microbial keratitis. PeerJ 9:e10778 https://doi.org/10.7717/peerj.10778

The authors have chosen to make the review history of this article public.

Abstract

Background

Microbial keratitis is a leading cause of preventable blindness worldwide. Conventional sampling and culture techniques are time-consuming, with over 40% of cases being culture-negative. Nanopore sequencing technology is portable and capable of generating long sequencing reads in real-time. The aim of this study is to evaluate the potential of nanopore sequencing directly from clinical samples for the diagnosis of bacterial microbial keratitis.

Methods

Using full-length 16S rRNA amplicon sequences from a defined mock microbial community, we evaluated and benchmarked our bioinformatics analysis pipeline for taxonomic assignment on three different 16S rRNA databases (NCBI 16S RefSeq, RDP and SILVA) with clustering at 97%, 99% and 100% similarities. Next, we optimised the sample collection using an ex vivo porcine model of microbial keratitis to compare DNA recovery rates of 12 different collection methods: 21-gauge needle, PTFE membrane (4 mm and 6 mm), Isohelix^™ SK-2S, Sugi^® Eyespear, Cotton, Rayon, Dryswab^™, Hydraflock^®, Albumin-coated, Purflock^®, Purfoam and Polyester swabs. As a proof-of-concept study, we then used the sampling technique that provided the highest DNA recovery, along with the optimised bioinformatics pipeline, to prospectively collected samples from patients with suspected microbial keratitis. The resulting nanopore sequencing results were then compared to standard microbiology culture methods.

Results

We found that applying alignment filtering to nanopore sequencing reads and aligning to the NCBI 16S RefSeq database at 100% similarity provided the most accurate bacterial taxa assignment. DNA concentration recovery rates differed significantly between the collection methods (p < 0.001), with the Sugi^® Eyespear swab providing the highest mean rank of DNA concentration. Then, applying the optimised collection method and bioinformatics pipeline directly to samples from two patients with suspected microbial keratitis, sequencing results from Patient A were in agreement with culture results, whilst Patient B, with negative culture results and previous antibiotic use, showed agreement between nanopore and Illumina Miseq sequencing results.

Conclusion

We have optimised collection methods and demonstrated a novel workflow for identification of bacterial microbial keratitis using full-length 16S nanopore sequencing.

Introduction

Microbial keratitis is a leading cause of preventable blindness worldwide and is the most common cause of acute medical ophthalmology admission (Centers for Disease Control and Prevention, 2014). Bacterial keratitis accounts for the majority of the microbial keratitis cases in the Western hemisphere, with a preponderance for Gram-positive organisms such as Staphylococcus aureus and Streptococcus pneumoniae, and to a lesser extent Gram-negative organisms such as Pseudomonas aeruginosa and Klebsiella pneumoniae (Ting et al., 2018; Tan et al., 2017; Ibrahim, Boase & Cree, 2009; Lichtinger et al., 2012). Prognosis is dependent on early identification of the causative organism and initiation of appropriate treatment, including antibiotic therapy (Austin, Lietman & Rose-Nussbaumer, 2017). Conventional sampling and culture technique is time-consuming, with culture results taking 48 hours and antimicrobial sensitivity results taking up to 5 days (Maurer et al., 2017). Approximately 40% of clinically suspected microbial keratitis cases are culture-negative (Sugita et al., 2013; Tananuvat et al., 2012), leading to the widespread initial implementation of empirical broad-spectrum antimicrobial therapeutic protocols, increasing the risk of antimicrobial resistance and poorer patient outcomes (Goldstein, Kowalski & Gordon, 1999), particularly in patients who do not have ready access to diagnostic laboratories, such as those in poor and low income countries and deployed military personnel (Musa et al., 2010).

Culture-independent molecular techniques such as polymerase chain reaction (PCR) has been utilised in the diagnosis of ocular infections (Kim et al., 2008), however, this requires a priori knowledge of the likely pathogenic micro-organism to determine which specific primer sets to use and is limited by the number of species that can be detected simultaneously in a single PCR assay (Bispo et al., 2018; Ung et al., 2020). Newer, high-throughput sequencing approaches can be broadly classified into two major options: targeted amplicon sequencing (selective amplification of the specific genetic region of interest such as 16S rRNA in bacteria or 18S rRNA in fungi) and metagenomic sequencing (untargeted amplification of all genomic DNA) (Ung et al., 2020). Untargeted metagenomic sequencing allows for the discovery of unexpected, novel pathogens, such as Vittaforma corneae in infectious conjunctivitis (Lalitha et al., 2019) and Torque teno virus in culture-negative endophthalmitis (Lee et al., 2015). Deep metagenomic sequencing techniques have enabled phylogenetic analysis of the temporal and geographic origin of ocular infection (Doan et al., 2016b; Kirstahler et al., 2018).

Several challenges have hindered the adoption of sequencing technologies in routine clinical practice for microbial keratitis, including sample collection methods that are frequently ineffective (Kaye et al., 2003) compounded by the low abundance of pathogens (Ung et al., 2020), high contamination from background host DNA or laboratory reagents (Gu, Miller & Chiu, 2019), lack of standardisation in sequencing and bioinformatics processing methods (Chiu & Miller, 2019). Sampling and DNA extraction methodology significantly impact upon the downstream sequencing data in samples with low biomass (Douglas et al., 2020; Sui et al., 2020). Microbial cells within the sample must be sufficiently lysed to liberate its DNA content, and this is particularly challenging for thick-walled microorganisms whereby mechanical disruption method such as bead-beating in conjunction with heat, chemical or enzymatic treatment is required in the DNA extraction protocol (De Boer et al., 2010; Ojo-Okunola et al., 2020). Clinical samples have high host DNA content, usually constituting more than 90% of the sequences, with relatively low abundance of pathogen DNA, and therefore requiring greater depth of sequencing (Ung et al., 2020; Lalitha et al., 2019). The prohibitively high running costs of large sequencing platforms mean that they are only available in select centres, and samples are generally pooled for batch processing, resulting in delayed turnaround time. Efforts have been directed to reduce the amount of host DNA present in clinical samples, especially for samples that can be obtained in abundance, such as saliva, sputum, and bronchoalveolar lavage and blood (Charalampous et al., 2019; Marotz et al., 2018; Feehery et al., 2013). However these techniques are still not compatible with a generic method that can be applied across all the different types of clinical samples.

Marker genes using conserved, housekeeping regions of the genome interspersed with variable regions have been utilised to infer phylogenetic links and microbial taxonomy (Clarridge, 2004; Raina et al., 2019). The universal 16S ribosomal RNA (rRNA) gene sequence in bacteria, which is approximately 1,550 base pairs long, composed of a highly conserved region interspersed with nine variable regions (V1-9), is the most commonly used marker gene for assessing bacterial profiles (Clarridge, 2004; Akram et al., 2017; Achtman & Wagner, 2008). Amplification of the 16S rRNA region by PCR reduces the background host contamination and by only sequencing a smaller part of the genome, this drastically reduces the sequencing depth requirements and cost. The vast majority of clinical studies have only sequenced part of the 16S gene, ranging from the single variable region of V4, V6 to three variable regions V1-3 or V3-5, because the widely used Illumina sequencing platform is only capable of producing short reads of less than 500 bases (Johnson et al., 2019; Holm et al., 2019). This short-read sequencing further compounds the taxonomic resolution of 16S rRNA sequencing, which is typically limited to genus-level resolution (Achtman & Wagner, 2008). Choice of the hypervariable region affects the taxonomic assignment, for example the V4 hypervariable region providing better whole bacterial diversity in human gut microbiome studies whereas the V1-V2 hypervariable region is more specific for skin microbiota profiling (Santos et al., 2020). Several studies have shown that full-length 16S rRNA reads provides better taxonomic resolution compared to reads that only target a certain region of the 16S rRNA gene (Winand et al., 2020; Nygaard et al., 2020). The choice of 16S reference database also impacts upon the taxonomic assignment (Nygaard et al., 2020; Park & Won, 2018; Szabó et al., 2016). For example, the expanded Human Oral Microbiome Database contains references to microbes specifically from the aerodigestive tract whilst the National Center for Biotechnology Information (NCBI) 16S RefSeq, the Ribosomal Database Project (RDP) and the SILVA rRNA databases include taxa from all sources (human and non-human hosts) and the environment (RefSeq, 2020; Cole et al., 2014; Quast et al., 2013; Escapa et al., 2018).

With the introduction of long-read sequencing technologies, such as the Pacific Biosciences (PacBio) and Oxford Nanopore Technologies (ONT) sequencing platforms, we are now able to sequence in real-time the full-length of the 16S rRNA gene (V1–V9 regions) (Johnson et al., 2019; Winand et al., 2020). Major advantages of using nanopore sequencing include portability (pocket-sized ONT MinION sequencer vs large PacBio sequencer), relatively low cost (~£140 per sample by multiplexing 12 samples per run and £820 for the ONT MinION vs ~£280 per sample and £390,000 for the PacBio Sequel II System) and rapid workflow (less than an hour from sample preparation to data analysis with nanopore sequencing vs 3.5 hours with PacBio) (De Maio et al., 2019; 16S Sequencing and Analysis, 2020; Ashton et al., 2015; PacBio Sequel Systems, 2020; Workflow-PacBio, 2020). Nanopore sequencing has been utilised for on-the-field, real-time genomic surveillance of Ebola (Quick et al., 2016) and Zika (Quick et al., 2017; Faria et al., 2017) viruses. In addition, nanopore sequencing has also been used for pathogen detection in proof-of-concept clinical studies of sepsis (Leggett et al., 2020), lower respiratory tract (Charalampous et al., 2019; Yang et al., 2019), urinary tract (Schmidt et al., 2017) and prosthetic joint infections (Sanderson et al., 2018). However, there is relatively high error rates in nanopore sequencing (~95% raw read accuracy for nanopore) (Rang, Kloosterman & De Ridder, 2018), which affects the discriminatory power of 16S rRNA gene for species level classification (Winand et al., 2020). Strategies to reduce nanopore sequencing error rate are constantly evolving with improvements to the pore chemistry and basecalling software (Rang, Kloosterman & De Ridder, 2018).

The aim of this study was to evaluate the potential of full-length 16S nanopore sequencing directly from clinical samples of bacterial microbial keratitis, focussing on the optimisation of the DNA extraction and bioinformatics pipeline to make 16S nanopore sequencing feasible. Firstly, using a defined mock microbial community, we evaluated and benchmarked our bioinformatics analysis pipeline for taxonomic assignment. Then, we optimised the sample collection using an ex vivo porcine model of microbial keratitis. Next, we performed a proof-of-concept study in which we applied the sampling technique that provided the highest DNA recovery and the optimised bioinformatics pipeline to prospectively collected samples from patients with suspected microbial keratitis, comparing the nanopore sequencing results to standard microbiology culture methods (Fig. 1).

Figure 1: Overview of study workflow.
Study workflow starting from in silico studies for bioinformatics benchmarking on defined mock community, ex vivo study for optimising eye swab/collection methods and proof-of-concept clinical study on patients with microbial keratitis. Abbreviations: *Staphylococcus aureus (S. aureus); Klebsiella pneumoniae (K. pneumoniae); Enterococcus avium (E. avium);* National Center for Biotechnology Information (NCBI); Ribosomal Database Project (RDP); quantitative polymerase chain reaction (qPCR).

Download full-size image

DOI: 10.7717/peerj.10778/fig-1

Materials and Methods

Mock bacterial community

Bacterial species representative of the spectrum of microbial keratitis causative Gram-positive and Gram-negative pathogens (Staphylococcus aureus, Klebsiella pneumoniae and Enterococcus avium (previously classified as group D Streptococcus)) were used to define the mock bacterial community. Klebsiella pneumoniae and Staphylococcus aureus were grown overnight at 37 °C in lysogeny broth (LB) whilst Staphylococcus aureus was grown in Brain Heart Infusion (BHI) broth. Negative controls consisted of the LB and BHI broth without any inoculum, respectively. Overnight cultures were diluted into fresh medium to an optical density 600 nm (OD₆₀₀) of 0.05 and incubated at 37 °C with aeration. The culture OD₆₀₀ was measured every 30 min for 5 h using spectrophotometer (Ultrospec^™ 2100 pro, Amersham Biosciences, UK). For enumeration of bacteria, cultures were plated onto agar plates (Enterococcus avium, blood agar; Staphylococcus aureus and Klebsiella pneumoniae, LB agar respectively) and incubated overnight at 37 °C. The colony-forming units (CFUs) for each species were enumerated the following day. Bacterial growth curves were plotted and the mid-exponential growth phases of the samples were taken. The mock bacterial community consisted of 1 × 10⁵ CFU/ml each of Klebsiella pneumoniae, Enterococcus avium and Staphylococcus aureus.

Ex vivo porcine model of microbial keratitis

Freshly enucleated porcine eyes (Sus scrofa domestica) were obtained as a by-product of the meat industry and transported to the laboratory under storage at 4 °C. Each eye was disinfected with Povidone-iodine 10% w/w for 1 min, followed by two rinses of sterile 0.9% Sodium Chloride for 1 min, and placed in an individual chamber of a sterile 6-well culture plate (Sigma–Aldrich, Merck KGaA, Darmstadt, Germany). Using stereoscopic surgical loupes, a 4mm trephine punch (Acu-Punch^®, Acuderm, Fort Lauderdale, FL, USA) was used to create a single central anterior stromal corneal lesion (with debridement of the central 4 mm). The area was sampled with the respective swabs/collection methods pre-inoculation, as a background control. Each eye was then inoculated with 20 µL of 1 × 10⁵ CFU/ml each of the mock community (Enterococcus avium, Staphylococcus aureus and Klebsiella pneumoniae). The amount of inoculum was determined according to the estimated density of bacteria encountered in vivo in clinical infections (Kaye et al., 2003). The area was re-sampled with the respective collection methods after 30 min to prevent bacterial overgrowth, and placed immediately into a ZR BashingBead^™ Lysis Tube containing 750 µl of DNA Shield^™ (Zymo Research, Irvine, CA, USA) and stored at −80 °C until DNA extraction.

Collection method/swabs

The different collection methods evaluated are Isohelix^™ SK-2S, Sugi^® Eyespear, Cotton, Rayon, Dryswab^™, Hydraflock, Albumin-coated, Purflock, Purfoam, Polyester swabs, 21-gauge needle, and Polytetrafluoroethylene (PTFE) membranes (Table 1). Sampling order was varied. A total of 60 ex-vivo porcine eye models were used. Each sampling replicate was performed on a separate eye. Swabs were not pre-moistened, as a previous pilot study that we have conducted showed that dry swabs provided higher DNA yield, compared to pre-moistened swabs (Table S1).

Table 1:

Summary of the collection methods used.

Name	Manufacturer	Composition of bud	Shaft	Catalogue number
Isohelix^™ SK-2S	Isohelix, Kent, UK	Viscose rayon	–	SK-2S
Sugi^® Eyespear	Kettenbach GmbH and Co. KG, Escehnburg, Germany	Cotton and cellulose	–	30901
MW1021 Dryswab^™ Rayon	MWE Medical Wire, Corsham, UK	Rayon	Plastic	MW1021
MW1041 Cotton	MWE Medical Wire, Corsham, UK	Cotton	Wood	MW1041
MW1021D Dryswab^™ Polyester	MWE Medical Wire, Corsham, UK	Polyester	Plastic	MW1021D
MW821 Dryswab^™ Flock	MWE Medical Wire, Corsham, UK	Purflock^®	Plastic	MW831
MW100 Fine tip Dryswab^™ Rayon	MWE Medical Wire, Corsham, UK	Rayon	Plastic	MW100
MW130 Hospiswab^™ Albumin	MWE Medical Wire, Corsham, UK	Albumin coated	Wood	MW130
MW840 Hydraflock^® Plastic	MWE Medical Wire, Corsham, UK	Hydraflock^®	Plastic	MW840
MW946 Sigma Swab^® Purfoam	MWE Medical Wire, Corsham, UK	Purfoam	Plastic	MW946
BD Needle 21G	Becton-Dickinson (BD) & Co, New Jersey, USA	Stainless steel	–	BD301155
Biopore^® PTFE 6 mm	Merck & Co, New Jersey, USA	PTFE	–	BGCM00010
Biopore^® PTFE 4 mm	Merck & Co, New Jersey, USA	PTFE	–	BGCM00010

DOI: 10.7717/peerj.10778/table-1

Note:

Becton-Dickinson (BD); Medical Wire and Equipment (MWE); Polytetrafluoroethylene (PTFE).

DNA extraction

DNA from the mock community, conjunctival swabs and negative controls were extracted using ZymoBIOMICS DNA Miniprep kit (Zymo Research, Irvine, CA, USA) according to the manufacturer’s instructions. Negative control DNA extraction was performed on reagents without a DNA template and also on sterile 0.9% Sodium Chloride that was used to clean the porcine eyes, whilst positive control DNA extraction was performed on 20 µL of 1 × 10⁵ CFU/ml each of Enterococcus avium, Staphylococcus aureus and Klebsiella pneumoniae inserted directly into the lysis tube containing 750 µl of DNA Shield^™.

DNA concentration was determined fluorometrically using a Qubit dsDNA high-sensitivity assay (Thermo Fisher Scientific, Waltham, MA, USA).

16S qPCR for 16S and beta-actin

16S qRT-PCR for 16S and beta-actin genes was done to compare the host to microbial DNA ratio across the different sampling methods.

The primers used to amplify the 16S rRNA gene were: forward primer 341F 5′-CCTACGGGAGGCAGCAG-3′, and reverse primer 534R 5′-ATTACCGCGGCTGCTGGCA-3′. These primers are complementary to the conserved regions in the 16S rRNA gene, nucleotide positions 290–484 in Escherichia coli, producing a fragment of 195 bp.

The primers used to amplify the beta-actin Sus scrofa gene were: Forward primer 5′-CCAAGCCTGGACTACCTCCT-3′, and reverse primer 5′-AAACCTGGAGAGGTTCACCG-3′. These primers were complementary to the Sus scrofa actin beta (ACTB) transcript RNA gene, spanning the nucleotide positions 1,348–1,535, producing an amplicon length of 188 bp.

Primers were synthesized by Eurofins Genomics (Ebersberg, Germany). The final PCR mix contained 0.4 μl each of forward and reverse primers (total concentration of 0.4 nmol each), 5 μl of SYBR Green Master Mix (TaKaRa BioTech Corporation, Dalian, China), 3.2 μl of UltraPure^™ DNase/RNase-Free distilled water (ThermoFisher Scientific, Waltham, MA, USA) and 1 μl of unamplified genomic DNA, giving a final reaction volume of 10 μl. All samples were performed in triplicate. qPCR was performed using the Roche LightCycler^®480 Instrument (Roche Diagnostics, Meyland, France) on the following programme: 1 cycle of 30 s at 95 °C, 40 cycles of 10 s at 95 °C, 30 s at 59 °C and 20 s at 78 °C.

16S gene copy number quantification

The genomic 16S copy number was quantified using Eubacteria 16S Ribosomal gene genesig^® Standard kit (Primerdesign^™ Ltd, Camberley, UK). The final PCR mix contained 10 μl of PrecisionPLUS 2X qPCR Master Mix, 1 μl of Eubacteria/probe mix and 5 μl of unamplified genomic DNA, and 4 μl of RNase/DNase free water giving a final reaction volume of 15 μl. Samples were divided and performed in triplicates. For the standard curve, the positive control template (16S Eubacteria) was serially diluted 10-fold to obtain copy numbers ranging from 2 × 10⁵ to 2 copy/μL. Quantitative PCR was performed with the Roche LightCycler^®480 Instrument (Roche Diagnostics, Meyland, France) on the following programme: 1 cycle of 2 min at 95 °C, 50 cycles of 10 s at 95 °C and 1 min at 60 °C. The C_T number of experimental samples were interpolated against the standard curve to calculate the corresponding 16S copy number.

Nanopore 16S barcoding, sequencing and bioinformatics

The full-length 16S rRNA genes (16S sequencing primers: 27F-AGAGTTTGATCMTGGCTCAG; 1492R-CGGTTACCTTGTTACGACTT) were amplified and samples with amplicons above 1nM were subsequently sequenced on GridION (ONT, Oxford, UK) using the R9 flow cell (FLO-MIN106; ONT, Oxford, UK), as per manufacturer’s protocol. Raw sequence reads were basecalled using ONT’s MinKNOW software Guppy v.3.2.4 with the R9.4 high accuracy model. Raw reads were demultiplexed using qcat version 1.0.1. Sequence summary and read length histograms were generated using NanoPlot version 1.30.1. 16S rRNA databases were obtained from NCBI 16S RefSeq (RefSeq, 2020) (Nucleotide search details: 33,175 (BioProject) or 33,317 (BioProject)); the RDP (Cole et al., 2014) release 11, update 5; and the SILVA rRNA database project (Cole et al., 2014) version 132 repositories respectively. Differences in the characteristics of the three databases are summarised in Table S2. Reference sequences from the 16S rRNA databases were then clustered into 97%, 99% and 100% similarity thresholds using heuristic clustering method (greedy incremental clustering algorithm) on CD-HIT (Fu et al., 2012) version 4.8.1 (commands: -c 0.97, 0.99, 1.0, -M 62900, -d 250). We used Minimap2 (Li, 2018) version 2.12-r87 (commands: -K 100M, -ax map-ont) to align the demultiplexed reads to the respective 16S databases and processed the resulting files using the Sequence Alignment/Map (SAM)tools (Li et al., 2009) version 1.9 (commands: samtools view -b -F 2308 (to remove unmapped, non-primary and supplementary reads), samtools sort, samtools index, samtools idxstats). Bioinformatics scripts and the FASTQ files are available at DOI 10.6084/m9.figshare.13213898.v1.

Clinical sample collection from patients presenting with microbial keratitis

Patients (n = 2) presenting to the Birmingham and Midland Eye Centre, Birmingham, West Midlands, UK with suspected microbial keratitis affecting one eye were recruited to the study. Informed written consent was obtained. The research followed the tenets of the Declaration of Helsinki and was approved by the Health Research Authority Ethics Committee (Rapid Diagnosis of Ocular Infections (RADAR); Reference: 11/EM/0274).

Standard clinical microbiology corneal scrape culture results were compared with nanopore sequencing. For each patient, corneal scrapes were taken for standard clinical microbiology as per routine clinical practice (Lin et al., 2019). Corneal scrapes were taken at the base and edge of the lesion, inoculated onto blood, chocolate and Sabaraud’s agar plates, and processed at the local clinical microbiology laboratory (The agar plates were incubated for 5 days—blood and chocolate (6% CO₂ at 37 °C); Sabourand’s (air at 30 °C)). The collection method with the highest DNA recovery in the ex vivo porcine study was used to collect corneal, conjunctival and negative control samples from the patients. Swabs were taken from the affected cornea and unaffected contralateral conjunctiva, together with a negative control ‘air swab’ of the examination room at the time point of participant sampling, to exclude contamination. Swabs were placed immediately into a ZR BashingBead^™ Lysis Tube containing 750 ul of DNA Shield^™ (Zymo Research, Irvine, CA, USA) and stored at −80 °C until DNA extraction, as described above. Sequence data were compared with the corresponding microbiology culture results of the hospital corneal scrapes. In cases where there was no concordance between culture and nanopore sequencing results, Illumina MiSeq 16S rRNA V4 sequencing was performed.

Miseq 16S sequencing and bioinformatics

The V4 variable region of the 16S rRNA gene was amplified from DNA extracts using primer sets (515F: GTGCCAGCMGCCGCGGTAA; and 806R: GGACTACHVGGGTWTCTAAT) and sequenced on the Illumina MiSeq platform (Kozich et al., 2013). Raw data were filtered and analysed with Mothur and QIIME software packages, as previously described (Kozich et al., 2013; Caporaso et al., 2010).

Statistical analysis

Comparisons of DNA concentrations and C_T across the collection materials were performed using Kruskal-Wallis tests. The mean ranks for the DNA concentrations were then used to sort the collection material in order, and data were summarised using medians and ranges. All analyses were performed using IBM SPSS 22 (IBM Corp. Armonk, NY, USA), with p < 0.05 deemed to be indicative of statistical significance throughout.

Results

Evaluation of bioinformatics analysis pipeline on defined mock microbial community

To evaluate the different methods of performing taxonomic assignment for the most reliable bacterial taxonomic assignment on 16S rRNA nanopore sequencing reads, we compared the effects of removal of unmapped, non-primary and supplementary reads on the taxonomic identification of the mock community (Table 2; Figs. S1 and S2). Removal of unmapped, non-primary and supplementary reads provided a more accurate taxonomic assignment of the mock community.

Table 2:

Comparing the total mapped reads and taxa identification of reads that have not been filtered vs. reads that have been filtered to remove unmapped, non-primary and supplementary reads.

Without removal of unmapped, non-primary and supplementary reads			With removal of unmapped, non-primary and supplementary reads
Total alignments	705,919		Total alignments	117,827
Taxa	#Mapped	Relative abundance	Taxa	#Mapped	Relative abundance
Enterococcus avium*	204,171	28.92	Enterococcus avium*	64,172	54.46
Klebsiella pneumoniae*	183,274	25.96	Klebsiella pneumoniae*	32,248	27.37
Enterococcus pseudoavium	81,810	11.59	Staphylococcus aureus*	14,772	12.54
Staphylococcus aureus*	72,691	10.30	Enterococcus malodoratus	913	0.77
Enterococcus malodoratus	46,019	6.52	Enterococcus pseudoavium	868	0.74
Enterococcus devriesei	21,978	3.11	Enterococcus viikkiensis	535	0.45
Enterococcus viikkiensis	15,614	2.21	Enterococcus hirae	410	0.35
Staphylococcus simiae	13,980	1.98	Enterococcus devriesei	294	0.25
Enterococcus gilvus	7,862	1.11	Enterococcus durans	203	0.17
Enterococcus hirae	6,380	0.90	Enterobacter cloacae	170	0.14
Taxa			Taxa
Enterococcus	409,683	58.04	Enterococcus	68,665	58.28
Klebsiella	187,581	26.57	Klebsiella	32,482	27.57
Staphylococcus	90,775	12.86	Staphylococcus	15,184	12.89
Serratia	3,078	0.44	Enterobacter	214	0.18
Enterobacter	3,011	0.43	Serratia	165	0.14
Pectobacterium	1,238	0.18	Pectobacterium	124	0.11
Streptococcus	763	0.11	Streptococcus	83	0.07
Citrobacter	653	0.09	Citrobacter	76	0.06
Bacillus	601	0.09	Bacillus	73	0.06
Lactobacillus	579	0.08	Lactobacillus	51	0.04

DOI: 10.7717/peerj.10778/table-2

Note:

* Mock microbial community in the positive control sample—Enterococcus avium, Staphylococcus aureus and Klebsiella pneumoniae Reads were mapped against the unclustered NCBI 16S database Relative abundance is defined as the number of reads mapped to the taxa divided by the total number of mapped reads.

Clustering of reference databases by a threshold of sequence similarity may reduce search time and make mapping faster (Li, Jaroszewski & Godzik, 2002). To determine the effects of clustering, the reference sequences of three different 16S databases were reviewed (the NCBI Reference Sequence, NCBI 16S RefSeq; the RDP; and the SILVA ribosomal RNA gene database) (Table 3). We found that for all three databases, clustering at a threshold of 99% and above provided more accurate operational taxonomic unit assignment (OTU) compared to clustering at 97% similarity. The NCBI 16S RefSeq database with reference sequences clustered at 100% similarity provided the most accurate OTU assignment, with more than 95% of the reads corresponding to the mock community. Based on these results, we chose to align our sequencing reads to the NCBI 16S RefSeq database at 100% similarity (reference database sequences clustered at 100% similarity) and removed any unmapped, non-primary or secondary alignments.

Table 3:

Comparing the effects of using NCBI 16S, RDP and SILVA databases at different levels of clustering (97%, 99% and 100% similarity).

NCBI 16S RefSeq				RDP				SILVA
Number of reads	Cluster	#Mapped	Relative abundance	Number of reads	Cluster	#Mapped	Relative abundance	Number of reads	Cluster	#Mapped	Relative abundance
482,900	97% similarity			483,062	97% similarity			481,985	97% similarity
	Enterococcus hirae	183,027	37.90		Enterobacter soli	173,164	35.85		Klebsiella quasipneumoniae	114,637	23.78
	Enterobacter soli	172,739	35.77		Enterococcus faecium	123,393	25.54		Enterococcus faecium	109,825	22.79
	Staphylococcus aureus*	99,582	20.62		Staphylococcus aureus*	99,695	20.64		Staphylococcus aureus*	99,153	20.57
	Salmonella enterica	7,266	1.50		Enterococcus asini	21,726	4.50		Klebsiella pneumoniae*	54,508	11.31
	Enterococcus casseliflavus	2,761	0.57		Enterococcus phoeniculicola	19,122	3.96		Enterococcus casseliflavus	34,290	7.11
483,211	99% similarity			483,374	99% similarity			482,392	99% similarity
	Klebsiella pneumoniae*	184,328	38.15		Klebsiella pneumoniae*	187,465	38.78		Klebsiella pneumoniae*	164,929	34.19
	Enterococcus avium*	173,319	35.87		Enterococcus avium*	182,655	37.79		Enterococcus avium*	109,569	22.71
	Staphylococcus aureus*	97,455	20.17		Staphylococcus aureus*	96,923	20.05		Staphylococcus aureus*	98,477	20.41
	Enterococcus hirae	4,975	1.03		Enterococcus malodoratus	1,281	0.27		Enterococcus devriesei	39,961	8.28
	Enterococcus malodoratus	4,401	0.91		Enterobacter cloacae	1,235	0.26		Enterococcus raffinosus	24,276	5.03
483,382	100% similarity			483,432	100% similarity			482,599	100% similarity
	Klebsiella pneumoniae*	187,745	38.84		Klebsiella pneumoniae*	179,200	37.07		Klebsiella pneumoniae*	184,252	38.18
	Enterococcus avium*	175,497	36.31		Enterococcus avium*	176,822	36.58		Enterococcus avium*	112,163	23.24
	Staphylococcus aureus*	97,505	20.17		Staphylococcus argenteus	90,126	18.64		Staphylococcus aureus*	98,461	20.40
	Enterococcus pseudoavium	3,758	0.78		Staphylococcus aureus*	8,510	1.76		human gut	55,245	11.45
	Enterococcus malodoratus	3,279	0.68		Klebsiella quasipneumoniae	7,614	1.57		Enterococcus gilvus	4,491	0.93

DOI: 10.7717/peerj.10778/table-3

Note:

* Mock microbial community in the positive control sample—Enterococcus avium, Staphylococcus aureus and Klebsiella pneumoniae.

Optimising sampling technique for collection of microbial keratitis on ex-vivo porcine model

To optimise the sampling technique on ex-vivo porcine model of microbial keratitis, comparisons were made between the different collection materials, both pre- and post-inoculation. Pre-inoculation, the average C_T of 16S was not found to differ significantly between the collection materials (p = 0.909) (Table S3). Post-inoculation, the DNA concentration was found to differ significantly across the collection materials (p < 0.001) (Table S4; Fig. S3). The highest concentrations were observed for the Sugi^® Eyespear and Isohelix^™ SK-2S, with medians of 82.2 and 96.6 ng/ul respectively, whilst the lowest concentrations were observed for the PTFE 4mm, at a median of 0.19 ng/ul. For the average C_T, no significant difference across the collection methods was detected for 16S (p = 0.242). The 16S: β-Actin ratio was consistent across all of the collection materials (p = 1.000).

Six out of the twelve collection methods provided sufficient DNA yield for 16S Nanopore sequencing and 16S copy number quantification—Sugi^® Eyespear, Isohelix^™ SK-2S, MW1021D Dryswab^™ Polyester, MW130 Hospiswab^™ Albumin, MW1041 Cotton and MW1021 Dryswab^™ Rayon (Fig. 2). All three species of the mock microbial community inoculated onto the ex-vivo porcine eyes were detectable on nanopore sequencing. The Sugi^® Eyespear swab was chosen for patient sample collection as it provided the highest mean rank DNA concentration that was sufficient for nanopore sequencing.

Figure 2: Comparison of swab collection yield, defined by relative abundances of mock community normalised by 16S copy number.

Download full-size image

DOI: 10.7717/peerj.10778/fig-2

Proof-of-concept clinical study on microbial keratitis patient samples

The optimised sampling technique and bioinformatics pipeline were applied directly on two test patients with suspected microbial keratitis. Corneal, conjunctival and negative control swabs were collected from two consecutive patients presenting to the emergency department (Table 4; Table S5). Patient A had a majority of reads (497,995 out of total 613,482 reads, 81.2%) mapping to Serratia marcescens, which was in agreement with culture results. Patient B, who had been started on topical antibiotics prior to sampling, had no growth on culture after 5 days, but the majority of reads mapped to Bacillus subtilis (33,188 out of a total of 92,810 reads, 35.8%) on nanopore sequencing. Further Illumina Miseq 16S rRNA sequencing confirmed the presence of Bacillus in the affected eye of Patient B. Control samples from the unaffected contralateral conjunctival swabs and the respective negative control ‘air swabs’ had negligible reads (<1%) compared to the number of reads generated from the affected eye.

Table 4:

Results of culture and 16S rDNA sequencing of patient samples aligned to NCBI 16S RefSeq Database at 100% similarity.

Sample	Total mapped reads of sample (Proportion of sample reads/total reads in sequencing run)	Bacterial taxa identified on nanopore sequencing (number of reads, relative abundance)	Culture results	Most abundant bacterial taxa identified on 16S V4 MiSeq (relative abundance)
Patient A—total reads of sequencing run: 1,097,318
Affected eye (Right cornea)	613,482 (55.9%)	Serratia marcescens (497,995, 81.2%); Serratia nematodiphila (60,000, 9.78%); Klebsiella aerogenes (9,247, 1.51%); Kluyvera ascorbate (1,981, 0.32%); Cutibacterium acnes (1,942, 0.32%)	Serratia marcescens	–
Unaffected eye (Left conjunctiva)	28 (0.0026%)	Serratia marcescens (6, 21.4%); Streptococcus cristatus (3, 10.7%); Hungateiclostridium cellulolyticum (2, 7.14%); Streptococcus parauberis (2, 7.14%); Enterococcus avium (2, 7.14%)	–	–
Negative control	26 (0.0024%)	Klebsiella pneumoniae (6, 23.1%); Enterococcus avium (4, 15.4%); Serratia marcescens (4, 15.4%); Staphylococcus aureus (2, 7.69%); Kluyvera ascorbate (1, 3.85%)	–	–
Patient B—total reads of sequencing run: 273,987
Affected eye (Left cornea)	92,810 (33.9%)	Bacillus subtilis (33,188, 35.8%); Staphylococcus caprae (5,875, 6.3%); Staphylococcus saccharolyticus (5,206, 5.61%); Aggregatibacter segnis (4,109, 4.43%); Cutibacterium acnes (2,998, 3.23%)	No growth in culture after 5 days	Bacillus (13.9%); Dialister (8.6%); Actinobacter (6.9%); Rubrobacter (5.5%); Staphylococcus (4.3%)
Unaffected eye (Right conjunctiva)	1,301 (0.45%)	Snodgrassella alvi (935, 71.9%); Escherichia fergusonii (48, 3.69%); Anoxybacillus flavithermus (35, 2.69%); Eikenella corrodens (24, 1.84%); Thermicanus aegyptius (20, 1.54%)	–	–
Negative control	6 (0.0022%)	Klebsiella pneumoniae (3, 50%); Staphylococcus aureus (2, 33.3%); Enterococcus avium (1, 16.7%)	–	–

DOI: 10.7717/peerj.10778/table-4

Discussion

We have developed a proof-of-concept study optimising the sample collection method and downstream bioinformatics pipeline for full-length 16S rRNA gene identification by nanopore sequencing in the setting of microbial keratitis.

The use of different swabs and collection methods for microbial keratitis has a direct effect on the DNA yield—the Sugi^® Eyespear and Isohelix^™ SK-2S swabs provided the highest DNA concentration. However, the ratio of host to microbial DNA recovery is similar across all collection methods. Differences in the DNA yield could be explained by the absorption efficacy of the swabs (Bruijns, Tiggelaar & Gardeniers, 2018). The absorption capacity of the swab materials is dependent upon the swab tip dimensions and morphology of the sorbent material—how tightly wound the sorbent material is to the shaft (Bruijns, Tiggelaar & Gardeniers, 2018). The Sugi^® Eyespear swab, primarily designed for use in ophthalmic theatres due to its high tensile strength in absorbing fluids, has been proven to be effective in recovering DNA from corneal tissue in our study. Previous studies have also demonstrated that DNA recovery is inversely proportional to the fiber density on the swabs (Brownlow, Dagnall & Ames, 2012; Verdon, Mitchell & van Oorschot, 2014). Both Sugi^® Eyespear and Isohelix^™ SK-2S have swab tips made of cellulose fiber, which have high DNA-binding capacity (Su & Comeau, 1999).

Choice of primers affects the amplification efficiency of the 16S rRNA gene region, with primers that amplify the entire 16S rRNA gene spanning the V1-9 variable regions (27F and 1492R) providing better classification of reads compared to primers that only amplify a portion of the 16S rRNA region, as shown in studies by Winand et al. (2020) and Nygaard et al. (2020). Therefore, we have used the 27F and 1492R primer pairs provided in the commercially available ONT Rapid 16S barcoding kit. Other preliminary studies have also shown that sequencing the whole rrn operon (~4,300 bp), which includes the 16S rRNA gene, internal transcribed spacer (ITS) region, and the 23S rRNA gene, may provide better taxonomic resolution (Cuscó et al., 2019). However, as there is a lack of updated and curated rrn operon databases, users will need to retrieve and compile whole ribosomal operon reference database for their specific usage (Benítez-Páez & Sanz, 2017), whereas, curated or updated 16S rRNA reference databases are more readily available (RefSeq, 2020; Quast et al., 2013).

Another factor influencing the quality of nanopore sequencing is the choice of basecalling software (Rang, Kloosterman & De Ridder, 2018). We used the Guppy ‘flip-flop’ high accuracy model for basecalling, based on Wick et al’s study showing that the model, which utilises the recurrent neural network algorithm, performed better than the other basecalling programs (Albacore, Scrappie and Flappie) (Wick, Judd & Holt, 2019). Using Minimap2 (Li, 2018), we aligned our nanopore sequencing reads against the three different publicly available databases, NCBI 16S RefSeq, RDP and SILVA. Performance matrix comparison of thirteen different classification tools by Urban and colleagues revealed that Minimap2 provided robust alignments that were closely aligned to their mock community taxa (Urban et al., 2020). However, similar to the challenges encountered by Urban et al. (2020), we have had issues of high memory usage on Minimap2, which necessitated a reduction in the number bases loaded into memory to process in the query batch (command -K 100M). By comparing against a defined mock community, we observed differences in the taxonomic assignments between the databases—with the NCBI 16S RefSeq database clustered at 100% providing the most accurate assignments, which could be attributed to the differences in the database size and sequence validation steps (Balvočiūtė & Huson, 2017; Park & Won, 2018). The NCBI 16S RefSeq database is manually curated and near full length 16S sequences are preferentially selected (RefSeq, 2020). In our benchmarking steps, we have used the CD-HIT program (Li & Godzik, 2006), which employs heuristic greedy incremental clustering algorithm to cluster the reference sequences of the databases into 97%, 99% and 100% similarity to approximate taxonomic assignments. Schmidt, Matias Rodrigues & Von Mering (2015) assessed different clustering programs (hierarchical and heuristic algorithms) and observed that the CD-HIT program was robust, computationally efficient and provided reproducible clusterings. In our study, we have demonstrated that the 100% identity threshold provided a more optimal OTU assignment compared to 97% or 99% identity thresholds, which is consistent with previously published studies (Edgar, 2018; Mysara et al., 2017).

Nanopore sequencing reads were in concordance with current gold standard clinical microbiology culture techniques in Patient A. In the case of Patient B where cultures were negative, in the setting of previous antibiotic use, the nanopore sequencing result was in agreement with Illumina short-read sequencing suggesting an identification of a putative organism in the context of a false negative culture. The human ocular surface is paucibacterial. The conjunctival microbiome predominantly consists of Corynebacteria, Propionibacteria, Staphylococcus, and Streptococcus—with ‘approximately 1 bacterium for every 20 human conjunctival epithelial cell collected on conjunctival swab’ (Doan et al., 2016a). It is likely that the presence of Serratia marcescens and Bacillus subtilis at such high relative abundance in the patient samples, with correlating clinical signs and symptoms of infective keratitis, would constitute a positive test. A major challenge in applying high-throughput sequencing in clinical practice is distinguishing between true polymicrobial keratitis, ocular commensal or contaminant (Ung et al., 2020). Hence, we have taken meticulous steps to account for any potential contamination in our study—swabs of the unaffected contralateral eye and negative control swabs at the same time point and clinical environment of the patient had been taken, processed and sequenced in the same manner as the swabs of the affected eye. These negative control swabs had significantly fewer reads, less than 1% of the reads from the affected eye.

Another challenge in using full-length 16S rRNA sequencing is the difficulty in differentiating species and subspecies strains in certain bacterial genera with high sequence homology and similarity, notably the Bacillus subtilis species complex (Public Health England, 2018). This is illustrated in Patient B, whereby different taxonomic species within the Bacillus subtilis group such as Bacillus amyloliquefacians, Bacillus licheniformis, Bacillus velenzensis and Bacillus halotolerans have been assigned. Classical phenotypic tests by colonial appearance on culture, presence of ß-haemolysis, or biochemical tests to discriminate between these subspecies are unreliable. Other laboratory identification methods such as Matrix-Assisted Laser Desorption Ionisation–Time of Flight mass spectrometry to detect microbial protein composition is highly variable dependent upon endospore formation (Shu & Yang, 2017), whilst Multilocus Sequence Typing, which relies on PCR amplification and sequencing of six or seven well-conserved, housekeeping genes within the bacterial genome, is unable to provide distinct phylogenetic typing of Bacillus owing to the difficulty in designing primers for genetic sequences with high similarity. Although Pulse Field Gel Electrophoresis, which utilises endonuclease restriction enzymes, subsequent separation of DNA fragments by gel electrophoresis and staining under ultraviolet light for bands, is highly discriminatory, it is time consuming (over 30 h) and requires specialist laboratory equipment (Public Health England, 2018). Hence, Public Health England advocates initial clustering and genus identification by 16S rDNA for Bacillus, which could then be followed by more in-depth whole genome sequencing for more accurate strain characterisation (Public Health England, 2018).

Nanopore sequencing technology has the potential to provide rapid, real-time diagnosis of causative pathogens in a healthcare setting, with relatively low cost. Although in this proof of concept study, we have used 16S primers which specifically amplify bacterial DNA, non-biased deep metagenomic detection of pathogens and its antimicrobial resistance genomes from cultured clinical isolates using nanopore sequencing have previously been demonstrated (Schmidt et al., 2017; Szabó et al., 2016). Direct RNA sequencing on the nanopore platform would also enable identification of ‘live’ pathogens and host gene profiling for transcriptome signatures related to the infection (Lalitha et al., 2019). However, challenges still remain in terms of limiting background contamination (Glassing et al., 2016), reducing the error rates of sequencing and improving base-calling algorithms (Wick, Judd & Holt, 2019). Performing molecular diagnostics on ocular samples is inherently more difficult compared to other types of clinical samples, such as blood, urine or cerebrospinal fluid, as ocular samples are many magnitudes smaller in volume (Doan et al., 2016b) and are highly abundant in human cells (Lalitha et al., 2019).

This is the first study to evaluate the use of full-length 16S nanopore sequencing for detection of pathogens in microbial keratitis. We have optimised collection methods and demonstrated the bioinformatics pipeline for bacterial microbial keratitis. Our study is limited by the small sample size of patient cohort and the use of 16S rRNA primers, which specifically amplifies bacterial genome. To resolve this, larger clinical sample studies involving unbiased metagenomic sequencing are required to determine the sensitivity and specificity, as well as the cost effectiveness of nanopore sequencing.

Conclusion

We have optimised collection methods and demonstrate a novel workflow for identification of bacterial microbial keratitis using nanopore sequencing.

Supplemental Information

Pilot data comparing DNA yield of dry vs. pre-moistened swab at different time points.

Freshly enucleated porcine eyes (Sus scrofa domestica) were obtained as a by-product of the meat industry and transported to the laboratory under storage at 4 °C. Each eye was disinfected with Povidone-iodine 10% w/w for 1 min, followed by two rinses of sterile 0.9% Sodium Chloride for 1 min, and placed in an individual chamber of sterile 6-well culture plate (Sigma–Aldrich, Merck KGaA, Darmstadt, Germany). Using stereoscopic surgical loupes, a 4mm trephine punch (Acu-Punch^®, Acuderm, Fort Lauderdale, USA) was used to create a single central anterior stromal corneal lesion (with debridement of the central 4mm). Each eye was then inoculated with 20 µL of 1 × 10⁵ CFU/ml each of the mock community (Enterococcus avium, Staphylococcus aureus and Klebsiella pneumoniae). Negative control eyes were not inoculated with the mock community. The area was re-sampled with the respective swab conditions (dry swab vs. pre-moistened swab with sterile 0.9% Sodium Chloride) at two time points (30 min vs. 12 h) using Purflock® Ultra Standard (MWE Medical Wire, Corsham, UK) at room temperature, and placed immediately into a ZR BashingBead^™ Lysis Tube containing 750 µl of DNA Shield^™ (Zymo Research, Irvine, CA, USA) and stored at −80 °C until DNA extraction. DNA was extracted using ZymoBIOMICS DNA Miniprep kit (Zymo Research, Irvine, CA, USA) according to the manufacturer’s instructions. DNA concentration was determined fluorometrically using a Qubit dsDNA high-sensitivity assay (Thermo Fisher Scientific, Waltham, MA, USA).

DOI: 10.7717/peerj.10778/supp-1

Download

Comparison of the 16S rRNA databases.

References: 1. 16S RefSeq records processing and curation. https://www.ncbi.nlm.nih.gov/refseq/targetedloci/16S_process/. Accessed February 17, 2020. 2. Cole J. R., Wang Q., Fish J. A., et al. Ribosomal Database Project: Data and tools for high throughput rRNA analysis. Nucleic Acids Res. 2014;42(D1):D633. doi:10.1093/nar/gkt1244 3. Quast C., Pruesse E., Yilmaz P., et al. The SILVA ribosomal RNA gene database project: Improved data processing and web-based tools. Nucleic Acids Res. 2013;41(D1):D590. doi:10.1093/nar/gks1219

DOI: 10.7717/peerj.10778/supp-2

Download

Pre-inoculation average C_T by collection material.

Data are reported as medians and ranges, with p-values from Kruskal–Wallis tests; bold p-values are significant at p < 0.05.

DOI: 10.7717/peerj.10778/supp-3

Download

Post-inoculation average C_T and DNA concentrations by collection material.

Data are reported as medians and ranges, with p-values from Kruskal–Wallis tests; bold p -values are significant at p < 0.05. *The ordering of collection materials, sorted from largest to smallest on the mean rank of the DNA concentration.

DOI: 10.7717/peerj.10778/supp-4

Download

Comparing the effects of using NCBI 16S, RDP and SILVA databases at different levels of clustering (97%, 99% and 100% similarity) on patient samples.

DOI: 10.7717/peerj.10778/supp-5

Download

Bar graph comparing the effects of removal of unmapped, non-primary and supplementary reads on the taxonomic identification of the mock community.

DOI: 10.7717/peerj.10778/supp-11

Download

Bioinformatics workflow.

Schematic diagram of bioinformatics workflow.

DOI: 10.7717/peerj.10778/supp-12

Download

Post-inoculation DNA concentration by collection material.

Collection materials are sorted in order of the mean rank of DNA concentration.

DOI: 10.7717/peerj.10778/supp-13

Download

[1] 16S Sequencing and Analysis. 2020. 16S analysis using real-time, long-read nanopore sequencing. (accessed 21 July 2020)

[2] Achtman M, Wagner M. 2008. Microbial diversity and the genetic nature of microbial species. Nature Reviews Microbiology 6(6):431-440

[3] Akram A, Maley M, Gosbell I, Nguyen T, Chavada R. 2017. Utility of 16S rRNA PCR performed on clinical specimens in patient management. International Journal of Infectious Diseases 57:144-149

[4] Ashton PM, Nair S, Dallman T, Rubino S, Rabsch W, Mwaigwisya S, Wain J, O’Grady J. 2015. MinION nanopore sequencing identifies the position and structure of a bacterial antibiotic resistance Island. Nature 33:296-300

[5] Austin A, Lietman T, Rose-Nussbaumer J. 2017. Update on the management of infectious keratitis. Ophthalmology 124(11):1678-1689

[6] Balvočiūtė M, Huson DH. 2017. SILVA, RDP, greengenes, NCBI and OTT—how do these taxonomies compare? BMC Genomics 18(S2):114

[7] Benítez-Páez A, Sanz Y. 2017. Multi-locus and long amplicon sequencing approach to study microbial diversity at species level using the MinIONTM portable nanopore sequencer. Gigascience 6(7):1

[8] Bispo PJM, Davoudi S, Sahm ML, Ren A, Miller J, Romano J, Sobrin L, Gilmore MS. 2018. Rapid detection and identification of uveitis pathogens by qualitative multiplex real-time PCR. Investigative Opthalmology & Visual Science 59(1):582-589

[9] Brownlow RJ, Dagnall KE, Ames CE. 2012. A comparison of DNA collection and retrieval from two swab types (cotton and nylon flocked swab) when processed using three QIAGEN extraction methods. Journal of Forensic Sciences 57(3):713-717

[10] Bruijns BB, Tiggelaar RM, Gardeniers H. 2018. The extraction and recovery efficiency of pure DNA for different types of swabs. Journal of Forensic Sciences 63(5):1492-1499

[11] Caporaso JG, Kuczynski J, Stombaugh J, Bittinger K, Bushman FD, Costello EK, Fierer N, Peña AG, Goodrich JK, Gordon JI, Huttley GA, Kelley ST, Knights D, Koenig JE, Ley RE, Lozupone CA, McDonald D, Muegge BD, Pirrung M, Reeder J, Sevinsky JR, Turnbaugh PJ, Walters WA, Widmann J, Yatsunenko T, Zaneveld J, Knight R. 2010. QIIME allows analysis of high-throughput community sequencing data. Nature Methods 7(5):335-336

[12] Centers for Disease Control and Prevention. 2014. Estimated Burden of Keratitis—United States, 2010. (accessed 3 June 2019)

[13] Charalampous T, Kay GL, Richardson H, Aydin A, Baldan R, Jeanes C, Rae D, Grundy S, Turner DJ, Wain J, Leggett RM, Livermore DM, O’Grady J. 2019. Nanopore metagenomics enables rapid clinical diagnosis of bacterial lower respiratory infection. Nature Biotechnology 37(7):783-792

[14] Chiu CY, Miller SA. 2019. Clinical metagenomics. Nature 20:341-355

[15] Clarridge JE. 2004. Impact of 16S rRNA gene sequence analysis for identification of bacteria on clinical microbiology and infectious diseases. Clinical Microbiology Reviews 17(4):840-862

[16] Cole JR, Wang Q, Fish JA, Chai B, McGarrell DM, Sun Y, Brown CT, Porras-Alfaro A, Kuske CR, Tiedje JM. 2014. Ribosomal database project: data and tools for high throughput rRNA analysis. Nucleic Acids Research 42(D1):D633-D642

[17] Cuscó A, Catozzi C, Viñes J, Sanchez A, Francino O. 2019. Microbiota profiling with long amplicons using Nanopore sequencing: full-length 16S rRNA gene and the 16S-ITS-23S of the rrn operon. F1000Research 7:1755

[18] De Boer R, Peters R, Gierveld S, Schuurman T, Kooistra-Smid M, Savelkoul P. 2010. Improved detection of microbial DNA after bead-beating before DNA isolation. Journal of Microbiological Methods 80(2):209-211

[19] De Maio N, Shaw LP, Hubbard A, George S, Sanderson ND, Swann J, Wick R, Oun MA, Stubberfield E, Hoosdally SJ, Crook DW, Peto TEA, Sheppard AE, Bailey MJ, Read DS, Anjum MF, Sarah Walker A, Stoesser N. 2019. Comparison of long-read sequencing technologies in the hybrid assembly of complex bacterial genomes. Microbial Genomics 5(9):e000294

[20] Doan T, Akileswaran L, Andersen D, Johnson B, Ko N, Shrestha A, Shestopalov V, Lee CS, Lee AY, Van Gelder RN. 2016a. Paucibacterial microbiome and resident DNA virome of the healthy conjunctiva. Investigative Opthalmology & Visual Science 57(13):5116

[21] Doan T, Wilson MR, Crawford ED, Chow ED, Khan LM, Knopp KA, O’Donovan BD, Xia D, Hacker JK, Stewart JM, Gonzales JA, Acharya NR, DeRisi JL. 2016b. Illuminating uveitis: metagenomic deep sequencing identifies common and rare pathogens. Genome Medicine 8(1):90

[22] Douglas CA, Ivey KL, Papanicolas LE, Best KP, Muhlhausler BS, Rogers GB. 2020. DNA extraction approaches substantially influence the assessment of the human breast milk microbiome. Scientific Reports 10(1):1-10

[23] Edgar RC. 2018. Updating the 97% identity threshold for 16S ribosomal RNA OTUs. Bioinformatics 34(14):2371-2375

[24] Escapa IF, Chen T, Huang Y, Gajare P, Dewhirst FE, Lemon KP. 2018. New insights into human Nostril microbiome from the expanded human oral microbiome database (eHOMD): a resource for the microbiome of the human aerodigestive tract. mSystems 3(6):8

[25] Faria NR, Quick J, Claro IM, Thézé J, De Jesus JG, Giovanetti M, Kraemer MUG, Hill SC, Black A, Da Costa AC, Franco LC, Silva SP, Wu CH, Raghwani J, Cauchemez S, Du Plessis L, Verotti MP, De Oliveira WK, Carmo EH, Coelho GE, Santelli ACFS, Vinhal LC, Henriques CM, Simpson JT, Loose M, Andersen KG, Grubaugh ND, Somasekar S, Chiu CY, Muñoz-Medina JE, Gonzalez-Bonilla CR, Arias CF, Lewis-Ximenez LL, Baylis SA, Chieppe AO, Aguiar SF, Fernandes CA, Lemos PS, Nascimento BLS, Monteiro HAO, Siqueira IC, De Queiroz MG, De Souza TR, Bezerra JF, Lemos MR, Pereira GF, Loudal D, Moura LC, Dhalia R, França RF, Magalhães T, Marques ET, Jaenisch T, Wallau GL, De Lima MMC, Nascimento V, De Cerqueira EM, De Lima MMC, Mascarenhas DL, Neto JPM, Levin AS, Tozetto-Mendoza TR, Fonseca SN, Mendes-Correa MC, Milagres FP, Segurado A, Holmes EC, Rambaut A, Bedford T, Nunes MRT, Sabino EC, Alcantara LCJ, Loman NJ, Pybus OG. 2017. Establishment and cryptic transmission of Zika virus in Brazil and the Americas. Nature 546(7658):406-410

[26] Feehery GR, Yigit E, Oyola SO, Langhorst BW, Schmidt VT, Stewart FJ, Dimalanta ET, Amaral-Zettler LA, Davis T, Quail MA, Pradhan S. 2013. A method for selectively enriching microbial DNA from contaminating vertebrate host DNA. PLOS ONE 8(10):76096

[27] Fu L, Niu B, Zhu Z, Wu S, Li W. 2012. CD-HIT: accelerated for clustering the next-generation sequencing data. Bioinformatics 28(23):3150-3152

[28] Glassing A, Dowd SE, Galandiuk S, Davis B, Chiodini RJ. 2016. Inherent bacterial DNA contamination of extraction and sequencing reagents may affect interpretation of microbiota in low bacterial biomass samples. Gut Pathogens 8(1):734

[29] Goldstein MH, Kowalski RP, Gordon YJ. 1999. Emerging fluoroquinolone resistance in bacterial keratitis: a 5-year review. Ophthalmology 106(7):1213-1318

[30] Gu W, Miller S, Chiu CY. 2019. Clinical metagenomic next-generation sequencing for pathogen detection. Annual Review of Pathology: Mechanisms of Disease 14(1):319-338

[31] Holm JB, Humphrys MS, Robinson CK, Settles ML, Ott S, Fu L, Yang H, Gajer P, He X, McComb E, Gravitt PE, Ghanem KG, Brotman RM, Ravel J. 2019. Ultrahigh-throughput multiplexing and sequencing of >500-base-pair amplicon regions on the Illumina HiSeq 2500 platform. mSystems 4(1):6

[32] Ibrahim YW, Boase DL, Cree IA. 2009. Epidemiological characteristics, predisposing factors and microbiological profiles of infectious corneal ulcers: the portsmouth corneal ulcer study. British Journal of Ophthalmology 93(10):1319-1324

[33] Johnson JS, Spakowicz DJ, Hong BY, Petersen LM, Demkowicz P, Chen L, Leopold SR, Hanson BM, Agresta HO, Gerstein M, Sodergren E, Weinstock GM. 2019. Evaluation of 16S rRNA gene sequencing for species and strain-level microbiome analysis. Nature Communications 10(1):1-11

[34] Kaye SB, Rao PG, Smith G, Scott JA, Hoyles S, Morton CE, Willoughby C, Batterbury M, Harvey G. 2003. Simplifying collection of corneal specimens in cases of suspected bacterial keratitis. Journal of Clinical Microbiology 41(7):3192-3197

[35] Kim E, Chidambaram JD, Srinivasan M, Lalitha P, Wee D, Lietman TM, Whitcher JP, Van Gelder RN. 2008. Prospective comparison of microbial culture and polymerase chain reaction in the diagnosis of corneal ulcer. American Journal of Ophthalmology 146(5):714-723.e1

[36] Kirstahler P, Bjerrum SS, Friis-Møller A, La Cour M, Aarestrup FM, Westh H, Pamp SJ. 2018. Genomics-based identification of microorganisms in human ocular body fluid. Scientific Reports 8(1):1-14

[37] Kozich JJ, Westcott SL, Baxter NT, Highlander SK, Schloss PD. 2013. Development of a dual-index sequencing strategy and curation pipeline for analyzing amplicon sequence data on the MiSeq Illumina sequencing platform. Applied and Environmental Microbiology 79(17):5112-5120

[38] Lalitha P, Seitzman GD, Kotecha R, Hinterwirth A, Chen C, Zhong L, Cummings S, Lebas E, Sahoo MK, Pinsky BA, Lietman TM, Doan T. 2019. Unbiased pathogen detection and host gene profiling for conjunctivitis. Ophthalmology 126(8):1090-1094

[39] Lee AY, Akileswaran L, Tibbetts MD, Garg SJ, Van Gelder RN. 2015. Identification of torque teno virus in culture-negative endophthalmitis by representational deep DNA sequencing. Ophthalmology 122(3):524-530

[40] Leggett RM, Alcon-Giner C, Heavens D, Caim S, Brook TC, Kujawska M, Martin S, Peel N, Acford-Palmer H, Hoyles L, Clarke P, Hall LJ, Clark MD. 2020. Rapid MinION profiling of preterm microbiota and antimicrobial-resistant pathogens. Nature Microbiology 5(3):430-442

[41] Li H. 2018. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics 34(18):3094-3100

[42] Li W, Godzik A. 2006. Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics 22(13):1658-1659

[43] Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R. 2009. The sequence alignment/map format and SAMtools. Bioinformatics 25(16):2078-2079

[44] Li W, Jaroszewski L, Godzik A. 2002. Sequence clustering strategies improve remote homology recognitions while reducing search times. Protein Engineering, Design and Selection 15(8):643-649

[45] Lichtinger A, Yeung SN, Kim P, Amiran MD, Iovieno A, Elbaz U, Ku JYF, Wolff R, Rootman DS, Slomovic AR. 2012. Shifting trends in bacterial keratitis in Toronto: an 11-year review. Ophthalmology 119(9):1785-1790

[46] Lin A, Rhee MK, Akpek EK, Amescua G, Farid M, Garcia-Ferrer FJ, Varu DM, Musch DC, Dunn SP, Mah FS. 2019. Bacterial keratitis preferred practice pattern^®. Ophthalmology 126(1):P1-P55

[47] Marotz CA, Sanders JG, Zuniga C, Zaramela LS, Knight R, Zengler K. 2018. Improving saliva shotgun metagenomics by chemical host DNA depletion. Microbiome 6(1):42

[48] Maurer FP, Christner M, Hentschke M, Rohde H. 2017. Advances in rapid identification and susceptibility testing of bacteria in the clinical microbiology laboratory: implications for patient care and antimicrobial stewardship programs. Infectious Disease Reports 9(1):6839

[49] Musa FU, Tailor R, Gao A, Hutley E, Rauz S, Scott RAH. 2010. Contact lens-related microbial keratitis in deployed British military personnel. British Journal of Ophthalmology 94(8):988-993

[50] Mysara M, Vandamme P, Props R, Kerckhof F-MM, Leys N, Boon N, Raes J, Monsieurs P. 2017. Reconciliation between operational taxonomic units and species boundaries. FEMS Microbiology Ecology 93(4):431

[51] Nygaard AB, Tunsjø HS, Meisal R, Charnock C. 2020. A preliminary study on the potential of Nanopore MinION and Illumina MiSeq 16S rRNA gene sequencing to characterize building-dust microbiomes. Scientific Reports 10(1):3209

[52] Ojo-Okunola A, Claassen-Weitz S, Mwaikono KS, Gardner-Lubbe S, Zar HJ, Nicol MP, Du Toit E. 2020. The influence of DNA extraction and lipid removal on human milk bacterial profiles. Methods Protoc 3(2):39

[53] PacBio Sequel Systems. 2020. Sequence with confidence. (accessed 8 September 2020)

[54] Park S-C, Won S. 2018. Evaluation of 16S rRNA databases for taxonomic assignments using a Mock community. Genomics & Informatics 16(4):e24

[55] Public Health England. 2018. UK standards for microbiology investigations identification of bacillus specie. England: Public Health England.

[56] Quast C, Pruesse E, Yilmaz P, Gerken J, Schweer T, Yarza P, Peplies J, Glöckner FO. 2013. The SILVA ribosomal RNA gene database project: Improved data processing and web-based tools. Nucleic Acids Research 41(D1):D590-D596

[57] Quick J, Grubaugh ND, Pullan ST, Claro IM, Smith AD, Gangavarapu K, Oliveira G, Robles-Sikisaka R, Rogers TF, Beutler NA, Burton DR, Lewis-Ximenez LL, De Jesus JG, Giovanetti M, Hill SC, Black A, Bedford T, Carroll MW, Nunes M, Alcantara LC, Sabino EC, Baylis SA, Faria NR, Loose M, Simpson JT, Pybus OG, Andersen KG, Loman NJ. 2017. Multiplex PCR method for MinION and Illumina sequencing of Zika and other virus genomes directly from clinical samples. Nature Protocols 12(6):1261-1266

[58] Quick J, Loman NJ, Duraffour S, Simpson JT, Severi E, Cowley L, Bore JA, Koundouno R, Dudas G, Mikhail A, Ouédraogo N, Afrough B, Bah A, Baum JHJ, Becker-Ziaja B, Boettcher JP, Cabeza-Cabrerizo M, Camino-Sánchez Á, Carter LL, Doerrbecker J, Enkirch T, Garciá-Dorival I, Hetzelt N, Hinzmann J, Holm T, Kafetzopoulou LE, Koropogui M, Kosgey A, Kuisma E, Logue CH, Mazzarelli A, Meisel S, Mertens M, Michel J, Ngabo D, Nitzsche K, Pallasch E, Patrono LV, Portmann J, Repits JG, Rickett NY, Sachse A, Singethan K, Vitoriano I, Yemanaberhan RL, Zekeng EG, Racine T, Bello A, Sall AA, Faye OO, Faye OO, Magassouba N, Williams VCJC, Amburgey V, Winona L, Davis E, Gerlach J, Washington F, Monteil V, Jourdain M, Bererd M, Camara AA, Somlare H, Camara AA, Gerard M, Bado G, Baillet B, Delaune D, Nebie KY, Diarra A, Savane Y, Pallawo RB, Gutierrez GJ, Milhano N, Roger I, Williams VCJC, Yattara F, Lewandowski K, Taylor J, Rachwal P, Turner DJ, Pollakis G, Hiscox JA, Matthews DA, O’Shea MK, Johnston AMD, Wilson D, Hutley E, Smit E, Di Caro A, Wolfel R, Stoecker K, Fleischmann E, Gabriel M, Weller SA, Koivogui L, Diallo B, Keita S, Rambaut A, Formenty P, Gunther S, Carroll MW. 2016. Real-time, portable genome sequencing for Ebola surveillance. Nature 530(7589):228-232

[59] Raina V, Nayak T, Ray L, Kumari K, Suar M. 2019. A polyphasic taxonomic approach for designation and description of novel microbial species. In: Microbial Diversity in the Genomic Era. Amsterdam: Elsevier. 137-152

[60] Rang FJ, Kloosterman WP, De Ridder J. 2018. From squiggle to basepair: computational approaches for improving nanopore sequencing read accuracy. Genome Biology 19(1):90

[61] RefSeq. 2020. 16S RefSeq records processing and curation. (accessed 17 February 2020)

[62] Sanderson ND, Street TL, Foster D, Swann J, Atkins BL, Brent AJ, McNally MA, Oakley S, Taylor A, Peto TEA, Crook DW, Eyre DW. 2018. Real-time analysis of nanopore-based metagenomic sequencing from infected orthopaedic devices. BMC Genomics 19(1):714

[63] Santos A, Van Aerle R, Barrientos L, Martinez-Urtaza J. 2020. Computational methods for 16S metabarcoding studies using Nanopore sequencing data. Computational and Structural Biotechnology Journal 18:296-305

[64] Schmidt TSBB, Matias Rodrigues JF, Von Mering C. 2015. Limits to robustness and reproducibility in the demarcation of operational taxonomic units. Environmental Microbiology 17(5):1689-1706

[65] Schmidt K, Mwaigwisya S, Crossman LC, Doumith M, Munroe D, Pires C, Khan M, Woodford A, Saunders N, Wain NJ, O’Grady J, Livermore DM. 2017. Identification of bacterial pathogens and antimicrobial resistance directly from clinical urines by nanopore-based metagenomic sequencing. Journal of Antimicrobial Chemotherapy 72(1):104-114

[66] Shu LJ, Yang YL. 2017. Bacillus classification based on matrix-assisted laser desorption ionization time-of-flight mass spectrometry—effects of culture conditions. Scientific Reports 7(1):1

[67] Su X, Comeau AM. 1999. Cellulose as a matrix for nucleic acid purification. Analytical Biochemistry 267(2):415-418

[68] Sugita S, Ogawa M, Shimizu N, Morio T, Ohguro N, Nakai K, Maruyama K, Nagata K, Takeda A, Usui Y, Sonoda KH, Takeuchi M, Mochizuki M. 2013. Use of a comprehensive polymerase chain reaction system for diagnosis of ocular infectious diseases. Ophthalmology 120(9):1761-1768

[69] Sui H, Weil AA, Nuwagira E, Qadri F, Ryan ET, Mezzari MP, Phipatanakul W, Lai PS. 2020. Impact of DNA extraction method on variation in human and built environment microbial community and functional profiles assessed by shotgun metagenomics sequencing. Frontiers in Microbiology 11:953

[70] Szabó M, Nagy T, Wilk T, Farkas T, Hegyi A, Olasz F, Kiss J. 2016. Characterization of two multidrug-resistant IncA/C plasmids from the, 1960s, by using the MinION sequencer device. Antimicrob Agents and Chemotherapy 60(11):6780-6786

[71] Tan SZ, Walkden A, Au L, Fullwood C, Hamilton A, Qamruddin A, Armstrong M, Brahma AK, Carley F. 2017. Twelve-year analysis of microbial keratitis trends at a UK tertiary hospital. Eye 31(8):1229-1236

[72] Tananuvat N, Salakthuantee K, Vanittanakom N, Pongpom M, Ausayakhun S. 2012. Prospective comparison between conventional microbial work-up vs PCR in the diagnosis of fungal keratitis. Eye 26(10):1337-1343

[73] Ting DSJ, Settle C, Morgan SJ, Baylis O, Ghosh S. 2018. A 10-year analysis of microbiological profiles of microbial keratitis: the North East England Study. Eye 32(8):1416-1417

[74] Ung L, Bispo PJM, Doan T, Van Gelder RN, Gilmore MS, Lietman T, Margolis TP, Zegans ME, Lee CS, Chodosh J. 2020. Clinical metagenomics for infectious corneal ulcers: rags to riches? Ocular Surface 18(1):1-12

[75] Urban L, Holzer A, Baronas JJ, Hall M, Braeuninger-Weimer P, Scherm MJ, Kunz DJ, Perera SN, Martin-Herranz DE, Tipper ET, Salter SJ, Stammnitz MR. 2020. Freshwater monitoring by nanopore sequencing. bioRxiv

[76] Verdon TJ, Mitchell RJ, van Oorschot RAH. 2014. Swabs as DNA collection devices for sampling different biological materials from different substrates. Journal of Forensic Sciences 59(4):1080-1089

[77] Wick RR, Judd LM, Holt KE. 2019. Performance of neural network basecalling tools for Oxford Nanopore sequencing. Genome Biology 20(1):129