Introduction
Arthropods can act as biological vectors that transmit infectious agents and thereby cause diseases in humans and animals. After mosquitoes, ticks are the most common arthropod vector for viruses, bacteria, and other parasites causing different vector-borne diseases. Ticks from different parts of the world have been shown to carry viruses belonging to, for example, the Bunyaviridae, Flaviviridae, Asfaviridae and Orthomyxoviridae families. Additionally, ticks carry many non-pathogenic microbes, and some of these microbes have formed symbiotic relationships with their hosts and transmit by vertical transmission [2–5].
The genus Rhipicephalus, belonging to the Ixodidae family (family of hard ticks), is widely distributed worldwide and is considered a potential vector of several emerging pathogens. Different studies have shown that viral pathogens, such as Thogoto viruses, Wad Medani virus, Nairobi sheep disease virus, Crimean-Congo hemorrhagic fever virus, African swine fever virus and Tick-borne encephalitis virus [1,6–8], can be found in Rhipicephalus ticks. Borrelia, Anaplasma, Rickettsia, Ehrlichia, and Babesia are a few examples of the pathogenic bacteria that have been identified in these types of ticks.
With the advent of metagenomic approaches, the limitations of culture-based methods have been overcome and have enabled the characterization of the entire microbiota associated with the host. Numerous studies have used metagenomics to explore viral communities in different arthropod species and have in these identified viruses associated with a broad range of animals, plants and insects. Many of the identified viruses have been novel, for example, highly divergent viruses belonging to nairoviruses and phleboviruses were reported in Amblyomma and Ixodes ticks from the USA [11–13] and in Rhipicephalus ticks.
Previous reports indicate that many tick-borne disease-causing agents exist in Mozambique, such as Nairobi sheep disease virus, Theleiria and Anaplasma. Since ticks are known vectors for potentially pathogenic and non-pathogenic viruses and the virome of ticks is understudied in this region, the current study used viral metagenomics to identify viruses associated with Rhipicephalus ticks collected in the Zambezi Valley of Mozambique.
Tick collection and identification
Fifty-one adult ticks were collected in October 2014 from small ruminants in the Cuacua village of the Zambezia province in central Mozambique (Geographical coordinates S 17°48.043ʹ E 035°24.730ʹ). Tick collection was carried out on private land with the permission from the landowner and the local farmers. The collected ticks were stored in RNAlater (Invitrogen), transported to the laboratory, and stored at −80°C until further use. Individual ticks were identified morphologically to determine the genus; however, the sex of the ticks was not determined. The ticks were surface-sterilized with 95% ethanol and rinsed twice in water before nucleic acid extraction.
Nucleic acid extraction
Each tick pool (up to 3 ticks/pool) was mechanically homogenized using the Tissuelyser II (Qiagen) for 30 cycles/sec with 1 ml of TRIzol LS reagent (Invitrogen) and two 5 mm stainless steel beads. The supernatant was collected after centrifugation at 13,000 x g for 10 min at 4°C. Total RNA was extracted from the homogenate according to the manufacturer’s instructions, and the RNA pellet was dissolved in 40 µl of nuclease-free water. After extracting RNA from the aqueous phase of the TRIzol homogenate, the remaining slurry containing the interphase/organic phase was saved at −80°C. Next, 5 µl of RNA from each tube was combined into a single pool, and the RNA was treated with DNase from the RNase-free DNase set (Qiagen) and purified with the RNeasy MinElute Cleanup kit (Qiagen). Ribosomal RNA was depleted using the RiboZero kit (RiboZero Gold Human/Mouse/Rat, Illumina) according to the manufacturer’s protocol, and the RNA was again concentrated using the RNeasy MinElute Cleanup kit (Qiagen).
cDNA labeling, amplification and sequencing
First strand cDNA synthesis, followed by labeling of cDNA and random amplification, was performed with 10 µl of RNA, as described by Cholleti et al. 2016. The final amplified product were submitted to the National Genomics Centre (SciLife Lab, Uppsala, Sweden) for library preparation and sequencing. High-throughput sequencing was performed with the Ion Torrent PGM sequencing platform using an Ion 318TM chip (v2) and 400 bp read length chemistry. The raw sequencing data are accessible through NCBI’s Sequence Read Archive, SRA: SRP109282.
Sequence processing and taxonomy assignment
The sequences produced from the Ion Torrent platform were quality checked by filtering reads with low quality scores (Q < 20), removing exact duplicate reads and trimming the ends of the reads with PRINSEQ. Good quality reads were mapped to different tick genomes Ixodes scapularis (GenBank assembly accession no. GCF_000208615.1) and Rhipicephalus sanguineus complete mitochondrial genome (GenBank accession no. NC_002074.1), using the default settings of Bowtie2. Unmapped reads were subjected to BLAST searches querying against NCBI nucleotide (nt) and protein sequence (nr) databases with an e-value cutoff of 1e-03. Reads that were classified as viruses were further assembled to generate longer sequences using the de novo assembler in CodonCode Aligner 6.02 (CodonCode Corporation) and SeqMan 11.2.1 (DNASTAR).
Confirmation of sequences and recovery of genomic ends
The viral contigs were used as a reference for designing specific primers to confirm and analyze the sequences of terminal ends, performed by the rapid amplification of cDNA ends (RACE). All of the primers used in this study are listed in Supplementary Table S2. PCR amplification was performed at the following conditions: 95°C for 10 min, followed by 35 cycles of 95°C for 30 sec, 55–60°C for 30 sec, 72°C for 1 min and finally holding at 72°C for 7 min. RACE products were purified with a GeneJet PCR purification kit (ThermoFisher Scientific) and sequenced at Macrogen Europe (Macrogen Inc.).
Phylogenetic analysis
The phylogenetic tree was constructed using amino acid sequences of selected viruses. For this, viral sequences were downloaded from GenBank and multiple sequence alignment (MSA) was performed using ClustalW with default parameters. The gaps and missing data were eliminated prior to phylogenetic analysis and the tree was generated using maximum likelihood (ML) method using MEGA 7 (version 7.0.26). The statistical significance of the tree topologies was evaluated by 1000 bootstrap replicates.
Orthomyxoviridae family
In total, 4008 reads (98% of all viral reads) were classified within the Orthomyxoviridae family (Figure 2). At the amino acid level, the reads showed closest similarity to members of the Quaranjavirus genus, such as the Wellfleet Bay virus (WFBV), Tjuloc vius (TLV), Quaranfil virus (QRFV), Johnstol Atoll virus (JAV) and unclassified quaranjaviruses. No significant similarities were observed at the nucleotide level. Assembly of these reads generated 6 contigs with high sequencing depth and lengths ranging from 218 to 2353 nucleotides (nt) (Table 3). BLAST searches of these contigs revealed that they show similarities to the nearly full-length segments of Wellfleet Bay virus and Tjuloc virus ORFs: Hemagglutinin (HA), Nucleoprotein (NP), Polymerase basic 1 protein (PB1), Polymerase basic 2 protein (PB2), and Polymerase Acidic protein (PA) (Figure 2). However, the identified ORFs exhibit high genetic diversity to known quaranjavirus genomes, with an amino acid identity of only 32–55%, indicating that these represent novel viral sequences belonging to the Quaranjavirus genus. One of the contigs (contig6, 218 nt long) showed an amino acid identity of 55% to the NP protein of WFBV; however, it did not assemble with the other NP-classified reads in the data set. The BLAST hit results for this contig included a hypothetical protein of the Tjuloc virus and Quaranfil virus with an identity of 32%. This contig could, therefore, be a matrix protein or another viral protein divergent from previously known quranjaviral segments. The 3′ UTR sequences of the HA and NP sequences were recovered using RACE analysis. The 3′ UTR could not be recovered from the other segments, nor could any 5′ UTR be recovered.
Evolutionary relationships was analyzed using PB1 as it is considered the most conserved of the orthomyxovirus genes. The phylogenetic analysis of the PB1 protein sequences, including Influenza A, B, C, Thogotovirus and Quaranjavirus, showed that PB1 from the novel quaranjavirus, identified in this study, formed a separate branch in the quaranjavirus group and was most closely related to WFBV (with 100% bootstrap support) (Figure 3).
Other viral families
Sequences similar to Parvoviridae, Retroviridae, Coronaviridae, unclassified dsRNA viruses and environmental samples were also identified in the ticks (Table 2). Assembly of the sequences belonging to the Parvoviridae family generated 5 contigs ranging between 354–870 nt. These contigs showed an amino acid identity (26–62%) to the non-structural protein 1 of Cherax quadricarinatus densovirus (accession no. YP009134732.1), Lonestar tick densovirus 1 (ASU47551.1) and Ambidensovirus CaaDV1 (AR146481.1). A single contig (448 nt) was generated from the assembly of the coronavirus reads. This contig showed the closest identity to ORF1a of Duck coronavirus (AKF17723.1); however, the similarity at the amino acid level was only 24% (Supplementary Table S1). Sequences classified as Retroviridae family showed low sequence identity (25–35%) to known viral proteins (polymerase of Feline foamy virus, gag protein of human immune deficiency virus 1 and Simian immune deficiency virus) and it was not possible to assemble them into longer contigs, possibly due to the absence of overlaps or high sequence diversity.
Discussion
In this study, a metagenomic approach was used to identify viruses associated with Rhipicephalus ticks in the Zambezi Valley of Mozambique where the surveillance of tick-associated pathogens is limited. To the best of our knowledge, this is the first study where high-throughput sequencing have been used to explore the viruses present in ticks from Mozambique. The majority of the viral sequences belonged to the Orthomyxoviride family and was therefore studied in more detail; however, other viral families such as Parvoviridae and Coronaviridae were also identified. Many of these viral sequences showed very low sequence identity to known viruses, indicating that they most likely correspond to novel viruses.
The Orthomyxoviridae family consists of six recognized genera: Influenza A, B, and C, Isavirus, Thogotovirus, Quaranjavirus and the newly proposed Influenza D. Viruses in this family infect a broad range of hosts including humans, birds, swine, fish and arthropods, and transmission occurs by different routes such as fecal-oral, water, air, and direct contact. The genus Quaranjavirus was approved by ICTV in 2011 and included two new species, Quaranfil virus (QRFV) and Johnston Atoll virus (JAV), as well as a tentative member, Lake Chad virus (LKCV). These viruses were isolated in the 1950s and 1960s but were only recently recognized as quaranjaviruses. They appear to be mainly associated with ticks (QRFV and JAV) and birds (LKCV) [21–24]. In the present study, novel viral sequences showing sequence identity to quaranjaviruses were characterized from Rhipicephalus spp. ticks in Mozambique using high-throughput sequencing. These sequences are highly divergent from all known quaranjaviruses, with approximately 50% amino acid identity. BLASTx search analyses show that these sequences are most closely related to WFBV and TLV. In the phylogeny, the PB1 protein sequence from this study clustered with quaranjaviruses and showed closest relationship to WFBV. At this point, we are not sure if the sequences found in this study belong to one, two or more viruses. Future studies are needed to show this. The mechanism of transmission is uncertain, but it is speculated that soft and hard ticks are the primary vectors. Several studies have identified quaranjaviruses in other arthropod vectors, such as spiders, mosquitoes and horseflies. Little is known about the molecular pathogenesis of quaranjaviruses. The viruses in this genus are globally distributed throughout the Middle East, Africa and Pacific regions.
Parvoviruses are associated with a variety of chronic diseases in humans and animals. However, these viral sequences have the ability to integrate into the chromosomal DNA of a wide range of hosts and may transmit to offspring. The parvovirus sequences identified in the current study had closest similarity to non-structural protein 1 of different densoviruses, which were shown previously to integrate into tick genomes such as in Ixodes, Amblyomma and Rhipicephalus genera. We assume that these sequences are either endogenous viral elements of the tick genome or DNA from densoviruses remaining in the RNA due to incomplete DNase treatment during the sample preparation step.
Ticks can be co-infected with different bacteria and viruses and may thus be involved in co-transmission of different pathogens, which may enhance disease severity as reported previously. In contrast, endosymbionts may interact with arthropod-borne pathogens and play a crucial role in the fitness of the arthropod vector. For instance, Coxiella endosymbionts of Amblyomma ticks impair the transmission of Ehrlichia chaffiensis. Further, in another important arthropod vector, the mosquito, the Wolbachia symbiont can limit the replication of several pathogenic viruses such as Dengue virus, yellow fever virus, West Nile virus and different protozoans such as Plasmodium [32–34]. It has also been demonstrated that pathogens may influence the microbiome of arthropod vectors by modulating immune responses for its survival and infection. Thus, characterizing the microbiome of ticks and other arthropods is just the first step to understanding the complex interactions between the different microorganisms that reside within the vector.
In summary, we have identified a novel and highly divergent orthomyxovirus in Rhipicephalus spp. ticks in Mozambique. This study constitutes the initial step towards a more comprehensive understanding of the viruses circulating in ticks. Additional investigations are warranted on the functional and ecological aspects of the identified orthomyxovirus. This increased understanding of the tick-borne viruses highlights the role in emergence and transmission of disease-causing agents.