Introduction
Coronaviruses are positive-sense, single-stranded RNA viruses found in humans and a wide variety of animals. In humans, coronaviruses are mainly causes of respiratory tract infections, whereas in animals, they can cause respiratory, enteric, hepatic and neurological diseases of varying severity. As in other RNA viruses, the infidelity of RNA-dependent RNA polymerase results in high mutation rates. In addition, coronaviruses possess a unique mechanism of viral replication that leads to high frequencies of recombination, as well as the largest genome size (26.4 to 31.7 kb) among all known RNA viruses that gives this family of viruses extra plasticity. All these factors have allowed the coronaviruses to adapt to new hosts and ecological niches. Traditionally, coronaviruses were classified into groups 1, 2 and 3, with groups 1 and 2 consisting of mammalian coronaviruses and group 3 being avian coronaviruses. In 2008, the Coronavirus Study Group of the International Committee for Taxonomy of Viruses proposed three genera, Alphacoronavirus, Betacoronavirus and Gamma-coronavirus, to replace these three traditional groups of coronaviruses (http://talk.ictvonline.org/cfs-filesystemfile.ashx/__key/CommunityServer.Components.PostAttachments/00.00.00.06.26/2008.085_2D00_122V.01.Coronaviridae.pdf).
The SARS epidemic that originated from southern China in 2003 has boosted interest in all areas of coronavirus research, most notably, coronavirus biodiversity and genomics. Before 2003, there were only 10 coronaviruses with complete genomes available, including only two human coronaviruses, human coronavirus 229E (HCoV-229E) and human coronavirus OC43 (HCoV-OC43). These two human coronaviruses were discovered in the 1960s, with HCoV-229E being a group 1 coronavirus and HCoV-OC43 a group 2 coronavirus [17, 34]. After the SARS epidemic, up to December 2008, 16 novel coronaviruses were discovered and their complete genomes sequenced. Among these 16 previously unrecognized coronaviruses were two more human coronaviruses, human coronavirus NL63 (HCoV-NL63) and human coronavirus HKU1 (HCoV-HKU1), ten other mammalian coronaviruses and four avian coronaviruses. HCoV-NL63 is a group 1 coronavirus whereas HCoV-HKU1 is a group 2 coronavirus. In just a few years after their discoveries, numerous reports throughout the world had described the presence of HCoV-NL63 and HCoV-HKU1 in patients with respiratory infections in their corresponding countries. In this article, we reviewed our current understanding of the classification, virology, epidemiology, clinical diseases, laboratory diagnosis, treatment and prevention of HCoV-HKU1.
Classification and virology
In the current system of classification in which group 2 coronaviruses are classified into group 2a, group 2b and the recently proposed groups 2c and 2d, HCoV-HKU1 is a group 2a coronavirus. Phylogenetically, no close relative with less than 10% nucleotide difference in any of the genes is present (Figure 1)
At the moment, 22 HCoV-HKU1 genomes have been sequenced. The sizes of the genomes of HCoV-HKU1 range from 29,295 to 30,097 nucleotides. The G + C contents of HCoV-HKU1 genomes are 32%, the lowest among all known coronaviruses with complete genome sequences available. The genome organization of HCoV-HKU1 is similar to that of other coronaviruses, with the characteristic gene order 5′-replicase ORF1ab, spike (S), envelope (E), membrane (M), nucleocapsid (N)-3′ (Figure 2). Additional genomic features of HCoV-HKU1 that are similar to all other group 2a coronaviruses include a putative transcription regulatory sequence (TRS) of CUAAAC, two papain-like protease domains (PL1pro and PL2pro) in nsp3 of ORF1ab, a haemagglutinin esterase (HE) gene between ORF1ab and S, and an internal ribosomal entry site upstream to the initiation codon of E. Among all coronaviruses with complete genomes available, HCoV-HKU1 shows the most extreme codon usage bias, attributed most importantly to severe cytosine deamination that has led to C → U changes.
The ORF1ab replicase polyprotein is putatively cleaved by its papain-like proteases and 3C-like protease (3CLpro) into 16 nonstructural proteins (nsp1 to nsp16) homologous to the corresponding ones in other coronaviruses. Analysis of the putative cleavage sites of the 3CLpro revealed a unique putative cleavage site at the junction between nsp13 and nsp14, which was also recognized in HCoV-NL63. In addition to this unique putative cleavage site, all HCoV-HKU1 genomes contain tandem copies of a 30-base acidic tandem repeat (ATR) (variable numbers of perfect repeats of NDDEDVVTGD in the amino acid sequence followed by variable numbers of imperfect repeats) at the N-terminus of nsp3 inside the acidic domain upstream of PL1pro. This phenomenon was also observed in the two strains of HCoV-HKU1 found in France with this region of the genome characterized. Although the function of the ATR is not known, these tandem copies of ATR have made the nsp3 of HCoV-HKU1 the longest nsp3 among all coronaviruses with complete genomes available.
At least three genotypes, genotype A, B and C, of HCoV-HKU1, with inter-genotypic homologous recombination, have been observed. When the pol, S and N genes of clinical strains HCoV-HKU1 were analyzed, it was observed that the strains fell into two clusters, designated genotypes A and B. Interestingly, there were a few strains in which the sequences of pol genes were clustered with genotype A but those of S and N genes were clustered with genotype B. Subsequent sequencing and analysis of 22 complete HCoV-HKU1 genomes found that there were three genotypes of HCoV-HKU1 and significant numbers of homologous recombination events have occurred among the three genotypes. The most notable example was in a stretch of 29 bases at the 3′ end of nsp16, in which recombination between genotype A and genotype B has led to the generation of genotype C. This represented the first example of homologous recombination in human coronavirus, and was also the first study to describe a distribution of natural recombination spots in the entire genome of field isolates of a coronavirus. Although no complete genome sequence is available for HCoV-HKU1 strains found outside Hong Kong, sequences of fragments of pol, S and N of the strains suggested that all three genotypes are probably distributed globally. Analysis of a single gene is not sufficient for genotyping of HCoV-HKU1, but would require sequencing of at least two gene loci, one from nsp10 to nsp16, such as pol or helicase, and another from HE to N, such as S or N.
Although HCoV-HKU1 is still not cultivable using a wide variety of cell lines, neuron-glia culture and intracerebral inoculation of suckling mice, the biogenesis, subcellular localization and intracellular trafficking of S in HCoV-HKU1 as well as its interaction with major histocompatibility complex class I C molecule (HLA-C) were characterized recently [4, 6]. Results of these studies confirmed that S of HCoV-HKU1 was N-glycosylated in the endoplasmic reticulum (ER) with high-mannose N-glycans, and it was first distributed in the ER and Golgi, subsequently also detected in vesicles throughout the cytoplasm, and finally on the cell surface. Cleavage of S into S1 (Endo H deglycosylation resistant) and S2 (Endo H deglycosylation sensitive) that was inhibited by the furin or furin-like enzyme inhibitor, peptidyl chloromethylketone was also observed, confirming the bioinformatics prediction of the presence of cleavage site between S1 and S2. Using HCoV-HKU1 S pseudotyped virus, human alveolar epithelial A549 cells were shown to be the most susceptible cell line. Using an A549 cDNA expression library transduced into the non-permissive, baby hamster kidney cell line BHK-21 for detecting proteins that bind HCoV-HKU1 S1-600 glycoprotein, independent clones with inserts encoding HLA-C were fished out. Further experiments also suggested that HLA-C is involved in the attachment of HCoV-HKU1 to A549 cells and is a potential candidate to facilitate cell entry.
Epidemiology
In our studies on animal coronaviruses, no HCoV-HKU1 was detected in screening more than 10,000 animal specimens in a variety of mammalian and avian species [23, 40, 41, 44]. The dn/ds ratios of all ORFs in the genomes of HCoV-HKU1 are low. Therefore, HCoV-HKU1 is stably evolving in humans, probably the only known reservoir. Similar to other respiratory viruses, HCoV-HKU1 is presumably transmitted through exchange of respiratory secretions. The incidence of HCoV-HKU1 infections is the highest in winter. Similar to other human coronaviruses, HCoV-HKU1 infections have been reported globally (Table 1) [1,3,8,9,11–15,20,21,25,29,30,32,35,36,42]. From the studies with the incidence of all four human coronaviruses examined (Table 1), the median (range) incidence of HCoV-HKU1 was 0.9 (0 – 4.4) %, with no significant difference to those of HCoV-OC43 [1.9 (0 – 6.3) %], HCoV-229E [0.4 (0 – 6.9) %] and HCoV-NL63 [1.1 (0 – 8.0) %]. Moreover, different human coronaviruses were more prevalent than the others in different studies.
The seroprevalence of HCoV-HKU1 antibody varied widely in different studies that used different antigens and methodologies for antibody detection. In a recent study on seroepidemiology of HCoV-HKU1 using Escherichia coli BL21 expressed recombinant S-based enzyme-linked immunosorbent assay (ELISA) and line immunoassay, it was observed that the seroprevalence of HCoV-HKU1 antibody increased from 0% in patients who were <10 years old to a plateau of 22% in patients who were 31 to 40 years old. In another study on seroepidemiology of the four human coronaviruses in Germany using E. coli BL21-expressed recombinant N-based line immunoassay, it was noted that the seropositivity for HCoV-HKU1 in 25 healthy blood donors was 48%, similar to those for HCoV-OC43 (52%), HCoV-229E (56%) and HCoV-NL63 (60%). On the other hand, in another study of a US metropolitan population using baculovirus-expressed recombinant N-based ELISA, it was observed that the proportion of HCoV-HKU1 seropositive adults was 59.2%, significantly lower than those for HCoV-OC43 (90.8%), HCoV-229E (91.3%) and HCoV-NL63 (91.8%). Moreover, it was also observed that significantly different seropositivity rates for the various human coronaviruses were observed in individuals of different races, smoking status and socioeconomic status. Further studies have to be performed to delineate whether these demographic factors confer differential risks of susceptibility to different human coronaviruses.
Clinical diseases
Similar to other human coronaviruses, HCoV-HKU1 is associated with both upper and lower respiratory tract infections. Respiratory tract infections associated with HCoV-HKU1 are indistinguishable from those associated with other respiratory viruses. For upper respiratory tract infections, most patients present with fever, running nose and cough; while for lower respiratory tract infections, fever, productive cough and dyspnea are common presenting symptoms. Most HCoVHKU1 infections are self-limiting, with only two deaths reported in patients with HCoV-HKU1 pneumonia. Both had underlying diabetes mellitus, cardiovascular diseases (myocardial infarction in one and cerebrovascular accident in the other) and cancers (gastric lymphoma in one and prostatic carcinoma in the other), lymphopenia and airspace shadows in both lungs. Interestingly, a recent study from rural Thailand that involved control patients showed the presence of human coronaviruses in >2% of control patients, which raised questions about the role of human coronaviruses in pneumonia. At the moment, no antiviral drugs or vaccines for HCoV-HKU1 and the other human coronaviruses are available. Symptomatic and supportive treatment is the mainstay of therapy given to patients suffering from infections caused by these viruses.
In addition to respiratory tract infections, HCoV-HKU1 has been found in other illnesses. In our one-year prospective study, it was observed that HCoV-HKU1 infection was associated with febrile seizures. On the other hand, in another French study, although six (17.6%) of the 34 HCoV-HKU1 infected children were admitted for epileptic seizures, HCoV-HKU1 infections were not shown to be associated with febrile seizures. In one study, HCoV-HKU1 was detected in the stool samples of two patients with respiratory tract infections but no gastrointestinal tract symptoms, but it was not detected in patients with diarrhea. In another study, HCoV-HKU1 was detected in a liver transplant recipient with hepatitis, which other causes of hepatitis, such as graft rejection, cytomegalovirus, etc. were excluded. The significance of HCoV-HKU1 febrile seizures, gastroenteritis and hepatitis remains to be determined.
Laboratory diagnosis
The most common method for making a diagnosis of HCoV-HKU1 infection is RT-PCR or real-time RT-PCR using RNA extracted from respiratory tract samples such as nasopharyngeal aspirates (NPA). Both the pol and N genes have been used as the targets for amplification (Table 2). In our index patient, the viral loads in the NPA of the patient were 106 to 107 and 105 copies/ml of NPA in the first and second weeks of illness respectively, and became undetectable from the third week of illness onwards. Recently, DNA microarrays have also been used for detection of HCoV-HKU1 and other human coronaviruses, and the sensitivity was found to be similar to that of individual real-time RT-PCR. In addition to nucleic acid detection, monoclonal antibodies have been generated for direct antigen detection in NPA, although the targets for the binding of the monoclonal antibodies are still not known.
For antibody detection, E. coli BL21- and baculovirus-expressed recombinant N of HCoV-HKU1 has been used for IgG and IgM detection in sera of patients and normal individuals, using Western blot and ELISA. In our index patient, his serum IgG titer rose from <1:1,000 in the first week of his illness to 1:2,000 and 1:8,000 in the second and fourth weeks of his illness respectively, whereas the IgM titers in the first, second and fourth weeks were 1:20, 1:40, and 1:80 respectively. Furthermore, in our retrospective study of patients with community acquired pneumonia, all the six patients with serum samples available showed four-fold changes in IgG titer and/or presence of IgM against HCoV-HKU1. Although HCoV-HKU1 is still not cultivable, a neutralization antibody test was also developed recently using HCoV-HKU1 pseudotyped virus.
Concluding remarks
The discovery of SARS coronavirus marked the beginning of the race of coronavirus hunting in humans and animals. In just a few years, the number of “coronavirus” papers found by Medline search has doubled and the number of coronaviruses with complete genomes available has tripled. With this increase in the number of coronaviruses and genomes, we are starting to appreciate the diversity of coronaviruses. Moreover, comprehensive and user-friendly databases for efficient sequence retrieval and the ever-improving bioinformatics tools have further enabled us to improve our understanding of the phylogeny and genomics of coronaviruses. With the increasing number of coronaviruses, more and more closely related coronaviruses from distantly related animals have been observed. The most notable example related to human coronaviruses is the clustering of HCoV-OC43, bovine coronavirus and porcine hemagglutinating encephalomyelitis virus. With more and more coronaviruses discovered, we will be able to understand the origin of the various human coronaviruses, and more importantly, the secret behind their mechanisms of interspecies transmission.