Introduction
Recently several lines of evidence have demonstrated that a new strain of coronaviruses, SARS-CoV (severe acute respiratory syndrome-associated coronavirus), is the pathogen of SARS 1., 2.. Within a short period, SARS-CoV was decoded completely, providing a fundamental basis for understanding its pathogenesis and developing effective diagnostic and therapeutic approaches 3., 4., 5.. However, although the initial SARS outbreak has ended, people still have quite a limited knowledge about this infectious disease (6). Worries about the recurrence of SARS in this winter have been expressed by medical professionals and how to diagnose cases in the early stages of infection is their major concern. To develop a sensitive assay for detecting the diluted virus in body fluid, we must first address the questions about which biological roles the SARS-CoV structural proteins play and the strength of the immunoresponses caused by these proteins.
Of all the coronaviral structural proteins, the N protein is the most abundant throughout infection, both in mRNA and protein levels (7). Compared to the mRNA levels of other structural genes, the mRNA of the N protein is expressed three to ten times higher at 12-hour post-infection (8). The N protein has a high composition of polar amino acid residues. Most of them display hydrophilicity and immunogenicity, as, for example, the murine coronavirus (MCV; ref. 9), the turkey coronavirus (TCV; ref. 10), and the infectious bronchitis virus (IBV; ref. 11). In contrast to other SARS-CoV structural proteins that contain multiple sites for glycosylation during infection, which may cause different immunoresponses, the N protein is free of glycosylation sites and does not change its immunological characteristics even expressed in a prokaryote system (12). Importantly, the SARS-CoV N protein is highly conserved with an almost complete identity among various strains that have been sequenced so far (5). These features make the N proteins useful for group-specific serologic assays. For example, the N gene in IBV was expressed in bacteria with a histidine tag at the N-terminus, and the expressed IBV N protein was then used as antigen for developing an assay to detect IBV-specific antibody. Collisson et al. found that this antigen could be successfully applied to diagnose IBV viruses in different chicken strains (13). Utilizing the baculovirus expression system, the TCV N protein was expressed and applied to specific serologic tests for detecting TCV antibodies in turkeys (14). A high degree of concordance was observed between the ELISA and the indirect fluorescent antibody (IFA) test with 96% of specificity (14). Therefore, studying the immunogenic properties of the N protein may form the basis for developing immunodetection assays of SARS, and also for formulating strategies of future vaccine development against this disease.
The present study was undertaken to explore the epitope sites located in the SARS-CoV N protein. We have generated three different lengths of the N protein fragments and synthesized nine peptides derived from the N proteins. We observed that all recombinant proteins could cross-react with the antibodies in SARS sera. To define the antigenic site(s) on the N protein, we further screened the synthesized peptides in SARS sera using Western blot as well as ELISA. The data obtained from these immunoassays demonstrated that an epitope site is within the C-terminus of the N protein.
PCR amplification and expression constructs
To amplify the N gene fragments, we used the cDNA reverse-transcripted from SARS-CoV mRNA as the template, and three pairs of primers, N-full, ΔN256 and ΔN124, as primers in PCR reactions. As shown in Figure 1, the sizes of PCR products coincide well with the theoretical estimation based upon the genomic sequence, 1.2 Kb, 490 bp, and 890 bp, for N-full, ΔN256, and ΔN124, respectively. Since all primers contain additional BamH I site at 5’ end and Not I at 3’ end, both restriction enzymes were used to completely digest these PCR products and pET30a vector. The digested and harvested N gene fragments were ligated with the linearized pET30a to form the expression vectors, pET30-N-full, pET30-ΔN256 and pET30-ΔN124. In Figure 1, the digestion of these vectors with BamH I and Not I generated three DNA fragments with different molecular weights, which were identical to the PCR products, indicating that these N gene fragments were inserted into pET30a correctly. Finally all insertions were confirmed by DNA sequencing (data not shown).
Expression and purification of recombinant N proteins
The three expression vectors were transformed into the BL-21 strain and the proteins were expressed by inducement of IPTG. Interestingly, pET30-ΔN124 was able to produce some recombinant proteins even without the inducement of IPTG, whereas pET30-N-full and pET30-ΔN256 only generated their proteins after the presence of IPTG. According to the hydrophobic analysis, the N protein contains several hydrophilic regions spanning the whole N protein (Figure 2). Thus, these recombinant proteins were expected to be highly soluble. On the contrary, all the three recombinant proteins expressed in BL-21 mainly formed inclusion body and released limited soluble forms in cytoplasm. The recombinants could not be purified directly from soluble fractions. To obtain a high protein yield, the bacterial pellets were treated with 8 M urea followed by a strong probe sonication. The denatured proteins retained the affinity to Ni2+, and hence Ni-NTA column was effectively applied into the purification of these recombinants. As shown in Figure 3, high purity proteins from all the three N recombinants have been obtained through one step of affinity chromatography (>95%).
Immunoresponses of the recombinant N proteins to SARS sera
The lysate of Vero-E6 infected by SARS-CoV was tested by Western blot with SARS sera as the primary antibody. An obvious immunoprecipitate band appeared around 50 kDa (Figure 4A), close to the molecular weight of the N protein based upon the theoretical estimation. The same experiments were repeated with eleven sera from SARS patients, giving a consistent result that a major immunoreactive band is located at about 50 kDa (data not shown). To check the immunoreactions of the recombinant N proteins, all were examined by Western blot using the sera from SARS patients. Figure 4B depicts that all three N recombinants have strong immunoresponses to the SARS sera. Impressively, the size of the N-full protein is almost identical to this immuno-positive band found from Vero-E6 lysate. Thus, the evidence supports the conclusion that the N protein expressed from Vero-E6 cells that had been infected by SARS-CoV is an antigenic protein. Another potentially important phenomenon is that the immunoreactive intensities of three recombinant proteins are quite similar. The N-full protein contains the whole amino acid sequence of the N protein. However, the other two recombinants, ΔN256 and ΔN124, are truncated N proteins. The ΔN256 protein only has 166 amino acids at the C-terminus of the N protein, whereas the ΔN124 protein is lack of 124 amino acids at the N-terminus of the N protein but contains 298 amino acids at C-terminus. Hence, all of the N recombinants share the same C-terminus and are distinguished from each other by the lengths of their N-terminus. Since similar immunoreactivities were observed in the three N recombinants, they may share similar epitope site(s), possibly within 166 amino acids at the C-terminus that are shared by the three recombinants. The N proteins, with the truncated N-terminus ΔN256 and ΔN124, did not seem to lose their immunoresponses to SARS sera, indicating that the antigenicity of N-terminus of the N protein might be much weaker than that of the C-terminus. This hypothesis was supported by ELISA experiments. A number of SARS sera were tested by ELISA to examine the immunoreactivities of these N recombinants, and all the three N proteins displayed similar positive ELISA reactivities (data not shown).
Screening the epitope site at the C-terminus of the N protein
Having established that the three N recombinants share 166 amino acids of the N protein at the C-terminus and share similar immunoresponses to SARS sera, the next question to address is how to precisely localize the epitope site(s) within this region. On the basis of hydropathy plot, nine peptides located in this region were selected and synthesized. These synthesized peptides were screened in SARS sera (31 cases) with ELISA. The lysate of infected Vero-E6 cells was used as positive control for ELISA reactivity. A peptide showing comparable ELISA reactivity with this control would be considered as an immunoreactive N fragment. Table 1 summarizes the ELISA data from the screening of the nine peptides, in which p values are very important, indicating whether there is a significant difference in ELISA reactivities between a synthesized peptide and SARS-CoV as the antigen. Two peptides, N371 and N385, have comparable immunoreactivities with the Vero-E6 lysate, with 97% and 94% positive detection rates, respectively. Thus the epitope site at the C-terminus of the N protein is likely to be located at Codons 371–407.
Confirming N371 at epitope site by competitive inhibition using ELISA and Western blot
Since the antibodies in sera from SARS patients can recognize these viral proteins, the synthesized peptides, if they indeed mimic some epitope sites in this virus, should elicit immunoresponses and neutralize the immunoreaction in ELISA. Peptide N371 is the most comparable antigen among the nine synthesized peptides. Hence, it is deduced to be a good competitor against SARS-CoV virus for antibody binding, if it does exist at the epitope site. Prior to ELISA determinations, peptide N371 was incubated with or without SARS serum for 30 min, respectively, and was then added to a microtiter plate coated with the extracted proteins from SARS-CoV. As predicted, the ELISA reactivities were attenuated with the increase of peptide concentrations in the SARS sera (Figure 5A). Furthermore, the competition experiments were conducted in Western blot. The PVDF (Polyvinylidene Fluoride) membrane transblotted with three recombinant N proteins were incubated with the SARS serum containing the peptide, N371. Compared with the immunoresponses showed in Figure 3B, the cross-reactions between the recombinants and SARS antibodies were significantly curtailed, indicating that this peptide inhibited the recognition between SARS-CoV and its antibodies (Figure 5B). This peptide, therefore, is confirmed to be an epitope site that exists in SARS-CoV as well as in the three recombinant proteins.
Discussion
Antigenicity of the N proteins in the coronavirus family has been extensively studied 15., 16.. Usually these N proteins contain multiple epitopic sites. However, they commonly have an epitopic site at the C-terminus. For instance, the epitope sites for TGEV (transmissible gastroenteritis virus), MHV (mouse hepatitis virus), and IBV are located around Codons 360–382 (17), 381–405 (18), and 360–409 at C-termini (19), respectively. Similar to other coronaviruses, the present study has confirmed that an antigenic site is located at the C-terminus of SARS-CoV. In Figure 2, there is a hydropathy plot for the SARS-CoV N protein. A lot of regions in this protein have low hydrophobicity scores (<0), and the C-terminus is specifically strong in hydrophilicity. Two major forces, charge-charge interactions and hydrogen bonds caused by hydrophilicity, have been hypothesized to be crucial for the formation of epitope site. Peptides N371 and N385 are both located at the hydrophilic region of C-terminus and contain a high content of polar amino acids (~ 38%). Further more, their biophysical parameters correlated well with the values of ELISA reactivity as well as the intensity of Western blot when the two peptides were used as antigens. These findings, thus, support the theory that both hydrophilicity and peptide charge are important in determining immunoactive sites in SARS-CoV N proteins.
As described above, the N protein of SARS-CoV is hydrophilic and chargeable. The major function of the N protein in coronaviruses is to stabilize genomic RNA by forming a helical ribonucleocapsid (RNP), which is located at the viral core, and is highly stable under conditions of high ionic strength and highly resistant to the actions of RNase (20). Because of its location in the virus, the N protein is rarely glycosylated in an infected cell. Hence, its recombinant proteins should be soluble during expression in the bacterial system. On the contrary, all the three recombinant N proteins did not express in soluble forms but mainly generated the inclusion bodies. Although a number of approches were taken to improve the solubility of these recombinants, such as altering expression temperature or decreasing inducer concentrations, the formations of inclusions were found in BL-21 strains consistently. Some misfolding mechanisms are believed to be behind the phenomenon. Moreover, this phenomenon may explain the role of the N protein in the virus because the N protein and viral genomic RNA interact reciprocally to stabilize each other. In the bacterial expression system, the precipitation of the N protein could result from the absence of the RNA molecules.
Serological assay is widely used for the diagnosis of virus infection in the host for its low cost, fast speed and high accuracy (21). In some urgent situations, a common way in serological assay is to use viral lysates as antigens. Nevertheless, the use of viral lysates brings some drawbacks. For instance, people preparing SARS-CoV lysates are at risk of infection. Instead of viral lysate, recombinant proteins can be used to avoid such problems. However, to generate a proper antigen specific to SARS-CoV is a tough challenge. A major problem is how to discover high antigenic fragments in viral proteins. In this study, a specific epitope site has been identified. Significantly, these recombinant proteins and synthesized peptides (N371 and N385) were confirmed to have strong immunoresponse to the patient sera determined by either ELISA or Western blot. Screening with a total of 31 SARS patient and 24 normal samples, these antigens, either from recombinants or from peptides, have shown a potential application in clinical diagnosis, with under 2% negative detection rate and over 95% positive detection rate. Therefore, the information provided in this report regarding the epitope site of the N protein will be of potential benefit in developing diagnostic techniques.
Materials
The E. coli strains, DH5α and BL-21(λCE6), were purchased from Beijing Dingguo Corporation (Beijing, China). All primers were synthesized by Shanghai Ding’an Corporation (Shanghai, China). The bacterial protein expression vector, pET-30a, was purchased from Novagen (Darmstadt, Germany). Restriction enzymes and nitro-blue tetrazolium chloride (NBT) and 5-bromo-4-chloro-3’-indolyphosphate p-toluidine salt (BCIP) were obtained from Promega (Madison, USA). Taq DNA polymerase and related PCR reagents were from Invitrogen (Carlsbad, USA). Ni-NTA resin was purchased from Qiagen (Hilden, Germany). The antibody anti-human IgG conjugated with alkaline phosphatase was purchased from Beijing Zhongshan Company (Beijing, China).
Serum specimens
Sera from SARS patients were from the hospitals in Beijing, and the control sera were obtained from healthy volunteers. The clinical diagnostic criteria for SARS followed the Clinical Description of SARS released by WHO (http://www.who.int/csr/sars/guidelines/en/).
Virus resource
The SARS-CoV BJ01 strain used in this study was cultured by the Microbe Epidemic Institute in the Academy of Military Medical Sciences (22). The BJ01 genome was sequenced by Beijing Genomics Institute (BGI). All of the N protein fragments were designed based upon the genome sequence.
SARS-CoV infection and protein extraction from the infected cells
The SARS-CoV was propagated on Vero-E6 cells as described above at the Microbe Epidemic Institute in the Academy of Military Medical Sciences. After viral propagation, the cells were harvested and placed at 70°C for 2 h inactivate the virus. Then, the infected Vero-E6 cells in culture medium were concentrated by polyethylene glycol (PEG) 20,000 followed by cell lysis and protein denaturation with 8 M urea. The cell lysate was further sonicated with a probe sonicator and centrifuged at 13,000 g to remove the insoluble debris. The supernatant was used for Western blot and ELISA determinations.
Peptide design and synthesis
The amino acid sequence of the N protein was downloaded into the ProtScale program at Swiss Institute of Bioinformatics (SIB) to analyze the physical characteristics of the proteins, such as hydrophilicity, hydrophobicity, accessible residues, buried residues, molecular weight, and pI values. A total of nine peptides ranging in size from 16 to 25 amino acid residues were selected for synthesis. All the peptides were synthesized commercially by Chinese Peptide Corporation (Hangzhou, China). The synthesized peptides were characterized by HPLC and mass spectrometry.
Plasmid constructions
The viral genomic RNA was prepared using TRIzol reagent (Invitrogen). First-strand cDNA synthesis was carried out by a SuperScript system as described in the products manual (Invitrogen). Primers for PCR amplification were synthesized by Shanghai Ding’an Co. The full length of the N gene was amplified by PCR using the primer pair, 3’ primer (ATAAGAATGCGGCCGCTTATGCCTGAGTTGAA) with a Not I restriction site and 5’ primer (CGGGATCCATGTCTGATAATGGACCCCA) with a BamH I site. To generate the ΔN256 and ΔN124 fragments, the pairs of the primer were designed as follows. 3’ primer (ATAAGAATGCGGCCGCTTATGCCTGAGTTGAA) with a Not I site and 5’ primer (CGGGATCCCCTCGCCAAAAACGTACT) with a BamH I site for ΔN256, and 3’ primer (AAGAATGCGGCCGCTTATGCCTGAGTTGAA) with a Not I site and 5’ primer (CGGGATCCGCTAACAAAGAAGGCATCGTA) with a BamH I site for ΔN124. After restriction digestion, the digestive fragment containing the N gene was ligated with a linearized pET30a vector (23).
Expression and purification of N recombinant proteins
Three transformed BL-21 stains containing the expression vectors, pET30-N-full, pET30-ΔN256 and pET30-ΔN124, were inoculated into 500 mL of LB broth containing 100 μg/mL of kanamycin. Being shaked, the cultures grew to an optical density (OD) at 600 nm of 0.6–0.8 at 20°C, and IPTG was added to a final concentration of 1 mM. The bacteria were incubated at 37°C for additional 4–6 h, and followed by centrifugation at 4,000 g for 10 min to get the bacterial pellets. The pellets were resuspended in a 10 mL binding buffer containing 20 mM Tris-HCl, pH 7.9, 200 mM NaCl, and 5 mM imidazole, and were lysed by sonication for 3 min. The resulting lysates were centrifuged at 12,000 g at 4°C for 20 min. The supernatant and pellet were examined by SDS-PAGE (SDS-PolyAcrylamide Gel Electrophoresis) and Western blot to check the solubility of the expressed protein. If the inclusion was formed, the bacterial pellets were dissolved in the binding buffer containing 8 M urea. The recombinant proteins were purified by an affinity chromatography, Ni-NTA (Qiagen), with a linear elution of imidazole.
Western blot
The recombinant proteins were separated by SDS-PAGE (10%) and transferred onto PVDF membranes. The membranes were blocked by 3% BSA in Tris 100 mM, NaCl 120 mM, 0.1% Tween-20, pH 7.9 (TTBS), and then were incubated with sera from SARS patients as primary antibody. An anti-human IgG conjugated with alkaline phosphatase was used as the second antibody. The immunoprecipitated bands were developed using a substrate mixture of NBT and BCIP.
ELISA measurement
ELISA tests were carried out according to conventional protocol. In brief, the purified fusion protein was mixed with the sample dilution buffer, embedded in 96-well ELISA plates, and incubated at 37°C for 30 min. Then the wells were washed for five times with the washing buffer, and incubated with sera from SARS patients or normal controls. Each well was washed and incubated with peroxidase-conjugated goat anti-human IgG at 37°C for 20 min. Finally the wells were washed again with PBS containing 0.5% Tween-20. The peroxidase reaction was visualized using the o-pheylenediamine solution as substrate. After 10 min of incubation at 37°C, the reaction was stopped by adding 50 μL of 4 M sulphuric acid, and OD at 450 nm with a reference wavelength of 630 nm was measured by using an automatic ELISA plate reader (Multiskan Microplate Photometer, Finland).
