Dataset: 11.1K articles from the COVID-19 Open Research Dataset (PMC Open Access subset)
All articles are made available under a Creative Commons or similar license. Specific licensing information for individual articles can be found in the PMC source and CORD-19 metadata
.
More datasets: Wikipedia | CORD-19

Made by DATEXIS (Data Science and Text-based Information Systems) at Beuth University of Applied Sciences Berlin

Deep Learning Technology: Sebastian Arnold, Betty van Aken, Paul Grundmann, Felix A. Gers and Alexander Löser. Learning Contextualized Document Representations for Healthcare Answer Retrieval. The Web Conference 2020 (WWW'20)

Funded by The Federal Ministry for Economic Affairs and Energy; Grant: 01MD19013D, Smart-MD Project, Digital Technologies

# Highlight for Query ‹Coronavirussymptoms›

## Mesodynamics in the SARS nucleocapsid measured by NMR field cycling

Introduction

The dynamic nature of proteins is one of the key elements of their function. Although this principle is widely accepted, the detailed characterization of protein motions at atomic resolution is much more challenging than determining the average static structure. High field NMR is a particularly powerful method for studying protein dynamics at atomic resolution over a wide array of time scales and has therefore extensively been used in the last two decades. The major emphasis on the development of ultra-high-field magnets for an increase in sensitivity and resolution has allowed investigations on ever-larger biomolecular systems. Ironically, this high-field regime is not as sensitive to mesodynamics as relaxation data at lower fields. Low-field R1 relaxation experiments covering 1H Larmor frequencies up to several tens of MHz have been performed using field-cycling relaxometers (Kimmich 1996; Luchinat and Parigi 2007). Due to the low resolution, these experiments have usually probed the relaxation of solvent protons as reporters, and are therefore limited. Here we report R1 relaxation data of a protein at low field, but with atomic resolution, using a field-cycling apparatus in a commercial 500 MHz magnet (Redfield 2003). By physically moving the sample in the magnet bore the relaxation behavior can be investigated at various low fields; however, the return of the sample to its initial position allows these measurements with the resolution and sensitivity of the high field.

This approach has been successfully applied to studies of lipids and DNA, using one-dimensional 31P and 13C spectra (Roberts et al. 2004; Sivanandam et al. 2009). Here we use two-dimensional 1H 15N heteronuclear experiments spanning from 17.3 to 91.2 MHz (15N Larmor frequency will be used throughout, corresponding to 170–900 MHz “spectrometers”) to investigate the dynamics of the N-terminal domain of the SARS (Severe Acute Respiratory Syndrome) nucleocapsid protein (SARSN).

SARSN is a multifunctional protein (Surjit and Lal 2008) implicated in RNA binding (Tan et al. 2006; Zuniga et al. 2007), capsid assembly (Hsieh et al. 2005; Saikatendu et al. 2007), cell-cycle disruption (Surjit et al. 2006), and immune response to the SARS virus (Zhang et al. 2007). Previous experiments have indicated a flexible β-hairpin in addition to a relatively rigid antiparallel β-sheet based on the {1H}-15N heteronuclear NOE at 500 MHz (Huang et al. 2004). Sampling the spectral density over this wider range enabled us to robustly determine the time scale of motions of the hairpin and additional loops (Fig. 1). Heteronuclear {1H}-15N NOE at differing fields (50.7–91.2 MHz) were particularly useful for establishing the internal correlation times in the disordered regions. Matching internal correlation times of about 0.8 ns for many of these residues are suggestive of correlated motions, which were buttressed by molecular dynamics simulations in explicit solvent.

Relaxation measurements

High resolution 2D 15N R1 relaxation measurements at 17.3 and 30.2 MHz were obtained using a Varian INOVA 500 equipped with a Varian indirect probe and a homebuilt field-cycling device (Redfield 2003) to move the sample from the center of the magnet to the desired lower fringe fields. A standard thin-walled 5 mm NMR tube is modified with plastic inserts epoxied above and below the sample to avoid bubbles. Previous iterations of this device used a pneumatic driver for the shuttle, but the violent deceleration of the sample at either end of transit apparently led to protein denaturation, even though samples are tightly sealed and degassed to minimize turbulence as described previously (Redfield 2003). The system has therefore been rebuilt to use a stepper motor to drive a timing belt and push rod to move the sample, allowing for more controlled deceleration (Redfield, in preparation). Experiments with ferro-cytochrome c were performed to verify that temperature variations during field-cycling could be neglected (data not shown).

Fringe field strengths versus height were measured with magnetometers as described (Redfield 2003) and checked with a commercial Gaussmeter (Lake Shore). The position of the sample in the fringe field was controlled by digital input to the stepper motor. The field strengths reported in the results reflect the measured field at the center of the sample. R1 measurements at low field used a standard experiment (Farrow et al. 1994) with the addition of commands to control the field-cycling apparatus, and 100–150 ms delays to accommodate the raising and lowering of the sample. This transit delay was adjusted to account for the greater distance necessary to sample low fields. In order to compensate for signal attenuation due to relaxation in transit, 4–8 times as many transients were acquired for these experiments as at high field. Field-cycling did not affect the lock to a greater extent than did a strong gradient pulse, and no degradation of the lock was observed over the course of the experiment. Proton inversion pulses during the relaxation period were not feasible because of the inhomogeneity of the fringe field and absence of a transmitter coil at the relevant height.

High-field NMR spectra were acquired on Varian INOVA 500, 600, and 900 MHz spectrometers equipped with triple resonance probes and z-axis gradients, as well as a Bruker AVANCE-800 spectrometer equipped with a cryoprobe. R1, R2, and NOE data were collected using standard 15N relaxation experiments (Farrow et al. 1994), in an interleaved manner.

Processing and fitting of relaxation data

Spectra were processed using NMRPipe (Delaglio et al. 1995) software and analyzed using NMRViewJ (Johnson and Blevins 1994). For R1 and R2, peak intensities were fit to mono-exponential decay equations using an in-house program. Errors in the measurements were assessed from base-plane noise and duplicated points; because of the interleaved acquisition, both approaches resulted in similar estimated errors. Model-free fits were performed using relxn2.2 (Lee et al. 1999) and the graphical front-end rvi (Clarkson et al. 2006); only relaxation data from 50.7 MHz and above were used in the final fits. Data were weighted by error for χ2 minimization, which for purposes of fitting was defined as the larger of the fitted error or 5%. Excluding flexible residues (Vuister et al. 1993), a τm of 11.9 ns was determined by global minimization and used in all subsequent fits. An estimation of tumbling anisotropy using local fits of τm and the 1SSK structure (Huang et al. 2004) with the program qfit (Lee et al. 1999) indicated D‖/D⊥ of 0.94, and anisotropy was therefore disregarded in subsequent fitting procedures. Errors in single fits were assessed using a Monte Carlo method in which random values consistent with the measured errors were added to the original data and the fit repeated. The total number of Monte Carlo simulations used was 150 for each residue.

Model selection was performed using the Bayesian information criterion (BIC) with model elimination (d’Auvergne and Gooley 2003) and a modified jackknife approach in which 1–3 relaxation rates or NOEs were eliminated in each attempted fit. Residues for which more than one violation of model-free constraints (order parameters greater than one or less than zero, or τe/τs ≥ τm) occurred during the jackknife procedure were shifted to a simpler model. No residues experienced multiple violations during the jackknife when model 2 was used. The initial rounds of model selection included models that incorporate REX corrections to the R2 data (models 3 and 4). However, these models were not ultimately selected for any residues because CPMG relaxation dispersion gave no indication of REX (data not shown), and refinement of model selection excluded those models. Accordingly, the only models used in the final analysis were model 2 (S2, τe) (Lipari and Szabo 1982), and model 5 (\documentclass[12pt]{minimal}

\usepackage{amsmath}

\usepackage{wasysym}

\usepackage{amsfonts}

\usepackage{amssymb}

\usepackage{amsbsy}

\usepackage{mathrsfs}

\usepackage{upgreek}

\setlength{\oddsidemargin}{-69pt}

\begin{document}$$S^{ 2}_{\text{s}}$$\end{document}, \documentclass[12pt]{minimal}

\usepackage{amsmath}

\usepackage{wasysym}

\usepackage{amsfonts}

\usepackage{amssymb}

\usepackage{amsbsy}

\usepackage{mathrsfs}

\usepackage{upgreek}

\setlength{\oddsidemargin}{-69pt}

\begin{document}$$S^{ 2}_{\text{f}}$$\end{document}, τs) (Clore et al. 1990). The final model-free fits excluded the low-field data in the interest of accuracy (see below), but including these data did not generally change the fitted order parameter by more than 5%.

Parameter variability in the jackknife was heterogeneous. The standard deviations of the order parameter (S2 or \documentclass[12pt]{minimal}

\usepackage{amsmath}

\usepackage{wasysym}

\usepackage{amsfonts}

\usepackage{amssymb}

\usepackage{amsbsy}

\usepackage{mathrsfs}

\usepackage{upgreek}

\setlength{\oddsidemargin}{-69pt}

\begin{document}$$S^{ 2}_{\text{s}}$$\end{document} × \documentclass[12pt]{minimal}

\usepackage{amsmath}

\usepackage{wasysym}

\usepackage{amsfonts}

\usepackage{amssymb}

\usepackage{amsbsy}

\usepackage{mathrsfs}

\usepackage{upgreek}

\setlength{\oddsidemargin}{-69pt}

\begin{document}$$S^{ 2}_{\text{f}}$$\end{document}) and internal correlation time (τe or τs) were 4.6 and 96% of the average value for a residue, respectively. For residues with a narrow R1 distribution, these errors were reduced to 2.9 and 11.9%, respectively, suggesting that the internal correlation time in particular is fit much more robustly for these residues.

Molecular dynamics simulation

Solvent molecules and counter-ions were added to the PDB file 2OFZ (Saikatendu et al. 2007) by the program solvate 1.0 (Grubmuller 1995). Atoms beyond 42 Å from the center were deleted; the minimum distance between any protein atom and the spherical surface was 16 Å. The stochastic boundary condition (Brooks and Karplus 1983; Brunger et al. 1984) was imposed to prevent solvent from departing from the water sphere. The CHARMM program (Brooks et al. 1983) with the all-atom force field (MacKerell et al. 1998) and CMAP correction (Mackerell et al. 2004) was used for minimization and simulation. The non-bonded interaction cut-off distance was set to be 14 Å, and the simulation timestep was 2 fs. Hydrogen atoms were constrained by the SHAKE algorithm. The system was minimized, then heated from 50 to 300 K in 200 ps. The two production runs were 15 ns in duration.

Protein expression and purification

The SARSN construct was a gift from Dr. Hualiang Jiang (Luo et al. 2004). E. Coli BL21 (DE3) cells were grown in 15N M9 medium to an OD of 0.6 and induced with 1 mM IPTG. Cells were harvested by centrifugation and resuspended in lysis buffer (0.1 M Tris, 0.5 M NaCl, pH 7.0) with a protease inhibitor cocktail (Calbiochem). Following lysis by sonication, cellular debris was removed by centrifugation. The lysate was then dialyzed against 50 mM Na4H2PO4, 50 mM NaCl pH 7.0 and purified over SP-sepharose equilibrated in the same buffer, eluting with 2 M NaCl. Following a repeat dialysis step, the protein was purified over Q-sepharose using the same buffer system. Following dialysis into the NMR buffer (50 mM HEPES, 150 mM NaCl, 0.02% NaN3 pH 7.0), the protein was purified over S-100 sepharose and concentrated to 1 mM. NMR samples contained 10% D2O. All separation media were purchased from GE Healthcare.

Field dependence of R1 relaxation rates

At 500 MHz (11.7 T), R1 values are roughly constant across the entire SARSN, but R1 heterogeneity significantly increased as the magnitude of the field was changed in either direction (Fig. 2). When R1 was measured at a 17.3 MHz field sampled by the field-cycling apparatus, for several residues R1 was decreased by as much as 25% relative to the majority of the residues (Fig. 2a). This reduction is also observed at 7 T, but to a lesser extent. These residues were primarily localized to the β-hairpin (residues 90–108) and a loop region spanning residues 60–64. On the contrary, increasing the field strength above 11.7 T resulted in an enhanced R1 for the same set of residues relative to the rest of the protein. At the highest field employed in this study, 91.2 MHz, the enhancement was as high as 40% (Fig. 2b).

Qualitatively, this “R1 crossover” implies that these residues have greater flexibility than the remainder of the protein. Lower R1 values are expected for flexible residues at low field because the internal motion depresses the spectral density in the extreme narrowing limit (ωτm ≪ 1). On the contrary, in the spin-diffusion limit (ωτm ≫ 1), flexible residues are expected to have a greater spectral density and thus a higher R1 than rigid residues. Moreover, the particular frequency at which the crossover occurs defines an approximate range of internal correlation times for these residues. For a protein of this size, the observed R1 crossover point implies a motion with an internal correlation time in the range of 0.5–1.5 ns.

The data collected at low field using field cycling show larger differences in R1 between rigid and flexible residues than the high and ultra-high-field data. Uncertainties for the low-field data are larger because relaxation during the 200–300 ms transit time and increased R1 relaxation rates at lower fields diminishes the intensity of the signals from the protein. Despite these imperfections, the field-cycling data are of sufficient quality to provide valuable insight into the dynamics of this protein at atomic resolution.

Field dependence of the {1H}-15N heteronuclear NOEs

The {1H}-15N heteronuclear NOEs for the β-hairpin also display a pronounced field-dependence between 50.7 and 91.2 MHz (Fig. 3) with a confirmation of the previously reported strong NOEs at 50.7 MHz (Huang et al. 2004). The attenuation of the NOEs at higher fields strongly suggests an internal mesodynamic fluctuation. The 15N NOE is the ratio RNOEγH/R1 γN, where RNOE is approximately proportional to J(ωH) since ωN ≪ ωH, and R1 is proportional to J(ωN) (Abragam 1961). This ratio changes little for rigid residues over a twofold change in Larmor frequency because the respective spectral density values are essentially constant over this frequency range. In contrast, the observed field-dependence of the NOE for flexible residues indicates the presence of a transition in the spectral density over this frequency range. Moreover, the range of Larmor frequencies sampled defines upper and lower bounds for the internal correlation time τ (ωτ ≈ 1 for the center of the transition) of 150 ps < τ < 3 ns. Without knowing where the RNOEγH/R1γN ratio flattens out, however, the frequency of the motion cannot be determined with great precision.

Model-free analysis to characterize the dynamics of SARSN

While dispersion patterns may imply gross features of the dynamics, a detailed characterization of protein motions requires a more complete survey of relaxation data and interpretation using motional models. For this goal, a complete set of high-field R1 (Fig. 2b), NOE (Fig. 3) and R2 (Fig. 4) data was acquired and used to determine model-free parameters for all residues of SARSN (Fig. 5). All residues were fit to a local dynamics model involving either a single order parameter (S2) and internal correlation time (τe) (model 2, or simple model-free) (Lipari and Szabo 1982) or two order parameters (\documentclass[12pt]{minimal}

\usepackage{amsmath}

\usepackage{wasysym}

\usepackage{amsfonts}

\usepackage{amssymb}

\usepackage{amsbsy}

\usepackage{mathrsfs}

\usepackage{upgreek}

\setlength{\oddsidemargin}{-69pt}

\begin{document}$$S^{ 2}_{\text{f}}$$\end{document} and \documentclass[12pt]{minimal}

\usepackage{amsmath}

\usepackage{wasysym}

\usepackage{amsfonts}

\usepackage{amssymb}

\usepackage{amsbsy}

\usepackage{mathrsfs}

\usepackage{upgreek}

\setlength{\oddsidemargin}{-69pt}

\begin{document}$$S^{ 2}_{\text{s}}$$\end{document}) and internal correlation times (τs and τf), where τf is assumed to be ~0 (model 5, or extended model-free) (Clore et al. 1990).

Models including an REX term were not selected by information criteria for any residues. This is in agreement with R2 relaxation-dispersion experiments performed on SARSN, which also do not identify any significant μs-ms fluctuations (data not shown). The pronounced decrease in R2 values in the loop and β-hairpin regions (Fig. 5) is consistent with significant ps-ns motions, but not slower motions.

While the majority of residues have high order parameters and short internal correlation times typical for a stable, folded structure, residues with the above described unusual R1 and NOE dispersions fit to model 5. These model 5 residues are characterized by high \documentclass[12pt]{minimal}

\usepackage{amsmath}

\usepackage{wasysym}

\usepackage{amsfonts}

\usepackage{amssymb}

\usepackage{amsbsy}

\usepackage{mathrsfs}

\usepackage{upgreek}

\setlength{\oddsidemargin}{-69pt}

\begin{document}$$S^{ 2}_{\text{f}}$$\end{document} and low \documentclass[12pt]{minimal}

\usepackage{amsmath}

\usepackage{wasysym}

\usepackage{amsfonts}

\usepackage{amssymb}

\usepackage{amsbsy}

\usepackage{mathrsfs}

\usepackage{upgreek}

\setlength{\oddsidemargin}{-69pt}

\begin{document}$$S^{ 2}_{\text{s}}$$\end{document} values, indicative of local order combined with larger-amplitude slower fluctuations. Critically, the field dependence of the R1 and NOE data enabled us to robustly determine the correlation time τs of these slower fluctuations. The striking similarity of τs of about 0.8 ns for the entire β-hairpin and adjacent loops (Figs. 1, 5) suggests that these residues undergo a correlated motion as a stable structural unit. This is consistent with the structure showing a regular hydrogen-bonded β-hairpin as well the chemical shifts in this region (Cornilescu et al. 1999).

The high accuracy of the determined τs values arises from the crossover in the R1 data and the pronounced NOE dispersion data as illustrated by simulated relaxation parameters (Fig. 6). These theoretical curves emphasize the value of field-cycling data in identifying the R1 crossover and thereby defining the timescale of the collective mesodynamics.

We want to note that although the observed relaxation rates at 17.3 and 30.2 MHz, obtained by field-cycling, qualitatively agree with high-field observations, the low-field R1 values are 45 and 30% lower than predicted from model-free parameters using high-field data, respectively. Error from interference between CSA and dipolar relaxation mechanisms is of particular concern here, due to the inability to regularly invert proton polarization during the relaxation period in the field-cycled experiments. While this effect certainly contributes to the observed bias towards too low R1 values, it is not possible to account for the entire effect at 17.3 and 30.2 MHz using analytical calculations of ηz, implying that the observed bias cannot be attributed to any single source of interference.

Additionally, in the fringe field of a magnet, the field strength decreases constantly with increasing distance from the center of the coil. As a result, the field is inhomogeneous over the 1.5 cm length of the sample, with a maximal variation of 1.4 T at 30.2 MHz (7 T). The remaining difference between prediction and experiment could in part result from this field inhomogeneity, but could also arise from technical error or from some previously unrecognized shortcoming of the model-free theory. These questions should be investigated further in future by a more complete survey between 5 and 9 T, or possibly at lower field using 13C carbonyl relaxation. Below 4 T the relaxation rate is expected to be too fast to be measured with our shuttle device for any nuclear spin adjacent to a geminal proton because of relaxation of the measured nuclear spin by the proton. For the results presented here, this uncertainty does not significantly affect the conclusions.

MD simulations of SARSN in explicit solvent

Although the similarity in correlation times for the entire β-hairpin and a number of residues that are far away in sequence but close in structure suggests a collective motion, the ensemble nature of NMR measurements precludes a direct verification. For this goal, we performed a 15 ns all-atom MD simulation of the SARSN N-terminal domain in explicit solvent. Here, correlated motions can be directly identified, since the simulations are performed on a single molecule.

Consistent with the high order parameters fitted for the majority of the domain, the globular part of the protein is stable, without detectable structural drift (Fig. 7a, b). The β-hairpin is also internally stable—although it is exposed to solvent, it does not show signs of partial unfolding during the simulation. When the hairpin is superimposed for all snapshots from the simulation, the fluctuations remain within 2 Å for the β-hairpin throughout the trajectory (Fig. 7b). However, the simulations show a large amplitude motion of the β-hairpin as a rigid body relative to the globular portion of the protein (Fig. 7a, b). The low internal fluctuation within the β-hairpin is in agreement with high \documentclass[12pt]{minimal}

\usepackage{amsmath}

\usepackage{wasysym}

\usepackage{amsfonts}

\usepackage{amssymb}

\usepackage{amsbsy}

\usepackage{mathrsfs}

\usepackage{upgreek}

\setlength{\oddsidemargin}{-69pt}

\begin{document}$$S^{ 2}_{\text{f}}$$\end{document}, while the large displacement of the β-hairpin corresponds to the low \documentclass[12pt]{minimal}

\usepackage{amsmath}

\usepackage{wasysym}

\usepackage{amsfonts}

\usepackage{amssymb}

\usepackage{amsbsy}

\usepackage{mathrsfs}

\usepackage{upgreek}

\setlength{\oddsidemargin}{-69pt}

\begin{document}$$S^{ 2}_{\text{s}}$$\end{document} values calculated form the NMR experiments. Moreover, the timescale of the β-hairpin displacement is in the ns time regime, in agreement with the τs of 0.8 ns from the NMR experiments.

In addition to extensive mesodynamics in the β-hairpin, the NMR relaxation data revealed motions with similar correlation times for loops encompassing residues 60–64 and 127–130. The MD simulations now provide a rationale for this observation: during the simulations, direct and water-mediated hydrogen bonds form between several residues in the hairpin and these flexible loops. For example, the side-chain of R90 hydrogen-bonds temporarily to residues in both loops (Fig. 7c). These interactions seem to be responsible for coordinating motions of the hairpin and those loops, explaining the similarity of relaxation dispersions at these sites. Strikingly, the MD simulations elucidate the collective nature of the mesodynamics of the entire β-hairpin and the correlation of motions between the hairpin and the 60–64 loop. These correlations can be computed from the time traces of atomic fluctuations within one molecule through a covariance matrix (Fig. 7d), emphasizing the value of MD simulations to complement the NMR experiments.

Conclusions

The advantages of examining relaxation properties of proteins at multiple field strengths are well-known, and due to the limitations of resolution and configuration, unidirectional expansion of measurements to higher fields has been the natural progression. Here we have expanded our sampling of the spectral density in both directions using a field-cycling apparatus and high-field magnets to measure R1 relaxation in a heterogeneously dynamic domain from the SARS coronavirus at atomic resolution.

Besides the proof of principle that site-specific relaxation rates can be determined at low field while retaining much of the sensitivity and all of the resolution of a high-field spectrometer, the experiments allowed a precise determination of the internal correlation times of residues with complex motional models (model 5). Given that these residues have very similar internal correlation times and cluster structurally together, we suggest that the relaxation behavior is the result of correlated motions of the hairpin and the 60–64 loop. This model is buttressed by MD simulations. The β-hairpin and the 60–64 loop coincide with several previously-identified sites where SARSN engages in protein-protein and protein-nucleic acid interactions, including a ubiquitination site at K62. That areas of significant flexibility in SARSN coincide with residues important for binding interactions is not surprising, because disordered regions in proteins are often associated with promiscuous binding activity (Fink 2005). However our current data do not provide mechanistic insight into the relation between dynamics and SARS function.

Although the technique of measuring relaxation in the fringe field limits the accuracy of the relaxation rates, the data from low-field experiments are qualitatively consistent with the findings from high-field and ultra-high-field spectrometry. With further refinement of equipment and technique it should be possible to improve the sensitivity and accuracy of these measurements substantially. The methods of field-cycling (Roberts et al. 2004; Sivanandam et al. 2009) will continue to be very useful in other contexts: for example, experiments with 13C carbonyls where the absence of an attached proton will probably make measurements to zero field feasible, as they are for 31P.