Identifying Peaks

Analysis of the peaks in the NMR spectra is obviously much more useful if one has the assignment of which atoms in the molecule (or molecules) give rise to which peaks in the spectrum. Section 7 will focus on some tools in NMRViewJ for the initial assignment of peaks from triple-resonance (13C, 15N, 1H) spectra and Section 5.7 on the more specific problem of identifying the pairs of hydrogen atoms that give rise to NOE peaks. Here we will describe some tools that are of general utility in working with the identification of NMR peaks.

Every peak list stores within itself several parameters that facilitate peak searching and matching. The peak template parameter is a list of dimension name and tolerance pairs that are used in searching peak lists for peaks with certain chemical shift values. The search template can have fewer dimensions than the peak list. So, for example, a 3D HNCO experiment could have its template set with nv_peak template hnco 1HN 0.1 15N 0.3. Searching this peak list with two chemical shifts, nv_peak find hnco 8.3 117.0, would find any peaks whose 1HN chemical shift is between 8.2 and 8.4 ppm, and whose 15N chemical shift is between 116.7 and 117.3 ppm. Because the 13C dimension is not specified in the template the 13C chemical shift is ignored in the search.

The peak-list's pattern and tolerance parameters are used when a search is done to find atoms whose assigned chemical shifts are consistent with those of a given peak. The peak pattern of an individual dimension has the format sequenceNumber.atomType. For example, a peak pattern could be set for a peak list named hnco with the command: nv_peak pattern hnco i.hn i.n i-1.c. This implies that any atom assignments for the first dimension should be of atom type hn, for the second dimension should be of type n, and the third dimension of type c. Because the first and second dimensions share the same symbolic sequence number (which can be either an i or a j the atom assignments for these two dimensions should always have the same sequence number. The third dimensions atom assignment should have a residue that is one less than the residue assignment for the first and second dimensions. The tolerance value indicates the allowable deviation between the peaks chemical shifts and the chemical shift of any atoms assigned to the peak. For example, nv_peak tolerance hnco 0.1 0.3 0.4, would set the tolerance for the first dimension to 0.1, the second dimension to 0.3, and the third to 0.4. The peak pattern and tolerance values are most readily set within the Peak Reference panel. Note: the peak pattern and tolerance values are not enforced for peaks in the present versions of NMRViewJ. They exist as storage for parameters that can be used by Tcl scripts for the analysis and assignment of peaks.

The following shows the Peak Reference panel for an 15N NOESY-HSQC experiment. The first and third dimensions are for the amide proton and nitrogen, respectively, as can be seen from the patterns "i.h" and "i.n". The second dimension is for the indirectly detected proton which could be on the same or a different residue, and is not necessarily an amide proton, so it has the pattern "j.h*". The proton in the first dimension has relation D3, this means it is bonded (the D comes from descendant, in a tree structure) to the atom in the third dimension (that is, the amide proton is bonded to the amide nitrogen). The relation field is not necessary in this case as the only atoms that would match the i.h pattern are the amide protons, but in many cases it is necessary to explicitly indicate which dimensions correspond to atoms which must be bonded to the the atom in another dimension.

The pattern and tolerance parameters are central to the use of the Peak Identification Dialog. This dialog, brought up from the Edit->Identify menu of the Peak Analysis panel, can be used to interactively assign atom identifiers to individual peaks for a molecule with existing chemical shift assignments. If this dialog is open, then each time you step to a new peak in the Peak Analysis panel this PeakID panel will be updated with a list of the atoms. The atoms listed are those atoms for which their chemical shifts are within the peaklist's tolerance of the peaks chemical shift values, and for which their atom names are consistent with the peaklist?s pattern.

Each possible assignment will be listed in the table with a set of scores indicating how close the chemical shifts of the atoms in that assignment are with the chemical shifts of the corresponding dimensions of the current peak. If three-dimensional coordinates of the atoms are available, and two of the peak dimensions correspond to hydrogen atoms, then additional information is given about the distance between the pair of hydrogen atoms. The columns of the table contain the following information (the column numbers given are for the example 3D peak list, and will differ for peak lists with a different number of dimensions).

The most probable assignments, based on the distances in the current structure set are those with the highest assignment contribution, as given in the last column of the table. The column is sorted by this value, so the most probable assignments are at top. After the table is populated a subset of the rows will be selected (highlighted in yellow in above example). This is the minimum set of assignments required for the sum of the assignment contribution to exceed a target value. This value, by default 1.0, can be set by entering a value in the CLimit field. The exponent applied to the distances when calculating the contributeions is by default 6.0, and can be set by entering a value in the Exp field.

Only those assignments where at least one structure has a distance less than a specified cutoff will be shown. By default, all assignments are shown. The cutoff can be set by entering a value in the CutOff field.

If the Spectrum1 or Spectrum2 checkboxes are activated then the region of the spectrum (of the dataset assigned to the peaklist) around the peak can be displayed in the panels on the right side of the PeakId panel. Double-clicking on an entry in the atoms list will refresh the display of the spectrum and draw the crosshair cursor at the position of the chemical shifts of the atoms in that entry.

NOE spectra are often collected so that the experimental sweep width is less, in the indirect dimensions, than the sweep width needed to include all the peak resonances. In this case the peaks will appear at a position folded by some number of sweep widths from their actual position. These peaks would not normally be matched with atoms whose chemical shifts are at their correct, unfolded, position. The peak identifier can be configured to search not only at the actual peak position, but at one or more folded chemical shifts. This can be configured using the dialog brought up by clicking Configure Folding.

The NMRViewJ user should themselves synthesize the information from the chemical shift deviations, hydrogen pair distances, and spectral display to reach a conclusion about which, if any, of the atom entries are the correct assignment for the peak. Selecting the entry and then clicking the Assign button will update the assignment labels for the current peak with the names of the atoms in that entry.

At any given point in a project it may not be possible to assign a unique set of atoms to a particular peak. Instead, because of overlap of chemical shifts there may be an ambiguous group of atoms that could give rise to the peak. NMRViewJ, supports the explicit assignment of an ambiguous set of atoms to a particular peak. This can be done within the PeakId panel by selecting more than one row of atom sets in the list and clicking the assign button. The entire group of atoms will be stored with the peak. These assignments can be viewed within the Peak Analysis panel by using the arrows adjacent to the peak label entry.