Jump to contentJump to page navigation: previous page [access key p]/next page [access key n]
Applies to MineXpert3 11.4.2

5 Mass Spectral Deconvolutions

When analysing a mass spectrum, there are two available deconvolution modes:

Deconvolutions are performed in order to get back to the Mr mass of the analyte while reading m/z values. In the following sections, all the avaiable deconvolution processes are described.

Before delving into the deconvolutions, it is necessary to present two menu options that are found in the plot widgets contained in the Mass spectra window: the menu items under the Centroidation and deconvolutions menu (Figure 5.1, “Mass spectrum plot widget-specific deconvolution menu” ).

Mass spectrum plot widget-specific deconvolution menu

The two menu items are needed to configure the mouse-based deconvolution of mass spectra.

Figure 5.1: Mass spectrum plot widget-specific deconvolution menu

These two menus allow one to set parameters for the manual deconvolutions (see text for details).

5.1 Deconvolution based on charge state

In this kind of deconvolution, at the present time, the software assumes that the ionization agent is the proton and that the ionization is positive.

The deconvolution is based on the determination of the distance between two peaks —consecutive or not— of a given charge state envelope. When the user left-mouse-button-click-drags the cursor from one peak to another, the program tries to calculate if the distance between two peaks matches one or more charge difference(s). If so, it computes the molecular (Mr [12C]-relative molecular mass) mass of the analyte.

Note
Note: The mouse drag movement is significant

The Mr value that is computed is for the analyte below the mass peak at which the mouse drag moment operation started.

Top panel of Figure 5.2, “Charge state-based mass deconvolution” shows the charge state envelope-based deconvolution process for a protein of Mr≍8566 Da. In the top panel, the deconvolution has involved two consecutive peaks ( span> is 1). The mouse drag movement occurred from left to right. Thus, the m/z value chosen for the computation is that under the left peak (start point of the movement). The status line at the bottom of the plot indicates the selection range, the delta movement on the x-axis, the computed charge, the Mr value chosen for the calculation (the Mr value at the start of drag movement) and finally the calculated Mr value.

This two consecutive mass peaks method is the default method. However, it might happen that no clearly visible mass peak is available around one nice peak that might be chosen as the start of the mouse drag operation. In this case it is possible to define a different span> between two peaks elected for the deconvolution (see Figure 5.1, “Mass spectrum plot widget-specific deconvolution menu” ). In the figure, than span has been set to 2, which means that the mouse drag movement encompasses three mass peaks: the movement start peak, one peak in the middle and finally the movement stop peak (the span is thus of two intervals between the extreme peaks).

The bottom panel of the figure now displays the same Mr value for the protein even if the span is now of two intervals.

Charge state-based mass deconvolution

Deconvolution approach using two peaks belonging to the same charge state envelope. The top deconvolution involves two consecutive mass peaks (peak span value is 1). The bottom deconvolution involves two non-consecutive peaks (peak span value is 2). The Mr value, expectedly, did not change whatever the configured span.

Figure 5.2: Charge state-based mass deconvolution
Note
Note

The charge calculation, which is at the heart of the deconvolution, almost never produces an integer value with no fractional part (say, charge z=15.0) because it is almost impossible to drag the mouse cursor the exact number of pixels that would match a m/z range leading to such an integral charge value. Almost always, the charge that is calculated looks like 14.995 or 15.001, for example. This is due to the fact that the mouse moves at discrete positions on the screen and these positions might be more or less far apart, depending on the mouse capabilities and on the current zoom factor over the mass spectrum region of interest.

It is advised to zoom-in as much as possible over the peaks at hand so as to minimize the difficulties above. It may happen, however, that even zoomed-in peaks are not sufficiently distant to allow a charge calculation. In this case, reduce the stringency over the fractional part that is allowed in the charge (see menu item Set charge minimal fractional part at Figure 5.1, “Mass spectrum plot widget-specific deconvolution menu” ). By default, the stringency is set at 0.99, that is, any calculated value that has a fractional part either superior or equal to 0.99 or inferior or equal to 0.01 would lead to a successful round-up/round-down to the nearest integer value. Outside of the [0.01-0.99] interval, no charge calculation is performed and thus no deconvolution is performed. When the stringency is too high, reducing it will allow the deconvolution to be carried-over. General experience is that setting that value to 0.997 is fine for most situations and provides very reliable results.

5.2 Deconvolution based on isotopic cluster peaks

In this kind of deconvolution, the user left-mouse-button-click-drags the cursor between the first two peaks (when possible) of the isotopic cluster. The charge state of the ion is the inverse of the distance between the two consecutive peaks (that is, the m/z delta value). Figure 5.3, “Isotopic cluster-based mass deconvolution” shows that deconvolution process at work.

Isotopic cluster-based mass deconvolution

Top panel: deconvolution involving a left-to-right mouse drag movement. Bottom panel: deconvolution involving a right-to-left mouse drag movement. The calculated Mr values differ.

Figure 5.3: Isotopic cluster-based mass deconvolution
Note
Note: The mouse drag movement start position is significant

The left-mouse-button-click-dragging direction (left→right or right→left) determines the final Mr that is computed because that value is calculated for the peak under the mouse when the mouse drag movement is initiated. This is visible in the two panels of Figure 5.3, “Isotopic cluster-based mass deconvolution” , where the top panel shows the Mr computed for the left peak and the bottom panel shows the Mr computed for the right peak. Since the ion is monocharged, the difference is 1 Da.

This is a significant departure from the previous versions, where the postulate was that the single real peak of interest in an isotopic cluster was the left-most monoisotopic peak. Since this software has been used by scientists in research projects using almost 100 % labelled bacteria (with [13C] and [15N]), that concept has become moot. Indeed, analytes from these bacteria have their monoisotopic peak at the far right end of the isotoopic cluster.

The new behaviour allows the scientists to compute the Mr value of the peak of interest in an isotopic cluster, be that for a non-labelled or for a labelled analyte. See the following articles as examples of heavy isotope almost full labelling of bacteria.

Heavy isotope labeling and mass spectrometry reveal unexpected remodeling of bacterial cell wall expansion in response to drugs. Atze H, Liang Y, Hugonnet JE, Gutierrez A, Rusconi F, Arthur M. Elife, 2022, doi: 10.7554/eLife.72863, PMID: 35678393 .

Peptidoglycan-tethered and free forms of the Braun lipoprotein are in dynamic equilibrium in Escherichia coli. Liang Y, Hugonnet JE, Rusconi F, Arthur M. Elife, 2024, doi: 10.7554/eLife.91598, PMID:39360705 .

(p)ppGpp modifies RNAP function to confer β-lactam resistance in a peptidoglycan-independent manner. Voedts H, Anoyatis-Pelé C, Langella O, Rusconi F, Hugonnet JE, Arthur M. Nat Microbiol, 2024, PMID:38443580 .

5.3 Automated deconvolution based on isotopic cluster peaks

The automated deconvolution that is based on the isotopic cluster peaks is designed to be reliable for low-mass analytes (typically below 5 kDa). The process involves two separate steps: first, the centroids are extracted from the mass spectrum (see Section 4.1.7, “ Centroid Extraction from Mass Spectra ”); second, the extracted centroids are used for the deconvolution proper. The deconvolution is a highly complex process that would require extensive knowldedge to understand. The main concepts are the following:

  • Iterate in each observed centroid (from smallest (m/z,i) values to greatest) of the input centroided mass spectrum and, for each centroid, assume that is has a given charge (in the charges range provided in the configuration; see below). With that assumed charge, compute a neutral analyte mass.

  • The neutral analyte mass is then used to construct a theoretical neutral averagine-based isotopic cluster. Each centroid of the isotopic is then taken in sequence and subject to a matching process.

  • The matching operation takes place by first converting back the theoretical neutral analyte cluster centroids to a charged analyte m/z value (same charge as that initially used to craft the neutral mass); second, the m/z value is searched for in the input mass spectrum centroids. When an input spectrum m/z value matches that of the theoretical isotopic cluster, a match is stored.

  • When all the centroids of the theoretical isotopic cluster have been iterated into, the set of found matches is checked for reliability (typically, there must have been at least three matches in the cluster, albeit not necessarily contiguous). Other checks are performed also. If the checks are all positive, then the match is stored for the next step.

  • At then end of the processing of all the input centroids, all the found matches are processed in order to consolidate them to a neutral mass. Each neutral mass is listed along with all the m/z value that was actually matched or the set of m/z values corresponding to different charges of the same analyte molecule.

The low mass deconvolution algorithm can be tightly configured using a large set of parameter settings that are detailed below and that are accessible via the application preferences window shown in Figure 5.4, “” and detailed below.

Interface to the configuration of the low mass deconvolution process.

Figure 5.4:
  • Min. charge (Max. charge): the charge range that is explored.

  • Mass tolerance (ppm): the tolerance with which all the mass matches are performed. A typical value is 20.

  • Min. mass (Max. mass): the neutral mass range that is explored.

  • Min. intensity: the intensity value below which input mass spectrum centroids are discarded.

  • Explained intensity threshold: Once the matches between the centroids in the theoretical isotopic cluster and those in the input centroided mass spectrum have been performed, the reliability of these matches is checked like so: select the monoisotopic centroid and its contiguous centroid in both the theoretical isotopic cluster and in the matched centroided mass spectrum. Compute the ratio between the intensities of these centroids. That ratio is used to normalize the theoretical centroids intensities so they can be compared to the matched input mass spectrum centroid intensities. Compute the sum of the matched centroids intensities for all the matches (input mass spectrum vs theoretical). Compute the [input / theoretical] summed intensities ratio, which reflects the 'explained intensity'. The greater the ratio, the better the intensity correspondence between the theoretical and experimental isotopic clusters. That ratio value is required to be greater than the value set for this parameter for the matches to be considered reliable. Typical values: 0.5-0.7.

  • Relative cluster shape tolerance: Once the matches between the centroids in the theoretical isotopic cluster and those in the input centroided mass spectrum have been performed, the reliability of these matches is checked like so: get the monoisotopic (highest probability centroid, isotopologue 0) match. Get the isotopologue 1 match. For each one of these two matches compute the ratio of the intensities for the theoretical isotopic cluster data and for the input centroided mass spectrum (observed data) This yields the following intensity ratios: [ theor_ratio = theor. intensity iso 1 / theor. intensity iso 0 ] and [ obser_ratio -> obser. intensity iso 1 / obser. intensity iso 0 ]. Compute the ratio [ abs(obser_ratio - theor_ratio) / theor_ratio ] which amounts to a 'normalized difference'. If the ratio above is smaller than this parameter setting, then the matches are considered to be reliable. Typical values: 0.6-0.7.

  • Min Pearson corr. score: Once the matches between the centroids in the theoretical isotopic cluster and those in the input centroided mass spectrum have been performed, the reliability of these matches is checked like so: A Pearson correlation factor is computed to check that the observed and theoretical isotopic cluster centroids are correlated by their intensity. If the obtained correlation factor is greater or equal to this setting, then the matches are considered to be reliable. Typical values: 0.7-0.90.

  • Neutral averagine formula: This the averagine formula to be used to model the isotopic clusters starting from a neutral analyte mass.

  • Isotopic data file: The path name of the file holding the isotopic data. This is a file having the following format (removed data for clarity):

                      1,hydrogen,H,1,1.007825,1,0,0.999884,-0.000115,0
                      1,hydrogen,H,1,2.014101,2,1,0.0001157,-9.0644,0
                      2,helium,He,2,3.0160293,3,0,0.00000,-13.5206046 0 [
                      ... ] 6,carbon,C,6,12.00000,12,0,0.9892119,-0.0108466,0
                      6,carbon,C,6,13.003354,13,1,0.0107,-4.529315,0
                      7,nitrogen,N,7,14.0030,-0.003648,0
                      7,nitrogen,N,7,15.00010,15,1,0.0036419,-5.615226,0
                      8,oxygen,O,8,15.99491,16,0,0.9975676 -0.00243,0
                      8,oxygen,O,8,16.999131,17,1,0.00038093,-7.872715,0
                      8,oxygen,O,8,17.999159,18,2,0.0020513,-6.189236,0
    Note
    Note

    It is possible to use an isotopic data file that lists heavy isotopes as the most probable isotopes for any chemical element. In this manual author's research, bacteria are fully labelled with the heavy stable isotopes of carbon and nitrogen, which produces peptidoglycan structural elements that display as inverted isotopic clusters (with respect to natural isotopic abundance analytes). The deconvolver works also in this case by only detecting the input mass spectrum isotopic cluster having the proper inverted shape, and reports the analytes with the diagnostic ions at monoisotopic m/z values corresponding to the highest value in the isotopic cluster that was correctly recapitulated by the averagine-based theoretical isotopic cluster. This means that whatever the isotopic abundance configuration, the reported ions are correct: their monoisotopic m/z value is reported.

  • The last field may be used to indicate the path name to a file where all the processing details are stored. Perusing these data might be interesting when the deconvolution results do not correspond to expected results.

When the deconvolution process has ended, the results are shown in the console window.

5.4 Reading the resolving power based on mass spectral data

When left-mouse-button-click-dragging the mouse cursor between two mass spectrum locations of interest, the program computes the apparent resolving power. This process is shown on Figure 5.5, “Calculation of the resolving power” , where the resolving power is calculated by dragging the mouse cursor from one edge of a peak to the other at half maximum height (this is called full width at half maximum [FWHM] resolution).

Calculation of the resolving power

Click-dragging the mouse cursor will trigger the calculation of the resolving power of the instrument. That value is printed in the status bar.

Figure 5.5: Calculation of the resolving power
Print this page