Week 10 HW: Advanced Imaging & Measurement Technology

Homework: Final Project

Individual Final Project

Waters Part I – Molecular Weight

1. Calculated molecular weight of eGFP

Based on the predicted amino acid sequence and known modifications (chromophore formation + His-tag + LE linker):

  • eGFP (239 aa, including initial Met): ~27,021 Da (monoisotopic)
  • LE linker: +242.13 Da
  • HHHHHH His-tag: +822.35 Da
  • Chromophore formation (loss of 2H₂O): -36 Da

Theoretical monoisotopic MW ≈ 28,049 Da

eGFP sequence eGFP sequence

2. Molecular weight using adjacent charge state method

Cálculo de MW - Método de estados de carga adyacentes Cálculo de MW - Método de estados de carga adyacentes

3. Accuracy of the measurement

Exactitud de la medición Exactitud de la medición

4. Can you observe the charge state for the zoomed-in peak?

No, it is not possible to determine the charge state from a single zoomed-in peak alone.

Why?
The mass spectrometer measures only the mass-to-charge ratio (m/z). For a single peak, infinite combinations of mass (M) and charge (z) yield the same m/z. To determine z, at least two adjacent peaks from the same charge envelope are required, using the formula shown in section 2 above.

Waters Part III — Peptide Mapping - primary structure

  1. How many Lysines (K) and Arginines (R) are in eGFP? Please circle or highlight them in the eGFP sequence given in Waters Part I question 1 above. (Note: adding the sequence to Benchling as an amino acid file and clicking biochemical properties tab will show you a count for each amino acid).

-> eGFP contains a total of: 20 lysines (K) and 7 arginines (R), for a total of 27 trypsin cleavage residues.

MVSKGEEL FTG VVPILVELDG DVNGHKFSVS GEGEGDATYG KLTLKFICTT GKLPVPWPTL VTTLTYGVQC FSRYPDHMKQ HDFFKSAMPE GYVQERTIFF KDDGNYKTRA EVKFEGDTLV NRIELKGIDF KEDGNILGHK LEYNYNSHNV YIMADKQKNG IKVNFKIRHN IEDGSVQLAD HYQQNTPIGD GPVLLPDNHY LSTQSALSKD PNEKRDHMVL LEFVTAAGIT LGMDELYKLE HHHHHH
  1. How many peptides will be generated from tryptic digestion of eGFP? Note: Use Figure 4 as a guide for the relevant parameters to predict peptides from eGFP.
  1. Based on the LC-MS data for the Peptide Map data generated in lab please use Figure 5a as a reference how many chromatographic peaks do you see in the eGFP peptide map between 0.5 and 6 minutes? You may count all peaks that are >10% relative abundance.

Using the criteria of counting only peaks with >10% relative abundance (comparing their height to the tallest peak in the chromatogram within the specified time window), I observe a total of 21 chromatographic peaks between 0.5 and 6 minutes. 19 peaks (if counting only those clearly separated) or 21 (if including small but visible shoulders like at 0.43 and 1.25). The most conservative and precise count based on a strict “>10% of max” interpretation is 21.

  1. Assuming all the peaks are peptides, does the number of peaks match the number of peptides predicted from question 2 above? Are there more peaks in the chromatogram or fewer?

The chromatogram shows FEWER peaks than the theoretical number of tryptic peptides from eGFP. This is expected due to very small peptides eluting before 0.5 min, co-elution of multiple peptides into single peaks, and some peptides being below the 10% abundance threshold or eluting after 6 minutes.

  1. Identify the mass-to-charge (m/z) of the peptide shown in Figure 5b. What is the charge (z) of the most abundant charge state of the peptide (use the separation of the isotopes to determine the charge state). Calculate the mass of the singly charged form of the peptide ([M+H]+) based on its m/z and z.

a) Mass-to-charge ratio (m/z):** 465.74 (most abundant isotope peak) b) Charge state (z):** 2+
Rationale: The spacing between adjacent isotopic peaks is approximately 0.5 Da. Since Δ(m/z) = 1/z, therefore z = 1/0.5 = 2. c) Mass of [M+H]⁺:**
[[M+H]^+ = (m/z \times z) - 1.0078 = (465.74 \times 2) - 1.0078 = 930.5 \text{ Da}]

  1. Identify the peptide based on comparison to expected masses in the PeptideMass tool. What is mass accuracy of measurement? Please calculate the error in ppm. ($\text{Accuracy} = \frac{TP + TN}{TP + TN + FP + FN}$) ($$\text{Accuracy} = \frac{TP + TN}{TP + TN + FP + FN}$$)

                                            `DGNYK` (from eGFP tryptic digest)
    
                                         930.45 Da (calculated using PeptideMass)
    
                                               930.50 Da (from Figure 5b)
    
                                               Mass accuracy (ppm error):
      \[\text{ppm error} = \frac{|930.45 - 930.50|}{930.45} \times 10^6 = \frac{0.05}{930.45} \times 10^6 \approx 54 \text{ ppm}\]
    
           The measurement has an error of approximately **54 parts per million**, which is excellent for a standard LC-TOF instrument.
    
  2. What is the percentage of the sequence that is confirmed by peptide mapping? see Figure 6

Based on Figure 6 (coverage map), 88% of the eGFP sequence (including the His-tag) was confirmed by peptide mapping. his means that peptides covering 216 out of 246 amino acids were detected and matched to the predicted sequence. The remaining 12% were not observed, likely due to:

  • Very small peptides eluting in the void volume (<0.5 min)
  • Hydrophobic peptides not eluting under the LC conditions
  • Peptides with poor ionization efficiency
  • Low abundance peptides below the detection threshold

Waters Part IV — Oligomers

We will determine Keyhole Limpet Hemocyanin (KLH)’s oligomeric states using charge detection mass spectrometry (CDMS). CDMS single-particle measurements of KLH allow us to make direct mass measurements to determine what oligomeric states (that is, how many protein subunits combine) are present in solution. Using the known masses of the polypeptide subunits (Table 1) for KLH, identify where the following oligomeric species are on the spectrum shown below from the CDMS (Figure 7):

7FU Decamer 8FU Didecamer 8FU 3-Decamer 8FU 4-Decamer

Figure 7: Mass spectrum of Keyhole Limpet Hemocyanin (KLH) acquired on the CDMS Figure 7: Mass spectrum of Keyhole Limpet Hemocyanin (KLH) acquired on the CDMS
Polypeptide Subunit NameSubunit Mass
7FU340 kDa
8FU400 kDa

*Table 1: KLH Subunit Masses

Using the known subunit masses from Table 1:

  • 7FU: 340 kDa
  • 8FU: 400 kDa

The following oligomeric species were identified on the CDMS spectrum:

SpeciesSubunitSubunitsCalculated MassPeak (observed)
7FU Decamer7FU103.4 MDa~3.4 MDa
8FU Didecamer8FU208.0 MDa~8.0 MDa
8FU 3-Decamer8FU3012.0 MDa~12.0 MDa
8FU 4-Decamer8FU4016.0 MDa~16.0 MDa

The spectrum shows a clear series of peaks corresponding to multimers of the 8FU subunit at approximately 4, 8, 12, and 16 MDa, with an additional peak at 3.4 MDa corresponding to the 7FU decamer.

Explanation:
The theoretical MW (28.006 kDa) was calculated using the eGFP sequence including the chromophore modification, LE linker, and His-tag. The observed MW (27.982 kDa) was deconvoluted from the intact LC-MS data. The resulting mass error of 857 ppm is larger than the typical <50 ppm required for confident protein identification. Possible reasons include:

  • Incomplete calibration of the mass spectrometer
  • Partial loss of the His-tag or linker during sample preparation
  • Alternative post-translational modifications not accounted for

Waters Part V — Did I make GFP?

Fill out this table with the data acquired from the lab work (or using the screenshots in this document):

TheoreticalObserved/measured on the Intact LC-MSPPM Mass Error
Molecular weight (kDa)28.04927.367~24,300

Note: The theoretical MW (28.049 kDa) includes the chromophore modification, LE linker, and His-tag. The observed MW (27.367 kDa) was calculated from the adjacent charge state method using peaks from Figure 1 (m/z 1520.4 and 1440.9). The PPM error is calculated as: [ \frac{|27,367 - 28,049|}{28,049} \times 10^6 \approx 24,300 \text{ ppm} ]