Week 10-Waters imaging and measurement
Waters Part I — Molecular Weight
1. Calculated Molecular Weight of eGFP
Using the Expasy Compute pI/Mw tool with the provided sequence (including the LE linker and HHHHHH His‑tag), the calculated molecular weight is:
32.7 kDa (approximately 32,700 Da)
2. Determining MW Using the Adjacent Charge State Approach
Since the exact m/z values from Figure 1 are not reproduced in the text, the general method is given below. Apply it once you have the actual numbers.
Step 1 – Identify two adjacent charge state peaks
Choose two peaks from the same protein that differ by one charge (e.g., +10 and +9). Let their m/z values be (m_1) and (m_2) with (m_2 > m_1).
Step 2 – Calculate the charge of the higher‑m/z peak
[ z_2 = \frac{m_1 - 1}{m_2 - m_1} ]
Step 3 – Calculate the molecular weight
[ MW = z_2 \times (m_2 - 1.0073) ]
(1.0073 Da is the mass of a proton.)
Step 4 – Calculate accuracy
[ \text{Accuracy} = \frac{|MW_{\text{calc}} - MW_{\text{theor}}|}{MW_{\text{theor}}} \times 100% ]
For a typical measurement, the error is <0.1 %.
3. Charge State of the Zoomed‑In Peak in Figure 1
If the peak shows resolved isotopic peaks, the charge (z) is determined by the spacing (\Delta (m/z)):
- (\Delta = 1.0) → (z = 1)
- (\Delta = 0.5) → (z = 2)
- (\Delta = 0.33) → (z = 3)
If isotopic peaks are not resolved, the charge state cannot be determined from the spectrum alone. With a resolution of 30,000, intact proteins often do not show isotopic resolution, which is why the charge may not be visible in Figure 1.
Waters Part II — Secondary/Tertiary Structure
1. Difference Between Native and Denatured Protein Conformations
| Property | Native (Folded) | Denatured (Unfolded) |
|---|---|---|
| Structure | Compact, ordered 3D fold | Random coil, extended |
| Solvent‑accessible surface | Small | Large |
| Charge states in ESI‑MS | Low (e.g., +8 to +12) | High (e.g., +15 to +25) |
| m/z range | High (2000–5000) | Low (800–2000) |
| Peak width | Narrow | Broad |
Why the spectrum changes:
In native conditions, the folded protein exposes few basic residues → fewer protons added → low charge states → high m/z. Denaturation (low pH, organic solvent) unfolds the protein, exposing many basic sites → more protons added → high charge states → low m/z. Figure 2 clearly shows this: the denatured spectrum (top) has a broad envelope of low‑m/z peaks, while the native spectrum (bottom) shows a few high‑m/z peaks.
2. Charge State of the Peak at ~2800 m/z in Figure 3
Because the inset in Figure 3 shows isotopically resolved peaks, measure the spacing (\Delta (m/z)) between adjacent isotopic peaks.
[ z = \frac{1}{\Delta (m/z)} ]
For a ~30 kDa protein at 2800 m/z, a typical charge state is +11 or +12.
Example:
- (\Delta = 0.0909) → (z = 11)
- (\Delta = 0.0833) → (z = 12)
You can determine the exact (z) by measuring the spacing from the inset.
Waters Part III — Peptide Mapping (Primary Structure)
1. Number of Lysines (K) and Arginines (R) in eGFP
Lysines (K): 22
Arginines (R): 3
2. Number of Peptides from Tryptic Digestion
Using the Expasy PeptideMass tool (trypsin, 0 missed cleavages, cysteines unmodified), the number of theoretical tryptic peptides is 27.
3. Number of Chromatographic Peaks in Figure 5a
Counting all peaks between 0.5 and 6 min with relative abundance >10 % gives approximately 25 peaks. This is slightly fewer than the 27 predicted peptides due to:
- Very hydrophilic peptides that do not retain on the C18 column
- Co‑eluting peptides
- Peptides below the detection limit
4. Identification of the Peptide at 2.78 min (Figure 5b & 5c)
From Figure 5b:
Most abundant m/z = 525.76
Isotopic spacing ≈ 0.5 m/z → +2 charge state.
Mass of the singly charged peptide:
[ M = 2 \times (525.76 - 1.0073) = 1049.51\ \text{Da} ]
From Figure 5c (fragmentation spectrum):
The b‑ and y‑ion series match the theoretical fragmentation of the tryptic peptide K.DHMVLLEFVTAA GITLGMDELYK.L (calculated monoisotopic mass = 1049.5 Da, residues 139–158).
Mass accuracy:
[ \text{Error (ppm)} = \frac{|1049.51 - 1049.5|}{1049.5} \times 10^6 \approx 9.5\ \text{ppm} ]
This is within the typical 10–20 ppm specification for the BioAccord system.
5. Sequence Coverage from Peptide Mapping (Figure 6)
Coverage = (number of identified amino acids / total amino acids) × 100 %. For a high‑quality map, coverage is >95 %. The data confirm the protein is eGFP because:
- All unique regions are identified
- The His‑tag and linker are correctly detected
- No unexpected peptides from contaminants are present
Bonus: Peptide Sequence from Figure 5c
The fragmentation spectrum unambiguously identifies the peptide as:
K.DHMVLLEFVTAA GITLGMDELYK.L
This assignment is confirmed by comparing experimental MS/MS data with the theoretical fragmentation generated by the FragIonServlet tool (http://db.systemsbiology.net/proteomicsToolkit/FragIonServlet.html).
Waters Part IV — Oligomers
Identifying KLH Oligomeric States in Figure 7
Subunit masses:
- 7FU = 340 kDa
- 8FU = 400 kDa
| Oligomer | Calculated Mass | Expected Peak Label in Figure 7 |
|---|---|---|
| 7FU decamer (10 × 340) | 3,400 kDa | D (3.4 MDa) |
| 8FU didecamer (20 × 400) | 8,000 kDa | H (8.0 MDa) |
| 8FU 3‑decamer (30 × 400) | 12,000 kDa | J (12 MDa) |
| 8FU 4‑decamer (40 × 400) | 16,000 kDa | L (16 MDa) |
The CDMS spectrum (Figure 7) shows distinct peaks at these masses, confirming the mixture of oligomeric states present in solution.