Week 5 HW: Protein Design Part II

Part 1: Generate Binders with PepMLM 🐧

Human SOD1 Sequence from UniProt (154 amino acids): https://www.uniprot.org/uniprotkb/P00441/entry https://www.uniprot.org/uniprotkb/P00441/entry#sequences

Click to view Human SOD1 Sequence
MATKAVCVLKGDGPVQGIINFEQKESNGPVKVWGSIKGLTEGLHGFHVHEFGDNTAGCTSAGPHFNPLSRKHGGPKDEERHVGDLGNVTADKDGVADVSIEDSVISLSGDHCIIGRTLVVHEKADDLGKGGNEESTKTGNAGSRLACGVIGIAQ
Click to view Human SOD1 Sequence with A4V mutation which causes ALS
MATKVVCVLKGDGPVQGIINFEQKESNGPVKVWGSIKGLTEGLHGFHVHEFGDNTAGCTSAGPHFNPLSRKHGGPKDEERHVGDLGNVTADKDGVADVSIEDSVISLSGDHCIIGRTLVVHEKADDLGKGGNEESTKTGNAGSRLACGVIGIAQ

I had to manually reset the code as I was not able to change the parameters with the sliding scales. I was stuck with generating a single 15 peptide long binder at a time. Thankfully I was able to change this by editing the back end of the form, but also by forcing the code to make 4 binders that were 12 peptides long.

Here is a table with the binders ranked and compared against a known binder:

RankPeptide SourceSequencePseudo Perplexity
1Reference (Experimental)FLYRWLPSRRGG2.2833
2PepMLM (Candidate 0)KLVPAVVLAHKX7.4714
3PepMLM (Candidate 1)KRSYPTALRHWX10.1367
4PepMLM (Candidate 2)WRYPVAABHGK11.0383
5PepMLM (Candidate 3)WHVYVVGLRHKE25.8914

The perplexity metric measures how perplexed or “surprised” as it were, a model is by a sequence. Hence a lower score represents higher model confidence or predicted affinity. Here, the known binder FLYRWLPSRRGG acts as a benchmark, scoring 2.28 on the pseudo perplexity rating, which is significantly lower than the newly generated designs. As you can see, I have ranked the binders in order of their respective perplexity ratings.

Part 2: Evaluate Binders with AlphaFold3 🦐
RankJob NameipTMpTMPrimary Binding LocationTarget Engagement
1SOD1 and KLVPAVVLAHK0.580.82N-terminus GrooveHigh (Pocket)
2SOD1 and WHVYVVGLRHKE0.490.81Upper Ξ²-barrel RidgeModerate (Surface)
3SOD1 and KRSYPTALRHW0.440.90Ξ²-barrel LoopsModerate (Surface)
4SOD1 and WRYPVAABHGK0.390.83Lower Dimer InterfaceLow/Mod (Surface)
5SOD1 and FLYRWLPSRRGG (Ref)0.260.81Surface LoopsLow (Transient)

Key

Confidence LevelpLDDT RangeEmojiCorresponding Color
Very HighpLDDT > 90πŸ’™Dark Blue
Confident90 > pLDDT > 70🩡Light Blue (Cyan)
Low70 > pLDDT > 50πŸ’›Yellow
Very LowpLDDT < 50🧑Orange

Protein-peptide complex Models using AlphaFold3 and Residue Alignment Charts (Green)

They are ordered according to their ipTM score, with the first (KLVPAVVLAHK) having the greatest score (0.58) etc Please refer to the relevant table above if necessary

SOD1 & KLVPAVVLAHK

SOD1 & WHVYVVGLRHKE

SOD1 & KRSYPTALRHW

SOD1 & WRYPVAABHGK

SOD1 & FLYRWLPSRRGG - Reference

Alignment: SOD1 & KLVPAVVLAHKX

Alignment: SOD1 & WHVYVVGLRHKE

Alignment: SOD1 & KRSYPTALRHW

Alignment: SOD1 & WRYPVAABHGK

Alignment: SOD1 & FLYRWLPSRRGG - Reference

So the AlphaFold 3 modelling confirmed the potential of the peptides I designed using PepMLM. Interestingly all four peptides that I generated using the model outperformed FLYRWLPSRRGG - the known experimental reference, in terms of ipTM (interface confidence). This was despite it having the lowest pseudo perplexity score. Candidate 0 (KLVPAVVLAHK) emerged as having the greatest ipTM score with 0.58; it also had the lowest perplexity score of 7.4714. It’s high ipTM score demonstrates its unique ability to dock deeply within the N-terminus groove where the A4V (ALS causing) mutation of SOD1 sits. The other peptides showed varying affinities for the Ξ²-barrel and dimer interface.

The second highest performing binder was Candidate 3 (WHVYVVGLRHKE) - which had an ipTM score of 0.49, though it had the highest pseudo perplexity score of 25.8914. This means that it binds quite well to the mutated SOD1 though it is less likely to be found in nature than the other peptides.

Part 3: Evaluate Properties of Generated Peptides in the PeptiVerse πŸͺ

The first thing I noticed was that the PeptiVerse Website Interface is so cute and easy to use. Wow! Pranam and his team really popped off with this one πŸ‘πŸΌ so hats 🎩 🧒 πŸ‘’ off to them!

So kawaii!

In the search for a peptide that can stabilize the SOD1 proteinβ€”a primary target in ALS research we move from structural modeling in AlphaFold 3 to therapeutic profiling in PeptiVerse. While a 3D model shows us how a peptide looks, these 11 metrics provided by PeptiVerse tell us how it will behave in a human body. Here is the result of testing the four PepMLM-designed candidates against a known reference binder (FLYRWLPSRRGG) in order of ipTM score (highest to lowest):

Metric / PropertyKLVPAVVLAHKWHVYVVGLRHKEKRSYPTALRHWWRYPVAABHGKFLYRWLPSRRGG (Ref)
🐊 ipTM (Structural)0.580.490.440.390.26
πŸ’§ Solubility1.0001.0001.0001.0001.000
πŸ”¬ Permeability0.2420.1430.8490.3590.862
🩸 Hemolysis0.0320.0520.0220.0100.047
πŸ‘― Non-Fouling0.2850.2970.5490.4800.666
⏱️ Half-Life (hrs)0.4380.4120.3420.3390.310
πŸ”— Binding (pKd)5.5285.9195.9655.3005.968
πŸ“ Length (aa)1112111112
βš–οΈ Mol. Weight (Da)1174.51522.81414.61166.51507.7
⚑ Net Charge (pH 7)+1.59+0.94+2.85+1.85+2.76
🎯 Isoelectric Point10.008.6011.009.9911.71
πŸ’¦ GRAVY (Hydrophobicity)1.02-0.38-1.44-0.73-0.71

The results revealed a fascinating trade-off between structural integrity and medicinal viability!

As we know Candidate 0 (KLVPAVVLAHK) had the greatest ipTM score of 0.58, meaning that AlphaFold3 has great confidence in its structural fit to the mutated SOD1. However, based on PeptiVerse’s therapeutic metrics, Candidate 1 (KRSYPTALRHW) comes out on top though it had a lower ipTM score of 0.44.

Why is this?

Candidate 1 (KRSYPTALRHW) had the second lowest pseudo perplexity score of the generated peptides with 7.4714, meaning that it is reasonable to assume that it could occur in nature. Regarding the generated peptides it achieved the highest predicted affinity score with 5.965 pKd, as well as the best permeability score of 0.849, suggesting that it could actually reach the mutated SOD1 proteins inside cells. It also has the highest positive net charge of all the peptides (including the reference), with a score of +2.85 which makes it uniquely suited to crossing the blood-brain barrier and binding to the negatively charged aggregates of mutant SOD1.

Therefore I would advance Candidate 1 (KRSYPTALRHW).

Part 4: Generate Optimized Peptides with moPPIt πŸ§ΌπŸ§½πŸ«§πŸ’¦πŸ§ΉπŸͺ£
RunSequenceAffinity (pKd)SolubilitySpecificityMotif ScoreHemolysis
#1RFKCIVKVMVRR8.8810.5000.6150.5530.944
#2KRLQLYRKKCAE7.1930.7500.7370.6340.964
#3QRACDYFRDDED7.7830.8330.6790.0590.895
#4KEKEGPCWESEK7.3600.8330.8710.0020.962
Part 5: L-Protein Mutants ☣️

Lysis Protein Sequence from UniProt (75 amino acids): https://www.uniprot.org/uniprotkb/P03609/entry

Click to view Lysis Protein Sequence
METRFPQQSQQTPASTNRRRPFKHEDYPCRRQQRSSTLYVLIFLAIFLSKFTNQLLLSLLEAVIRTVTTLQQLLT

DnaJ Protein Sequence from UniProt (376 amino acids): https://www.uniprot.org/uniprotkb/P08622/entry

Click to view Lysis Protein Sequence
MAKQDYYEILGVSKTAEEREIRKAYKRLAMKYHPDRNQGDKEAEAKFKEIKEAYEVLTDSQKRAAYDQYGHAAFEQGGMGGGGFGGGADFSDIFGDVFGDIFGGGRGRQRAARGADLRYNMELTLEEAVRGVTKEIRIPTLEECDVCHGSGAKPGTQPQTCPTCHGSGQVQMRQGFFAVQQTCPHCQGRGTLIKDPCNKCHGHGRVERSKTLSVKIPAGVDTGDRIRLAGEGEAGEHGAPAGDLYVQVQVKQHPIFEREGNNLYCEVPINFAMAALGGEIEVPTLDGRVKLKVPGETQTGKLFRMRGKGVKSVRGGAQGDLLCRVVVETPVGLNERQKQLLQELQESFGGPTGEHNSPRSKSFFDGVKKFFDDLTR