Week 5 HW:Protein-design-part-ii

Part 1: Generate binders with PepMLM

Step 1 — Get the mutant SOD1 sequence

Original SOD1 sequence from Uniprot

sp|P00441|SODC_HUMAN Superoxide dismutase [Cu-Zn] OS=Homo sapiens OX=9606 GN=SOD1 PE=1 SV=2 MATKAVCVLKGDGPVQGIINFEQKESNGPVKVWGSIKGLTEGLHGFHVHEFGDNTAGCTS AGPHFNPLSRKHGGPKDEERHVGDLGNVTADKDGVADVSIEDSVISLSGDHCIIGRTLVV HEKADDLGKGGNEESTKTGNAGSRLACGVIGIAQ

A4V mutant SOD1 sequence

MATKVVCVLKGDGPVQGIINFEQKESNGPVKVWGSIKGLTEGLHGFHVHEFGDNTAGCTS AGPHFNPLSRKHGGPKDEERHVGDLGNVTADKDGVADVSIEDSVISLSGDHCIIGRTLVV HEKADDLGKGGNEESTKTGNAGSRLACGVIGIAQ

Step 2 — Run PepMLM

Used google colab notebook for the step.

https://colab.research.google.com/drive/1j4HO5MPdIlCLZFvqYnjy_Ug--8oxTB9Z#scrollTo=VtfbXYndhyle

#BinderPseudo Perplexity
1WLSYPVVLEWGE16.4334209
2WHYYVVAVRWKE31.112519
3WHSYAAAAALWE12.9540785
4WLYYAVGLAWKX14.2779037
5FLYRWLPSRRGGXXXX

(lower perplexity = the model is more confident the peptide is a good binder)

Part 2: AlphaFold3 structure prediction

For each result, I looked at the below: The ipTM score (higher is better, closer to 1.0 means confident binding prediction)

BinderPseudo PerplexityipTM score
WLSYPVVLEWGE16.43342090.21
WHYYVVAVRWKE31.1125190.35
WHSYAAAAALWE12.95407850.27
WLYYAVGLAWKX14.2779037Could not be generated because it had a letter X
FLYRWLPSRRGGXXXX0.3

The ipTM score (interface predicted TM-score) measures how confidently AlphaFold3 predicts the two chains interact. It ranges from 0 to 1. Below 0.4 = poor 0.4–0.6 = moderate Above 0.6 = good

Where the peptide appears to dock (look at the 3D viewer — is it near position 4 at the N-terminus? At the dimer interface? Surface-exposed?)

I have a question on the fact that my second binder has a high Pseudo Perplexity value of 31 therefore the assumption is that the binding confidence level is very low. However through alphafold it has the best ipTM score suggesting that it had the best confident binding prediction

Part 3: PeptiVerse therapeutic properties

BinderSolubilityHemolysisBinding AffinityLengthMolecular WeightNet Charge (pH 7)Isoelectric PointHydrophobicity (GRAVY)
WLSYPVVLEWGESoluble - 1.00Non-hemolytic 0.112Weak binding - 5.952121477.7-2.234.240.26
WHYYVVAVRWKESoluble - 1.00Non-hemolytic 0.069Weak binding - 6.345121635.90.858.5-0.42
WHSYAAAAALWESoluble - 1.00Non-hemolytic 0.035Weak binding - 5.863121375.5-1.155.470.18
WLYYAVGLAWKXSoluble - 1.00Non-hemolytic 0.115Medium binding - 7.101121351.80.768.50.56

the best one balances strong binding + not hemolytic + good solubility.

Part 4: moPPIt optimized design