Week-05-hw-protein-design-part-2

Homework: Protein Design II

Part A: SOD1 Binder Peptide Design (From Pranam)

Part 1: Generate Binders with PepMLM

1.Begin by retrieving the human SOD1 sequence from UniProt (P00441) and introducing the A4V mutation.

MATKVVCVLKGDGPVQGIINFEQKESNGPVKVWGSIKGLTEGLHGFHVHEFGDNTAGCTSAGPHFNPLSRKHGGPKDEERHVGDLGNVTADKDGVADVSIEDSVISLSGDHCIIGRTLVVHEKADDLGKGGNEESTKTGNAGSRLACGVIGIAQ

2.Using the PepMLM Colab linked from the HuggingFace PepMLM-650M model card:

3.Generate four peptides of length 12 amino acids conditioned on the mutant SOD1 sequence.

4.To your generated list, add the known SOD1-binding peptide FLYRWLPSRRGG for comparison.

5.Record the perplexity scores that indicate PepMLM’s confidence in the binders.

Part 2: Evaluate Binders with AlphaFold3

1.For each peptide, submit the mutant SOD1 sequence followed by the peptide sequence as separate chains to model the protein-peptide complex.

3.Record the ipTM score and briefly describe where the peptide appears to bind. Does it localize near the N-terminus where A4V sits? Does it engage the β-barrel region or approach the dimer interface? Does it appear surface-bound or partially buried?

In all the figures the peptide appears to bind to the β-barrel region or approach the dimer interface. It appears surface bound.

4.In a short paragraph, describe the ipTM values you observe and whether any PepMLM-generated peptide matches or exceeds the known binder.

The above image of known binder shows ipTM value of 0.38 and pTm of 0.81.

The ipTm values observed for otehr binders only one comes close to ipTMof 0.31 and pTm of 0.87.The rest are all binders have higher ipTm in the range of 051-0.60.The (KRYYAAAVELWK) due to its balanced high ipTM, affinity, solubility, and specificity for SOD1 A4V—ideal for therapeutic prototyping in cell-free systems or biosensor integration may be a good hit. None deeply penetrate pockets, typical for PepMLM 12-mers on structured targets like SOD1.

Part 3: Evaluate Properties of Generated Peptides in the PeptiVerse

Structural confidence alone is insufficient for therapeutic development. Using PeptiVerse, let’s evaluate the therapeutic properties of your peptide! For each PepMLM-generated peptide:

1.Paste the peptide sequence.

2.Paste the A4V mutant SOD1 sequence in the target field.

3.Check the boxes

1,Predicted binding affinity

2.Solubility

3.Hemolysis probability

4.Net charge (pH 7)

5.Molecular weight

Compare these predictions to what you observed structurally with AlphaFold3. In a short paragraph, describe what you see. Do peptides with higher ipTM also show stronger predicted affinity? Are any strong binders predicted to be hemolytic or poorly soluble? Which peptide best balances predicted binding and therapeutic properties?

The iPTM scores and binding affinity are related, the highter the score the higher the bonding affinity.I could not find any strong binder only found medium binding binders. All of them were soluble and non-haemolytic.

Choose one peptide you would advance and justify your decision briefly.

I would like to go with KRYYAAAVELWK binder as it has 7.168 binding affinity, higher than all four. Has high specificity for target SOD1 mutant protein. It has high iPTM score of 0.58.

Part 4: Generate Optimized Peptides with moPPIt

4.After generation, briefly describe how these moPPit peptides differ from your PepMLM peptides. How would you evaluate these peptides before advancing them to clinical studies?

Higher values (closer to 1) indicate better performance; affinity measures binding likelihood to the target motif, the specificty of binder 1 is best compared to the other two.

Evaluating peptides like KKKRGGST and TEKVQAGCF from moPPIt for clinical advancement requires a rigorous, multi-stage preclinical pipeline to confirm computational predictions and ensure safety/efficacy.Sequences can be refined using ProteinMPNN for improved affinity/specificity.

Part B: BRD4 Drug Discovery Platform Tutorial (Gabriele)

Part 0: Sign-up to Boltz Lab

Part 1: Structural Predictions in the Sandbox

Running Your Three Predictions Navigate to the Boltz Sandbox at lab.boltz.bio and log in to your account.

  1. Go to Sandbox → New Prediction

  2. Name this BRD4 binder JQ1

  3. Select ‘Complex’, add ‘Sequence from RCSB’, and add 3MXF

  4. Continue through Constraints (not needed for this example), and select Jq1 as the Binder for an affinity prediction.

  5. Submit the prediction.

  6. Use the ‘Duplicate Prediction’ in the results review, and remove the small molecule.

  7. Add in the SMILES for the Hit and Lead.

  8. When predictions complete, record your results in the table below

Does Binding Confidence increase as you move from hit to clinical candidate? What would you expect, and why might it deviate?

Yes the binding confidence increases from hit to clinical candidate. The goal of drug discovery is to incraese the binding affinity and selectivity of the drug without comprimising on efficacy,potency, pharmocological properties, its a balance of all tehse properties.

It might deviate due to the in vitro environment assay not matching with teh complex environment in body and due to teh balancing of all drug properties liek binding strength, potency etc.

Inspect the predicted binding pose for JQ1. Can you identify potential key binding interactions.

The approach is there towards the helices of the BRD4 domain.

Compare the Optimization Scores. How do the scores compare for JQ1 vs the Lead.

The optimization scores are 0.27 for lead and 0.44 for JQ1. The difference in teh values tell us that the lead has initial activity but compared to JQ1 requires more modifications for binding affinity, selectivity and pharmocological properties.

Part 2: Setting Up a BRD4 Design Project

Now you will create a small molecule Design Project - the Boltz Lab workflow for virtual screening and lead optimisation. We will set up BRD4 as a target using the clinical candidate as our structural reference. 2.1 Creating the Target

  1. From the dashboard, create a Design Projects via ‘New Project’
  2. Name your project: ‘BRD4 Workshop '
  3. Select ‘Small Molecule’
  4. Click Add Target and add the protein structure as in the Sandbox using PDB code 3MXF
  5. Continue and let the apo structure complete. Continue if the structure looks good.
  6. Leave binding residue selection blank, the platform will auto-detect the pocket
  7. In the Molecular Probe field, paste the JQ1 SMILES.
  8. Predict Pocket Structure and complete the Target Set-Up

Part 3: Running Your Virtual Screen

Run a Generative Design Campaign We will utilize the Boltz Lab small-molecule generative workflow. This generates novel molecules optimised for BRD4 binding using Boltz-2 as the scoring function.

  1. After creating the design project, Boltz Lab will prompt you to Generate binders with AI.
  2. Name your experiment, provide a relevant hypothesis, and Create the Experiment.
  3. The New Virtual Screen will be pre-configured with a Generative screen using the Enamine REAL space.
  4. Keep ‘Normal Filtering’ selected. This will ensure we only generate molecules acceptable to a medicinal chemist.
  5. Decide if you would like to apply any Molecule Filters. We recommend the ‘Drug-Like’ Preset.
  6. Select a custom number of Binders and enter 1K.
  7. Start the Virtual Screen.
  8. Allow binders to be generated, and View Results in Experiment

Part 4: Analysis and Discussion

As your experiment completes, use the ‘Quick Add Candidates’ on the experiment screen to add JQ1 as a benchmark for generated designs.

Interpreting Your Results

As your experiment completes, use the ‘Quick Add Candidates’ on the experiment screen to add JQ1 as a benchmark for generated designs. From your screen output, identify three categories of molecules:

There is only 1 candiadate as high confidence binder SM-Z2EJ9VV7

22 of them are in teh moderate confindence binder category, and rest in low confidence / no binders category.

Discussion: As the virtual screen completes, assess the following: • How does JQ1 in the Design Project screen alongside the library. Does it score as the top compound?

Binding confidence is 0.96 as compared to 0.84 of the candidate compound.

Yes it scores as the top compound.

• How do the top scoring binders compare in binding pose to JQ1?

The binding pose of the top binder is similar to JQ1.Key differences include denser packing and fewer disordered loops (less yellow) in the high candidate, suggesting improved prediction reliability. Perceptual image similarity is moderate, confirming related but distinct poses.

• Try adding a second target to your project via the dropdown in the structure viewer, for example, BRD2 (PDB: 5UEN). Re-run the top scoring binders against BRD2 and compare which compounds score highly for BRD4 but not BRD2. This is a selectivity analysis - a key part of real BET inhibitor programs.

The top binder for BRD4 binds with binding confidence of 0.30 to the BRD2 domain.

SM-MMS4D6XV is the top binder for BRD2 domain but binding confidence of same is 0.43 with the BRD4 domain.

Part C: Final Project: L-Protein Mutants

Based on the Clustal omega results mutation was done on the below sites and the protein co folded with DnaJ.

Coutesy: Neurosnap and tamarindBio