Projects

Final projects:

  • AI-guided de novo design of peptides targeting LigA to disrupt adhesion and virulence in Leptospira interrogans SECTION 1: ABSTRACT Leptospirosis is a neglected zoonotic disease caused by Leptospira species and remains an important public health concern, particularly in low-resource regions. This project focuses on the LigA protein from Leptospira interrogans, specifically its 11th domain, which is associated with bacterial adhesion and virulence.

Subsections of Projects

Individual Final Project

cover image

AI-guided de novo design of peptides targeting LigA to disrupt adhesion and virulence in Leptospira interrogans


SECTION 1: ABSTRACT

Leptospirosis is a neglected zoonotic disease caused by Leptospira species and remains an important public health concern, particularly in low-resource regions. This project focuses on the LigA protein from Leptospira interrogans, specifically its 11th domain, which is associated with bacterial adhesion and virulence.

The main objective is to design de novo peptides capable of binding to LigA using computational protein engineering tools such as PepMLM and AlphaFold2 Multimer. Selected peptide candidates will later be incorporated into expression constructs designed in Benchling for transformation into E. coli and downstream validation.

Overall, the project aims to establish a low-cost computational and experimental workflow for peptide-based anti-virulence therapies with potential applications against neglected infectious diseases and antimicrobial resistance.


SECTION 2: PROJECT AIMS

Aim 1: Experimental Aim

Design and identify peptide candidates capable of interacting with the 11th domain of the LigA protein using computational tools such as PepMLM and AlphaFold Multimer.

Aim 2: Development Aim

Experimentally evaluate selected peptides through cloning, bacterial expression in E. coli, and preliminary interaction studies with the LigA protein.

Aim 3: Visionary Aim

Develop an accessible and low-cost workflow for peptide-based therapeutic discovery against neglected infectious diseases.


SECTION 3: BACKGROUND

Literature Context

Previous studies identified the LigA protein as an important virulence factor involved in immune evasion and host interaction in leptospirosis [1]. Additionally, recent advances in artificial intelligence have enabled the computational design of peptide binders against protein targets using models such as PepMLM [2].

This project combines both approaches by applying AI-driven peptide engineering to a neglected infectious disease target that currently lacks alternative therapeutic strategies.

Innovation

The project introduces an anti-virulence strategy targeting LigA instead of relying exclusively on antibiotics. It also integrates modern computational tools such as PepMLM, AlphaFold, and Benchling to accelerate peptide discovery while reducing experimental cost and workload.

Impact

Leptospirosis affects more than one million people annually and antimicrobial resistance continues to increase [4], [5]. This project could contribute to alternative therapeutic approaches based on peptide engineering and synthetic biology. Additionally, the proposed workflow could be adapted for other neglected diseases and resistant bacterial pathogens.

Ethical Considerations

The project involves ethical considerations related to biosafety, responsible AI use, and equitable access to biotechnology. Peptide candidates must be carefully evaluated to minimize risks such as toxicity or inaccurate computational predictions.

Furthermore, accessibility and public health impact should remain priorities when developing emerging therapeutic technologies, especially for vulnerable populations affected by neglected infectious diseases.


SECTION 4: EXPERIMENTAL DESIGN, TECHNIQUES, TOOLS, AND TECHNOLOGY

Experimental Plan

Week 1: Computational Analysis

  • Retrieve LigA sequences from UniProt and NCBI.
  • Identify conserved regions using BLAST, SMART, and InterPro.
  • Generate candidate peptides using PepMLM.

Week 2: Structural Validation

  • Evaluate peptide-LigA interactions using AlphaFold2 Multimer.
  • Visualize complexes in PyMOL.
  • Design DNA constructs in Benchling.

Week 3: Peptide Expression

  • Synthesize plasmids through Twist Bioscience.
  • Transform constructs into E. coli BL21(DE3).
  • Induce recombinant peptide expression with IPTG.

Week 4: Purification and Validation

  • Purify peptides using IMAC chromatography.
  • Validate expression through SDS-PAGE and Western blotting.

Week 5: Binding Assays

  • Perform fluorescence-based binding assays with Alexa Fluor 488.
  • Analyze binding affinity values using GraphPad Prism or R.

Techniques Relevant to the Project

  • Pipetting
  • Lab Safety
  • Bioethics
  • DNA Design
  • Gel Electrophoresis
  • Databases
  • Lab Automation
  • Twist Orders
  • Protein Design
  • PepMLM
  • Benchling
  • Modeling
  • Bioproduction
  • Plasmid Prep
  • Bacterial Culture
  • Quality Control
  • Protein Purification
  • Restriction Cloning

Selected Techniques

PepMLM was used to generate de novo peptide candidates targeting the LigA protein. Candidate peptides were screened according to predicted affinity and confidence scores.

Benchling was utilized to organize DNA construct design, plasmid mapping, sequence analysis, and cloning workflow preparation.


Associated Companies

  • Addgene
  • Asimov (Kernel)
  • Boltz.bio
  • Ginkgo Bioworks
  • New England Biolabs
  • Opentrons
  • Thermo Fisher Scientific
  • Twist Biosciences

SECTION 5: RESULTS & QUANTITATIVE EXPECTATIONS

Validation Strategy

The first aim of the project was validated through an in silico workflow involving computational peptide design and screening against the LigA protein.

Validation Workflow

The validation workflow included:

  1. Retrieving the LigA sequence from UniProt.
  2. Identifying the LigA 11th domain.
  3. Generating peptide candidates using PepMLM.
  4. Screening peptides based on confidence and affinity predictions.
  5. Evaluating peptide-LigA complexes using AlphaFold2 Multimer.
  6. Visualizing interactions in PyMOL.

Techniques Used

Protein design tools such as PepMLM and AlphaFold were used to generate and analyze peptide candidates. UniProt and iGEM databases provided sequence and biological information, while Benchling supported sequence organization and construct design.

Results Summary

The LigA sequence was obtained from UniProt (ID: C0J1Q2), and the 11th domain was selected for peptide targeting, which is:

KVVSIEVIPNNISFAKGNSYQFKATGIYTDHSEADITEQVTWSSSNPKIASVENTFGSAGLVNTTNIGSTNITAKLSDTVSGSSVL
Figure 1. LigA structure with 11th domain.

PepMLM generated multiple candidate peptides:

Table 1: PepMLM peptides

IDBinderPerplexity
0SKPISIIVKDSSSLLKKIKK13.214
1KSVVGKISSITGTAKIKKIK16.319
2GKSVVSISSDTGSKLLILIS22.824
3KPSGSIVSSITTSKAIILLS21.513

Peptide 0 was selected based on its low perplexity score. AlphaFold modeling produced an iPTM score of 0.64 and a pTM score of 0.69, suggesting a moderate-confidence interaction.

Figure 2. Peptide-protein complex.

PyMOL visualization identified several residues in close spatial proximity, supporting the feasibility of peptide-LigA binding.

Figure 3. Peptide-protein complex in PyMOL. Peptide is coloured in blue, LigA in green, and the red dots are residues at the binding interface.

A final expression cassette was designed in Benchling using iGEM biobricks and cloned into the pET-21a(+) plasmid using XhoI and BamHI restriction sites.

Below is the list of the biobricks used:

Table 2: Cassette components

PartBiobrick NameNucleotide Sequence
PromoterBBa_J23100TTGACGGCTAGCTCAGTCCTAGGTACAGTGCTAGC
RBSBBa_B0032TCACACAGGAAAG
Start codonNAAUG
SequenceNATCAAAACCGATCTCGATTATCGTGAAAGATAGCTCCAGTTTACTGAAAAAGATTAAAAAAAT
HisTagNACATCACCATCACCATCAC
Stop codonNATAA
TerminatorBBa_B0010GAAGCTGCTGCAAGAGAAGCTGCAGCTAGGGAGGCTGCAGCTAGGGAGGCTGCTGCAAGA
Figure 4. Construct and plasmid in Benchling.
---

Challenges and Limitations

One major challenge was identifying the exact location of LigA domain 11, since literature descriptions were limited. Additional difficulties included interpreting AlphaFold confidence scores and designing cloning workflows with limited prior molecular biology experience.

These limitations were addressed through literature review, database analysis, and computational modeling. Future improvements could include molecular dynamics simulations and experimental in vitro validation.


SECTION 6: ADDITIONAL INFORMATION

References

[1] A. Kumar et al., “Deciphering the role of leptospira surface protein LIGA in modulating the host innate immune response,” Frontiers in Immunology, vol. 12, p. 807775, Dec. 2021, doi: 10.3389/fimmu.2021.807775.

[2] L. T. Chen et al., “Target sequence-conditioned design of peptide binders using masked language modeling,” Nature Biotechnology, Aug. 2025, doi: 10.1038/s41587-025-02761-2.

[3] J. Jumper et al., “Highly accurate protein structure prediction with AlphaFold,” Nature, vol. 596, no. 7873, pp. 583–589, Jul. 2021, doi: 10.1038/s41586-021-03819-2.

[4] K. B. Karpagam and B. Ganesh, “Leptospirosis: a neglected tropical zoonotic infection of public health importance—an updated review,” European Journal of Clinical Microbiology & Infectious Diseases, vol. 39, no. 5, pp. 835–846, Jan. 2020, doi: 10.1007/s10096-019-03797-4.

[5] S. Pineda, J. M. M. Garro, J. E. S. Flórez, S. Agudelo-Pérez, F. P. Monroy, and R. G. P. Sánchez, “Detection of genes related to antibiotic resistance in leptospira,” Tropical Medicine and Infectious Disease, vol. 9, no. 9, p. 203, Sep. 2024, doi: 10.3390/tropicalmed9090203.

[6] Beacon Bio, “Leptospirosis cases in Peru increase to 1045 with five deaths reported; Ministry of Health issues epidemiological alert amid rainfall conditions,” Beacon Bio, 2026. [Online]. Available: https://beaconbio.org/en/report/?reportid=9f425260-15d4-4032-986e-aa2fbd458db0&eventid=72171b38-461c-4b00-a2f8-dd93ed09c811. [Accessed: May 24, 2026].

[7] L. Czaplewski et al., «Alternatives to antibiotics—a pipeline portfolio review», The Lancet Infectious Diseases, vol. 16, n.o 2, pp. 239-251, ene. 2016, doi: 10.1016/s1473-3099(15)00466-1.


Supply List and Budget

Table 3: Supplies and estimated costs

CategoryItemCost (USD)
DNA SynthesisTwist Bioscience plasmid synthesis$1,800
Bacterial ExpressionE. coli BL21(DE3) cells$60
Protein PurificationNi-NTA magnetic beads$240
Protein ValidationSDS-PAGE and antibodies~$900
Binding AssaysAlexa Fluor 488 reagent$420
ConsumablesPipette tips and plates~$476
Equipment AccessAutomation and imaging systems$700
Estimated Total Budget~$7,600-$9,000 USD

Here is the full documentation in case you might want to refer to it

Group Final Project

cover image cover image