Subsections of Homework
Week 1 HW: Principles and Practices

1) Describe a biological engineering application or tool you want to develop and why.
Throughout my previous research as a new biodesigner, I have been particularly drawn to two themes: the intricate relationship between the gut microbiota and its effects on human physical and mental health, and the fascinating world of fungi and their broader implications for planetary health. I have deepened my research into psilocybin and related tryptamine alkaloids (substances that activate serotonin 5-HT receptors), which are already being investigated clinically for neuropsychiatric conditions (Xi et al., 2023), but what especially interests me is emerging evidence that some of their effects may be mediated by the gut microbiota rather than the brain alone (Caspani et al., 2024). There is still a lack of research on the precise impact that serotonergic psychedelics have on the structure and composition of the gut microbiota, even though animal studies suggest possible links (Császár-Nagy et al., 2022). A survey of wild‑type rats has shown changes in the abundance of spore‑forming gut bacteria following oral administration of serotonin, suggesting that serotonergic compounds such as psilocybin could directly or indirectly reshape microbial communities (Fung et al., 2019).
Beyond psilocybin, the traditional use of other psychoactive organisms also points to underexplored antimicrobial and immunomodulatory effects. For instance, the peyote cactus (Lophophora williamsii) has been reported to display activity against multiple penicillin‑resistant strains of Staphylococcus aureus (McCleary et al., 1960). Such examples raise broader questions about how these organisms interact with microbes and the host’s physiology. These kinds of plants, fungi, and compounds have long histories in traditional medicine yet remain only partially understood within Western biomedical frameworks (Doesburg-van Kleffens et al., 2023). I believe that the intersection of psychedelics, microbiota, and immune function is a promising area for synthetic biology yet to be further explored, with potential not only for already studied mental health conditions but also for inflammatory bowel diseases (IBD) (Qureshi et al., 2025). Rather than viewing these compounds solely through a brain‑centric lens, it would be interesting to investigate their therapeutic potential through the gut microbiota as a primary interface rather than a secondary effect.
Psychedelic‑assisted therapy is currently highly regulated, expensive, and accessible to very few people (Rea et Wallace, 2021). From a biodesign and synthetic biology perspective, biosynthesising psilocybin in a bacterial host such as Escherichia coli through recombinant DNA technology (Keller et al., 2025) could potentially provide a more controllable, scalable, and eventually more affordable route to producing psilocybin or related analogues.
In the context of the HTGAA course, I therefore propose to focus on a minimal and safe fragment of this broader concept, with the following objectives:
- Investigate how natural serotonergic psychedelics and related natural compounds might interact with the gut microbiota- immune axis, with a focus on IBD and gut inflammations.
- Design a conceptual framework for microbial psilocybin production in E. coli (Adams et al., 2019) as a platform for future microbiome‑targeted therapies.
- Prototype an E. coli experimental system expressing a heterologous lyase enzyme from a psilocybin-related tryptamine biosynthetic pathway (Abrahms et al., 2025), assembled via PCR-based cloning methods.
- Apply CRISPR‑based editing to modulate the expression of the heterologous enzyme and characterise the resulting change in a readout (through colour or fluorescence) to determine if the right sequence in E.Coli is being targeted.
- Develop a simple in silico pathway model to explore where metabolic bottlenecks might occur in a full psilocybin biosynthesis pathway in E. coli (Irvine et al., 2023).
- Reflect on ethical, regulatory, and accessibility questions around engineering psychoactive compounds and further document how these considerations shape the design of the project.
Reference List:
- Abrahms, Z.N., Sen, A.K. and Jones, J.A. (2025).Pathway engineering for the biosynthesis of psychedelics. Current Opinion in Biotechnology, 94, pp.103314–103314. doi:https://doi.org/10.1016/j.copbio.2025.103314.
- Adams, A.M., Kaplan, N.A., Wei, Z., Brinton, J.D., Monnier, C.S., Enacopol, A.L., Ramelot, T.A. and Jones, J.A. (2019). In vivo production of psilocybin in E. coli. Metabolic Engineering, [online] 56, pp.111–119. doi:https://doi.org/10.1016/j.ymben.2019.09.009.
- Caspani, G., Ruffell, S.G.D., Tsang, W., Netzband, N., Rohani-Shukla, C., Swann, J.R. and Jefferies, W.A. (2024). Mind over matter: the microbial mindscapes of psychedelics and the gut-brain axis. Pharmacological Research, 207, p.107338. doi:https://doi.org/10.1016/j.phrs.2024.107338.
- Császár-Nagy, N., Bob, P. and Bókkon, I. (2022). A Multidisciplinary Hypothesis about Serotonergic Psychedelics. Is it Possible that a Portion of Brain Serotonin Comes From the Gut? Journal of Integrative Neuroscience, 21(5), p.148. doi:https://doi.org/10.31083/j.jin2105148.
- Doesburg-van Kleffens, M., Zimmermann-Klemd, A.M. and Gründemann, C. (2023). An Overview on the Hallucinogenic Peyote and Its Alkaloid Mescaline: The Importance of Context, Ceremony and Culture. Molecules, [online] 28(24), p.7942. doi:https://doi.org/10.3390/molecules28247942.
- Fung, T.C., Vuong, H.E., Luna, C.D.G., Pronovost, G.N., Aleksandrova, A.A., Riley, N.G., Vavilina, A., McGinn, J., Rendon, T., Forrest, L.R. and Hsiao, E.Y. (2019). Intestinal serotonin and fluoxetine exposure modulate bacterial colonization in the gut. Nature Microbiology. doi:https://doi.org/10.1038/s41564-019-0540-4.
- Keller, M.R., McKinney, M.G., Sen, A.K., Guagliardo, F.G., Hellwarth, E.B., Islam, K.N., Kaplan, N.A., Gibbons, W.J., Kemmerly, G.E., Meers, C., Wang, X. and Jones, J.A. (2025). Psilocybin biosynthesis enhancement through gene source optimization. Metabolic Engineering, [online] 91, pp.119–129. doi:https://doi.org/10.1016/j.ymben.2025.04.003.
- McCleary, J.A., Sypherd, P.S. and Walkington, D.L. (1960). Antibiotic activity of an extract of peyote (Lophophora Williamii (Lemaire) Coulter). Economic Botany, 14(3), pp.247–249. doi:https://doi.org/10.1007/bf02907956.
- Qureshi, O., Cowley, J., Pegg, A., Cooper, A.J., Gordon, J., Brady, C.A., Belli, A., Butterworth, S., Upthegrove, R., Andrews, N. and Barnes, N.M. (2025). Are we hallucinating or can psychedelic drugs modulate the immune system to control inflammation? British journal of pharmacology, [online] p.10.1111/bph.70138. doi:https://doi.org/10.1111/bph.70138.
- Xi, D., Berger, A., Shurtleff, D., Zia, F.Z. and Belouin, S. (2023). National Institutes of Health psilocybin research speaker series: State of the science, regulatory and policy landscape, research gaps, and opportunities. Neuropharmacology, [online] 230, p.109467. doi:https://doi.org/10.1016/j.neuropharm.2023.109467.
2) Next, describe one or more governance/policy goals related to ensuring that this application or tool contributes to an “ethical” future, like ensuring non-malfeasance (preventing harm). Break big goals down into two or more specific sub-goals.
Policies Goal: To ensure microbial psilocybin production for microbiome-targeted IBD therapy/medication prioritises safety, equitable access despite microbiome variability, respect for traditional knowledge and transparency to build public trust and prevent harm.
- Risk assessment for gut-specific harm: to safely develop this kind of medication or therapy, it will require a longitudinal patient gut-microbome monitoring prior to and post-dosing to refine safety protocols and the detection of any adverse shifts.
- Accessibility whilst respecting indviduality of the gut microbiome: creating an accessible medication targeting something so unique to the individual will be a challenge. There are now affordable baseline microbiome profiling kits (e.g. Feel Gut - Microbiome Test Kit - 250£) to screen high-risk patients pre-treatment, while developing 2–3 engineered E. coli strains producing dosage variants (low/medium/high) based on symptom severity or gut sensitivity.
- Value of traditional knowledge: crediting and collaborating with communities that have historically used psychedelic organisms for medical approaches through benefit-sharing agreements (BSAs).
- Transparency and accountability: Create traceable audit trails for strains and products to ensure responsibility if misuse occurs and build public trust through open data sharing.
Sources:
- DiliTrust (2025). Understanding Audit Trails: Implementation, Types, and Best Practices. [online] Dilitrust. Available at: https://www.dilitrust.com/audit-trail/ [Accessed 7 Feb. 2026].
- FeelGut (2026). Gut Microbiome Health Test. [online] Feel Gut. Available at: https://feelgut.co.uk/products/gut-microbiome-health-test?gad_source=1&gad_campaignid=21790867659&gbraid=0AAAAAqF2ctBqNuD8W2LgDdGB8xmh-dJkI&gclid=Cj0KCQiAhaHMBhD2ARIsAPAU_D4EnlJcLWOuT0Ol-Gz0qDHbGOnE0a_08EaWu5OmccHvfelHGAJLgeUaAmI0EALw_wcB [Accessed 7 Feb. 2026].
- Ogilvy.com.au (2024). Shared Success: What is benefit sharing and why does it matter? - Insight - MinterEllison. [online] Minterellison.com. Available at: https://www.minterellison.com/articles/what-is-benefit-sharing-and-why-does-it-matter [Accessed 7 Feb. 2026].
- Secretariat of the Convention on Biological Diversity (2011). Convention on Biological Diversity: ABS Theme Access and benefit-sharing. [online] Available at: https://www.cbd.int/abs/infokit/revised/web/factsheet-abs-en.pdf [Accessed 7 Feb. 2026].
3) Next, describe at least three different potential governance “actions” by considering the four aspects below (Purpose, Design, Assumptions, Risks of Failure & “Success”).
To meet these policy goals, I propose the following three potential governance actions:
- COLLECTIVE DATABASE:
Call for the development of a collective database where all biologists who have worked on the E. Coli psiloyibin strains would share their protocol and CRISPR history. This calls for the collaboration of universities and researchers worldwide for future research to optimise time by limiting repetitions and consequently reducing costs. The development of controlled access will also be necessary to ensure the information in the database is used ethically and for the correct reasons.- PATIENT GUT-MICROBIOME SCREENING:
Call for the pre-screening of patients’ gut microbiome to ensure appropriate candidates. This also calls for collaboration with a pharmaceutical partner to deliver these kits directly to potential patients or to hospitals and institutions where the samples will be analysed to determine whether they are suitable for treatment of the patient's gut microbiome. This will ensure safety and a reduction of side effects, and better results by than accordingly selecting which strain would benefit the patient the most. There are still some risks considering ‘false negative’ testing, which could therefore make it difficult to fully confirm the appropriate treatment, as well as the potential increase in price of the treatment if the gut-microbiome data were to become a Pharma IP monopoly, limiting access due to often increasing prices (Dosi et al., 2023).
- CROSS COLLABORATION
Call for the implementation of royalty funds for indigenous cultures that have contributed to the development of this project with their knowledge on the medical application of psychedelics to ensure an ethical and transparent development of the project. Furthermore, developing a wider web of collaboration with governments, scientists, hospitals, psychologists, and doctors to meet and discuss/review strains, trials and dosages. This complex cross-collaboration would only be possible with an FDA approval of the usage of such psychedelics to begin with, which would require a long time, many trials and solid evidence of success at a high success rate level (Lamkin, 2022).
References:
- Dosi, G., Marengo, L., Staccioli, J. and Virgillito, M.E. (2023). Big Pharma and Monopoly capitalism: a long-term View.Structural Change and Economic Dynamics, [online] 65, pp.15–35. doi:https://doi.org/10.1016/j.strueco.2023.01.004.
- Lamkin, M. (2021). Prescription Psychedelics: The Road from FDA Approval to Clinical Practice. The American Journal of Medicine, 135(1). doi:https://doi.org/10.1016/j.amjmed.2021.07.033.
4) Next, score (from 1-3 with, 1 as the best, or n/a) each of your governance actions against your rubric of policy goals. The following is one framework but feel free to make your own:
Scale: 1-3 (1: Most effective, 2: Moderately effective, 3: Least effective, or N/A)

5) Last, drawing upon this scoring, describe which governance option, or combination of options, you would prioritize, and why. Outline any trade-offs you considered as well as assumptions and uncertainties.
I propose the prioritisation of the strain database, as well as the microbiome pre-screening, as the foundational governance actions for the development of microbial psilocybin. Given the strict regulations surrounding the use of psychedelic drugs in the medical world, transparency, feasability and safety are essential for governmental approval. Strategically, developing this project in countries like Switzerland or the Netherlands would offer optimal conditions. In Zurich and Geneva (Switzerland), research on the medical use of psychedelic compounds, such as psilocybin or LSD, in clinical psychiatric trials is continuously growing (Elçi, 2025). Additionally, their biotechnological infrastructure would allow for an ideal environment to conduct the development of such a project. On the other hand, the legal use of psilocybin truffles in the Netherlands should also be considered as a place of interest. Overlooking this as phases, the initial phase should consist of the launch of the strain database for safety and optimisation, followed by mandating affordable gut-microbiome screening. Lastly, validating via in-silico computational models the prediction of human gut-microbiome-psilocybin interactions through pharmacokinetic data (Grogan and Preuss, 2023) to build towards the potential commercialisation of such a product.
List of References:
- Grogan, S. and Preuss, C. (2023). Pharmacokinetics. [online] PubMed. Available at: https://www.ncbi.nlm.nih.gov/books/NBK557744/ [Accessed 9 Feb. 2026].
- Aylin Elçi (2025). Switzerland is home to Europe’s only psychedelics treatment. [online] SWI swissinfo.ch. Available at: https://www.swissinfo.ch/eng/multinational-companies/switzerland-is-home-to-europes-only-psychedelics-treatment/89195943 [Accessed 7 Feb. 2026].
Assignment (Week 2 Lecture Prep)
Homework Questions from Professor Jacobson:
1)Nature’s machinery for copying DNA is called polymerase. What is the error rate of polymerase? How does this compare to the length of the human genome. How does biology deal with that discrepancy?
The error rate ratio of polymerase is 1:10^6 base pairs, equivalent to one mistake every million bases. In comparison, the human genome consists of approximately 3 billion base pairs (3 x 10^9) (National Human Genome Research Institute, 2026), meaning each replication introduces around 3000 errors (3x10^9 x 10^6), leading to genetic mutations. In synthetic biology, high-fidelity DNA polymerases have proven to have much lower error rates for PCR cloning of long biosynthetic pathways. These polymerases possess 3’ to 5’ exonuclease proofreading domains that detect mismatches through structural perturbations and immediately remove them before continuing synthesis. (Clent Life Science, 2024).
2)How many different ways are there to code (DNA nucleotide code) for an average human protein? In practice, what are some of the reasons that all of these different codes don’t work to code for the protein of interest?
There are astronomically many possible ways to code for an average human protein. For example, a 300-amino-acid protein may have 10^100 different synonymous codon combinations (Rajbanshi and Guruacharya, 2025). However, most fail in practice because cells prefer specific codons (codon usage bias), which causes rare codons to slow down the tRNA process. Consequently, this creates nonuniform ribosome decoding rates on mRNAs and in turn disrupts the contranslational protein folding process, which is essential for proper protein function (Liu et al., 2021).
References:
- Bates, S. (2019). Base Pair. [online] Genome.gov. Available at: https://www.genome.gov/genetics-glossary/Base-Pair [Accessed 8 Feb. 2026].
- Clent Life Science (2024). High-fidelity DNA Polymerases & When to use them. [online] Clent Life Science. Available at: https://clentlifescience.co.uk/high-fidelity-dna-polymerases-and-when-to-use-them/ [Accessed 8 Feb. 2026].
- Liu, Y., Yang, Q. and Zhao, F. (2021). Synonymous but Not Silent: The Codon Usage Code for Gene Expression and Protein Folding. Annual Review of Biochemistry, 90(1), pp.375–401. doi:https://doi.org/10.1146/annurev-biochem-071320-112701.
- Rajbanshi, B. and Guruacharya, A. (2025). codonGPT: reinforcement learning on a generative language model enables scalable mRNA design. Nucleic Acids Research, [online] 53(22). doi:https://doi.org/10.1093/nar/gkaf1345.
Homework Questions from Dr. LeProust:
1)What’s the most commonly used method for oligo synthesis currently?
Solid-phase phosphoramidite is currently the most commonly used method for oligo synthesis. It is a cyclical four-step phosphoramidite synthesis method (McLaughlin, 2025), which was developed in 1981 by Marvin Caruthers.
2) Why is it difficult to make oligos longer than 200nt via direct synthesis?
Making oligos longer than 200 nucleotides via direct synthesis is difficult due to cumulative yield losses. During this synthesising method, one nucleotide is added at a time. Although there is a high success rate (99%), each added nucleotide will contribute to yield loss (ATDBio Ltd, 2005). At the 200th nucleotide added (0.99^200), the overall yield production will have a new success rate of 0.135%, resulting in extremely low yield and consequently resulting in a waste of expensive reagents and purification time (Mühlegger,2025).
3) Why can’t you make a 2000bp gene via direct oligo synthesis?
As mentioned in the last question, chemical DNA synthesis adds only one nucleotide at a time, at a 99% success rate. Taking into consideration that we only use one strand of DNA for the synthesis, 0.99^2000 would reduce the success rate by 0.0002% yield for a 2000bp gene. Considering this extremely low amount, it would make it practically impossible for the yield to be detected for the purification process.
References:
- ATDBio Ltd (2005). ATDBio - Solid-phase oligonucleotide synthesis. [online] atdbio.com. Available at: https://atdbio.com/nucleic-acids-book/Solid-phase-oligonucleotide-synthesis [Accessed 8 Feb. 2026].
- Glen Research (2026). Glen Report 21.211 - TECHNICAL BRIEF – Synthesis of Long Oligonucleotides. [online] Glenresearch.com. Available at: https://www.glenresearch.com/reports/gr21-211 [Accessed 8 Feb. 2026].
- McLaughlin, L. (2025). What Is Oligonucleotide Synthesis? Phosphoramidite oligonucleotide synthesis. [online] Biotechnologyreviews.com. Available at: https://www.biotechnologyreviews.com/p/what-is-oligonucleotide-synthesis [Accessed 8 Feb. 2026].
- Michael Mühlegger (2025). Oligonucleotide manufacturing – challenges & solutions. [online] Single Use Support: Pionneering Biopharma. Available at: https://www.susupport.com/blogs/biopharmaceutical-products/oligonucleotide-manufacturing-challenges-solutions [Accessed 8 Feb. 2026].
Homework Question from George Church:
1) What are the 10 essential amino acids in all animals and how does this affect your view of the “Lysine Contingency”?
The essential amino acids are: Histidine (His), Isoleucine (Ile), Leucine (Leu), Lysine (Lys), Methionine (Met), Phenylalanine (Phe), Threonine (Thr), Tryptophan (Trp), Valine (Val), and Arginine (ARG). (Lopez and Mohiuddin, 2024).
The Lysine Contingency was a genetically engineered fail‑safe created by Dr Henry Wu in the late 1980s to prevent Jurassic Park’s dinosaurs from synthesising lysine. The idea was that park staff would have to supplement the animals with lysine to keep them alive, and if any dinosaur escaped Isla Nublar, it would die without this dietary supply. This was intended as a strategy to protect outside ecosystems. However, this design was fundamentally flawed. In 1997, Dr Sarah Harding showed that the dinosaurs could obtain enough lysine from their environment through their normal diet, making the contingency ineffective in practice. Since all animals already depend on dietary lysine in the first place, the Contingency never provided the precise control the park claimed (Jurassic-Pedia, 2024).
I agree with Dr Sorkin’s view that, even though these dinosaurs were cloned, they still deserved rights as living animals. Humans chose to recreate this extinct species, so it seems ethically wrong that we should also reserve the right to let them die for our convenience or as a simplistic safety measure. This raises a wider question about how much power humans should have over organisms they create or modify. In my view, these ethical discussions should happen before such cloning work begins, not after problems appear. Overall, the Lysine Contingency feels poorly thought through and aimed at solving a problem that should never have been framed that way in the first place.
References:
- Lopez, M.J. and Mohiuddin, S.S. (2024). Biochemistry, Essential Amino Acids. [online] PubMed. Available at: https://www.ncbi.nlm.nih.gov/books/NBK557845/ [Accessed 8 Feb. 2026].
- Jurassic-Pedia (2024). Lysine Contingency (S/F) / (S/F-T/G) – Jurassic-Pedia. [online] Jurassic-Pedia. Available at: https://www.jurassic-pedia.com/lysine-contingency-sf/ [Accessed 9 Feb. 2026].
Week 2 HW: Dna-Read-Write-and-Edit
Part 1: Benchling & In-silico Gel Art
I simulated the Restriction Enzyme Digestion in Benchling to create a design. I found it initially difficult to visualise patterns or images with the 7 restriction enzymes. I therefore decided to mix certain enzymes in the same wells to generate more DNA fragments and explore shapes further.


Part 3: DNA Design Challenge
3.1 Which protein have you chosen and why? Using one of the tools described in the recitation (NCBI, UniProt, Google), obtain the protein sequence for the protein you chose.Name of protein: psiH (tryptamine 4-monooxygenase)
I chose this specific protein as it relates to my project idea from homework 1. PsiH catalyses the 4-hydroxylation of tryptamine to 4-hydroxytryptamine, which is an essential and unique part of psilocybin biosynthesis that allows for the production of psilocin (the active therapeutic metabolite). This P450 enzyme (psiH) acts as a critical rate‐limiting step of psilocybin production. Furthermore, it needs exact heme binding and substrate fit, which is rare in nature and tough to engineer in E. coli (Huang et al., 2025). This makes PsiH the technical core of my biosynthetic pathway design from Homework 1, where engineered E. coli would produce psilocybin locally to activate gut serotonin signalling for IBD treatment (Robinson et al., 2023).
Refrences:
Huang, Z., Yao, Y., Di, R., Zhang, J., Pan, Y. and Liu, G. (2025). De Novo Biosynthesis of Antidepressant Psilocybin in Escherichia coli.Microbial biotechnology, [online] 18(4), p.e70135. doi:https://doi.org/10.1111/1751-7915.70135.Gregory Ian Robinson, Li, D., Wang, B., Rahman, T., Gerasymchuk, M., Hudson, D., Kovalchuk, O. and Kovalchuk, I. (2023). Psilocybin and Eugenol Reduce Inflammation in Human 3D EpiIntestinal Tissue.Life, 13(12), pp.2345–2345. doi:https://doi.org/10.3390/life13122345.Amino Acid Protein Sequence:
MIAVLFSFVIAGCIYYIVSRRVRRSRLPPGPPGIPIPFIGNMFD
MPEESPWLTFLQWGRDYNTDILYVDAGGTEMVILNTLETITDLLEKRGSIYSGRLEST
MVNELMGWEFDLGFITYGDRWREERRMFAKEFSEKGIKQFRHAQVKAAHQLVQQLTKT
PDRWAQHIRHQIAAMSLDIGYGIDLAEDDPWLEATHLANEGLAIASVPGKFWVDSFPS
LKYLPAWFPGAVFKRKAKVWREAADHMVDMPYETMRKLAPQGLTRPSYASARLQAMDL
NGDLEHQEHVIKNTAAEVNVGGGDTTVSAMSAFILAMVKYPEVQRKVQAELDALTNNG
QIPDYDEEDDSLPYLTACIKELFRWNQIAPLAIPHKLMKDDVYRGYLIPKNTLVFANT
WAVLNDPEVYPDPSVFRPERYLGPDGKPDNTVRDPRKAAFGYGRRNCPGIHLAQSTVW
IAGATLLSAFNIERPVDQNGKPIDIPADFTTGFFRHPVPFQCRFVPRTEQVSQSVSGP
Source:National Library of Medicine (2026). Psilocybe cubensis strain FSU 12409 putative monooxygenase (psiH) gene - Nucleotide - NCBI. [online] Nih.gov. Available at: https://www.ncbi.nlm.nih.gov/nuccore/MF000993 [Accessed 16 Feb. 2026].3.2 DNA Reverse Translation:
atgattgcggtgctgtttagctttgtgattgcgggctgcatttattatattgtgagccgc
cgcgtgcgccgcagccgcctgccgccgggcccgccgggcattccgattccgtttattggc
aacatgtttgatatgccggaagaaagcccgtggctgacctttctgcagtggggccgcgat
tataacaccgatattctgtatgtggatgcgggcggcaccgaaatggtgattctgaacacc
ctggaaaccattaccgatctgctggaaaaacgcggcagcatttatagcggccgcctggaa
agcaccatggtgaacgaactgatgggctgggaatttgatctgggctttattacctatggc
gatcgctggcgcgaagaacgccgcatgtttgcgaaagaatttagcgaaaaaggcattaaa
cagtttcgccatgcgcaggtgaaagcggcgcatcagctggtgcagcagctgaccaaaacc
ccggatcgctgggcgcagcatattcgccatcagattgcggcgatgagcctggatattggc
tatggcattgatctggcggaagatgatccgtggctggaagcgacccatctggcgaacgaa
ggcctggcgattgcgagcgtgccgggcaaattttgggtggatagctttccgagcctgaaa
tatctgccggcgtggtttccgggcgcggtgtttaaacgcaaagcgaaagtgtggcgcgaa
gcggcggatcatatggtggatatgccgtatgaaaccatgcgcaaactggcgccgcagggc
ctgacccgcccgagctatgcgagcgcgcgcctgcaggcgatggatctgaacggcgatctg
gaacatcaggaacatgtgattaaaaacaccgcggcggaagtgaacgtgggcggcggcgat
accaccgtgagcgcgatgagcgcgtttattctggcgatggtgaaatatccggaagtgcag
cgcaaagtgcaggcggaactggatgcgctgaccaacaacggccagattccggattatgat
gaagaagatgatagcctgccgtatctgaccgcgtgcattaaagaactgtttcgctggaac
cagattgcgccgctggcgattccgcataaactgatgaaagatgatgtgtatcgcggctat
ctgattccgaaaaacaccctggtgtttgcgaacacctgggcggtgctgaacgatccggaa
gtgtatccggatccgagcgtgtttcgcccggaacgctatctgggcccggatggcaaaccg
gataacaccgtgcgcgatccgcgcaaagcggcgtttggctatggccgccgcaactgcccg
ggcattcatctggcgcagagcaccgtgtggattgcgggcgcgaccctgctgagcgcgttt
aacattgaacgcccggtggatcagaacggcaaaccgattgatattccggcggattttacc
accggcttttttcgccatccggtgccgtttcagtgccgctttgtgccgcgcaccgaacag
gtgagccagagcgtgagcggcccg
Source
The Sequence Manipulation Suite (2024).Reverse Translate. [online] www.bioinformatics.org. Available at: https://www.bioinformatics.org/sms2/rev_trans.html [Accessed 16 Feb. 2026].3.3 Codon optimisation
In your own words, describe why you need to optimise codon usage. Which organism have you chosen to optimise the codon sequence for and why?
Although different codons can code for the same amino acid, each species/organism has a bias for its codon preferences. This is done by changing/optimising the DNA codon sequence (not the amino-acid sequence) of the protein in order to match the codon preferences of the host organism (Cheema et al., 2022). If I were to take a human gene and insert it into a bacterium, it might use certain codons that the bacterium wouldn’t/would rarely use. This, in turn, makes the translation process slower or incomplete, resulting in a low protein yield (Creative BioLabs,2025). Therefore, by optimising the sequence with a specific host, I can make the translation process faster and more reliable.
For this specific exercise, I chose to optimise the psiH protein for E. Coli. I made this choice because E. coli is one of the most commonly used hosts for genetic engineering due to its rapid culture rate, simple nutritional needs and well-understood genetics (Adamczyk and Reed, 2017). Additionaly it is relatively cheap to culture
(Francis and Page,2010). In relation to Homework 1, during the time of this course, E. coli is an appropriate host for prototyping the psilocybin pathway to conceptually extend toward microbiome-targeted therapies.
sources
Creative BioLabs (2025). Codon Optimization and Its Impact on mRNA Translation Efficiency. [online] Creative-biolabs.com. Available at: https://ribosome.creative-biolabs.com/codon-optimization-and-its-impact-on-mrna-translation-efficiency.htm [Accessed 17 Feb. 2026].Cheema, N., Georgios Papamichail and Dimitris Papamichail (2022). Computational tools for synthetic gene optimization. Elsevier eBooks, pp.171–189. doi:https://doi.org/10.1016/b978-0-12-824469-2.00018-x.Adamczyk, P.A. and Reed, J.L. (2017). Escherichia coli as a model organism for systems metabolic engineering. Current Opinion in Systems Biology, [online] 6, pp.80–88. doi:https://doi.org/10.1016/j.coisb.2017.11.001.Francis, D.M. and Page, R. (2010). Strategies to optimize protein expression in E. coli. Current protocols in protein science, [online] Chapter 5(1), p.Unit 5.24.1-29. doi:https://doi.org/10.1002/0471140864.ps0524s61.What technologies could be used to produce this protein from your DNA? Describe in your own words how the DNA sequence can be transcribed and translated into a protein.
A variety of technologies could be used to produce the psiH protein from its DNA. One of the most suitable that we have discussed in lectures would be Gibson Assembly. It would allow us to stitch together multiple DNA fragments (promoter, Ribosome binding site, my optimised DNA sequence of psiH, terminator…) inside the plasmid of the E. Coli without the use of restriction enzymes. You may also use external platforms like Twist Bioscience to order your full plasmid by giving them the appropriate optimised sequence.
The DNA sequence may be transcribed and translated into a protein with the following steps:
Transcription: The RNA Polymerase will bind to the promoter sequence of the DNA. The double helix of the DNA unwinds and allows for the RNA polymerase to create a complementary mRNA strand following the base-pairing rules.Translation: Ribosomes then bind to the mRNA at the RBS near the start codon (ATG). tRNA molecules then match their anticodons to the mRNA codons, resulting in specific amino acids. Lastly, the ribosomes will continue to read through the sequence until a stop codon is reached, causing the release of the protein chain.Sources:
Nature Education (2014). The Information in DNA Determines Cellular Function via Translation | Learn Science at Scitable. [online] Nature.com. Available at: https://www.nature.com/scitable/topicpage/the-information-in-dna-determines-cellular-function-6523228/ [Accessed 15 Feb. 2026].Webster, M.W. and Weixlbaumer, A. (2021). The intricate relationship between transcription and translation. Proceedings of the National Academy of Sciences, [online] 118(21). doi:https://doi.org/10.1073/pnas.2106284118.Part 4: Prepare a Twist DNA Synthesis Order
Building my DNA Insert Sequence
I began by optimising my psiH translated protein DNA sequence in Benchling with a linear topology and optimising it for E. coli. I then added the reading direction (forward), and the given DNA sequences highlighted in the homework (Promoter, RBS, Coding Sequence, 7x His Tag, Stop Codon, Terminator).

Linear Map of the entire sequence:

Final Sequence Benchling link!
Building my Full Plasmid Sequence
After downloading my insert sequence (expression cassette) as a FASTA file and uploading it into my Twist account, selecting an appropriate vector (pTwist Amp High Copy), I was able to download the full plasmid sequence (GenBank). I then imported the GenBank file of my plasmid back into Benchling.

Part 5: DNA Read/Write/Edit
5.1 DNA Read
1) What DNA would you want to sequence (e.g., read) and why?
I want to sequence the 3955 bp E. coli plasmid containing the codon-optimised PsiH gene (tryptamine 4-monooxygenase from Psilocybe cubensis) that I developed with Twist Bioscience technology for exercise 4. This would allow for the verification of the construct (Gibson Assembly) of the plasmid, to verify that there are no mutations in the critical heme-binding site essential for the following steps of psilocibin synthesis. Ultimately, the goal would be to create a baseline sequence for future IBD therapy-scale engineering (Adams et al., 2019).
Sources
Adams, A.M., Kaplan, N.A., Wei, Z., Brinton, J.D., Monnier, C.S., Enacopol, A.L., Ramelot, T.A. and Jones, J.A. (2019). In vivo production of psilocybin in E. coli. Metabolic Engineering, [online] 56, pp.111–119. doi:https://doi.org/10.1016/j.ymben.2019.09.009.1) In the lecture, a variety of sequencing technologies were mentioned. What technology or technologies would you use to perform sequencing on your DNA, and why?
I initially considered Sanger sequencing due to its high accuracy (~99.9%) and effectiveness for targeted validation of small DNA regions. However, my E. coli psiH plasmid has 3955 bp, which exceeds Sanger’s read length of ~800 bp per reaction. Full coverage would require multiple reads; consequently, needing the development of multiple primers, which creates a time-consuming and costly process (AAT Bioquest, 2024).
I therefore continued my research and thought Oxford Nanopore Technologies' MinION device was a more appropriate fit (Oxford Nanopore Technologies, 2024). This technology generates long reads (>20 kb) capable of sequencing my entire 3955 bp plasmid in 1-2 reads with >99% accuracy (Brown et al., 2023). This approach would allow for the verification of the complete PsiH integration in the plasmid, promoter/RBS/terminator junctions and detect assembly errors.
Sources
AAT Bioquest (2024). What are the limitations of the Sanger Sequencing method?W | AAT Bioquest. [online] Aatbio.com. Available at: https://www.aatbio.com/resources/faq-frequently-asked-questions/what-are-the-limitations-of-the-sanger-sequencing-method [Accessed 15 Feb. 2026].Brown, S.D., Dreolini, L., Wilson, J.F., Miruna Balasundaram and Holt, R.A. (2023). Complete sequence verification of plasmid DNA using the Oxford Nanopore Technologies’ MinION device.BMC Bioinformatics , 24(1). doi:https://doi.org/10.1186/s12859-023-05226-y.Oxford Nanopore Technologies (2024). Plasmidsaurus redefine the gold standard: whole-plasmid sequencing with Oxford Nanopore. [online] Oxford Nanopore Technologies. Available at: https://nanoporetech.com/blog/plasmidsaurus-redefine-the-gold-standard-whole-plasmid-sequencing-with-oxford-nanopore [Accessed 17 Feb. 2026].1) Is your method first-, second-, or third-generation or other? How so?
Oxford Nanopore Technologies' MinION is a 3rd generaation sequencer. This means that it can read much longer sequences than any 1st or 2nd generation sequencing technologies (Hilt and Ferrieri, 2022). Additionally, it is one of the rare sequencing technologies that allows for real-time analysis. (Oxford Nanopore Technologies,2021)
Sources
Hilt, E.E. and Ferrieri, P. (2022). Next Generation and Other Sequencing Technologies in Diagnostic Microbiology and Infectious Diseases. Genes, 13(9), p.1566. doi:https://doi.org/10.3390/genes13091566.Oxford Nanopore Technologies (2021). How Nanopore Sequencing Works. [online] Oxford Nanopore Technologies. Available at: https://nanoporetech.com/platform/technology [Accessed 17 Feb. 2026].2) What is your input? How do you prepare your input (e.g. fragmentation, adapter ligation, PCR)? List the essential steps.
Input: Purified plasmid DNA from my engineered E. coli (3955 bp PsiH construct)
Preparation
Grow Escherichia coli strain DH5α (Huan et al., 2025)Introduce the plasmid (containing the optimised psiH gene) through an overnight culture.Extraction of the plasmid to obtain the pure PsiH-plasmid DNA out of E. coli cells.Measuring DNA through NanoDrop (spectrophotometer): usually around 1-2 µL of sample to measure concentration and purity. (Thermo Fisher Scientific, 2026).For Nanoprre preparation, add motor proteins (acting as enzymes that control the speed and direction of DNA as it moves through the nanopore). (Oxford Nanopore Technologies, 2025)Sources
Huang, Z., Yao, Y., Di, R., Zhang, J., Pan, Y. and Liu, G. (2025). De Novo Biosynthesis of Antidepressant Psilocybin in Escherichia coli. Microbial biotechnology, [online] 18(4), p.e70135. doi:https://doi.org/10.1111/1751-7915.70135.Oxford Nanopore Technologies (2025). How Oxford Nanopore sequencing works. [online] Oxford Nanopore Technologies. Available at: https://nanoporetech.com/blog/how-oxford-nanopore-sequencing-works [Accessed 17 Feb. 2026].Thermo Fisher Scientific (2026). NanoDrop Microvolume Spectrophotometers - US. [online] www.thermofisher.com. Available at: https://www.thermofisher.com/uk/en/home/industrial/spectroscopy-elemental-isotope-analysis/molecular-spectroscopy/uv-vis-spectrophotometry/instruments/nanodrop.html [Accessed 17 Feb. 2026].3) What are the essential steps of your chosen sequencing technology? How does it decode the bases of your DNA sample (base calling)?
Miniprep of plasmid: isolates and purifies plasmid DNA from bacterial culture. Includes a colour buffer system for a visual quality check to ensure my extraction worked successfully.Quantify DNA (spectrophotometer)Library preparation: Addition of barcode and motor proteins.Priming of the Nanopore flow cell: removing air and addition of bufferLoad the library onto the MinION flow cell.Start running through the software and wait for sequencing (usually around 24-48 hours).
Image: Instructions from Monarch Spin Plasmid Miniprep Kit, 2026.
Sources
New England Biolabs (2025). [online] Neb.com. Available at: https://www.neb.com/en-gb/products/t1110-monarch-spin-plasmid-miniprep-kit [Accessed 17 Feb. 2026].PANDORA-ID-NET Consortium (2021). Oxford Nanopore flow cell priming and loading tutorial. [online] YouTube. Available at: https://www.youtube.com/watch?v=IknVaEnuDz0 [Accessed 17 Feb. 2026].4) What is the output of your chosen sequencing technology?
Output: A FASTQ file containing the complete 3955 bp sequence of my optimised E.coli PsiH plasmid.
Sources
Oxford Nanopore Technologies plc. (2020). Output Structure - Oxford Nanopore Output Specifications. [online] Github.io. Available at: https://nanoporetech.github.io/ont-output-specifications/latest/minknow/output_structure/ [Accessed 19 Feb. 2026].5.2 DNA Write
I will synthesise the recombinant DNA of the engineered E. Coli as it contains the introduced plasmid from the psilocybin producing fungi.
5.2 DNA Write
(i)What DNA would you want to synthesise (e.g., write) and why?
I would like to synthesise the codon-optimised PsiH gene. This is an essential step for the development of this project, as fungal codons usually don’t express well in bacteria (Naqvi et al., 2016). Synthesising it will allow for a higher level of psiH with the ultimate goal of psilocin production.
Sources
Naqvi, S.H.Z., Cord‐Landwehr, S., Singh, R., Frank, B., Kolkenbrock, S. and Moerschbacher, B.M. (2016). A Recombinant Fungal Chitin Deacetylase Produces Fully Defined Chitosan Oligomers with Novel Patterns of Acetylation. Applied and Environmental Microbiology, 82(22), pp.6645–6655. doi:https://doi.org/10.1128/aem.01961-16.(ii)What technology or technologies would you use to perform this DNA synthesis, and why?
I would use the services of Twist Bioscience to ensure a correct synthesis with no PCR errors, and be ready for the Gibson Assembly of my plasmid.
1) What are the essential steps of your sequencing methods?
Twist uses a Phosphoramidite synthesis method. These are the four essential steps:
Coupling: the first phosphoramidite in the chain is attached to the surface with a catalysed condensation reaction.Oxidation: the phosphite triester is unstable, so it is converted to a phosphate to improve the sequenceCoupling: the next phosphoramidite is coupled to the available -OH on the previous deblocked molecule.Capping: Sometimes the coupling is not 100% efficient, and therefore, the coupling fails. To stop this, an unreactive group is added, blocking further extension.Repetition: Oxidation is repeated to extend the oligonucleotide molecule in a desired sequence.
Image: Twist Bioscience Website, 2026.
Sources
Twist Bioscience (2018). Phosphoramidite Chemistry for DNA Synthesis | Twist Bioscience. [online] www.twistbioscience.com. Available at: https://www.twistbioscience.com/blog/science/simple-guide-phosphoramidite-chemistry-and-how-it-fits-twist-biosciences-commercial [Accessed 18 Feb. 2026].2) What are the limitations of your sequencing method (if any) in terms of speed, accuracy, and scalability?
Phosphoramidite synthesis makes short DNA pieces (200-1500 bp) (Hughes and Ellington,2017), but Twist Bioscience assembles them into my full 2155 bp PsiH gene with error correction anyway, so it wouldn’t affect the actual synthesis process of my DNA. However, another issue I would like to raise is the use of hazardous reagents in reactions, washing and purification processes of this type of synthesis, which raises significant safety and environmental issues which shouldn’t be undermined (Gao et al., 2025).
Sources
Gao, N., Yu, A., Yang, W., Zhang, X., Shen, Y. and Fu, X. (2025). Enzymatic de novo oligonucleotide synthesis: Emerging techniques and advancements. Biotechnology Advances, [online] 82, p.108604. doi:https://doi.org/10.1016/j.biotechadv.2025.108604.Hughes, R.A. and Ellington, A.D. (2017). Synthetic DNA Synthesis and Assembly: Putting the Synthetic in Synthetic Biology. Cold Spring Harbor Perspectives in Biology, [online] 9(1), p.a023812. doi:https://doi.org/10.1101/cshperspect.a023812.5.3 DNA Edit
What DNA would you want to edit and why?
I’d like to edit the PsiH gene via mutagenesis. This would allow for the optimisation of tryptamine binding, which in turn will enhance enzyme activity in E. coli without needing host genome changes (Huang et al., 2025).
Sources
Huang, Z., Yao, Y., Di, R., Zhang, J., Pan, Y. and Liu, G. (2025). De Novo Biosynthesis of Antidepressant Psilocybin in Escherichia coli. Microbial biotechnology, [online] 18(4), p.e70135. doi:https://doi.org/10.1111/1751-7915.70135.(ii)What technology or technologies would you use to perform these DNA edits and why?
I would use the site-directed mutagenesis method to examine the relationship between function and structure of my selected protein (Zhang et al., 2009). To do so, I would amplify my Twist PsiH plasmid with mutant primers in order to enhance the tryptamine binding in E.coli host.
Sources
Zhang, B., Zhang, X., An, X., Ran, D., Zhou, Y., Lu, J. and Tong, Y. (2009). An easy-to-use site-directed mutagenesis method with a designed restriction site for convenient and reliable mutant screening. Journal of Zhejiang University SCIENCE B, 10(6), pp.479–482. doi:https://doi.org/10.1631/jzus.b0820367.1) How does your technology of choice edit DNA? What are the essential steps?
PCR site-directed mutagenesis edits PsiH using custom primers carrying my mutation (one base change to improve tryptamine binding) to amplify the full Twist plasmid during PCR, producing nicked circular copies with the incorporated alteration.
2) What preparation do you need to do (e.g. design steps) and what is the input (e.g. DNA template, enzymes, plasmids, primers, guides, cells) for the editing?
Design psiH primers with mutations.Use PCR to amplify the full plasmid.The DpnI enzyme is used to digest parental DNA, removing the original psiH plasmid.Verify the mutant through sequencing.
image: From Addgene.org
Sources
Kristian Laursen (2016). Site-directed mutagenesis by PCR. [online] Addgene.org. Available at: https://blog.addgene.org/site-directed-mutagenesis-by-pcr [Accessed 20 Feb. 2026].3) What are the limitations of your editing methods (if any) in terms of efficiency or precision?
Considering that I am only editing a small part of my DNA, using this method has a relatively high precision rate for a single alteration. Prior steps, though, like a mistake in the primer preparation, which will cause wrong mutations (Alvarez, 2024), as well as a very large plasmid, will cause a drop in accuracy (Jacobs et al., 2011).
Sources
Alvarez, D. (2024). 5 Common Challenges in Site-Directed Mutagenesis and How to Overcome Them. [online] TeselaGen Biotechnology. Available at: https://teselagen.com/blog/challenges-site-directed-mutagenesis/ [Accessed 21 Feb. 2026].Jacobs, J.S., Hong, X. and Eberl, D.F. (2011). A mesmerising new approach to site-directed mutagenesis in large transformation-ready constructs. Fly, [online] 5(2), pp.162–169. doi:https://doi.org/10.4161/fly.5.2.15092.Week 3 HW: Lab Automation
Assignment: Python Script for Opentrons Artwork
1) Generate an artistic design using the GUI at opentrons-art.rcdonovan.com.

My OpenTron design is inspired by the nudibranch (sea slug) from the Mollusca phylum.
Image source: Siewert, I. (2014). Nudibranch - Marine Life in Thailand. [online] Diving in Phuket Thailand. Available at: https://www.diving-thailand-phuket.com/nudibranch-marine-life-thailand/.
Using the coordinates from the GUI, follow the instructions in the HTGAA26 Opentrons Colab to write your own Python script which draws your design using the Opentrons.
After following the instructions, I downloaded the Python script from the GUI website to insert it into my Google Drive code. The code seemed to be all correct, but I encountered an error when it came to the proteins available on the GUI website and the actual colours in the code. I therefore used the Gemini function available in Google Colab to debug my code when it couldn’t recognise some of the colours. Gemini helped identify missing arguments in the load_labware function and suggested remapping custom protein names to standard colours for visualisation. This is why my final image colours don't match those on the OpenTrons art website.
Final image:

Post Lab-Questions
1) Find and describe a published paper that utilises the Opentrons or an automation tool to achieve novel biological applications.
I found a 2025 paper named 'Real‑time AI‑driven quality control for laboratory automation: a novel
computer vision solution for the opentrons OT‑2 liquid handling robot'that was
I found a 2025 paper named: Real‑time AI‑driven quality control for laboratory automation: a novel
computer vision solution for the Opentrons OT‑2 liquid handling robot (Khan et al., 2025), which I found particularly interesting.
This paper explores a novel AI-driven computer vision model called YOLOv8 (object detection model), created for the enhancement of quality that limits the accuracy and reliability of liquid handling robots like Opentrons OT-2. Combining these two systems would allow for the precise detection of pippette tips as well as liquid volumes, consequently providing real-time feedback on potential errors like incorrect placement or the absence of pipette tips as well as liquid levels. The paper highlights that the results obtained with their model present an effective, accessible and affordable solution for the improvement of laboratory automation for both academic and research laboratories.
In the introduction, the authors discuss the advantages ( enhancing reproducibility, efficiency and safety) but also disadvantages (high costs, protocol variability and limited expertise) of automation as well as how using it in life sciences remains limited compared to its high success rates in other industries like manufacturing and food production. The paper, therefore, addresses this gap through their YOLOv8 model in combination with the Opentrons OT-2 liquid handling robot.
The paper continues into the Methodology section, which was divided into 4 sections:
Data collection: taking images of pipette tips and liquid volumes using a camera in the OT-2 robot under a variety of lab conditions.Image annotation: used the Computer Vision Annotation Tool for the labelling of pipette tips and liquid volumes for object detection.Model Training: trained the YOLOv8 on the annotated dataset.Real-time Integration and Error Detection: using the trained YOLOv8 in a server-client setup for real-time error detection and feedback of the OT-2 pipetting system.The following are the experimental results and analysis, where they began by evaluating the YOLOv8 performance and concluding that the trend indicates the model had progressively enhanced its ability to recognise and, in turn, categorise objects more accurately throughout the training. They continued by looking at the readings from the detected tips and liquids by testing the YOLOv8 system through the conduction of 50 experiments. The experiments consisted of running the OT-2 robot in real time, and intentionall exclude certain pipette tips. The tests resulted in a 98% accuracy of identifying missing tips and their exact location. In addition, the authors tested the measurement of liquids within the pipette tips by conducting 100 experiments using a variety of liquid quantities. Their results showed a 95% accuracy in the assessment of liquid within the tips, which suggested that it was an effective approach but that there is still room for improvement. The authors suggest the implementation of liquid segmentation techniques to do so.
In the discussion section, the authors mainly highlight the potential of their technology as an affordable alternative to costly proprietary solutions and how it aids in the improvement of the quality of produced results, which consequently increases the throughput of the laboratory.
Lastly, in the conclusion and future direction section, the paper describes the potential of the YOLOv8 coupled with OT-2 in the advamncement in life science laboratory automation. The authors also state the limitations of their study, such as their experiment focusing on specific pipette types and liquid volumes. They suggest that future works should address these by expanding the dataset to include a wider range of experimental conditions as well as additional liquid handling tasks. Furthermore, they aim to further develop the YOLOv8 AI computer to detect challenges such as air bubbles, which also affect liquid handling accuracy.
Paper Reference:
Khan, S.U., Møller, V.K., Frandsen, R.J.N. and Mansourvar, M. (2025). Real-time AI-driven quality control for laboratory automation: a novel computer vision solution for the Opentrons OT-2 liquid handling robot. Applied Intelligence, 55(6). doi:https://doi.org/10.1007/s10489-025-06334-3.
2) Write a description about what you intend to do with automation tools for your final project.
For my final project (if I continue the direction of my initial project proposition), I would use a Python script to automate E. coli transformation for screening Psilocybe-derived psiH pathway variants.(Bryant, 2022)
The automation tools I would need/ want to use apart from Python:
Opentrons OT-2 robot: handles precise pipetting of cells, plasmids, and media across all the wells simultaneously(Opentrons,2026).Thermocycler module: automates heat shock for DNA uptake into E. coli cells. (Bryant, 2022).Ginkgo Nebula service: for providing a library of codon-optimised (Saras, 2023) psiH variant plasmids in the correct well format for the OT-2 robot.Custom 3D-printed inserts made in PrusaSlicer (Pamidi et al., 2024). Could be used for the cultivation or automation process of the project.As a biodesign student, I want to combine industrial automation (OT-2) with hands-on fabrication challenges. The 3D modelling is a new territory for me, but pushing these design boundaries alongside molecular automation is core to biodesign practice. Additionally, since lab access through my node is uncertain and equipment availability unknown, I'll prioritise developing the Python script and 3D printed components, as these I can prototype independently while planning for potential laboratory access.
Sources
Bryant, J.A., Kellinger, M., Longmire, C., Miller, R. and R Clay Wright (2022). AssemblyTron: flexible automation of DNA assembly with Opentrons OT-2 lab robots. Synthetic biology, [online] 8(1). doi:https://doi.org/10.1093/synbio/ysac032.Saras, N. (2023). Arbor® Biotechnologies. [online] Arbor Biotechnologies®. Available at: https://arbor.bio/arbor-biotechnologies-announces-collaboration-with-ginkgo-bioworks-to-advance-the-discovery-and-development-of-precision-gene-editors/ [Accessed 22 Feb. 2026].Opentrons (2026). OT-2 Robot - Opentrons. [online] Opentrons.com. Available at: https://opentrons.com/products/ot-2-robot?sku=999-00111 [Accessed 22 Feb. 2026].Pamidi, A.S., Spano, M.B. and Weiss, G.A. (2024). A Practical Guide to 3D Printing for Chemistry and Biology Laboratories. Current Protocols, 4(10). doi:https://doi.org/10.1002/cpz1.70036.Final Project Ideas

Week 4 HW: Protein Design I
Part A. Conceptual Questions
1. How many molecules of amino acids do you take with a piece of 500 grams of meat? (on average, an amino acid is ~100 Daltons)There are a variety of meats depending on the kind. Let’s take an average of about 20% of protein by mass of meat (Day, 2016). This would therefore mean that 500g of meat would contain roughly 100g of protein. The average molecular weight of an amino acid residue is 100 Daltons, which translates to about 100g per mole of the amino acid unit (100g/100g/mol = 1 mole of amino acid units) (ProPrep, 2019). 1 mole contains 6.022 x 10^23 (Avogadro’s number) amount of particles (The ChemTeam, 2026). Therefore, if you consume 500g of meat, you are also ingesting approximately 6 x 10^23 amino acid residues.
References
Day, L. (2016). Protein: Food Sources. Encyclopedia of Food and Health, pp.530–537. doi:https://doi.org/10.1016/b978-0-12-384947-2.00576-6.Dawson, J. (2021). Biological Macromolecules and Amino Acids.ecampusontario.pressbooks.pub. [online] Available at: https://ecampusontario.pressbooks.pub/bioc2580/chapter/bioc2580-lecture-1-biological-macromolecules-amino-acids/ [Accessed 28 Feb. 2026].ProPrep (2019).How can molecular weight conversion be achieved from dalton to g/mol, and why is this unit of measurement significant in molecular biology? [online] Proprep.com. Available at: https://www.proprep.com/questions/how-can-molecular-weight-conversion-be-achieved-from-dalton-to-gmol-and-why-is-this-unit-of-measurem [Accessed 1 Mar. 2026].The ChemTeam (2026). Welcome To Zscaler Directory Authentication. [online] Chemteam.info. Available at: https://www.chemteam.info/Mole/MolarMass.html.12.Why do humans eat beef but do not become a cow, eat fish but do not become fish?
There are several reasons why eating another animal does not make us become that animal, and they centre on two key concepts. First, what makes us human rather than another organism is the DNA in our own cells and how our genes are regulated (Brown, 2002). Our genome encodes for specific human proteins, cells, and tissues, and this blueprint does not change just because we eat another species. Second, chemical digestion breaks food down into small building blocks before absorption. When a human consumes an organism with a different genotype, they do not absorb that organism’s genome. Instead, digestive enzymes break down animal proteins into individual amino acids and degrade the animal’s DNA into nucleotides (Patricia and Dhamoon, 2022). Because virtually all life is built from the same basic set of around 20 natural amino acids, what really matters is how these building blocks are arranged in the polypeptide chain. The human body uses the amino acids it absorbs to make human proteins according to its own genetic instructions.
References
Brown, T.A. (2002). The Human Genome. [online] Nih.gov. Available at: https://www.ncbi.nlm.nih.gov/books/NBK21134/ [Accessed 1 Mar. 2026].Patricia, J.J. and Dhamoon, A.S. (2022). Physiology, Digestion. [online] National Library of Medicine. Available at: https://www.ncbi.nlm.nih.gov/books/NBK544242/ [Accessed 1 Mar. 2026].3. Why are there only 20 natural amino acids?
Ultimately, a set of fundamental physicochemical imperatives (molecular properties such as solubility, acidity, and basicity) dictates the prebiotic selection of the canonical 20 amino acid types. The guiding principle was parsimony, where nature retained only the simplest structures with high functional value, avoiding redundancy and unnecessary complexity (Moldoveanu and David, 2022). Energy efficiency also played a key role. The study ‘Why twenty amino acid residue types suffice(d) to support all living systems (2018) used quantum chemistry and chemoinformatics to analyse a large panel of candidate chemicals, followed by statistical analysis of their complexity and property scores. The results showed that the 20 canonical amino acids were the most likely to form under prebiotic conditions and had optimal physicochemical properties (Bywater, 2018).
References
Bywater, R.P. (2018). Why twenty amino acid residue types suffice(d) to support all living systems. PLOS ONE, 13(10), p.e0204883. doi:https://doi.org/10.1371/journal.pone.0204883.Moldoveanu, S. and David, V. (2022). Characterization of analytes and matrices. Essentials in Modern HPLC Separations, 2, pp.179–205. doi:https://doi.org/10.1016/b978-0-323-91177-1.00003-x.4. Can you make other non-natural amino acids? Design some new amino acids.
Yes, you can make other non-natural amino acids, known as non-proteinogenic or non-canonical amino acids (ncAAs), which have been routinely synthesised and engineered in labs to be added into proteins to alter certain physicochemical and biological properties for an optimal protein (e.g., reactivity), which has been a key development for enzymology and drug discovery (Adhikari et al., 2021). To design ncUAAs, you need to analyse their structure and determine what must remain the same and what may be altered. The core structure has a central alpha carbon that is bonded to an amino group, a carboxyl group, a hydrogen atom, and an R group (custom side chain). The R group can be hydrophobic, polar, or charged. The size, shape and potential for covalent interactions determine how each amino acid residue interacts with its environment (Whiteburn, 2024). It is also the area that needs to be altered/engineered to create an ncUAA.
References
Adhikari, A., Bhattarai, B.R., Aryal, A., Thapa, N., KC, P., Adhikari, A., Maharjan, S., Chanda, P.B., Regmi, B.P. and Parajuli, N. (2021). Reprogramming natural proteins using unnatural amino acids.RSC Advances , 11(60), pp.38126–38145. doi:https://doi.org/10.1039/d1ra07028b.Whitburn, T. (2024). Exploring Biological Molecules: Amino Acids, Protein Structure, and Function. [online] Online A-level Biology Tutor. Available at: https://www.alevelbiologytutor.com/tutoring-blog/2024/3/1/exploring-biological-molecules-understanding-amino-acids-protein-structure-and-function [Accessed 27 Feb. 2026].5.Where did amino acids come from before enzymes that make them, and before life started?
Amino acids on prebiotic Earth formed through non-biological chemical reactions (abiogenesis), both from space delivery and surface synthesis (Cowing, 2023). Meteorites (carbonaceous chondrites) delivered large quantities of abiotically formed amino acids like glycine, alanine, α-amino-n-butyric acid, isovaline, and β-alanine, which landed in early oceans (Strasdeit, 2009). Atmospheric sparks from electric discharges (lightning) and volcanic ash-gas clouds also synthesised amino acids in water droplets (Cowing, 2025). Over time, wet-dry cycles in lagoons or tidal pools concentrated these amino acids and polymerised them into peptides, creating the building blocks for early life.
References
Cowing, K. (2023). How Were Amino Acids Formed Before The Origin Of Life On Earth? - Astrobiology. [online] Astrobiology. Available at: https://astrobiology.com/2023/04/how-were-amino-acids-formed-before-the-origin-of-life-on-earth.html [Accessed 1 Mar. 2026].Cowing, K. (2025). Microlightning In Water Droplets May Have Sparked Life On Earth - Astrobiology. [online] AstrobiologyAvailable at: https://astrobiology.com/2025/03/microlightning-in-water-droplets-may-have-sparked-life-on-earth.html [Accessed 1 Mar. 2026].Strasdeit, H. (2009). Prebiotic amino acid chemistry on the early Earth. EPSC Abstracts, [online] 4. Available at: https://meetingorganizer.copernicus.org/EPSC2009/EPSC2009-70.pdf [Accessed 1 Mar. 2026].6.If you make an α-helix using D-amino acids, what handedness (right or left) would you expect?
Most amino acids come in at least two forms, whose structures are mirror images of each other and are referred to as the right-handed or left-handed optical isomers. Natural proteins use L-amino acids and form right-handed alpha helices, whereas D-amino acids are the mirror image, therefore producing the left-handed helices (enantiomers).
(BOC Sciences, 2025).
References
BOC Sciences (2025). D-Amino Acids. [online] Bocsci.com. Available at: https://aapep.bocsci.com/amino-acids/d-amino-acids-3260.html [Accessed 1 Mar. 2026].7. Can you discover additional helices in proteins?
There may be additional helices in proteins, such as secondary-structure helices, that may be discovered. For example, the Pi-Helices, often described as rare or uncommon, are found in approximately 15% of proteins and are often overlooked despite their important functions (Cooley et al., 2010).
References
Cooley, R.B., Arp, D.J. and Karplus, P.A. (2010). Evolutionary Origin of a Secondary Structure: π-Helices as Cryptic but Widespread Insertional Variations of α-Helices That Enhance Protein Functionality. Journal of Molecular Biology, 404(2), pp.232–246. doi:https://doi.org/10.1016/j.jmb.2010.09.034.8. Why are most molecular helices right-handed?
Most protein α-helices are right-handed because L-amino acids adopt dihedral angles that minimise steric clashes between side chains and the backbone. Left-handed alpha-helices are possible on the Ramachandran plot (2D visualisation of backbone dihedral angles for amino acid residues in proteins) but suffer more side chain-backbone collisions for L-amino acids, making them energetically unfavourable and rare in nature (Robinson and Afzal, 2014).
References
Robinson, S.W., Afzal, A.M. and Leader, D.P. (2014). Bioinformatics: Concepts, Methods, and Data. Handbook of Pharmacogenomics and Stratified Medicine, pp.259–287. doi:https://doi.org/10.1016/b978-0-12-386882-4.00013-x.9. Why do β-sheets tend to aggregate?
Beta-Sheets tend to aggregate because their exposed edges promote intermolecular hydrogen bonding that readily pairs with other beta-strands from separate proteins and misfolded chains. These associations form stable cross-beta structures, often aggregating into amyloid fibrils linked to diseases such as Alzheimer’s. (Richardson and Richardson, 2002).
References
Richardson, J.S. and Richardson, D.C. (2002). Natural β-sheet proteins use negative design to avoid edge-to-edge aggregation. Proceedings of the National Academy of Sciences, [online] 99(5), pp.2754–2759. doi:https://doi.org/10.1073/pnas.052706099.Part B: Protein Analysis and Visualization
1. Briefly describe the protein you selected and why you selected it.
For this exercise, I’m looking at Huwentoxin-IV (HwTx-IV) from Chinese bird spider venom (Cyriopagopus schmidti). It is a 35-amino-acid peptide that blocks pain signals by hitting the NaV1.7 channels (Sermadiras et al., 2013). These channels are voltage-gated sodium channels expressed in certain neurons, where they play an important role in the generation and transmission of pain-related information (Fouillet et al., 2017). This fits into one of my project propositions for the development of an analgesic cream for the treatment of arthritis pain as a non-opioid painkiller alternative. Understanding its 3D structure shows me exactly where the disulfide bonds need to form and which face binds to the NaV1.7 pain receptor, so when I clone it into E. coli, I will reduce the risks of misfolds in the protein or inactive peptide (Weiss et al., 2022). Essentially, the structure serves as the blueprint for developing a topical formulation for my design proposition.
References
Sermadiras, I., Revell, J., Linley, J.E., Sandercock, A. and Ravn, P. (2013). Recombinant Expression and In Vitro Characterisation of Active Huwentoxin-IV. PLoS ONE, 8(12), p.e83202. doi:https://doi.org/10.1371/journal.pone.0083202.Fouillet, A., Watson, J.F., Piekarz, A.D., Huang, X., Li, B., Priest, B.T., Nisenbaum, E.S., Sher, E. and Ursu, D. (2017). Characterisation of Nav1.7 functional expression in rat dorsal root ganglia neurons by using an electrical field stimulation assay. Molecular Pain, 13, p.174480691774517-174480691774517. doi:https://doi.org/10.1177/1744806917745179.Weiss, K., Racho, J. and Riemer, J. (2022). Compartmentalized disulfide bond formation pathways. Redox Chemistry and Biology of Thiols, pp.321–340. doi:https://doi.org/10.1016/b978-0-323-90219-9.00020-0.2. Identify the amino acid sequence of your protein.
Full precursor sequence:
MVNMKASMFLALAGLVLLFVVCYASESEEKEFSNELLSSVLAVD
DNSKGEERECLEIFKACNPSNDQCCKSSKLVCSRKTRWCKYQIGK
(National Library of Medicine, 2026)
The bold section of the amino acid sequence is the active section of the protein, as seen below on the RCSB PDB website, highlighted in purple:

References
PDB (2017). RCSB PDB - 1MB6:Three-dimensional solution structure of huwentoxin-IV by 2D 1H-NMR. [online] Rcsb.org. Available at: https://www.rcsb.org/structure/1MB6#entity-1 [Accessed 2 Mar. 2026].National Library of Medicine (2026). Ornithoctonus huwena huwentoxin-IV precursor, mRNA, complete cds - Nucleotide - NCBI. [online] Nih.gov. Available at: https://www.ncbi.nlm.nih.gov/nuccore/30575583 [Accessed 2 Mar. 2026].- How long is it? What is the most frequent amino acid?
The full precursor sequence is 89aa long (UniProt, 2026) whilst the active protein is 35aa (PDB, 2017). Additionally, the most frequent amino acid is Cysteine (C) that appears 6 times in the active protein.
Below is the Python Code as well as the initial error I got, which was fixed through Gemini Built-In Tool:


- How many protein sequence homologs are there for your protein?
When focusing specifically on the Huwentoxin-IV protein from Cyriopagopus schmidti (C. schmidti)and restricting the search by taxonomy, I identified 13 homologous sequences sharing between 42.0% and 73.8% sequence similarity with the selected protein.
When all taxonomic groups were included in the analysis, the number of homologous sequences increased immensely to 193 across spider species. Among these, five sequences showed the highest similarity to Huwentoxin-IV, ranging from 89.5% to 90.7%. Notably, these highly similar sequences were derived from Cyriopagopus hainanus, another species classified as a Chinese bird spider. (Uniprot: Blast Tool, 2026)
References
UniProt (2026). UniProt Blast. [online] UniProt. Available at: https://www.uniprot.org/blast/uniprotkb/ncbiblast-R20260302-171309-0587-87679188-p1m/overview [Accessed 2 Mar. 2026].- Does your protein belong to any protein family?
Yes, it belongs to the neurotoxin 10 (Hwtx-1) family, furthermore to the 22 (Htx-4) subfamily. I found this information through the Family & Domains section of my selected protein (UniProt, 2026).
3.Identify the structure page of your protein in RCSB
- When was the structure solved? Is it a good quality structure? A good quality structure is the one with good resolution. Smaller the better (Resolution: 2.70 Å)The structure was solved in August 2002 by authors Peng, K., Shu, Q., Liang, S.P. (PDB, 2017). In terms of resolution, the RCSB entry does not clearly indicate the resolution for the isolated toxin structure. However, according to the study “Employing NaChBac for cryo-EM analysis of toxin action on voltage-gated Na⁺ channels in nanodisc” by Gao et al. (2020), the structure of HWTX-IV bound to human Nav1.7 was obtained at an overall resolution of 3.2 Å, with the local resolution of the toxin improving from approximately 6 Å to approximately 4 Å. In structural biology, lower resolution values indicate higher structural quality. While a resolution of 2.70 Å would generally be considered good quality, the reported resolutions of 3.2–3.5 Å are moderate rather than high. Nevertheless, these resolutions are sufficient for visualising overall toxin docking.
References
PDB (2017). RCSB PDB - 1MB6: Three-dimensional solution structure of huwentoxin-IV by 2D 1H-NMR. [online] Rcsb.org. Available at: https://www.rcsb.org/structure/1MB6#entity-1 [Accessed 2 Mar. 2026].Gao, S., Valinsky, W.C., On, N.C., Houlihan, P.R., Qu, Q., Liu, L., Pan, X., Clapham, D.E. and Yan, N. (2020). Employing NaChBac for cryo-EM analysis of toxin action on voltage-gated Na+ channels in nanodisc. Proceedings of the National Academy of Sciences, [online] 117(25), pp.14187–14193. doi:https://doi.org/10.1073/pnas.1922903117.- Does your protein belong to any structure classification family?
Yes, it belongs to the inhibitor cystine knot structural family (Peng et al., 2002).
References
Peng, K., Shu, Q., Liu, Z. and Liang, S. (2002). Function and Solution Structure of Huwentoxin-IV, a Potent Neuronal Tetrodotoxin (TTX)-sensitive Sodium Channel Antagonist from Chinese Bird Spider Selenocosmia huwena. Journal of Biological Chemistry, 277(49), pp.47564–47571. doi:https://doi.org/10.1074/jbc.m204063200.4. Open the structure of your protein in any 3D molecule visualization software:
- Visualize the protein as “cartoon”, “ribbon” and “ball and stick”.After downloading my protein structure from RCSB PDB as a .cif file, I was able to open it in PyMol. I then played around with the commands on the right in the ’S’ section to alter the visual structure of my selected protein.


- Color the protein by secondary structure. Does it have more helices or sheets?

I coloured the protein by secondary structure in the cartoon representation. The green represents coils (loops), the yellow arrows represent sheets, and red represents helices. No red helices appear in the structure, while yellow sheet arrows are present. To verify this, I used the PyMOL count_items function to count residues assigned to each secondary structure type. The results showed 0 residues classified as helices and 6 residues classified as β-sheets (based on Cα atom counts). Therefore, the protein contains sheets but no helices. The remaining residues (274 atoms total) are classified as loops. Below is the following code used to colour and to verify this through PyMOL’s counting function:

- Color the protein by residue type. What can you tell about the distribution of hydrophobic vs hydrophilic residues?
Colour Code: Yellow = hydrophobic, Green = polar (uncharged), Blue = positively charged, Red = negatively charged.

ANALYSIS: The positively charged residues (blue and red) are strongly hydrophilic and seem to be concentrated more on the outer loops than the centre. The polar residues are usually also hydrophilic and appear spread throughout the protein structure. The hydrophobic residues (yellow) appear to be seen in the core of the sheet, but are also spread out around the full structure.
- Visualise the surface of the protein. Does it have any “holes” (aka binding pockets)?

ANALYSIS: Across the whole surface of the protein, only a single small cavity is apparent and could be identified as a binding pocket. Overall, the protein surface is relatively smooth and compact, which would explain why the protein doesn’t have any other molecules, as it may be limiting interactions by providing an unstable environment for binding.
Part C: Using ML-Based Protein Design Tools
1. Choose your favorite protein from the PDB

C.1 Protein Language Modeling
a. Use ESM2 to generate an unsupervised deep mutational scan of your protein based on language model likelihoods.

b. Can you explain any particular pattern? (choose a residue and a mutation that stands out)
Overall, the heatmap shows many positions with predominantly high scores (yellow/green), indicating mutation-tolerant sites, which contradicts the idea that smaller proteins would be more susceptible to mutations, disrupting protein function. Positions 8–16 stand out with a high concentration of blue (-4/-6), suggesting high mutation intolerance here. W mutations seem to be specifically prone to mutations wherever they are located in the sequence implyging implying that mutations in these amino acids could critically disrupt core stability or binding despite the region's flexibility.
C.2 Latent Space
a. Use the provided sequence dataset to embed proteins in reduced dimensionality.

b. Analyse the different formed neighbourhoods: do they approximate similar proteins?
The latent space map groups proteins into neighbourhoods where closer points share similar sequences, structures, or functions. For example, the circled group in the image below includes proteins from soil bacteria like Bacillus subtilis and Enterococcus faecalis, which have overlapping membrane and stress-response features, showing that the map captures biological similarity.

c. Place your protein in the resulting map and explain its position and similarity to its neighbours.

Below are the names and structures of three proteins in the same neighbourhood as my selected protein:
Image references:
- Topoisomerase IV subunit B and Focal adhesion kinase 1 from RCSB PDB Website
- Synthase alpha sunbunit, domain 1 from Uniprot Website
In the protein latent space, Huwentoxin-IV from Cyriopagopus schmidti sits in a region populated more by regulatory and interaction-focused proteins than by purely structural ones. Its closest neighbours include an ATPase domain from topoisomerase IV, along with fragments of chicken focal adhesion kinase and a synthase alpha subunit. Despite their differences in size, fold, and cellular context, these proteins all seem to rely on chemically specialised surfaces to influence the behaviour of other macromolecules. This suggests that the model is picking up on less global structural similarity and more on shared biophysical features, such as molecular interactions and regulation.
C.2 Protein Folding
1. Fold your protein with ESMFold. Do the predicted coordinates match your original structure?
The 35aa active part of the 89aa Huwentoxin-IV mini-protein, which I looked at in my Part B homework:

Yes, the predicted coordinates from ESMFold appear to match the original structure quite well. The overall beta sheet-rich fold is preserved, and the main secondary structure elements align closely with the original 35-aa active region of Huwentoxin-IV. While there may be minor differences in loop orientation or flexible regions, the core remains essentially the same.
I decided to also look at the full precursor protein, since I had not done so in Section B because of some initial confusion. Most papers mainly mention and highlight the 35-amino-acid mature toxin, and although the 89-aa precursor is listed on the PDB mini-protein page, opening the .cif file in PyMOL only shows the structure of the active toxin region. Since this is also the main focus of my final project idea (Option 3), it makes sense that this is where most researchers have concentrated their attention. Looking at the full precursor, we can see that what was previously a structure dominated by beta sheets now also includes a long alpha helix, which is therefore not part of the active toxin site.

2.Try changing the sequence: first try some mutations, then large segments. Is your protein structure resilient to mutations?
Processor Sequence:
MVNMKASMFLALAGLVLLFVVCYASESEEKEFSNELLSSVLAVDDNSKGEERECLEIFKACNPSNDQCCKSSKLVCSRKTRWCKYQIGK
(Bold: active protein)
MUTATION 1
The active part of the protein, located at 50-85 in the full precursor sequence, Amino acids (Tryptophan) W, Histidine (H) and Methionine (M) seem to have lower tolerance to mutations throughout that section of the sequence. I want to see if altering one of these specific amino acids in the active protein will produce a visible change in the structure.
Mutated Sequence 1: Changed Lysine (K) to Tryptophan (W) on 57aa
MVNMKASMFLALAGLVLLFVVCYASESEEKEFSNELLSSVLAVDDNSKGEERECLEIFWACNPSNDQCCKSSKLVCSRKTRWCKYQIGK

Analysis: It appears that even changing a singular amino acid in the active site of the protein has already significantly altered the local structure.
MUTATION 2
Secondly, I changed 3 amino acids in the protein’s active region, once again focusing on those at higher risk of mutation.
Mutated Sequence 2: Changed KAC to WHM on 57-59aa
MVNMKASMFLALAGLVLLFVVCYASESEEKEFSNELLSSVLAVDDNSKGEERECLEIFWHMNPSNDQCCKSSKLVCSRKTRWCKYQIGK

Analysis: Similar to the previous mutation, altering amino acids in the active site continues to modify the local structure. However, the active-site fold remains broadly similar across both mutations. Relative to the original structure, the beta strands seem to become progressively more separated, suggesting that increasing the number of mutations may gradually reduce the compactness of the native fold.
MUTATION 3
Additionally, I wanted to focus on the active protein as it also showed interesting results on the mutation heat map. Specifically, in amino acids E and D towards the start of the sequence (positions 11 and 13), which are located in the alpha helix section of the protein, show significantly low numbers on the scale (-6), suggesting high mutational intolerance. I therefore predict this would alter the structure/position of the helix.
Mutated sequence 3: Changed MFLALAGL to EDEDDEED on 8-16aa
MVNMKASEDEDDEEDVLLFVVCYASESEEKEFSNELLSSVLAVDDNSKGEERECLEIFWHMNPSNDQCCKSSKLVCSRKTRWCKYQIGK

Analysis: The mutation caused a major alteration in the protein’s overall structure, leading to significant distortion of the alpha helices and beta sheets and a noticeable change in the overall folding pattern.
MUTATION 4
Lastly, I also wanted to look at changing a larger section of my sequence and opted for the first 20 amino acids, as overall, the mutation heat map suggests a high tolerance to mutations. Therefore, I predict that altering this part of the sequence shouldn’t alter the overall structure too much.
Mutated Sequence 4: Changed MVNMKASMFLALAGLVLLFV to EDWWHMELFALALSQKICVD on 1-20aa
EDWWHMELFALALSQKICVDVCYASESEEKEFSNELLSSVLAVDDNSKGEERECLEIFWHMNPSNDQCCKSSKLVCSRKTRWCKYQIGK

Analysis: The mutation in the first 20 amino acids resulted in a significant alteration of the overall protein structure, with major disruptions to the alpha helices and beta sheets. This indicates that, despite predictions from the mutation heat map, this region is less tolerant to sequence changes than expected.
Conclusion: Overall, the protein shows very low resistance to mutations, likely due to its small size and peptide nature, which makes structural stability more sensitive to sequence changes.
C3. Protein Generation
Inverse-Folding a protein: Let’s now use the backbone of your chosen PDB to propose sequence candidates via ProteinMPNN
1. Analyze the predicted sequence probabilities and compare the predicted sequence vs the original one.

Using the backbone from Huwatoxin (PDB: 1MB6, active 35aa region) in ProteinMPNN, I generated a new sequence candidate:
EPKGENTPCTEENQNCDKEKNVECSPEKGACAPP
It scored 1.1293, lower than the original backbone’s 2.3287, showing it’s a plausible but less optimal fit to the structure. The sequence recovery of 0.2286 (23%) indicates low similarity to the wild-type (~9% identity), meaning a substantial redesign. Compared to the original’s mutation heat map, this designed sequence appears much more mutation-tolerant, suggesting improved structural robustness for engineering.
Heat map of the generated sequence candidate:

2. Input this sequence into ESMFold and compare the predicted structure to your original.

The sequence candidate model appears more looped and flexible, while the inverse-folded model on a fixed backbone is more compact and helical. Overall, this confirms a high fidelity visual overlap despite sequence changes, validating the design’s structural stability.
Part D. Group Brainstorm on Bacteriophage Engineering
Group of Commited Listeners LifeFabs: Sara Gaviria Escobar, Ruben Janssen, Justine de RiedmattenProposal: Engineering the MS2 Lysis Protein L to Enhance Stability Background
The MS2 bacteriophage lysis (L) is a 75 amino acid long-protein, and it is responsible for triggering host cell lysis; this is why it is also called a toxin from the group of bacteriophages (Mezhyrova, 2023). It is a powerful protein that has been widely used in studies where researchers seek to control cell death, but it is difficult to do so due to its instability (Mylon, 2010).
Objectives
To use computational protein design tools to engineer possible stable variants of the MS2 L-protein.
To identify variants where structure is preserved but higher stability is shown
Analyse if the variants can fold like the original protein and if they interact correctly with DnaJ, its chaperone
MethodsObtain an initial protein backbone for MS2 L-protein using ESMFold.Obtain alternative sequences to the protein using ProteinMPNN.Mutate the alternative sequences to the protein using ESM-2.Model the variants using ESMFold and analyse if the folding is maintained in comparison to the initial protein (MS2 L-protein). Create a ranking based on the confidence metric.sAssess the top 3 variants’ interactions with DnaJ using AlphaFold-Multimer to predict 3D structures of protein complexes (co-folding multiple chains)Expected Outcomes
Hopefully, these methods are able to identify some stabilized MS2 L-protein variants with correct folding and interaction with its chaperone, DnaJ. If successful, these designs could serve as templates for further experimental testing in E. coli and provide a methodology adaptable to other phage‑derived membrane proteins that also show decreased stability.Potential Challenges
The main concern would be generating variants with correct folding, but incorrect interaction with DnaJ, as computational models sometimes can not predict the dynamics of those interactions (Chamakura et al., 2017 & Mondal et al., 2024), thus disrupting the lysis mechanism.References
Mezhyrova, J., Martin, J., Börnsen, C., Dötsch, V., Frangakis, A. S., Morgner, N., & Bernhard, F. (2023). In vitro characterization of the phage lysis protein MS2-L. Microbiome Research Reports, 2(4), 28.Mylon, S. E., Rinciog, C. I., Schmidt, N., Gutierrez, L., Wong, G. C., & Nguyen, T. H. (2010). Influence of salts and natural organic matter on the stability of bacteriophage MS2. Langmuir, 26(2), 1035-1042.Chamakura, K. R., Tran, J. S., & Young, R. (2017). MS2 lysis of Escherichia coli depends on host chaperone DnaJ. Journal of Bacteriology, 199(12), 10-1128.Mondal, A., Singh, B., Felkner, R. H., De Falco, A., Swapna, G. V. T., Montelione, G. T., … & Perez, A. (2024). A Computational Pipeline for Accurate Prioritization of Protein‐Protein Binding Candidates in High‐Throughput Protein Libraries.Angewandte Chemie International Edition, 63(24), e202405767.Week 5: Protein Design II
Part 1: Generate Binders with PepMLM
1. Begin by retrieving the human SOD1 sequence from UniProt (P00441) and introducing the A4V mutation.Human SOD1 Sequence (154AA per monomer - 308AA):
MATKAVCVLKGDGPVQGIINFEQKESNGPVKVWGSIKGLTEGLHGFHVHEFGDNTAGCTSAGPHFNPLSRKHGGPKDEERHVGDLGNVTADKDGVADVSIEDSVISLSGDHCIIGRTLVVHEKADDLGKGGNEESTKTGNAGSRLACGVIGIAQMATKAVCVLKGDGPVQGIINFEQKESNGPVKVWGSIKGLTEGLHGFHVHEFGDNTAGCTSAGPHFNPLSRKHGGPKDEERHVGDLGNVTADKDGVADVSIEDSVISLSGDHCIIGRTLVVHEKADDLGKGGNEESTKTGNAGSRLACGVIGIAQ
A4V mutation: The alanine to valine mutation at codon 4 (A4V) of SOD1 causes a rapid and progressive form of amyotrophic lateral sclerosis (ALS). In Uniprot, however, it appears that the sequence starts with Methionine, making the mutation from Alanine to Valine actually at codon 5.
Mutation A4V (one of the monomers):
MATKVVCVLKGDGPVQGIINFEQKESNGPVKVWGSIKGLTEGLHGFHVHEFGDNTAGCTSAGPHFNPLSRKHGGPKDEERHVGDLGNVTADKDGVADVSIEDSVISLSGDHCIIGRTLVVHEKADDLGKGGNEESTKTGNAGSRLACGVIGIAQ
References:
Saeed, M., Yang, Y., Deng, H-X., Hung, W-Y., Siddique, N., Dellefave, L., Gellera, C., Andersen, P.M. and Siddique, T. (2009). Age and founder effect of SOD1 A4V mutation causing ALS. Neurology, 72(19), pp.1634–1639. doi:https://doi.org/10.1212/01.wnl.0000343509.76828.2a.Using the PepMLM Colab linked from the HuggingFace PepMLM-650M model card:
2. Generate four peptides of length 12 amino acids conditioned on the mutant SOD1 sequence.
3. To your generated list, add the known SOD1-binding peptide FLYRWLPSRRGG for comparison.

Record the perplexity scores that indicate PepMLM’s confidence in the binders.
Asked for the built-in Gemini AI feature on the Colab for another code to get the pseudo perplexity of the known SOD1-binding peptide:

Part 2: Evaluate Binders with AlphaFold3
1. Navigate to the AlphaFold Server: alphafoldserver.com
For each peptide, submit the mutant SOD1 sequence followed by the peptide sequence as separate chains to model the protein-peptide complex.
When entering the pepitdes, when adding sequence numbered 2 in the table above, the AlphaFold Server wasn’t letting me enter the ‘X’ Amino Acid as it isn’t an actual amino acid and will be used in a sequence for an unknown or undetermined AA. I therefore had to alter this sequence:

2. Record the ipTM score and briefly describe where the peptide appears to bind. Does it localise near the N-terminus where A4V sits? Does it engage the β-barrel region or approach the dimer interface? Does it appear surface-bound or partially buried?

Binder 1: WLHKHHVVHLKK

The peptide doesn’t bind directly to the protein and is not located close to the N terminus (circled in red in the image above, always the orange end). It engages with the Beta barrel of the lefmonomer. It also appears to be surface-bound, as it is essentially exposed to the solvent and not in contact with the protein. I was initially confused about why I had two identical monomers, but after some research, I learned that SOD1 is a homodimer composed of two identical subunits that mirror one another.
Binder 2: WLRHHAAAHHKK

Once again, the peptide doesn’t localise close to the N terminus; it is close to the Beta barrel but doesn’t interact with it and is surface-bound.
Binder 3: WLRHHHVIHHKH

The peptide doesn’t localise close to the N terminus, and doesn’t engage with the
Beta barrel or the dimer interface and is surface-bound.
Binder 4: WLRHSAAIKHKK

The peptide doesn’t localise close to the N terminus, and is the closest out of all the other peptides to the Beta barrel. It is also surface-bound.
Controlled Binder: FLYRWLPSRRGG

The peptide doesn’t localise close to the N terminus, is slightly partially buried and approaches the dimer surface, where the two monomers join.
3. In a short paragraph, describe the ipTM values you observe and whether any PepMLM-generated peptide matches or exceeds the known binder.
The ipTM values of my generated peptides are in a range of 0.26 to 0.4, with an average of 0.33. In comparison, the known (control) binder has a score of 0.3. Ultimately, they are relatively similar, but 3/4 of my generated peptides have a higher score than the known binder. Overall, all the binders have a relatively low confidence in the predicted interface, translating to weak interactions between the protein and the binder.
Evaluate Properties of Generated Peptides in the PeptiVerse
1. Structural confidence alone is insufficient for therapeutic development. Using PeptiVerse, let’s evaluate the therapeutic properties of your peptide! For each PepMLM-generated peptide:Binder 1:
Binder 2:
Binder 3:
Binder 4:

Compare these predictions to what you observed structurally with AlphaFold3. In a short paragraph, describe what you see. Do peptides with higher ipTM also show stronger predicted affinity? Are any strong binders predicted to be hemolytic or poorly soluble? Which peptide best balances predicted binding and therapeutic properties?
Overall, all the binders have a weak binding affinity, which matches the ipTM scores described earlier. This also backs up the 3D structures where none of the peptides was closely interacting with the protein. In terms of solubility, all of them are soluble, which is therapeutically beneficial as it improves dissolution rates and increases absorption (Saeed et al., 2009). Furthermore, they are all non-hemolytic (low hemolysis probability), which is also therapeutically beneficial, as it ensures minimum red blood cell destruction (Cleveland Clinic, 2022). Looking at the net charge, SOD1 is negatively charged (-6 monomers) with the physiological pH of 7.4 (Shi et al., 2014), and all the binders are positively charged with a pH ranging from 2.28 to 3.93. This is generally advantageous as the positively charged binders will be attracted to the negatively charged SOD1 surface; however, when they are too positively charged, it may also cause the binders to bind to other negatively charged molecules. Lastly, in terms of molecular weight, all binders have a relatively low value, ranging from 1474.8 to 1636.9 Da, which minimises aggregation risks.
I believe the peptide which best balances predicted binding and therapeutic properties would be peptide 3 (WLRHHHVIHHKH), firstly due to having the highest ipTM score. Secondly, in terms of its therapeutic properties, it has the highest binding affinity out of all the peptides, as well as having high solubility and being non-hemolytic. Although all peptides had similar properties and scores, it appears to be the most appropriate option for further development.
References:
Cleveland Clinic (2022). Hemolysis: Types, causes & symptoms. [online] Cleveland Clinic. Available at: https://my.clevelandclinic.org/health/diseases/24108-hemolysis [Accessed 20 Apr. 2026].Saeed, M., Yang, Y., Deng, H-X., Hung, W-Y., Siddique, N., Dellefave, L., Gellera, C., Andersen, P.M. and Siddique, T. (2009). Age and founder effect of SOD1 A4V mutation causing ALS. Neurology, 72(19), pp.1634–1639. doi:https://doi.org/10.1212/01.wnl.0000343509.76828.2a.
with a value of around 1.0 and are non-hemolytic.Shi, Y., Abdolvahabi, A. and Shaw, B.F. (2014). Protein charge ladders reveal that the net charge of ALS‐linked superoxide dismutase can be different in sign and magnitude from predicted values. Protein Science, 23(10), pp.1417–1433. doi:https://doi.org/10.1002/pro.2526.Generate Optimised Peptides with moPPIt
Settings for moPPIt peptide generation towards specific residues and optimisation of the therapeutic properties:

I focused on Motif positions 1–8 to keep the design space close to the mutation site at position 4. I increased the weight of binding affinity to 2, as this is ultimately the most critical objective, while assigning similar weights to hemolysis, solubility, and motif to preserve overall therapeutic properties.
Generated moPPit peptides (37-minute runtime):
Briefly describe how these moPPit peptides differ from your PepMLM peptides. How would you evaluate these peptides before advancing them to clinical studies?The moPPit peptides generated differ from the ones I have generated on PepMLM in their binding affinity, solubility and hemolysis risks. moPPit peptides have a significantly higher affinity score (6.93-7.26) in comparison to the PepMLM ones. This could essentially be due to the motif position, as well as increasing the weight of affinity in the settings. In terms of solubility, the moPPit peptides have a lower solubility value than the PepMLM peptides, which makes them slightly less advantageous in this specific therapeutic property. Additionally, they have a significantly higher hemolysis value, resulting in potentially higher risks of red blood cell destruction. Ultimately, the moPPit peptides appear to bind more strongly but appear less favourable for safety and developability. To evaluate these peptides before advancing them to clinical studies, I would begin by testing their predictions in an in-silico model through in vitro experiments before any animal or human trial. Furthermore, I would conduct a safety screening (Cyprotex in vitro hemolysis testing) to evaluate the level of toxic hemolysis (Bauch, 2026).
References:
Bauch, C. (2026). Hemolysis | Cyprotex ADME-Tox Solutions. [online] Evotec Website (English). Available at: https://www.evotec.com/solutions/drug-discovery-preclinical-development/cyprotex-adme-tox-solutions/toxicology/mechanistic-toxicity/hemolysis [Accessed 25 Mar. 2026].Final Project: L-Protein Mutants
Option 1: Mutagenesis
Step 1: Gathering information about Lysis proteins
DNA sequence of Lysis Protein Phage MS2 (E. coli) 225bp:
atggaaacccgctttccgcagcagagccagcagaccccggcgagcaccaaccgccgccgcccgtttaaacatgaagattatccgtgccgccgccagcagcgcagcagcaccctgtatgtgctgatttttctggcgatttttctgagcaaatttaccaaccagctgctgctgagcctgctggaagcggtgattcgcaccgtgaccaccctgcagcagctgctgacc
Reference: EMBL-EBI (2026). ENA Browser. [online] Ebi.ac.uk. Available at: https://www.ebi.ac.uk/ena/browser/view/V00642 [Accessed 26 Mar. 2026].
Chaperone DNAj sequence (E. coli, strain K12) 1358bp:
TCGACGCTGAATTTGAAGAAGTCAAAGACAAAAAATAATCGCCCTATAAACGGGTAATTA
TACTGACACGGGCGAAGGGGAATTTCCTCTCCGCCCGTGCATTCATCTAGGGGCAATTTA
AAAAAGATGGCTAAGCAAGATTATTACGAGATTTTAGGCGTTTCCAAAACAGCGGAAGAG
CGTGAAATCAGAAAGGCCTACAAACGCCTGGCCATGAAATACCACCCGGACCGTAACCAG
GGTGACAAAGAGGCCGAGGCGAAATTTAAAGAGATCAAGGAAGCTTATGAAGTTCTGACC
GACTCGCAAAAACGTGCGGCATACGATCAGTATGGTCATGCTGCGTTTGAGCAAGGTGGC
ATGGGCGGCGGCGGTTTTGGCGGCGGCGCAGACTTCAGCGATATTTTTGGTGACGTTTTC
GGCGATATTTTTGGCGGCGGACGTGGTCGTCAACGTGCGGCGCGCGGTGCTGATTTACGC
TATAACATGGAGCTCACCCTCGAAGAAGCTGTACGTGGCGTGACCAAAGAGATCCGCATT
CCGACTCTGGAAGAGTGTGACGTTTGCCACGGTAGCGGTGCAAAACCAGGTACACAGCCG
CAGACTTGTCCGACCTGTCATGGTTCTGGTCAGGTGCAGATGCGCCAGGGATTCTTCGCT
GTACAGCAGACCTGTCCACACTGTCAGGGCCGCGGTACGCTGATCAAAGATCCGTGCAAC
AAATGTCATGGTCATGGTCGTGTTGAGCGCAGCAAAACGCTGTCCGTTAAAATCCCGGCA
GGGGTGGACACTGGAGACCGCATCCGTCTTGCGGGCGAAGGTGAAGCGGGCGAGCATGGC
GCACCGGCAGGCGATCTGTACGTTCAGGTTCAGGTTAAACAGCACCCGATTTTCGAGCGT
GAAGGCAACAACCTGTATTGCGAAGTCCCGATCAACTTCGCTATGGCGGCGCTGGGTGGC
GAAATCGAAGTACCGACCCTTGATGGTCGCGTCAAACTGAAAGTGCCTGGCGAAACCCAG
ACCGGTAAGCTATTCCGTATGCGCGGTAAAGGCGTCAAGTCTGTCCGCGGTGGCGCACAG
GGTGATTTGCTGTGCCGCGTTGTCGTCGAAACACCGGTAGGCCTGAACGAAAGGCAGAAA
CAGCTGCTGCAAGAGCTGCAAGAAAGCTTCGGTGGCCCAACCGGCGAGCACAACAGCCCG
CGCTCAAAGAGCTTCTTTGATGGTGTGAAGAAGTTTTTTGACGACCTGACCCGCTAACCT
CCCCAAAAGCCTGCCCGTGGGCAGGCCTGGGTAAAAATAGGGTGCGTTGAAGATATGCGA
GCACCTGTAAAGTGGCGGGGATCACTCCCATAAGCGCT
Reference: EMBL-EBI (2026). ENA Browser. [online] Ebi.ac.uk. Available at: https://www.ebi.ac.uk/ena/browser/view/M12565
Conserved sites and known mutational effects:
Lysis proteins, and in particular single-gene lysis (Sgls) proteins, often feature conserved sequence sites despite their diversity. Analysis of L sequences from related F-specific MS2-like phages reveals four key elements in the L protein: a basic N-terminus, a hydrophobic region, a conserved Leu(48)-Ser(49) (LS) dipeptide essential for type II Sgl lytic function, and a variable C-terminus.(Antillon et al., 2024). Additionally, mutational studies of the MS2 lysis protein L demonstrate that several residues in the central domains are highly conserved across most homologs (Chamakura et al., 2017).
Figure: Conserved LS and similarity of central domains
Reference: Mutational analysis of the MS2 lysis protein L. Microbiology
doi:https://doi.org/10.1099/mic.0.000485.
The figure below analyses the mutational analysis of MS2 L. The sequence is separated into its four domains through the numbered boxes at the bottom of the figure. The Amino acids above the L sequence show single amino acid mutations (missense mutation) that have a lysis defect, but don’t cause protein accumulation. Below the sequence are single amino acid mutations that don’t affect lysis. The amino acids with green asterisks in the L sequence show where a single nucleotide of the sequence could be changed by a stop codon (nonsense mutation). (Chamakura et al., 2017)
Figure: Mutational Analysis of MS2 L.
Reference: Mutational analysis of the MS2 lysis protein L. Microbiology
doi:https://doi.org/10.1099/mic.0.000485.
References:
Chamakura, K.R., Edwards, G.B. and Young, R. (2017). Mutational analysis of the MS2 lysis protein L. Microbiology, [online] 163(7), pp.961–969. doi:https://doi.org/10.1099/mic.0.000485.S. Francesca Antillon, Bernhardt, T.G., Karthik Chamakura and Young, R. (2024). Physiological characterisation of single-gene lysis proteins. Journal of bacteriology. doi:https://doi.org/10.1128/jb.00384-23.Step 2: Approach and Sequence variants
Below is the downloaded top 20 L-protein mutants dataset that I got from the ESM-2 Colab:

The LLR Scores: A higher log-likelihood ratio is considered to mean the protein mutation of that amino acid in the specific location of the sequence is more supported, whereas a lower LLR score shows a low tolerance for a mutation.
Now continuing through the Colab, I managed to get the top 10 of the mutations and added the experimental data as a cvs file to also gather the top 10 mutations from the data:
Top 10 from Colab:

Experimental data:

Top 10 of experimental Data:

Step 3: Filtering and Ranking sequences
To determine the location of the mutations, it is important to identify the soluble N-terminus (1–39 aa) and the transmembrane domain (40–75 aa) (Mezhyrova et al., 2023). As shown in the tables above, all experimental and generated mutations are located within these regions of the sequence.
By comparing the experimental data with the mutations generated in Colab, it is possible to identify the most likely mutations present in both datasets. The highest LLR-scored mutations are mainly located in the N-terminal region, particularly at position 29, with one mutation also found in the transmembrane domain at position 50. Since higher LLR scores indicate greater mutational tolerance, these mutations appear to be the most tolerated. The three mutations that appear in both datasets are:
K50LC29LY39LC29RC29SReferences:
Mezhyrova J., Martin, J., Börnsen, C., Volker Dötsch, Achilleas Stefanos Frangakis, Morgner, N. and Bernhard, F. (2023). In vitro characterization of the phage lysis protein MS2-L. Microbiome Research Reports, 2(4). doi:https://doi.org/10.20517/mrr.2023.28.p>Step 4: Submit 5 mutated sequences
*Note: in all the structure images below, the top protein is the original, and the bottom one is the mutated structure*
Mutation 1: K50L
Mutated sequence:
METRFPQQSQQTPASTNRRRPFKHEDYPCRRQQRSSTLYVLIFLAIFLSLFTNQLLLSLLEAVIRTVTTLQQLLT

The mutated sequence seems slightly more elongated and compacted than the original sequence.
Mutation 2: C29L
Mutated sequence:
METRFPQQSQQTPASTNRRRPFKHEDYPLRRQQRSSTLYVLIFLAIFLSKFTNQLLLSLLEAVIRTVTTLQQLLT

My observations reveal no noticeable structural differences between the original and mutated sequences.
Mutation 3: Y39L
Mutated sequence:
METRFPQQSQQTPASTNRRRPFKHEDYPCRRQQRSSTLLVLIFLAIFLSKFTNQLLLSLLEAVIRTVTTLQQLLT

The original sequence exhibits more elongated yet compacted alpha helices compared to the mutated sequence.
Mutation 4: C29R
Mutated sequence:
METRFPQQSQQTPASTNRRRPFKHEDYPRRRQQRSSTLYVLIFLAIFLSKFTNQLLLSLLEAVIRTVTTLQQLLT

The mutated sequence produces a slightly more elongated structure compared to the more compact alpha-helical fold of the original protein. This suggests the mutation C29S in the N-terminus slightly impacts secondary structure without fully destabilising the fold.
Mutation 5: C29S
Mutated sequence:
METRFPQQSQQTPASTNRRRPFKHEDYPSRRQQRSSTLYVLIFLAIFLSKFTNQLLLSLLEAVIRTVTTLQQLLT

Similarly to the previous mutation in the same location, the mutated sequence produces a slightly more elongated structure compared to the more compact alpha-helical fold of the original protein.
Week 6 HW: Genetic Circuits Part I: Assembly Technologies
Assignment: DNA Assembly
1. What are some components in the Phusion High-Fidelity PCR Master Mix, and what is their purpose?
Phusion High-Fidelity PCR Master Mix offers high fidelity and performance for PCR. It consists of Phusion DNA polymerase (1), deoxynucleotides and reaction buffer that has been optimised and also includes MgCl2 (New England Biolabs, 2026). The DNA Polymerase allows for the rapid synthesis of a new DNA strand with high accuracy, as it generates long templates with a single enzyme (Thermo Scientific, 2018). The deoxynucleotides are the essential building blocks for the DNA polymerase to proceed with the synthesis of a new DNA strand, whilst the buffer is what provides the ideal environment for the DNA polymerase to function, resulting in high-yield and fidelity.
References:
New England Biolabs (2026).PhusionTM High-Fidelity PCR Master Mix with HF Buffer. [online] Neb.com. Available at: https://www.neb.com/en-gb/products/m0531-phusion-high-fidelity-pcr-master-mix-with-hf-buffer?srsltid=AfmBOooj2e0WKxJyWmTMyut6egCuogqvdwaFvEZFcWb2_X7GBZSD_15h [Accessed 20 Mar. 2026].Thermo Scientific (2018). Thermo Scientific Phusion High-Fidelity PCR Master Mix #_ Lot _ Expiry Date _ Store at -20 °C Ordering information. [online] Available at: https://documents.thermofisher.com/TFS-Assets/LSG/manuals/MAN0012771_Phusion_HiFi_PCR_MasterMix_100rxn_UG.pdf [Accessed 20 Mar. 2026].2. What are some factors that determine primer annealing temperature during PCR?
The main factor that determines primer annealing temperature during PCR is the calculation of the melting temperature of the primers selected for the amplification. The general rule is to start with an annealing temperature of 3-5°C lower than the lowest melting temperature of the primers (ThermoFisher Scientific, 2019). Another factor that will further determine the temperature is the length of the primers, as they have higher melting temperatures. If the annealing temperature exceeds a temperature of 65°C instead of an optimal temperature of approximately 54°C, there are risks of secondary annealing (Andreas Ebertz, 2022). Furthermore, salt concentration (NA+) further impacts primer annealing as higher levels allow for higher annealing temperature (ThermoFisher Scientific, 2019).
References:
ThermoFisher Scientific (2019). PCR Cycling Parameters—Six Key Considerations for Success | Thermo Fisher Scientific - UK. [online] Thermofisher.com. Available at: https://www.thermofisher.com/uk/en/home/life-science/cloning/cloning-learning-center/invitrogen-school-of-molecular-biology/pcr-education/pcr-reagents-enzymes/pcr-cycling-considerations.html [Accessed 20 Mar. 2026].Ebertz, A. (2022). Primer Design Guide. [online] The DNA Universe BLOG. Available at: https://the-dna-universe.com/2022/09/05/primer-design-guide-the-top-5-factors-to-consider-for-optimum-performance/ [Accessed 20 Mar. 2026].3. There are two methods from this class that create linear fragments of DNA: PCR, and restriction enzyme digests. Compare and contrast these two methods, both in terms of protocol as well as when one may be preferable to use over the other.
Polymerase Chain Reaction (PCR) is a laboratory nucleic acid amplification technique used to generate multiple copies of a specific target DNA sequence in vitro. It relies on repeated cycles of thermal denaturation, primer annealing, and extension by a DNA polymerase, leading to exponential amplification of the target DNA section and enabling its detection and analysis. (Khehra et al., 2023).
Protocol(Khehra et al., 2023):
- PCR begins with the extraction of a small nucleic acid sample of DNA or RNA into a reaction tube.
- Denaturation: the DNA is heated to 95°C to break the hydrogen bonds between its complementary base pairs of the double-stranded DNA, separating them into two seperate strands.
- Annealing: the denatured DNA is then cooled down to a temperature of 55°C to 72°C, which allows the binders to bind to each complementary sequence on single-stranded DNA through pairing their 3’ ends to the template strand, providing a start point for DNA synthesis.
- Extension: during the final stage, the temperature is raised to 75°C to 80°C to optimise the DNA polymerase, which promotes strand elongation of the new strand.
DNA polymerase synthesises in a 3’ to 5’ direction, generating new sequences complementary to the template strand. The full process is regulated by the use of a thermal cycler that regulates the time and temperature of each step mentioned above. Conducting multiple cycles results in the amplification of multiple copies of the target DNA within the tube.
Restriction enzyme digestion is a process that involves the cutting of DNA into fragments by restriction enzymes (endonucleases) at specific recognition sequences. This results in DNA fragments that vary in size depending on their location in these recognition sequences. It is usually performed in a microcentrifuge tube with the necessary components: the template DNA, restriction enzymes and Mg2+ under the right conditions (Shen, 2019). There are two possible products depending on the used restriction enzymes; the first one creates staggered cuts (majority of cuts), leaving single-stranded overhangs that will easily anneal to complementary DNA strands (sticky) and the second results in cuts in the middle of the recognition sequence, leaving no overhangs (blunt-end) causing them to ligate less and are therefore harder to clone (QIAGEN, 2013).
Protocol (QIAGEN, 2013):
- Pipette reaction components into a tube (water, DNA, buffer, enzyme)
- Centrifuge the tube.
- Incubate the content in a water bath (usually around 37°C for 1-4 hours).
- In certain applications, it is needed to heat-inactivate the enzyme after digestion ( around 65°C for 20 minutes).
Ultimately, PCR is the amplification of specific DNA sequences to create large amounts of copies, whilst restriction enzyme digestion uses endonucleases to cut DNA in specific sites, meaning they don’t serve the same purpose. PCR is quantity-based, whilst restriction enzyme digestion is used to create smaller DNA fragments. Furthermore, PCR synthesises new DNA strands whilst PCR cuts already existing DNA strands. Lastly, PCR relies on the binding of primers to the DNA strands, whilst restriction enzyme digestion relies on the recognition site of the enzyme.
References:
Khehra, N., Padda, I.S. and Swift, C.J. (2023). Polymerase Chain Reaction (PCR). [online] PubMed. Available at: https://www.ncbi.nlm.nih.gov/books/NBK589663/ [Accessed 20 Mar. 2026].Shen, C.-H. (2019). Restriction Digest - an overview | ScienceDirect Topics. [online] www.sciencedirect.com. Available at: https://www.sciencedirect.com/topics/biochemistry-genetics-and-molecular-biology/restriction-digest [Accessed 20 Mar. 2026].QIAGEN (2013). Restriction endonuclease digestion of DNA. [online] www.qiagen.com. Available at: https://www.qiagen.com/us/knowledge-and-support/knowledge-hub/bench-guide/dna/handling-dna/restriction-endonuclease-digestion-of-dna [Accessed 20 Mar. 2026].4. How can you ensure that the DNA sequences that you have digested and PCR-ed will be appropriate for Gibson cloning?
To ensure that the DNA sequences will be appropriate, several factors need to be considered. Firstly, Gibson Assembly are designed to be 20 to 40 nucleotides long (Yu, 2021); it is essential for successful annealing that the primers are designed within that range of base pairs. In addition to this, Gibson Assembly’s optimal temperature melting point typically ranges between 60°C and 70°C, so making sure the primers from PCR also have a similar temperature melting point will ensure a more successful assembly. Additionally, using high-fidelity PCR instead of regular PCR will reduce the risks of mutations during amplification (Eggert et al., 2005). Furthermore, purifying your DNA post-PCR to eliminate any unnecessary byproducts will increase the accuracy of Gibson cloning.
References:
Yu, C. (2021). What is the optimal Tm temperature for overlaps in Gibson Assembly? | ResearchGate. [online] ResearchGate. Available at: https://www.researchgate.net/post/What_is_the_optimal_Tm_temperature_for_overlaps_in_Gibson_Assembly [Accessed 20 Mar. 2026].Eggert, T., Funke, S.A., Rao, N.M., Acharya, P., Krumm, H., Reetz, M.T. and Jaeger, K.-E. (2005). Multiplex-PCR-Based Recombination as a Novel High-Fidelity Method for Directed Evolution. ChemBioChem, 6(6), pp.1062–1067. doi:https://doi.org/10.1002/cbic.200400417.5. How does the plasmid DNA enter the E. coli cells during transformation?
There are two seperate methods of how the plasmid DAN enters the E. Coli cells during transformation:
Chemical Competence method: This method consists of growing E. coli to log phase, which is then chilled on ice and further treated with ice-cold CaCl₂. This binds cell membrane lipids and DNA phosphates, causing the neutralisation of charges, making the negatively charged plasmid DNA less repelled by the membrane. The mix is then heat-shocked at 42°C for 30-90 seconds to temporarily open membrane pores for DNA entry. Lastly, the cell is iced again shortly to reseal the membrane (Froger and Hall, 2007)
Electroporation: The cells ells are mixed with DNA and zapped with a high-voltage pulse (e.g., 2.5 kV). This creates transient membrane holes through dielectric breakdown. Ultimately, the electric field drives charged DNA into the cell before the pores close (Biology LibreTexts, 2021)
References:
Froger, A., & Hall, J. E. (2007). Transformation of plasmid DNA into E. coli using the heat shock method. Journal of visualized experiments : JoVE, (6), 253. https://doi.org/10.3791/253Biology LibreTexts (2021) Transforming E. coli. Available at: https://bio.libretexts.org/Bookshelves/Biotechnology/Lab_Manual:_Introduction_to_Biotechnology/01:_Techniques/1.13:_Transformation6. Describe another assembly method in detail (such as Golden Gate Assembly)
Part 1: Explain the other method in 5 - 7 sentences plus diagrams (either handmade or online).
Golden Gate Assembly is a DNA cloning technique that allows multiple fragments to be combined in a defined order without leaving extra sequences at the junctions. It uses Type IIS restriction enzymes, such as BsaI, which cut DNA outside of their recognition sites to create specific 4-base overhangs. These overhangs are designed so that each fragment connects only to its correct neighbours.
The process takes place in a single reaction mixture that alternates between temperatures, favouring enzyme cutting (37°C) and ligation by T4 DNA ligase (16°C), which aids in correcting incorrect assemblies automatically. To avoid unintended cleavage, DNA sequences are “domesticated” by removing internal enzyme recognition sites. Overall, this method is more efficient than traditional cloning techniques and is especially useful for building complex genetic systems (Bird et al., 2022).
Figure 1: Golden Gate Assembly vs. Traditional Cloning
Image reference: New England Biolabs (2024). Getting Started with Golden Gate Assembly. [online] Neb.com. Available at: https://www.neb.com/en-gb/nebinspired-blog/getting-started-with-golden-gate?srsltid=AfmBOoo7ZCRSqsQ1DI7fCFa1awuGBgToH2L_sRbw7czYQBuj4tPjf8lh [Accessed 20 Mar. 2026].
References:
Bird, J.E., Marles-Wright, J. and Giachino, A. (2022). A User’s Guide to Golden Gate Cloning Methods and Standards. ACS Synthetic Biology, 11(11). doi:https://doi.org/10.1021/acssynbio.2c00355.Part 2: Model this assembly method with Benchling or Asimov Kernel!
To start, I selected the pGGA backbone (a common domesticated vector used for Golden Gate assembly) to use in my construct. After obtaining the sequence and adding it to Benchling as a circular plasmid map, I inserted the PsiH codon-optimised enzyme sequence from a previous assignment. This enzyme, involved in psilocybin biosynthesis, aligns with the concept for my Project 1 final design.
PGGAselect: 2020bp DNA
Full sequence and details of the cloning vector are available at:
https://www.neb.com/en/-/media/nebus/page-images/tools-and-resources/interactive-tools/dna-sequences-and-maps/text-documents/pggaselectgbk.txt?rev=5882d0f571c34713a43c67e6d64c25ea&hash=5C1F89A2A1E3E98F94CA667DE322DDF0

Week 7 HW: Genetic Circuits Part II: Neuromorphic Circuits
Assignment Part 1: Intracellular Artificial Neural Networks (IANNs)
1) What advantages do IANNs have over traditional genetic circuits, whose input/output behaviours are Boolean functions?
The main limitation of traditional genetic circuits is their restricted control over the strength, timing, and cellular context of therapeutic effects, as their input–output behaviour is usually constrained to simple Boolean logic. In contrast, IANNs can provide finer control of gene expression and cellular behaviour by tuning promoters, repressors, and other genetic components, and they can also sum and weight multiple inputs within a single network, rather than relying on many individually wired logic gates, which become complex and error‑prone.
References:
Carneiro, D.C., Rocha, C., Patrícia K. F. Damasceno, Barbosa, V. and Soares, P. (2024). Therapeutic applications of synthetic gene/genetic circuits: a patent review. Frontiers in Bioengineering and Biotechnology, 12. doi:https://doi.org/10.3389/fbioe.2024.1425529.Seak, L.C.U., Lo, O.L.I., Suen, W.C.-W. and Wu, M.-T. (2021). Next-generation biocomputing: mimicking artificial neural network with genetic circuits. bioRxiv. doi:https://doi.org/10.1101/2021.03.12.435120.2) Describe a useful application for an IANN; include a detailed description of input/output behaviour, as well as any limitations an IANN might face to achieve your goal.
Exploring my Final Project 2 idea (Bacillus subtilis biofilm paint), creating IANN genetic circuits in this spore-forming bacterium could transform the living paint into a smarter system for VOC pollution mapping. Rather than only showing simple surface darkening, the paint could form responsive biofilm patterns that record not just the presence of pollution, but also the environmental context in which exposure occurred. This would create a richer “pollution memory map,” making the project more useful for long-term environmental monitoring, scientific analysis, and the development of an educational and artistic installation.
I was inspired by the paper “A single-layer artificial neural network with engineered bacteria” by Sarkar et al. (2020). In this work, the authors designed an IANN in which extracellular chemical signals acted as inputs. These signals were linearly combined and processed through a nonlinear log-sigmoid activation function generated by synthetic genetic circuits. Each artificial neuron was encoded in bacteria, with weight and bias values tuned by engineering molecular interactions within the cell. This allowed bacterial “neurons” to perform specific logical functions, effectively creating an ANN using living cells.
For this living paint system, the IANN would integrate several environmental inputs. Different VOCs could be detected through separate promoter pathways, where each promoter contributes a weighted signal to the final pigmentation response. For example, VOC1 and VOC2 could each activate distinct sensory promoters whose outputs are combined inside the circuit. Additional contextual inputs could include airflow-sensitive signalling, which would influence spore germination and help distinguish stagnant pollution from outdoor exposure, and UV exposure, which would act as an indicator of spore viability and environmental stress. The output would be a nonlinear pigmentation or biofilm pattern response, where the intensity, shape, or layering of darkening reflects the combined history of the environment’s pollution.
A major limitation of this system is the dormant nature of Bacillus subtilis spores. Because IANN function depends on active transcription and translation, the circuit can only “compute” during short wet periods when spores germinate. During prolonged dry weather or strong UV exposure, the spores remain dormant, meaning the paint cannot continuously process inputs. This could lead to inconsistent activation and patchy or faded biofilm patterns over time. Another important challenge is mutation and genetic instability. A multi-gene IANN containing several nodes alongside VOC sensing pathways would be at high metabolic risk of creating mutations, especially over months of outdoor exposure. There are also regulatory and public acceptance limitations. Engineered spore-forming bacteria with multi-input decision-making circuits would likely face greater biosafety scrutiny than simpler VOC-degrading systems. Finally, the nonlinear nature of the IANN output may make interpretation difficult. Because the visible darkening is based on multiple weighted inputs, complex shade gradients may be hard for scientists to decode into a clear pollution history. While this complexity improves sensing, it may reduce interpretability unless paired with computational decoding tools.
References:
Sarkar, K., Bonnerjee, D. and Bagh, S. (2020). A single-layer artificial neural network with engineered bacteria. doi:https://doi.org/10.48550/arXiv.2001.00792.3) Draw a diagram for an intracellular multilayer perceptron where layer 1 outputs an endoribonuclease that regulates a fluorescent protein output in layer 2.

Assignment Part 2: Fungal Materials
1. What are some examples of existing fungal materials, and what are they used for? What are their advantages and disadvantages over traditional counterparts?
Fungal materials have been an increasing area of development for the production of sustainable alternative materials, in particular, mycelium, the underground root network of fungi. There are two primary ways of processing mycelium into a material.
Composite mycelium materials (CMM): An organic growing material (medium) like woodchips or agricultural waste is pressed inside a mould and is then inoculated with mycelium, allowing for the network to grow and fill in the cracks of the medium, creating a solid material. In order to stop the mycelium from growing, the material is dehydrated through heat or compression. The outcome is a solid foam-like material whose characteristics can be manipulated depending on the selected growing medium. All the parts of this processing technique are composed of biological elements, resulting in its biodegradability over a short period of time.
Pure mycelium materials (PMM): The mycelium is grown without a medium, but still has a nutrient source to allow its growth. Once grown. It can be combined with additional processing techniques, allowing greater control over the final product. For example, the material can be coated with a chemical to increase its durability, but this can make it non-biodegradable. In comparison to CMM, it allows the production of flexible materials and its characteristics can be altered depending on the strain of fungi used.
Examples of mycelium materials:
Mycelium has been expanding in the world of textiles thanks to a 2018 patent that enabled the use of fungi in textile production. Mycelium leather has been tested and proven to, in some cases, have a higher tensile strength and resistance to tearing than synthetic leather (Gandia et al., 2021). However, it may degrade more quickly unless treated with a plastic coating. The company MycoWorks has been exploring how to replicate woven fabric with mycelium by directing the orientation of its growth. In theory, this would allow to create a stronger woven material. Another example of mycelial materials includes packaging such as CMM styrofoam (hemp and mycelium), developed by a company called Ecoactive (Ecoactive LLC, 2026). Furthermore, it is being explored as a paper-like material produced by growing pellets of fungi that are agitated into a pulp. This is a fast process, but it is prone to contamination. In construction, mycelium has a lot of advantageous properties, such as high thermal insulation and fire resistance, whilst still allowing for good airflow. Currently, it is being used for carbon-neutral architecture (Myceen, 2024) in a variety of materials (bricks, insulation, pastes…).
Lastly, I discovered that mushroom material can be grown to carry out an electrical wiring pattern. In Ecovative’s patent, a fungus is grown on a nutrient medium (like potato dextrose agar or broth) that contains metal salts such as copper sulfate or copper chloride. As the mycelium spreads, it absorbs these metals and forms thin sheets that grow in the shape of the desired circuit pattern, creating conductive “wires” within the fungal material (Cerimi et al., 2019). In comparison to standard circuits, it would essentially be compostable as well as having a lower carbon footprint in production. In terms of functionality, being a ‘living electronic’ could, in theory, be combined with self-regenerative repair and sensing (Adamatzky, 2022).
Advantages
They are biodegradable, so they can break down over a relatively short time.Their processing generally involves lower toxicity and energy use than that produced from petrochemicals.By changing the fungal strain, substrate and post-processing, the same system can be adapted for many different applications.They can have a lower overall carbon footprint than conventional materials, especially when they replace high‑impact products.In some cases, they can be designed as “living” or responsive materials (e.g. self‑healing structures or conductive), which is not possible with inert synthetic materials.Disadvantages
When kept fully bio-based and uncoated, they are often less durable and more sensitive to biological decay than traditional materials.Their mechanical properties can be more variable and sometimes lower than standardised industrial materials, which makes engineering and certification more challenging.They usually take longer to produce because the mycelium needs time to grow and colonise the substrate, slowing down manufacturing cycles.At the moment, they are often more expensive than mass‑produced plastics, foams or leathers.They can be sensitive to environmental conditions such as humidity and contamination, which can limit certain outdoor or high‑load uses unless additional protection is added.Being a relatively new class of materials, they come with more uncertainty around regulation, long‑term performance and risk assessment compared to well‑known conventional materials.References:
Adamatzky, A., Ayres, P., Beasley, A.E., Chiolerio, A., Dehshibi, M.M., Gandia, A., Albergati, E., Mayne, R., Nikolaidou, A., Roberts, N., Tegelaar, M., Tsompanas, M.-A., Phillips, N. and Wösten, H.A.B. (2022). Fungal electronics. Biosystems, 212, p.104588. doi:https://doi.org/10.1016/j.biosystems.2021.104588.Ecoactive (n.d.). Ecovative. [online] Ecovative. Available at: https://ecovative.com [Accessed 26 Mar. 2026].Cerimi, K., Akkaya, K.C., Pohl, C., Schmidt, B. and Neubauer, P. (2019). Fungi as source for new bio-based materials: a patent review. Fungal Biology and Biotechnology, 6(1). doi:https://doi.org/10.1186/s40694-019-0080-y.Gandia, A., van den Brandhof, J.G., Appels, F.V.W. and Jones, M.P. (2021). Flexible Fungal Materials: Shaping the Future. Trends in Biotechnology, 39(12). doi:https://doi.org/10.1016/j.tibtech.2021.03.002.Myceen (2024). Materials for carbon-neutral architecture. [online] Myceen.com. Available at: https://myceen.com/about [Accessed 26 Mar. 2026].Pale Blue dots (2026). Fungi-based materials. [online] Notion. Available at: https://pbdvc-research.notion.site/Fungi-based-materials-3b088667784f416e90169be831fb6105 [Accessed 26 Mar. 2026].1. What might you want to genetically engineer fungi to do and why? What are the advantages of doing synthetic biology in fungi as opposed to bacteria?
For my project idea of developing an IBD treatment via psilocybin's effects on gut microbiota and serotonin signalling, a key challenge is confirming whether microbial reshaping is a direct effect of psilocybin (Fung et al., 2019),(Robinson, et al., 2023). I would therefore propose to start with the engineering of one psilocybin-producing fungal strain (Psilocybe cubensis) and genetically engineer variants that differ by only one feature, for example: one with active PsiH enzyme (IPsiH catalyses the 4-hydroxylation of tryptamine to 4-hydroxytryptamine, which is an essential and unique part of psilocybin biosynthesis that allows for the production of psilocin) (Huang et al., 2025), one with PsiH knocked out, one with tuned production levels (via promoter strength), and one producing an inactive analogue. These variants could be applied to ex vivo gut models to compare microbiome composition and immune markers, isolating whether psilocybin itself and not other mushroom components drives changes. Additionally, a reporter circuit with fluorescence under a gut condition-responsive promoter would track fungal activity timing and location. This genetic engineering approach would provide a controlled platform to test the hypothesis of psilocybin's gut therapeutic potential.
References:
Dragos Ciocan and Eran Elinav (2023). Engineering bacteria to modulate host metabolism. Acta physiologica, 238(3). doi:https://doi.org/10.1111/apha.14001.Fung, T.C., Vuong, H.E., Luna, C.D.G., Pronovost, G.N., Aleksandrova, A.A., Riley, N.G., Vavilina, A., McGinn, J., Rendon, T., Forrest, L.R. and Hsiao, E.Y. (2019). Intestinal serotonin and fluoxetine exposure modulate bacterial colonization in the gut. Nature Microbiology. doi:https://doi.org/10.1038/s41564-019-0540-4.Gregory Ian Robinson, Li, D., Wang, B., Rahman, T., Gerasymchuk, M., Hudson, D., Kovalchuk, O. and Kovalchuk, I. (2023). Psilocybin and Eugenol Reduce Inflammation in Human 3D EpiIntestinal Tissue. Life, 13(12), pp.2345–2345. doi:https://doi.org/10.3390/life13122345.Huang, Z., Yao, Y., Di, R., Zhang, J., Pan, Y. and Liu, G. (2025). De Novo Biosynthesis of Antidepressant Psilocybin in Escherichia coli. Microbial biotechnology, [online] 18(4), p.e70135. doi:https://doi.org/10.1111/1751-7915.70135.Week 9 HW: Cell - free systems
Homework Part A: General and Lecturer-Specific Questions
General homework questions
1. Explain the main advantages of cell-free protein synthesis over traditional in vivo methods, specifically in terms of flexibility and control over experimental variables. Name at least two cases where cell-free expression is more beneficial than cell production.
Cell-free protein synthesis provides greater flexibility and control compared to traditional in vivo expression, as the reaction occurs outside living cells, allowing factors such as DNA concentration and energy components to be adjusted without affecting cell viability. In in vivo expression methods, amplifying the bacterial plasmid, sequencing it, expressing the strain, and purifying the protein must be done separately for every single protein in the pathway. In contrast, in cell-free protein expression using a synthetic cell, proteins can be expressed directly from linear PCR fragments without the need for plasmid construction. This makes the process much more time‑efficient and allows results to be obtained within hours. Furthermore, all proteins can be expressed in one pot by adjusting the concentrations of the DNA, unlike in vivo systems that require different promoters for each protein and expression level.
Answer based on Kate Adamala, 2/03/2026 lecture
Cases where cell-free expression is more beneficial than cell production:1. In the production of toxic or membrane proteins, cell-free systems are advantageous because the reaction occurs outside a living cell. Internal metabolism does not need to be maintained, and there are no cellular barriers limiting translation control (Zemella et al., 2015). Traditionally, genetic circuit prototyping using plasmid DNA is time‑consuming, taking several days per validation cycle. In contrast, linear DNA can complete the same cycles within 4–8 hours, making it possible to validate large circuits rapidly and to study molecules that were previously considered too toxic for in vivo work (Brookwell et al, 2021).
2. Cell-free systems also benefit other prototyping efforts, such as drug discovery, by shortening the time from compound identification to validation. Moreover, reactions can be monitored in real time, providing insights into the underlying mechanisms and improving the design and optimisation of experimental pathways.
References
Brookwell, A., Oza, J.P. and Caschera, F. (2021). Biotechnology Applications of Cell-Free Expression Systems.Life , 11(12), p.1367. doi:https://doi.org/10.3390/life11121367.Zemella, A., Thoring, L., Hoffmeister, C. and Kubick, S. (2015). Cell-Free Protein Synthesis: Pros and Cons of Prokaryotic and Eukaryotic Systems. ChemBioChem, [online] 16(17), pp.2420–2431. doi:https://doi.org/10.1002/cbic.201500340.2.Describe the main components of a cell-free expression system and explain the role of each component.
A cell-free expression system contains all the molecular machinery needed for transcription and translation without using living cells. Its main components include:
Lipid membrane: Composed of phospholipids for structural stability and cholesterol additives that increase membrane fluidity and durability.Cell extract: Provides ribosomes, enzymes, and other necessary cofactors for transcription and translation.Cytoplasmic components: Small molecules such as salts, metabolites, and energy substrates that support biochemical reactions.tRNAs and nucleotides: Supply the genetic code and building blocks for protein synthesis.Membrane channels: Allow controlled exchange of small molecules, enabling communication and regulation.Genetic template: A minimal genome in the form of a plasmid or linear DNA containing the genes to be expressed.Answer based on Kate Adamala, 2/03/2026 lecture
3. Why is energy provision regeneration critical in cell-free systems? Describe a method you could use to ensure continuous ATP supply in your cell-free experiment.
Energy regeneration is critical in cell-free systems because protein synthesis requires a continuous supply of ATP. Translation is highly energy‑intensive, using roughly 4–5 ATP equivalents per peptide bond. As the reaction proceeds, phosphate by‑products accumulate and deplete the energy pool, limiting protein yield. To maintain ATP levels, energy regeneration schemes can be introduced.
One effective example is the Protein Synthesis Using Recombinant Elements (PURE) system, a minimal biochemical setup capable of carrying out cell‑free protein synthesis using defined enzymatic components. In this system, pyruvate oxidase is added to catalyse the conversion of pyruvate and inorganic phosphate into acetyl phosphate. This intermediate can then regenerate ATP through its conversion to acetate, catalysed by acetate kinase naturally present in the extract(Yadav et al., 2025).
references
Yadav, S., Perkins, A.J.P., Liyanagedera, S.B.W., Bougas, A. and Laohakunakorn, N. (2025). ATP Regeneration from Pyruvate in the PURE System. ACS Synthetic Biology, 14(1), pp.247–256. doi:https://doi.org/10.1021/acssynbio.4c00697.4. Compare prokaryotic versus eukaryotic cell-free expression systems. Choose a protein to produce in each system and explain why.
Prokaryotic cell-free systems typically use E. coli extracts and provide high protein yields at low cost. They are rapid to prepare and are easily genetically engineered, which makes them suitable for large-scale production and the incorporation of non-canonical amino acids. However, they have limited capacity for complex protein folding and lack most post-translational modifications. In contrast, eukaryotic cell-free systems use extracts from sources such as wheat germ, insect cells, or protozoa. They are better suited for proteins that require proper folding and post-translational modifications, allowing the production of more complex and functional proteins. However, they are generally slower, more expensive, and often yield less protein than prokaryotic systems (Zemella et al., 2015).
For eukaryotic systems: Monoclonal Antibodies (mAbs)
These are complex, large glycoproteins that require proper folding and post-translational modifications, which eukaryotic systems handle better. Heavily used in drug manufacturing, their higher solubility in eukaryotic extracts makes the final drug more effective (Ding and Huang, 2024).
For prokaryotic systems: Human Serum Albumin (HSA)
This small protein needs no complex glycosylation. Commonly used as a drug carrier where production time matters, prokaryotic systems provide the high yields needed for scalable manufacturing (Raoufinia et al., 2016).
references
Creative Biostructure (2025). Protein Synthesis in Prokaryotes vs. Eukaryotes: What’s the Difference? [online] Creative-biostructure.com. Available at: https://www.creative-biostructure.com/protein-synthesis-in-prokaryotes-vs-eukaryotes.htm [Accessed 4 Apr. 2026].Ding, Z. and Huang, Y. (2024). Production of Monoclonal Antibodies for Therapeutic Purposes: Applications, Techniques, and Improvement. doi:https://doi.org/10.20944/preprints202405.1854.v1.Zemella, A., Thoring, L., Hoffmeister, C. and Kubick, S. (2015). Cell-Free Protein Synthesis: Pros and Cons of Prokaryotic and Eukaryotic Systems. ChemBioChem, [online] 16(17), pp.2420–2431. doi:https://doi.org/10.1002/cbic.201500340.Raoufinia, R., Mota, A., Keyhanvar, N., Safari, F., Shamekhi, S. and Abdolalizadeh, J. (2016). Overview of Albumin and Its Purification Methods. Advanced Pharmaceutical Bulletin, 6(4), pp.495–507. doi:https://doi.org/10.15171/apb.2016.063.5. How would you design a cell-free experiment to optimise the expression of a membrane protein? Discuss the challenges and how you would address them in your setup.
What needs to be added:
Insoluble expression:If lipids or detergents are not included, membrane proteins tend to aggregate due to their hydrophobic regions. Although aggregation can sometimes allow refolding into functional proteins, it more often results in misfolded or inactive proteins.Detergents: These are commonly added during expression to increase solubility by surrounding hydrophobic regions of the protein. However, they can sometimes destabilise the protein or interfere with downstream applications.MSP-Nanodiscs:These provide a stabilising lipid environment for membrane proteins. They enable proteins to remain in a near-native state and can be used in a wide range of assays without requiring detergents.Liposomes and lipids:These create a membrane-like environment that supports proper folding and insertion of membrane proteins, particularly when used with insect cell lysates.There are two main methods of cell-free membrane protein expression: batch and Continuous exchange cell-free (CECF) / dialysis systems. I would choose CECF/dialysis methods over batch systems because they allow longer reaction times and generally result in higher protein yields.(Cube Biotech, 2014)
Steps:
Follow a medium-scale protein production approach, which results in a high protein yield whilst optimising reaction conditions for maximum purity:
Preparation of plasmid template: Use circular DNA templates, as they are more stable and optimised for high-yield protein expression.Reaction setup: Use a large reaction volume within a continuous exchange system to support sustained protein synthesis.Reaction conditions optimisation: Maximise yield by adjusting key variables such as the composition of the feeding solution.Protein analysis: Assess the yield and purity of the expressed protein and prepare it for downstream applications.(Synthego, 2026)
Challenges and how to address them:
Membrane proteins are difficult to study because their hydrophobic surfaces require detergents or lipid environments for extraction and stabilisation. This creates challenges not only during expression, but also throughout purification and downstream analysis.
In addition, membrane proteins are often flexible and structurally unstable, which reduces the likelihood of obtaining a functional or well-behaved protein. This issue can be addressed by providing a more native-like environment during expression, such as incorporating lipids or nanodiscs to improve stability and folding. These inherent properties impact multiple stages of research, including expression, solubilisation, purification, crystallisation, and structure determination. As a result, it is often necessary to adopt broader screening strategies.
One way to overcome these challenges is to test a range of targets or homologues, increasing the chances of identifying a protein that behaves well under experimental conditions. This approach has been particularly successful in the study of G-protein-coupled receptors, where reducing protein flexibility has been essential for structural studies. For example, stabilising flexible regions through engineered modifications or binding partners has enabled successful structure determination.
(Carpenter et al., 2008).references
Carpenter, E.P., Beis, K., Cameron, A.D. and Iwata, S. (2008). Overcoming the challenges of membrane protein crystallography. Current Opinion in Structural Biology, 18(5), pp.581–586. doi:https://doi.org/10.1016/j.sbi.2008.07.001.Cube Biotech (2014). Why use cell-free protein expression? [online] Cube Biotech. Available at: https://cube-biotech.com/our-science/cell-free-lysates/cell-free-expression/ [Accessed 4 Apr. 2026].Synthego (2026). Cell-Free Protein Expression: A Guide - Synthego. [online] Synthego. Available at: https://www.synthego.com/cell-free-protein-expression-guide/ [Accessed 4 Apr. 2026].6.Imagine you observe a low yield of your target protein in a cell-free system. Describe three possible reasons for this and suggest a troubleshooting strategy for each.
Reasons for low yield of target protein:
1)Low ATP Level: Use Protein Synthesis Using Recombinant Elements (PURE) for ATP regeneration. (Yadav et al., 2025)
Poor DNA template: Use circular DNA to increase stability. (Synthego, 2026)
Poor protein folding: Add protein folding factors such as Chaperones to the in vitro extract. (Anantha, 2025)
references
Anantha, S. (2025). Solved: Low Yields in Cell-Free Protein Synthesis. [online] Bitesize Bio. Available at: https://bitesizebio.com/10234/solvedlow-yields-in-cell-free-protein-synthesis/ [Accessed 4 Apr. 2026].Synthego (2026). Cell-Free Protein Expression: A Guide - Synthego. [online] Synthego. Available at: https://www.synthego.com/cell-free-protein-expression-guide/ [Accessed 4 Apr. 2026].Yadav, S., Perkins, A.J.P., Liyanagedera, S.B.W., Bougas, A. and Laohakunakorn, N. (2025). ATP Regeneration from Pyruvate in the PURE System. ACS Synthetic Biology, 14(1), pp.247–256. doi:https://doi.org/10.1021/acssynbio.4c00697.Homework question from Kate Adamala
Design an example of a useful synthetic minimal cell as follows:
Pick a function and describe it.
a) What would your synthetic cell do? What is the input, and what is the output?
Based on my Final Project proposal: A synthetic minimal cell for PAH paint bioremediation.
Function: Continuously degrades PAH carcinogenic pollutants produced by incomplete combustion (eg. traffic emissions and industrial processes) when deposited on the paint surfaces.Input: Gaseous PAHs from air pollutionOutput: Carbon dioxide and water —> PAH-free surfacesb) Could this function be realized by cell-free Tx/Tl alone, without encapsulation?
No, this function could not be realised by cell-free Tx/Tl alone and would require encapsulation. Encapsulation would protect the synthetic minimal cells within the paint from harsh outdoor environmental conditions, allowing the PAH-degrading enzymes to function effectively.
c) Could this function be realized by genetically modified natural cell?
A genetically natural cell could realise PAH degradation, but would face serious challenges. PAHs like benzo[a]pyrene (BaP) form toxic reactive diol epoxides (BPDE) that bind to DNA adducts that halt replication and enzyme production (Wang et al., 2023). Additionally, living cells also need complex nutrients for long-term survival, plus growth control to prevent overgrowth.
However, in my project proposal examining B. subtilis's PAH-degrading properties, its spore-forming ability and environmental robustness. In relation to food source, a study conducted showed strain BMT4i (MTCC 9447) utilising BaP as a sole carbon and energy source with 84.66% degradation efficiency (Lily et al, 2010). Additionally, the degradation capacity of microbes may be induced by exposing them to higher PAH concentrations, resulting in genetic adaptation or changes responsible for high-efficiency removal/degradation. Overall, this keeps it as a strong candidate for the development of a PAH-degrading paint (Sakshi and Haritash, 2020).
references
Hazan, A., Lee, H.Y., Tiong, V. and AbuBakar, S. (2025). Bacillus subtilis Spores as a Vaccine Delivery Platform: A Tool for Resilient Health Defense in Low- and Middle-Income Countries. Vaccines, [online] 13(10), pp.995–995. doi:https://doi.org/10.3390/vaccines13100995.Lily, M.K., Ashutosh Bahuguna, Koushalya Dangwal and Garg, V. (2010). Optimization of an inducible, chromosomally encoded benzo [a] pyrene (BaP) degradation pathway in Bacillus subtilis BMT4i (MTCC 9447). Annals of Microbiology, 60(1), pp.51–58. doi:https://doi.org/10.1007/s13213-009-0010-y.Sakshi and Haritash, A.K. (2020). A comprehensive review of metabolic and genomic aspects of PAH-degradation. Archives of Microbiology, 202(8), pp.2033–2058. doi:https://doi.org/10.1007/s00203-020-01929-5.Wang, H., Liu, B., Cui, H., Xu, P., Xue, H. and Yuan, J. (2023). Dynamic changes of DNA methylation induced by benzo(a)pyrene in cancer. Genes and Environment, 45(1). doi:https://doi.org/10.1186/s41021-023-00278-1.d) Describe the desired outcome of your synthetic cell operation.
PAH-contaminated paint surfaces become PAH-free, preventing re-volatilisation and runoffs.
2. Design all components that would need to be part of your synthetic cell.
a) What would be the membrane made of?
Phospholipid bilayer with additive cholesterol (increases stability and fluidity) for paint surface adhesion and environmental stability. Additionally, PEG-lipids (polyethene glycol-lipids) as it acts as an interface between hydrophobic polymer cores and aqueous environments (Esteve, 2025).
References:
Esteve, M.Á. (2025). Beyond Lipids and Polymers: Understanding Hybrid Nanoparticles. [online] Curapath.com. Available at: https://blog.curapath.com/beyond-lipids-and-polymers-understanding-hybrid-nanoparticles [Accessed 4 Apr. 2026].b) What would you encapsulate inside? Enzymes, small molecules.
Catabolic enzymes such as dehydrogenase, monooxygenase, catechol dioxygenase, and aromatic-ring-hydroxylating dioxygenase (found in bacterias gene that degrade PAHs, key components of the pathway). (Tesfaye et al., 2025)Linear DNA encoding for the catabolic enzymes.E. Coli cell extract containing tRNAs, ribosomes and polymerases.Cholesterol and phospholipids.references
Tesfaye, E.L., Bogale, F.M. and Aragaw, T.A. (2025). Biodegradation of polycyclic aromatic hydrocarbons: The role of ligninolytic enzymes and advances of biosensors for in-situ monitoring. Emerging Contaminants, [online] 11(1), p.100424. doi:https://doi.org/10.1016/j.emcon.2024.100424.c) Which organism your Tx/Tl system will come from? Is bacterial OK, or do you need a mammalian system for some reason? (hint: for example, if you want to use small molecule modulated promoters, like Tet-ON, you need mammalian)
I would use E. Coli as my Tx/TI bacterial system, as PAH degradation is present in prokaryotic enzymes and pathways; therefore, no mammalian systems would be needed. Additionally, they also give a higher yield production and have a lower cost compared to eukaryotic and mammalian systems.
d) How will your synthetic cell communicate with the environment? (hint: are substrates permeable? or do you need to express the membrane channel?)
The catabolic PAH-degrading enzymes are hydrophilic proteins that remain inside the cell extract. The PAHs (small, hydrophobic) are what will passively diffuse through the phospholipid membrane and due to their high permeability, no specific channels need to be expressed (Cao et al., 2021).
references
Cao, Y., Zhang, L., Geng, Y., Li, Y., Zhao, Q., Huang, J., Ning, P. and Tian, S. (2021). Evaluation of the permeability and potential toxicity of polycyclic aromatic hydrocarbons to pulmonary surfactant membrane by the parallel artificial membrane permeability assay model. Chemosphere, p.132485. doi:https://doi.org/10.1016/j.chemosphere.2021.132485.3. Experimental details
a) List all lipids and genes. (bonus: find the specific genes; for example, instead of just saying “small molecule membrane channel” pick the actual gene.)
Lipids:
CholesterolPolymer coatingSpecific genes:
For ATP regeneration: pyruvate oxidase and acetate kinaseCatabolitic PAH-degrading enzymes: dehydrogenase, monooxygenase, catechol dioxygenase, and aromatic-ring-hydroxylating dioxygenaseb) How will you measure the function of your system?
Use Mass spectrometry coupled to gas chromatography (GC-MS) quantification to determine the exact levels of volatile compounds (VOCs), specifically the PAHs (Dimitrios,2024), present before and after application of paint.
referencesDimitrios Tsikas (2024). Perspectives of Quantitative GC-MS, LC-MS, and ICP-MS in the Clinical Medicine Science—The Role of Analytical Chemistry. Journal of Clinical Medicine, [online] 13(23), pp.7276–7276. doi:https://doi.org/10.3390/jcm13237276.Homework question from Peter Nguyen
Freeze-dried cell-free systems can be incorporated into all kinds of materials as biological sensors or as inducible enzymes to modify the material itself or the surrounding environment. Choose one application field — Architecture, Textiles/Fashion, or Robotics — and propose an application using cell-free systems that are functionally integrated into the material. Answer each of these key questions for your proposal pitch:
Write a one-sentence summary pitch sentence describing your concept.
I will continue to apply my final project proposal from above to explore my development options.
Self-monitoring, self-remediating anti-pollution paint that continuously degrades volatile Polycyclic Aromatic Hydrocarbons (PAH) using freeze-dried Bacillus subtilis BMT4i expressing PAH-degrading enzymes.
What societal challenge or market need will this address?
PAHs are associated with urban pollution sources such as vehicle exhaust and industrial emissions, which persist in the air and soil and pose significant health risks via inhalation, ingestion, and dermal contact (Wu et al., 2025). This freeze-dried cell-free system addresses these issues and could be a promising tool for environmental and human health.
references
Wu, Y., Meng, Y., Zhang, H., Hao, L., Zeng, T., Shi, Y., Chen, Y., Qiao, N. and Ren, Y. (2025). Ecological and Health Risks of Polycyclic Aromatic Hydrocarbons in Particulate Matter in Chinese Cities. GeoHealth, [online] 9(6). doi:https://doi.org/10.1029/2024gh001126.How do you envision addressing the limitations of cell-free reactions (e.g., activation with water, stability, one-time use)?
Activation: Uses outdoor rainwater and humidity to hydrate the freeze-dried system, so no manual activation is needed.One-time use: Bacillus subtilis can use PAHs as a carbon source, so the system is designed to keep breaking down fresh pollutant deposits over repeated exposure.Stability: The encapsulation layer in the paint matrix helps protect the enzymes from environmental exposures.Homework question from Ally Huang
1. Provide background information that describes the space biology question or challenge you propose to address. Explain why this topic is significant for humanity, relevant for space exploration, and scientifically interesting. (Maximum 100 words)
Galactic cosmic radiation poses a major health risk to astronauts, causing DNA damage, cancer risk, neurocognitive decline, and cardiovascular disease (Chancellor et al., 2014). This challenge is critical as long-duration missions to Mars exceed current radiation safety limits. Addressing it is vital for enabling deep-space exploration and protecting crew health. Scientifically, it is compelling to investigate whether BioBits cell-free systems can both detect DNA-damaged biomarkers and respond by producing DNA repair enzymes on demand, offering a real-time response.
references
Chancellor, J., Scott, G. and Sutton, J. (2014). Space Radiation: The Number One Risk to Astronaut Health beyond Low Earth Orbit. Life, [online] 4(3), pp.491–510. doi:https://doi.org/10.3390/life4030491.2. Name the molecular or genetic target that you propose to study. Examples of molecular targets include individual genes and proteins, DNA and RNA sequences, or broader -omics approaches. (Maximum 30 words)
γ-H2AX (phosphorylated histone H2AX) as a biomarker for cosmic radiation (Sethu et al., 2017) and the PARP-1 gene for DNA repair enzyme production (Ko and Ren, 2012).references
Ko, H.L. and Ren, E.C. (2012). Functional Aspects of PARP1 in DNA Repair and Transcription. Biomolecules, 2(4), pp.524–548. doi:https://doi.org/10.3390/biom2040524.Hande, Mp., Zeegers, D., Venkatesan, S., Koh, S., Low, G.M., Srivastava, P., Sundaram, N., Sethu, S., Banerjee, B., Jayapal, M., Belyakov, O., Baskar, R. and Balajee, A. (2017). Biomarkers of ionizing radiation exposure: A multiparametric approach. Genome Integrity, 8(1), p.6. doi:https://doi.org/10.4103/2041-9414.198911.3. Describe how your molecular or genetic target relates to the space biology question or challenge your proposal addresses. (Maximum 100 words)
Galactic cosmic rays can cause severe DNA damage in astronauts’ immune cells, creating clustered double-strand breaks. Within about 30 minutes, this damage leads to the formation of γ-H2AX foci, which act as an early signal that DNA repair may be needed. By detecting γ-H2AX, a BioBits® cell-free system can be triggered to produce the repair enzyme PARP-1. PARP-1 helps coordinate the repair process, allowing the cell to fix DNA breaks before mutations start to build up. This kind of self-activating biosensor-and-response system is especially useful in deep space, where long resupply delays make it difficult to rely on traditional medicines. Instead, it provides on-demand DNA repair exactly when astronauts need it.
references
Redon, C.E., Dickey, J.S., Bonner, W.M. and Sedelnikova, O.A. (2009). γ-H2AX as a biomarker of DNA damage induced by ionizing radiation in human peripheral blood lymphocytes and artificial skin. Advances in Space Research, 43(8), pp.1171–1178. doi:https://doi.org/10.1016/j.asr.2008.10.011.Wang, Y., Luo, W. and Wang, Y. (2019). PARP-1 and its associated nucleases in DNA damage response. YDNA Repair, 81, p.102651. doi:https://doi.org/10.1016/j.dnarep.2019.102651.4. Clearly state your hypothesis or research goal and explain the reasoning behind it. (Maximum 150 words)
Goal:
Create a toehold switch that senses γ-H2AX and turns on PARP-1 production in a BioBits® reaction. Then test whether the PARP-1 made this way can actually repair damaged DNA, as a proof-of-concept for a wearable radiation-response patch for astronauts.
Reasoning:
γ-H2AX appears quickly at sites where radiation causes DNA double-strand breaks, making it a reliable early warning signal. PARP-1 is one of the first proteins to respond to this damage, helping recruit the cell’s repair machinery. BioBits® cell-free systems have already been shown to work on the ISS, so combining them with toehold switches could create a self-contained system that detects DNA damage and immediately produces a repair protein without needing outside intervention. If small fragments of γ-H2AX can activate the toehold switch, this would demonstrate a detect and respond system for space radiation damage.
references
Podhorecka, M., Skladanowski, A. and Bozko, P. (2010). H2AX Phosphorylation: Its Role in DNA Damage Response and Cancer Therapy. Journal of Nucleic Acids, 2010, pp.1–9. doi:https://doi.org/10.4061/2010/920161.Kinner, A., Wu, W., Staudt, C. and Iliakis, G. (2008). -H2AX in recognition and signaling of DNA double-strand breaks in the context of chromatin. Nucleic Acids Research, 36(17), pp.5678–5694. doi:https://doi.org/10.1093/nar/gkn550.Outline your experimental plan - identify the sample(s) you will test in your experiment, including any necessary controls, the type of data or measurements that will be collected, etc. (Maximum 100 words)
Samples: Human lymphocytes exposed to simulated cosmic radiation and non-exposed controls.>Methods: Use miniPCR to amplify PARP-1 DNA. Measure protein production and γ-H2AX signals with a P51 Fluorescence Viewer.Data: Track fluorescence (protein expression), percentage of repaired DNA, and number of γ-H2AX foci before and after treatment.Controls: Non-irradiated cells and a scrambled toehold switch.Week 10 HW: Advanced Imaging & Measurment Technology
Homework: Final Project
Please identify at least one (ideally many) aspect(s) of your project that you will measure. It could be the mass or sequence of a protein, the presence, absence, or quantity of a biomarker, etc.
For my project, I want to measure the activity of the CotA laccase from Bacillus subtilis (whose gene is inserted into my pETite‑based plasmid) against PAH compounds using ABTS as a colour‑indicating substrate. I will measure the initial reaction rate of CotA, expressed as the change in absorbance over time, under different conditions, such as pH and temperature, to determine the enzyme’s optimal working range.
I would also like to measure the half‑life and stability of CotA in a paint‑like formula by embedding cultured CotA‑expressing cells in a water‑based gel or coating to then sample at regular intervals and quantify any residual activity through ABTS assays. Finally, I will record colour‑change metrics of the ABTS PAH‑surrogate solution over time, using percentage colour loss as a quantitative readout of degradative activity. This will help characterise the functional performance of the system in conditions relevant to a PAH‑degrading paint.
Please describe all of the elements you would like to measure, and furthermore describe how you will perform these measurements.
Considering PAH are not LBS1 safe, I will be using ABTS, a synthetic water‑soluble compound that laccases (like CotA) oxidise to a coloured radical as a PAH substitute. After mixing my pETite‑CotA plasmid with BL21(DE3) competent cells, performing heat shock and plating on LB kanamycin to grow a colony. I can then proceed to scaling up the culture to have enough E.coli to proceed with the following tests:
Reaction Rate of CotA Laccase - Change in absorbance over time:
1) Centrifuge the culture to pellet the cells and pipette away the surplus (LB and kanamycin) to obtain only the bacterial cells.2) Prepare the solution: dissolve ABTS in 100 mM potassium phosphate buffer (achieved by mixing Potassium phosphate monobasic and dibasic in water).
3) Centrifuge the culture to pellet the cells and pipette away the surplus (LB and kanamycin) to obtain only the bacterial cells.
4) From the obtained cells, I will then resuspend the pellet in a smaller volume of buffer to create a cell suspension.
5) I will then measure the ABTS assay of the cell suspensions of my prepared bacterial samples at different temperatures and pHs by using a 96-well plate reader or UV-Vis spectrophotometer, depending on lab availability.
6) I will measure the change in absorbance at 420nm over time during the ABTS oxidation assay.
7) I will calculate the initial rate as ∆A₄₂₀·min⁻¹, which serves as a quantitative measure of CotA activity under each condition.
8) Additionally, adding copper to the reaction will enhance the activity of CotA Laccase.
Colour‑change metrics of the ABTS PAH‑surrogate solution over time:At each time point t, take or prepare a small sample and measure Aλ at 420 nm (Aₜ)
Keep the unreacted ABTS at time 0 as your reference (A₀).For each timepoint, I will calculate the % colour loss: % colour loss t = 100 x A₀ - Aₜ / A₀If Aₜ is significantly smaller than A₀, there is a high % colour loss.Half‑life and stability of CotA‑expressing cells embedded in a paint‑like gel
Use some of the suspended cells and mix them with a water‑based gel or coating (e.g., agar hydrogel) as a conceptual paint.Spread a layer of the mixture on tiles or other surfaces and let it solidify.At different selected times (over a couple of days at least), scrape off a defined amount of the surface.Resuspend the sample in the buffer, centrifuge to get a CotA sample.Rerun an ABTS assay at 420nm on each sample, calculate the absorbance at the start and the initial rate: ∆A₄₂₀/minI can then express the activity and determine the time it takes for the enzyme’s activity to drop below 50% of its initial value: activity remaining at t= 100 x rateₜ/rate₀References
Ardila-Leal, L.D., Monterey-Gutiérrez, P.A., Poutou-Piñales, R.A., Quevedo-Hidalgo, B.E., Galindo, J.F. and Pedroza-Rodríguez, A.M. (2021). Recombinant laccase rPOXA 1B real-time, accelerated and molecular dynamics stability study. BMC Biotechnology, 21(1). doi:https://doi.org/10.1186/s12896-021-00698-3.BenchChem Technical Support Team (2026). Application Notes: ABTS Assay for Laccase Activity. [online] Available at: https://pdf.benchchem.com/7949/Application_Notes_ABTS_Assay_for_Laccase_Activity.pdf [Accessed 12 Apr. 2026].Dias, A.A., António J.S. Matos, Fraga, I., Sampaio, A. and Rui (2017). An Easy Method for Screening and Detection of Laccase Activity. The Open Biotechnology Journal, 11(1), pp.89–93. doi:https://doi.org/10.2174/1874070701711010089.JenaBios (n.d.). 4 benzenediol + O 2 4 benzosemiquinone + 2 H 2 O Activity. [online] Available at: https://www.jenabios.de/wp-content/uploads/2016/03/Datenblatt_Laccase_Cu.pdf [Accessed 13 Apr. 2026].Margot, J., Bennati-Granier, C., Maillard, J., Blánquez, P., Barry and Holliger, C. (2013). Supporting Information: Bacterial versus fungal laccase: Potential for micropollutant degradation. AMB Express.MartinsL.O., Soares, C.M., Pereira, M.M., Teixeira, M., Costa, T., Jones, G.H. and Henriques, A.O. (2002). Molecular and Biochemical Characterization of a Highly Stable Bacterial Laccase That Occurs as a Structural Component of the Bacillus subtilis Endospore Coat. Journal of Biological Chemistry, 277(21), pp.18849–18859. doi:https://doi.org/10.1074/jbc.m200827200.Shin-ichi Sakasegawa, Ishikawa, H., Imamura, S., Haruhiko Sakuraba, Goda, S. and Toshihisa Ohshima (2006). Bilirubin Oxidase Activity of Bacillus subtilis CotA. Applied and Environmental Microbiology, 72(1), pp.972–975. doi:https://doi.org/10.1128/aem.72.1.972-975.2006.JenaBios (n.d.). 4 benzenediol + O 2 4 benzosemiquinone + 2 H 2 O Activity. [online] Available at: https://www.jenabios.de/wp-content/uploads/2016/03/Datenblatt_Laccase_Cu.pdf [Accessed 13 Apr. 2026].What are the technologies you will use (e.g., gel electrophoresis, DNA sequencing, mass spectrometry, etc.)? Describe in detail.
For all three protocols, the core instrument is a UV‑Vis spectrophotometer (or a microplate reader with absorbance capabilities) to measure A₄₂₀ during ABTS‑based laccase assays. For the settings, I will set the wavelength at 420 nm (ABTS has the highest absorbance point). I will then create a ‘blank’ cuvette to calibrate it (buffer + ABTS with no CotA cells) and place it first in the spectrometer. Place each sample in a cuvette and place those in the spectrophotometer one by one. Calculation details are explained in the bullet point above.
Additionally, to relate to the lecture, my aim 3 for my project aims to develop a field‑ready bioremediation paint by evaluating CotA performance in a real‑world‑like coating matrix and validating its safety profile. Toxicity validation would focus on verifying that degradation yields safe byproducts rather than more hazardous compounds, a concern highlighted by recent studies showing that PAH transformation products could be toxic (Huizenga et al., 2025). In this context, mass spectrometry (e.g., LC‑MS/MS) would be used to identify and monitor the formation of degradation products (Nakken et al., 2025).
References
Agilent Technologies (2025). The Basics of UV-Vis Spectrophotometry. [online] Agilent. Available at: https://www.agilent.com/cs/library/primers/public/primer-uv-vis-basics-5980-1397en-agilent.pdf?srsltid=AfmBOopupFXxpTMDA1XLxTrhBD3EBGXNJ9ukjBHqXIJSZoU1jsVGuLr2 [Accessed 12 Apr. 2026].Huizenga, J.M., Semprini, L. and Garcia-Jaramillo, M. (2025). Identification of Potentially Toxic Transformation Products Produced in Polycyclic Aromatic Hydrocarbon Bioremediation Using Suspect and Non-Target Screening Approaches. Environmental Science & Technology, 59(15), pp.7561–7573. doi:https://doi.org/10.1021/acs.est.4c13093.Nakken, C.L., Sørhus, E., Holmelid, B., Meier, S., Mjøs, S.A. and Donald, C.E. (2025). Transformative knowledge of polar polycyclic aromatic hydrocarbons via high-resolution mass spectrometry. Science of The Total Environment, 960, p.178349. doi:https://doi.org/10.1016/j.scitotenv.2024.178349.