Zander — HTGAA Spring 2026

cover image cover image

Seeming to model phenomena found frequently throughout nature, the Reaction-Diffusion equations give rise to an endless assortment of intriguing interactions. I whipped up the one above courtesy of @pmneila via Reaction-Diffusion System (Grey-Scott model).

About me

Hey there, I’m Zander!

I’m a recent industrial design graduate with a growing fascination for how synthetic biology may be harnessed as a novel manufacturing technique. I was fortunate enough to receive a taster of the world of biology through the completion of a minor in Biotechnology, having gained not only some time in the wetlab, but an unquenchable desire to culture, plate, and microscopically observe every microorganism in a 10 kilometre radius. This time around, I’m pretty thrilled to be diving into the world of synthetic biology. Lets leave no stone unturned!

Contact

LinkedIn

Homework

Labs

Projects

Subsections of Zander — HTGAA Spring 2026

Subsections of Homework

Week 1 HW: Principles and Practices

cover image cover image

The morphogenetic process in the Drosophila dorsal thorax is explored above: Confocal Image of the dorsal thorax epithelium (left), tracking of cell division (middle) and expression patterns predicted by spatial transcriptomics (right).

First, describe a biological engineering application or tool you want to develop and why.

I’m fascinated by morphogenesis, in which the complex, three-dimensional organisation of biological entities can emerge! The possibilities of synthetically harbouring this process feel endless, however I am most intrigued by the potential of its application in radically-sustainable manufacturing.

Conventional manufacturing techniques (i.e formative plastic methods, subtractive techniques) in their archetypal incarnation contain inherent inefficacies at all stages, from the pre-production through to disposal of the manufactured artefact by the end-user. The plastic spork is an exemplar for how contemporary industrial logic constrains a product from its inception. Compatibility with standard machinery immediately informs material selection. Injection moulding techniques demand a particular design rationale, where minor features like undercuts can cause costs to balloon through added mould complexity. Instead of being repossessed wholly by organic processes at the atomic level when disposed, conventional thermoplastics degrade and colonize such exotic destinations as the mariana trench, our water supply, the human brain etc.

Organisms in the natural world appear unshackled by these constraints. I think immediately of the functionally-graded characteristics of bone, in which matter distribution is varied spatially, optimised to provide lightweight strength and support.

  • Consistency of use: Fair dissemination of information is necessary to permit strong, equitable discussions between distinct stakeholders (public, policymakers, technicians etc), ensuring the ethical concerns of all groups are platformed to guide accurate policymaking.

  • Transparency in research and application: ‘Growth’ processes should likely be strongly standardized (at least initially) to establish a safe, repeatable precedent of procedure.

  • Prevention of environmental contamination: Current biowaste disposal processes should be adapted to fit novel requirements as they emerge.

  • Prevention of utilisation in scenarios considered universally unethical: This one feels pretty clear: no application should cause innate suffering or distress to involved biological materials (or societies in which they are used).

    • a. Sound disposal practices of biomass bioproducts as industrial waste: Current biowaste disposal processes should be adapted to fit novel requirements as they emerge.
Next, describe at least three different potential governance “actions”.

1. Implement sound genetic safeguards.

Purpose: emergency termination of cellular processes in the event of dangerous circumstances, whereby the health of a population or ecosystem is suddenly threatened.

Induced auxotrophy (metabolic or via xenonucleic biochemistry), toxin gene expression cassettes, and engineered addiction are all contemporary methods of halting cellular activity outside of a specific environment (lab). A mosaic of approaches is recommended as the most robust containment method, especially when accounting for the spontaneous occurrence of safeguard-negating mutations.

Design: continued research, testing, and update of existing safeguards.
Assumptions: established safeguard techniques identified are inadequate or incompatible for this use case.

Risk of failure (or success!):

a. Strategies prove ineffective and difficult to consistently implement. Environmental contamination occurs frequently; consequences are unpredictable and catastrophic.

b. Genetic safeguards utilised are too complex or specific to permit equitable access.

2. Compose a universal convention outlining acceptable and unacceptable applications in manufacturing (and beyond), requiring strict agreement and adherence at a national level prior to industrial use.

Purpose: establish legal basis for international enforcement of manufacturing practices collectively identified as ethical and sound. Dissuade bad actors from potential misuse, encourage international collaboration, and mandate stewardship at the national scale. Unlike traditional manufacturing where inanimate materials are used, it feels like a necessity to require international collaboration where unethical practices could give rise to unpredictable, dangerous outcomes.

Design: The Biological Weapons Convention (BWC) serves as a strong example for the potential of a multilateral agreement to address and prevent misuse of biological practices at a global scale, reaching practically universal agreement across UN member states. I propose the creation of a convention following a similar framework, however with emphasis placed on proactively identifying and sanctioning acceptable and unacceptable use cases.

Assumptions: success hinges heavily upon members reaching a collective understanding. Additionally, it assumes that national governance voices are strongly aligned with the private interests of the manufacturing sector in each member state.

Risk of failure (or success!):

a. Private interests commence manufacturing via synthetic morphogenesis despite absent participation at UN level.

b. An unanimous decision is made, however coverage of convention proves inadequate, requiring frequent amendments.

3. Establish an independent watchdog for regulation of industrial activity.

Purpose: provide additional layer of security to possible corruption at both national and international scales, serving to conduct random audits of the industrial activity within member states.

Design: using universal convention as a basis, establish key measurable indicators as a baseline for monitoring and governance. Indicators must be specific and comprehensive, defining actions by industry which must be precisely met to guarantee compliance.

Assumptions:

a. The watchdog is sufficiently capable of avoiding corruption. Handlings are fair, just, and do not impose on the national security or privacy of member states.

b. Member states (and industry) corporate sufficiently with requests of watchdog.

Risk of failure (or success!):

a. Undetected convention violations threaten international biosecurity or violate agreements on ethical practices.

4. Implement traceability protocols as..

a. Chain Of Custody (CoC) for synthesis and distribution of manufacturing precursors.

b. Integrated genetic barcodes.

Purpose: ensure all stakeholders are accountable for their contribution to the design, production, and distribution of biological materials prior, during, and post manufacture. In the event that a ‘leak’ occurs whereby an ecosystem is threatened, traceability permits rapid response and containment, as well as identification of processes violating the convention.

Design: a. Define ‘custody’ objects, assigning each with a distinct identifier, lifecycle, and scope of use. Establish assignable states (created, modified, transported, archived, destroyed etc.). Design custody record around requirements identified in convention. b. Establish composition, length, and ideal location of barcode locus.

Assumptions: a. Stakeholders adhere stringently to CoC protocol.

Risk of failure (or success!): CoC processes are too authoritative to strongly discourage prospective stakeholders, with sound intent, from entering the market. The approach cannot achieve the required market leverage to replace the use of conventional manufacturing techniques.

Next score (from 1-3 with, 1 as the best, or n/a) each of your governance actions against your rubric of governance goals.

“Synthetic morphogenesis as a manufacturing technique”

Does the option:Option 1Option 2Option 3Option 4
Enhance Biosecurity
• By preventing incidents1111
• By helping respond3321
Foster Lab Safety
• By preventing incident1113
• By helping respond3321
Protect the environment
• By preventing incidents1113
• By helping respond3331
Other considerations
• Minimizing costs and burdens to stakeholders2332
• Feasibility?1332
• Not impede research2221
• Promote constructive applications2113
Total (lower > higher)19211918
Last, drawing upon this scoring, describe which governance option, or combination of options, you would prioritize, and why. Outline any trade-offs you considered as well as assumptions and uncertainties.

Although likely the most politically challenging, the convention outlined in Option 2 appears central to establishing the precedent required for ethical use of the novel technology. Option 1 is almost mandatory as a guardrail in the event of industrial pollution, as built-in failsafes provide the greatest degree of protection where incidental mishaps could have dire consequences for public health.

I propose the initial priorisation of Options 1 and 2 consecutively, whereby research seeks to first establish integrated safeguards appropriate for the requirements unique to manufacturing, whilst national policymakers work collaboratively to negotiate terms of the convention. Due to somewhat uncertain ramifications of misuse, initial collaboration directly with the United Nations Office of the Secretary General will provide the expertise necessary to move forward at a national scale, where progress is likely to be more fruitful.

Principles & Practices - Weekly Homework

Reflecting on what you learnt and did in class this week, outline any ethical concerns that arose, especially any that were new to you.

The ability to control the growth of organic matter in a way that is so precise feels unprecedented in the sense that it is not dissimilar to ‘playing god’; it is clear that this gives rise to countless ethical concerns:

Concern: Could the process inadvertently give rise to conscious organisms under the right (or more so wrong) circumstances?

  • Action 2.

Concern: What happens if the technique is abused to grow human flesh for use in applications considered universally immoral?

  • Action 2, Action 3.

Concern: Could self-propagating organisms be inadvertently created and released into ecosystems?

  • Action 1.

Concern: Could the process create significant social upheaval amongst religious societies?

  • Action 2.

I will continue to strongly consider this topic throughout the coming weeks, aiming to further philosophise and describe ideas as they arise!

Preparatory Questions (Week 2)
Professor Jacobsen

Nature’s machinery for copying DNA is called polymerase. What is the error rate of polymerase? How does this compare to the length of the human genome. How does biology deal with that discrepancy?

  • 1:10^6
  • Human genome = ~ 3.2 gbp; approximately one incorrect base is inserted per 1 million correct insertions.
  • Human biology employs several DNA proofreading and correction techniques; sense of correct geometry of base pairings, slowing catalysis when mismatch detected, and separation of mismatched primer to exonuclease site for removal etc.

How many different ways are there to code (DNA nucleotide code) for an average human protein? In practice what are some of the reasons that all of these different codes don’t work to code for the protein of interest?

  • Substantial degree of combinations possible mathematically (approx 10^190)
  • Degeneracy of genetic code (multiple distinct codons encode amino acid) means a small fraction of total combinations actually encode for protein.
Dr. LeProust

What’s the most commonly used method for oligo synthesis currently?

  • Phosphoramidite oligo synthesis

Why is it difficult to make oligos longer than 200nt via direct synthesis?

  • Small discrepancies in coupling efficiency (95% - 99%) compound exponentially beyond 200nt, reducing sequence accuracy beyond.

Why can’t you make a 2000bp gene via direct oligo synthesis?

  • As described above, synthesis becomes incredibly imprecise beyond about 200nt. Accumulation of errors will yield strand coding for dysfunctional characteristics.

Fidelity of DNA replication-a matter of proofreading

George Church

[Using Google & Prof. Church’s slide #4] What are the 10 essential amino acids in all animals and how does this affect your view of the “Lysine Contingency”?

  • Arginine, histidine, methionine, isoleucine, leucine, lysine, phenylalanine, threonine, tryptophan, and valine compose the 10 essential amino acids, where arginine is described as ‘conditionally essential’ in some mammalian species, such as humans (Rychen et al., 2018).
  • The ‘Lysine Contingency’ describes the knock-out mutation performed in Jurassic Park (1993), in which the dinosaurs were rendered incapable of producing the essential aa Lysine (synthetic auxotrophy), requiring constant supplementation.
  • The organic abundance of Lysine as an essential aa effectively renders this method useless, as they would be capable of receiving it through their conventional diet. This is especially interesting when considering prospective use of synthetic auxotrophy to control the growth of a manufactured organism; how might nature creatively find a method to circumvent the integrated safeguard?

Biochemistry, Essential Amino Acids.

Safety and efficacy of l-arginine produced by fermentation with Escherichia coli NITE BP-02186 for all animal species.

Scratchpad

Synthetic biology as…

  • A manufacturing technique
  • A bioremediation technique
  • A fertilization technique
  • A method to arrange bacterial growth (magnetotaxis)

Manufacturing technique (key concepts / words):

  • Morphogenesis
  • Developmental biology
  • Tissue patterning
  • Hox genes
  • Homeoboxes
  • Positional information (morphogen gradient)
Assorted references

Huigang, L., Menghui, L., Xiaoli, Z., Cui, H. and Zhiming, Y. (2022). Development of and prospects for the biological weapons convention. Journal of Biosafety and Biosecurity, [online] 4(1), pp.50–53. doi:https://doi.org/10.1016/j.jobb.2021.11.003.

Johnson, M.S., Venkataram, S. and Kryazhimskiy, S. (2023). Best Practices in Designing, Sequencing, and Identifying Random DNA Barcodes. Journal of Molecular Evolution, [online] 91(3), pp.263–280. doi:https://doi.org/10.1007/s00239-022-10083-z.

Teufel, J., López Hernández, V., Greiter, A., Kampffmeyer, N., Hilbert, I., Eckerstorfer, M., Narendja, F., Heissenberger, A. and Simon, S. (2024). Strategies for Traceability to Prevent Unauthorised GMOs (Including NGTs) in the EU: State of the Art and Possible Alternative Approaches. Foods, [online] 13(3), p.369. doi:https://doi.org/10.3390/foods13030369.

Trotsyuk, A.A., Waeiss, Q., Bhatia, R.T., Aponte, B.J., Heffernan, I.M.L., Madgavkar, D., Felder, R.M., Lehmann, L.S., Palmer, M.J., Greely, H., Wald, R., Goetz, L., Trengove, M., Vandersluis, R., Lin, H., Cho, M.K., Altman, R.B., Endy, D., Relman, D.A. and Levi, M. (2024). Toward a framework for risk mitigation of potential misuse of artificial intelligence in biomedical research. Nature Machine Intelligence, [online] 6(12), pp.1435–1442. doi:https://doi.org/10.1038/s42256-024-00926-3.

Wang, F. and Zhang, W. (2019). Synthetic biology: Recent progress, biosafety and biosecurity concerns, and possible solutions. Journal of Biosafety and Biosecurity, [online] 1(1), pp.22–30. doi:https://doi.org/10.1016/j.jobb.2018.12.003.

Week 2 HW: DNA Read / Write / Edit

Part 1: Benchling & In-silico Gel Art

Left: practice in-silico digestion featuring base sequence (Lambda) and restriction enzymes as specified.

Right: in-silico digestion representing a three-subunit amino polypeptide! To achieve the central grouping of bands I desired, I first asked chatGPT to recommend a short base sequence, then referenced the linear map in Benchling to iterate with endonucleases which produced cuts at the positions needed to make bp lengths which would match an outline I sketched.

Part 2: Gel Art - Restriction Digests and Gel Electrophoresis

No wetlab access. Given access in the coming months, I’ll definitely attempt to create my artwork given the required endonucleases + base sequence are accessible!

Part 3: DNA Design Challenge

3.1. Choose your protein.

From Magnetospirillum magneticum (AMB-1): HtrA/DegP family protease MamE

Magnetotaxic bacteria possess the highly curious ability to form organised, magnetic ‘inclusions’ in structures composed of ‘magnetosomes’. Magnetotaxis as a behavioural characteristic is particularly advantageous in aquatic environments, where detection of the earth’s magnetic field permits spatial sense in the water column as a method to locate nutrient-rich microenvironments. In the model organism Magnetospirillum magneticum AMB-1 (AMB-1), MamE protease is central to the biomineralization process required for magnetic crystal formation within magnetosomes.

I selected this protein as I find this behaviour incredibly enchanting! Its role in both magnetoreception and the biomineralization process is fascinating, and promises many intriguing applications if harnessed.

MamE protease AA sequence:

sp|Q2W8Q8|MAME_PARM1 Magnetosome formation protease MamE OS=Paramagnetospirillum magneticum (strain ATCC 700264 / AMB-1) OX=342108 GN=mamE PE=1 SV=1

>MAMFNGDVEDGGRGDASCGKDLKRYLMLMGVVALVVLFGAFIYRQSSGGLRLGAMLEQMGRGTGPAVNVPVQQGGPSAAVNPAMSVPAGARVAPPSAAGAIATMPPMVDFGPAPIGAGGPFSSVVTLLRNSVVAVTASSANGQAMPDPLGLANPDGLPHFANPATRSVENIGTGVIVRNDGFIVTNYHVVRGANSVFVTVQDDVGSTRYSAEIIKMDEALDLALLKVAPKTPLTAAVLGDSDGVQVADEVIAIGTPFGLDMTVSRGIISAKRKSMVIEGVTHSNLLQTDAAINQGNSGGPLVISNGTVVGINTAIYTPNGAFAGIGFAVPSNQARLFILDEVGWLPTSTAEGASMGLVAMQRPMGGGVGAAGPAIFAGTRAPHTDGRQNMDCTTCHDLIPAGNGRPAPMMPIAAPIPPPPIPMGAVSPHTDGRQNMNCANCHQMLGGAAPIAAPGLGGGAYRFAQPPGSLAINIQGPRGGQSTAAGTGRVTLLGAALTPMSQRLGAQTGVPVGRGVFISGVTPNTPAATAGLRPGDVLLKVDGRPVRLPEEVSAIMVEMHAGRSVRLGVLRDGDVRNMTLVAGPAGLAAAAVQAPAIADMAQPPMGGMAPTAPGMVAVPGGPAVMPKPPTEFNWLGMEIETFQAPRPITGVPGAVPVPGAKGAQVAEVLVGSRAAVAGLQANDLILEVNNRPVAGPARLDAAIKGATNAGQQILLKVNRNGQEFWIVL

3.2. Reverse Translate: Protein (amino acid) sequence to DNA (nucleotide) sequence.

MamE protease gene, reverse-translated via Genecorner:

>reverse translation of sp|Q2W8Q8|MAME_PARM1 Magnetosome formation protease MamE OS=Paramagnetospirillum magneticum (strain ATCC 700264 / AMB-1) OX=342108 GN=mamE PE=1 SV=1 to a 2184 base sequence of most likely codons.

atggcgatgtttaacggcgatgtggaagatggcggccgcggcgatgcgagctgcggcaaagatctgaaacgctatctgatgctgatgggcgtggtggcgctggtggtgctgtttggcgcgtttatttatcgccagagcagcggcggcctgcgcctgggcgcgatgctggaacagatgggccgcggcaccggcccggcggtgaacgtgccggtgcagcagggcggcccgagcgcggcggtgaacccggcgatgagcgtgccggcgggcgcgcgcgtggcgccgccgagcgcggcgggcgcgattgcgaccatgccgccgatggtggattttggcccggcgccgattggcgcgggcggcccgtttagcagcgtggtgaccctgctgcgcaacagcgtggtggcggtgaccgcgagcagcgcgaacggccaggcgatgccggatccgctgggcctggcgaacccggatggcctgccgcattttgcgaacccggcgacccgcagcgtggaaaacattggcaccggcgtgattgtgcgcaacgatggctttattgtgaccaactatcatgtggtgcgcggcgcgaacagcgtgtttgtgaccgtgcaggatgatgtgggcagcacccgctatagcgcggaaattattaaaatggatgaagcgctggatctggcgctgctgaaagtggcgccgaaaaccccgctgaccgcggcggtgctgggcgatagcgatggcgtgcaggtggcggatgaagtgattgcgattggcaccccgtttggcctggatatgaccgtgagccgcggcattattagcgcgaaacgcaaaagcatggtgattgaaggcgtgacccatagcaacctgctgcagaccgatgcggcgattaaccagggcaacagcggcggcccgctggtgattagcaacggcaccgtggtgggcattaacaccgcgatttataccccgaacggcgcgtttgcgggcattggctttgcggtgccgagcaaccaggcgcgcctgtttattctggatgaagtgggctggctgccgaccagcaccgcggaaggcgcgagcatgggcctggtggcgatgcagcgcccgatgggcggcggcgtgggcgcggcgggcccggcgatttttgcgggcacccgcgcgccgcataccgatggccgccagaacatggattgcaccacctgccatgatctgattccggcgggcaacggccgcccggcgccgatgatgccgattgcggcgccgattccgccgccgccgattccgatgggcgcggtgagcccgcataccgatggccgccagaacatgaactgcgcgaactgccatcagatgctgggcggcgcggcgccgattgcggcgccgggcctgggcggcggcgcgtatcgctttgcgcagccgccgggcagcctggcgattaacattcagggcccgcgcggcggccagagcaccgcggcgggcaccggccgcgtgaccctgctgggcgcggcgctgaccccgatgagccagcgcctgggcgcgcagaccggcgtgccggtgggccgcggcgtgtttattagcggcgtgaccccgaacaccccggcggcgaccgcgggcctgcgcccgggcgatgtgctgctgaaagtggatggccgcccggtgcgcctgccggaagaagtgagcgcgattatggtggaaatgcatgcgggccgcagcgtgcgcctgggcgtgctgcgcgatggcgatgtgcgcaacatgaccctggtggcgggcccggcgggcctggcggcggcggcggtgcaggcgccggcgattgcggatatggcgcagccgccgatgggcggcatggcgccgaccgcgccgggcatggtggcggtgccgggcggcccggcggtgatgccgaaaccgccgaccgaatttaactggctgggcatggaaattgaaacctttcaggcgccgcgcccgattaccggcgtgccgggcgcggtgccggtgccgggcgcgaaaggcgcgcaggtggcggaagtgctggtgggcagccgcgcggcggtggcgggcctgcaggcgaacgatctgattctggaagtgaacaaccgcccggtggcgggcccggcgcgcctggatgcggcgattaaaggcgcgaccaacgcgggccagcagattctgctgaaagtgaaccgcaacggccaggaattttggattgtgctg

3.3. Codon optimization.

When engineering the genome of one organism to express a foreign gene, codon optimization increases the chances that the translated sequence will result in the correct, functional protein product. Although the central dogma is inherent to all lifeforms, each class of organism will possess a ‘codon bias’ by which the particular codon (of which there may be multiple for one amino acid; degeneracy) will be represented by a comparatively higher concentration of tRNA molecules.

Vibrio natriegens is a gram negative, marine-dwelling prokaryote understood to possess one of the highest growth rates of any organism. This trait would be highly desirable for manufacturing applications where controlled biomineralisation could be harnessed to create solid artefacts in a short timeframe.

As V. Natriegens is not so forth a model organism and was thus omitted as an option from available online tools, I retrieved the codon usage table for Vibrio natriegens from the Codon Usage Database and used the BioInfromatics Reverse Translation Tool to directly generate the most likely to succeed, non-degenerate DNA sequence. This instead required the input of the AA sequence previously retrieved in 3.1.

>reverse translation of sp|Q2W8Q8|MAME_PARM1 Magnetosome formation protease MamE OS=Paramagnetospirillum magneticum (strain ATCC 700264 / AMB-1) OX=342108 GN=mamE PE=1 SV=1 to a 2184 base sequence of most likely codons.

atggcaatgtttaacggtgatgtagaagatggtggtcgtggtgatgcatcatgtggtaaagatcttaaacgttaccttatgcttatgggtgtagtagcacttgtagtactttttggtgcatttatctaccgtcaatcatcaggtggtcttcgtcttggtgcaatgcttgaacaaatgggtcgtggtacaggtccagcagtaaacgtaccagtacaacaaggtggtccatcagcagcagtaaacccagcaatgtcagtaccagcaggtgcacgtgtagcaccaccatcagcagcaggtgcaatcgcaacaatgccaccaatggtagattttggtccagcaccaatcggtgcaggtggtccattttcatcagtagtaacacttcttcgtaactcagtagtagcagtaacagcatcatcagcaaacggtcaagcaatgccagatccacttggtcttgcaaacccagatggtcttccacattttgcaaacccagcaacacgttcagtagaaaacatcggtacaggtgtaatcgtacgtaacgatggttttatcgtaacaaactaccatgtagtacgtggtgcaaactcagtatttgtaacagtacaagatgatgtaggttcaacacgttactcagcagaaatcatcaaaatggatgaagcacttgatcttgcacttcttaaagtagcaccaaaaacaccacttacagcagcagtacttggtgattcagatggtgtacaagtagcagatgaagtaatcgcaatcggtacaccatttggtcttgatatgacagtatcacgtggtatcatctcagcaaaacgtaaatcaatggtaatcgaaggtgtaacacattcaaaccttcttcaaacagatgcagcaatcaaccaaggtaactcaggtggtccacttgtaatctcaaacggtacagtagtaggtatcaacacagcaatctacacaccaaacggtgcatttgcaggtatcggttttgcagtaccatcaaaccaagcacgtctttttatccttgatgaagtaggttggcttccaacatcaacagcagaaggtgcatcaatgggtcttgtagcaatgcaacgtccaatgggtggtggtgtaggtgcagcaggtccagcaatctttgcaggtacacgtgcaccacatacagatggtcgtcaaaacatggattgtacaacatgtcatgatcttatcccagcaggtaacggtcgtccagcaccaatgatgccaatcgcagcaccaatcccaccaccaccaatcccaatgggtgcagtatcaccacatacagatggtcgtcaaaacatgaactgtgcaaactgtcatcaaatgcttggtggtgcagcaccaatcgcagcaccaggtcttggtggtggtgcataccgttttgcacaaccaccaggttcacttgcaatcaacatccaaggtccacgtggtggtcaatcaacagcagcaggtacaggtcgtgtaacacttcttggtgcagcacttacaccaatgtcacaacgtcttggtgcacaaacaggtgtaccagtaggtcgtggtgtatttatctcaggtgtaacaccaaacacaccagcagcaacagcaggtcttcgtccaggtgatgtacttcttaaagtagatggtcgtccagtacgtcttccagaagaagtatcagcaatcatggtagaaatgcatgcaggtcgttcagtacgtcttggtgtacttcgtgatggtgatgtacgtaacatgacacttgtacaggtccagcaggtcttgcagcagcagcagtacaagcaccagcaatcgcagatatggcacaaccaccaatgggtggtatggcaccaacagcaccaggtatggtagcagtaccaggtggtccagcagtaatgccaaaaccaccaacagaatttaactggcttggtatggaaatcgaaacatttcaagcaccacgtccaatcacaggtgtaccaggtgcagtaccagtaccaggtgcaaaaggtgcacaagtagcagaagtacttgtaggttcacgtgcagcagtagcaggtcttcaagcaaacgatcttatccttgaagtaaacaaccgtccagtagcaggtccagcacgtcttgatgcagcaatcaaaggtgcaacaaacgcaggtcaacaaatccttcttaaagtaaaccgtaacggtcaagaattttggatcgtactt

For HW consistency, I additionally optimised the ‘most likely codon’ MamE sequence for expression in Escherichia Coli (E. Coli) via the tool at Vectorbuilder:

MamE gene optimised for expression in E. Coli MamE gene optimised for expression in E. Coli

The resulting Codon Adaption Index (CAI) for E. Coli fortunately still appeared to fall within a suitable range of <0.8.

3.4. You have a sequence! Now what?

The mamE sequence could first be integrated in-vitro into a recombinant plasmid, then transformed into V. Natriegen cells via electroporation, by which plasmids enter the cell through the creation of temporary pores in its membrane. The recombinant plasmids should then be transcribed and translated as normal by the appropriate enzymes within the cytosol. Electroporation is advantageous as is both efficient and relatively fast.

3.5. Bonus: How does it work in nature/biological systems?

In prokaryotes such as V. Natriegen, polycistronic transcription for a single mRNA molecule to be translated into different proteins. This is achieved via the inclusion of separate, distinct START and STOP codons, creating separate regions along the same mRNA molecule which are thus translated into distinct proteins.

I attempted to align the flow of information from DNA through polypeptide via the stacking of my three produced Benchling sequences manually. Unfortunately begins to break down in alignment, but accurate for the first three codons:

MamE Central Dogma MamE Central Dogma

ChatGPT used to format correct alignment:

ChatGPT Aligned Image ChatGPT Aligned Image
Part 4: Prepare a Twist DNA Synthesis Order

This process was really fascinating! I would love to learn more about the different inclusions along the way, and of their functionality + importance.

Review my plasmid here.

Part 5: DNA Read/Write/Edit

5.1 DNA Read (i) What DNA would you want to sequence (e.g., read) and why?

It would be intriguing to sequence morphogen-encoding genes (such as the HOX group) in species which have so far not been included. I would love to compare the base composition of different morphogen-encoding genes across distinct species to better understand the mechanisms behind different types of tissue patterning, and perhaps discover more conserved sequences.

(ii) In lecture, a variety of sequencing technologies were mentioned. What technology or technologies would you use to perform sequencing on your DNA and why?

I would select some combination of Single Molecule (third-generation; sequences individual bases directly without prior amplification) and Fluorescent 3D (other; distant in characteristics) to map genes of interest. Fluorescent 3D could be used to first determine the spatial concentration of suspected morphogen-encoding genes in a given tissue, then use Single Molecule to determine finer structural details. The capacity for Single Molecule to produce very long reads is desirable where developmental regions may be large and complex.

Fluorescent 3D:

  • Input: Hybridization target probe pre-prepared based on theorized gene of interest (GOI). Tissue segment of interest pre-prepared on slide.

  • Base calling method: iterative cycles of images detect each base as ‘spot in tissue’; corresponds to a particular gene identity.

  • Output: spatial coordinates for gene expression + cells it is expressed by.

Single Molecule:

  • Input: cells identified to contain GOI previous step are isolated and cultured, then lysed before separating and purifying DNA. Construction (oligonucleotide synthesis) and addition of nanopore sequencing adapters. Segments loaded into flow cell.

  • Base calling method: individual DNA molecules pulled through nanopore by sequencing adapter, disrupting baseline ionic current of the detector which is detected and recorded by device. ‘Trace signals’ correspond to a particular base.

  • Output: Long-read sequences (LRS)

5.2 DNA Write (i) What DNA would you want to synthesize (e.g., write) and why?

Artificially constructing a cluster of genes which allows for relatively precise, customisable control of tissue patterning would be highly appealing. Think programmable osteablasts, where the functional grading of bone-like structures can be controlled to produce an organic car chassis.

(ii) What technology or technologies would you use to perform this DNA synthesis and why?

Multiplexed Oligo Synthesis followed by Gibson Assembly appears a feasible option for the construction of large genes. Many sub-80 bp oligos can first be synthesised in parallel as a means of error-rate limitation, then combined via Gibson Assembly to construct the entire gene. In regards to morphogenic genes, both high precision and accuracy are required as the resulting phenotype is likely highly sensitive to accurate sequence composition.

Multiplexed Oligo Synthesis (in situ, via inkjet printing)

  • Essential steps: computational design of desired DNA sequence, followed by division into 60-80 bp oligos with overlapping regions. Microarray pre-prepared with contact primer at each oligo synthesis site; single added sequentially in cycles at each site adhering to phosphamidite chemistry process. At termination length, cleaved, retrieved, and amplified.

  • Limitations: error rate increases as sequence grows larger. Overlapping requirements of gibson assembly requires more cycles to be performed.

Gibson assembly

  • Essential steps: Addition of exonuclease to prepared oligonucleotides to create sticky overhangs. Segments hybridise at designed complementary overlapping regions. DNA polymerase introduced to integrate missing nt’s at gaps. Ligase seals gaps throughout the incomplete backbone.

  • Limitations: efficiency decreases as quantity of oligo fragments increases. Precise design of overlapping regions essential.

5.3 DNA Edit (i) What DNA would you want to edit and why?

It would be ideal to edit the genomes of more agricultural crops to introduce robust autofertilisation traits via symbiosis with microorganisms, for use in both developed and developing nations alike. In industrialised agriculture, external fertilisation is responsible for damaging pollution events throughout adjacent ecosystems. In developing regions where soil viability is poor, endemic low-yields resulting from limited fertilizer access drive perilous consequences. As the most abundant atmospheric gas, fixation of nitrogen directly from where directly where it is required makes significant sense.

(ii) What technology or technologies would you use to perform these DNA edits and why?

CRISPR/Cas9 currently appears the most desirable method for performing DNA modifications within plant embryo cells, particularly due to its efficiency and precision.

  • Preparation + inputs: Design gRNA to match target sequence within plant embryo; incorporate into suitable vector alongside Cas9 gene.

  • Limitations: Cuts at sequences similar to DNA target in embryo possible, requiring careful planning and validation post-edit. Editing efficiency dependent on accessibility of target gene.

References

Jerlie Mhay Matres, H., Hilscher, J., Datta, A., Armario-Nájera, V., Baysal, C., He, W., Huang, X., Zhu, C., Valizadeh-Kamran, R., Trijatmiko, K.R., Capell, T., Christou, P., Stoger, E. and Slamet-Loedin, I.H. (2021). Genome editing in cereal crops: an overview. Transgenic Research, 30(4), pp.461–498. https://doi.org/10.1007/s11248-021-00259-6

Lee, H.H., Ostrov, N., Wong, B.G., Gold, M.A., Khalil, A.S. and Church, G.M. (2016). Vibrio natriegens, a new genomic powerhouse. https://doi.org/10.1101/058487

Quinlan, A., Murat, D., Vali, H. and Komeili, A. (2011). The HtrA/DegP family protease MamE is a bifunctional protein with roles in magnetosome protein localization and magnetite biomineralization. Molecular Microbiology, 80(4), pp.1075–1087. https://doi.org/10.1111/j.1365-2958.2011.07631.x

Wan, J., Ji, R., Liu, J., Ma, K., Pan, Y. and Lin, W. (2024). Biomineralization in magnetotactic bacteria: From diversity to molecular discovery-based applications. Cell Reports, 43(12), p.114995. https://doi.org/10.1016/j.celrep.2024.114995

Weinstock, M.T., Hesek, E.D., Wilson, C.M. and Gibson, D.G. (2016). Vibrio natriegens as a fast-growing host for molecular biology. Nature Methods, 13(10), pp.849–851. https://doi.org/10.1038/nmeth.3970

Week 3 HW: Lab Automation

Part 1: OT-2 Automation

This week involved the plating and growth of a design created via the controlled aspiration of fluorescent Escherichia Coli (E. coli) through the OT-2 virtual platform.

Left: final result! P(eace)CR seemed a fitting name.

Right: initial reference image. source: peaceful pixel dove.

After some brief iteration, I settled on an 8-bit interpretation of the classic ‘dove with olive branch’, a relatively universal symbol of peace. As DesignerCells only has access to sfGFP and mRFP1 fluorescent recombinants, this design provided a suitable compromise to the design limitations. I will include the final outcome pending the lab is successful (fingers crossed!).

ChatGPT was used in conjunction with the CoLabs reference code to create the final script for the OT-2 liquid handling platform:

from opentrons import types

metadata = {    # see https://docs.opentrons.com/v2/tutorial.html#tutorial-metadata
    'author': 'Zander Morris',
    'protocolName': 'P(eace)CR',
    'description': 'Prints the P(eace)CR graphic via the set of data points.',
    'source': 'HTGAA 2026 Opentrons Lab',
    'apiLevel': '2.20'
}

##############################################################################
###   Robot deck setup constants - don't change these
##############################################################################

TIP_RACK_DECK_SLOT = 9
COLORS_DECK_SLOT = 6
AGAR_DECK_SLOT = 5
PIPETTE_STARTING_TIP_WELL = 'A1'

well_colors = {
    'A1' : 'Red',
    'B1' : 'Green',
    'C1' : 'Orange'
}

sfgfp_points = [(-26.4, 21.6),(-26.4, 19.2),(-24, 19.2),(-26.4, 16.8),(-24, 16.8),(-21.6, 16.8),(-14.4, 16.8),(-26.4, 14.4),(-24, 14.4),(-21.6, 14.4),(-16.8, 14.4),(-14.4, 14.4),(-21.6, 12),(-19.2, 12),(-16.8, 12),(-14.4, 12),(-24, 9.6),(-21.6, 9.6),(-19.2, 9.6),(-16.8, 9.6),(-26.4, 7.2),(-24, 7.2),(-19.2, 7.2),(-16.8, 4.8),(-21.6, 2.4),(-19.2, 2.4),(-16.8, 2.4),(-19.2, 0),(-16.8, 0),(-14.4, 0),(-14.4, -2.4),(-14.4, -4.8),(-14.4, -7.2),(-12, -9.6),(-12, -12),(-9.6, -14.4),(-9.6, -16.8),(-9.6, -19.2)]
mrfp1_points = [(12, 26.4),(7.2, 24),(9.6, 24),(12, 24),(26.4, 24),(4.8, 21.6),(7.2, 21.6),(9.6, 21.6),(12, 21.6),(24, 21.6),(26.4, 21.6),(2.4, 19.2),(4.8, 19.2),(9.6, 19.2),(12, 19.2),(24, 19.2),(26.4, 19.2),(0, 16.8),(9.6, 16.8),(12, 16.8),(21.6, 16.8),(24, 16.8),(26.4, 16.8),(0, 14.4),(9.6, 14.4),(12, 14.4),(19.2, 14.4),(21.6, 14.4),(24, 14.4),(26.4, 14.4),(0, 12),(9.6, 12),(12, 12),(14.4, 12),(16.8, 12),(21.6, 12),(24, 12),(26.4, 12),(0, 9.6),(7.2, 9.6),(9.6, 9.6),(12, 9.6),(21.6, 9.6),(24, 9.6),(26.4, 9.6),(-9.6, 7.2),(-7.2, 7.2),(0, 7.2),(7.2, 7.2),(9.6, 7.2),(21.6, 7.2),(24, 7.2),(-12, 4.8),(-9.6, 4.8),(-4.8, 4.8),(-2.4, 4.8),(0, 4.8),(4.8, 4.8),(21.6, 4.8),(24, 4.8),(-14.4, 2.4),(-12, 2.4),(-2.4, 2.4),(0, 2.4),(2.4, 2.4),(21.6, 2.4),(-9.6, 0),(-7.2, 0),(16.8, 0),(19.2, 0),(21.6, 0),(-7.2, -2.4),(-4.8, -2.4),(14.4, -2.4),(16.8, -2.4),(19.2, -2.4),(-7.2, -4.8),(-4.8, -4.8),(7.2, -4.8),(9.6, -4.8),(12, -4.8),(14.4, -4.8),(16.8, -4.8),(-4.8, -7.2),(-2.4, -7.2),(12, -7.2),(-4.8, -9.6),(-2.4, -9.6),(0, -9.6),(14.4, -9.6),(16.8, -9.6),(19.2, -9.6),(-2.4, -12),(0, -12),(2.4, -12),(4.8, -12),(19.2, -12),(21.6, -12),(0, -14.4),(2.4, -14.4),(4.8, -14.4),(7.2, -14.4),(9.6, -14.4),(12, -14.4),(19.2, -14.4),(21.6, -14.4),(24, -14.4),(26.4, -14.4),(28.8, -14.4),(14.4, -16.8),(19.2, -16.8),(21.6, -16.8),(24, -16.8),(26.4, -16.8),(28.8, -16.8),(16.8, -19.2),(19.2, -19.2),(21.6, -19.2),(24, -19.2),(19.2, -21.6),(21.6, -21.6),(19.2, -24)]

def run(protocol):
  ##############################################################################
  ###   Load labware, modules and pipettes
  ##############################################################################

  # Tips
  tips_20ul = protocol.load_labware('opentrons_96_tiprack_20ul', TIP_RACK_DECK_SLOT, 'Opentrons 20uL Tips')

  # Pipettes
  pipette_20ul = protocol.load_instrument("p20_single_gen2", "right", [tips_20ul])

  # Modules
  temperature_module = protocol.load_module('temperature module gen2', COLORS_DECK_SLOT)

  # Temperature Module Plate
  temperature_plate = temperature_module.load_labware('opentrons_96_aluminumblock_generic_pcr_strip_200ul',
                                                      'Cold Plate')
  # Choose where to take the colors from
  color_plate = temperature_plate

  # Agar Plate
  agar_plate = protocol.load_labware('htgaa_agar_plate', AGAR_DECK_SLOT, 'Agar Plate')  ## TA MUST CALIBRATE EACH PLATE!
  # Get the top-center of the plate, make sure the plate was calibrated before running this
  center_location = agar_plate['A1'].top()

  pipette_20ul.starting_tip = tips_20ul.well(PIPETTE_STARTING_TIP_WELL)

  ##############################################################################
  ###   Patterning
  ##############################################################################

  ###
  ### Helper functions for this lab
  ###

  # pass this e.g. 'Red' and get back a Location which can be passed to aspirate()
  def location_of_color(color_string):
    for well,color in well_colors.items():
      if color.lower() == color_string.lower():
        return color_plate[well]
    raise ValueError(f"No well found with color {color_string}")

  # For this lab, instead of calling pipette.dispense(1, loc) use this: dispense_and_detach(pipette, 1, loc)
  def dispense_and_detach(pipette, volume, location):
      """
      Move laterally 5mm above the plate (to avoid smearing a drop); then drop down to the plate,
      dispense, move back up 5mm to detach drop, and stay high to be ready for next lateral move.
      5mm because a 4uL drop is 2mm diameter; and a 2deg tilt in the agar pour is >3mm difference across a plate.
      """
      assert(isinstance(volume, (int, float)))
      above_location = location.move(types.Point(z=location.point.z + 5))  # 5mm above
      pipette.move_to(above_location)       # Go to 5mm above the dispensing location
      pipette.dispense(volume, location)    # Go straight downwards and dispense
      pipette.move_to(above_location)       # Go straight up to detach drop and stay high

  ###
  ### YOUR CODE HERE to create your design
  ###

  # -----------------------------
  # Printing parameters
  # -----------------------------
  VOL_PER_DOT = 0.75

  # Keep aspirates comfortably below 20uL for accuracy/safety
  MAX_ASPIRATE_UL = 18.0
  MAX_BATCH_DOTS = int(MAX_ASPIRATE_UL // VOL_PER_DOT)  # 18.0 // 0.75 = 24

  # Choose where on Z you actually want to dispense.
  # Start conservative: 0 means "at agar_plate['A1'].top() plane".
  # If your drops need to touch the agar more, try -0.5 or -1.0 after testing.
  DISPENSE_DZ = 2

  def point_location_from_center(dx, dy, dz=DISPENSE_DZ):
      # Offsets are in mm
      return center_location.move(types.Point(x=dx, y=dy, z=dz))

  def print_points(points, color_name):
      pipette_20ul.pick_up_tip()

      i = 0
      while i < len(points):
          batch = points[i:i + MAX_BATCH_DOTS]
          batch_volume = len(batch) * VOL_PER_DOT

          # Pull enough dye for this batch
          pipette_20ul.aspirate(batch_volume, location_of_color(color_name))

          # Dispense each dot
          for (dx, dy) in batch:
              loc = point_location_from_center(dx, dy)
              dispense_and_detach(pipette_20ul, VOL_PER_DOT, loc)

          i += MAX_BATCH_DOTS

      pipette_20ul.drop_tip()

  # -----------------------------
  # Print your two datasets
  # -----------------------------
  print_points(sfgfp_points, "Green")
  print_points(mrfp1_points, "Red")

Prompts were primarily as follows. Small corrections and confirmations were omitted for clarity:

I want to take a set of predefined x and y points and allow my them to be interpreted by Opentrons OT-2 via a Python script, so that the actuator can move to each point and dispense a set volume of liquid. How do setup a grid and allow the OT-2 to move to each location?

[code inserted]. This is the code I have so far. How can I get it to navigate between the points?

0.75 ul per dot please.

I can’t get my points list to correctly define?

Okay starting over. this is my code so far. what should I add precisely to get it to work?

[series of errors corrected via feedback and correction loop with ChatGPT]

Part 2: Find and describe a published paper that utilizes the Opentrons or an automation tool to achieve novel biological applications.

“An Automation Workflow for High‐Throughput Manufacturing and Analysis of Scaffold‐Supported 3D Tissue Arrays”

This paper details the use of the OT-2 as a means to automate various processes and assays associated with the culture of patient-derived organoids. Conventionally a process performed manually, the scaffold-supported platform for orgonoid-based tissue (SPOT) method is automated as a possible solution to improve throughput and scalability. This approach provides a possible solution to the limited conventional scalability of SPOT. As SPOT is a method of drug discovery, specifically in the development of therapeutics personalized to the specific tissue of a patient’s tumour, increasing the speed and volume at which this process may be performed via OT-2 lab automation is a highly promising application of lab-automation technology.

Part 3: Write a description about what you intend to do with automation tools for your final project.

I want to investigate the process of controlling the spatial formation of a biofilm (or the biomineralisation process) across the surface of a morphable 3D artifact (scaffold).

Although I am still unsure of the precise outcome, I would like to explore the possibility of integrating automation at two points:

  1. At culture production: to perform the recombinant process, automating the transformation process to facilitate the uptake of engineered plasmids via a desirable prokaryotic host.

  2. Within physical tool: to serve as a repeatbly programmable platform for biofilm formation.

I will work to crystalise the precise integration of automation methods in the coming weeks.

References

Cao, R., Li, N.T., Latour, S., Cadavid, J.L., Tan, C.M., Forman, A., Jackson, H.W. and McGuigan, A.P. (2023). An Automation Workflow for High‐Throughput Manufacturing and Analysis of Scaffold‐Supported 3D Tissue Arrays. Advanced Healthcare Materials, [online] 12(19). doi:https://doi.org/10.1002/adhm.202202422.

Reference image source: StockCake (2026). Peaceful Pixel Dove. [online] StockCake. Available at: https://stockcake.com/i/peaceful-pixel-dove_3822449_1791913 [Accessed 27 Feb. 2026].

Week 4 HW: Protein Design Part i

Part A: Conceptual Questions

1. How many molecules of amino acids do you take with a piece of 500 grams of meat? (on average an amino acid is ~100 Daltons)

500 grams of beef contains ≈ 112 g of protein. 1 dalton corresponds to ≈ 1.6605×10-24 gram. 1 amino acid (100 daltons) ≈ 1.6605×10−22 grams. 112 grams / 1.6605×10−22 grams ≈ 6.745×1023.

The 112 g of protein contained with 500 g of beef represents ≈ 6.745×1023 molecules of amino acid!

2. Why do humans eat beef but do not become a cow, eat fish but do not become fish?

One aspect of “cowness” could be considered the successful construction, transcription, and translation of genetic material which results in a heritable bovine phenotype; physical traits and behaviours which can be associated with cows. For this to be possible, genetic material must not only be incorporated inherently across all tissues at the cellular level, but be intact, and complemented by the correct composition of functional enzymes.

When humans consume tissues from another organism, a series of digestive enzymes effectively destroy the structure of foreign macromolecules (such as DNA), into recyclable constituent molecules (like nucleotides). Even if foreign genetic material were to remain intact inside the stomach, it would be impossible for it to be integrated across all cells at the scale required to produce observable “cowness”.

3. Why are there only 20 natural amino acids?

This is a very intriguing question by which the precise answer still remains elusive. One theory suggests that the canonical 20 were selected gradually on the basis of parsimony (extreme stinginess, or frugality), where monomers favoured provided the widest spectrum of utility for the lowest investment of energy and material (Bywater, 2018).

4. Can you make other non-natural amino acids? Design some new amino acids.

The synthesis of non-canonical amino acids (NcAA’s) is possible, with large advancements in the field made in the early 21st century. For my attempt at NcAA design, see 4.1 below.

5. Where did amino acids come from before enzymes that make them, and before life started?

Amino acids are theorised to have emerged abiotically via complex chemical processes that were facilitated by the proximity of specific chemical reactants and physical parameters (heat, electricity). It is also possible that amino acids arrived on earth from extraterrestrial sources via a meteorite(s), where other methods of synthesis may have first preluded their arrival (Kirsching, 2022).

6. If you make an α-helix using D-amino acids, what handedness (right or left) would you expect?

Amino acids can be characterised by their chirality, with pairs known as enantiomers representing their two possible configurations (L and D). This pair represents the ‘mirror-image’, non-superimposable structural quality central to chirality. In living organisms, L-aminos are almost exclusively used to construct proteins. In α-helices, this curiously results in the final protein having a D-configuration. Using D-aminos to form α-helices instead yields an overall L-configuration in the resultant structure.

7. Can you discover additional helices in proteins?

Additional types of helices have been discovered and researched, such as the 3-10 helices and Pi helices!

8. Why are most molecular helices right-handed?

Most molecular helices appear right-handed as a resultant property of their left-handed amino monomers. This D configuration is said to improve steric hindrance (reduce reactivity of specific molecules in macromolecules) and energetic stability (qualities of hydrogen bonds in backbone of a-helices are optimised).

9. Why do β-sheets tend to aggregate? What is the driving force for β-sheet aggregation?

The chemical structure of the edges of β-sheets promote aggregation, as it readily encourages the formation of hydrogen bonds with adjacent β-sheets to form larger aggregates.

4.1: Design some new amino acids.

AlaninePropargyl-valineHalogenated-valine

I found and followed a short tutorial online detailing how Pymol can be used to quickly design NcAA models, starting with Valine as per the demonstration. With an extremely limited background in chemistry, I used chatGPT to aid me in selecting and positioning functional groups. I decided to create 2 NcAA’s, one optimised for click-reactivity (Acetylene) and photo-reactivity (Bromine), via the separate addition of acetylene and bromine to the methyl group of each Alanine respectively. ChatGPT prompts:

I would like to take this valine and make a non-canonical amino acid by adding different functional groups. which spots should I add groups onto, and what can they be?

I would like to edit for photo-reactivity and click-reactivity. What groups do I add, and what part of the image represents the methyl group?

[Showed me a series of possible combinations.]

These are the only fragments I have available.

[I provided the options available under the ‘fragment’ tab in PyMOL, which then allowed it to narrow down the additions.]

[Two final options I narrowed down two based on desirable characteristics]
[Propargyl-valine = Acetylene addition = click-reactivity]
[Halogenated-valine = bromine addition = photo-reactivity]
Part B: Protein Analysis and Visualization

Many bacterial species are capable of cell-to-cell communication as a response to changes in localised population density. This system is known as quorum sensing.

1. Briefly describe the protein you selected and why you selected it.

I have selected LuxR, a transcriptional activator, and one of the first proteins (alongside LuxL) to be identified as integral in a QS pathway. LuxR works with LuxL in the prokaryote Vibrio fischeri to permit bioluminescence once a particular population density is reached. QS pathways are particularly interesting to me as I would like to understand how they may be leveraged to enable external control of a bacterial population’s behaviour at the cellular level via controlled chemical signalling.

2. Identify the amino acid sequence of your protein.

LuxR has a length of 250 amino acids. The most common is Isoleucine (I), which appears 28 times. Using BLAST, LuxR appeared to have approx. 186 possible homologs after using the E value to cull the total pool of 250 matches. The protein is also the title for the family it is contained within: LuxR-type transcriptional regulators.

>sp|P35327|LUXS_ALIF1 Transcriptional activator protein LuxR OS=Aliivibrio fischeri (strain ATCC 700601 / ES114) OX=312309 GN=luxR PE=3 SV=1

MNIKNINANEKIIDKIKTCNNNKDINQCLSEIAKIIHCEYYLFAIIYPHSIIKPDVSIIDNYPEKWRKYYDDAGLLEYDPVVDYSKSHHSPINWNVFEKKTIKKESPNVIKEAQESGLITGFSFPIHTASNGFGMLSFAHSDKDIYTDSLFLHASTNVPLMLPSLVDNYQKINTTRKKSDSILTKREKECLAWASEGKSTWDISKILGCSERTVTFHLTNTQMKLNTTNRCQSISKAILTGAINCPYLKN

3. Identify the structure page of your protein in RCSB.

I couldn’t find a non ML-resolved match for the LuxR sequence from V. fischeri, which was intriguing considering it was supposedly the first organism where this protein was identified.

I did however find a direct match that was resolved with Alphafold DB on 2021-12-09 with a global model confidence of 88.61%. I couldn’t determine if there were additional molecules in the structure. There was also a homolog called 7AMT resolved though x-ray diffraction, which was deposited on 2020-10-09 with an excellent resolution of 2.60 Å. Both proteins belong to the transcription classification family.

4. Open the structure of your protein in any 3D molecule visualization software.

I continued with 7AMT as it was readily available in the correct format from PDB.

Cartoon
Cylinder
Stick + Sphere
7AMT surface structure visualised with hydrophilic (blue) and hydrophobic (orange) regions highlighted. Several pits can be observed, particularly on the side opposite to the DNA binding region.
Part C: Using ML-Based Protein Design Tools
C.1: Protein Language Modeling

I decided to progress with 7AMT as its prior resolvement through x-ray diffraction makes it a good candidate for comparison and testing with the provided ML tools!

1. Deep Mutational Scans

7AMT: Unsupervised Deep Mutational Scan.

2. Latent Space Analysis

7AMT: Latent Space Analysis.
C.2: Protein Folding
7AMT ESMfold 2: rainbow for clarity.7AMT ESMfold 1: colour gradient representing confidence of resolution.
C.3: Protein Generation
Original structure (confidence)Predicted structure (confidence)
References

Foodstandards.gov.au. (2026). Australian Food Composition Database. [online] Available at: https://afcd.foodstandards.gov.au/fooddetails.aspx?PFKID=F000678 [Accessed 6 Mar. 2026].

Bywater, R.P. (2018). Why twenty amino acid residue types suffice(d) to support all living systems. PLoS ONE, 13(10), pp.e0204883–e0204883. https://doi.org/10.1371/journal.pone.0204883

Cooley, R.B., Arp, D.J. and Karplus, P.A. (2010). Evolutionary origin of a secondary structure: π-helices as cryptic but widespread insertional variations of α-helices that enhance protein functionality. Journal of Molecular Biology, 404(2), pp.232–246. https://doi.org/10.1016/j.jmb.2010.09.034

Doig, A.J. (2017). Frozen, but no accident – why the 20 standard amino acids were selected. The FEBS Journal, 284(9), pp.1296–1305. https://doi.org/10.1111/febs.13982

Kirschning, A. (2022). On the evolutionary history of the twenty encoded amino acids. Chemistry – A European Journal, 28(55). https://doi.org/10.1002/chem.202201419

Munier, R. and Cohen, G.N. (1956). Incorporation d’analogues structuraux d’aminoacides dans les protéines bactériennes. Biochimica et Biophysica Acta, 21(3), pp.592–593. https://doi.org/10.1016/0006-3002(56)90207-4

Oliveira, N.M., Foster, K.R. and Durham, W.M. (2016). Single-cell twitching chemotaxis in developing biofilms. Proceedings of the National Academy of Sciences, 113(23), pp.6532–6537. https://doi.org/10.1073/pnas.1600760113

Soto-Aceves, M.P., Diggle, S.P. and Greenberg, E.P. (2023). Microbial primer: LuxR-LuxI quorum sensing. Microbiology, 169(9). https://doi.org/10.1099/mic.0.001343

Week 5 HW: Protein Design Part ii

Part 1: Generate Binders with PepMLM

Human SOD1 sequence (without A4V):

>sp|P00441|SODC_HUMAN Superoxide dismutase [Cu-Zn] OS=Homo sapiens OX=9606 GN=SOD1 PE=1 SV=2

MATKAVCVLKGDGPVQGIINFEQKESNGPVKVWGSIKGLTEGLHGFHVHEFGDNTAGCTSAGPHFNPLSRKHGGPKDEERHVGDLGNVTADKDGVADVSIEDSVISLSGDHCIIGRTLVVHEKADDLGKGGNEESTKTGNAGSRLACGVIGIAQ

A4V represents a point mutation within the 4th codon of the human SOD1 gene. This results in the substitution of the conventional alanine for a valine amino acid.

Mutant SOD1 sequence (with A4V):

MATKVVCVLKGDGPVQGIINFEQKESNGPVKVWGSIKGLTEGLHGFHVHEFGDNTAGCTSAGPHFNPLSRKHGGPKDEERHVGDLGNVTADKDGVADVSIEDSVISLSGDHCIIGRTLVVHEKADDLGKGGNEESTKTGNAGSRLACGVIGIAQ

Four experimental peptides were then generated using the PepMLM tool:

BinderPseudo Perplexity
HRYYPVAVRLKE11.513239315710866
HRYPVVAVAWKE14.299230176233568
KRYPPVAARWKE20.264871722997526
WLYPVAAARHKE20.749563629245603
Evaluate Binders with AlphaFold3
EntryBinderPseudo PerplexityipTMDescriptionImage
🎮FLYRWLPSRRGG0.31Known-binder
0HRYYPVAVRLKE11.5132393157108660.26HRYYPVAVRLKE image
1HRYPVVAVAWKE14.2992301762335680.26HRYPVVAVAWKE image
2KRYPPVAARWKE20.2648717229975260.34KRYPPVAARWKE image
3WLYPVAAARHKE20.7495636292456030.35WLYPVAAARHKE image
Part 3: Evaluate Properties of Generated Peptides in the PeptiVerse

Week 6 HW: Genetic Circuits Part i

DNA Assembly
  1. The primary component of Phusion High-Fidelity PCR Master Mix is the polymerase Phusion, an alternative to standard taq polymerase which provides upwards of 40x greater fidelity (the accuracy at which a template strand is replicated) comparative to taq. It also contains free-deoxynucleotides (bases for formation of replicated strand) and a reaction buffer (provides optimal pH conditions for Phusion polymerase activity).

  2. Factors may include primer melting temperature, composition of reaction buffer, characteristics of target sequence, and type of polymerase. For example, primers containing higher concentrations of guanine and cytosine bases will possess a higher melting temperature due to the strength imparted by the triple hydrogen bonds of GC.

  3. PCR involves the amplification and ‘construction’ of fragments from a provided template sequence, while restriction digests produce linear fragments from the repeated ‘truncation’ of larger fragments within a concentrated pool (via restriction enzymes).

    • Both methods are commonly used in conjunction.
    • Both methods produce linear double-stranded fragments.
    • The result of both methods may be assayed via gel electrophoresis.
  4. Primers with the appropriate 20-40bp overlaps must be precisely designed and validated for use prior to their use as the initiation point of PCR. This ensures the required 5’ to 3’ orientation and overlaps are present to permit the successful cloning via Gibson assembly.

  5. Simple diffusion permits the entry of cloned plasmids into E. coli cells permitted that heat shock or electroporation techniques have been used prior, as these methods generate pores within the plasma membrane that would otherwise be impassable.

  6. Twin-Primer assembly is an non-enzymatic, alternative technique of assembling PCR-amplified fragments into plasmids, while retaining the necessary efficacy and fidelity of conventional methodologies:

    1. Segment the overall sequence in-silco into a series of fragments with complimentary designed overlaps between adjacent fragments, with two ‘sticky ends’ required per fragment. Overlaps are characteristically 16-20 bp long, and are selected to possess a universal melting temperature of approximately 50 C.

    2. Amplification is then performed separately, twice per fragment, with each iteration using a different set of primer pairs. Per fragment, this produces: A, sequence with overlap to previous fragment, and B, sequence with overlap to next fragment.

    3. The two products (A and B) are then combined to generate two separate versions of the same fragment however featuring unique overlapping regions.

    4. Post mix., the PCR’ed products are denatured to generate ssDNA, before controlled cooling to generate hybrid ‘duplex intermediates’.

    5. All duplex intermediates are then incubated collectively, permitting the annealing of overhangs from complimentary fragments, and ultimately assembling a plasmid (albeit with nicked backbone)

    6. Upon transformation into the desired host, bacterial machinery is co-opted to phosphorylate DNA ends and ligate nicks throughout the DNA backbone, ultimately resulting in a complete plasmid.

Twin Primer Assembly diagram
Asimov Kernel

The Repressilator

'Repressilator' Construct
The Repressilator construct. Although its title is very fitting, it's still such a cool name!

Iteration 1

  • Began by referencing the Repressilator paper: source
  • Unsure of the appropriate RBS to use prior to each coding region. There’s a fair few available in ‘Charecterized Bacterial Parts’ repository! Wasn’t able to locate BBa_B0034, the original RBS utilised within the construct described in the paper. Selected S1 RBS at random; lets see how it goes!
  • I was able to locate and insert pSC101, the original low-copy backbone used by Elowitz & Liebler!
  • Stock settings were employed in the initial simulation (E. coli, 72 hours, 10 min timestep).
  • Results were not aligned with the oscillatory behaviour of the Repressilator. Protein expression appeared to peak across each gene, then remain constant.

Iteration 2

  • Realised almost immediately upon inspection of the repressilator in the bacterial samples folder that I had failed to include terminators!
  • Retained pSC101 and S1 RBS.
  • Stock settings (E. coli, 72 hours, 10 min timestep).
  • Results were much more reminiscent of reference! Oscillatory expression behaviour appeared to ‘tighten’, with the period each gene was expressed narrowing throughout the simulation.

Iteration 3

  • Finally adjusted RBS to A1 and backbone to pUC-SpecR v1 (as featured within the reference Repressilator)
  • Promoters still appear in slightly different locations to reference, however the overall architecture appears to match that of Elowitz & Liebler.
  • Results were again significantly different; actually less relevant than those obtained in iteration 2!
  • Definitely suspicious that the layout of the promoters is altering the outcome.

Iteration 4

  • Precisely copied the layout of the reference construct.
  • Results matched the reference.
Simulation 1Simulation 2Simulation 3Simulation 4
Iteration 1Iteration 2Iteration 3Iteration 4
Custom Constructs

Construct A

'Construct A
Construct A

I wanted to understand how the pTac promoter could be used in conjunction with IPTG ligands to control expression of a specific gene. In construct A, the intention is that the transcription of LacL produces a repressor which prevents the transcription of mKate and thus its translation to yield the fluorescent reporter protein.

Simulation 1

  • Settings: E. coli, 72 hours, 10 min intervals. Ligand (IPTG) introduced at max concentration at 36 hours.
  • Appeared to work first try! LacL was transcribed at a constant rate, repressing the activity of the pTac promoter. At 36 hours, the introduction of IPTG functioned as a structural analog to allolactose, inhibiting the activity of the repressor and permitting the transcription of mKate.
Simulation 1Simulation 2Simulation 3Simulation 4
Protien ConcentrationsRNA ConcentrationsRibosome FluxRNAP FLux

Week 7 HW: Genetic Circuits Part ii

Intracellular Artificial Neural Networks (IANNs)

What advantages do IANNs have over traditional genetic circuits, whose input/output behaviors are Boolean functions?

Cellular processes are inherently continuous, with a multitude of intermeshed processes with inputs and outputs that are mutually significant. It is often difficult, if not impossible, to effectively manipulate such an environment through a boolean system alone.

IANN’s allow for genetic circuits to be programmed which respond effectively to ‘analog’ signals, whereby the rules of gene expression (when, and how much) may be governed by molecular gradients across two or three separate inputs.

Describe a useful application for an IANN; include a detailed description of input/output behavior, as well as any limitations an IANN might face to achieve your goal.

Quorum sensing is the process by which bacterial cells within a shared environment may communicate as a means to regulate population-wide behaviours, typically in response to unique population thresholds. Let’s consider a theoretical case for biomaterial formation, where we have first programmed and cultured a non-pathogenic bacterial species to function as an efficient biomineralisation unit.

To permit a degree of programmability to the duration and localisation of biomineralisation behaviour across each individual bacterium, an IANN may be employed to modify and expand the capabilities of the quorum-sensing process. If we have established that the process is the most efficient across a particular population-nutriant ratio, we will seek to construct a lower and upper threshold to provide a degree of redundancy for use in environments heterogeneous to the laboratory. This is analogous to the band-pass model described in the lecture.

Inputs

  • Auto-inducer A: produced only when population values exceed the lower threshold bias.
  • Auto-repressor B: produced only when the population values exceed the upper threshold bias (determined by training IANN on culture growing within chemostat under several different ratios; the goal is to coax the population into maintaining a stationary-like phase as the only conditions under which biomineralisation occurs.

Outputs

  • Low A low B: no biomineralisation.
  • Medium A low B: biomineralisation
  • High A High B: no biomineralization

Recognising when a particular population density has been exceeded could prove difficult, and is likely to manifest at different points dependent on energy source. Several separate ligands are likely to be responsible for regulating growth under different conditions; careful construction of the host genome would need to be required prior to developing IANN to limit ‘noise’.

Draw a diagram for an intracellular multilayer perceptron where layer 1 outputs an endoribonuclease that regulates a fluorescent protein output in layer 2.

'Perceptron
Multilayer perceptron: X1 input reduces endonuclease production and thus permits translation of Y1BFP in layer 2.
Fungal Materials

What are some examples of existing fungal materials and what are they used for? What are their advantages and disadvantages over traditional counterparts?

Applications of filamentous fungal materials are quite diverse due to the varied mechanical properties of both rigid and flexible denominations; construction elements and biodegradable packaging respectively are two excellent examples. Mycelium-based materials often surpass virtually any conventional material where true biodegradability is desirable, as they may be completely decomposed and reintegrated by organic lifeforms without requiring additional processing, and often without industrial composting. Properties such as heat resilience, insulation, and rigidity are highly customisable. Resultingly, the materials are also susceptible to premature decay, moisture, and contamination. Long-term durability is also limited in applications where exposure to friction or mechanical stresses is common.

What might you want to genetically engineer fungi to do and why? What are the advantages of doing synthetic biology in fungi as opposed to bacteria?

I would be interested to explore how the structural properties of fungal growth could be controlled internally to produce highly functionally-graded materials. It would also be quite fascinating to explore the communicative-element of mycelial networks; could a network be constructed which permitted messages to be sent and received in a couple hours across, say, the local park?

The diverse structural and metabolic features of fungi make them an excellent vehicle for synthetic biology in areas such as bioremediation and, of course, biomaterials. Genetically, they share a closer resemblance to humans than plants and prokaryotic bacteria, which is intriguing in its own right. They are comparatively more rugged and resilient to environmental stressors than bacteria, which may be useful in an experimental context.

References

‌Hinneburg, H., Gu, S. and Naseri, G. (2025). Fungal Innovations—Advancing Sustainable Materials, Genetics, and Applications for Industry. Journal of Fungi, [online] 11(10), p.721. doi:10.3390/jof11100721.

Moreno-Gámez, S., Hochberg, M.E. and van Doorn, G.S. (2023). Quorum sensing as a mechanism to harness the wisdom of the crowds. Nature Communications, [online] 14(1). doi:10.1038/s41467-023-37950-7.

Week 9 HW: Cell-Free Systems

General homework questions

Explain the main advantages of cell-free protein synthesis over traditional in vivo methods, specifically in terms of flexibility and control over experimental variables.

Comparative to traditional methods, cell free systems:

  • Permit the incorporation of non-canonical amino acids (NCAA’s) for extended protein functionality not observed in nature.
  • Permit the synthesis and collection of difficult to express proteins, such as trans-membrane proteins.
  • Allow the immediate use of linear fragments in solution. This offers high flexibility for projects involving prototyping at frequent intervals; no need for plasmid construction and subsequent cloning.
  • Permit greater control of matrix composition during biochemical processes. Proceed with crude extracts for high-throughput, inexpensive studies, or calibrate its composition (such as in PURE) to examine or promote specific processes.

Cell-free is uniquely superior for use in environments which do not support cell production techniques, such as in materials! i.e a cell-free system may be inactivated by lyophilization then subsequently reactivated upon rehydration.

Describe the main components of a cell-free expression system and explain the role of each component.

  • Engineered sequences for translation; typically plasmids, but also possible with linear fragments. The core information for what is to be expressed.
  • Cellular lysate containing required molecular machinery for Tx/Tl (ribosomes, RNA polymerase etc). Permits the expression of the engineered sequences.
  • Source of energy-rich compounds and nucleosides. Provide free energy for enzymes in the reaction to do work, as well as necessary substrates for building mRNA.
  • Buffering compounds: stabilise reactions and maintain consistent pH.

Why is energy provision regeneration critical in cell-free systems? Describe a method you could use to ensure continuous ATP supply in your cell-free experiment.

In living cells, energy regeneration is typically mediated by sequential chains of catabolic and anabolic reactions, by which the cell will sustain protein-synthesis activities; these pathways can be absent in cell-free systems. Phosphoenolpyruvate (PEP) is commonly used to regenerate ADP molecules into ATP, providing a ‘continuous’ supply with the caveat that its concentration must be ample for the entire duration of the intended experiment. Glucose-6-phosphate (G6P) is a promising, affordable alternative to PEP which still functions as a reducing agent.

Compare prokaryotic versus eukaryotic cell-free expression systems. Choose a protein to produce in each system and explain why.

Each approach presents its own distinct advantages and limitations, often dependent on the specific source for which the expression system is based. While prokaryotic permits high protein yields, simplicity in genetic engineering, and the ability to synthesise in extreme conditions (Archeal extremophile extracts), post-translational modifications are limited and only chaperones native to prokaryotic species are available. Eukaryotic extracts are diverse, permit fast lysate preparation, and mirror mammalian systems, yet often are also subject to high-cultivation costs and low protein yields.

Prokaryotic: Cas12a, modified via the inclusion of NCAA’s to extend its trans-cleavage properties. Useful when expressed in high yields as the main component of a signal-amplifying biosensor for.

Eukaryotic: any protein intended for use in the human body! Likely to produce proteins with higher-quality folding, as well as overall be more compatible with human biochemistry.

How would you design a cell-free experiment to optimize the expression of a membrane protein? Discuss the challenges and how you would address them in your setup.

Membrane proteins are often flexible and structurally unstable, and are typically expressed and purified from live-culture systems in which they are first correctly folded and inserted into the cellular membrane by a specific enzyme. To mirror this process and extend the stability of synthesized membrane proteins during storage, one such approach could be to encourage their integration with artificial vesicles. Whilst this technique may improve their stability, it could prove as a hindrance when further purification is required to ‘detach’ the membrane proteins and reintegrate them into the host organism.

Imagine you observe a low yield of your target protein in a cell-free system. Describe three possible reasons for this and suggest a troubleshooting strategy for each.

Insufficient energy source: recalibrate concentration. Incompatible lysate: identify suitable alternative. Inefficient codon usage: reoptimise codons for expression in specific lysate.

Homework question from Kate Adamala

Pick a function and describe it.

1. What would your synthetic cell do? What is the input and what is the output?

Produce and excrete magnetic inclusions (magnetosomes).

Input: Isopropyl β-D-1-thiogalactopyranoside (IPTG).
Output: Magnetite vesicles.

2. Could this function be realized by cell-free Tx/Tl alone, without encapsulation?

No, membrane encapsulation is required for magnetite vesicle formation.

3. Could this function be realized by genetically modified natural cell?

Yes! However, likely not at a scale and yield which is industrially significant.

4. Describe the desired outcome of your synthetic cell operation.

Minimal cells are cultivated inside a bioreactor, with capability to support the production of various valuable compounds in response to specific signalling molecules. When IPTG is added to the system, culture responds by producing and secreting magnetite vesicles.

Design all components that would need to be part of your synthetic cell.

1. What would the membrane be made of?

Unsaturated fatty acid bilayer to enhance fluidity and facilitate vesicle formation.

2. What would you encapsulate inside?

Proteins associated with the MamAB gene cluster would constitute the most significant inclusions, specifically MamE (magnetite and vesicle maturation), MamI + MamL (magnetosome membrane formation).

3. Which organism your Tx/Tl system will come from? Is bacterial OK, or do you need a mammalian system for some reason?

Prokaryotic is preferred; Tx/Tl system can primarily be derived from the magnetotactic model organism AMB-1.

4. How will your synthetic cell communicate with the environment?

IPTG is membrane permeable. Magnetite vesicles may be freely transported via budding and shedding.

Experimental details

1. List all lipids and genes.

mamAB gene cluster: magnetite vesicle formation.
LacI repressor: receptor for IPTG.
IPTG ligand: ligand for LacL deactivation.

2. How will you measure the function of your system?

Due to the conductive nature of magnetite, I would use electrochemical conductometry to monitor the protein yield in real time in response to the presence of IPTG.

Homework question from Peter Nguyen
Homework question from Ally Huang
References

Calhoun, K.A. and Swartz, J.R. (2005). Energizing cell‐free protein synthesis with glucose metabolism. Biotechnology and Bioengineering, [online] 90(5), pp.606–613. doi:10.1002/bit.20449.

Carpenter, E.P., Beis, K., Cameron, A.D. and Iwata, S. (2008). Overcoming the challenges of membrane protein crystallography. Current Opinion in Structural Biology, [online] 18(5), pp.581–586. doi:10.1016/j.sbi.2008.07.001.

Murat, D., Quinlan, A., Vali, H. and Komeili, A. (2010). Comprehensive genetic dissection of the magnetosome gene island reveals the step-wise assembly of a prokaryotic organelle. Proceedings of the National Academy of Sciences, [online] 107(12), pp.5593–5598. doi:10.1073/pnas.0914439107.

Paul, N.L., Carpa, R., Ionescu, R.E. and Popa, C.O. (2025). The Biomedical Limitations of Magnetic Nanoparticles and a Biocompatible Alternative in the Form of Magnetotactic Bacteria. Journal of Functional Biomaterials, [online] 16(7), p.231. doi:10.3390/jfb16070231.

Pui Yan Wong, Mal, J., Sandak, A., Luo, L., Jian, J. and Pradhan, N. (2024). Advances in microbial self-healing concrete: A critical review of mechanisms, developments, and future directions. The Science of The Total Environment, 947, pp.174553–174553. doi:10.1016/j.scitotenv.2024.174553.

Zemella, A., Thoring, L., Hoffmeister, C. and Kubick, S. (2015). Cell‐Free Protein Synthesis: Pros and Cons of Prokaryotic and Eukaryotic Systems. ChemBioChem, [online] 16(17), pp.2420–2431. doi:10.1002/cbic.201500340.

Week 10 HW: Advanced Imaging & Measurement Technology

Homework: Final Project

Please identify at least one (ideally many) aspect(s) of your project that you will measure.

Two quantities are prioritesed for measurement:

  • Total fluorescence emitted by cas12a transnuclease cleavage of probes after designated runtime (approx. 15-20 minutes). This can be achieved by the use of a fluorometer.
  • Affinity of SELEX-designed aptamers and exosporium target protein. This may be measured by determining the equilibrium dissociation constant KD, through a number of unique approaches.
Homework: Waters Part I — Molecular Weight

Based on the predicted amino acid sequence of eGFP (see below) and any known modifications, what is the calculated molecular weight?

eGFP sequence (with Methionine, His-purification tag + linker):

MVSKGEELFTG VVPILVELDG DVNGHKFSVS GEGEGDATYG KLTLKFICTT GKLPVPWPTL VTTLTYGVQC FSRYPDHMKQ HDFFKSAMPE GYVQERTIFF KDDGNYKTRA EVKFEGDTLV NRIELKGIDF KEDGNILGHK LEYNYNSHNV YIMADKQKNG IKVNFKIRHN IEDGSVQLAD HYQQNTPIGD GPVLLPDNHY LSTQSALSKD PNEKRDHMVL LEFVTAAGIT LGMDELYKLE HHHHHH

Theoretical Mw: 28006.60 Da via Compute pI/Mw

Calculate the molecular weight of the eGFP using the adjacent charge state approach described in the recitation.

ESI electrospray ionisation: single proton H+ corresponds to 1.0073 DA.

z = 30.9058 (e) Mw = 27898.90 Da Accuracy: 3845.52 Ppm

Accuracy is well in excess of a suitable deviation of 0-50 PPm (calculation error?)

Can you observe the charge state for the zoomed-in peak in the mass spectrum for the intact eGFP?

The resolution of the spectrometer is not sufficient to resolve individual isotopic peaks of proteins at this size, and thus the charge cannot be determined.

Homework: Waters Part II — Secondary/Tertiary structure

Based on learnings in the lab, please explain the difference between native and denatured protein conformations.

As a protein denatures and unfolds, its ‘exposed’ surface area increases and permits a greater degree of ionisation. This will be reflected by the distribution of several large peaks at a higher mass to charge ratio, as opposed to the concentrated peaks observed in a native state protein analyte.

Zooming into the native mass spectrum of eGFP from the Waters Xevo G3 QTof MS, can you discern the charge state of the peak at ~2800?

In this instance, the resolution is satisfactory to resolve the charge state.

Homework: Waters Part III — Peptide Mapping - primary structure

How many Lysines (K) and Arginines (R) are in eGFP?

MVSKGEELFTG VVPILVELDG DVNGHKFSVS GEGEGDATYG KLTLKFICTT GKLPVPWPTL VTTLTYGVQC FSRYPDHMKQ HDFFKSAMPE GYVQERTIFF KDDGNYKTRA EVKFEGDTLV NRIELKGIDF KEDGNILGHK LEYNYNSHNV YIMADKQKNG IKVN FKIRHN IEDGSVQLAD HYQQNTPIGD GPVLLPDNHY LSTQSAL SKD PNEKRDHMVL LEFVTAAGIT LGMDELYKLE HHHHHH

How many peptides will be generated from tryptic digestion of eGFP?

Mass (Da)PositionPeptide sequence
4472.1752170–210HNIEDGSVQLADHYQQNTPIGDGPVLLPDNHYLSTQSALSK
2566.2931217–239DHMVLLEFVTAAGITLGMDELYK
2437.26085–27GEELFTGVVPILVELDGDVNGHK
2378.257754–74LPVPWPTLVTTLTYGVQCFSR
1973.9062142–157LEYNYNSHNVYIMADK
1503.659728–42FSVSGEGEGDATYGK
1266.578387–97SAMPEGYVQER
1083.4979240–247LEHHHHHH
1050.5214115–123FEGDTLVNR
982.4952133–141EDGNILGHK
821.394081–86QHDFFK
790.355275–80YPDHMK
769.391347–53FICTTGK
711.2944103–108DDGNYK
655.381398–102TIFFK
602.2780211–215DPNEK
579.3137128–132GIDFK
507.2925164–167VNFK
502.3235124–127IELK

19 sequences; sequences < 500 Da appear to be excluded.

Based on the LC-MS data for the Peptide Map data generated in lab how many chromatographic peaks do you see in the eGFP peptide map between 0.5 and 6 minutes?

Approximately 21, however less if dependent on how merged peaks are handled.

Assuming all the peaks are peptides, does the number of peaks match the number of peptides predicted from question 2 above? Are there more peaks in the chromatogram or fewer?

No, however the in-silico digest deviates by only 2 fewer peaks. Considering the in-silico is likely to have omitted sequences < 500 dA, it seems plausible that this accounts for the additional peaks resolved by the Total ion chromatogram.

Identify the mass-to-charge of the peptide shown in Figure 5b.

From the enhanced image, we can determine an approximate spacing between peaks of m/z 0.5, correlating to z = 5.

(M+H)sup>+ = 2623.77 Da

Identify the peptide based on comparison to expected masses in the PeptideMass tool. What is mass accuracy of measurement?

This correlates most closely with DHMVLLEFVTAAGITLGMDE LYK at 2566.2931 dA, however this appears substantially off; necessary to review the charge calculation performed above.

A calculated value of 22396.85 PPM confirms a calculation error.

What is the percentage of the sequence that is confirmed by peptide mapping?

88 percent of the sequence was confirmed by peptide mapping.

Homework: Waters Part IV — Oligomers
Polypeptide Subunit NameIntensity
7FU Decamer3.4
8FU Decamer4.013
8FU Didecamer8.33
8FU 3-Decamer12.67
8FU 4-Decamer16.683* (not included from CDMS data)
Homework: Waters Part V — Did I make GFP?
TheoreticalObserved/measured on the Intact LC-MSPPM Mass Error
Molecular weight (kDa)28006.6027898.903845.52 ppm

Week 11 HW: Bioproduction & Cloud labs

Part A: The 1,536 Pixel Artwork Canvas | Collective Artwork

My efforts were unfortunately extremely limited, having only contributed 2 pixels total. Definitely regret not having been more engaged! Given the chance to contribute again, I would have loved to coordinate my efforts with members from my node (alike to what occurred across r/place) to build a mutually-crafted design! It was fascinating seeing it evolve over time; I often wondered to what extent others were collaborating.

The scale appeared somewhat limiting in terms of achieving higher-resolution designs. For future iterations where (hopefully) a greater cohort of students are enrolled, it would be fascinating to see what would be possible with 4x the space.

Part B: Cell-Free Protein Synthesis | Cell-Free Reagents
E. coli Lysate
BL21 (DE3) Star LysateIncludes T7 RNA Polymerase.
Salts/Buffer
Potassium GlutamateMaintains ionic strength for optimal reaction conditions.
HEPES-KOH pH 7.5Buffers the system to maintain stable pH.
Magnesium GlutamateSource of Mg^2+ ions which optimise Tx + Tl rates by supporting the function of translational machinery; essential cofactor.
Potassium phosphate monobasicBuffering agent to maintain pH stability. Precursor molecule for regeneration of ATP from ADP.
Potassium phosphate dibasicBuffering agent to maintain pH stability. Additional inorganic phosphate source for ATP regeneration.
Energy / Nucleotide System
RiboseEssential precursor molecule in structure and function of nucleic acids. A core component of ATP and thus a crucial component in its regeneration.
GlucoseSignificant energy source and substrate for ATP regeneration.
AMPNucleotide precursor for RNA synthesis. Precursor for ATP regeneration.
CMPATP regeneration intermediate; precursor of CTP which is required as energy source in the transcription process.
GMPCost-effective alternative to NTP’s. Converted to GTP via intermediary energy regeneration reactions, an essential energy source for initiation (and more!) during translation.
UMPPrecursor molecule for RNA biosynthesis; provides initial structure for phosphorylation which eventually becomes UTP, a substrate in transcription.
GuanineFundamental base across both DNA and RNA, as well as GTP precursor. Constituent element in energy regeneration.
Translation Mix (Amino Acids)
17 Amino Acid MixConstitutes key amino acids essential for the formation of a protein's primary structure (sequence) during polymerisation.
TyrosineAmino acid with functional roles when integrated as a subunit into primary structure (hydrophobic and polar). May serve as a site of post-postranslational modifications.
CysteineAmino acid with reductive qualities. Facilitates translational elongation.
Additives
NicotinamideEssential precursor molecule for NAD+ regeneration, an oxidising agent central to translational processes.
Backfill
Nuclease Free WaterHighly-purified water. Shields sensitive contents of mixture from degradation.
Part C: Planning the Global Experiment | Cell-Free Master Mix Design

Describe the main differences between the 1-hour optimized PEP-NTP master mix and the 20-hour NMP-Ribose-Glucose master mix.

The 20-hour appears to include 2 additional salts/buffers (Potassium phosphate in ratio 1.6:1 dibasic to monobasic), which appears to provide higher, sustained buffering capacity (longer reaction?). The energy/nucleotide system in the 1-hour is immediately accessible; NTP’s are provided pre-synthesised alongside ATP and PEP-mono, which serves as a key intermediate in the ATP regeneration process. 20-hour is more reliant on enzymatic pathways inherent to the lysate for energy regeneration and NTP formation. 1-hour includes heavy supplementation of additives to limit requirement of intermediate metabolic process; 2-hour supplies only Nicotinamide, a NAD precursor required for energy regeneration.

How can transcription occur if GMP is not included but Guanine is?

Guanine must be converted metabolically by cellular machinery in lysate to form GMP prior to its use in transcription. There are two feasible pathways for this lysate; phosphoribosylation directly to GMP (requires that PRPP enzyme is present in lysate) or the Purine nucleoside phosphorylase pathway, in which it is converted into Guanosine, then GMP.

sfGFPOxidative damage and/or mutations to the residue Arg96 may hinder the rate at which the sfGFP chromophore is developed, potentially altering duration until readout. The production of reactive oxidative species is possible in mixes more reliant on expansive metabolic processes.
mRFP1Notably long mutation time of 60 minutes. Described to possess low acid sensitivity.
mKO2Notably long maturation time of 105 minutes. Moderate acid sensitivity.
mTurquoise2Very low acid sensitivity. High photostability.
mScarlet_IGenerally robust protein with fast maturation and moderate acid sensitivity.
Electra2N/a.

Create a hypothesis for how adjusting one or more reagents in the cell-free mastermix could improve a specific biophysical or functional property you identified above, in order to maximize fluorescence over a 36-hour incubation. Clearly state the protein, the reagent(s), and the expected effect.

TBD: waiting on email with fluorescent protein and well allocation.

Part D: Build-A-Cloud-Lab
'Cloudlab orange!
Quick attempt! The RAC's have an almost skyline-like appearance when clustered and composed nonsensically.

Subsections of Labs

Week 1 Lab: Pipetting

cover image cover image

Subsections of Projects

Individual Final Project

cover image cover image

Group Final Project

cover image cover image