Petr Filipenko — HTGAA Spring 2026

About me
Contact info
Homework
- Week 1 HW: Principles and Practices
- Week 5 HW: Protein Design Part 2
- Week 6 HW: Genetic Circuits Part I: Assembly Technologies

Week 1 HW: Principles and Practices
First, describe a biological engineering application or tool you want to develop and why. This could be inspired by an idea for your HTGAA class project and/or something for which you are already doing in your research, or something you are just curious about. I am interested in creating biosensors by using modified bacterial cultures to detect toxins and contaminants in the environment. Currently, to test for substances such as heavy metals, biotoxins, and organic pollutants, we often need expensive and often inaccessible detection tools, such as mass spectrometers and analytical chemistry kits. By creating biosensors out of bacteria, we can simplify contaminant detection.
Week 5 HW: Protein Design Part 2
Part A: SOD1 Binder Peptide Design (From Pranam) Assignees for the following sections MIT/Harvard studentsRequired Committed ListenersRequired Superoxide dismutase 1 (SOD1) is a cytosolic antioxidant enzyme that converts superoxide radicals into hydrogen peroxide and oxygen. In its native state, it forms a stable homodimer and binds copper and zinc. Mutations in SOD1 cause familial Amyotrophic Lateral Sclerosis (ALS). Among them, the A4V mutation (Alanine → Valine at residue 4) leads to one of the most aggressive forms of the disease. The mutation subtly destabilizes the N-terminus, perturbs folding energetics, and promotes toxic aggregation.
Week 6 HW: Genetic Circuits Part I: Assembly Technologies
Assignment: DNA Assembly Assignees for the following sections MIT/Harvard studentsRequired Committed ListenersRequired What are some components in the Phusion High-Fidelity PCR Master Mix and what is their purpose? as per 1 High fidelity DNA polymerase (50 times more accurate than “standard” Thermus aquaticus DNApoly) Phusion HF buffer Mg2+ as 1.5mM MgCl2 for DNApoly to function dNTPs What are some factors that determine primer annealing temperature during PCR?
I am interested in creating biosensors by using modified bacterial cultures to detect toxins and contaminants in the environment. Currently, to test for substances such as heavy metals, biotoxins, and organic pollutants, we often need expensive and often inaccessible detection tools, such as mass spectrometers and analytical chemistry kits. By creating biosensors out of bacteria, we can simplify contaminant detection.
Biological containment: built-in safety mechanisms such as kill switches, dependence on essential nutrients to control spread, and self-limiting lifespans to prevent uncontrolled survival or evolution outside intended use.
Environmental risk assessment: Require pre-deployment ecological impact studies assessing horizontal gene transfer, ecosystem disruption, and long-term persistence.
Dual-use risk mitigation: Establish oversight to prevent misuse of synthetic organisms for harmful surveillance, bioweaponization, or covert environmental manipulation.
Next, describe at least three different potential governance “actions” by considering the four aspects below (Purpose, Design, Assumptions, Risks of Failure & “Success”). Try to outline a mix of actions (e.g. a new requirement/rule, incentive, or technical strategy) pursued by different “actors” (e.g. academic researchers, companies, federal regulators, law enforcement, etc). Draw upon your existing knowledge and a little additional digging, and feel free to use analogies to other domains (e.g. 3D printing, drones, financial systems, etc.).
Purpose: What is done now and what changes are you proposing? There is a need for simple detection of various contaminants in our environment. The current state requires a lot of effort and equipment to do this job. Developing biosensing bacteria can simplify contaminant detection and expand access, especially in situations where current methods are impractical or inaccessible.
Design: What is needed to make it “work”? (including the actor(s) involved - who must opt-in, fund, approve, or implement, etc) Various industries are in constant need of rapid detection of contaminants and toxins. Pharma and the food industry are always tightly controlled for contaminants, but detection is often limited and can lead to serious problems. This method would involve funding and approval from industry and government bodies, such as the FDA in US.
Assumptions: What could you have wrong (incorrect assumptions, uncertainties)?
Risks of Failure & “Success”: How might this fail, including any unintended consequences of the “success” of your proposed actions? Require continuous monitoring, reporting, and mitigation mechanisms if harms emerge from synthetic organisms. The main risks are that engineered bacteria may reproduce uncontrollably, or evolve outside their intended environment, its genetic material could be transferred to natural organisms, alteringthe environment. As this technology becomes widely available, its oversight/monitoring becomes harder to enforce. It is difficult to predict long-term effects of any engineered organism.
Next, score (from 1-3 with, 1 as the best, or n/a) each of your governance actions against your rubric of policy goals. The following is one framework but feel free to make your own:
| Does the option: | Option 1 | Option 2 | Option 3 |
|---|---|---|---|
| Enhance Biosecurity | |||
| • By preventing incidents | 1 | 2 | 1 |
| • By helping respond | 1 | 2 | 1 |
| Foster Lab Safety | |||
| • By preventing incident | 2 | 3 | n/a |
| • By helping respond | 2 | 3 | n/a |
| Protect the environment | |||
| • By preventing incidents | 2 | 1 | 1 |
| • By helping respond | 2 | 1 | 1 |
| Other considerations | |||
| • Minimizing costs and burdens to stakeholders | 1 | 1 | 1 |
| • Feasibility? | 1 | 1 | n/a |
| • Not impede research | 1 | 3 | 2 |
| • Promote constructive applications | 1 | 1 | n/a |
I would prioritize a risk-based hybrid governance approach that integrates binding regulation, mandatory technical standards, and participatory oversight. No single governance option sufficiently addresses the biological, social, and ethical risks associated with bacterial biosensors. Binding regulation is necessary to prevent harm by mandating authorization, biosafety review, and post-deployment monitoring. Technical standards and ethics-by-design requirements mitigate risk at the design stage by embedding safety and containment mechanisms. Participatory and deliberative oversight addresses concerns related to consent, equity, and public trust, which technical regulation alone cannot resolve. The primary trade-offs include slower innovation, increased costs due to stricter oversight, reduced regulatory flexibility as technologies evolve, and greater time demands from inclusive governance processes. However, these trade-offs are justified by the potential for irreversible ecological or social harm.
There are DNA polymerases with different fidelity and mechanisms for proofreading and repair, which differ between prokaryotes and eukaryotes. For humans, it is estimated at 1:10000-100000. The human genome is 3.1Gb, meaning that without an additional proofreading or error-correction mechanism, we would accumulate many DNA errors. To help with such a problem, 3´•5´ exonucleases (Pols δ and ε) are used to remove the mismatch to allow correct DNA synthesis to proceed, increasing fidelity by 100-1000x, making the resulting new copy DNA even at a 3Gb virtually errorless. Proofreading also occurs during mRNA translation, preventing the incorporation of incorrect amino acids. Different DNA repair mechanisms are used to correct DNA (like MutS repair system); they are highly conserved, and any loss of function can lead to serious problems, including cancer.
Wang F, He Q, O’Donnell ME, Li H. The proofreading mechanism of the human leading-strand DNA polymerase ε holoenzyme. Proc Natl Acad Sci U S A. 2025 Jun 3;122(22)
Average human protein is about 300 AAs so if we have 61 codons and 20 AAs and assume that those AA equally spread, leading to an average of 3 codons per aminoacid we can estimate that there are astronomical 3^300 ways to encode a random protein in DNA. In practice, not all codons are used with the same frequency; there are limiting factors, such as the amount of tRNA available to “service” all 61 codons. Secondly, if DNA has high CG content, it would not be easily accessible for translation other factors like stability of resulting mRNA and posibility of results with secondary structures will limit coding options, For synthetic DNA an important step of codon optimization used to address these issues and improve an expresion.
Solid-phase phosphoramidite chemistry is most common method currently however it has limitation on size of product and problems with speed and reagent and product stability
Due to depurination in which the β-N-glycosidic bond is hydrolytically cleaved releasing a nucleic base (A/G) in acidic environment required for synthesis leads to incomplete product and errors in resulting DNA.
Why can’t you make a 2000bp gene via direct oligo synthesis?
Because of these limits, larger DNA is assembled from high-quality oligos (typically 60–200nt) using methods like Gibson Assembly or PCA
Yin Y, Arneson R, Yuan Y, Fang S. Long oligos: direct chemical synthesis of genes with up to 1728 nucleotides. Chem Sci. 2024 Dec 18;16(4):1966-1973. doi: 10.1039/d4sc06958g. PMID: 39759933; PMCID: PMC11694485.
Choose ONE of the following three questions to answer; and please cite AI prompts or paper citations used, if any.
There are nine essential amino acids (histidine, isoleucine, leucine, lysine, methionine, phenylalanine, threonine, tryptophan, valine) Arginine is often added to this list, but can be synthesised in animals from glutamate. Lysine contingency refers to an organism that cannot synthesise lysine and is therefore dependent on lysine-containing media. This idea was used in the Jurassic Park Hollywood movie, but it does not make sense as a contingency measure due to the availability of lysine in a food chain (lysine produced in plants->herbivores->…->T-Rex). This idea might work if organism is dependent on a non-natural amino acid, so it can survive only if provided such AA
Ostrov N, Landon M, Guell M, Kuznetsov G, Teramoto J, Cervantes N, Zhou M, Singh K, Napolitano MG, Moosburner M, Shrock E, Pruitt BW, Conway N, Goodman DB, Gardner CL, Tyree G, Gonzales A, Wanner BL, Norville JE, Lajoie MJ, and Church GM. Design, synthesis, and testing toward a 57-codon genome. Science 2016; 353:819–22
Part A: SOD1 Binder Peptide Design (From Pranam)
| MIT/Harvard students | Required |
| Committed Listeners | Required |
Superoxide dismutase 1 (SOD1) is a cytosolic antioxidant enzyme that converts superoxide radicals into hydrogen peroxide and oxygen. In its native state, it forms a stable homodimer and binds copper and zinc.
Mutations in SOD1 cause familial Amyotrophic Lateral Sclerosis (ALS). Among them, the A4V mutation (Alanine → Valine at residue 4) leads to one of the most aggressive forms of the disease. The mutation subtly destabilizes the N-terminus, perturbs folding energetics, and promotes toxic aggregation.
Your challenge:
You will use three models developed in our lab:
mutant: MATKVVCVLKGDGPVQGIINFEQKESNGPVKVWGSIKGLTEGLHGFHVHEFGDNTAGCTSAGPHFNPLSRKHGGPKDEERHVGDLGNVTADKDGVADVSIEDSVISLSGDHCIIGRTLVVHEKADDLGKGGNEESTKTGNAGSRLACGVIGIAQ
AlphaFold’s 𝑖𝑝𝑇𝑀 (Interface Predicted Template Modelling) metric is used to assess the accuracy of structural predictions of protein-protein interactions (PPIs) and to estimate the probability that two proteins interact. Values >0.8 are considered confident, while values <0.6 suggest a potential but not guaranteed failure. While pTM (Predicted Template Modelling) measures the global accuracy of the entire complex model, with values >0.5 indicating a reasonable prediction.
Structural confidence alone is insufficient for therapeutic development. Using PeptiVerse, let’s evaluate the therapeutic properties of your peptide! For each PepMLM-generated peptide:
| Peptide | Predicted binding affinity | Solubility | Hemolysis probability | Net charge (pH 7) | Molecular weight |
|---|---|---|---|---|---|
| WHSGAVAAELKK | 5.493 (weak) | 1.0 (soluble) | 0.023 (non-hemolytic) | 0.85 | 1296.5 |
| WLSYAVAAELWE | 6.599 (weak) | 1.0 (soluble) | 0.171 (non-hemolytic) | -2.23 | 1437.6 |
| WLSGPAALEHKK | 4.864 (weak) | 1.0 (soluble) | 0.016 (non-hemolytic) | 0.85 | 1336.5 |
| FLYRWLPSRRGG | 5.968 (weak) | 1.0 (soluble) | 0.047 (non-hemolytic) | 2.76 | 1507.7 |
Scores ≥ 9 correspond to tight binders (K ≤ 10⁻⁹ M, nanomolar to picomolar range) Scores between 7 and 9 correspond to medium binders (10⁻⁷–10⁻⁹ M, nanomolar to micromolar range) Scores < 7 correspond to weak binders (K ≥ 10⁻⁶ M, micromolar and weaker) A difference of 1 unit in score corresponds to an approximately tenfold change in binding affinity.
Compare these predictions to what you observed structurally with AlphaFold3. In a short paragraph, describe what you see. Do peptides with higher ipTM also show stronger predicted affinity? Are any strong binders predicted to be hemolytic or poorly soluble? Which peptide best balances predicted binding and therapeutic properties? Choose one peptide you would advance and justify your decision briefly.
All are weak binders as per PeptiVerse and Alphafold. Peptide with least ipTM score (WLSYAVAAELWE) in AF3 reported as most promissing binder in Peptiverse. All peptides are soluble and non-hemolytic. However high scoring peptide carry a charge which mekes it difficult to cross membranes.
Now, move from sampling to controlled design. moPPIt uses Multi-Objective Guided Discrete Flow Matching (MOG-DFM) to steer peptide generation toward specific residues and optimize binding and therapeutic properties simultaneously. Unlike PepMLM, which samples plausible binders conditioned on just the target sequence, moPPIt lets you choose where you want to bind and optimize multiple objectives at once.
Assignment: DNA Assembly
| MIT/Harvard students | Required |
| Committed Listeners | Required |
What are some components in the Phusion High-Fidelity PCR Master Mix and what is their purpose?
as per 1
What are some factors that determine primer annealing temperature during PCR?
There are two methods from this class that create linear fragments of DNA: PCR, and restriction enzyme digests. Compare and contrast these two methods, both in terms of protocol as well as when one may be preferable to use over the other.
How can you ensure that the DNA sequences that you have digested and PCR-ed will be appropriate for Gibson cloning?
How does the plasmid DNA enter the E. coli cells during transformation?
Describe another assembly method in detail (such as Golden Gate Assembly)
Explain the other method in 5 - 7 sentences plus diagrams (either handmade or online).
Model this assembly method with Benchling or Asimov Kernel!