Governance 1. Bioengineering Tool Phage satellites are a diverse class of mobile genetic elements that parasitize a phage. Extracellular Prophage-Inducing Particles (EPIPs), are a novel class of phage satellite discovered by the Saha Lab that induces the extremely stable prophage, HerbertWM, in Mycolicibacterium aichiense. Due to their novelty, much of their mechanisms of action are unknown, but it is hypothesized that they contain antirepressors or partial antirepressors due to annotated genes that bear resemblance to BRO domains which have been noted to have some influence over transcription (Zemskov et al., 2000).
Because of the observed ability of EPIPs to induce a stable prophage, developing a lytic-lysogenic switch from their putative antirepressors would be valuable for many applications including medical and environmental applications. This switch could ensure that a temperate phage lyses bacterial pathogens in phage therapy or control the transcription of engineered constructs in the soil.
DNA Design Challenge Protein chosen: tape measure protein of Alyssa1, a Mycolicibacterium phage satellite called an “Extracellular Prophage-Inducing Particle” (EPIP). Tape measure proteins are rare in phage satellites, and TEM imaging has shown that EPIPs have uniquely long tails compared to the helper phage, the phage they parasitize, suggesting the tape measure protein may contribute to a unique mechanism of parasitism.
Paper Discussion Paper: AssemblyTron: flexible automation of DNA assembly with Opentrons OT-2 lab robots
This paper reports using the Opentrons robot to conduct PCR, Golden Gate assembly, and Gibson assembly. It appears that the robot was able to determine specific parameters for experiments such as annealing temperature. The paper did not go into much detail about how this occurred, but after looking at some of the supplementary materials, the script likely parses .csv files that include primer and fragement sequences.
Subsections of Homework
Week 1 HW: Principles and Practices
Governance
1. Bioengineering Tool
Phage satellites are a diverse class of mobile genetic elements that parasitize a phage. Extracellular Prophage-Inducing Particles (EPIPs), are a novel class of phage satellite discovered by the Saha Lab that induces the extremely stable prophage, HerbertWM, in Mycolicibacterium aichiense. Due to their novelty, much of their mechanisms of action are unknown, but it is hypothesized that they contain antirepressors or partial antirepressors due to annotated genes that bear resemblance to BRO domains which have been noted to have some influence over transcription (Zemskov et al., 2000). Because of the observed ability of EPIPs to induce a stable prophage, developing a lytic-lysogenic switch from their putative antirepressors would be valuable for many applications including medical and environmental applications. This switch could ensure that a temperate phage lyses bacterial pathogens in phage therapy or control the transcription of engineered constructs in the soil.
2. Governance/Policy Goals
Enhancebiosecurity: making sure this tool won’t activate genes or viruses that could harm people
Preventing incidents
Helping respond
Foster lab safety: while M. aichiense is BSL-1, safety precautions should still be used
Preventing incidents
Helping respond
Protect the environment: monitor how using this tool in the environment effects ecology
Preventing incidents
Helping respond
Other considerations
Minimizing costs/burdens to stakeholders
Feasibility
Not impede research
Promote constructive applications
3. Actions
Option 1: Make grant funding easier to obtain if the researcher is working toward using this tool in a beneficial application
Purpose: encourage beneficence
Design: funding sources such as the NIH or NSF must agree to this. These sources also must have enough money allocated by the government to give out, so the government must also agree to this.
Assumptions: researchers want money; it is already required of researchers to discuss potential applications of their work when applying for funding
Risks of failure and “success”: this could accidentally discourage basic science research—research for the sake of knowledge or a better understanding of the world that could benefit more people in the long run.
Option 2: Ensure individuals are well-trained to prevent contamination by more harmful microbes
Purpose: policies regarding BSL-1 waste disposal are already in place; however, in the past, M. aichiense cultures have become contaminated with bacteria that could have been a higher BSL level (but thankfully were not)
Design: more senior researchers should give newer researchers more hands-on focused training
Assumptions: senior researchers have the time to give newer researchers this training
Risks of failure and “success”: this could detract from the time spent researching, making the lab less productive
Option 3: Researchers must study long-term effects of using this tool in the environment
Purpose: ensure the application the tool is used for does not harm the environment
Design: researchers must have the means to track environmental changes over time. Some regulatory agency must enforce this standard.
Assumptions: researchers will do this to some extent anyway to see how durable/effective their work is in the environment
Risks of failure and “success”: researchers may not have the resources or time to allocate to study this. They may feel that their resources are better spent elsewhere
4. Scoring
Does the option:
Option 1
Option 2
Option 3
Enhance Biosecurity
2
1
1
• By preventing incidents
1
1
2
• By helping respond
3
2
1
Foster Lab Safety
n/a
1
n/a
• By preventing incidents
n/a
1
n/a
• By helping respond
n/a
1
n/a
Protect the environment
2
n/a
1
• By preventing incidents
2
n/a
2
• By helping respond
3
n/a
1
Other considerations
• Minimizing costs and burdens to stakeholders
2
2
3
• Feasibility?
2
1
3
• Not impede research
1
2
2
• Promote constructive applications
1
3
2
5. Prioritization
Options 1 and 2 should be prioritized because they are more easily enforced and are based on pre-existing regulations and practices, making them fairly feasible. Option 3 is difficult to enforce and offers researchers no incentive to comply, whereas option 1 provides incentive and works through positive reinforcement. Additionally, both options 1 and 2 focus on preventing incidents by offering incentives toward beneficence (option 1) or providing better safety training (option 2). Meanwhile, option 3 prioritizes reacting to possible incidents by tracking environmental changes, noticing damage to the environment, and responding. Lastly, option 3 most obviously hinders research by dictating what resaerchers must investigate, while option 1 does not hinder research and option 2 will help research in the long run, even if it initially takes time away from research to teach good practices.
Pre-Lecture Work
Professor Jacobson
DNA polymerase’s error rate is 1:106. The human genome is ~3 billion bp, so on average, there would be 3000 mistakes made per round of DNA replication. Biology deals with this by destroying misfolded proteins and/or killing cells with too many mutations.
Due to third-base wobble, there are many ways to code for a protein of interest. In real life, many of these don’t work because certain organisms have a bias toward particular tRNAs, so mRNA codons must match the tRNA anticodons.
Dr. LeProust
The most common method of oligo synthesis uses phosphoramidite.
Chemical errors accumulate, so making fragments longer than 200 bp is difficult with direct synthesis.
Errors have accumulated so much that there are basically no fragments with the correct sequence.
George Church
10 essential amino acids: histidine, isoleucine, leucine, lysine, methionine, phenylalanine, threonine, tryptophan, valine, and arginine. The “lysine contingnecy” suggests that humans (and other animals) must eat plants because they are our only source of lysine, but they are also the only source of all of these amino acids, not just lysine.
I would use slide 4, the NA to AA conversion to know what nucleotides to use that would result in amino acids that would chemically interact.
BoSS: It would be great to find a way to store medications/treatments at room temperature; however, this could lead to unintended consequences such as vulnerability to contamination. As someone who works in a lab with bacteria, although the bacteria I work with could survive at room temperature, we store them in a fridge to help prevent contamination. I would also be concerned about using protective molecules from organisms that have adapted to survive at extreme temperatures to store medications/treatments, as these protective molecules could harm humans if they contaminate the treatment.
HTGAA Website
I added information about me, my email, and a cover image to the home page of the website. I have also learned the basics of markdown (including how to embed links and add images) during that process and have added this homework assignment to the website as well.
Week 2 HW: DNA Read, Write, and Edit
DNA Design Challenge
Protein chosen: tape measure protein of Alyssa1, a Mycolicibacterium phage satellite called an “Extracellular Prophage-Inducing Particle” (EPIP). Tape measure proteins are rare in phage satellites, and TEM imaging has shown that EPIPs have uniquely long tails compared to the helper phage, the phage they parasitize, suggesting the tape measure protein may contribute to a unique mechanism of parasitism.
This sequence has been optimized for expression in E. coli. In this instance, it would be helpful to do this as these phage satellites were discovered in Mycolicibacterium aichiense, a non-model organism, so expressing this protein in E. coli would make it easier to work with. In general, nucleotide sequences are optimized so that the protein can be more efficiently expressed in a given organism, so that working with it is easier (such as by removing restriction enzyme cut sites), or so that synthesis is easier (such as by lowering the GC content).
Choosing a Vector in Twist: I chose pTwist Amp High Copy because the Saha Lab works with ampicillin and if I want to express this protein in E. coli, I assume I’d want a lot of protein and for every E. coli descendent to have the plasmid, so a high copy plasmid would work best.
What DNA would you want to sequence (e.g., read) and why? I would like to sequence my gut microbiome. I’ve read about how the microbiome, specifically the gut microbiome can influence a person’s behaviors and send signals to the brain. This raises some philosophical questions of whether humans are truly governed by their own thoughts or if their thoughts are being subtly influenced by bacteria in their gut. I think it would be interesting to see what species dominate my microbiome.
In lecture, a variety of sequencing technologies were mentioned. What technology or technologies would you use to perform sequencing on your DNA and why? Nanopore sequencing it can generate long reads, so it is easier to identify unique features of each species. This makes it easier to assemble genomes.
Write
What DNA would you want to synthesize (e.g., write) and why? I have always wondered if it would one day be possible to create an organism with “no evolutionary history.” Many of the synthetic biology products/parts now are from preexisting organisms, but is it possible to create an entirely new sequence of DNA that produces proteins with no orthologs? Theoretically, this may be possible, similar to the “monkey at a type-writer” idea, where by chance a random sequence of nucleotides could not only produce a novel protein, but a novel organism. I would like to do this if only to see if it is possible.
What technology or technologies would you use to perform this DNA synthesis and why? To design this sequence, I would need some kind of machine learning model because biology is extremely complex and cannot be modeled with traditional methods. To synthesize this DNA, I could use the most common method of DNA synthesis, phosphoramidite chemistry, to synthesize pieces of this DNA, then use an assembly technique like Gibson assembly or Golden Gate assembly to construct the genome. These methods are most accessible and are commonly used.
Essential steps: deprotection, add next nucleotide, cap to stop strands that didn’t get the new nucleotide added from growing, and oxidation. Limitations: this method can’t make fragments longer than ~200 bp due to errors that accumulate.
Edit
What DNA would you want to edit and why?
I would like to edit my dog’s DNA to make him age more slowly. My dog makes me very happy but I don’t get to see him much because I’m at college and he lives with my parents. I would love it if he could stick around long enough that I could take care of him once I am out of school.
What technology or technologies would you use to perform these DNA edits and why? I would use CRISPR to knock in or out genes that I’d need to change. Editing an organism to change its life span would require many edits to the genome, and those edits would likely have to be precise. Because of that, CRISPR would be the best option.
Essential steps: guide RNA recognizes DNA sequence, and the Cas 9 enzyme makes a double-stranded break in the DNA. Preparation and input: prepare the guide RNA, so I’d need to know the sequence. Inputs include cells, some sort of vector to get the CRISPR-Cas9 system into the cells, and the CRISPR-Cas9 system (enzyme and guide RNA) Limitations: cuts in off target places, requirement of a PAM sequence, delivery mechanism (finding a vector)
This paper reports using the Opentrons robot to conduct PCR, Golden Gate assembly, and Gibson assembly. It appears that the robot was able to determine specific parameters for experiments such as annealing temperature. The paper did not go into much detail about how this occurred, but after looking at some of the supplementary materials, the script likely parses .csv files that include primer and fragement sequences.
Automation in Final Project
Since my final project will likely involve assembly techniques as described above, I could use automation by writing a Python script that automatically calculates experiment parameters such as temperatures and concentrations based on information that I put in such as DNA sequences and concentrations. This part of experiment planning is tedious and often involves multiple tools/pages (such as NEB’s Tm calculator, and various protocols for each method of assembly), so having one script that can do this automatically would be helpful. Additionally, it would be interesting to see how the Opentrons robot can run this script, though I predict that that would require some amount of troubleshooting that I may not have time for.