Homework

Weekly homework submissions:

  • Week 1 HW: Principles and Practices

    Governance 1. Bioengineering Tool Phage satellites are a diverse class of mobile genetic elements that parasitize a phage. Extracellular Prophage-Inducing Particles (EPIPs), are a novel class of phage satellite discovered by the Saha Lab that induces the extremely stable prophage, HerbertWM, in Mycolicibacterium aichiense. Due to their novelty, much of their mechanisms of action are unknown, but it is hypothesized that they contain antirepressors or partial antirepressors due to annotated genes that bear resemblance to BRO domains which have been noted to have some influence over transcription (Zemskov et al., 2000). Because of the observed ability of EPIPs to induce a stable prophage, developing a lytic-lysogenic switch from their putative antirepressors would be valuable for many applications including medical and environmental applications. This switch could ensure that a temperate phage lyses bacterial pathogens in phage therapy or control the transcription of engineered constructs in the soil.

  • Week 2 HW: DNA Read, Write, and Edit

    DNA Design Challenge Protein chosen: tape measure protein of Alyssa1, a Mycolicibacterium phage satellite called an “Extracellular Prophage-Inducing Particle” (EPIP). Tape measure proteins are rare in phage satellites, and TEM imaging has shown that EPIPs have uniquely long tails compared to the helper phage, the phage they parasitize, suggesting the tape measure protein may contribute to a unique mechanism of parasitism.

  • Week 3 HW: Lab Automation

    Paper Discussion Paper: AssemblyTron: flexible automation of DNA assembly with Opentrons OT-2 lab robots This paper reports using the Opentrons robot to conduct PCR, Golden Gate assembly, and Gibson assembly. It appears that the robot was able to determine specific parameters for experiments such as annealing temperature. The paper did not go into much detail about how this occurred, but after looking at some of the supplementary materials, the script likely parses .csv files that include primer and fragement sequences.

Subsections of Homework

Week 1 HW: Principles and Practices

cover image cover image

Governance

1. Bioengineering Tool

Phage satellites are a diverse class of mobile genetic elements that parasitize a phage. Extracellular Prophage-Inducing Particles (EPIPs), are a novel class of phage satellite discovered by the Saha Lab that induces the extremely stable prophage, HerbertWM, in Mycolicibacterium aichiense. Due to their novelty, much of their mechanisms of action are unknown, but it is hypothesized that they contain antirepressors or partial antirepressors due to annotated genes that bear resemblance to BRO domains which have been noted to have some influence over transcription (Zemskov et al., 2000).
Because of the observed ability of EPIPs to induce a stable prophage, developing a lytic-lysogenic switch from their putative antirepressors would be valuable for many applications including medical and environmental applications. This switch could ensure that a temperate phage lyses bacterial pathogens in phage therapy or control the transcription of engineered constructs in the soil.

2. Governance/Policy Goals

  • Enhancebiosecurity: making sure this tool won’t activate genes or viruses that could harm people
    • Preventing incidents
    • Helping respond
  • Foster lab safety: while M. aichiense is BSL-1, safety precautions should still be used
    • Preventing incidents
    • Helping respond
  • Protect the environment: monitor how using this tool in the environment effects ecology
    • Preventing incidents
    • Helping respond
  • Other considerations
    • Minimizing costs/burdens to stakeholders
    • Feasibility
    • Not impede research
    • Promote constructive applications

3. Actions

Option 1: Make grant funding easier to obtain if the researcher is working toward using this tool in a beneficial application

  • Purpose: encourage beneficence
  • Design: funding sources such as the NIH or NSF must agree to this. These sources also must have enough money allocated by the government to give out, so the government must also agree to this.
  • Assumptions: researchers want money; it is already required of researchers to discuss potential applications of their work when applying for funding
  • Risks of failure and “success”: this could accidentally discourage basic science research—research for the sake of knowledge or a better understanding of the world that could benefit more people in the long run.

Option 2: Ensure individuals are well-trained to prevent contamination by more harmful microbes

  • Purpose: policies regarding BSL-1 waste disposal are already in place; however, in the past, M. aichiense cultures have become contaminated with bacteria that could have been a higher BSL level (but thankfully were not)
  • Design: more senior researchers should give newer researchers more hands-on focused training
  • Assumptions: senior researchers have the time to give newer researchers this training
  • Risks of failure and “success”: this could detract from the time spent researching, making the lab less productive

Option 3: Researchers must study long-term effects of using this tool in the environment

  • Purpose: ensure the application the tool is used for does not harm the environment
  • Design: researchers must have the means to track environmental changes over time. Some regulatory agency must enforce this standard.
  • Assumptions: researchers will do this to some extent anyway to see how durable/effective their work is in the environment
  • Risks of failure and “success”: researchers may not have the resources or time to allocate to study this. They may feel that their resources are better spent elsewhere

4. Scoring

Does the option:Option 1Option 2Option 3
Enhance Biosecurity211
• By preventing incidents112
• By helping respond321
Foster Lab Safetyn/a1n/a
• By preventing incidentsn/a1n/a
• By helping respondn/a1n/a
Protect the environment2n/a1
• By preventing incidents2n/a2
• By helping respond3n/a1
Other considerations
• Minimizing costs and burdens to stakeholders223
• Feasibility?213
• Not impede research122
• Promote constructive applications132

5. Prioritization

Options 1 and 2 should be prioritized because they are more easily enforced and are based on pre-existing regulations and practices, making them fairly feasible. Option 3 is difficult to enforce and offers researchers no incentive to comply, whereas option 1 provides incentive and works through positive reinforcement. Additionally, both options 1 and 2 focus on preventing incidents by offering incentives toward beneficence (option 1) or providing better safety training (option 2). Meanwhile, option 3 prioritizes reacting to possible incidents by tracking environmental changes, noticing damage to the environment, and responding. Lastly, option 3 most obviously hinders research by dictating what resaerchers must investigate, while option 1 does not hinder research and option 2 will help research in the long run, even if it initially takes time away from research to teach good practices.

Pre-Lecture Work

Professor Jacobson

  1. DNA polymerase’s error rate is 1:106. The human genome is ~3 billion bp, so on average, there would be 3000 mistakes made per round of DNA replication. Biology deals with this by destroying misfolded proteins and/or killing cells with too many mutations.
  2. Due to third-base wobble, there are many ways to code for a protein of interest. In real life, many of these don’t work because certain organisms have a bias toward particular tRNAs, so mRNA codons must match the tRNA anticodons.

Dr. LeProust

  1. The most common method of oligo synthesis uses phosphoramidite.
  2. Chemical errors accumulate, so making fragments longer than 200 bp is difficult with direct synthesis.
  3. Errors have accumulated so much that there are basically no fragments with the correct sequence.

George Church

  1. 10 essential amino acids: histidine, isoleucine, leucine, lysine, methionine, phenylalanine, threonine, tryptophan, valine, and arginine. The “lysine contingnecy” suggests that humans (and other animals) must eat plants because they are our only source of lysine, but they are also the only source of all of these amino acids, not just lysine.
  2. I would use slide 4, the NA to AA conversion to know what nucleotides to use that would result in amino acids that would chemically interact.
  3. BoSS: It would be great to find a way to store medications/treatments at room temperature; however, this could lead to unintended consequences such as vulnerability to contamination. As someone who works in a lab with bacteria, although the bacteria I work with could survive at room temperature, we store them in a fridge to help prevent contamination. I would also be concerned about using protective molecules from organisms that have adapted to survive at extreme temperatures to store medications/treatments, as these protective molecules could harm humans if they contaminate the treatment.

HTGAA Website

I added information about me, my email, and a cover image to the home page of the website. I have also learned the basics of markdown (including how to embed links and add images) during that process and have added this homework assignment to the website as well.

Week 2 HW: DNA Read, Write, and Edit

DNA Design Challenge

Protein chosen: tape measure protein of Alyssa1, a Mycolicibacterium phage satellite called an “Extracellular Prophage-Inducing Particle” (EPIP). Tape measure proteins are rare in phage satellites, and TEM imaging has shown that EPIPs have uniquely long tails compared to the helper phage, the phage they parasitize, suggesting the tape measure protein may contribute to a unique mechanism of parasitism.

TMP_info TMP_info

Amino acid sequence: MARKAGMSTGVEVARISVKVGPDTKHFRRELKKDLDRIEESMRAKIDVEPDMKGFRQEVQSKTKGMRTSVKVDADVDRKGFLGRIADSLSQIQPPSFGSGINPTGYAAIAAGIAALTPLIAGTLGAATTALMALPGLVAAVATPIAALTLGIDGLKAAATRLQGPFEDLKATMSSAVESQFGPVFDQLRSLFPVLKGALPSVTAGLADMAKSIADVIASPEGLAKIDTTIRNIGMALTTAAPGVGKFTDGLMGLVQSFTGKPLQGVAEWFSKTGDSFSAWVEKMTRPSWFTGKSPLEVAFGNLGDTLKTIADTLGDVGQKALDFFSDPEKIKSFRDELTLVSDAVKGIATGINGIATAYSSLPLSGEGLKGLMPIQAQLGVKMFDGLKESAAKAFAEVATMAGGFVSNIGSTFMSIGDTLSGIWGGITNAAATAFNSLVSAAQSAVSGVVSAVATIPGQVASALAGLAEAGASAGRNLVQGLVNGISGMIGSAVAKARELASSVASAVTGFLGIHSPSKLFTEIGEYVGQGFDNGLQSQLATLGQTAKAMAETVTEEFNGGLKFGADGFSTDSDNPLMQAGAGLANAPVDFAKATGKQFLSDLGISGNGVLSRALTEGIQYIFQISSVDEALSIKDRETSKNALSIVGR

Reverse translated nucleotide sequence:
atggcgcgcaaagcgggcatgagcaccggcgtggaagtggcgcgcattagcgtgaaagtgggcccggataccaaacattttcgccgcgaactgaaaaaagatctggatcgcattgaagaaagcatgcgcgcgaaaattgatgtggaaccggatatgaaaggctttcgccaggaagtgcagagcaaaaccaaaggcatgcgcaccagcgtgaaagtggatgcggatgtggatcgcaaaggctttctgggccgcattgcggatagcctgagccagattcagccgccgagctttggcagcggcattaacccgaccggctatgcggcgattgcggcgggcattgcggcgctgaccccgctgattgcgggcaccctgggcgcggcgaccaccgcgctgatggcgctgccgggcctggtggcggcggtggcgaccccgattgcggcgctgaccctgggcattgatggcctgaaagcggcggcgacccgcctgcagggcccgtttgaagatctgaaagcgaccatgagcagcgcggtggaaagccagtttggcccggtgtttgatcagctgcgcagcctgtttccggtgctgaaaggcgcgctgccgagcgtgaccgcgggcctggcggatatggcgaaaagcattgcggatgtgattgcgagcccggaaggcctggcgaaaattgataccaccattcgcaacattggcatggcgctgaccaccgcggcgccgggcgtgggcaaatttaccgatggcctgatgggcctggtgcagagctttaccggcaaaccgctgcagggcgtggcggaatggtttagcaaaaccggcgatagctttagcgcgtgggtggaaaaaatgacccgcccgagctggtttaccggcaaaagcccgctggaagtggcgtttggcaacctgggcgataccctgaaaaccattgcggataccctgggcgatgtgggccagaaagcgctggatttttttagcgatccggaaaaaattaaaagctttcgcgatgaactgaccctggtgagcgatgcggtgaaaggcattgcgaccggcattaacggcattgcgaccgcgtatagcagcctgccgctgagcggcgaaggcctgaaaggcctgatgccgattcaggcgcagctgggcgtgaaaatgtttgatggcctgaaagaaagcgcggcgaaagcgtttgcggaagtggcgaccatggcgggcggctttgtgagcaacattggcagcacctttatgagcattggcgataccctgagcggcatttggggcggcattaccaacgcggcggcgaccgcgtttaacagcctggtgagcgcggcgcagagcgcggtgagcggcgtggtgagcgcggtggcgaccattccgggccaggtggcgagcgcgctggcgggcctggcggaagcgggcgcgagcgcgggccgcaacctggtgcagggcctggtgaacggcattagcggcatgattggcagcgcggtggcgaaagcgcgcgaactggcgagcagcgtggcgagcgcggtgaccggctttctgggcattcatagcccgagcaaactgtttaccgaaattggcgaatatgtgggccagggctttgataacggcctgcagagccagctggcgaccctgggccagaccgcgaaagcgatggcggaaaccgtgaccgaagaatttaacggcggcctgaaatttggcgcggatggctttagcaccgatagcgataacccgctgatgcaggcgggcgcgggcctggcgaacgcgccggtggattttgcgaaagcgaccggcaaacagtttctgagcgatctgggcattagcggcaacggcgtgctgagccgcgcgctgaccgaaggcattcagtatatttttcagattagcagcgtggatgaagcgctgagcattaaagatcgcgaaaccagcaaaaacgcgctgagcattgtgggccgc

Codon optimized nucleotide sequence:
ATGGCCCGCAAAGCCGGCATGAGCACCGGCGTGGAAGTCGCGCGTATTAGCGTGAAAGTGGGCCCGGATACCAAACATTTTCGTCGCGAACTGAAAAAAGATCTGGATCGCATTGAAGAAAGCATGCGTGCCAAAATTGACGTAGAACCGGATATGAAAGGCTTTCGTCAGGAAGTGCAGAGCAAAACCAAAGGCATGCGCACCAGCGTGAAAGTGGATGCGGATGTGGATCGCAAAGGCTTCCTGGGCCGTATTGCCGATAGCCTGAGTCAGATTCAGCCGCCTAGCTTTGGCAGCGGCATTAACCCGACCGGTTACGCCGCGATTGCGGCGGGCATTGCCGCGCTGACCCCGCTGATTGCGGGCACCCTGGGCGCGGCGACCACCGCGCTGATGGCCCTGCCGGGCCTGGTGGCGGCGGTGGCGACCCCGATTGCGGCGCTGACTCTGGGCATTGATGGCCTGAAAGCCGCCGCGACCCGCCTGCAGGGTCCGTTTGAAGATCTGAAAGCCACCATGTCGAGCGCCGTGGAAAGCCAATTTGGCCCGGTGTTTGATCAGCTGCGTAGCCTGTTTCCGGTGCTGAAAGGCGCGCTGCCGAGCGTGACCGCGGGTCTTGCGGATATGGCGAAGAGCATTGCCGATGTGATTGCCAGCCCGGAAGGCCTGGCAAAAATTGATACCACCATCCGTAACATTGGCATGGCCCTGACCACCGCGGCCCCGGGCGTTGGCAAATTTACCGATGGTCTGATGGGCCTGGTACAGAGCTTTACCGGCAAACCGCTGCAGGGCGTGGCGGAATGGTTTAGTAAAACCGGTGATTCATTTAGCGCCTGGGTGGAAAAAATGACCCGCCCGAGCTGGTTTACCGGCAAAAGCCCGCTGGAAGTCGCATTCGGCAACCTGGGCGATACCCTGAAAACCATTGCCGATACGCTGGGCGATGTCGGCCAGAAAGCCCTGGATTTTTTTAGCGATCCGGAAAAAATTAAATCGTTTCGCGATGAACTGACCCTGGTGAGCGATGCCGTAAAAGGCATTGCCACCGGTATCAACGGCATTGCCACCGCCTACTCATCCCTGCCGCTGAGCGGCGAAGGCCTGAAAGGCCTGATGCCGATTCAGGCCCAGCTGGGCGTTAAAATGTTTGATGGTCTGAAAGAAAGCGCCGCGAAAGCCTTCGCGGAAGTTGCGACCATGGCAGGCGGCTTTGTGTCGAACATTGGCTCCACCTTTATGAGCATTGGCGATACCCTGAGCGGTATTTGGGGTGGCATTACCAATGCGGCGGCGACCGCGTTTAACAGCCTGGTTAGCGCGGCGCAGAGCGCGGTGAGCGGCGTGGTGAGCGCGGTGGCCACCATTCCGGGTCAGGTGGCGAGCGCCCTGGCGGGCCTGGCCGAAGCAGGCGCCAGCGCCGGCCGCAATCTGGTGCAGGGTCTGGTCAACGGCATTAGCGGCATGATCGGCAGCGCCGTAGCGAAAGCGCGCGAACTGGCGAGCAGCGTGGCGTCAGCGGTTACCGGCTTCCTGGGCATTCACAGCCCGAGCAAATTATTTACCGAAATTGGTGAATATGTGGGCCAGGGCTTCGATAACGGCCTGCAGAGCCAGCTGGCGACTCTGGGTCAGACCGCGAAAGCGATGGCGGAAACCGTGACCGAAGAATTTAACGGCGGCCTGAAATTTGGTGCGGACGGCTTTAGCACCGACAGCGATAATCCGTTAATGCAGGCGGGCGCCGGCCTGGCCAACGCGCCGGTGGATTTTGCGAAAGCCACCGGTAAACAGTTTCTGTCTGATCTGGGTATCAGCGGCAACGGCGTTCTGAGCCGTGCACTGACCGAAGGCATCCAGTATATTTTTCAGATTAGCAGCGTTGATGAAGCGCTGAGCATTAAAGATCGCGAAACCAGCAAAAACGCCCTGAGCATTGTGGGCCGT

This sequence has been optimized for expression in E. coli. In this instance, it would be helpful to do this as these phage satellites were discovered in Mycolicibacterium aichiense, a non-model organism, so expressing this protein in E. coli would make it easier to work with. In general, nucleotide sequences are optimized so that the protein can be more efficiently expressed in a given organism, so that working with it is easier (such as by removing restriction enzyme cut sites), or so that synthesis is easier (such as by lowering the GC content).

Prepare a Twist DNA Synthesis Order

Alyssa1 Tape Measure Protein Expression Cassette in Benchling
Sequence in Benchling
Benchling Benchling

Choosing a Vector in Twist: I chose pTwist Amp High Copy because the Saha Lab works with ampicillin and if I want to express this protein in E. coli, I assume I’d want a lot of protein and for every E. coli descendent to have the plasmid, so a high copy plasmid would work best.

Alyssa1 Tape Measure Protein in Plasmid
Sequence from Twist imported into Benchling
Alyssa TMP in plasmid Alyssa TMP in plasmid

DNA Read/Write/Edit

Read

What DNA would you want to sequence (e.g., read) and why?
I would like to sequence my gut microbiome. I’ve read about how the microbiome, specifically the gut microbiome can influence a person’s behaviors and send signals to the brain. This raises some philosophical questions of whether humans are truly governed by their own thoughts or if their thoughts are being subtly influenced by bacteria in their gut. I think it would be interesting to see what species dominate my microbiome.

In lecture, a variety of sequencing technologies were mentioned. What technology or technologies would you use to perform sequencing on your DNA and why?
Nanopore sequencing it can generate long reads, so it is easier to identify unique features of each species. This makes it easier to assemble genomes.

Write

What DNA would you want to synthesize (e.g., write) and why?
I have always wondered if it would one day be possible to create an organism with “no evolutionary history.” Many of the synthetic biology products/parts now are from preexisting organisms, but is it possible to create an entirely new sequence of DNA that produces proteins with no orthologs? Theoretically, this may be possible, similar to the “monkey at a type-writer” idea, where by chance a random sequence of nucleotides could not only produce a novel protein, but a novel organism. I would like to do this if only to see if it is possible.

What technology or technologies would you use to perform this DNA synthesis and why?
To design this sequence, I would need some kind of machine learning model because biology is extremely complex and cannot be modeled with traditional methods. To synthesize this DNA, I could use the most common method of DNA synthesis, phosphoramidite chemistry, to synthesize pieces of this DNA, then use an assembly technique like Gibson assembly or Golden Gate assembly to construct the genome. These methods are most accessible and are commonly used.

Essential steps: deprotection, add next nucleotide, cap to stop strands that didn’t get the new nucleotide added from growing, and oxidation.
Limitations: this method can’t make fragments longer than ~200 bp due to errors that accumulate.

Edit

What DNA would you want to edit and why? I would like to edit my dog’s DNA to make him age more slowly. My dog makes me very happy but I don’t get to see him much because I’m at college and he lives with my parents. I would love it if he could stick around long enough that I could take care of him once I am out of school.

What technology or technologies would you use to perform these DNA edits and why?
I would use CRISPR to knock in or out genes that I’d need to change. Editing an organism to change its life span would require many edits to the genome, and those edits would likely have to be precise. Because of that, CRISPR would be the best option.

Essential steps: guide RNA recognizes DNA sequence, and the Cas 9 enzyme makes a double-stranded break in the DNA.
Preparation and input: prepare the guide RNA, so I’d need to know the sequence. Inputs include cells, some sort of vector to get the CRISPR-Cas9 system into the cells, and the CRISPR-Cas9 system (enzyme and guide RNA)
Limitations: cuts in off target places, requirement of a PAM sequence, delivery mechanism (finding a vector)

Week 3 HW: Lab Automation

Paper Discussion

Paper: AssemblyTron: flexible automation of DNA assembly with Opentrons OT-2 lab robots

This paper reports using the Opentrons robot to conduct PCR, Golden Gate assembly, and Gibson assembly. It appears that the robot was able to determine specific parameters for experiments such as annealing temperature. The paper did not go into much detail about how this occurred, but after looking at some of the supplementary materials, the script likely parses .csv files that include primer and fragement sequences.

Automation in Final Project

Since my final project will likely involve assembly techniques as described above, I could use automation by writing a Python script that automatically calculates experiment parameters such as temperatures and concentrations based on information that I put in such as DNA sequences and concentrations. This part of experiment planning is tedious and often involves multiple tools/pages (such as NEB’s Tm calculator, and various protocols for each method of assembly), so having one script that can do this automatically would be helpful. Additionally, it would be interesting to see how the Opentrons robot can run this script, though I predict that that would require some amount of troubleshooting that I may not have time for.

Final Project Ideas

Slide deck