Homework

Weekly homework submissions:

  • Week 1 HW: Principles and Practices

    Describe a biological engineering application or tool you want to develop and why. One application that might be worth exploring would be in the realm of data storage for an imagined, hypothetical semi-exposed media. For this context special proteins could be designed in applications that bind to sequences that code for hidden malware or faulty sequences and or act for preserving media. I imagine that both this media design and this protein use would be niche but nonetheless fun to design for. Modalities could be as a gentle spray or as a settled solution that could be extracted.

  • Week 2 HW: DNA Read, Write, and Edit

    Part 0: Basics of Gel Electrophoresis [This was a pure watch session. Thus there’s nothing to add here.] Part 1: Benchling & In-silico Gel Art Restriction Enzymes Simulated on Lambda_NEB: EcoRI, HindIII, BamHI, KpnI, EcoRV, SacI, and SalI Simple Art produced by way of Paul Vanouse’s Latent Figure Protocol artworks through the use of RC Donovan’s Gel Art Iteration Tool (https://rcdonovan.com/gel-art): “4 corners”, using EcoRI and SalI in Lanes 1 and 10.

  • Week 3 HW: Lab Automation

    Find and describe a published paper that utilizes the Opentrons or an automation tool to achieve novel biological applications. DeRoo, J.B., Jones, A.A., Slaughter, C.K., Ahr, T.W., Stroup, S.M., Thompson, G.B. and Snow, C.D., 2025. Automation of protein crystallization scaleup via Opentrons-2 liquid handling. SLAS technology, 32, p.100268. https://doi.org/10.1016/j.slast.2025.100268 This work describes an approach by which an Opentrons-2 liquid handling robot was used for automating sitting drop protein crystallization trials. This ability also improve comparability of products produced, improving studies that depend on their proper manufacture. An important detail is how the Opentrons-2 can prove a cost-effective option for laboratory operations. For example, at the time of writing, the Opentrons-2 can be purchased for around 13.5K USD vs that of a Gryphon machine at around 65K USD.

Subsections of Homework

Week 1 HW: Principles and Practices

cover image cover image
  1. Describe a biological engineering application or tool you want to develop and why. One application that might be worth exploring would be in the realm of data storage for an imagined, hypothetical semi-exposed media. For this context special proteins could be designed in applications that bind to sequences that code for hidden malware or faulty sequences and or act for preserving media. I imagine that both this media design and this protein use would be niche but nonetheless fun to design for. Modalities could be as a gentle spray or as a settled solution that could be extracted.

  2. Describe one or more governance/policy goals related to ensuring that this application or tool contributes to an “ethical” future, like ensuring non-malfeasance (preventing harm).

A policy goal to ensure that this application contributes to an ethical future could be ensuring transparent design and standardization of these proteins (and their stats per context) so there is a way to validate their production and application prior to use. This could reduce the chance of bad actors sabotaging batches or distributing lower quality batches.

  1. Describe at least three different potential governance “actions” by considering the four aspects below (Purpose, Design, Assumptions, Risks of Failure & “Success”)
Governance ActionPurposeDesignAssumptionsFailure and Success Risks
Standardized LibrariesEducates communities on proteins available and safe designsCreate a library to hold these designs, as well as protocols for their manufacture to specThat communities will agree on these standards and that there are not unnecessary inequities holding these standardsSuccess means that manufacture paths are straightforward.

Failure means that design paths are too numerous to account for and fragmentation possibilities are higher
Blue-teaming DesignTo build a community line of protection of designs and applicationsDesign blue teaming frameworks and recruit educators to practiceThat there is sufficient interest for fundingSuccess means that it is easier for production and share. Failure means that this pipeline has an established line of people who
Red-teaming DesignTo identify means of sabotaging to proteins to protect libraries and distributorsDesign blue-teaming framework and recruit educators to practiceThat there is sufficient interest for fundingSuccess means that it is easier for production and share. Failure means that this pipeline has less visibility on sabotage routes
  1. Next, score (from 1-3 with, 1 as the best, or n/a) each of your governance actions against your rubric of policy goals.
Does the option:Option 1Option 2Option 3
Enhance Biosecurity333
• By preventing incidents333
• By helping respond333
Foster Lab Safety333
• By preventing incident333
• By helping respond333
Protect the environment333
• By preventing incidents333
• By helping respond333
Other considerations333
• Minimizing costs and burdens to stakeholders322
• Feasibility?333
• Not impede research333
• Promote constructive applications333
  1. Last, drawing upon this scoring, describe which governance option, or combination of options, you would prioritize, and why. Outline any trade-offs you considered as well as assumptions and uncertainties.

I would prioritize library standardization. All operations, from that educators to users, can be facilitated from having a common reference frame. Further, among the options, this appears the most neutral. Overall, proper security requires practitioners from both sides of the spectrum: training in protecting and disrupting systems, and being willing to document for the good of the community.


Homework

Answers for Homework Questions from Professor Jabson:

  1. Error rate - 1: 10^6. This is dwarfed by the length of the human genome. The body has numerous selectivity and repair mechanisms to deal with mitigate issues from mutations.
  2. 64 / Some codes might not work due to chemical incompatibility and structural reasons.

Answers for Homework Questions from Dr. LeProust:

  1. Phosphoramidite DNA Synthesis Cycle
  2. Increasing inefficiency of reactions with greater length
  3. Extremely high inefficiency Error rates

Answers for Homework Question from George Church:

  1. Arginine, histidine, isoleucine, leucine, lysine, methionine, phenylalanine, threonine, tryptophan, and valine

I’m still forming my thoughts on how this affects my view of the Lysine Contingency.

Week 2 HW: DNA Read, Write, and Edit

Part 0: Basics of Gel Electrophoresis

[This was a pure watch session. Thus there’s nothing to add here.]

Part 1: Benchling & In-silico Gel Art

Restriction Enzymes Simulated on Lambda_NEB: EcoRI, HindIII, BamHI, KpnI, EcoRV, SacI, and SalI

Simple Art produced by way of Paul Vanouse’s Latent Figure Protocol artworks through the use of RC Donovan’s Gel Art Iteration Tool (https://rcdonovan.com/gel-art):

“4 corners”, using EcoRI and SalI in Lanes 1 and 10.

Part 2: Gel Art - Restriction Digests and Gel Electrophoresis

[This leaned on Wetlab Access. As a virtual student, this was not required.]

Part 3: DNA Design Challenge

3.1. Choose your protein.

Regarding proteins, I chose U-box domain-containing protein 12, also known as Plant U-box protein 12 or RING-type E3 ubiquitin transferase PUB12. I wanted to start with something and keep things relatively simple.

sp|Q9ZV31|PUB12_ARATH U-box domain-containing protein 12 OS=Arabidopsis thaliana OX=3702 GN=PUB12 PE=2 SV=1 MAKSEKHKLAQTLIDSINEIASISDSVTPMKKHCANLSRRLSLLLPMLEEIRDNQESSSE VVNALLSVKQSLLHAKDLLSFVSHVSKIYLVLERDQVMVKFQKVTSLLEQALSIIPYENL EISDELKEQVELVLVQLRRSLGKRGGDVYDDELYKDVLSLYSGRGSVMESDMVRRVAEKL QLMTITDLTQESLALLDMVSSSGGDDPGESFEKMSMVLKKIKDFVQTYNPNLDDAPLRLK SSLPKSRDDDRDMLIPPEEFRCPISLELMTDPVIVSSGQTYERECIKKWLEGGHLTCPKT QETLTSDIMTPNYVLRSLIAQWCESNGIEPPKRPNISQPSSKASSSSSAPDDEHNKIEEL LLKLTSQQPEDRRSAAGEIRLLAKQNNHNRVAIAASGAIPLLVNLLTISNDSRTQEHAVT SILNLSICQENKGKIVYSSGAVPGIVHVLQKGSMEARENAAATLFSLSVIDENKVTIGAA GAIPPLVTLLSEGSQRGKKDAATALFNLCIFQGNKGKAVRAGLVPVLMRLLTEPESGMVD ESLSILAILSSHPDGKSEVGAADAVPVLVDFIRSGSPRNKENSAAVLVHLCSWNQQHLIE AQKLGIMDLLIEMAENGTDRGKRKAAQLLNRFSRFNDQQKQHSGLGLEDQISLI Site: https://rest.uniprot.org/uniprotkb/Q9ZV31.fasta Base site: https://www.uniprot.org/uniprotkb/Q9ZV31/entry

3.2. Reverse Translate: Protein (amino acid) sequence to DNA (nucleotide) sequence.

Reverse Translation Tool – BCCM – GeneCorner (https://www.genecorner.ugent.be/rev_trans.html) PUB12, a plant U-box–type E3 ubiquitin ligase DNA sequence (1) >reverse translation of sp|Q9ZV31|PUB12_ARATH U-box domain-containing protein 12 OS=Arabidopsis thaliana OX=3702 GN=PUB12 PE=2 SV=1 to a 1962 base sequence of most likely codons. atggcgaaaagcgaaaaacataaactggcgcagaccctgattgatagcattaacgaaatt gcgagcattagcgatagcgtgaccccgatgaaaaaacattgcgcgaacctgagccgccgc ctgagcctgctgctgccgatgctggaagaaattcgcgataaccaggaaagcagcagcgaa gtggtgaacgcgctgctgagcgtgaaacagagcctgctgcatgcgaaagatctgctgagc tttgtgagccatgtgagcaaaatttatctggtgctggaacgcgatcaggtgatggtgaaa tttcagaaagtgaccagcctgctggaacaggcgctgagcattattccgtatgaaaacctg gaaattagcgatgaactgaaagaacaggtggaactggtgctggtgcagctgcgccgcagc ctgggcaaacgcggcggcgatgtgtatgatgatgaactgtataaagatgtgctgagcctg tatagcggccgcggcagcgtgatggaaagcgatatggtgcgccgcgtggcggaaaaactg cagctgatgaccattaccgatctgacccaggaaagcctggcgctgctggatatggtgagc agcagcggcggcgatgatccgggcgaaagctttgaaaaaatgagcatggtgctgaaaaaa attaaagattttgtgcagacctataacccgaacctggatgatgcgccgctgcgcctgaaa agcagcctgccgaaaagccgcgatgatgatcgcgatatgctgattccgccggaagaattt cgctgcccgattagcctggaactgatgaccgatccggtgattgtgagcagcggccagacc tatgaacgcgaatgcattaaaaaatggctggaaggcggccatctgacctgcccgaaaacc caggaaaccctgaccagcgatattatgaccccgaactatgtgctgcgcagcctgattgcg cagtggtgcgaaagcaacggcattgaaccgccgaaacgcccgaacattagccagccgagc agcaaagcgagcagcagcagcagcgcgccggatgatgaacataacaaaattgaagaactg ctgctgaaactgaccagccagcagccggaagatcgccgcagcgcggcgggcgaaattcgc ctgctggcgaaacagaacaaccataaccgcgtggcgattgcggcgagcggcgcgattccg ctgctggtgaacctgctgaccattagcaacgatagccgcacccaggaacatgcggtgacc agcattctgaacctgagcatttgccaggaaaacaaaggcaaaattgtgtatagcagcggc gcggtgccgggcattgtgcatgtgctgcagaaaggcagcatggaagcgcgcgaaaacgcg gcggcgaccctgtttagcctgagcgtgattgatgaaaacaaagtgaccattggcgcggcg ggcgcgattccgccgctggtgaccctgctgagcgaaggcagccagcgcggcaaaaaagat gcggcgaccgcgctgtttaacctgtgcatttttcagggcaacaaaggcaaagcggtgcgc gcgggcctggtgccggtgctgatgcgcctgctgaccgaaccggaaagcggcatggtggat gaaagcctgagcattctggcgattctgagcagccatccggatggcaaaagcgaagtgggc gcggcggatgcggtgccggtgctggtggattttattcgcagcggcagcccgcgcaacaaa gaaaacagcgcggcggtgctggtgcatctgtgcagctggaaccagcagcatctgattgaa gcgcagaaactgggcattatggatctgctgattgaaatggcggaaaacggcaccgatcgc ggcaaacgcaaagcggcgcagctgctgaaccgctttagccgctttaacgatcagcagaaa cagcatagcggcctgggcctggaagatcagattagcctgatt

(2) >reverse translation of sp|Q9ZV31|PUB12_ARATH U-box domain-containing protein 12 OS=Arabidopsis thaliana OX=3702 GN=PUB12 PE=2 SV=1 to a 1962 base sequence of consensus codons. atggcnaarwsngaraarcayaarytngcncaracnytnathgaywsnathaaygarath gcnwsnathwsngaywsngtnacnccnatgaaraarcaytgygcnaayytnwsnmgnmgn ytnwsnytnytnytnccnatgytngargarathmgngayaaycargarwsnwsnwsngar gtngtnaaygcnytnytnwsngtnaarcarwsnytnytncaygcnaargayytnytnwsn ttygtnwsncaygtnwsnaarathtayytngtnytngarmgngaycargtnatggtnaar ttycaraargtnacnwsnytnytngarcargcnytnwsnathathccntaygaraayytn garathwsngaygarytnaargarcargtngarytngtnytngtncarytnmgnmgnwsn ytnggnaarmgnggnggngaygtntaygaygaygarytntayaargaygtnytnwsnytn taywsnggnmgnggnwsngtnatggarwsngayatggtnmgnmgngtngcngaraarytn carytnatgacnathacngayytnacncargarwsnytngcnytnytngayatggtnwsn wsnwsnggnggngaygayccnggngarwsnttygaraaratgwsnatggtnytnaaraar athaargayttygtncaracntayaayccnaayytngaygaygcnccnytnmgnytnaar wsnwsnytnccnaarwsnmgngaygaygaymgngayatgytnathccnccngargartty mgntgyccnathwsnytngarytnatgacngayccngtnathgtnwsnwsnggncaracn taygarmgngartgyathaaraartggytngarggnggncayytnacntgyccnaaracn cargaracnytnacnwsngayathatgacnccnaaytaygtnytnmgnwsnytnathgcn cartggtgygarwsnaayggnathgarccnccnaarmgnccnaayathwsncarccnwsn wsnaargcnwsnwsnwsnwsnwsngcnccngaygaygarcayaayaarathgargarytn ytnytnaarytnacnwsncarcarccngargaymgnmgnwsngcngcnggngarathmgn ytnytngcnaarcaraayaaycayaaymgngtngcnathgcngcnwsnggngcnathccn ytnytngtnaayytnytnacnathwsnaaygaywsnmgnacncargarcaygcngtnacn wsnathytnaayytnwsnathtgycargaraayaarggnaarathgtntaywsnwsnggn gcngtnccnggnathgtncaygtnytncaraarggnwsnatggargcnmgngaraaygcn gcngcnacnytnttywsnytnwsngtnathgaygaraayaargtnacnathggngcngcn ggngcnathccnccnytngtnacnytnytnwsngarggnwsncarmgnggnaaraargay gcngcnacngcnytnttyaayytntgyathttycarggnaayaarggnaargcngtnmgn gcnggnytngtnccngtnytnatgmgnytnytnacngarccngarwsnggnatggtngay garwsnytnwsnathytngcnathytnwsnwsncayccngayggnaarwsngargtnggn gcngcngaygcngtnccngtnytngtngayttyathmgnwsnggnwsnccnmgnaayaar garaaywsngcngcngtnytngtncayytntgywsntggaaycarcarcayytnathgar gcncaraarytnggnathatggayytnytnathgaratggcngaraayggnacngaymgn ggnaarmgnaargcngcncarytnytnaaymgnttywsnmgnttyaaygaycarcaraar carcaywsnggnytnggnytngargaycarathwsnytnath

3.3. Codon optimization. Once a nucleotide sequence of your protein is determined, you need to codon optimize your sequence. You may, once again, utilize google for a “codon optimization tool”. In your own words, describe why you need to optimize codon usage.

  • Optimizing codon usage can help account for host bias as well as reduce the pool of downstream errors to account for, as well as save materials and time.

Which organism have you chosen to optimize the codon sequence for and why?

  • Which: Arabidopsis thaliana

  • Why: I wanted to start somewhere, and this organism is familiar.

  • Optimization Tool Used: Vector Builder (https://en.vectorbuilder.com/tool/codon-optimization.html ) Chose to use the first sequence of the two. Enzyme recognition sites avoided included BsaI and BbsI.

Resulting Sequence is as follows: PUB12, a plant U-box–type E3 ubiquitin ligase DNA sequence ATGGCTAAATCTGAGAAGCATAAGTTGGCTCAGACTCTCATCGATTCTATAAATGAAATTGCTTCTATCTCAGATTCAGTTACTCCAATGAAGAAGCATTGTGCAAATTTGTCTAGGAGATTGTCACTTCTTCTTCCAATGTTGGAAGAGATTAGAGATAATCAAGAGTCTAGCTCTGAAGTCGTGAACGCTTTGCTCTCAGTTAAACAATCTTTATTACATGCTAAGGATCTCTTGTCTTTCGTCAGTCATGTGAGCAAGATATATCTTGTTCTTGAGAGAGATCAAGTGATGGTTAAGTTTCAAAAAGTTACTAGCCTTCTTGAGCAAGCTCTTTCTATAATCCCTTATGAGAATCTTGAAATTTCTGATGAATTGAAAGAACAAGTTGAATTGGTTCTTGTTCAACTTAGAAGATCTTTGGGTAAGAGAGGTGGTGATGTTTACGATGATGAACTTTATAAGGATGTTCTTTCACTTTACAGTGGAAGAGGATCAGTTATGGAGAGTGATATGGTTCGTCGAGTTGCCGAGAAATTGCAACTAATGACTATCACTGATTTGACACAAGAGTCTCTTGCTCTTCTTGATATGGTTTCTTCTAGTGGTGGAGATGATCCTGGAGAGTCATTCGAAAAGATGTCTATGGTTCTTAAGAAAATTAAGGATTTCGTTCAAACCTATAATCCTAACCTAGATGACGCTCCCCTTAGACTTAAATCATCATTGCCTAAATCGAGAGATGATGATCGTGATATGCTTATTCCACCTGAAGAATTCCGTTGTCCTATTTCGCTTGAGCTTATGACTGATCCTGTAATCGTTTCTTCAGGTCAAACCTATGAAAGAGAGTGTATTAAGAAGTGGCTTGAAGGAGGACATTTGACATGTCCTAAGACTCAAGAAACTTTGACATCTGATATCATGACCCCTAATTATGTTCTTAGATCTTTGATCGCTCAATGGTGTGAGTCGAATGGAATCGAGCCTCCAAAGAGGCCAAACATAAGTCAGCCTTCTAGTAAGGCTTCTTCATCATCTAGTGCTCCTGATGACGAACATAATAAGATCGAAGAATTGCTCTTGAAGTTGACTTCTCAGCAACCTGAAGATAGAAGATCCGCTGCTGGAGAGATCAGACTTTTGGCCAAACAAAACAACCATAACAGAGTTGCTATCGCTGCTTCAGGAGCTATTCCACTCTTGGTGAACCTTTTGACTATCTCAAACGATTCCAGAACACAAGAGCATGCTGTTACGTCTATCCTCAACCTTTCTATCTGCCAAGAAAATAAAGGTAAGATCGTTTATTCTAGTGGTGCAGTGCCTGGTATTGTTCATGTTTTGCAGAAGGGATCAATGGAGGCTAGAGAAAACGCTGCTGCTACTCTTTTCTCTCTTTCCGTTATAGATGAGAATAAGGTTACTATTGGAGCTGCTGGAGCAATTCCACCTTTGGTTACTCTCCTTTCTGAAGGATCACAGCGTGGAAAGAAGGATGCTGCTACTGCACTCTTCAACCTTTGTATCTTTCAGGGTAATAAAGGTAAGGCAGTTAGAGCAGGACTTGTGCCTGTGCTTATGAGGCTTTTGACTGAACCTGAATCTGGAATGGTTGATGAGAGCCTTTCTATTCTTGCTATTCTTTCTTCTCATCCAGACGGAAAGTCTGAAGTTGGAGCTGCTGATGCAGTTCCTGTTCTTGTTGATTTCATCAGATCTGGATCTCCTAGAAATAAGGAGAATTCTGCTGCAGTTCTTGTTCACTTGTGTTCATGGAATCAACAACATCTTATCGAAGCACAGAAGCTTGGAATCATGGATCTTCTCATCGAGATGGCTGAAAACGGAACTGATCGTGGTAAGAGAAAGGCCGCACAATTGCTTAATAGATTTTCTAGATTTAACGATCAGCAGAAGCAACACAGTGGTCTTGGTCTTGAAGATCAAATTTCATTGATT Below is a printout contrasting the two:

3.4. You have a sequence! Now what?

Recombinant DNA technologies could be utilized to make this protein from the DNA (Cell-free or Cell-dependent). Respectively they involve either special mix that can take place in a test tube or through using a live cell’s machinery to produce the protein.

Part 4: Prepare a Twist DNA Synthesis Order

Build Your DNA Insert Sequence

I prepared the above improved sequence as a test order. Below is the initial step through the creation of the DNA/RNA Sequence in Benchling with a Linear Topology.

It was annotated below as such within Benchling before a linear map and file was constructed that could be uploaded to Twist Bio.

Start Codon: ATG Coding Sequence: ATGGCTAAATCTGAGAAGCATAAGTTGGCTCAGACTCTCATCGATTCTATAAATGAAATTGCTTCTATCTCAGATTCAGTTACTCCAATGAAGAAGCATTGTGCAAATTTGTCTAGGAGATTGTCACTTCTTCTTCCAATGTTGGAAGAGATTAGAGATAATCAAGAGTCTAGCTCTGAAGTCGTGAACGCTTTGCTCTCAGTTAAACAATCTTTATTACATGCTAAGGATCTCTTGTCTTTCGTCAGTCATGTGAGCAAGATATATCTTGTTCTTGAGAGAGATCAAGTGATGGTTAAGTTTCAAAAAGTTACTAGCCTTCTTGAGCAAGCTCTTTCTATAATCCCTTATGAGAATCTTGAAATTTCTGATGAATTGAAAGAACAAGTTGAATTGGTTCTTGTTCAACTTAGAAGATCTTTGGGTAAGAGAGGTGGTGATGTTTACGATGATGAACTTTATAAGGATGTTCTTTCACTTTACAGTGGAAGAGGATCAGTTATGGAGAGTGATATGGTTCGTCGAGTTGCCGAGAAATTGCAACTAATGACTATCACTGATTTGACACAAGAGTCTCTTGCTCTTCTTGATATGGTTTCTTCTAGTGGTGGAGATGATCCTGGAGAGTCATTCGAAAAGATGTCTATGGTTCTTAAGAAAATTAAGGATTTCGTTCAAACCTATAATCCTAACCTAGATGACGCTCCCCTTAGACTTAAATCATCATTGCCTAAATCGAGAGATGATGATCGTGATATGCTTATTCCACCTGAAGAATTCCGTTGTCCTATTTCGCTTGAGCTTATGACTGATCCTGTAATCGTTTCTTCAGGTCAAACCTATGAAAGAGAGTGTATTAAGAAGTGGCTTGAAGGAGGACATTTGACATGTCCTAAGACTCAAGAAACTTTGACATCTGATATCATGACCCCTAATTATGTTCTTAGATCTTTGATCGCTCAATGGTGTGAGTCGAATGGAATCGAGCCTCCAAAGAGGCCAAACATAAGTCAGCCTTCTAGTAAGGCTTCTTCATCATCTAGTGCTCCTGATGACGAACATAATAAGATCGAAGAATTGCTCTTGAAGTTGACTTCTCAGCAACCTGAAGATAGAAGATCCGCTGCTGGAGAGATCAGACTTTTGGCCAAACAAAACAACCATAACAGAGTTGCTATCGCTGCTTCAGGAGCTATTCCACTCTTGGTGAACCTTTTGACTATCTCAAACGATTCCAGAACACAAGAGCATGCTGTTACGTCTATCCTCAACCTTTCTATCTGCCAAGAAAATAAAGGTAAGATCGTTTATTCTAGTGGTGCAGTGCCTGGTATTGTTCATGTTTTGCAGAAGGGATCAATGGAGGCTAGAGAAAACGCTGCTGCTACTCTTTTCTCTCTTTCCGTTATAGATGAGAATAAGGTTACTATTGGAGCTGCTGGAGCAATTCCACCTTTGGTTACTCTCCTTTCTGAAGGATCACAGCGTGGAAAGAAGGATGCTGCTACTGCACTCTTCAACCTTTGTATCTTTCAGGGTAATAAAGGTAAGGCAGTTAGAGCAGGACTTGTGCCTGTGCTTATGAGGCTTTTGACTGAACCTGAATCTGGAATGGTTGATGAGAGCCTTTCTATTCTTGCTATTCTTTCTTCTCATCCAGACGGAAAGTCTGAAGTTGGAGCTGCTGATGCAGTTCCTGTTCTTGTTGATTTCATCAGATCTGGATCTCCTAGAAATAAGGAGAATTCTGCTGCAGTTCTTGTTCACTTGTGTTCATGGAATCAACAACATCTTATCGAAGCACAGAAGCTTGGAATCATGGATCTTCTCATCGAGATGGCTGAAAACGGAACTGATCGTGGTAAGAGAAAGGCCGCACAATTGCTTAATAGATTTTCTAG Stop Codon: TAG

Linear Map:

Here’s an example of what you just annotated in Benchling: Sequence Import and Quote Obtained

The pTwist Amp High Copy: pTwist Amp Vector was chosen after the Clonal Gene Choice was pursued. The quote is to the left.. The annotated sequence page from TwistBio from which a GenBank construct file was downloaded is to the right.

The construct was imported into Benchling to yield the plasmid below.

Part 5: DNA Read/Write/Edit

5.1 DNA Read

(i) What DNA would you want to sequence (e.g., read) and why?

  • I might want to sequence large viruses, particularly out of sheer curiosity for the instructions that allow them to exist in their current forms.

Sources of interest:

  • Piacente, F., De Castro, C., Jeudy, S., Molinaro, A., Salis, A., Damonte, G., Bernardi, C., Abergel, C. and Tonetti, M.G., 2014. Giant virus Megavirus chilensis encodes the biosynthetic pathway for uncommon acetamido sugars. Journal of Biological Chemistry, 289(35), pp.24428-24439.

  • Legendre, M., Arslan, D., Abergel, C. and Claverie, J.M., 2012. Genomics of Megavirus and the elusive fourth domain of Life. Communicative & integrative biology, 5(1), pp.102-106.

  • Arslan, D., Legendre, M., Seltzer, V., Abergel, C. and Claverie, J.M., 2011. Distant Mimivirus relative with a larger genome highlights the fundamental features of Megaviridae. Proceedings of the National Academy of Sciences, 108(42), pp.17486-17491.

(ii) In lecture, a variety of sequencing technologies were mentioned. What technology or technologies would you use to perform sequencing on your DNA and why?

  • I would borrow from the methods used in the aforementioned literature, particularly “454-titanium and Illumina HiSeq approaches”. These methods appear adequate. Lack of a priori knowledge of the genome or genomic features not being required is helpful, in addition to single-nucleotide resolution, higher dynamic range, and less DNA/RNA needed.

https://www.ebi.ac.uk/training/online/courses/functional-genomics-ii-common-technologies-and-data-analysis-methods/next-generation-sequencing/

Also answer the following questions: Is your method first-, second- or third-generation or other? How so?

  • Second-generation. They engage massively parallel sequencing.

What is your input? How do you prepare your input (e.g. fragmentation, adapter ligation, PCR)? List the essential steps.

  • Library Preparation (fragmenting of DNA and adapters added to both ends of DNA for amplification)
  • Sequencing
  • Data Analysis and Cleanup

What are the essential steps of your chosen sequencing technology, how does it decode the bases of your DNA sample (base calling)?

Simplified:

  • The DNA strand gets color coded with fluorescent terminators
  • Images are taken of the flow cell after each letter is added
  • Software determines the bases based on color intensities
  • Calls are made, corrections are issued, and output is cleaned up

Source of interest: https://genohub.com/bioinformatics/10/base-calling

What is the output of your chosen sequencing technology?

  • Numerous sequence reads

5.2 DNA Write (i) What DNA would you want to synthesize (e.g., write) and why?

I am not sure yet, but I am leaning towards DNA origami art to experiment with the medium and explore versatility of applications.

Sources of interest:

  • Bush, J., Singh, S., Vargas, M., Oktay, E., Hu, C.H. and Veneziano, R., 2020. Synthesis of DNA origami scaffolds: Current and emerging strategies. Molecules, 25(15), p.3386.

  • Weck, J.M. and Heuer-Jungemann, A., 2025. Fully addressable designer superstructures assembled from one single modular DNA origami. Nature communications, 16(1), p.1556.

  • DNA origami by Paul W. K. Rothemund, California Institute of Technology, 2004. 100 nanometers in diameter. (ii) What technology or technologies would you use to perform this DNA synthesis and why?

  • For validating the structures, if cost didn’t matter, I would consider using next generation sequencing (Illumina, for both sequences of the staples and scaffold) and Atomic Force Microscopy (Visual, especially confirming folds)

Also answer the following questions:

The essential steps of the chosen sequencing methods would be:

  • Library Prep (DNA Fragmentation and Adapter ligation methods)
  • Cluster Generation via amplification
  • Sequencing and base calling

What are the limitations of your sequencing method (if any) in terms of speed, accuracy, scalability?

  • The major limitations would be cost, error rates, and short read lengths. This would not be efficient to scale as is.

Improvements would involve: -Hand design of patterns (for ideating improvements) -Computer design and optimization of material usage -Production of material and strand-routing precision

5.3 DNA Edit (i) What DNA would you want to edit and why?

DNA edits that I would like to perform would be those that allow for the minimization and or elimination of metabolic disease states. The why comes down to the quality-of-life improvements for all involved.

(ii) What technology or technologies would you use to perform these DNA edits and why? Also answer the following questions:

How does your technology of choice edit DNA? What are the essential steps?

  • That, I’m still thinking about. While some gene therapies have been successfully – my mind shifts to which exact technology is the “right” choice, along with the ethical hurdles needed to investigate such properly.

What preparation do you need to do (e.g. design steps) and what is the input (e.g. DNA template, enzymes, plasmids, primers, guides, cells) for the editing?-

  • TBA

What are the limitations of your editing methods (if any) in terms of efficiency or precision?

  • TBA

Week 3 HW: Lab Automation

  1. Find and describe a published paper that utilizes the Opentrons or an automation tool to achieve novel biological applications.

DeRoo, J.B., Jones, A.A., Slaughter, C.K., Ahr, T.W., Stroup, S.M., Thompson, G.B. and Snow, C.D., 2025. Automation of protein crystallization scaleup via Opentrons-2 liquid handling. SLAS technology, 32, p.100268.

https://doi.org/10.1016/j.slast.2025.100268

This work describes an approach by which an Opentrons-2 liquid handling robot was used for automating sitting drop protein crystallization trials. This ability also improve comparability of products produced, improving studies that depend on their proper manufacture. An important detail is how the Opentrons-2 can prove a cost-effective option for laboratory operations. For example, at the time of writing, the Opentrons-2 can be purchased for around 13.5K USD vs that of a Gryphon machine at around 65K USD.

  1. Write a description about what you intend to do with automation tools for your final project.

I’m still forming my thoughts about how I want to effectively use automation tools for my final project.

So far, I am interested in branching off from example #2 given in the Homework and the above example, regarding screening an array of designed biosensor constructs.

One idea had in mind was towards a digital tracing project that revolves around said constructs used to track known entities.

Simply, products are given a unique ID with stored parameters. These are linked within a automation run so that each product is trackable as they are processed. One application that is probably already in use but would be fun to adapt towards something not already applied would be swappable combined wearable crystallized biosensors that are traded in daily for workers that are liable to be exposed to a particular organism and pollutant pairs.

I could use an Echo for transfer of nano-scale components. The Bravo or Opentrons-2 could be used for precise, automated pipetting ,especially of the crystals. The multiflow would be used to dispense the larger scale volume components. The PlateLoc would be helful for sealing the plates. The inheco could be used for controlled incubation. The Xpeel would be used for careful desealing of the plates. Finally, the PHERAstar could be used for reading fluorescence outputs.

Still developing this out from this branch.