Homework

Weekly homework submissions:

Week 1 HW: Principles and Practices
Describe a biological engineering application or tool you want to develop and why. One application that might be worth exploring would be in the realm of data storage for an imagined, hypothetical semi-exposed media. For this context special proteins could be designed in applications that bind to sequences that code for hidden malware or faulty sequences and or act for preserving media. I imagine that both this media design and this protein use would be niche but nonetheless fun to design for. Modalities could be as a gentle spray or as a settled solution that could be extracted.
Week 2 HW: DNA Read, Write, and Edit
Homework Part 0: Basics of Gel Electrophoresis [This was a pure watch session. Thus there’s nothing to add here.] Part 1: Benchling & In-silico Gel Art Restriction Enzymes Simulated on Lambda_NEB: EcoRI, HindIII, BamHI, KpnI, EcoRV, SacI, and SalI Simple Art produced by way of Paul Vanouse’s Latent Figure Protocol artworks through the use of RC Donovan’s Gel Art Iteration Tool (https://rcdonovan.com/gel-art):
Week 3 HW: Lab Automation
This lab, we were tasked with creating a design that could be generated by an OpenTrons Liquid Handling Robot. Assignment: Python Script for Opentrons Artwork Review this week’s recitation and this week’s lab for details on the Opentrons and programming it. Done. Generate an artistic design using the GUI at opentrons-art.rcdonovan.com. As a remote participant, I prototyped a design using the GUI at opentrons-art.rcdonovan.com.
Week 4 HW: Protein Design Part I
Part A: Conceptual Questions We were allowed answer 9 out of 11 of the following questions: How many molecules of amino acids do you take with a piece of 500 grams of meat? (on average an amino acid is ~100 Daltons) I expect answers to vary since meat composition vary. Water composition of meat is estimated to be about 65-80% meat. Some estimates have proteins make up around 1/5 to less than 1/3 of overall muscle tissue.
Week 5 HW: Protein Design Part II
This week’s homework was divided into 3 parts. Part A involved SOD1 Binder Peptide Design. That was broken into 3 parts: Part 1: Part 1: Generate Binders with PepMLM A) Begin by retrieving the human SOD1 sequence from UniProt (P00441) and introducing the A4V mutation. Done B) Using the PepMLM Colab linked from the HuggingFace PepMLM-650M model card: Done C) Generate four peptides of length 12 amino acids conditioned on the mutant SOD1 sequence.
Week 6 HW: Genetic Circuits Part I: Assembly Technologies
Part 1: DNA Assembly What are some components in the Phusion High-Fidelity PCR Master Mix and what is their purpose? Some of the components include the Phusion DNA Polymerease (key to PCR), dNTPs (these are used to synthesize new DNA), and buffering materials to stabilize the environmental conditions during the raction. What are some factors that determine primer annealing temperature during PCR? Some of these include buffer conditions, primer melting temperature, primer length, and GC content.
Week 7 HW: Genetic Circuits Part II: Neuromorphic Circuits
Assignment Part 1: Intracellular Artificial Neural Networks (IANNs) What advantages do IANNs have over traditional genetic circuits, whose input/output behaviors are Boolean functions? IANNs are great for analog computation, while also being scalable in deployment and useful for approximating functions. Describe a useful application for an IANN; include a detailed description of input/output behavior, as well as any limitations an IANN might face to achieve your goal. Disease diagnostics. Inputs would be expressions levels of biomarkers and signals deemed harmful. Outputs could be in the form of a gene with a strong signal or that is therapetic in nature. Strong limitations could come from unintended interactions with the output, as well as cross-talk generated. Noise within the biological system could affected outputs as well.
Week 8 HW: Break Week
This was a free week and was not obligatory to mark. This week was used to exploration of course materials and Final Project development.
Week 9 HW: Week 9 — Cell-Free Systems
General homework questions Explain the main advantages of cell-free protein synthesis over traditional in vivo methods, specifically in terms of flexibility and control over experimental variables. Name at least two cases where cell-free expression is more beneficial than cell production. Cell free expression allows teams to conduct biomanufacturing without living cells and operate beyond the constraints of productions in living systems.
Week 10 HW: Week 10 — Advanced Imaging & Measurement Technology
Homework: Final Project Measurement Draft The main aspect of my project that I will measure is the functional activity of a mutated InaZ construct, specifically whether it increases ice nucleation efficiency relative to a control InaZ construct. I will perform this measurement using a controlled freezing assay in which replicate samples are cooled gradually and monitored for the onset of ice formation. The primary data collected will be the temperature at which freezing begins in each sample. Ice formation will be detected through optical observation of crystal formation and through temperature sensors that record the freezing point. In addition, I would verify the identity of the mutated construct using PCR, gel electrophoresis, and DNA sequencing. PCR and gel electrophoresis would be used to confirm the presence and approximate size of the inserted DNA, while DNA sequencing would confirm that the engineered inaZ mutation is correct. Together, these measurements allow me to confirm both that the construct was built properly and that it produces the intended increase in ice nucleation activity.
Week 11 HW: Bioproduction & Cloud Labs
Part A: The 1,536 Pixel Artwork Canvas | Collective Artwork Contribute at least one pixel to this global artwork experiment before the editing ends on Sunday 4/19 at 11:59 PM EST! A personalized URL was sent to the email address associated with your Discourse account, and you can discuss the artwork on the Discourse! If you did not have a chance to contribute, it’s okay, just make sure you become a TA this fall! 😉
Week 12 HW: Building Genomes
Homework: This homework was a reference to A) the acknowledhing the updated Week 11 and B) direction to make progress on our Final Project. This is expressed in the image below. Progress: Done. A) Contributed pixels to the Global Artwork Experiment and Followed up with mastermix concentrations as per part 3 of Homework 11C. B) The project was refined through constructive class critique and further reflection.
Week 13 HW: AI, SynBio, and Scaling Health Innovation (ARPA-H)
Homework: Work on your Final Project Present it May 12 (MIT/Harvard) or May 13 (Committed Listeners) Progress: Done. I further developed and practiced my presentation with my study group, classmates, and via some solo practice.
Week 14 HW: Bio Design & Bio Fabrication
Homework: Homework: Finish your Final Project Present it May 12 (MIT/Harvard) or May 13 (Committed Listeners Progress: Done. Final preparations were had during this week by which I practiced my presentation with my study group, classmates, and via some solo practice. As an additional note: my study group held multiple sessions to ensure that each of our projects were further sharpened. My Final Project, Project Z Freeze was successfully presented. My timeslot was 8:10PM on May 13.

Week 1 HW: Principles and Practices

Describe a biological engineering application or tool you want to develop and why. One application that might be worth exploring would be in the realm of data storage for an imagined, hypothetical semi-exposed media. For this context special proteins could be designed in applications that bind to sequences that code for hidden malware or faulty sequences and or act for preserving media. I imagine that both this media design and this protein use would be niche but nonetheless fun to design for. Modalities could be as a gentle spray or as a settled solution that could be extracted.
Describe one or more governance/policy goals related to ensuring that this application or tool contributes to an “ethical” future, like ensuring non-malfeasance (preventing harm).

A policy goal to ensure that this application contributes to an ethical future could be ensuring transparent design and standardization of these proteins (and their stats per context) so there is a way to validate their production and application prior to use. This could reduce the chance of bad actors sabotaging batches or distributing lower quality batches.

Describe at least three different potential governance “actions” by considering the four aspects below (Purpose, Design, Assumptions, Risks of Failure & “Success”)

Governance Action	Purpose	Design	Assumptions	Failure and Success Risks
Standardized Libraries	Educates communities on proteins available and safe designs	Create a library to hold these designs, as well as protocols for their manufacture to spec	That communities will agree on these standards and that there are not unnecessary inequities holding these standards	Success means that manufacture paths are straightforward. Failure means that design paths are too numerous to account for and fragmentation possibilities are higher
Blue-teaming Design	To build a community line of protection of designs and applications	Design blue teaming frameworks and recruit educators to practice	That there is sufficient interest for funding	Success means that it is easier for production and share. Failure means that this pipeline has an established line of people who
Red-teaming Design	To identify means of sabotaging to proteins to protect libraries and distributors	Design blue-teaming framework and recruit educators to practice	That there is sufficient interest for funding	Success means that it is easier for production and share. Failure means that this pipeline has less visibility on sabotage routes

Next, score (from 1-3 with, 1 as the best, or n/a) each of your governance actions against your rubric of policy goals.

Does the option:	Option 1	Option 2	Option 3
Enhance Biosecurity	3	3	3
• By preventing incidents	3	3	3
• By helping respond	3	3	3
Foster Lab Safety	3	3	3
• By preventing incident	3	3	3
• By helping respond	3	3	3
Protect the environment	3	3	3
• By preventing incidents	3	3	3
• By helping respond	3	3	3
Other considerations	3	3	3
• Minimizing costs and burdens to stakeholders	3	2	2
• Feasibility?	3	3	3
• Not impede research	3	3	3
• Promote constructive applications	3	3	3

Last, drawing upon this scoring, describe which governance option, or combination of options, you would prioritize, and why. Outline any trade-offs you considered as well as assumptions and uncertainties.

I would prioritize library standardization. All operations, from that educators to users, can be facilitated from having a common reference frame. Further, among the options, this appears the most neutral. Overall, proper security requires practitioners from both sides of the spectrum: training in protecting and disrupting systems, and being willing to document for the good of the community.

Homework

Answers for Homework Questions from Professor Jabson:

Error rate - 1: 10^6. This is dwarfed by the length of the human genome. The body has numerous selectivity and repair mechanisms to deal with mitigate issues from mutations.
64 / Some codes might not work due to chemical incompatibility and structural reasons.

Answers for Homework Questions from Dr. LeProust:

Phosphoramidite DNA Synthesis Cycle
Increasing inefficiency of reactions with greater length
Extremely high inefficiency Error rates

Answers for Homework Question from George Church:

Arginine, histidine, isoleucine, leucine, lysine, methionine, phenylalanine, threonine, tryptophan, and valine

I’m still forming my thoughts on how this affects my view of the Lysine Contingency.

Week 2 HW: DNA Read, Write, and Edit

Homework

Part 0: Basics of Gel Electrophoresis

[This was a pure watch session. Thus there’s nothing to add here.]

Part 1: Benchling & In-silico Gel Art

Restriction Enzymes Simulated on Lambda_NEB: EcoRI, HindIII, BamHI, KpnI, EcoRV, SacI, and SalI

Simple Art produced by way of Paul Vanouse’s Latent Figure Protocol artworks through the use of RC Donovan’s Gel Art Iteration Tool (https://rcdonovan.com/gel-art):

“4 corners”, using EcoRI and SalI in Lanes 1 and 10.

Part 2: Gel Art - Restriction Digests and Gel Electrophoresis

[This leaned on Wetlab Access. As a virtual student, this was not required.]

Part 3: DNA Design Challenge

3.1. Choose your protein.

Regarding proteins, I chose U-box domain-containing protein 12, also known as Plant U-box protein 12 or RING-type E3 ubiquitin transferase PUB12. I wanted to start with something and keep things relatively simple.

sp|Q9ZV31|PUB12_ARATH U-box domain-containing protein 12 OS=Arabidopsis thaliana OX=3702 GN=PUB12 PE=2 SV=1 MAKSEKHKLAQTLIDSINEIASISDSVTPMKKHCANLSRRLSLLLPMLEEIRDNQESSSE VVNALLSVKQSLLHAKDLLSFVSHVSKIYLVLERDQVMVKFQKVTSLLEQALSIIPYENL EISDELKEQVELVLVQLRRSLGKRGGDVYDDELYKDVLSLYSGRGSVMESDMVRRVAEKL QLMTITDLTQESLALLDMVSSSGGDDPGESFEKMSMVLKKIKDFVQTYNPNLDDAPLRLK SSLPKSRDDDRDMLIPPEEFRCPISLELMTDPVIVSSGQTYERECIKKWLEGGHLTCPKT QETLTSDIMTPNYVLRSLIAQWCESNGIEPPKRPNISQPSSKASSSSSAPDDEHNKIEEL LLKLTSQQPEDRRSAAGEIRLLAKQNNHNRVAIAASGAIPLLVNLLTISNDSRTQEHAVT SILNLSICQENKGKIVYSSGAVPGIVHVLQKGSMEARENAAATLFSLSVIDENKVTIGAA GAIPPLVTLLSEGSQRGKKDAATALFNLCIFQGNKGKAVRAGLVPVLMRLLTEPESGMVD ESLSILAILSSHPDGKSEVGAADAVPVLVDFIRSGSPRNKENSAAVLVHLCSWNQQHLIE AQKLGIMDLLIEMAENGTDRGKRKAAQLLNRFSRFNDQQKQHSGLGLEDQISLI Site: https://rest.uniprot.org/uniprotkb/Q9ZV31.fasta Base site: https://www.uniprot.org/uniprotkb/Q9ZV31/entry

3.2. Reverse Translate: Protein (amino acid) sequence to DNA (nucleotide) sequence.

Reverse Translation Tool – BCCM – GeneCorner (https://www.genecorner.ugent.be/rev_trans.html) PUB12, a plant U-box–type E3 ubiquitin ligase DNA sequence (1) >reverse translation of sp|Q9ZV31|PUB12_ARATH U-box domain-containing protein 12 OS=Arabidopsis thaliana OX=3702 GN=PUB12 PE=2 SV=1 to a 1962 base sequence of most likely codons. atggcgaaaagcgaaaaacataaactggcgcagaccctgattgatagcattaacgaaatt gcgagcattagcgatagcgtgaccccgatgaaaaaacattgcgcgaacctgagccgccgc ctgagcctgctgctgccgatgctggaagaaattcgcgataaccaggaaagcagcagcgaa gtggtgaacgcgctgctgagcgtgaaacagagcctgctgcatgcgaaagatctgctgagc tttgtgagccatgtgagcaaaatttatctggtgctggaacgcgatcaggtgatggtgaaa tttcagaaagtgaccagcctgctggaacaggcgctgagcattattccgtatgaaaacctg gaaattagcgatgaactgaaagaacaggtggaactggtgctggtgcagctgcgccgcagc ctgggcaaacgcggcggcgatgtgtatgatgatgaactgtataaagatgtgctgagcctg tatagcggccgcggcagcgtgatggaaagcgatatggtgcgccgcgtggcggaaaaactg cagctgatgaccattaccgatctgacccaggaaagcctggcgctgctggatatggtgagc agcagcggcggcgatgatccgggcgaaagctttgaaaaaatgagcatggtgctgaaaaaa attaaagattttgtgcagacctataacccgaacctggatgatgcgccgctgcgcctgaaa agcagcctgccgaaaagccgcgatgatgatcgcgatatgctgattccgccggaagaattt cgctgcccgattagcctggaactgatgaccgatccggtgattgtgagcagcggccagacc tatgaacgcgaatgcattaaaaaatggctggaaggcggccatctgacctgcccgaaaacc caggaaaccctgaccagcgatattatgaccccgaactatgtgctgcgcagcctgattgcg cagtggtgcgaaagcaacggcattgaaccgccgaaacgcccgaacattagccagccgagc agcaaagcgagcagcagcagcagcgcgccggatgatgaacataacaaaattgaagaactg ctgctgaaactgaccagccagcagccggaagatcgccgcagcgcggcgggcgaaattcgc ctgctggcgaaacagaacaaccataaccgcgtggcgattgcggcgagcggcgcgattccg ctgctggtgaacctgctgaccattagcaacgatagccgcacccaggaacatgcggtgacc agcattctgaacctgagcatttgccaggaaaacaaaggcaaaattgtgtatagcagcggc gcggtgccgggcattgtgcatgtgctgcagaaaggcagcatggaagcgcgcgaaaacgcg gcggcgaccctgtttagcctgagcgtgattgatgaaaacaaagtgaccattggcgcggcg ggcgcgattccgccgctggtgaccctgctgagcgaaggcagccagcgcggcaaaaaagat gcggcgaccgcgctgtttaacctgtgcatttttcagggcaacaaaggcaaagcggtgcgc gcgggcctggtgccggtgctgatgcgcctgctgaccgaaccggaaagcggcatggtggat gaaagcctgagcattctggcgattctgagcagccatccggatggcaaaagcgaagtgggc gcggcggatgcggtgccggtgctggtggattttattcgcagcggcagcccgcgcaacaaa gaaaacagcgcggcggtgctggtgcatctgtgcagctggaaccagcagcatctgattgaa gcgcagaaactgggcattatggatctgctgattgaaatggcggaaaacggcaccgatcgc ggcaaacgcaaagcggcgcagctgctgaaccgctttagccgctttaacgatcagcagaaa cagcatagcggcctgggcctggaagatcagattagcctgatt

(2) >reverse translation of sp|Q9ZV31|PUB12_ARATH U-box domain-containing protein 12 OS=Arabidopsis thaliana OX=3702 GN=PUB12 PE=2 SV=1 to a 1962 base sequence of consensus codons. atggcnaarwsngaraarcayaarytngcncaracnytnathgaywsnathaaygarath gcnwsnathwsngaywsngtnacnccnatgaaraarcaytgygcnaayytnwsnmgnmgn ytnwsnytnytnytnccnatgytngargarathmgngayaaycargarwsnwsnwsngar gtngtnaaygcnytnytnwsngtnaarcarwsnytnytncaygcnaargayytnytnwsn ttygtnwsncaygtnwsnaarathtayytngtnytngarmgngaycargtnatggtnaar ttycaraargtnacnwsnytnytngarcargcnytnwsnathathccntaygaraayytn garathwsngaygarytnaargarcargtngarytngtnytngtncarytnmgnmgnwsn ytnggnaarmgnggnggngaygtntaygaygaygarytntayaargaygtnytnwsnytn taywsnggnmgnggnwsngtnatggarwsngayatggtnmgnmgngtngcngaraarytn carytnatgacnathacngayytnacncargarwsnytngcnytnytngayatggtnwsn wsnwsnggnggngaygayccnggngarwsnttygaraaratgwsnatggtnytnaaraar athaargayttygtncaracntayaayccnaayytngaygaygcnccnytnmgnytnaar wsnwsnytnccnaarwsnmgngaygaygaymgngayatgytnathccnccngargartty mgntgyccnathwsnytngarytnatgacngayccngtnathgtnwsnwsnggncaracn taygarmgngartgyathaaraartggytngarggnggncayytnacntgyccnaaracn cargaracnytnacnwsngayathatgacnccnaaytaygtnytnmgnwsnytnathgcn cartggtgygarwsnaayggnathgarccnccnaarmgnccnaayathwsncarccnwsn wsnaargcnwsnwsnwsnwsnwsngcnccngaygaygarcayaayaarathgargarytn ytnytnaarytnacnwsncarcarccngargaymgnmgnwsngcngcnggngarathmgn ytnytngcnaarcaraayaaycayaaymgngtngcnathgcngcnwsnggngcnathccn ytnytngtnaayytnytnacnathwsnaaygaywsnmgnacncargarcaygcngtnacn wsnathytnaayytnwsnathtgycargaraayaarggnaarathgtntaywsnwsnggn gcngtnccnggnathgtncaygtnytncaraarggnwsnatggargcnmgngaraaygcn gcngcnacnytnttywsnytnwsngtnathgaygaraayaargtnacnathggngcngcn ggngcnathccnccnytngtnacnytnytnwsngarggnwsncarmgnggnaaraargay gcngcnacngcnytnttyaayytntgyathttycarggnaayaarggnaargcngtnmgn gcnggnytngtnccngtnytnatgmgnytnytnacngarccngarwsnggnatggtngay garwsnytnwsnathytngcnathytnwsnwsncayccngayggnaarwsngargtnggn gcngcngaygcngtnccngtnytngtngayttyathmgnwsnggnwsnccnmgnaayaar garaaywsngcngcngtnytngtncayytntgywsntggaaycarcarcayytnathgar gcncaraarytnggnathatggayytnytnathgaratggcngaraayggnacngaymgn ggnaarmgnaargcngcncarytnytnaaymgnttywsnmgnttyaaygaycarcaraar carcaywsnggnytnggnytngargaycarathwsnytnath

3.3. Codon optimization. Once a nucleotide sequence of your protein is determined, you need to codon optimize your sequence. You may, once again, utilize google for a “codon optimization tool”. In your own words, describe why you need to optimize codon usage.

Optimizing codon usage can help account for host bias as well as reduce the pool of downstream errors to account for, as well as save materials and time.

Which organism have you chosen to optimize the codon sequence for and why?

Which: Arabidopsis thaliana
Why: I wanted to start somewhere, and this organism is familiar.
Optimization Tool Used: Vector Builder (https://en.vectorbuilder.com/tool/codon-optimization.html ) Chose to use the first sequence of the two. Enzyme recognition sites avoided included BsaI and BbsI.

Resulting Sequence is as follows: PUB12, a plant U-box–type E3 ubiquitin ligase DNA sequence ATGGCTAAATCTGAGAAGCATAAGTTGGCTCAGACTCTCATCGATTCTATAAATGAAATTGCTTCTATCTCAGATTCAGTTACTCCAATGAAGAAGCATTGTGCAAATTTGTCTAGGAGATTGTCACTTCTTCTTCCAATGTTGGAAGAGATTAGAGATAATCAAGAGTCTAGCTCTGAAGTCGTGAACGCTTTGCTCTCAGTTAAACAATCTTTATTACATGCTAAGGATCTCTTGTCTTTCGTCAGTCATGTGAGCAAGATATATCTTGTTCTTGAGAGAGATCAAGTGATGGTTAAGTTTCAAAAAGTTACTAGCCTTCTTGAGCAAGCTCTTTCTATAATCCCTTATGAGAATCTTGAAATTTCTGATGAATTGAAAGAACAAGTTGAATTGGTTCTTGTTCAACTTAGAAGATCTTTGGGTAAGAGAGGTGGTGATGTTTACGATGATGAACTTTATAAGGATGTTCTTTCACTTTACAGTGGAAGAGGATCAGTTATGGAGAGTGATATGGTTCGTCGAGTTGCCGAGAAATTGCAACTAATGACTATCACTGATTTGACACAAGAGTCTCTTGCTCTTCTTGATATGGTTTCTTCTAGTGGTGGAGATGATCCTGGAGAGTCATTCGAAAAGATGTCTATGGTTCTTAAGAAAATTAAGGATTTCGTTCAAACCTATAATCCTAACCTAGATGACGCTCCCCTTAGACTTAAATCATCATTGCCTAAATCGAGAGATGATGATCGTGATATGCTTATTCCACCTGAAGAATTCCGTTGTCCTATTTCGCTTGAGCTTATGACTGATCCTGTAATCGTTTCTTCAGGTCAAACCTATGAAAGAGAGTGTATTAAGAAGTGGCTTGAAGGAGGACATTTGACATGTCCTAAGACTCAAGAAACTTTGACATCTGATATCATGACCCCTAATTATGTTCTTAGATCTTTGATCGCTCAATGGTGTGAGTCGAATGGAATCGAGCCTCCAAAGAGGCCAAACATAAGTCAGCCTTCTAGTAAGGCTTCTTCATCATCTAGTGCTCCTGATGACGAACATAATAAGATCGAAGAATTGCTCTTGAAGTTGACTTCTCAGCAACCTGAAGATAGAAGATCCGCTGCTGGAGAGATCAGACTTTTGGCCAAACAAAACAACCATAACAGAGTTGCTATCGCTGCTTCAGGAGCTATTCCACTCTTGGTGAACCTTTTGACTATCTCAAACGATTCCAGAACACAAGAGCATGCTGTTACGTCTATCCTCAACCTTTCTATCTGCCAAGAAAATAAAGGTAAGATCGTTTATTCTAGTGGTGCAGTGCCTGGTATTGTTCATGTTTTGCAGAAGGGATCAATGGAGGCTAGAGAAAACGCTGCTGCTACTCTTTTCTCTCTTTCCGTTATAGATGAGAATAAGGTTACTATTGGAGCTGCTGGAGCAATTCCACCTTTGGTTACTCTCCTTTCTGAAGGATCACAGCGTGGAAAGAAGGATGCTGCTACTGCACTCTTCAACCTTTGTATCTTTCAGGGTAATAAAGGTAAGGCAGTTAGAGCAGGACTTGTGCCTGTGCTTATGAGGCTTTTGACTGAACCTGAATCTGGAATGGTTGATGAGAGCCTTTCTATTCTTGCTATTCTTTCTTCTCATCCAGACGGAAAGTCTGAAGTTGGAGCTGCTGATGCAGTTCCTGTTCTTGTTGATTTCATCAGATCTGGATCTCCTAGAAATAAGGAGAATTCTGCTGCAGTTCTTGTTCACTTGTGTTCATGGAATCAACAACATCTTATCGAAGCACAGAAGCTTGGAATCATGGATCTTCTCATCGAGATGGCTGAAAACGGAACTGATCGTGGTAAGAGAAAGGCCGCACAATTGCTTAATAGATTTTCTAGATTTAACGATCAGCAGAAGCAACACAGTGGTCTTGGTCTTGAAGATCAAATTTCATTGATT Below is a printout contrasting the two:

3.4. You have a sequence! Now what?

Recombinant DNA technologies could be utilized to make this protein from the DNA (Cell-free or Cell-dependent). Respectively they involve either special mix that can take place in a test tube or through using a live cell’s machinery to produce the protein.

Part 4: Prepare a Twist DNA Synthesis Order

Build Your DNA Insert Sequence

I prepared the above improved sequence as a test order. Below is the initial step through the creation of the DNA/RNA Sequence in Benchling with a Linear Topology.

It was annotated below as such within Benchling before a linear map and file was constructed that could be uploaded to Twist Bio.

Start Codon: ATG Coding Sequence: ATGGCTAAATCTGAGAAGCATAAGTTGGCTCAGACTCTCATCGATTCTATAAATGAAATTGCTTCTATCTCAGATTCAGTTACTCCAATGAAGAAGCATTGTGCAAATTTGTCTAGGAGATTGTCACTTCTTCTTCCAATGTTGGAAGAGATTAGAGATAATCAAGAGTCTAGCTCTGAAGTCGTGAACGCTTTGCTCTCAGTTAAACAATCTTTATTACATGCTAAGGATCTCTTGTCTTTCGTCAGTCATGTGAGCAAGATATATCTTGTTCTTGAGAGAGATCAAGTGATGGTTAAGTTTCAAAAAGTTACTAGCCTTCTTGAGCAAGCTCTTTCTATAATCCCTTATGAGAATCTTGAAATTTCTGATGAATTGAAAGAACAAGTTGAATTGGTTCTTGTTCAACTTAGAAGATCTTTGGGTAAGAGAGGTGGTGATGTTTACGATGATGAACTTTATAAGGATGTTCTTTCACTTTACAGTGGAAGAGGATCAGTTATGGAGAGTGATATGGTTCGTCGAGTTGCCGAGAAATTGCAACTAATGACTATCACTGATTTGACACAAGAGTCTCTTGCTCTTCTTGATATGGTTTCTTCTAGTGGTGGAGATGATCCTGGAGAGTCATTCGAAAAGATGTCTATGGTTCTTAAGAAAATTAAGGATTTCGTTCAAACCTATAATCCTAACCTAGATGACGCTCCCCTTAGACTTAAATCATCATTGCCTAAATCGAGAGATGATGATCGTGATATGCTTATTCCACCTGAAGAATTCCGTTGTCCTATTTCGCTTGAGCTTATGACTGATCCTGTAATCGTTTCTTCAGGTCAAACCTATGAAAGAGAGTGTATTAAGAAGTGGCTTGAAGGAGGACATTTGACATGTCCTAAGACTCAAGAAACTTTGACATCTGATATCATGACCCCTAATTATGTTCTTAGATCTTTGATCGCTCAATGGTGTGAGTCGAATGGAATCGAGCCTCCAAAGAGGCCAAACATAAGTCAGCCTTCTAGTAAGGCTTCTTCATCATCTAGTGCTCCTGATGACGAACATAATAAGATCGAAGAATTGCTCTTGAAGTTGACTTCTCAGCAACCTGAAGATAGAAGATCCGCTGCTGGAGAGATCAGACTTTTGGCCAAACAAAACAACCATAACAGAGTTGCTATCGCTGCTTCAGGAGCTATTCCACTCTTGGTGAACCTTTTGACTATCTCAAACGATTCCAGAACACAAGAGCATGCTGTTACGTCTATCCTCAACCTTTCTATCTGCCAAGAAAATAAAGGTAAGATCGTTTATTCTAGTGGTGCAGTGCCTGGTATTGTTCATGTTTTGCAGAAGGGATCAATGGAGGCTAGAGAAAACGCTGCTGCTACTCTTTTCTCTCTTTCCGTTATAGATGAGAATAAGGTTACTATTGGAGCTGCTGGAGCAATTCCACCTTTGGTTACTCTCCTTTCTGAAGGATCACAGCGTGGAAAGAAGGATGCTGCTACTGCACTCTTCAACCTTTGTATCTTTCAGGGTAATAAAGGTAAGGCAGTTAGAGCAGGACTTGTGCCTGTGCTTATGAGGCTTTTGACTGAACCTGAATCTGGAATGGTTGATGAGAGCCTTTCTATTCTTGCTATTCTTTCTTCTCATCCAGACGGAAAGTCTGAAGTTGGAGCTGCTGATGCAGTTCCTGTTCTTGTTGATTTCATCAGATCTGGATCTCCTAGAAATAAGGAGAATTCTGCTGCAGTTCTTGTTCACTTGTGTTCATGGAATCAACAACATCTTATCGAAGCACAGAAGCTTGGAATCATGGATCTTCTCATCGAGATGGCTGAAAACGGAACTGATCGTGGTAAGAGAAAGGCCGCACAATTGCTTAATAGATTTTCTAG Stop Codon: TAG

Linear Map:

Here’s an example of what you just annotated in Benchling: Sequence Import and Quote Obtained

The pTwist Amp High Copy: pTwist Amp Vector was chosen after the Clonal Gene Choice was pursued. The quote is to the left.. The annotated sequence page from TwistBio from which a GenBank construct file was downloaded is to the right.

The construct was imported into Benchling to yield the plasmid below.

Part 5: DNA Read/Write/Edit

5.1 DNA Read

(i) What DNA would you want to sequence (e.g., read) and why?

I might want to sequence large viruses, particularly out of sheer curiosity for the instructions that allow them to exist in their current forms.

Sources of interest:

Piacente, F., De Castro, C., Jeudy, S., Molinaro, A., Salis, A., Damonte, G., Bernardi, C., Abergel, C. and Tonetti, M.G., 2014. Giant virus Megavirus chilensis encodes the biosynthetic pathway for uncommon acetamido sugars. Journal of Biological Chemistry, 289(35), pp.24428-24439.
Legendre, M., Arslan, D., Abergel, C. and Claverie, J.M., 2012. Genomics of Megavirus and the elusive fourth domain of Life. Communicative & integrative biology, 5(1), pp.102-106.
Arslan, D., Legendre, M., Seltzer, V., Abergel, C. and Claverie, J.M., 2011. Distant Mimivirus relative with a larger genome highlights the fundamental features of Megaviridae. Proceedings of the National Academy of Sciences, 108(42), pp.17486-17491.

(ii) In lecture, a variety of sequencing technologies were mentioned. What technology or technologies would you use to perform sequencing on your DNA and why?

I would borrow from the methods used in the aforementioned literature, particularly “454-titanium and Illumina HiSeq approaches”. These methods appear adequate. Lack of a priori knowledge of the genome or genomic features not being required is helpful, in addition to single-nucleotide resolution, higher dynamic range, and less DNA/RNA needed.

https://www.ebi.ac.uk/training/online/courses/functional-genomics-ii-common-technologies-and-data-analysis-methods/next-generation-sequencing/

Also answer the following questions: Is your method first-, second- or third-generation or other? How so?

Second-generation. They engage massively parallel sequencing.

What is your input? How do you prepare your input (e.g. fragmentation, adapter ligation, PCR)? List the essential steps.

Library Preparation (fragmenting of DNA and adapters added to both ends of DNA for amplification)
Sequencing
Data Analysis and Cleanup

What are the essential steps of your chosen sequencing technology, how does it decode the bases of your DNA sample (base calling)?

Simplified:

The DNA strand gets color coded with fluorescent terminators
Images are taken of the flow cell after each letter is added
Software determines the bases based on color intensities
Calls are made, corrections are issued, and output is cleaned up

Source of interest: https://genohub.com/bioinformatics/10/base-calling

What is the output of your chosen sequencing technology?

Numerous sequence reads

5.2 DNA Write (i) What DNA would you want to synthesize (e.g., write) and why?

I am not sure yet, but I am leaning towards DNA origami art to experiment with the medium and explore versatility of applications.

Sources of interest:

Bush, J., Singh, S., Vargas, M., Oktay, E., Hu, C.H. and Veneziano, R., 2020. Synthesis of DNA origami scaffolds: Current and emerging strategies. Molecules, 25(15), p.3386.
Weck, J.M. and Heuer-Jungemann, A., 2025. Fully addressable designer superstructures assembled from one single modular DNA origami. Nature communications, 16(1), p.1556.
DNA origami by Paul W. K. Rothemund, California Institute of Technology, 2004. 100 nanometers in diameter. (ii) What technology or technologies would you use to perform this DNA synthesis and why?
For validating the structures, if cost didn’t matter, I would consider using next generation sequencing (Illumina, for both sequences of the staples and scaffold) and Atomic Force Microscopy (Visual, especially confirming folds)

Also answer the following questions:

The essential steps of the chosen sequencing methods would be:

Library Prep (DNA Fragmentation and Adapter ligation methods)
Cluster Generation via amplification
Sequencing and base calling

What are the limitations of your sequencing method (if any) in terms of speed, accuracy, scalability?

The major limitations would be cost, error rates, and short read lengths. This would not be efficient to scale as is.

Improvements would involve: -Hand design of patterns (for ideating improvements) -Computer design and optimization of material usage -Production of material and strand-routing precision

5.3 DNA Edit (i) What DNA would you want to edit and why?

DNA edits that I would like to perform would be those that allow for the minimization and or elimination of metabolic disease states. The why comes down to the quality-of-life improvements for all involved.

(ii) What technology or technologies would you use to perform these DNA edits and why?

I can see myself engaging CRISPR-Cas9 and base editing for their precision. Of the two, especially the latter, in order to reduce off-target based effects.

Also answer the following questions:

How does your technology of choice edit DNA? What are the essential steps?

It uses guide RNA to direct modified Cas proteins to DNA sequences and convert nucleotides.

The major steps woud involve (i) Designing guide RNA that compliment the target gene(s); (ii) deliver said RNA and its editor protein to the cells of interest (iii) waiting as the guide RNA mediated editor binds to the target DNA site (iv) waiting as the deaminase converts targeted base paits (v) waiting as the edits are incoporated (vi) assaying for confirmation via suquencing and functional assay applications

Ethics meditations no doubt should accompany each step.

What preparation do you need to do (e.g. design steps) and what is the input (e.g. DNA template, enzymes, plasmids, primers, guides, cells) for the editing?

Preparation involves (i) identifying seuquences associated with disease states in select organisms, (ii) computationally designing guide RNAs to target them, (iii) computationally optimizing said guide RNAs and selecting a properly paired base editing system (iv) modeling the application of guide RNAs and base editor asystem applyed to DNA sequences highly associated with disease states in the bodies of select organisms (v) validating said results and modifying as needed (vi) developing in vitro cellular models capable of testing editing efficiency, safety, and functional outcomes (vii) Organoid and higher models may follow

What are the limitations of your editing methods (if any) in terms of efficiency or precision?

Avoiding off-target edits is not a given. Validation and optimization steps remain necessary.

Week 3 HW: Lab Automation

This lab, we were tasked with creating a design that could be generated by an OpenTrons Liquid Handling Robot.

Assignment: Python Script for Opentrons Artwork

Review this week’s recitation and this week’s lab for details on the Opentrons and programming it.

Done.

Generate an artistic design using the GUI at opentrons-art.rcdonovan.com.

As a remote participant, I prototyped a design using the GUI at opentrons-art.rcdonovan.com.

This resulted in a layered plus symbol shown below.

Using the coordinates from the GUI, follow the instructions in the HTGAA26 Opentrons Colab to write your own Python script which draws your design using the Opentrons.

You may use AI assistance for this coding — Google Gemini is integrated into Colab (see the stylized star bottom center); it will do a good job writing functional Python, while you probably need to take charge of the art concept.
If you’re a proficient programmer and you’d rather code something mathematical or algorithmic instead of using your GUI coordinates, you may do that instead.

Acknowledged

The coordinates for generating such can be found here, courtesy of RC Donovan’s tool:

sfgfp_points = [(-2.2, 6.6),(0, 6.6),(2.2, 6.6),(-2.2, 4.4),(2.2, 4.4),(-6.6, 2.2),(-4.4, 2.2),(-2.2, 2.2),(2.2, 2.2),(4.4, 2.2),(6.6, 2.2),(-6.6, 0),(6.6, 0),(-6.6, -2.2),(-4.4, -2.2),(-2.2, -2.2),(2.2, -2.2),(4.4, -2.2),(6.6, -2.2),(-2.2, -4.4),(2.2, -4.4),(-2.2, -6.6),(0, -6.6),(2.2, -6.6)] electra2_points = [(0, 4.4),(0, 2.2),(-4.4, 0),(-2.2, 0),(0, 0),(2.2, 0),(4.4, 0),(0, -2.2),(0, -4.4)] mrfp1_points = [(-4.4, 8.8),(-2.2, 8.8),(0, 8.8),(2.2, 8.8),(4.4, 8.8),(-4.4, 6.6),(4.4, 6.6),(-8.8, 4.4),(-6.6, 4.4),(-4.4, 4.4),(4.4, 4.4),(6.6, 4.4),(8.8, 4.4),(-8.8, 2.2),(8.8, 2.2),(-8.8, 0),(8.8, 0),(-8.8, -2.2),(8.8, -2.2),(-8.8, -4.4),(-6.6, -4.4),(-4.4, -4.4),(4.4, -4.4),(6.6, -4.4),(8.8, -4.4),(-4.4, -6.6),(4.4, -6.6),(-4.4, -8.8),(-2.2, -8.8),(0, -8.8),(2.2, -8.8),(4.4, -8.8)]

If the Python component is proving too problematic even with AI and human assistance, download the full Python script from the GUI website and submit that:

Not needed, but appreciated.

If you use AI to help complete this homework or lab, document how you used AI and which models made contributions.

For my node, in order to work with their printer which had two colors, a modified version was created. Gemini within was tested and employed to deliver the following result.

Sign up for a robot time slot if you are at MIT/Harvard/Wellesley or at a Node offering Opentrons automation. The Python script you created will be run on the robot to produce your work of art!

At MIT/Harvard? Lab times are on Thursday Feb.19 between 10AM and 6PM.
At other Nodes? Please coordinate with your Node.

I was added to the William and Mary Node. I coordinated with Margaret and Kate for OpenTrons code submission. My code was submitted to Kate and who was then able to faciliate the printing of my design. Please see below.

Submit your Python file via this form.

DONE.

Post-LAb Questions:

Find and describe a published paper that utilizes the Opentrons or an automation tool to achieve novel biological applications.

DeRoo, J.B., Jones, A.A., Slaughter, C.K., Ahr, T.W., Stroup, S.M., Thompson, G.B. and Snow, C.D., 2025. Automation of protein crystallization scaleup via Opentrons-2 liquid handling. SLAS technology, 32, p.100268.

https://doi.org/10.1016/j.slast.2025.100268

This work describes an approach by which an Opentrons-2 liquid handling robot was used for automating sitting drop protein crystallization trials. This ability also improve comparability of products produced, improving studies that depend on their proper manufacture. An important detail is how the Opentrons-2 can prove a cost-effective option for laboratory operations. For example, at the time of writing, the Opentrons-2 can be purchased for around 13.5K USD vs that of a Gryphon machine at around 65K USD.

Write a description about what you intend to do with automation tools for your final project.

I’m still forming my thoughts about how I want to effectively use automation tools for my final project.

So far, I am interested in branching off from example #2 given in the Homework and the above example, regarding screening an array of designed biosensor constructs.

One idea had in mind was towards a digital tracing project that revolves around said constructs used to track known entities.

Simply, products are given a unique ID with stored parameters. These are linked within a automation run so that each product is trackable as they are processed. One application that is probably already in use but would be fun to adapt towards something not already applied would be swappable combined wearable crystallized biosensors that are traded in daily for workers that are liable to be exposed to a particular organism and pollutant pairs.

I could use an Echo for transfer of nano-scale components. The Bravo or Opentrons-2 could be used for precise, automated pipetting ,especially of the crystals. The multiflow would be used to dispense the larger scale volume components. The PlateLoc would be helful for sealing the plates. The inheco could be used for controlled incubation. The Xpeel would be used for careful desealing of the plates. Finally, the PHERAstar could be used for reading fluorescence outputs.

Still developing this out from this branch.

Final Project Ideas

Done. My intitial Project Ideas were added:

The initial candidates were:

Project Name: Aptly Aptamer-Based Sensor for Endocrine Disruptors

Problem: Endocrine-disrupting chemicals can persist in water at concentrations that are difficult to monitor in real time. Hypothesis: If we engineer high-affinity aptamers that selectively bind hormone-mimicking pollutants, we can detect these contaminants at environmentally relevant levels. Solution: Develop a portable biosensor that binds engineered aptamers to a measurable fluorescent or electrochemical signal for field-based detection.

Real-world Literature and Examples of Problem:

Bertram, M.G., Gore, A.C., Tyler, C.R. and Brodin, T., 2022. Endocrine-disrupting chemicals. Current Biology, 32(13), pp.R727-R730.

Pironti, C., Ricciardi, M., Proto, A., Bianco, P.M., Montano, L. and Motta, O., 2021. Endocrine-disrupting compounds: An overview on their occurrence in the aquatic environment and human exposure. Water, 13(10), p.1347.

https://www.usgs.gov/programs/environmental-health-program/science/long-term-study-finds-endocrine-disrupting-chemicals

Project Name: NF-Lamp Lighter Deployable LAMP-Based Sensor for Naegleria fowleri

Problem: Testing for Naegleria fowleri currently requires sending water samples to a lab, which delays results and limits real-time monitoring. Hypothesis: If we combine on-site water filtration with LAMP DNA amplification, we can detect N. fowleri quickly without a full laboratory. Solution: Build a deployable mini-station that filters water, runs a LAMP test in a sealed cartridge, and sends a simple detection alert to public health officials.

Real-world Literature and Examples of Problem:

Grace, E., Asbill, S. and Virga, K., 2015. Naegleria fowleri: pathogenesis, diagnosis, and treatment options. Antimicrobial agents and chemotherapy, 59(11), pp.6677-6681.

Maciver, S.K., Piñero, J.E. and Lorenzo-Morales, J., 2020. Is Naegleria fowleri an emerging parasite?. Trends in parasitology, 36(1), pp.19-28.

https://www.cdc.gov/naegleria/about/index.html

Project Name: LP Alert Deployable Biosensor for Legionella pneumophila in Cooling Towers

Problem: Cooling towers can grow Legionella pneumophila, and detection could be made more quickly. Hypothesis: If we combine on-site DNA amplification with CRISPR detection and an electrochemical readout, we can quickly and accurately detect L. pneumophila and avoid having to send samples to a lab. Solution: Build a deployable unit that samples cooling tower water, amplifies L. pneumophila DNA, converts detection into an electrical signal, and sends an alert to facility managers.

Real-world Literature and Examples of Problem:

Wéry, N., Bru-Adan, V., Minervini, C., Delgénes, J.P., Garrelly, L. and Godon, J.J., 2008. Dynamics of Legionella spp. and bacterial populations during the proliferation of L. pneumophila in a cooling tower facility. Applied and environmental microbiology, 74(10), pp.3030-3037.

Brigmon, R.L., Turick, C.E., Knox, A.S. and Burckhalter, C.E., 2020. The impact of storms on Legionella pneumophila in cooling tower water, implications for human health. Frontiers in Microbiology, 11, p.543589.

https://www.cdc.gov/control-legionella/php/toolkit/cooling-towers-module.html

Week 4 HW: Protein Design Part I

Part A: Conceptual Questions

We were allowed answer 9 out of 11 of the following questions:

How many molecules of amino acids do you take with a piece of 500 grams of meat? (on average an amino acid is ~100 Daltons)

I expect answers to vary since meat composition vary.

Water composition of meat is estimated to be about 65-80% meat. Some estimates have proteins make up around 1/5 to less than 1/3 of overall muscle tissue.

We’ll work with 20% and 30% for upper and lower bounds for simplicity.

Assuming that 100 Daltons roughly equals 100 g/mo

Lower: 500 g × 0.2 = 100 g protein = 1 mol. 6.02 × 10²³ molecules

Upper: 500 g × 0.3 = 150 g protein = 1.5 mol. ~9 × 10²³ molecules

Quick Sources: https://redtablemeats.com/fresh-meat/beef/what-percentage-of-beef-is-protein/ https://agriculture.institute/fresh-meat-technology/water-content-in-meat-importance/

For simplicity, let’s work with Taking the average, 500 grams of meat may yield

Why do humans eat beef but do not become a cow, eat fish but do not become fish?

These products are broken down through digestion and recycled. Our genetic material largely informs the building process of new material.

Why are there only 20 natural amino acids?

I’m not convinced that those are the only ones, in so far of what we have observed on Earth it is likely that the developmental “cost” as per our development over time was too high. 20 reflects optimization, no?

Can you make other non-natural amino acids? Design some new amino acids.

Yes. These can be designed through new side chains, although stability and inclusion may be a problem.

Where did amino acids come from before enzymes that make them, and before life started?

Abiotic processes. Some sources list thermal vents and delivery through meteorites.

Ehrenfreund, P., Bernstein, M.P., Dworkin, J.P., Sandford, S.A. and Allamandola, L.J., 2001. The photostability of amino acids in space. The Astrophysical Journal Letters, 550(1), pp.L95-L99.

Kobayashi, K., Mita, H., Kebukawa, Y., Nakagawa, K., Kaneko, T., Obayashi, Y., Sato, T., Yokoo, T., Minematsu, S., Fukuda, H. and Oguri, Y., 2021. Space exposure of amino acids and their precursors during the Tanpopo Mission. Astrobiology, 21(12), pp.1479-1493.

If you make an α-helix using D-amino acids, what handedness (right or left) would you expect?

Left-handed

Can you discover additional helices in proteins?

Yes

Why are most molecular helices right-handed?

The stereochemistry of L-amino acids constrains backbone geometry in a way that right-handed helices are lower in energy. Right-handed chirality is favored and chirality from the bottom influences chirality at higher structures.

Why do β-sheets tend to aggregate? What is the driving force for β-sheet aggregation?

[Skipped]

Why do many amyloid diseases form β-sheets?

β-Sheet structures are stable and can easy to produce from misfolding.

Can you use amyloid β-sheets as materials? Design a β-sheet motif that forms a well-ordered structure.

[Skipped]

Part B: Protein Analysis and Visualization

The protein that I selected is U-box domain-containing protein 12.Possesses E3 ubiquitin-protein ligase in vitro. I was not too particular on its selection and wanted to start somewhere.

Source: https://www.uniprot.org/uniprotkb/Q5VRH9/entry

Oryza sativa subsp. japonica (Rice)

Amino Acid sequence:

sp|Q5VRH9|PUB12_ORYSJ U-box domain-containing protein 12 OS=Oryza sativa subsp. japonica OX=39947 GN=PUB12 PE=1 SV=1 MPKRVADEIAALPEPRGPLRRPCADLSRRVRLLAPLLDHLPASSSSSSSTPLADALGAAR DLLRKTRDGSKIDQAMRGDAFLDEFAGVNRQIHLALDALPYNTFHMPQEVQEQVALVHSQ FQRASTRTDPPDTQLSMDLAWALTDNPSDPALLTRISHKLQLHTMADMKNESIALHNMVI STAGEPDGCVDQMSSLLKKLKDCVVTEDHANDALTTRSASIKHRSPIIPDEFRCPISLEL MQDPVIVSSGQTYERSCIQKWLDSGHKTCPKTQQPLSHTSLTPNFVLKSLISQWCEANGI ELPKNKQNSRDKKAAKSSDYDHAGLVSLMNRLRSGNQDEQRAAAGEIRLLAKRNVNNRIC IAEAGAIPLLVNLLSSSDPRTQEHAVTALLNLSIHENNKASIVDSHAIPKIVEVLKTGSM ETRENAAATLFSLSVVDENKVTIGAAGAIPPLINLLCDGSPRGKKDAATAIFNLCIYQGN KVRAVKAGIVIHLMNFLVDPTGGMIDEALSLLSILAGNPEGKIVIARSEPIPPLVEVIKT GSPRNRENAAAILWLLCSADTEQTLAAKAAGVEDALKELSETGTDRAKRKASSILELMHQ ANEDSLKGNGH

The length of the protein is: 611 aminoacids. The most common amino acid is: L, which appears 72 times.

This protein does not share 100% homology with any other although there are many with 50-90% homology, per UniProt. Via the Blast Tool, a lot of homologs were found.

Query= EMBOSS_001

Length=611 Score E Sequences producing significant alignments: (Bits) Value

SP:Q5VRH9 PUB12_ORYSJ U-box domain-containing protein 12 OS=Oryza … 1199 0.0 TR:I1PYP8 I1PYP8_ORYGL U-box domain-containing protein 12 OS=Oryza… 1194 0.0 TR:A0A0D3GBR7 A0A0D3GBR7_9ORYZ RING-type E3 ubiquitin transferase … 1193 0.0 TR:A0A0E0PSC1 A0A0E0PSC1_ORYRU U-box domain-containing protein 12 … 1193 0.0 TR:A0A0E0HJJ2 A0A0E0HJJ2_ORYNI U-box domain-containing protein 12 … 1192 0.0 TR:A0A0E0A3S8 A0A0E0A3S8_9ORYZ U-box domain-containing protein 12 … 1182 0.0 SP:Q5VRH9-2 PUB12_ORYSJ Isoform 2 of U-box domain-containing prote… 1182 0.0 TR:A0A0D3GBR8 A0A0D3GBR8_9ORYZ RING-type E3 ubiquitin transferase … 1176 0.0 TR:A0A0E0PSC3 A0A0E0PSC3_ORYRU U-box domain-containing protein 12 … 1176 0.0 TR:A0A0E0HJJ3 A0A0E0HJJ3_ORYNI U-box domain-containing protein 12 … 1175 0.0 TR:A2Y893 A2Y893_ORYSI U-box domain-containing protein 12 OS=Oryza… 1167 0.0 TR:A0A0E0A3T5 A0A0E0A3T5_9ORYZ U-box domain-containing protein 12 … 1165 0.0 TR:A0A0E0DVD9 A0A0E0DVD9_9ORYZ U-box domain-containing protein 12 … 1165 0.0 TR:A0A0E0L6Q0 A0A0E0L6Q0_ORYPU U-box domain-containing protein 12 … 1152 0.0 TR:A0A6G1DGM2 A0A6G1DGM2_9ORYZ RING-type E3 ubiquitin transferase … 1128 0.0 TR:A0A0D9WKQ8 A0A0D9WKQ8_9ORYZ RING-type E3 ubiquitin transferase … 1102 0.0 TR:A0A0E0A3T4 A0A0E0A3T4_9ORYZ RING-type E3 ubiquitin transferase … 1009 0.0 TR:A0A5J9SK41 A0A5J9SK41_9POAL RING-type E3 ubiquitin transferase … 1005 0.0 TR:A0A2T7E463 A0A2T7E463_9POAL RING-type E3 ubiquitin transferase … 1000 0.0 TR:A0ABC9E206 A0ABC9E206_9POAL RING-type E3 ubiquitin transferase … 999 0.0 TR:A0A3L6RWE2 A0A3L6RWE2_PANMI RING-type E3 ubiquitin transferase … 999 0.0 TR:A0ABC9DBY8 A0ABC9DBY8_9POAL RING-type E3 ubiquitin transferase … 993 0.0 TR:A0A8T0T5H3 A0A8T0T5H3_PANVG RING-type E3 ubiquitin transferase … 993 0.0 TR:A0ABC9DW43 A0ABC9DW43_9POAL RING-type E3 ubiquitin transferase … 991 0.0 TR:A0A1E5WM86 A0A1E5WM86_9POAL RING-type E3 ubiquitin transferase … 991 0.0 TR:A0A811SD54 A0A811SD54_9POAL RING-type E3 ubiquitin transferase … 989 0.0 TR:A0A8T0TI26 A0A8T0TI26_PANVG RING-type E3 ubiquitin transferase … 988 0.0 TR:A0A811S773 A0A811S773_9POAL RING-type E3 ubiquitin transferase … 987 0.0 TR:A0A835KU84 A0A835KU84_9POAL RING-type E3 ubiquitin transferase … 986 0.0 TR:A0AAV5CS17 A0AAV5CS17_ELECO U-box domain-containing protein 12 … 986 0.0 TR:J3MAH2 J3MAH2_ORYBR U-box domain-containing protein 12 OS=Oryza… 985 0.0 TR:C5Z221 C5Z221_SORBI RING-type E3 ubiquitin transferase OS=Sorgh… 984 0.0 TR:A0A835FWE8 A0A835FWE8_9POAL RING-type E3 ubiquitin transferase … 983 0.0 TR:I1H245 I1H245_BRADI U-box domain-containing protein 12 OS=Brach… 983 0.0 TR:A0A3L6PH60 A0A3L6PH60_PANMI RING-type E3 ubiquitin transferase … 982 0.0 TR:A0A4U6UXZ9 A0A4U6UXZ9_SETVI U-box domain-containing protein 12 … 981 0.0 TR:B6SXY9 B6SXY9_MAIZE RING-type E3 ubiquitin transferase OS=Zea m… 964 0.0 TR:A0ACD5Y0V4 A0ACD5Y0V4_AVESA Uncharacterized protein OS=Avena sa… 957 0.0 TR:A0A3B6THH5 A0A3B6THH5_WHEAT RING-type E3 ubiquitin transferase … 956 0.0 TR:A0A8R7RDC8 A0A8R7RDC8_TRIUA RING-type E3 ubiquitin transferase … 955 0.0 TR:A0ACD6A429 A0ACD6A429_AVESA Uncharacterized protein OS=Avena sa… 950 0.0 TR:A0A8I6Y9M9 A0A8I6Y9M9_HORVV RING-type E3 ubiquitin transferase … 942 0.0 TR:A0A3B6I3Q2 A0A3B6I3Q2_WHEAT U-box domain-containing protein 12 … 917 0.0 TR:A0AAV5F2R6 A0AAV5F2R6_ELECO U-box domain-containing protein 12 … 914 0.0 TR:K3XW51 K3XW51_SETIT U-box domain-containing protein 12 OS=Setar… 905 0.0 TR:A0A0E0L6Q1 A0A0E0L6Q1_ORYPU U-box domain-containing protein 12 … 900 0.0 TR:A0A452XGN3 A0A452XGN3_AEGTS RING-type E3 ubiquitin transferase … 888 0.0 TR:A0A3B6R6B6 A0A3B6R6B6_WHEAT RING-type E3 ubiquitin transferase … 868 0.0 TR:A0A9R0Z145 A0A9R0Z145_TRITD RING-type E3 ubiquitin transferase … 866 0.0 TR:A0ACD5ZS62 A0ACD5ZS62_AVESA Uncharacterized protein OS=Avena sa… 843 0.0 TR:A0A452XGJ2 A0A452XGJ2_AEGTS RING-type E3 ubiquitin transferase … 836 0.0 TR:A0A0E0DVD6 A0A0E0DVD6_9ORYZ RING-type E3 ubiquitin transferase … 815 0.0 TR:A0A6I9S1S1 A0A6I9S1S1_ELAGV RING-type E3 ubiquitin transferase … 795 0.0 TR:A0A8B7C1E5 A0A8B7C1E5_PHODC RING-type E3 ubiquitin transferase … 786 0.0 TR:A0AAV8GY30 A0AAV8GY30_9POAL RING-type E3 ubiquitin transferase … 785 0.0 TR:A0AAV8EMN4 A0AAV8EMN4_9POAL RING-type E3 ubiquitin transferase … 783 0.0 TR:A0A8K0IKK4 A0A8K0IKK4_COCNU RING-type E3 ubiquitin transferase … 782 0.0 TR:A0AAV8F803 A0AAV8F803_9POAL RING-type E3 ubiquitin transferase … 781 0.0 TR:A0A8K0N659 A0A8K0N659_COCNU RING-type E3 ubiquitin transferase … 780 0.0 TR:A0A6I9SJB7 A0A6I9SJB7_ELAGV RING-type E3 ubiquitin transferase … 779 0.0 TR:A0A8B8Z9X0 A0A8B8Z9X0_PHODC U-box domain-containing protein 12 … 778 0.0 TR:A0A9Q0CB42 A0A9Q0CB42_9POAL RING-type E3 ubiquitin transferase … 777 0.0 TR:A0AAD5WDJ8 A0AAD5WDJ8_9POAL RING-type E3 ubiquitin transferase … 776 0.0 TR:A0A452XGQ0 A0A452XGQ0_AEGTS RING-type E3 ubiquitin transferase … 773 0.0 TR:A0A2I0BCA5 A0A2I0BCA5_9ASPA RING-type E3 ubiquitin transferase … 757 0.0 TR:A0A3B6TG51 A0A3B6TG51_WHEAT RING-type E3 ubiquitin transferase … 756 0.0 TR:A0A835RLE0 A0A835RLE0_VANPL RING-type E3 ubiquitin transferase … 755 0.0 TR:A0A9R0SRI3 A0A9R0SRI3_TRITD U-box domain-containing protein 12 … 754 0.0 TR:A0A6I9S1Y4 A0A6I9S1Y4_ELAGV RING-type E3 ubiquitin transferase … 753 0.0 TR:A0A833VES7 A0A833VES7_9POAL RING-type E3 ubiquitin transferase … 751 0.0 TR:A0A9R1BGT5 A0A9R1BGT5_TRITD RING-type E3 ubiquitin transferase … 751 0.0 TR:A0A9D5BZQ6 A0A9D5BZQ6_9LILI RING-type E3 ubiquitin transferase … 750 0.0 TR:A0ABD0UM27 A0ABD0UM27_DENTH RING-type E3 ubiquitin transferase … 746 0.0 TR:A0A8B8ZB12 A0A8B8ZB12_PHODC RING-type E3 ubiquitin transferase … 745 0.0 TR:A0A8T3AWN2 A0A8T3AWN2_DENNO RING-type E3 ubiquitin transferase … 744 0.0 TR:A0A2I0WEQ6 A0A2I0WEQ6_9ASPA RING-type E3 ubiquitin transferase … 743 0.0 TR:A0A6P5GVN1 A0A6P5GVN1_ANACO RING-type E3 ubiquitin transferase … 741 0.0 TR:A0A5P1FQ11 A0A5P1FQ11_ASPOF RING-type E3 ubiquitin transferase … 734 0.0 TR:A0AAP0G926 A0AAP0G926_9ASPA RING-type E3 ubiquitin transferase … 729 0.0 TR:A0AAP0WZL6 A0AAP0WZL6_LIQFO RING-type E3 ubiquitin transferase … 728 0.0 TR:A0A843WFY6 A0A843WFY6_COLES RING-type E3 ubiquitin transferase … 723 0.0 TR:A0ACB7WUI3 A0ACB7WUI3_DIOAL Zinc finger RING/FYVE/PHD-type prot… 723 0.0 TR:A0AB40BCZ8 A0AB40BCZ8_DIOCR RING-type E3 ubiquitin transferase … 723 0.0 TR:A0AAQ3K0H0 A0AAQ3K0H0_9LILI RING-type E3 ubiquitin transferase … 720 0.0 TR:F6HND8 F6HND8_VITVI RING-type E3 ubiquitin transferase OS=Vitis… 718 0.0 TR:A0ABY9D2S6 A0ABY9D2S6_VITVI RING-type E3 ubiquitin transferase … 718 0.0 TR:A0AAV7E209 A0AAV7E209_ARIFI RING-type E3 ubiquitin transferase … 717 0.0 TR:A0AA39DJS7 A0AA39DJS7_VITRO RING-type E3 ubiquitin transferase … 715 0.0 TR:A0ABS8SCS2 A0ABS8SCS2_DATST RING-type E3 ubiquitin transferase … 712 0.0 TR:A0AAV5LAD4 A0AAV5LAD4_9ROSI RING-type E3 ubiquitin transferase … 712 0.0 TR:V4TIZ6 V4TIZ6_CITCL RING-type E3 ubiquitin transferase OS=Citru… 710 0.0 TR:A0A804I6U7 A0A804I6U7_MUSAM RING-type E3 ubiquitin transferase … 710 0.0 TR:A0A200Q9A7 A0A200Q9A7_MACCD RING-type E3 ubiquitin transferase … 709 0.0 TR:A0A7J6W5Z9 A0A7J6W5Z9_THATH RING-type E3 ubiquitin transferase … 709 0.0 TR:A0ACB8L513 A0ACB8L513_CITSI U-box domain-containing protein 14 … 709 0.0 TR:A0A067E149 A0A067E149_CITSI RING-type E3 ubiquitin transferase … 709 0.0 TR:A0ACB9PUD9 A0ACB9PUD9_BAUVA Uncharacterized protein OS=Bauhinia… 707 0.0 TR:A0A9Q1MDB3 A0A9Q1MDB3_9SOLA RING-type E3 ubiquitin transferase … 707 0.0 TR:A0AAV8RJ11 A0AAV8RJ11_ENSVE RING-type E3 ubiquitin transferase … 707 0.0 TR:A0AAN8VNY8 A0AAN8VNY8_9MAGN RING-type E3 ubiquitin transferase … 706 0.0 TR:A0ABR2QF73 A0ABR2QF73_9ROSI RING-type E3 ubiquitin transferase … 706 0.0 TR:A0A061FPF5 A0A061FPF5_THECC RING-type E3 ubiquitin transferase … 706 0.0 TR:A0A6P5YZR2 A0A6P5YZR2_DURZI RING-type E3 ubiquitin transferase … 705 0.0 TR:A0A6A4QEE0 A0A6A4QEE0_LUPAL RING-type E3 ubiquitin transferase … 705 0.0 TR:A0A6I9TG78 A0A6I9TG78_SESIN RING-type E3 ubiquitin transferase … 705 0.0 TR:A0A835LLS2 A0A835LLS2_9MAGN RING-type E3 ubiquitin transferase … 705 0.0 TR:A0AAD4NZN1 A0AAD4NZN1_PERFH RING-type E3 ubiquitin transferase … 704 0.0 TR:A0ABR2S2R6 A0ABR2S2R6_9ROSI RING-type E3 ubiquitin transferase … 704 0.0 TR:A0AAE1RYG1 A0AAE1RYG1_9SOLA RING-type E3 ubiquitin transferase … 704 0.0 TR:A0A1J6IL89 A0A1J6IL89_NICAT RING-type E3 ubiquitin transferase … 704 0.0 TR:A0A9Q1M5W7 A0A9Q1M5W7_9SOLA RING-type E3 ubiquitin transferase … 704 0.0 TR:A0A6J1B7W8 A0A6J1B7W8_9ROSI RING-type E3 ubiquitin transferase … 704 0.0 TR:A0A9W7HPH7 A0A9W7HPH7_HIBTR RING-type E3 ubiquitin transferase … 703 0.0 TR:A0A9W7HNE6 A0A9W7HNE6_HIBTR RING-type E3 ubiquitin transferase … 703 0.0 TR:A0A2P5ANF8 A0A2P5ANF8_TREOI RING-type E3 ubiquitin transferase … 702 0.0 TR:A0A1S2XZS7 A0A1S2XZS7_CICAR RING-type E3 ubiquitin transferase … 702 0.0 TR:A0AAE1SAH3 A0AAE1SAH3_9SOLA RING-type E3 ubiquitin transferase … 702 0.0 TR:A0A1J7FNJ6 A0A1J7FNJ6_LUPAN RING-type E3 ubiquitin transferase … 702 0.0 TR:A0A2P5AJ14 A0A2P5AJ14_PARAD RING-type E3 ubiquitin transferase … 702 0.0 TR:A0A1U7VEA5 A0A1U7VEA5_NICSY RING-type E3 ubiquitin transferase … 702 0.0 TR:M1ABT1 M1ABT1_SOLTU RING-type E3 ubiquitin transferase OS=Solan… 702 0.0 TR:A0ABQ7W6L0 A0ABQ7W6L0_SOLTU RING-type E3 ubiquitin transferase … 702 0.0 TR:A0AAV9K3G9 A0AAV9K3G9_9SOLN RING-type E3 ubiquitin transferase … 702 0.0 TR:A0A1S4BCS6 A0A1S4BCS6_TOBAC RING-type E3 ubiquitin transferase … 702 0.0 TR:A0A6J5U2T1 A0A6J5U2T1_PRUAR RING-type E3 ubiquitin transferase … 702 0.0 TR:A0ACB8L4W7 A0ACB8L4W7_CITSI U-box domain-containing protein 14 … 702 0.0 TR:A0A2G2VCI1 A0A2G2VCI1_CAPBA RING-type E3 ubiquitin transferase … 702 0.0 TR:A0AAE1YU04 A0AAE1YU04_9LAMI RING-type E3 ubiquitin transferase … 701 0.0 TR:A0AAV1X6J7 A0AAV1X6J7_LUPLU RING-type E3 ubiquitin transferase … 701 0.0 TR:A0A1U8EX65 A0A1U8EX65_CAPAN RING-type E3 ubiquitin transferase … 701 0.0 TR:A0A5E4ECC4 A0A5E4ECC4_PRUDU RING-type E3 ubiquitin transferase … 701 0.0 TR:A0ABD1WYG0 A0ABD1WYG0_9LAMI RING-type E3 ubiquitin transferase … 701 0.0 TR:A0AAD7M5U3 A0AAD7M5U3_QUISA RING-type E3 ubiquitin transferase … 701 0.0 TR:A0AAD7Q5P3 A0AAD7Q5P3_QUISA RING-type E3 ubiquitin transferase … 700 0.0 TR:A0A6P8CYA9 A0A6P8CYA9_PUNGR RING-type E3 ubiquitin transferase … 700 0.0 TR:A0A1S3ZKQ3 A0A1S3ZKQ3_TOBAC RING-type E3 ubiquitin transferase … 700 0.0 TR:A0A2G5D164 A0A2G5D164_AQUCA RING-type E3 ubiquitin transferase … 700 0.0 TR:A0A6P5U3J9 A0A6P5U3J9_PRUAV RING-type E3 ubiquitin transferase … 700 0.0 TR:A0ABQ9M5E9 A0ABQ9M5E9_HEVBR RING-type E3 ubiquitin transferase … 699 0.0 TR:A0ACC0ZAK8 A0ACC0ZAK8_9ROSI Uncharacterized protein OS=Pistacia… 699 0.0 TR:A0ABM0P219 A0ABM0P219_PRUMU RING-type E3 ubiquitin transferase … 699 0.0 TR:A0A3Q7IQ73 A0A3Q7IQ73_SOLLC RING-type E3 ubiquitin transferase … 699 0.0 TR:A0AA39SE18 A0AA39SE18_ACESA RING-type E3 ubiquitin transferase … 699 0.0 TR:A0A5J5AW04 A0A5J5AW04_9ASTE RING-type E3 ubiquitin transferase … 699 0.0 TR:A0A314Y6E1 A0A314Y6E1_PRUYE RING-type E3 ubiquitin transferase … 699 0.0 TR:A0A251QMB1 A0A251QMB1_PRUPE RING-type E3 ubiquitin transferase … 699 0.0 TR:A0A7J7M6E3 A0A7J7M6E3_9MAGN RING-type E3 ubiquitin transferase … 699 0.0 TR:A0AAV1QYQ7 A0AAV1QYQ7_9ROSI RING-type E3 ubiquitin transferase … 699 0.0 TR:A0A835H9Y2 A0A835H9Y2_9MAGN RING-type E3 ubiquitin transferase … 699 0.0 TR:A0AAD4XDG0 A0AAD4XDG0_9MAGN RING-type E3 ubiquitin transferase … 698 0.0 TR:A0AAE1ILH4 A0AAE1ILH4_9FABA RING-type E3 ubiquitin transferase … 698 0.0 TR:A0A8J6CUL5 A0A8J6CUL5_9ROSI RING-type E3 ubiquitin transferase … 698 0.0 TR:A0ABD2QVW4 A0ABD2QVW4_9SOLN RING-type E3 ubiquitin transferase … 697 0.0 TR:A0A5N5FI06 A0A5N5FI06_9ROSA RING-type E3 ubiquitin transferase … 697 0.0 TR:A0A9Q0UCA7 A0A9Q0UCA7_SALPP RING-type E3 ubiquitin transferase … 697 0.0 TR:A0AAE0BAR6 A0AAE0BAR6_9ROSI RING-type E3 ubiquitin transferase … 697 0.0 TR:A0ABM1FMC2 A0ABM1FMC2_SOLPN RING-type E3 ubiquitin transferase … 697 0.0 TR:A0A6P6AKQ0 A0A6P6AKQ0_DURZI RING-type E3 ubiquitin transferase … 697 0.0 TR:A0AAP0LDP2 A0AAP0LDP2_9MAGN RING-type E3 ubiquitin transferase … 697 0.0 TR:A0AAP0QEX9 A0AAP0QEX9_9ROSI RING-type E3 ubiquitin transferase … 697 0.0 TR:A0A9J5WGK4 A0A9J5WGK4_SOLCO RING-type E3 ubiquitin transferase … 697 0.0 TR:A0ACB8L597 A0ACB8L597_CITSI U-box domain-containing protein 14 … 697 0.0 TR:A0A5C7GYA8 A0A5C7GYA8_9ROSI RING-type E3 ubiquitin transferase … 697 0.0 TR:A0A5D2XPV8 A0A5D2XPV8_GOSMU RING-type E3 ubiquitin transferase … 697 0.0 TR:A0A8T2Y9A9 A0A8T2Y9A9_POPDE RING-type E3 ubiquitin transferase … 697 0.0 TR:A0ABM2YU16 A0ABM2YU16_GOSHI RING-type E3 ubiquitin transferase … 697 0.0 TR:A0AAN8SZP9 A0AAN8SZP9_SOLBU RING-type E3 ubiquitin transferase … 697 0.0 TR:A0AAD5IAS4 A0AAD5IAS4_ACENE RING-type E3 ubiquitin transferase … 697 0.0 TR:A0A9Q0WKY8 A0A9Q0WKY8_9ROSI RING-type E3 ubiquitin transferase … 696 0.0 TR:A0A5D2NX47 A0A5D2NX47_GOSTO RING-type E3 ubiquitin transferase … 696 0.0 TR:A0ABD3TPL4 A0ABD3TPL4_9LAMI RING-type E3 ubiquitin transferase … 696 0.0 TR:A0A3S4PK96 A0A3S4PK96_9MAGN RING-type E3 ubiquitin transferase … 696 0.0 TR:A0AAD6PFY3 A0AAD6PFY3_9ROSI RING-type E3 ubiquitin transferase … 696 0.0 TR:A0A5J5B9Y0 A0A5J5B9Y0_9ASTE RING-type E3 ubiquitin transferase … 696 0.0 TR:A0ABC8UN55 A0ABC8UN55_9AQUA RING-type E3 ubiquitin transferase … 696 0.0 TR:B9HLH5 B9HLH5_POPTR RING-type E3 ubiquitin transferase OS=Popul… 696 0.0 TR:A0A5D2JAA3 A0A5D2JAA3_GOSTO RING-type E3 ubiquitin transferase … 696 0.0 TR:A0A8K0GRB9 A0A8K0GRB9_9ROSA RING-type E3 ubiquitin transferase … 695 0.0 TR:A0A5D2TDZ1 A0A5D2TDZ1_GOSMU RING-type E3 ubiquitin transferase … 695 0.0 TR:A0A5D2B520 A0A5D2B520_GOSDA RING-type E3 ubiquitin transferase … 695 0.0 TR:A0AAE1X2D8 A0AAE1X2D8_9LAMI RING-type E3 ubiquitin transferase … 695 0.0 TR:A0ABR2BSQ0 A0ABR2BSQ0_9ROSI RING-type E3 ubiquitin transferase … 695 0.0 TR:A0ABR2NSZ0 A0ABR2NSZ0_9ROSI RING-type E3 ubiquitin transferase … 695 0.0 TR:A0ABM3AXC1 A0ABM3AXC1_GOSHI RING-type E3 ubiquitin transferase … 695 0.0 TR:A0AAF0UY34 A0AAF0UY34_SOLVR RING-type E3 ubiquitin transferase … 695 0.0 TR:A0ABC8QLK3 A0ABC8QLK3_9AQUA RING-type E3 ubiquitin transferase … 695 0.0 TR:A0ABC8SRL1 A0ABC8SRL1_9AQUA RING-type E3 ubiquitin transferase … 695 0.0 TR:A0A9Q0SV54 A0A9Q0SV54_SALVM RING-type E3 ubiquitin transferase … 694 0.0 TR:A0ABU6RFH6 A0ABU6RFH6_9FABA RING-type E3 ubiquitin transferase … 694 0.0 TR:A0A8X7Z8K1 A0A8X7Z8K1_POPTO RING-type E3 ubiquitin transferase … 694 0.0 TR:A0ACC4BRY9 A0ACC4BRY9_POPAL Uncharacterized protein OS=Populus … 694 0.0 TR:A0AAD6MJN4 A0AAD6MJN4_9ROSI RING-type E3 ubiquitin transferase … 694 0.0 TR:A0ABD2TIP0 A0ABD2TIP0_9SOLN RING-type E3 ubiquitin transferase … 694 0.0 TR:A0ACC2KS43 A0ACC2KS43_PERAE Uncharacterized protein OS=Persea a… 694 0.0 TR:A0ABD3E7B2 A0ABD3E7B2_9LAMI RING-type E3 ubiquitin transferase … 694 0.0 TR:A0AAP0NZD1 A0AAP0NZD1_9MAGN RING-type E3 ubiquitin transferase … 694 0.0 TR:A0A834W7W7 A0A834W7W7_9FABA RING-type E3 ubiquitin transferase … 694 0.0 TR:A0A5J5PUV7 A0A5J5PUV7_GOSBA RING-type E3 ubiquitin transferase … 694 0.0 TR:A0A0B0NXG6 A0A0B0NXG6_GOSAR RING-type E3 ubiquitin transferase … 694 0.0 TR:A0ABR0NKE6 A0ABR0NKE6_GOSAR Uncharacterized protein OS=Gossypiu… 694 0.0 TR:A0A9D3UIB5 A0A9D3UIB5_9ROSI RING-type E3 ubiquitin transferase … 694 0.0 TR:A0A1R3H5H3 A0A1R3H5H3_COCAP RING-type E3 ubiquitin transferase … 694 0.0 TR:A0ACC2KS90 A0ACC2KS90_PERAE Uncharacterized protein OS=Persea a… 694 0.0 TR:A0AAJ6UKJ1 A0AAJ6UKJ1_POPEU RING-type E3 ubiquitin transferase … 694 0.0 TR:A0ABD1GH90 A0ABD1GH90_SALDI RING-type E3 ubiquitin transferase … 694 0.0 TR:A0A1R3JE18 A0A1R3JE18_9ROSI RING-type E3 ubiquitin transferase … 694 0.0 TR:A0AA88D9K5 A0AA88D9K5_FICCA RING-type E3 ubiquitin transferase … 694 0.0 TR:A0A7J9IC62 A0A7J9IC62_9ROSI RING-type E3 ubiquitin transferase … 693 0.0 TR:A0A7N2R9R8 A0A7N2R9R8_QUELO RING-type E3 ubiquitin transferase … 693 0.0 TR:A0A835JZV4 A0A835JZV4_9ROSI RING-type E3 ubiquitin transferase … 693 0.0 TR:A0A8J5FTW6 A0A8J5FTW6_ZINOF RING-type E3 ubiquitin transferase … 693 0.0 TR:A0A4Y7KVD5 A0A4Y7KVD5_PAPSO RING-type E3 ubiquitin transferase … 693 0.0 TR:A0A5D2F4H4 A0A5D2F4H4_GOSDA RING-type E3 ubiquitin transferase … 693 0.0 TR:A0A8J4VKI5 A0A8J4VKI5_9ROSI RING-type E3 ubiquitin transferase … 693 0.0 TR:A0AAN9F206 A0AAN9F206_CROPI RING-type E3 ubiquitin transferase … 693 0.0 TR:A0A5J5U6C5 A0A5J5U6C5_GOSBA RING-type E3 ubiquitin transferase … 693 0.0 TR:A0A6A2Y6Q4 A0A6A2Y6Q4_HIBSY RING-type E3 ubiquitin transferase … 693 0.0 TR:A0A5B6WJW5 A0A5B6WJW5_9ROSI RING-type E3 ubiquitin transferase … 692 0.0 TR:A0A498JGF4 A0A498JGF4_MALDO RING-type E3 ubiquitin transferase … 692 0.0 TR:A0A7N2MEE7 A0A7N2MEE7_QUELO RING-type E3 ubiquitin transferase … 692 0.0 TR:A0AAV6Y5W4 A0AAV6Y5W4_9LAMI RING-type E3 ubiquitin transferase … 692 0.0 TR:A0AAP0KTP2 A0AAP0KTP2_9MAGN RING-type E3 ubiquitin transferase … 692 0.0 TR:A0A7J6WEB3 A0A7J6WEB3_THATH RING-type E3 ubiquitin transferase … 692 0.0 TR:A0A2G5CD20 A0A2G5CD20_AQUCA RING-type E3 ubiquitin transferase … 692 0.0 TR:W9SKF5 W9SKF5_9ROSA RING-type E3 ubiquitin transferase OS=Morus… 692 0.0 TR:A0A072VQ10 A0A072VQ10_MEDTR U-box domain-containing protein 12 … 692 0.0 TR:A0A9W7I114 A0A9W7I114_HIBTR RING-type E3 ubiquitin transferase … 691 0.0 TR:A0A9W7M2C2 A0A9W7M2C2_HIBTR RING-type E3 ubiquitin transferase … 691 0.0 TR:A0A371F3Y5 A0A371F3Y5_MUCPR RING-type E3 ubiquitin transferase … 691 0.0 TR:A0A8J5FNQ2 A0A8J5FNQ2_ZINOF RING-type E3 ubiquitin transferase … 691 0.0 TR:A0A5N5FBR6 A0A5N5FBR6_9ROSA RING-type E3 ubiquitin transferase … 691 0.0 TR:A0ABC8QLK2 A0ABC8QLK2_9AQUA RING-type E3 ubiquitin transferase … 691 0.0 TR:A0ACB8L5G0 A0ACB8L5G0_CITSI U-box domain-containing protein 14 … 691 0.0 TR:A0ABR0CTF8 A0ABR0CTF8_9LAMI RING-type E3 ubiquitin transferase … 691 0.0 TR:A0A1U7ZJ54 A0A1U7ZJ54_NELNU RING-type E3 ubiquitin transferase … 691 0.0 TR:A0A540L773 A0A540L773_MALBA RING-type E3 ubiquitin transferase … 691 0.0 TR:A0ACB9ZMC7 A0ACB9ZMC7_CATRO Uncharacterized protein OS=Catharan… 691 0.0 TR:A0A1U7Z4U1 A0A1U7Z4U1_NELNU RING-type E3 ubiquitin transferase … 691 0.0 TR:A0A8B8MC51 A0A8B8MC51_ABRPR RING-type E3 ubiquitin transferase … 691 0.0 TR:A0A822YGE3 A0A822YGE3_NELNU RING-type E3 ubiquitin transferase … 691 0.0 TR:A0A7J7CI66 A0A7J7CI66_TRIWF RING-type E3 ubiquitin transferase … 690 0.0 TR:A0A8X8ZET3 A0A8X8ZET3_SALSN U-box domain-containing protein 12 … 690 0.0 TR:A0ACB7HJV7 A0ACB7HJV7_MANES Uncharacterized protein OS=Manihot … 690 0.0 TR:A0A2P6SJA1 A0A2P6SJA1_ROSCH RING-type E3 ubiquitin transferase … 690 0.0 TR:A0A2Z7BEP1 A0A2Z7BEP1_9LAMI RING-type E3 ubiquitin transferase … 689 0.0 TR:A0A8X8ZH43 A0A8X8ZH43_SALSN RING-type E3 ubiquitin transferase … 689 0.0 TR:A0A8S0SA15 A0A8S0SA15_OLEEU RING-type E3 ubiquitin transferase … 689 0.0 TR:A0AAW2CXI6 A0AAW2CXI6_9ROSI RING-type E3 ubiquitin transferase … 689 0.0 TR:A0ACC1YBU1 A0ACC1YBU1_MELAZ RING-type E3 ubiquitin transferase … 689 0.0 TR:A0AAV1C6C9 A0AAV1C6C9_OLDCO RING-type E3 ubiquitin transferase … 689 0.0

This belongs to the U-box E3 ubiquitin ligase family.

1. [Structure page for protein via RCSB]

Name: U-box domain-containing protein 12 Organism: Oryza sativa Japonica Group Experimental Method: X-ray Resolution: 2.5 Å Molecular Weight: 8.952 kDa

The resolution is 2.5 Å indicating good quality.
There are no other molecules in the solved structure
The structure contains only the protein and no additional bound molecules.
It belongs to the U-box structural family.

I used 3D Protein Imager, located at https://3dproteinimaging.com/. I found it via https://bcrf.biochem.wisc.edu/2024/02/06/3d-protein-imager-a-pymol-qutemol-web-alternative/. https://doi.org/10.1093/bioinformatics/btaa009

This can be accessed online, and free. They also provide an illustration service, which is pretty cool. It can be found below. This program did not have the “cartoon”, “ribbon”, and “ball and stick”, but it did have the “Real”, “Outline”, and “Goodsell” (cartoon-like) view modes where are shown from a “Sphere” mode of viewing. See image below.

To visualize it differently, I looked at it from a “tube” viewing mode. It has more helices than sheets. See image below.

Hydrophobic residues are inside the protein core, and hydrophilic residues are on the surface. See image below. Labeling by residue it has many charged and hydrophobic residues, with the latter being more focused inward. See image below.

Visualizing the surface, this structure appears compact, without deep binding cavities. See image below.

Part C

To be added with images

Part D

This part involved several of us students brainstorming for a lab according to the following instructions:

1. Find a group of ~3–4 students
1. Read through the Phage Reading material.
1. Review the Bacteriophage Final Project Goals for engineering the L Protein:
Choose one or two main goals from the list that you think you can address computationally (e.g., “We’ll try to stabilize the lysis protein,” or “We’ll attempt to disrupt its interaction with E. coli DnaJ.”).
Write a 1-page proposal (bullet points or short paragraphs) describing:
Each individually put your plan on your HTGAA website

Our initial proposal can be found below:

Week 5 HW: Protein Design Part II

This week’s homework was divided into 3 parts.

Part A involved SOD1 Binder Peptide Design. That was broken into 3 parts: Part 1: Part 1: Generate Binders with PepMLM

A) Begin by retrieving the human SOD1 sequence from UniProt (P00441) and introducing the A4V mutation. Done B) Using the PepMLM Colab linked from the HuggingFace PepMLM-650M model card: Done C) Generate four peptides of length 12 amino acids conditioned on the mutant SOD1 sequence.

AHWGVYTGVKKAKRX 15.649067 AWVPPYAVVYALRAX 18.835777 SRWPPYAARVEWAKA 19.332172 SRYDEVVGVKKLRKX 14.812749

D) To your generated list, add the known SOD1-binding peptide FLYRWLPSRRGG for comparison.

Done

Record the perplexity scores that indicate PepMLM’s confidence in the binders.

Recorded

Part 2: Evaluate Binders with AlphaFold3

A) Navigate to the AlphaFold Server: alphafoldserver.com

Done

B) For each peptide, submit the mutant SOD1 sequence followed by the peptide sequence as separate chains to model the protein-peptide complex.

Submitted:

Original: MATKAVCVLKGDGPVQGIINFEQKESNGPVKVWGSIKGLTEGLHGFHVHEFGDNTAGCTSAGPHFNPLSRKHGGPKDEERHVGDLGNVTADKDGVADVSIEDSVISLSGDHCIIGRTLVVHEKADDLGKGGNEESTKTGNAGSRLACGVIGIAQ

1st: MATKAVCVLKGDGPVQGIINFEQKESNGPVKVWGSIKGLTEGLHGFHVHEFGDNTAGCTSAGPHFNPLSRKHGGPKDEERHVGDLGNVTADKDGVADVSIEDSVISLSGDHCIIGRTLVVHEKADDLGKGGNEESTKTGNAGSRLACGVIGIAQAHWGVYTGVKKAKRA

2nd: MATKAVCVLKGDGPVQGIINFEQKESNGPVKVWGSIKGLTEGLHGFHVHEFGDNTAGCTSAGPHFNPLSRKHGGPKDEERHVGDLGNVTADKDGVADVSIEDSVISLSGDHCIIGRTLVVHEKADDLGKGGNEESTKTGNAGSRLACGVIGIAQAWVPPYAVVYALRAA

3rd: MATKAVCVLKGDGPVQGIINFEQKESNGPVKVWGSIKGLTEGLHGFHVHEFGDNTAGCTSAGPHFNPLSRKHGGPKDEERHVGDLGNVTADKDGVADVSIEDSVISLSGDHCIIGRTLVVHEKADDLGKGGNEESTKTGNAGSRLACGVIGIAQSRWPPYAARVEWAKA

4th: MATKAVCVLKGDGPVQGIINFEQKESNGPVKVWGSIKGLTEGLHGFHVHEFGDNTAGCTSAGPHFNPLSRKHGGPKDEERHVGDLGNVTADKDGVADVSIEDSVISLSGDHCIIGRTLVVHEKADDLGKGGNEESTKTGNAGSRLACGVIGIAQSRYDEVVGVKKLRKA

5th (with mutant): MATKAVCVLKGDGPVQGIINFEQKESNGPVKVWGSIKGLTEGLHGFHVHEFGDNTAGCTSAGPHFNPLSRKHGGPKDEERHVGDLGNVTADKDGVADVSIEDSVISLSGDHCIIGRTLVVHEKADDLGKGGNEESTKTGNAGSRLACGVIGIAQFLYRWLPSRRGG

All Xs were converted to As

C/D) Record the ipTM score and briefly describe where the peptide appears to bind. Does it localize near the N-terminus where A4V sits? Does it engage the β-barrel region or approach the dimer interface? Does it appear surface-bound or partially buried? D) In a short paragraph, describe the ipTM values you observe and whether any PepMLM-generated peptide matches or exceeds the known binder.

The IpTM score is .18. This does not appear to indicate high confidence in binding. The lack of clear localiztion, strong β-barrel engagement, meaningful Dimer interfacing, combined with surface bounding of the peptides does not suggest strong binding interactions.

Part 3: Evaluate Properties of Generated Peptides in the PeptiVerse

Paste the peptide sequence: Paste the A4V mutant SOD1 sequence in the target field. Check the boxes Predicted binding affinity Solubility Hemolysis probability Net charge (pH 7) Molecular weight

Data for such were pasted as pictures below, in this order:

Original 1st Sequence: AHWGVYTGVKKAKRX 2nd Sequence: AWVPPYAVVYALRAX 3rd Sequence: SRWPPYAARVEWAKA 4th Sequence: SRYDEVVGVKKLRKX 5th Sequence: Mutant

Part 4: Generate Optimized Peptides with moPPIt

Open the moPPit Colab linked from the HuggingFace moPPIt model card

Done

Make a copy and switch to a GPU runtime.

Done

In the notebook: Paste your A4V mutant SOD1 sequence. Choose specific residue indices on SOD1 that you want your peptide to bind (for example, residues near position 4, the dimer interface, or another surface patch). Set peptide length to 12 amino acids. Enable motif and affinity guidance (and solubility/hemolysis guidance if available). Generate peptides.

Done

After generation, briefly describe how these moPPit peptides differ from your PepMLM peptides. How would you evaluate these peptides before advancing them to clinical studies?

Erros were received and will be debugged.

Update:

This readout was given after fixing.

Binder Hemolysis Non-Fouling Solubility Half-Life Motif Specificity TEVEEQEDRQHH 0.932494111 0.910815179 0.916666687 1.714286923 0.001987276 0.934523821 LAAGQALGITTA 0.918551639 0.573852718 0.416666627 6.734929085 0.006364056 0.886904776 ARKLTPEDQKQG 0.961578194 0.903518379 0.75 2.08972311 0.080991976 0.803571403

moPPit peptides appear better to develop. I would evaluate these peptides further computationally before advancing them to clinical studies. This would involve ranking by various qualities and further optimization. If successful, in vitro studies would follow.

Part B: BRD4 Drug Discovery Platform Tutorial (Gabriele)

This assignment was optional. For the sake of priorities, it will not be posted here.

Part C: Final Project: L-Protein Mutants

This was held between of 5 members, 3 of whom were able to provide their results jointly. We persued the Option 1: Mutagenesis.

To save space, given the large volume of images, a link to the inputs can be found in this google doc:

https://docs.google.com/document/d/1676c1tgFUlGaP-Bwp9_vDexbk3VsOJeuQeylNfvz76o/edit?usp=sharing.

Some of the results will be shown below:

Week 6 HW: Genetic Circuits Part I: Assembly Technologies

Part 1: DNA Assembly

What are some components in the Phusion High-Fidelity PCR Master Mix and what is their purpose?

Some of the components include the Phusion DNA Polymerease (key to PCR), dNTPs (these are used to synthesize new DNA), and buffering materials to stabilize the environmental conditions during the raction.

What are some factors that determine primer annealing temperature during PCR?

Some of these include buffer conditions, primer melting temperature, primer length, and GC content.

There are two methods from this class that create linear fragments of DNA: PCR, and restriction enzyme digests. Compare and contrast these two methods, both in terms of protocol as well as when one may be preferable to use over the other.

PCR uses primers, dNTPs, buffers and polymerase to amplify DNA. Restriction enzyme digests are very precise in that they cut genomic material at precise locations, but they require restriction sites.

How can you ensure that the DNA sequences that you have digested and PCR-ed will be appropriate for Gibson cloning?

They need to be optimally designed for the system. This requires special attention primer design and their parameters.

How does the plasmid DNA enter the E. coli cells during transformation?

After a shock is induced, plasmid DNA can enter E. Coli cells through pores that emerge in their membrane.

Describe another assembly method in detail (such as Golden Gate Assembly)

6.1 Explain the other method in 5 - 7 sentences plus diagrams (either handmade or online).

Golden gate assembly is a method for efficient, seamless assembly that uses Type IIS restriction endonucleases to cleave DNA that exists outside of recognition sequences. Designs for inserts and cloning vectors place the Type IIS recognition sites distal to the cleavage site, and this allows efficient removal. NEB states 3 main advantages for GGA. One is that no scar sequence is introduced. Another is that the overhangs can allow for multiple simulaneous assemblies. Further, ligation and digestion can be done at the same time. A truncated example of a successful case is diagrammed below, with inspiration from a figure on NEB’s website.

Source/Reference: https://www.neb.com/en/applications/cloning-and-synthetic-biology/dna-assembly-and-cloning/golden-gate-assembly

6.2 Model this assembly method with Benchling or Asimov Kernel!

Route chosen: Asimov Kernel

Owing to the brilliance of the Kernal from Asimov, redesigning the wheel was unnecessary.

I found the “pPD005 pcDNA Golden Gate” under the Addgene Plasmid Repository detailing a construct that was already created It is shown below in circular form.