Make sure to document every step of the in-silico and lab experiments. Make sketches, screenshots, notes, drawings - anything that helps you - and others understand the experiment.
Your Documentation should help you - and others - to understand the topic. Don’t be afraid to add things that don’t work. Show your failures - and how you overcame them. Your Documentation should be a description of the amazing journey you are on!
Overview
Ethics, safety, and security are essential considerations throughout (and beyond!) this class. We have therefore designed the Class Assignment this week to give you a strong foundation, and then will ask you to reflect each week and in the design of your final project.
First, describe a biological engineering application or tool you want to develop and why. This could be inspired by an idea for your HTGAA class project and/or something for which you are already doing in your research, or something you are just curious about.
Next, describe one or more governance/policy goals related to ensuring that this application or tool contributes to an “ethical” future, like ensuring non-malfeasance (preventing harm). Break big goals down into two or more specific sub-goals. Below is one example framework (developed in the context of synthetic genomics) you can choose to use or adapt, or you can develop your own. The example was developed to consider policy goals of ensuring safety and security, alongside other goals, like promoting constructive uses, but you could propose other goals for example, those relating to equity or autonomy.
Next, describe at least three different potential governance “actions” by considering the four aspects below (Purpose, Design, Assumptions, Risks of Failure & “Success”). Try to outline a mix of actions (e.g. a new requirement/rule, incentive, or technical strategy) pursued by different “actors” (e.g. academic researchers, companies, federal regulators, law enforcement, etc). Draw upon your existing knowledge and a little additional digging, and feel free to use analogies to other domains (e.g. 3D printing, drones, financial systems, etc.).
Purpose: What is done now and what changes are you proposing?
Design: What is needed to make it “work”? (including the actor(s) involved - who must opt-in, fund, approve, or implement, etc)
Assumptions: What could you have wrong (incorrect assumptions, uncertainties)?
Risks of Failure & “Success”: How might this fail, including any unintended consequences of the “success” of your proposed actions?
Next, score (from 1-3 with, 1 as the best, or n/a) each of your governance actions against your rubric of policy goals. The following is one framework but feel free to make your own:
Does the option:
Option 1
Option 2
Option 3
Enhance Biosecurity
• By preventing incidents
• By helping respond
Foster Lab Safety
• By preventing incident
• By helping respond
Protect the environment
• By preventing incidents
• By helping respond
Other considerations
• Minimizing costs and burdens to stakeholders
• Feasibility?
• Not impede research
• Promote constructive applications
Last, drawing upon this scoring, describe which governance option, or combination of options, you would prioritize, and why. Outline any trade-offs you considered as well as assumptions and uncertainties. For this, you can choose one or more relevant audiences for your recommendation, which could range from the very local (e.g. to MIT leadership or Cambridge Mayoral Office) to the national (e.g. to President Biden or the head of a Federal Agency) to the international (e.g. to the United Nations Office of the Secretary-General, or the leadership of a multinational firm or industry consortia). These could also be one of the “actor” groups in your matrix.
Reflecting on what you learned and did in class this week, outline any ethical concerns that arose, especially any that were new to you. Then propose any governance actions you think might be appropriate to address those issues. This should be included on your class page for this week.
Assignment (Final Project) – Due as part of your Final Project presentation (not Feb 10)
Assignees for this section
MIT/Harvard students
Required
Committed Listeners
Required
As part of your final project, design one or more strategies to ensure that your project, and what it enables, contributes to growing an ethical biological future.
Assignment (Lab Preparation) — DUE BY START OF FEB 10 LECTURE
Assignees for this section
MIT/Harvard students
Required
Committed Listeners
(Not Applicable)
Lab Training (failure to complete this will jeopardize your acceptance into the course)
Complete Lab Specific Training in Person.
Complete Safety Training in Atlas
Navigate to atlas.mit.edu and on the right-hand side, click “Learning Center”
Head to the Course Catalog and find the following two courses:
General Biosafety for Researchers (EHS00260w)
Managing Hazardous Waste (EHS00501w)
Assignment (Week 2 Lecture Prep) — DUE BY START OF FEB 10 LECTURE
Assignees for this section
MIT/Harvard students
Required
Committed Listeners
Required
In preparation for Week 2’s lecture on “DNA Read, Write, and Edit," please review these materials:
Lecture 2 slides as posted below.
The associated papers that are referenced in those slides.
In addition, answer these questions in each faculty member’s section:
Nature’s machinery for copying DNA is called polymerase. What is the error rate of polymerase? How does this compare to the
length of the human genome. How does biology deal with that discrepancy?
How many different ways are there to code (DNA nucleotide code) for an average human protein? In practice what are some of the
reasons that all of these different codes don’t work to code for the protein of interest?
Choose ONE of the following three questions to answer; and please cite AI prompts or paper citations used, if any.
[Using Google & Prof. Church’s slide #4] What are the 10 essential amino acids in all animals
and how does this affect your view of the “Lysine Contingency”?
[Given slides #2 & 4 (AA:NA and NA:NA codes)] What code would you suggest for AA:AA interactions?
[(Advanced students)] Given the one paragraph abstracts for these real 2026 grant programs
sketch a response to one of them or devise one of your own:
Assignment (Your HTGAA Website) — DUE BY START OF FEB 10 LECTURE
Assignees for this section
MIT/Harvard students
Required
Committed Listeners
Required
Begin personalizing your HTGAA website in in https://edit.htgaa.org/, starting with your homepage —
fill in the template with information about yourself, or remove what’s there and make it your own. Be creative!
As with all assignments in HTGAA, be sure to write up every part of this Homework on your HTGAA website in order to receive credit.
Important
In order to continue in this course you need two things:
This homework completed and written up on your HTGAA website on pages.htgaa.org — make sure you’ve checked your
published website on pages.htgaa.org and are happy with how it shows up there; if your homework is not visible on
your pages.htgaa.org website course staff won’t see it and you will not be selected to continue in the course!
For this week only, AFTER your homework is complete and published on pages.htgaa.org, fill out the
Homework 1 Completion form which David emailed out just after
Lecture 1. This Google form expresses your interest in continuing with the course.
Without both of these you will not be accepted into HTGAA!
Synthetic Genomics: Options for Governance This is an older but useful report for thinking about a variety of options for the governance of biotechnology that inspired this week’s homework
National Security Commission on Emerging Biotechnology: This U.S. Congressional Commission will produce its first “comprehensive” report at the end of 2024 but has an “interim” 2023 report posted now, and they are currently soliciting input to guide national policy regulating biotech
iGEM 2020 Safety Hub: This page includes links to many useful resources including the WHO biosafety manual, the NIH guidelines and the CDC Biosafety in Microbial and Biomedical Laboratories Guide; additional information is available on the iGEM 2023 Responsibility page
Handbook for Community Biology Spaces: A handbook co-developed by community biolobabs, designed as a living document that can be updated and expanded by the community over time
DIYBio Ask a biosafety expert This page includes a portal where you can get your biosafety questions answered by professionals
Rooftop Solar and the Four Levers of Social Change: A blog post from Ethan Zuckerman considering different types of ways of regulating behavior, adopted in part from Lawrence Lessig’s book: Code 2.0, and explored in the context of energy consumption and production
Subsections of Week 1 (Feb 3)
Lab (Week 1) — Introduction to Pipetting and Dilutions
Overview
Objective
Welcome to HTGAA! This is our very first lab, and in this lab we will introduce students to the foundational techniques of pipetting and serial dilutions, critical for precise liquid handling and solution preparation in biological and chemical experiments.
This is a one-day lab with two protocols covered on mixing colors and dilution. By the end of the lab, students will confidently use pipettes, prepare solutions with desired concentrations, and troubleshoot common errors in pipetting.
Concepts Learned & Skills Gained
Students will:
Understand Units and Conversions: moles (mol), molarity (M), and conversions between µL, mL, and L.
Perform Serial Dilutions: Learn the stepwise dilution process to achieve specific solution concentrations.
Gain Pipetting Proficiency: Operate P20, P200, and P1000 pipettes accurately for volume transfers.
Visualize Mixing Outcomes: Use colors and absorbance measurements to observe concentration gradients.
Pre-Lab
Reading
Key Definitions
Here are some key definitions we’d like you to know before you get started.
Moles (mol): A unit representing $6.022 \times 10^{23}$ particles (atoms, molecules, etc.).
Molarity (M): Concentration defined as moles of solute per liter of solution (mol/L).
Conversions:
1 L = 1000 mL = 1,000,000 μL
1 M = 1000 mM = 1,000,000 μM
Planning Your Experiments
To calculate the volume of water needed for a dilution, use the formula: $$C_1 V_1 = C_2 V_2$$
$C_2$ : Final concentration (desired concentration).
$V_2$ : Final volume (total volume of the diluted solution).
Steps:
Rearrange the formula to calculate $V_1$: $$ V_1 = \frac{C_2 V_2}{C_1} $$
Calculate the volume of water (let’s call it $V_Water$) to add: $$ V_Water = V_2 - V_1 $$
Practice
Dilution Practice 1
Scenario: The stock concentration of a mystery substance (MS) is 5 M. Calculate how to dilute to 100 µM (0.1 mM):
Use sequential 1:499 and 1:99 dilution steps for accurate preparation.
Step 1: Dilute 5 M (5,000,000 µM) to 10,000 µM (500x dilution).
Step 2: Dilute 10,000 µM to 100 µM (100x dilution).
Dilution Practice 2
The stock concentration of a mystery substance (MS) is 5 M.
If the molar mass of MS is 532 g/mol, what’s the concentration of the stock concentration in g/mL? To make your life easier, you can use one of many online calculators.
You will perform a serial dilution to get 100 uM of MS. Devise a plan to dilute a 5 M MS solution to 100 uM. How many dilution steps will we need? Which tubes should we use? Which pipettes?
Fill out the following chart to prepare a final reaction with 60 uL reaction volume. Why did we make 100 uM MS if we actually need 40 uM MS? Why not prepare 40 uM in serial dilutions?
Reagent
Stock concentration
Desired concentration
Volume
Loading dye
6X
1X
MS
100 uM
40 uM
dH2O
n/a
n/a
Note
Please fill this out before coming to lab.
Additional resources
You must watch or be able to understand the following videos:
Mysterious substance (food coloring with water), henceforth: MS
Red, Blue and Yellow food coloring solutions
Gel loading dye (commonly used reagents for loading gels, strong purple color)
Part 1: Mixing Color
Prepare tubes with red, yellow, and blue food coloring solutions OR watercolor
Take ten tubes and mark them with numbers 1 to 6
Tube 1, 2 and 3: add 500 uL each red, yellow, and blue solution to the tube.
Tube 4: add 220 uL red solution to the tube, and add 220 uL yellow solution.
Try adding this in 2 steps: add 200 uL first, and then 20 uL. Discard your tips after you add one color!
Tube 5: add 525 uL yellow solution to the tube, and add 525 uL blue solution.
Tube 6: add 155 uL red solution to the tube, and add 155 uL blue solution.
Now you have a rainbow! You can try mixing other colors with the solutions.
Try plating different volumes (e.g. 1uL, 2uL, 5uL, 10uL) on a petri plate to make some designs and build your intuitive understanding of these volumes.
Part 2: Performing Serial Dilution
Perform serial dilutions to get 100 uM (0.1 mM) of MS.
Every time you mix in liquid, pipette up and down three or four times to ensure the two liquids are mixed thoroughly.
Mark each tube with its respective concentration using a pen.
Prepare a final reaction of 60 uL based on your table in the pre-lab.
Bonus: Take 20 uL from the final reaction and pipette it to a pre-prepared gel well. Wells are a bit trickier because they are thin and your pipette tip will puncture the gel if you’re not careful. Be gentle!
This week explores the read–write–edit toolkit: sequencing and synthesis workflows, restriction digests and gel electrophoresis, and early genome-editing frameworks.
Make sure to document every step of the in-silico and lab experiments. Make sketches, screenshots, notes, drawings… anything that helps you - and others - understand the experiment.
Your documentation should help you - and others - to understand the topic. Don’t be afraid to add things that don’t work. Show your failures - and how you overcame them. Your Documentation should be a description of the amazing journey you are on!
Part 0: Basics of Gel Electrophoresis
Assignees for this section
MIT/Harvard students
Required
Committed Listeners
Required
Attend or watch all lecture and recitation videos. Optionally watch bootcamp.
In recitation, we discussed that you will pick a protein for your homework that you find interesting. Which protein have you chosen and why? Using one of the tools described in recitation (NCBI, UniProt, google), obtain the protein sequence for the protein you chose.
[Example from our group homework, you may notice the particular format — The example below came from UniProt]
3.2. Reverse Translate: Protein (amino acid) sequence to DNA (nucleotide) sequence.
The Central Dogma discussed in class and recitation describes the process in which DNA sequence becomes transcribed and translated into protein. The Central Dogma gives us the framework to work backwards from a given protein sequence and infer the DNA sequence that the protein is derived from. Using one of the tools discussed in class, NCBI or online tools (google “reverse translation tools”), determine the nucleotide sequence that corresponds to the protein sequence you chose above.
Lysis protein DNA sequence atggaaacccgattccctcagcaatcgcagcaaactccggcatctactaatagacgccggccattcaaacatgaggattacccatgtcgaagacaacaaagaagttcaactctttatgtattgatcttcctcgcgatctttctctcgaaatttaccaatcaattgcttctgtcgctactggaagcggtgatccgcacagtgacgactttacagcaattgcttacttaa
3.3. Codon optimization.
Once a nucleotide sequence of your protein is determined, you need to codon optimize your sequence. You may, once again, utilize google for a “codon optimization tool”. In your own words, describe why you need to optimize codon usage. Which organism have you chosen to optimize the codon sequence for and why?
Lysis protein DNA sequence with Codon-Optimization ATGGAAACCCGCTTTCCGCAGCAGAGCCAGCAGACCCCGGCGAGCACCAACCGCCGCCGCCCGTTCAAACATGAAGATTATCCGTGCCGTCGTCAGCAGCGCAGCAGCACCCTGTATGTGCTGATTTTTCTGGCGATTTTTCTGAGCAAATTCACCAACCAGCTGCTGCTGAGCCTGCTGGAAGCGGTGATTCGCACAGTGACGACCCTGCAGCAGCTGCTGACCTAA
3.4. You have a sequence! Now what?
What technologies could be used to produce this protein from your DNA? Describe in your words the DNA sequence can be transcribed and translated into your protein. You may describe either cell-dependent or cell-free methods, or both.
3.5. [Optional] How does it work in nature/biological systems?
Describe how a single gene codes for multiple proteins at the transcriptional level.
Try aligning the DNA sequence, the transcribed RNA, and also the resulting translated Protein!!! See example below.
[Example shows the biomolecular flow in central dogma from DNA to RNA to Protein] Special note that all “T” were transcribed into “U” and that the 3-nt codon represents 1-AA.
Rearranged snapshot of MS2 L-protein information flow from DNA to RNA to Protein. Captured from Ice’s Benchling and stitched together in a ppt
Part 4: Prepare a Twist DNA Synthesis Order
Assignees for this section
MIT/Harvard students
Required
Committed Listeners
Required
This is a practice exercise, not necessarily your real Twist order!
For example, let’s make a sequence that will make E. coli glow fluorescent green under UV light by constitutively (always) expressing sfGFP (a green fluorescent protein):
In Benchling, selectNew DNA/RNA sequence
Give your insert sequence a name and select DNA with a Linear topology (this is a linear sequence that will be inserted into a circular backbone vector of our choosing).
Go through each piece of the given DNA sequences highlighted below (Promoter, RBS, Start Codon, Coding Sequence, His Tag, Stop Codon, Terminator) and paste the sequences into the Benchling file one after the other (replacing the coding sequence with your codon optimized DNA sequence of interest!). Each time you add a new piece of the sequence, make sure to annotate by right clicking over the sequence and creating an annotation that describes what each piece (e.g., Promoter, RBS, etc.) is (see image below).
RBS (e.g. BBa_B0034 with spacers for optimal expression): CATTAAAGAGGAGAAAGGTACC
Start Codon: ATG
Coding Sequence (your codon optimized DNA for a protein of interest, sfGFP for example): AGCAAAGGAGAAGAACTTTTCACTGGAGTTGTCCCAATTCTTGTTGAATTAGATGGTGATGTTAATGGGCACAAATTTTCTGTCCGTGGAGAGGGTGAAGGTGATGCTACAAACGGAAAACTCACCCTTAAATTTATTTGCACTACTGGAAAACTACCTGTTCCGTGGCCAACACTTGTCACTACTCTGACCTATGGTGTTCAATGCTTTTCCCGTTATCCGGATCACATGAAACGGCATGACTTTTTCAAGAGTGCCATGCCCGAAGGTTATGTACAGGAACGCACTATATCTTTCAAAGATGACGGGACCTACAAGACGCGTGCTGAAGTCAAGTTTGAAGGTGATACCCTTGTTAATCGTATCGAGTTAAAGGGTATTGATTTTAAAGAAGATGGAAACATTCTTGGACACAAACTCGAGTACAACTTTAACTCACACAATGTATACATCACGGCAGACAAACAAAAGAATGGAATCAAAGCTAACTTCAAAATTCGCCACAACGTTGAAGATGGTTCCGTTCAACTAGCAGACCATTATCAACAAAATACTCCAATTGGCGATGGCCCTGTCCTTTTACCAGACAACCATTACCTGTCGACACAATCTGTCCTTTCGAAAGATCCCAACGAAAAGCGTGACCACATGGTCCTTCTTGAGTTTGTAACTGCTGCTGGGATTACACATGGCATGGATGAGCTCTACAAA
7x His Tag (Let’s add a 7×His tag at the C-terminus of the protein to enable protein purification from E. coli): CATCACCATCACCATCATCAC
Once you’ve completed this, click on Linear Map to preview the entire sequence. If you intend to have a TA review a sequence in the future, this is a good way to verify that all sections are annotated!
This is not required for this exercise, but to share your design with others, please ensure that link sharing is turned on!
(Optional) Share your final sequence link with a TA for review!
This insert sequence you built is commonly referred to as an expression cassette in molecular biology (a sequence you can drop into any vector and it’ll perform its function). Go ahead and download the FASTA file for the sequence you made.
It’s helpful to visualize DNA designs using SBOL Canvas (Synthetic Biology Open Language) to convey your designs. Here’s an example of what you just annotated in Benchling:
4.3. On Twist, Select The “Genes” Option
4.4. Select “Clonal Genes” option
For this demonstration, we’ll choose Clonal Genes. You’ll select clonal genes or gene fragments depending on your final project.
Historically, HTGAA projects using clonal genes (circular DNA) have reached experimental results 1-2 weeks quicker because they can be transformed directly into E. coli without additional assembly.
Gene fragments (linear DNA) offer greater design flexibility but typically require an assembly or cloning step prior to transformation. An advantage is If designed with the appropriate exonuclease protection, gene fragments can be used directly in cell-free expression.
4.5. Import your sequence
You just took an amino acid sequence of interest and converted it into DNA, codon optimized it, and built an expression cassette around it! Choose the Nucleotide Sequence option and Upload Sequence File to upload your FASTA file.
4.6. Choose Your Vector
Since we’re ordering a clonal gene, you will need to refer to Twist’s Vector Catalog to choose your circular backbone. You can think of this as taking your linear expression cassette for your protein of interest, and completing the rest of the circle!
The backbone confers many special properties like antibiotic resistance, an origin of replication, and more. Discuss with your node to decide on appropriate antibiotic options. At MIT/Harvard, you can use Ampicillin, Chloramphenicol, or Kanamycin resistance.
Twist vectors do not contain restriction sites near the insert fragment, so make sure to flank your design with cut sites if you are intending to extract this DNA insert fragment later.
For this demonstration, choose a Twist cloning vectors like pTwist Amp High Copy.
Click into your sequence and select download construct (GenBank) to get the full plasmid sequence:
Go back to your Benchling account. Inside of a folder, click the import DNA/RNA sequence button and upload the GenBank file you just downloaded.
This is the plasmid you just built with your expression cassette included. Congratulations on building your first plasmid!
Important
For your final projects, remember to include:
Fully annotated Benchling insert fragment
Desired Twist cloning vector
Part 5: DNA Read/Write/Edit
Assignees for this section
MIT/Harvard students
Required
Committed Listeners
Required
5.1 DNA Read
(i) What DNA would you want to sequence (e.g., read) and why? This could be DNA related to human health (e.g. genes related to disease research), environmental monitoring (e.g., sewage waste water, biodiversity analysis), and beyond (e.g. DNA data storage, biobank).
(ii) In lecture, a variety of sequencing technologies were mentioned. What technology or technologies would you use to perform sequencing on your DNA and why? Also answer the following questions:
Is your method first-, second- or third-generation or other? How so?
What is your input? How do you prepare your input (e.g. fragmentation, adapter ligation, PCR)? List the essential steps.
What are the essential steps of your chosen sequencing technology, how does it decode the bases of your DNA sample (base calling)?
What is the output of your chosen sequencing technology?
5.2 DNA Write
(i) What DNA would you want to synthesize (e.g., write) and why? These could be individual genes, clusters of genes or genetic circuits, whole genomes, and beyond. As described in class thus far, applications could range from therapeutics and drug discovery (e.g., mRNA vaccines and therapies) to novel biomaterials (e.g. structural proteins), to sensors (e.g., genetic circuits for sensing and responding to inflammation, environmental stimuli, etc.), to art (DNA origamis). If possible, include the specific genetic sequence(s) of what you would like to synthesize! You will have the opportunity to actually have Twist synthesize these DNA constructs! :)
See some famous examples of DNA design
DNA origami by Paul W. K. Rothemund, California Institute of Technology, 2004. 100 nanometers in diameter.
(ii) What technology or technologies would you use to perform this DNA synthesis and why? Also answer the following questions:
What are the essential steps of your chosen sequencing methods?
What are the limitations of your sequencing method (if any) in terms of speed, accuracy, scalability?
5.3 DNA Edit
(i) What DNA would you want to edit and why? In class, George shared a variety of ways to edit the genes and genomes of humans and other organisms. Such DNA editing technologies have profound implications for human health, development, and even human longevity and human augmentation. DNA editing is also already commonly leveraged for flora and fauna, for example in nature conservation efforts, (animal/plant restoration, de-extinction), or in agriculture (e.g. plant breeding, nitrogen fixation). What kinds of edits might you want to make to DNA (e.g., human genomes and beyond) and why?
Colossal Biosciences Inc., a biotechnology company using genetic engineering to de-extinct various historic animals such as the woolly mammoth, dodo, and dire wolf.
(ii) What technology or technologies would you use to perform these DNA edits and why? Also answer the following questions:
How does your technology of choice edit DNA? What are the essential steps?
What preparation do you need to do (e.g. design steps) and what is the input (e.g. DNA template, enzymes, plasmids, primers, guides, cells) for the editing?
What are the limitations of your editing methods (if any) in terms of efficiency or precision?
Reading & Resources (click to expand)
Resources
DNA Sequencing at 40: Past, Present, and Future (2017) Shendure, J., Balasubramanian, S., Church, G. et al.https://doi.org/10.1038/nature24286
Base editors contain a nicking or dead Cas9 enzyme fused to a deaminase.
a.) PAM requirement: Base editors contain a nicking or dead Cas9 enzyme fused to a deaminase. For designing your guide RNA for base editing you will therefore have a PAM requirement like you would have for any Cas9 experiment.
b.) Deamination window: An additional design constraint is that the sequence window in which deamination occurs is only a few base pairs long. You can find information on the deamination windows in the review below (even though some new editors are not included).
BE4 and ABE7.10 are good starting points and both use SpCas9 with NGG Pam requirement. Base editors with other PAM sites have been constructed too.
TALEN
For TALENs, you can assume no sequence restrictions – One of the technology’s previous restrictions was a T starting base, but this has since been overcome. In contrast to the CRISPR/Cas technologies above, your DNA sequence is recognized through interactions between the DNA and the TALEN: each TAL in the array recognizes one base.
(Note: In order to introduce a double strand break, you will need to design to TALENs targeting the opposing strands.)
Gel Purification of DNA: after DNA gel electrophoresis, cutting a band of DNA out of the agarose gel allows isolation and purification of a specific DNA fragment:
Using the coordinates from the GUI, follow the instructions in the HTGAA26 Opentrons Colab to write your own Python script which draws your design using the Opentrons.
You may use AI assistance for this coding — Google Gemini is integrated into Colab (see the stylized star bottom center); it will do a good job writing functional Python, while you probably need to take charge of the art concept.
If you’re a proficient programmer and you’d rather code something mathematical or algorithmic instead of using your GUI coordinates, you may do that instead.
Ask for help early!
If you are having any trouble with scripting, contact your TAs as soon as possible for help. Do not wait until your scheduled robot time slot or you may not be able to complete this assignment!
If the Python component is proving too problematic even with AI and human assistance, download the full Python script from the GUI website and submit that:
Use the download icon pointed to by the red arrow in this diagram.
If you use AI to help complete this homework or lab, document how you used AI and which models made contributions.
Sign up for a robot time slot if you are at MIT/Harvard/Wellesley or at a Node offering Opentrons automation. The Python script you created will be run on the robot to produce your work of art!
At MIT/Harvard? Lab times are on Thursday Feb.19 between 10AM and 6PM.
Post-Lab Questions — DUE BY START OF FEB 24 LECTURE
Assignees for this section
MIT/Harvard students
Required
Committed Listeners
Required
One of the great parts about having an automated robot is being able to precisely mix, deposit, and run reactions without much intervention, and design and deploy experiments remotely.
For this week, we’d like for you to do the following:
Find and describe a published paper that utilizes the Opentrons or an automation tool to achieve novel biological applications.
Write a description about what you intend to do with automation tools for your final project. You may include example pseudocode, Python scripts, 3D printed holders, a plan for how to use Ginkgo Nebula, and more. You may reference this week’s recitation slide deck for lab automation details.
While your description/project idea doesn’t need to be set in stone, we would like to see core details of what you would automate. This is due at the start of lecture and does not need to be tested on the Opentrons yet.
Example 1: You are creating a custom fabric, and want to deposit art onto specific parts that need to be intertwined in odd ways. You can design a 3D printed holder to attach this fabric to it, and be able to deposit bio art on top. Check out the Opentrons 3D Printing Directory.
Example 2: You are using the cloud laboratory to screen an array of biosensor constructs that you design, synthesize, and express using cell-free protein synthesis.
Echo transfer biosensor constructs and any required cofactors into specified wells.
Bravo stamp in CPFS reagent master mix into all wells of a 96-well / 384-well plate.
Multiflo dispense the CFPS lysate to all wells to start protein expression.
PlateLoc seal the plate.
Inheco incubate the plate at 37°C while the biosensor proteins are synthesized.
XPeel remove the seal.
PHERAstar measure fluorescence to compare biosensor responses.
Final Project Ideas — DUE BY START OF FEB 24 LECTURE
Assignees for this section
MIT/Harvard students
Required
Committed Listeners
Required
As explained in this week’s recitation, add 1-3 slides with 3 ideas you have for an Individual Final Project in the appropriate
slide deck
for MIT/Harvard/Wellesley students
or for Commited Listeners. Be sure to put your name on your slide(s); for CLs, also put your city and country on your slide(s) and be sure you’re putting your slide(s) in your Node’s section of the deck.
Lab work this week is contained within the homework assignment below.
Homework: Protein Design I — DUE BY START OF MAR 3 LECTURE
Objective:
Learn basic concepts:
amino acid structure
3D protein visualization
the variety of ML-based design tools
Brainstorm as a group how to apply these tools to engineer a better bacteriophage (setting the stage for the final project).
Part A. Conceptual Questions
Assignees for this section
MIT/Harvard students
Required
Committed Listeners
Required
Answer any NINE of the following questions from Shuguang Zhang: (i.e. you can select two to skip)
How many molecules of amino acids do you take with a piece of 500 grams of meat? (on average an amino acid is ~100 Daltons)
Why do humans eat beef but do not become a cow, eat fish but do not become fish?
Why are there only 20 natural amino acids?
Can you make other non-natural amino acids? Design some new amino acids.
Where did amino acids come from before enzymes that make them, and before life started?
If you make an α-helix using D-amino acids, what handedness (right or left) would you expect?
Can you discover additional helices in proteins?
Why are most molecular helices right-handed?
Why do β-sheets tend to aggregate?
What is the driving force for β-sheet aggregation?
Why do many amyloid diseases form β-sheets?
Can you use amyloid β-sheets as materials?
Design a β-sheet motif that forms a well-ordered structure.
Part B: Protein Analysis and Visualization
Assignees for this section
MIT/Harvard students
Required
Committed Listeners
Required
In this part of the homework, you will be using online resources and 3D visualization software to answer questions about proteins. Pick any protein (from any organism) of your interest that has a 3D structure and answer the following questions:
Briefly describe the protein you selected and why you selected it.
Identify the amino acid sequence of your protein.
How long is it? What is the most frequent amino acid? You can use this Colab notebook to count the frequency of amino acids.
How many protein sequence homologs are there for your protein? Hint: Use Uniprot’s BLAST tool to search for homologs.
Does your protein belong to any protein family?
Identify the structure page of your protein in RCSB
When was the structure solved? Is it a good quality structure? Good quality structure is the one with good resolution. Smaller the better (Resolution: 2.70 Å)
Are there any other molecules in the solved structure apart from protein?
We will now try multiple things in the three sections below; report each of these results in your homework writeup on your HTGAA website:
C1. Protein Language Modeling
Picture Source: Bordin, Nicola et al (2023). Novel machine learning approaches revolutionize protein knowledge. Trends in Biochemical Sciences, Volume 48, Issue 4, 345 - 359
Deep Mutational Scans
Use ESM2 to generate an unsupervised deep mutational scan of your protein based on language model likelihoods.
Can you explain any particular pattern? (choose a residue and a mutation that stands out)
(Bonus) Find sequences for which we have experimental scans, and compare the prediction of the language model to experiment.
Latent Space Analysis
Use the provided sequence dataset to embed proteins in reduced dimensionality.
Analyze the different formed neighborhoods: do they approximate similar proteins?
Place your protein in the resulting map and explain its position and similarity to its neighbors.
C2. Protein Folding
Picture Source: Lin et al (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model.
Folding a protein
Fold your protein with ESMFold. Do the predicted coordinates match your original structure?
Try changing the sequence, first try some mutations, then large segments. Is your protein structure resilient to mutations?
C3. Protein Generation
Picture Source: 1. Post from Sergey Ovchinnikov 2. Roney, Ovchinnikov et al (2022). State-of-the-art estimation of protein model accuracy using AlphaFold. Phys. Rev. Lett. 129, 238101
Inverse-Folding a protein: Let’s now use the backbone of your chosen PDB to propose sequence candidates via ProteinMPNN
Analyze the predicted sequence probabilities and compare the predicted sequence vs the original one.
Input this sequence into ESMFold and compare the predicted structure to your original.
Part D. Group Brainstorm on Bacteriophage Engineering
Assignees for this section
MIT/Harvard students
Optional
Committed Listeners
Required
Find a group of ~3–4 students
Read through the Phage Reading material listed under “Reading & Resources” below.
Review the Bacteriophage Final Project Goals for engineering the L Protein:
Increased stability (easiest)
Higher titers (medium)
Higher toxicity of lysis protein (hard)
Brainstorm Session
Choose one or two main goals from the list that you think you can address computationally (e.g., “We’ll try to stabilize the lysis protein,” or “We’ll attempt to disrupt its interaction with E. coli DnaJ.”).
Write a 1-page proposal (bullet points or short paragraphs) describing:
Which tools/approaches from recitation you propose using (e.g., “Use Protein Language Models to do in silico mutagenesis, then AlphaFold-Multimer to check complexes.”).
Why do you think those tools might help solve your chosen sub-problem?
Name one or two potential pitfalls (e.g., “We lack enough training data on phage–bacteria interactions.”).
NGLViewer: NGL Viewer is a collection of tools for web-based molecular graphics. WebGL is employed to display molecules like proteins and DNA/RNA with a variety of representations.
Chimera: A highly extensible program for interactive visualization and analysis of molecular structures and related data, including density maps, supramolecular assemblies, sequence alignments, docking results, trajectories, and conformational ensembles.
Week 7 — Genetic Circuits Part II: Neuromorphic Circuits
This week covers neuromorphic genetic circuits, showing how engineered gene networks can implement neural-network
“perceptron”-like computation and learning.
Lecture (Tues, Mar 17)
Genetic Circuits Part II: Neuromorphic Circuits Ron Weiss, Evan Holbrook
(The recording will be posted here when available)
This lecture presents a range of advanced technologies to do precision
measurement of proteins at atomic scales, characterizing chemical
composition, and detecting protein sequence and structure.
This week examines how modern bioproduction pipelines, from strain engineering to fermentation and downstream processing, are increasingly designed, executed, and optimized through cloud lab platforms and automation — enabling remote, high-throughput, and reproducible synthetic biology at industrial scale.
Lecture (Tues, Apr 14)
Bioproduction & Cloud Labs Reshma Shetty
(The recording will be posted here when available)
Recitation (Wed, Apr 15)
Cloud laboratories Ronan Donovan
(The recording and slides will be posted here when available)
This week covers designing, programming, and fabricating engineered living materials — such as self-healing concretes,
adaptive biofilms, and responsive biomaterials — by integrating genetic circuit design, materials science, and bioprocess
engineering.
Lecture (Tues, Apr 28)
Biodesign & Engineered Living Materials; Frugal Science Manu Prakash, David Kong
(The recording will be posted here when available)
Recitation (Wed, Apr 29)
Printing mycelium in Bambu Labs X1 Carbon 3D printer Ren Ramlan
(The recording and slides will be posted here when available)