Homework

Class Assignments:

  • Week 1 HW: Principles and Practices

    Week 1: Principles and Practices Class Assignment 1. Application or Tool to Develop & Why Application: “The Bio-Puzzle” – A hardware-agnostic DNA assembly toolkit using split-protein reporters (e.g., Split-GFP) as physical “checksums” for long-sequence construction.

  • Week 2 HW: DNA Read, Write, and Edit

    Week 2: DNA Read, Write, and Edit Part 1 & 2: Gel Art Describe your process of creating Gel Art using Benchling and restriction digests.

  • Week 3 HW: Lab Automation

    Week 3: Lab Automation Lab Automation assignment This week, we explored lab automation and its applications in synthetic biology.

Subsections of Homework

Week 1 HW: Principles and Practices

cover cover

Week 1: Principles and Practices

Class Assignment

1. Application or Tool to Develop & Why

Application: “The Bio-Puzzle” – A hardware-agnostic DNA assembly toolkit using split-protein reporters (e.g., Split-GFP) as physical “checksums” for long-sequence construction.

Why: For students and DIY biologists, the most immediate “biosecurity” threat is unintentional human error. Assembling long DNA (2,000bp+) from short, affordable oligo pools is difficult and error-prone. Currently, verifying these assemblies requires expensive and slow sequencing. The “Bio-Puzzle” transforms biosecurity from a restrictive policy into a helpful engineering tool. By engineering DNA fragments to produce a visual signal only when assembled in the correct order, we provide a real-time, “at-the-bench” verification system.

2. Governance/Policy Goals

Goal: To foster a culture of “Integrity by Design” in decentralized research environments.

  • Sub-goal A: Minimize human error in DNA synthesis by providing immediate physical feedback.
  • Sub-goal B: Establish a community norm where long-sequence assembly is inherently linked to transparency and verification.

3. Potential Governance Actions

  • Option 1: The Split-Reporter Standard (Technical): Standardize “Verification Tags” at DNA junctions.
  • Option 2: Orthogonal Overhang Library (Policy): Create open-source library of “puzzle teeth” for safe connection.
  • Option 3: The Bio-Cookbook Ledger (Social): Peer-verified platform sharing success outcomes.

4. Scoring Governance Actions

Does the option:Option 1Option 2Option 3
Enhance Biosecurity
• By preventing incidents112
• By helping respond321
Foster Lab Safety
• By preventing incident112
Protect environment
• By preventing incidents112

(1 = Best / 3 = Worst)

5. Prioritization, Trade-offs, and Ethics

Prioritization: I prioritize Option 1 (Split-Reporter) because it addresses the core technical challenge.

Ethical Reflection: This project shifts the focus from “top-down censorship” to “bottom-up empowerment.”


Assignment (Week 2 Lecture Prep)

Homework Questions from Professor Jacobson

    • The error rate of polymerase is $10^{-6}$.
    • The length of human genome is $3.2$ Gbp.
    • Biology utilize proofreading and mismatch repair to deal with this discrepancy.
    • The average length of a human gene is about 1,036 bp.
    • Total possible DNA sequences: $3^{345}$.

Homework Questions from Dr. LeProust

  1. Phosphoramidite method: Deprotection → Coupling → Capping → Oxidation
  2. Limit: Once it exceeds 200 nt, it becomes difficult to separate and purify.
  3. Yield: Decreases exponentially with length.

Homework Question from George Church

  • Essential amino acids: Lysine, Histidine, Isoleucine, Leucine, Methionine, Phenylalanine, Threonine, Tryptophan, Valine, Arginine.
  • Lysine constraint: Lysine can be supplied externally, so the constraint isn’t absolute and could undergo reversion mutations.

Week 2 HW: DNA Read, Write, and Edit

cover cover

Week 2: DNA Read, Write, and Edit

Part 1 & 2: Gel Art

Describe your process of creating Gel Art using Benchling and restriction digests.

1.1 In-silico Design (Benchling)

  • Enzymes used: HindIII, SacI, KpnI

  • Design Concept: For this project, I designed a “Gel Art” pattern inspired by a European circular arch bridge. I wanted to use the different migration speeds of DNA fragments to recreate the structural elegance of an ancient stone bridge.

  • Inspiration: Stone Bridge Stone Bridge

  • Simulated Gel Pattern:

Gel Art Bridge Gel Art Bridge

1.2 Lab Execution

  • Protocol followed: “Gel Art: Restriction Digests and Gel Electrophoresis”
  • Results: (Awaiting wet-lab execution results)

Part 3: DNA Design Challenge

3.1 Protein Choice

  • Chosen Protein: LuxR (Aliivibrio fischeri)
  • Why: I chose LuxR because it is a fundamental component for building genetic circuits. LuxR/LuxI system is one of the most well-characterized Quorum Sensing systems, used to create sophisticated biological logic gates and population-controlled behaviors in synthetic biology.
  • Mechanism (LuxR/LuxI Quorum Sensing):

LuxR Quorum Sensing LuxR Quorum Sensing Image source: MDPI - International Journal of Molecular Sciences, 2020

  • Protein Sequence:
    >sp|P12746|LUXR_ALIF1 Transcriptional activator protein luxR OS=Aliivibrio fischeri (strain ATCC 7744 / MJ11) OX=312301 GN=luxR PE=1 SV=1
    MKNNIKNYAFLLFLFIIFINPKNNSAKLDKIKAYNTIVEKVEGNEFDLALFAYIHLALLL
    NKINNKLLIKGDKISLVGFPCVDNGLCSTGIIFSHVNDLVVNDYIFNIDNKENESIKLID
    LFEKSVEEVKAIYNYYKKINEKNYLILDSKISFYKLHDSYKKLYKLSLNIIPLSFEKKEL
    CILKKLIHETLSKFKIEKSYVNLDKLIDKNIQLIKIEQNDFNDSIYSYKKLISIILLPLT
    YFE

3.2 Reverse Translation

  • Reverse Translation Process: Starting from the LuxR protein sequence, I determined the original nucleotide sequence of the luxR gene from Aliivibrio fischeri. I used the Gene Corner Reverse Translate Tool for this process. In synthetic biology, this “reverse” process allows us to understand how nature codes for the protein, providing a baseline for synthesis.
  • Wild-type DNA Sequence (luxR):
    atgaaaaacaatattaaaaattatgcgtttcttttgttatttttcatcatatttattaat
    ccgaaaaataatagcgcaaaattagataaaatcaaagcgtacaatacaattgtagagaaa
    gtagaaggtaatgaatttgatttggcgctatttgcatatattcatttggccttactttta
    aataaaatcaataataagttatttattaaaggtgataaaatcagtttagttggtttcccg
    tgtgtagataacggattatgttcaactggaattattttttctcatgttaatgatttagtt
    gttaatgattatattttttacattgataataaagaaaatgaatcaattaaattgattgat
    ttatttgaaaagagtgtagaagaggtaaaagcgatttataattattataaaaaaattaat
    gagaaaaattatctaattttagattcaaaaatcagtttttataaattacatgatagttat
    aaaaaattatatatattgagtttaaatattatccctttaagttttgaaaaaaaagaactt
    tgtattttaaaaaaactaattcatgagacattaagtaaattcaaaattgagaagagttat
    gttaatttagataaattaattgataaaaatattcaattaattaaaattgagcaaaatgat
    tttaatgattcgatttatagttacaaaaaattaattagtattattctattaccactaact
    tattttgaataa

3.3 Codon Optimization

  • Optimized for Organism: Escherichia coli (K-12)

  • Optimization Tool: I utilized the IDT Codon Optimization Tool to adapt the sequence for high expression in E. coli.

  • Why Optimize? Codon optimization is crucial because different organisms prefer different “synonymous” codons to represent the same amino acid. This is known as codon usage bias. In E. coli, rare codons (like those found in A. fischeri) can lead to:

    1. Low Expression: Ribosomes stalling at rare codons, reducing protein yield.
    2. Truncated Proteins: Ribosomes falling off the mRNA before finishing.
    3. Misfolding: The timing of translation speed affects how the protein folds. Additionally, optimization helps remove internal restriction sites (like BsaI, BbsI) and strong secondary structures that might interfere with DNA synthesis or translation.
  • Codon-Optimized DNA Sequence for E. coli:

    ATG AAG AAC AAT ATT AAA AAC TAC GCA TTT CTG CTG CTG TTT TTT ATC ATT
    TTC ATC AAC CCG AAA AAT AAC TCA GCC AAG CTG GAT AAA ATT AAA GCG TAT
    AAT ACA ATT GTC GAA AAA GTG GAG GGC AAC GAA TTT GAT TTG GCG CTT TTT
    GCC TAC ATC CAC TTG GCG CTG TTG CTG AAT AAA ATT AAT AAT AAA TTG TTT
    ATT AAA GGC GAC AAG ATT TCG CTG GTC GGT TTC CCG TGC GTG GAC AAC GGC
    CTG TGC TCA ACT GGT ATT ATC TTT TCA CAT GTC AAT GAT CTT GTA GTG AAT
    GAT TAT ATC TTT TAT ATT GAC AAT AAA GAA AAT GAG AGT ATT AAG CTG ATT
    GAC CTT TTC GAG AAG TCC GTA GAG GAA GTG AAG GCC ATT TAT AAT TAT TAC
    AAG AAA ATC AAC GAA AAG AAT TAT CTG ATT TTG GAC TCA AAA ATC TCG TTC
    TAT AAA TTA CAC GAT TCT TAT AAA AAG CTT TAC ATC CTG TCG CTG AAC ATC
    ATC CCG TTG TCT TTT GAA AAA AAG GAA CTG TGT ATT CTG AAA AAA CTG ATC
    CAC GAA ACC CTG AGT AAA TTT AAA ATT GAG AAG TCT TAC GTT AAC CTG GAT
    AAA TTA ATT GAT AAA AAC ATT CAG TTG ATC AAG ATT GAA CAA AAC GAT TTC
    AAC GAT AGC ATT TAT TCG TAC AAA AAG TTA ATT AGC ATC ATT CTG CTG CCC
    TTG ACA TAT TTT GAA TAA

3.4 Production Technology

  • Process (Cell-Free Protein Synthesis - CFPS): I am particularly interested in producing this protein using cell-free methods, such as the PURE (Protein synthesis Using Recombinant Elements) system or TX-TL (Transcription-Translation) cell extracts (e.g., from E. coli).

  • How it works: In a cell-free system, instead of transforming the DNA into a living cell, we simply mix the DNA template with a cocktail of “biological machinery” in a test tube. This cocktail includes:

    1. RNA Polymerase: To transcribe the DNA into mRNA.
    2. Ribosomes: To translate the mRNA into a protein.
    3. tRNAs & Amino Acids: The building blocks and adapters for protein assembly.
    4. Energy Sources: ATP and regeneration systems to fuel the process.
  • Why Cell-Free?

    • Speed: Protein can be produced in hours rather than days, as there is no need for cell growth or transformation.
    • Direct Prototyping: We can use linear DNA (like the codon-optimized sequence I designed) directly without cloning it into a plasmid.
    • Safety & Control: Since there are no living cells, it is easier to study proteins that might be toxic to a host cell, and we have precise control over the reaction environment.
  • Dual-Method Compatibility: Ultimately, this design is highly versatile: it is possible to use both cell-free (in vitro) and cell-dependent (in vivo) methods. While cell-free systems offer rapid prototyping and a controlled environment, the same optimized sequence can be cloned into a plasmid and transformed into E. coli for stable, large-scale production. This flexibility allows us to choose the most appropriate method depending on the experimental goals.

3.5 [Optional] Central Dogma in Nature

  • Transcription and Translation: In nature, a single gene can sometimes code for multiple proteins through mechanisms like alternative splicing (in eukaryotes) or overlapping open reading frames (ORFs) (in viruses and some bacteria like our lux operon). For example, in the lux system, the organization of genes allows for coordinated expression of the entire bioluminescence machinery from a single promoter.

Part 4: Twist DNA Synthesis Order

4.1 DNA Insert Construction

  • Name: LuxR_Cassette_v1

  • Full Insert Sequence (FASTA):

    >LuxR_Cassette_v1
    TTTACGGCTAGCTCAGTCCTAGGTATAGTGCTAGCCATTAAAGAGGAGAAAGGTACCATG
    ATGAAGAACAATATTAAAAACTACGCATTTCTGCTGCTGTTTTTTATCATTTTCATCAAC
    CCGAAAAATAACTCAGCCAAGCTGGATAAAATTAAAGCGTATAATACAATTGTCGAAAAA
    GTGGAGGGCAACGAATTTGATTTGGCGCTTTTTGCCTACATCCACTTGGCGCTGTTGCTG
    AATAAAATTAATAATAAATTGTTTATTAAAGGCGACAAGATTTCGCTGGTCGGTTTCCCG
    TGCGTGGACAACGGCCTGTGCTCAACTGGTATTATCTTTTCACATGTCAATGATCTTGTA
    GTGAATGATTATATCTTTTATATTGACAATAAAGAAAATGAGAGTATTAAGCTGATTGAC
    CTTTTCGAGAAGTCCGTAGAGGAAGTGAAGGCCATTTATAATTATTACAAGAAAATCAAC
    GAAAAGAATTATCTGATTTTGGACTCAAAAATCTCGTTCTATAAATTACACGATTCTTAT
    AAAAAGCTTTACATCCTGTCGCTGAACATCATCCCGTTGTCTTTTGAAAAAAAGGAACTG
    TGTATTCTGAAAAAACTGATCCACGAAACCCTGAGTAAATTTAAAATTGAGAAGTCTTAC
    GTTAACCTGGATAAATTAATTGATAAAAACATTCAGTTGATCAAGATTGAACAAAACGAT
    TTCAACGATAGCATTTATTCGTACAAAAAGTTAATTAGCATCATTCTGCTGCCCTTGACA
    TATTTTGAACATCACCATCACCATCATCACTAACCAGGCATCAAATAAAACGAAAGGCTC
    AGTCGAAAGACTGGGCCTTTCGTTTTATCTGTTGTTTGTCGGTGAACGCTCTCTACTAGA
    GTCACACTGGCTCACCTTCGGGTGGGCCTTTCTGCGTTTATA
  • Components Breakdown:

    • Promoter (J23106): TTTACGGCTAGCTCAGTCCTAGGTATAGTGCTAGC (Constitutive promoter)
    • RBS (B0034): CATTAAAGAGGAGAAAGGTACC (Strong RBS with spacers)
    • Start Codon: ATG
    • Coding Sequence: Optimized luxR (Stop codon removed for C-terminal tagging)
    • 7x His Tag: CATCACCATCACCATCATCAC (For protein purification)
    • Stop Codon: TAA
    • Terminator (B0015): CCAGGCATCAAATAAAACGAAAGGCTCAGTCGAAAGACTGGGCCTTTCGTTTTATCTGTTGTTTGTCGGTGAACGCTCTCTACTAGAGTCACACTGGCTCACCTTCGGGTGGGCCTTTCTGCGTTTATA

4.2 Vector Selection

  • Cloning Vector: pTwist Amp High Copy

  • Design Preview (Benchling Linear Map): LuxR Benchling LuxR Benchling

  • Strategy: This vector was chosen for its high copy number in E. coli, ensuring high yields of the plasmid for downstream applications. The cassette is flanked by cloning sites for easy extraction if needed. You can view the full annotated sequence on Benchling here.

  • Final Plasmid Map: LuxR Plasmid Map LuxR Plasmid Map Visual representation of the LuxR_Expression_Cassette inserted into the pTwist Amp High Copy vector.


Part 5: DNA Read/Write/Edit

5.1 DNA Read (Sequencing)

  • (i) Target: I want to sequence environmental DNA and synthesized constructs in real-time at the “bench” or in the field. This is crucial for early detection of unintended mutations or environmental contamination.
  • (ii) Technology: Oxford Nanopore Technologies (ONT)
    • Generation: 3rd Generation (Single-molecule sequencing).
    • Input & Preparation: Long-read DNA. Minimal preparation is key; using ONT’s Rapid Sequencing Kits, we can perform transposase-based fragmentation and adapter ligation in under 10 minutes.
    • Essential Steps: DNA molecules pass through a protein nanopore embedded in a membrane. As they pass, they disrupt the electrical current.
    • Base Calling: The changes in current are decoded using neural networks (base-callers like Guppy or Dorado) into a sequence of A, T, C, and G.
    • Output: FASTQ files of “long reads,” allowing me to see entire genetic circuits in a single piece.
    • Why ONT? It is highly portable (MinION) and provides data in real-time, which is essential for the “Biosecurity by Design” concept below.

5.2 DNA Write (Synthesis)

  • (i) Target: Synthetic genetic circuits based on quorum sensing regulators like LuxR (as designed in Part 3 and 4). These would be distributed as part of “safe-to-use” biological toolkits for decentralized research.
  • (ii) Technology: Phosphoramidite Synthesis (Modern Array-based)
    • Essential Steps: Controlled coupling of A, T, C, and G nucleotides onto a substrate (like a silicon chip for Twist Bioscience), followed by deprotection and oxidation cycles.
    • Limitations: While accurate and scalable for fragments, constructing multi-kb circuits still requires hierarchical assembly (Gibson or Golden Gate). Errors increase with length, requiring the biosecurity measures described in Part 5.3.

5.3 DNA Edit (Biosecurity Kill Switch)

  • (i) Target: I want to edit the Host Genome (E. coli or cell-free chassis) to include an autonomously triggered “Biosecurity Kill Switch.”
  • (ii) Technology: CRISPR-Cas9 System
    • The Concept: If the ONT sequencer (Read) detects a specific error, unintended mutation, or the presence of a hazardous sequence, it triggers the expression of a specialized CRISPR-Cas system.

    • Mechanism: I would design the system where a guide RNA (gRNA) targets essential genomic sequences or the synthetic circuit itself. Once triggered, Cas9 creates double-strand breaks in the genome, effectively “self-destructing” the cell or the DNA pool to prevent the spread of a dangerous or dysfunctional agent. CRISPR Cas9 Mechanism CRISPR Cas9 Mechanism

    • Preparation & Input: Requires a plasmid carrying the Cas9 gene under a conditional promoter and specialized gRNAs designed for high precision.

    • Limitations: The primary limitation is “escaper” mutants—cells that survive by mutating the CRISPR target site. To mitigate this, multiple essential sites must be targeted simultaneously (multiplexing).


Week 3 HW: Lab Automation

cover cover

Week 3: Lab Automation

Lab Automation assignment

This week, we explored lab automation and its applications in synthetic biology.

1. Assignment: Python Script for Opentrons Artwork

I designed a “Pac-Man” themed piece of Bio-Art for the Opentrons OT-2 robot. The design features Pac-Man and several ghosts, mapped onto a circular canvas representing the output of the lab’s liquid handling.

Google Colab Link: HTGAA Week 3 - Pac-Man Opentrons Art

Art Preview: Pac-Man Opentrons Art Pac-Man Opentrons Art

Python Script Logic: The script iterates through a matrix of coordinates, assigning specific colored liquids (Yellow for Pac-Man, Red/Blue/Cyan/Orange for ghosts) to designated wells. I used the Opentrons Python API (v2) to handle the aspirate and dispense operations with the P300 single-channel pipette.


2. Post-Lab Questions

1. Published Paper Review

Paper: Direct SARS-CoV-2 detection using a portable, open-source robotic platform (Yue et al., Matter, 2021) Summary: This research describes the development of a fully automated, portable diagnostic platform for SARS-CoV-2 using the Opentrons OT-2 robot. It integrates CRISPR-Cas12a based detection (a “novel biological application”) with robotic liquid handling to perform sensitive and rapid virus screening. Novelty: The platform automates the entire process from sample preparation to fluorescence readout. By using an open-source robot like the OT-2, the authors created a system that is significantly more affordable and flexible than traditional diagnostic workstations. This demonstrates how automation can be deployed in “the field” or in resource-limited settings for high-stakes biological monitoring.

2. Automation for Final Project

For my final project, I am interested in optimizing LuxR-based biosensors. I intend to use automation to:

  1. High-Throughput Screening: Characterize dozens of LuxR variants against a library of AHL (Acyl-homoserine lactone) analogs.
  2. Serial Dilutions: Automatically perform precise serial dilutions of AHLs to calculate dose-response curves (EC50) for each variant.
  3. Combinatorial Mixing: Mix different signaling components to study crosstalk in multi-channel quorum sensing circuits.

Pseudocode Idea:

# Pseudo-code for LuxR sensitivity screening
concentrations = [0, 1e-9, 1e-8, 1e-7, 1e-6, 1e-5] # Molar
variants = ['LuxR_WT', 'LuxR_V1', 'LuxR_V2']

for variant in variants:
    for conc in concentrations:
        mix_variant_with_ahl(variant, conc)
        measure_fluorescence() # Integrated with reader

3. Final Project Ideas

As part of the assignment, I have proposed the following three directions for my individual final project:

  1. AI-Guided Evolution of LuxR Biosensors:
    • Using the LLR (Log-Likelihood Ratio) and sequence analysis I’ve been working on to predict mutations.
    • Using automation to screen these variants for shifted specificity.
  2. Automated Cell-Free Prototyping of Quorum Sensing Circuits:
    • Bypassing the cell transformation loop by testing LuxR-responsive promoters in a cell-free expression system.
    • Leveraging Opentrons for setup.
  3. Biological “Log” Memory via DNA Synthesis:
    • Designing a DNA synthesis cassette (like the one I designed for LuxR) that “records” chemical exposures into a DNA sequence that can be read out later.