Week 2 HW: DNA Read, Write, & Edit

cover image cover image

๐Ÿงฌ Week 2 Homework Components

DNA Read, Write, & Edit โ€” sequencing and synthesis workflows, restriction digests and gel electrophoresis, genome-editing frameworks.

๐Ÿ“‹ Overview

This week covers:

Content to be added as you complete each part.

Subsections of Week 2 HW: DNA Read, Write, & Edit

Part 1: Benchling & In-silico Gel Art

Part 1: Benchling & In-silico Gel Art

Simulated restriction enzyme digestion with the seven enzymes specified in this week’s lab protocol: SalI, SacI, EcoRV, KpnI, BamHI, HindIII, and EcoRI. Used both the DNA Gel Art Interface (ฮป DNA) and Benchling (lambda phage genome NC_001416) to visualize digest patterns and verify cut-site predictions.

Lab protocol: Gel Art: Restriction Digests and Gel Electrophoresis


Benchling Digest โ€” NC_001416 (Lambda Phage Genome)

Sequence: NC_001416 โ€” Escherichia phage lambda, 48,502 bp (linear).

Benchling digest link: NC_001416 Digest โ€” Benchling


Proof of Work โ€” Screenshots

1. DNA Gel Art Interface โ€” ฮป DNA Restriction Digests

Simulated gel electrophoresis using the DNA Gel Art tool. ฮป DNA was digested with various enzyme combinations (EcoRV + SacI, HindIII + PvuII, NdeI + SalI, etc.) across lanes 2โ€“10. The table documents water, CutSmart buffer, ฮป DNA, and enzyme volumes per lane.

DNA Gel Art Interface โ€” simulated restriction digests of ฮป DNA with multiple enzyme combinations; lanes 2โ€“10 show fragment patterns; restriction digest table documents reagents per lane DNA Gel Art Interface โ€” simulated restriction digests of ฮป DNA with multiple enzyme combinations; lanes 2โ€“10 show fragment patterns; restriction digest table documents reagents per lane

2. Benchling โ€” NC_001416 Sequence Map with Restriction Sites

Linear map of NC_001416 in Benchling showing the raw sequence, annotated genetic features (e.g., xis, nul, lambdap genes), and restriction enzyme cut sites (PciI, AscI, PmeI, BsaI, KpnI, SacI, SalI, and others) along the 48.5 kb genome.

Benchling NC_001416 โ€” sequence map and linear map with restriction enzyme cut sites and genetic features Benchling NC_001416 โ€” sequence map and linear map with restriction enzyme cut sites and genetic features

3. Virtual Digest Gel โ€” NC_001416 with All Seven Required Enzymes

Simulated gel (Life 1 kb Plus ladder) showing digest results for NC_001416 with each of the seven required enzymes:

LaneEnzymeFragment pattern
1HindIII3 bands (~11 kb, ~6.5 kb, ~2.1 kb)
2BamHI3 bands (~11.5 kb, ~7 kb, ~5.8 kb)
3KpnI2 bands (~12 kb, ~1.7 kb)
4EcoRVMultiple bands (many cut sites)
5SacI2 bands (~11.5 kb, ~1 kb)
6SalI2 bands (~11.5 kb, ~550 bp)
7EcoRIMultiple bands (~12 kb, ~9.5 kb, ~8.5 kb, ~7.5 kb, ~6 kb, ~3.5 kb)
Virtual digest gel โ€” NC_001416 digested with HindIII, BamHI, KpnI, EcoRV, SacI, SalI, EcoRI; Life 1 kb Plus ladder Virtual digest gel โ€” NC_001416 digested with HindIII, BamHI, KpnI, EcoRV, SacI, SalI, EcoRI; Life 1 kb Plus ladder

Enzymes Simulated

EnzymeRecognition siteNotes
SalIG^TCGAC6-cutter
SacIGAGCT^C6-cutter
EcoRVGAT^ATC6-cutter, blunt
KpnIGGTAC^C6-cutter
BamHIG^GATCC6-cutter
HindIIIA^AGCTT6-cutter
EcoRIG^AATTC6-cutter

Part 3: DNA Design Challenge

Part 3: DNA Design Challenge

3.1 Choose Your Protein

Protein chosen: Superfolder Green Fluorescent Protein (sfGFP)

Why: sfGFP is a robust, rapidly maturing fluorescent protein derived from Aequorea victoria (Pรฉdelacq et al., 2005). It is widely used in synthetic biology as a reporterโ€”when expressed in cells, it fluoresces bright green under blue/UV light, enabling real-time visualization of gene expression, protein localization, and cell tracking. Its “superfolder” mutations improve folding efficiency in diverse hosts (including E. coli), making it ideal for expression experiments. It also connects directly to Part 4, where we build an expression cassette to make E. coli glow green.

Source: FPbase โ€” Superfolder GFP | UniProt | GenBank: ASL68970

Protein sequence (amino acids):

MSKGEELFTGVVPILVELDGDVNGHKFSVRGEGEGDATNGKLTLKFICTTGKLPVPWPTLVTTLTYGVQCFSRYPDHMKRHDFFKSAMPEGYVQERTISFKDDGTYKTRAEVKFEGDTLVNRIELKGIDFKEDGNILGHKLEYNFNSHNVYITADKQKNGIKANFKIRHNVEDGSVQLADHYQQNTPIGDGPVLLPDNHYLSTQSVLSKDPNEKRDHMVLLEFVTAAGITHGMDELYK

(238 amino acids, ~26.8 kDa)


3.2 Reverse Translate: Protein โ†’ DNA

Using the Central Dogma in reverse: given a protein sequence, we infer a possible DNA sequence that could encode it. Because the genetic code is degenerate (multiple codons encode the same amino acid), many DNA sequences can produce the same protein. A simple reverse translation uses one valid codon per amino acidโ€”here, E. coli preferred codons (most frequently used in highly expressed genes).

Tool used: Reverse translation with E. coli codon preferences (e.g., ExPASy Translate or similar tools; can also be done manually with a codon usage table).

Reverse-translated DNA sequence (one possible encoding):

ATGTCAAAAGGTGAAGAACTGTTTACCGGTGTGGTGCCGATTCTGGTGGAACTGGATGGTGATGTGAACGGTCACAAATTTTCAGTGCGTGGTGAAGGTGAAGGTGATGCTACCAACGGTAAACTGACCCTGAAATTTATTTGCACCACCGGTAAACTGCCGGTGCCGTGGCCGACCCTGGTGACCACCCTGACCTACGGTGTGCAGTGCTTTTCACGTTACCCGGATCACATGAAACGTCACGATTTTTTTAAATCAGCTATGCCGGAAGGTTACGTGCAGGAACGTACCATTTCATTTAAAGATGATGGTACCTACAAAACCCGTGCTGAAGTGAAATTTGAAGGTGATACCCTGGTGAACCGTATTGAACTGAAAGGTATTGATTTTAAAGAAGATGGTAACATTCTGGGTCACAAACTGGAATACAACTTTAACTCACACAACGTGTACATTACCGCTGATAAACAGAAAAACGGTATTAAAGCTAACTTTAAAATTCGTCACAACGTGGAAGATGGTTCAGTGCAGCTGGCTGATCACTACCAGCAGAACACCCCGATTGGTGATGGTCCGGTGCTGCTGCCGGATAACCACTACCTGTCAACCCAGTCAGTGCTGTCAAAAGATCCGAACGAAAAACGTGATCACATGGTGCTGCTGGAATTTGTGACCGCTGCTGGTATTACCCACGGTATGGATGAACTGTACAAA

(714 bp)


3.3 Codon Optimization

Why optimize codon usage? Different organisms prefer different codons for the same amino acid, based on tRNA abundance and other factors. Using rare codons can slow translation, cause ribosome stalling, and reduce protein yield. Codon optimization replaces codons with those most frequently used in the target organism, improving expression levels and folding. It also allows us to avoid restriction enzyme recognition sites (e.g., BsaI, BsmBI, BbsI) that would interfere with Golden Gate or other assembly methods.

Organism chosen: Escherichia coli (K-12)

Why E. coli? It is the standard workhorse for recombinant protein expression: well-characterized genetics, fast growth, simple culture, and widely available vectors and protocols. The HTGAA Part 4 exercise uses E. coli for the sfGFP expression cassette, so optimizing for E. coli keeps the workflow consistent.

Tool used: Twist Bioscience Codon Optimization Tool (avoiding Type IIs sites BsaI, BsmBI, BbsI as recommended).

Codon-optimized DNA sequence (for E. coli):

Using Twist Codon Optimization Tool, avoiding Type IIs sites BsaI, BsmBI, BbsI:

ATGAGCAAAGGAGAAGAACTTTTCACTGGAGTTGTCCCAATTCTTGTTGAATTAGATGGTGATGTTAATGGGCACAAATTTTCTGTCCGTGGAGAGGGTGAAGGTGATGCTACAAACGGAAAACTCACCCTTAAATTTATTTGCACTACTGGAAAACTACCTGTTCCGTGGCCAACACTTGTCACTACTCTGACCTATGGTGTTCAATGCTTTTCCCGTTATCCGGATCACATGAAACGGCATGACTTTTTCAAGAGTGCCATGCCCGAAGGTTATGTACAGGAACGCACTATATCTTTCAAAGATGACGGGACCTACAAGACGCGTGCTGAAGTCAAGTTTGAAGGTGATACCCTTGTTAATCGTATCGAGTTAAAGGGTATTGATTTTAAAGAAGATGGAAACATTCTTGGACACAAACTCGAGTACAACTTTAACTCACACAATGTATACATCACGGCAGACAAACAAAAGAATGGAATCAAAGCTAACTTCAAAATTCGCCACAACGTTGAAGATGGTTCCGTTCAACTAGCAGACCATTATCAACAAAATACTCCAATTGGCGATGGCCCTGTCCTTTTACCAGACAACCATTACCTGTCGACACAATCTGTCCTTTCGAAAGATCCCAACGAAAAGCGTGACCACATGGTCCTTCTTGAGTTTGTAACTGCTGCTGGGATTACACATGGCATGGATGAGCTCTACAAA

(717 bp; optimized for E. coli expression, restriction-site free โ€” same sequence used in Part 4 expression cassette)


3.4 You Have a Sequence! Now What?

Technologies to produce sfGFP from this DNA:

  1. Cell-dependent (recombinant expression in E. coli):

    • Clone the codon-optimized gene into an expression vector (e.g., pTwist Amp High Copy) with a constitutive or inducible promoter (e.g., BBa_J23106), RBS (e.g., BBa_B0034), and terminator (e.g., BBa_B0015).
    • Transform the plasmid into E. coli (e.g., DH5ฮฑ, BL21).
    • Grow cells; the host RNA polymerase transcribes the DNA into mRNA, and ribosomes translate the mRNA into sfGFP.
    • The protein folds and forms its chromophore; cells fluoresce green under blue light (~488 nm excitation, ~510 nm emission).
  2. Cell-free (in vitro transcriptionโ€“translation):

    • Use a cell-free system (e.g., E. coli lysate, PURE system) with the DNA template.
    • Add NTPs, amino acids, and energy sources; the system transcribes and translates the gene without living cells.
    • Useful for rapid prototyping, toxic proteins, or when cell growth is impractical.
  3. DNA synthesis (Twist, IDT, etc.):

    • Order the gene as a clonal or linear fragment from a synthesis provider.
    • Use it directly for cloning or cell-free expression, avoiding PCR or cloning from natural sources.

Flow: DNA โ†’ (RNA polymerase) โ†’ mRNA โ†’ (ribosomes + tRNAs + amino acids) โ†’ polypeptide โ†’ (folding + chromophore formation) โ†’ fluorescent sfGFP.


3.5 [Optional] How Does It Work in Nature?

Alignment of DNA, RNA, and protein: In the Central Dogma, DNA is transcribed to RNA (Tโ†’U), and RNA is translated to protein (3 nt โ†’ 1 aa). Tools like Benchling or Ronan’s gel art site can visualize this alignment.

Single gene โ†’ multiple proteins: Alternative splicing (eukaryotes) or alternative start codons/ribosomal frameshifting can produce multiple proteins from one gene. sfGFP is a single open reading frame, but in general, one gene can yield multiple isoforms through these mechanisms.

Part 4: Prepare a Twist DNA Synthesis Order

Part 4: Prepare a Twist DNA Synthesis Order

Practice exercise โ€” building an sfGFP expression cassette in Benchling, preparing a mock Twist order, and annotating the plasmid.


4.1โ€“4.2 Accounts & Build Your DNA Insert Sequence

Created Twist and Benchling accounts. Built the sfGFP expression cassette in Benchling with annotated parts:

  • Promoter (BBa_J23106)
  • RBS (BBa_B0034)
  • Start codon (ATG)
  • Coding sequence (codon-optimized sfGFP from Part 3)
  • 7ร— His tag
  • Stop codon (TAA)
  • Terminator (BBa_B0015)

Proof of Annotation in Benchling

Benchling sequence link: sfGFP_expression_cassette ยท Benchling

Screenshot: Annotated Sequence Map in Benchling

The sequence map shows the sfGFP expression cassette (924 bp) with promoter, RBS, and sfGFP CDS annotated, plus restriction enzyme cut sites.

Benchling sfGFP expression cassette โ€” sequence map and linear map with annotated promoter (BBa_J23106), RBS (BBa_B0034), sfGFP CDS, and restriction enzyme sites Benchling sfGFP expression cassette โ€” sequence map and linear map with annotated promoter (BBa_J23106), RBS (BBa_B0034), sfGFP CDS, and restriction enzyme sites

Screenshot: Circular Plasmid Map (sfGFP in pTwist Amp High Copy)

The full construct (3145 bp) in pTwist Amp High Copy, with insert, source, AmpR promoter, and vector backbone annotated.

Note: The color choices for the plasmid annotations are a reflection of my cringe-worthy color skills โ€” consider yourself warned.

Circular plasmid map โ€” sfGFP_expression_cassette in pTwist Amp High Copy with annotated regions and restriction enzyme sites Circular plasmid map โ€” sfGFP_expression_cassette in pTwist Amp High Copy with annotated regions and restriction enzyme sites

4.3โ€“4.6 Twist Order Flow

  • Selected Genes โ†’ Clonal Genes on Twist
  • Uploaded FASTA (sfGFP expression cassette)
  • Chose vector: pTwist Amp High Copy from Twist Vector Catalog
  • Downloaded GenBank construct and imported into Benchling

Screenshot: Sequence Upload to Twist

Twist Genes โ€” HTGAA-Wk-2 upload interface showing sfgfp_expression_cassette successfully uploaded Twist Genes โ€” HTGAA-Wk-2 upload interface showing sfgfp_expression_cassette successfully uploaded

Design Notes: Manual vs. Programmatic

Efficiency: Designing expression cassettes and plasmids can be far more efficient with Python and/or R โ€” tools like DNA Chisel, PyDNA, or SynBioHub enable scripted design, validation, and export. Batch operations, automated codon optimization, and constraint checking become straightforward.

Learning value: Building the construct manually in Benchling โ€” clicking through each part, copying sequences, and annotating by hand โ€” offers a different kind of learning. You develop intuition for how promoters, RBSs, and CDSs fit together, where restriction sites fall, and what the plasmid “looks like” at each step. That tactile understanding is harder to get from a script. For a first expression cassette, the manual approach is worth the extra time.

    MANUAL (Benchling)              PROGRAMMATIC (Python/R)
    โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€               โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€
    Click, paste, annotate           Script โ†’ design โ†’ export
    Slow, one construct at a time    Fast, many constructs
    Deep, tactile understanding     Scalable, reproducible
    "I built this"                   "I designed 50 of these"
    
    Both have their place. Start manual; scale with code.

Documented Deliverables

ItemStatus
Desired Twist cloning vectorpTwist Amp High Copy
Fully annotated Benchling insert fragmentsfGFP_expression_cassette
GenBank construct importedโœ“

Part 5: DNA Read, Write, & Edit

Part 5: DNA Read, Write, & Edit

Answers framed around the BioVolt DIY electroporation pipeline: plasmid amplification โ†’ transformation โ†’ PCR verification โ†’ gel electrophoresis. What DNA would we read, write, and edit to make this frugal pipeline sing?

     โ•”โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•—
     โ•‘  ๐Ÿงฌ THE CENTRAL DOGMA MEETS BIOVOLT ๐Ÿงฌ                         โ•‘
     โ•‘                                                               โ•‘
     โ•‘     READ          WRITE         EDIT                          โ•‘
     โ•‘       โ”‚              โ”‚             โ”‚                          โ•‘
     โ•‘       โ–ผ              โ–ผ             โ–ผ                          โ•‘
     โ•‘   [Sequence]   [Synthesize]   [CRISPR]                        โ•‘
     โ•‘       โ”‚              โ”‚             โ”‚                          โ•‘
     โ•‘       โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ผโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜                          โ•‘
     โ•‘                      โ”‚                                        โ•‘
     โ•‘                      โ–ผ                                        โ•‘
     โ•‘            โšก BIOVOLT ZAPS IT IN โšก                             โ•‘
     โ•‘                 (E. coli glows green)                         โ•‘
     โ•šโ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•

5.1 DNA Read

(i) What DNA would you want to sequence and why?

In the BioVolt pipeline: After electroporation, we transform E. coli with plasmids (e.g., sfGFP expression cassette). We run post-transformation PCR and gel electrophoresis to infer successโ€”but we don’t know the exact sequence. Sequencing the plasmid (or PCR amplicon) confirms that:

  • The insert is correct (no truncations, no wrong gene)
  • Electroporation didn’t introduce mutations (high voltage can stress DNA)
  • The expression cassette is intact for downstream experiments

Broader applications (aligned with BioVolt’s democratization goals):

  • Environmental monitoring โ€” e.g., sewage/wastewater DNA for microbiome analysis in Panama; biodiversity surveys
  • Human health โ€” disease-associated genes, pharmacogenomics
  • DNA data storage โ€” archival sequences in synthetic DNA
  • Biobank validation โ€” verifying stored samples
    โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
    โ”‚  BIOVOLT PIPELINE: WHERE SEQUENCING FITS                    โ”‚
    โ”‚                                                             โ”‚
    โ”‚   Plasmid โ”€โ”€โ–บ PCR amp โ”€โ”€โ–บ BioVolt zap โ”€โ”€โ–บ Plate โ”€โ”€โ–บ Coloniesโ”‚
    โ”‚      โ”‚                         โ”‚                    โ”‚       โ”‚
    โ”‚      โ”‚                         โ”‚                    โ”‚       โ”‚
    โ”‚      โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ฌโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ดโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜      โ”‚
    โ”‚                    โ”‚                                        โ”‚
    โ”‚                    โ–ผ                                        โ”‚
    โ”‚              "Did it work?"  โ”€โ”€โ–บ  SEQUENCE IT! ๐Ÿ”ฌ           โ”‚
    โ”‚              (gel = maybe)       (sequence = certainty)     โ”‚
    โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜

(ii) What technology would you use and why?

Technology chosen: Oxford Nanopore (MinION) โ€” third-generation sequencing

Why Nanopore for BioVolt / frugal labs:

  • Portable โ€” USB-sized device; runs on laptop; fits in a backpack. Ideal for Panama, field sites, or home labs.
  • Real-time โ€” base calling as reads stream; no batch wait.
  • Long reads โ€” can span full plasmids; fewer assembly gaps.
  • Low capital โ€” compared to Illumina, much cheaper to get started.
  • No PCR required for some workflows โ€” direct DNA sequencing possible (native DNA).
QuestionAnswer
Output?FASTQ files (reads + quality scores); can be base-called in real time to BAM/FASTA.
Essential steps & base calling?(1) DNA passes through a nanopore; (2) each base disrupts ionic current differently; (3) base caller (e.g., Guppy) converts current traces โ†’ A/T/G/C; (4) reads assembled/compared to reference.
Input & preparation?Option A (PCR amplicon): PCR product โ†’ end-prep โ†’ adapter ligation โ†’ load onto flow cell. Option B (native): Fragment DNA (e.g., g-TUBE or sonication) โ†’ repair ends โ†’ adapter ligation โ†’ load. Key: adapters enable motor protein to thread DNA through pore.
First-, second-, or third-generation?Third-generation. Single-molecule, real-time; no amplification required for some lib preps; long reads; portable form factor.
         NANOPORE SEQUENCING (simplified)
         
              โ•ญโ”€โ”€โ”€-โ•ฎ
    DNA โ”€โ”€โ”€โ”€โ–บ โ”‚ โ–“โ–“ โ”‚  โ† pore in membrane
              โ”‚ โ–“โ–“ โ”‚     (ionic current changes per base)
              โ•ฐโ”€โ”€โ”€-โ•ฏ
                 โ”‚
                 โ–ผ
           โ•”โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•—
           โ•‘  A T G C  โ•‘  โ† base caller (Guppy, etc.)
           โ•‘  โ–“ โ–“ โ–“ โ–“  โ•‘     converts squiggle โ†’ sequence
           โ•šโ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•

5.2 DNA Write

(i) What DNA would you want to synthesize and why?

For BioVolt: The expression cassettes we electroporate! Specifically:

  • sfGFP plasmid โ€” promoter + RBS + sfGFP CDS + terminator (e.g., BBa_J23106, BBa_B0034, sfGFP, BBa_B0015). This is the “make E. coli glow green” construct we build in Part 4.
  • Custom reporters โ€” e.g., biosensors that fluoresce in response to environmental cues (pH, metals, toxins) for citizen-science monitoring.
  • Validation controls โ€” known sequences for PCR/gel positive controls in the frugal pipeline.

Broader: Therapeutics (mRNA vaccines), genetic circuits, DNA origami, gene clusters for metabolic engineering.

    WHAT WE SYNTHESIZE FOR BIOVOLT:
    
    โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
    โ”‚  [Promoter]โ”€[RBS]โ”€[ATG]โ”€[sfGFP]โ”€[His]โ”€[TAA]โ”€[Terminator]   โ”‚
    โ”‚       โ”‚                    โ”‚                               โ”‚
    โ”‚       โ””โ”€โ”€ always on        โ””โ”€โ”€ glows green under UV        โ”‚
    โ”‚                                                            โ”‚
    โ”‚  Twist / IDT makes this. BioVolt zaps it in. Done. ๐ŸŸข      โ”‚
    โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜

(ii) What technology would you use and why?

Technology: Column-based phosphoramidite synthesis (e.g., Twist Bioscience, IDT) โ€” the industry standard for gene synthesis.

Why: High fidelity, scalable, cost-effective for genes and gene fragments. Twist can deliver clonal genes (circular) ready for transformationโ€”perfect for BioVolt.

QuestionAnswer
Limitations?Speed: days to weeks. Accuracy: ~1 error per 1โ€“3 kb; may need sequencing to confirm. Scalability: great for genes; whole genomes get expensive. Length: very long constructs may need assembly.
Essential steps?(1) Design sequence (e.g., codon-optimized); (2) split into overlapping oligos; (3) synthesize oligos (phosphoramidite chemistry, base-by-base); (4) assemble oligos (PCR, Gibson, or enzymatic); (5) clone into vector; (6) sequence to verify.
    PHOSPHORAMIDITE SYNTHESIS (cartoon)
    
    Base + Base + Base + ...  โ†’  oligo  โ†’  assemble  โ†’  gene
    
        A   T   G   C   A   T   ...
        โ”‚   โ”‚   โ”‚   โ”‚   โ”‚   โ”‚
        โ–ผ   โ–ผ   โ–ผ   โ–ผ   โ–ผ   โ–ผ
    โ”Œโ”€โ”€โ”€โ”ดโ”€โ”€โ”€โ”ดโ”€โ”€โ”€โ”ดโ”€โ”€โ”€โ”ดโ”€โ”€โ”€โ”ดโ”€โ”€โ”€โ”ดโ”€โ”€โ”€----โ”
    โ”‚  โ–ˆโ–ˆโ–ˆโ–ˆ โ–ˆโ–ˆโ–ˆโ–ˆ โ–ˆโ–ˆโ–ˆโ–ˆ โ–ˆโ–ˆโ–ˆโ–ˆ โ–ˆโ–ˆโ–ˆโ–ˆ     โ”‚  โ† solid support (column)
    โ”‚  add โ†’ couple โ†’ oxidize โ†’ cap โ”‚  (repeat ~hundreds of times)
    โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€- โ”˜

5.3 DNA Edit

(i) What DNA would you want to edit and why?

For BioVolt:

  • Improve electroporation efficiency โ€” edit E. coli to knock out or modify genes that affect membrane composition, cell wall, or DNA repair (e.g., recA, mutS) to get more transformants per zap.
  • Biosensor chassis โ€” edit strains to express reporter circuits (e.g., GFP under metal-responsive promoter) for environmental sensing in the DIY pipeline.
  • Safety โ€” auxotrophic markers, kill switches, or containment edits for responsible DIYbio.

Broader: Human therapeutics (e.g., sickle cell), agriculture (nitrogen fixation, disease resistance), conservation (genetic rescue), longevity research.

    EDIT E. coli FOR BETTER BIOVOLT TRANSFORMATION?
    
         Wild-type E. coli              Edited E. coli
              โ”‚                              โ”‚
              โ”‚  "Membrane too tough"        โ”‚  "Softer membrane?"
              โ”‚  "DNA repair too good?"      โ”‚  "Fewer repair enzymes?"
              โ”‚                              โ”‚
              โ–ผ                              โ–ผ
         โšก BioVolt โšก                  โšก BioVolt โšก
              โ”‚                              โ”‚
              โ–ผ                              โ–ผ
         10ยณ CFU/ยตg                    10โต CFU/ยตg?  ๐ŸŽฏ
              โ”‚                              โ”‚
            "Meh"                      "Now we're talking!"

(ii) What technology would you use and why?

Technology: CRISPR/Cas9 (with HDR for precise edits) โ€” or base editors for single-nucleotide changes without double-strand breaks.

Why: Programmable, precise, widely adopted. gRNA design is straightforward; many tools (Benchling, etc.) support it.

QuestionAnswer
Limitations?Efficiency: not 100%; mixed populations. Precision: off-target cuts possible; PAM requirement constrains target sites. Delivery: need to get Cas9 + gRNA into cells (electroporation works!).
Preparation & input?Design: gRNA(s) targeting locus; donor template (ssODN or plasmid) for HDR. Input: DNA template, Cas9 nuclease, gRNA (or plasmid expressing both), cells. Optional: base editor (e.g., ABE, CBE) for point mutations.
Essential steps?(1) Design gRNA (avoid off-targets; check PAM, e.g., NGG for SpCas9); (2) deliver Cas9 + gRNA + donor (electroporation, conjugation, etc.); (3) Cas9 cuts DNA; (4) cell repairs via NHEJ or HDR; (5) screen for edits (PCR, sequencing).
    CRISPR/Cas9 IN ACTION (simplified)
    
    gRNA:  "Find this sequence"  โ”€โ”€ โ”
                                    โ”œโ”€โ”€โ–บ  Cas9  โ”€โ”€โ–บ  CUT! โœ‚๏ธ
    DNA:   ...TARGET...PAM...     โ”€โ”€โ”˜
    
    Before:  โ”€โ”€โ”€โ”€[TARGET]โ”€โ”€โ”€โ”€
    After:   โ”€โ”€โ”€โ”€โ•ฒ     โ•ฑโ”€โ”€โ”€โ”€   (cell repairs: NHEJ or HDR)
                  โ•ฒ   โ•ฑ
                   gap
    
    BioVolt could deliver Cas9 RNP + donor via electroporation! โšก

Summary: Read, Write, Edit โ†’ BioVolt

    โ•”โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•—
    โ•‘                     BIOVOLT + DNA TOOLKIT                      โ•‘
    โ•‘                                                                โ•‘
    โ•‘   WRITE (Twist)     โ”€โ”€โ–บ  plasmid with sfGFP                    โ•‘
    โ•‘         โ”‚                                                      โ•‘
    โ•‘         โ–ผ                                                      โ•‘
    โ•‘   EDIT (optional)   โ”€โ”€โ–บ  tune E. coli for better zapping       โ•‘
    โ•‘         โ”‚                                                      โ•‘
    โ•‘         โ–ผ                                                      โ•‘
    โ•‘   โšก BIOVOLT โšก     โ”€โ”€โ–บ  transform cells                         โ•‘
    โ•‘         โ”‚                                                      โ•‘
    โ•‘         โ–ผ                                                      โ•‘
    โ•‘   READ (Nanopore)   โ”€โ”€โ–บ  confirm plasmid sequence              โ•‘
    โ•‘                                                                โ•‘
    โ•‘   Result: Frugal, validated, democratized synthetic biology.   โ•‘
    โ•šโ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•โ•