Homework

Weekly homework submissions:

  • Week 1 HW: Principles and Practices

    AI Cite Prompts were directly based on the homework questions provided. Homework Questions from Professor Jacobson 1. What is the error rate of polymerase? How does this compare to the human genome length, and how does biology address the discrepancy? Polymerase error rate: ~10⁻⁵ per base without proofreading; ~10⁻⁷–10⁻⁸ with proofreading; ~10⁻⁹–10⁻¹⁰ with mismatch repair.

  • Week 2 HW: DNA Read, Write, & Edit

    3.1 mCherry I was thinking about observation under UV illumination — red fluorescence seems easier to detect compared to blue or green light. It also has stronger visual impact. Protein Sequence MVSKGEEDNMAIIKEFMRFKVHMEGSVNGHEFEIEGEGEGRPYEGTQTAKLKVTKGGPLPFAWDILSPQFMYGSKAYVKHPADIPDYLKLSFPEGFKWERVMNFEDGGVVTVTQDSSLQDGEFIYKVKLRGTNFPSDGPVMQKKTMGWEASSERMYPEDGALKGEIKQRLKLKDGGHYDAEVKTTYKAKKPVQLPGAYNVNIKLDITSHNEDYTIVEQYERAEGRHSTGGMDELYK Source: https://www.fpbase.org/protein/mcherry/ 3.2 Original DNA Sequence atggtgagcaaaggcgaagaagataacatggcgattattaaagaatttatgcgctttaaa gtgcatatggaaggcagcgtgaacggccatgaatttgaaattgaaggcgaaggcgaaggc cgcccgtatgaaggcacccagaccgcgaaactgaaagtgaccaaaggcggcccgctgccg tttgcgtgggatattctgagcccgcagtttatgtatggcagcaaagcgtatgtgaaacat ccggcggatattccggattatctgaaactgagctttccggaaggctttaaatgggaacgc gtgatgaactttgaagatggcggcgtggtgaccgtgacccaggatagcagcctgcaggat ggcgaatttatttataaagtgaaactgcgcggcaccaactttccgagcgatggcccggtg atgcagaaaaaaaccatgggctgggaagcgagcagcgaacgcatgtatccggaagatggc gcgctgaaaggcgaaattaaacagcgcctgaaactgaaagatggcggccattatgatgcg gaagtgaaaaccacctataaagcgaaaaaaccggtgcagctgccgggcgcgtataacgtg aacattaaactggatattaccagccataacgaagattataccattgtggaacagtatgaa cgcgcggaaggccgccatagcaccggcggcatggatgaactgtataaa 3.3 Codon Optimization Why Optimize? The same amino acid can be encoded by multiple codons, but different organisms have different codon usage preferences (tRNA abundance, translation efficiency, mRNA structure, etc.). To allow the host to express the protein more efficiently, codon optimization is necessary.

  • Week 3 HW: AUTOMATION

    1. Published paper Villanueva-Cañas et al., PLOS ONE (2021) built a multi-station SARS-CoV-2 RT-qPCR testing workflow using Opentrons OT-2 robots. The core novelty is a reusable software + station architecture that makes a complex diagnostic pipeline programmable, modular, and reproducible across setups.
  1. Final project automation plan Project: “Living Ice Cream” A temperature-responsive dessert system with:
  • Week 4 HW: Protine Design

    Protein & Amino Acid Questions 1) How many molecules of amino acids are in 500 g of meat? (Assume average amino acid ≈ 100 Da ≈ 100 g/mol) Upper-bound estimate: treat the 500 g as entirely amino acids. Moles of amino acids ≈ 500 g / (100 g/mol) = 5 mol Number of molecules ≈ 5 mol × 6.02 × 10²³ molecules/mol ≈ 3.0 × 10²⁴ amino acid molecules 2) Why do humans eat beef but do not become a cow? Dietary proteins are not incorporated intact into our bodies.

  • Week 9 HW: Cell-Free Systems

    General homework questions 1. Explain the main advantages of cell-free protein synthesis over traditional in vivo methods, specifically in terms of flexibility and control over experimental variables. Name at least two cases where cell-free expression is more beneficial than cell production. Cell-free protein synthesis has two major advantages over traditional in vivo expression: flexibility and experimental control.

Subsections of Homework

Week 1 HW: Principles and Practices

cover cover cover cover cover cover

AI Cite

Prompts were directly based on the homework questions provided.


Homework Questions from Professor Jacobson

1. What is the error rate of polymerase? How does this compare to the human genome length, and how does biology address the discrepancy?

Polymerase error rate:
~10⁻⁵ per base without proofreading; ~10⁻⁷–10⁻⁸ with proofreading; ~10⁻⁹–10⁻¹⁰ with mismatch repair.

Human genome size:
~3 × 10⁹ base pairs.

Biological solutions:
Proofreading, mismatch repair, diploidy, and natural selection.


2. How many different DNA codes can encode an average human protein? Why don’t all of them work in practice?

Theoretical number of encodings:
Due to codon degeneracy, typically 10³–10⁶+ possible sequences.

Practical constraints:

  • Codon bias and tRNA availability
  • mRNA secondary structure
  • GC content and sequence stability
  • Regulatory motifs (e.g., splicing, translation signals)
  • Error accumulation during synthesis and replication

Homework Questions from Dr. LeProust

3. What is the most commonly used method for oligo synthesis today?

Phosphoramidite solid-phase synthesis.


4. Why is it difficult to synthesize oligos longer than ~200 nt directly?

  • Each coupling step is less than 100% efficient
  • Errors accumulate linearly with length
  • Yield and purity drop exponentially

5. Why can’t a 2000 bp gene be made by direct oligo synthesis?

  • Error rates become prohibitive
  • Full-length product yield approaches zero
  • Long genes must be assembled from shorter oligos (e.g., Gibson assembly, PCA)

Homework Question from George Church

What are the 10 essential amino acids in all animals?

The ten essential amino acids that animals cannot synthesize de novo and must obtain from diet are:

Histidine, Isoleucine, Leucine, Lysine, Methionine, Phenylalanine, Threonine, Tryptophan, Valine, and Arginine
(Arginine is conditionally essential in adults but universally essential during growth.)


How does this affect the “Lysine Contingency”?

Lysine’s essentiality reflects a deep evolutionary constraint: animals universally lost lysine biosynthesis pathways, making them metabolically dependent on external sources. This supports the “lysine contingency” as a system-level lock-in rather than an arbitrary biochemical choice. Once lysine synthesis was abandoned, translational machinery, diet, and ecological dependencies co-evolved around its availability, making reversal highly unlikely. Thus, lysine exemplifies how early metabolic decisions constrain future evolutionary trajectories.

Week 2 HW: DNA Read, Write, & Edit


3.1 mCherry

I was thinking about observation under UV illumination — red fluorescence seems easier to detect compared to blue or green light. It also has stronger visual impact.

Protein Sequence

MVSKGEEDNMAIIKEFMRFKVHMEGSVNGHEFEIEGEGEGRPYEGTQTAKLKVTKGGPLPFAWDILSPQFMYGSKAYVKHPADIPDYLKLSFPEGFKWERVMNFEDGGVVTVTQDSSLQDGEFIYKVKLRGTNFPSDGPVMQKKTMGWEASSERMYPEDGALKGEIKQRLKLKDGGHYDAEVKTTYKAKKPVQLPGAYNVNIKLDITSHNEDYTIVEQYERAEGRHSTGGMDELYK

Source: https://www.fpbase.org/protein/mcherry/


3.2 Original DNA Sequence

atggtgagcaaaggcgaagaagataacatggcgattattaaagaatttatgcgctttaaa
gtgcatatggaaggcagcgtgaacggccatgaatttgaaattgaaggcgaaggcgaaggc
cgcccgtatgaaggcacccagaccgcgaaactgaaagtgaccaaaggcggcccgctgccg
tttgcgtgggatattctgagcccgcagtttatgtatggcagcaaagcgtatgtgaaacat
ccggcggatattccggattatctgaaactgagctttccggaaggctttaaatgggaacgc
gtgatgaactttgaagatggcggcgtggtgaccgtgacccaggatagcagcctgcaggat
ggcgaatttatttataaagtgaaactgcgcggcaccaactttccgagcgatggcccggtg
atgcagaaaaaaaccatgggctgggaagcgagcagcgaacgcatgtatccggaagatggc
gcgctgaaaggcgaaattaaacagcgcctgaaactgaaagatggcggccattatgatgcg
gaagtgaaaaccacctataaagcgaaaaaaccggtgcagctgccgggcgcgtataacgtg
aacattaaactggatattaccagccataacgaagattataccattgtggaacagtatgaa
cgcgcggaaggccgccatagcaccggcggcatggatgaactgtataaa

3.3 Codon Optimization

Why Optimize?

The same amino acid can be encoded by multiple codons, but different organisms have different codon usage preferences (tRNA abundance, translation efficiency, mRNA structure, etc.). To allow the host to express the protein more efficiently, codon optimization is necessary.

Organism Selected

E. coli K-12.

It is one of the most commonly used host strains. The technical maturity and widespread adoption of this system make it highly suitable for experimental work.


Optimized DNA Sequence

ATGGTGAGCAAAGGCGAAGAAGATAACATGGCGATTATTAAAGAATTTATGCGTTTTAAA
GTGCATATGGAAGGCAGCGTGAACGGCCATGAATTTGAAATTGAAGGCGAAGGCGAAGGC
CGTCCGTATGAAGGCACCCAGACCGCGAAACTGAAAGTGACCAAAGGCGGCCCGCTGCCG
TTTGCGTGGGATATTCTGAGCCCGCAGTTTATGTATGGCAAAGCGTATGTGAAACATCCG
GCGGATATTCCGGATTATCTGAAACTGAGCTTTCCGGAAGGCTTTAAATGGGAACGCGTG
ATGAACTTTGAAGATGGCGGCGTGGTGACCGTGACCCAGGATAGCAGCCTGCAGGATGGC
GAATTTATTTATAAAGTGAAACTGCGCGGCACCAACTTTCCGAGCGATGGCCCGGTGATG
CAGAAAAAAACCATGGGCTGGGAAGCGAGCAGCGAAACGCGTGTATCCGGAAGATGGCGCG
CTGAAAGGCGAAATTAAACAGCGCCTGAAACTGAAAGATGGCGGCCATTATGATGCGGAAG
TGAAAACCCACCTATAAAGCGAAAAAACCGGTGCAGCTGCCGGGCGCGTATAACGTGAACA
TTAAACTGGATATTACCAGCCATAACGAAGATTATACCATTGTGGAACAGTATGAACGTGC
GGAAGGCCGTCATAGCACCGGCGGCATGGATGAACTGTATAAATAA

Source: https://www.kazusa.or.jp/codon/cgi-bin/showcodon.cgi?aa=1&species=83333&style=N&utm


3.4 Cell-Dependent Expression

Goal

To have E. coli transcribe this DNA into mRNA and translate it into a fluorescent protein.

Method

Insert the optimized CDS into an expression plasmid including:

  • Promoter
  • RBS
  • Terminator

Transform the plasmid into E. coli.

In E. coli:

  • RNA polymerase recognizes the promoter
  • DNA is locally unwound
  • Complementary mRNA is synthesized

The ribosome binds via the Shine–Dalgarno sequence and begins translation at the start codon.

  • One codon (3 nucleotides) is read at a time
  • tRNAs deliver corresponding amino acids
  • Amino acids are linked into a polypeptide chain
  • Final product: fluorescent protein

4.1

/


4.2

https://benchling.com/s/seq-7VHbcY2Zp8vixGlBz1td?m=slm-7HxrEb6QjOZ58RtT8p96

(There’s unknown error for the optimization version,,,) cover cover cover cover


5.1 DNA Read

5.1(i) What DNA Would I Sequence?

A synthetic DNA library used for digital DNA data storage.

Artificially designed DNA fragments encoding digital information (text or images).

Why?

DNA here serves as an information storage medium rather than biological genetic material.

Sequencing verifies:

  • Whether the written digital information is preserved
  • Whether errors occurred during storage or amplification
  • The error rate (substitutions, insertions, deletions)

This is effectively a biotechnology-based data integrity check.


5.1(ii) Sequencing Technology

First-generation sequencing: Sanger sequencing.

Why?

  • High accuracy
  • Suitable for validating single fragments

Characteristics

  • Reads one DNA template at a time
  • Read length ~700–900 bp
  • Very high accuracy

Inputs

  • Template DNA
  • Primer
  • DNA polymerase
  • dNTPs
  • Fluorescently labeled ddNTPs

Core Principle

DNA synthesis is terminated at random positions using ddNTPs.

Process:

  1. Polymerase copies the template
  2. ddNTP incorporation stops elongation
  3. Fragments of different lengths are produced
  4. Fragments separated by size
  5. Fluorescent signal read to reconstruct sequence

Output

  • DNA sequence
  • Chromatogram

Limitations

  • Cannot sequence thousands of fragments simultaneously

5.2 DNA Write

5.2(i) What DNA Would I Synthesize?

An expression cassette expressing mCherry in E. coli.

Includes:

  • Promoter
  • RBS
  • Codon-optimized mCherry CDS
  • His tag
  • Terminator

Reason:

  • Produces visible red fluorescence
  • Strong contrast under blue light

5.2(ii) Synthesis Technology

Solid-phase chemical DNA synthesis (phosphoramidite method) + gene assembly.

Steps

  1. Design sequence computationally
  2. Split into short oligos
  3. Chemically synthesize oligos
  4. Assemble via PCR or Gibson Assembly
  5. Clone into vector
  6. Sequence verify

Limitations

  • Error rate increases with length
  • Assembly required
  • Sequencing verification required
  • Cost scales with length

5.3 DNA Edit

5.3(i) What DNA Would I Edit?

A single-base mutation (e.g., disease-causing point mutation).

Reason:

  • Represents the most precise editing scenario
  • Relevant for therapeutic research

5.3(ii) Editing Technology

CRISPR-Cas9 + HDR repair template.

Principle

  1. Design gRNA
  2. Cas9 creates double-strand break
  3. Provide donor DNA
  4. HDR replaces base precisely

Required Inputs

  • gRNA
  • Cas9 protein or plasmid
  • Donor DNA
  • Target cells

Limitations

  • Low HDR efficiency
  • Possible off-target effects
  • Complex delivery
  • Cell-type dependent precision

Week 3 HW: AUTOMATION

1) Published paper

Villanueva-Cañas et al., PLOS ONE (2021) built a multi-station SARS-CoV-2 RT-qPCR testing workflow using Opentrons OT-2 robots. The core novelty is a reusable software + station architecture that makes a complex diagnostic pipeline programmable, modular, and reproducible across setups.


2) Final project automation plan

Project: “Living Ice Cream”

A temperature-responsive dessert system with:

  • Slow “breathing” surface behavior (controlled micro-gas generation)
  • Visual shift (color / glow) near melt-adjacent temperatures

Why Ginkgo automation

I’m using Ginkgo’s autonomous / cloud-lab framing as an iteration engine for high-throughput DOE: stable automation backbone, fast experimental loops, and standardized readouts for repeated screening rounds.

cover cover

What I will automate

A) “Breathing” kinetics screening (high-throughput DOE)

Goal: Find enzyme/substrate + formulation conditions that yield slow, non-violent micro-gas behavior around ~15–25°C.

DOE axes (example)

  • enzyme concentration
  • substrate concentration
  • buffer / pH
  • capsule matrix composition
  • temperature + time

Minimal pseudocode

for cond in DOE_grid:
    dispense(cond.reagents, well)
    incubate(temp=cond.temp, time=cond.time)
    readout = measure_optical_bubble_proxy_or_pressure(well)
    log(readout, cond)

Week 4 HW: Protine Design

Protein & Amino Acid Questions


1) How many molecules of amino acids are in 500 g of meat?

(Assume average amino acid ≈ 100 Da ≈ 100 g/mol)

Upper-bound estimate: treat the 500 g as entirely amino acids.

  • Moles of amino acids ≈ 500 g / (100 g/mol) = 5 mol
  • Number of molecules ≈ 5 mol × 6.02 × 10²³ molecules/mol
  • ≈ 3.0 × 10²⁴ amino acid molecules

2) Why do humans eat beef but do not become a cow?

Dietary proteins are not incorporated intact into our bodies.

During digestion:

  • Proteins are broken down into amino acids (and small peptides).
  • Cells use those amino acids as generic building blocks.
  • Human proteins are synthesized according to human DNA instructions.

Food provides raw materials.
The blueprint comes from our genome and physiology.


3) Why are there only 20 natural amino acids?

Evolution selected 20 genetically encoded amino acids because they balance:

  • Chemical diversity
  • System simplicity

They provide:

  • Hydrophobic residues → core formation
  • Polar/charged residues → solubility & catalysis
  • Special residues:
    • Glycine → flexibility
    • Proline → rigidity
    • Cysteine → disulfide bonds

Adding more amino acids would require:

  • More complex tRNAs
  • More aminoacyl-tRNA synthetases
  • Increased ribosomal complexity
  • Higher error rates
  • Greater biological cost

4) Can you make non-natural amino acids?

Yes.

Non-natural amino acids (nnAAs) can be incorporated via:

  • Engineered tRNA/synthetase systems
  • Chemical synthesis + feeding strategies

They expand protein functionality beyond the natural 20.

Example conceptual designs:

Fluorinated hydrophobic amino acid

  • Leucine-like residue with fluorination
  • Tunes hydrophobicity & packing
  • Can increase stability
  • Can modify binding interfaces

5) If you build an α-helix using D-amino acids, what handedness results?

Natural proteins use L-amino acids → right-handed α-helices.

Using D-amino acids:

  • Produces the mirror image
  • Expected structure: left-handed α-helix

Reason: Switching chirality flips backbone angle preferences and hydrogen-bond geometry.


6) Why are most molecular helices right-handed?

Because biology overwhelmingly uses L-amino acids.

Given L-amino acids:

  • Steric constraints favor right-handed helices
  • Hydrogen-bond geometry stabilizes this orientation

This reflects:

  • Stereochemical constraints
  • Evolutionary fixation of chirality

7) Why do β-sheets aggregate?

β-sheet hydrogen bonding is inherently extendable.

Driving forces:

  • Backbone hydrogen bonding
  • Hydrophobic effect
  • Shape complementarity / stacking

If strand edges are exposed:

  • Additional strands dock
  • Sheets extend
  • Fibrils grow

8) Why do amyloid diseases involve β-sheets?

Misfolded proteins expose backbone segments that form β-strands.

These assemble into:

  • Extended β-sheet-rich fibrils
  • Highly stable, ordered aggregates

Consequences:

  • Cellular toxicity
  • Sequestration of functional proteins
  • Stress on quality-control systems

9) Can amyloid β-sheets be used as materials?

Yes.

Amyloid assemblies are:

  • Mechanically robust
  • Highly ordered
  • Self-assembling

Applications:

  • Nanofiber scaffolds
  • Hydrogels
  • Templates for nanomaterials
  • Functional biomaterials

Selected Protein Analysis


1️⃣ Protein Selection

Protein: NanoLuc luciferase
Derived from: Oplophorus gracilirostris

Function:

  • Catalyzes chemiluminescent reaction
  • Produces bright blue light

Why selected:

  • Core engine for “living / glowing ice cream” concept
  • Converts chemical energy → light
  • Small, bright, structurally characterized
  • Ideal computational design test case

2️⃣ Amino Acid Sequence

(Chain A, PDB 5IBO)

  • SDNMVFTLEDFVGDWRQTAGYNLDQVLEQGQQNLWDLNTEKQQLQKSLQDLKNEEVDMVLNNKSSN GQWFDVVKQKGGFVDGRTKFVGTNQGSLLGYYKDSDQLKTTNIKQVVSTLQGQKIDGTTLTQLNKE VLDNLVTTNPKLREKFQVVHQLLEDTQGTMNQKDTNRTV

3️⃣ Length & Composition

  • Length: 174 amino acids
  • Most frequent residue: Glycine (G)

5️⃣ Protein Family

NanoLuc belongs to:

  • Luciferase family
  • Oxidoreductases

Function:

  • Catalyzes oxidative reactions
  • Chemical energy → photon emission

6️⃣ Structure Page

PDB ID: 5IBO
(RCSB structure page referenced)

cover cover

7️⃣ Structure Quality

  • Method: X-ray crystallography
  • Resolution: 1.95 Å
  • Released: 2016

Resolution < 2.5 Å = high-quality structure
1.95 Å is considered reliable.


8️⃣ Other Molecules in Structure

Yes.
Bound ligand: Decanoic acid

Why this matters:

  • Indicates defined binding pocket
  • Active site accommodates hydrophobic molecules

Design implications for ice cream:

  • Does luciferin need micro-encapsulation?
  • Does fat content affect substrate diffusion?
  • Could dairy fat interact with hydrophobic pocket?

Structural data → formulation constraints.


🔟 PyMOL Visualization

Observation:

  • Beta-sheet dominant structure
  • Consistent with beta-barrel fold

Interpretation:

  • Beta-rich cores = tightly packed
  • Likely structurally stable

Implication: May retain activity under:

  • Slight warming
  • Partial melting
  • Oxygen exposure

1️⃣1️⃣ Residue Surface Analysis

Hydrophobic residues:

  • Cluster in interior

Polar/charged residues:

  • Dominant on surface

Interpretation:

  • Hydrophobic core → structural stability
  • Hydrophilic surface → water compatibility

From ice cream perspective:

  • Ice cream = water + fat + sugar
  • Hydrophilic exterior = favorable
  • Hydrophobic core remains shielded

Encouraging for enzyme stability in food matrix.

Week 9 HW: Cell-Free Systems

General homework questions

1. Explain the main advantages of cell-free protein synthesis over traditional in vivo methods, specifically in terms of flexibility and control over experimental variables. Name at least two cases where cell-free expression is more beneficial than cell production.

Cell-free protein synthesis has two major advantages over traditional in vivo expression: flexibility and experimental control.

First, it is more flexible because it does not require maintaining living cells. That means researchers can rapidly test different DNA templates, reaction conditions, cofactors, salts, temperatures, and additives without worrying about whether the host cell survives. This is especially useful when the protein of interest is toxic, unstable, or difficult for living cells to produce.

Second, it offers much tighter control over experimental variables. In a cell-free system, the researcher can directly define the reaction environment, including magnesium concentration, pH, energy source, template concentration, and molecular chaperones. In contrast, in vivo systems are influenced by the cell’s own metabolism, stress responses, membrane transport, and growth state, which makes it harder to isolate specific causes of success or failure.

At least two cases where cell-free expression is more beneficial than cell production are:

  1. When expressing toxic proteins that would harm or kill living cells.
  2. When doing rapid prototyping or screening, because many conditions can be tested quickly without cloning and culturing cells.

Additional useful cases include membrane protein optimization, incorporation of noncanonical amino acids, and applications where living cells are undesirable for safety or regulatory reasons.


2. Describe the main components of a cell-free expression system and explain the role of each component.

A cell-free expression system typically contains the following major components:

1. Cell lysate or extract
This is the core biological machinery taken from broken cells. It contains ribosomes, translation factors, tRNAs, aminoacyl-tRNA synthetases, and many enzymes needed for transcription and translation.

2. Genetic template (DNA or RNA)
This provides the instructions for the protein to be produced. In many systems, plasmid DNA or linear DNA is used as the template for transcription and translation.

3. Amino acids
These are the building blocks of proteins. They are required for translation.

4. Nucleotides
ATP, GTP, CTP, and UTP are needed for transcription, and ATP/GTP are also consumed during translation and other reaction steps.

5. Energy regeneration system
Because protein synthesis uses large amounts of ATP and GTP, the reaction needs a system to regenerate usable energy and keep the reaction going for longer.

6. Salts and buffer
These maintain the chemical environment needed for the reaction. Magnesium and potassium are especially important because they affect ribosome function and overall expression efficiency.

7. Optional additives
Depending on the application, the system may also include chaperones, disulfide bond helpers, detergents, liposomes, nanodiscs, crowding agents, or protease inhibitors. These help support difficult proteins or special reaction goals.

Together, these parts recreate the minimum biochemical environment needed for gene expression outside a living cell.


3. Why is energy provision regeneration critical in cell-free systems? Describe a method you could use to ensure continuous ATP supply in your cell-free experiment.

Energy regeneration is critical in cell-free systems because transcription and translation consume large amounts of ATP and GTP. Unlike living cells, cell-free systems do not have a full self-sustaining metabolism, so the reaction can quickly stall if energy molecules are depleted. Without continuous energy supply, protein yield drops, reaction time shortens, and the system becomes inefficient.

One method to ensure continuous ATP supply is to use an energy regeneration substrate such as phosphoenolpyruvate (PEP). In this setup, PEP donates phosphate groups that help recycle ATP during the reaction. This extends the active lifetime of the cell-free system and improves protein yield.

Other energy systems, such as creatine phosphate or 3-phosphoglycerate, can also be used, but PEP is a standard and effective example.


4. Compare prokaryotic versus eukaryotic cell-free expression systems. Choose a protein to produce in each system and explain why.

Prokaryotic cell-free systems, such as E. coli-based lysates, are generally faster, cheaper, and easier to optimize. They often provide high yields and are excellent for bacterial proteins or simple reporters. However, they are less suitable for proteins that require complex post-translational modifications or advanced folding machinery.

Eukaryotic cell-free systems, such as wheat germ, rabbit reticulocyte, insect, or mammalian extracts, are better for proteins that need more complex folding environments or eukaryotic-specific processing. These systems are usually more expensive and can be harder to optimize, but they are better suited for many eukaryotic proteins.

A good example for a prokaryotic system would be GFP. GFP is relatively straightforward to express, does not require complicated post-translational modification, and works well in E. coli-based cell-free systems.

A good example for a eukaryotic system would be a mammalian membrane receptor, such as a GPCR fragment or another receptor protein. This type of protein often requires a more eukaryotic-like folding environment and is harder to produce correctly in bacterial systems.

In short, prokaryotic systems are better for speed and simplicity, while eukaryotic systems are better for proteins that need more sophisticated cellular processing.


5. How would you design a cell-free experiment to optimize the expression of a membrane protein? Discuss the challenges and how you would address them in your setup.

To optimize the expression of a membrane protein in a cell-free system, I would design the experiment so that the protein has access to a membrane-like environment during synthesis. I would not express it in a plain aqueous reaction alone, because membrane proteins contain hydrophobic regions that tend to misfold or aggregate in water.

My setup would include:

  • A cell-free expression system
  • The DNA template encoding the membrane protein
  • A membrane-mimicking support such as liposomes, nanodiscs, or suitable detergents
  • A series of optimization conditions, such as different magnesium concentrations, temperatures, template concentrations, and lipid support levels

The main challenges are:

1. Aggregation
Hydrophobic regions may stick together instead of inserting properly into a membrane-like structure.

Response: add liposomes or nanodiscs so the protein has a compatible environment while it is being synthesized.

2. Misfolding
Even if the protein is produced, it may not fold into a functional form.

Response: optimize temperature, reaction rate, and possibly include chaperones.

3. Low functionality despite expression
A membrane protein may be made, but still not behave properly.

Response: evaluate both yield and functional activity, not just total protein amount.

I would screen multiple conditions in parallel and compare both expression level and functionality to identify the best setup.


6. Imagine you observe a low yield of your target protein in a cell-free system. Describe three possible reasons for this and suggest a troubleshooting strategy for each.

A low protein yield in a cell-free system could result from several factors:

Reason 1: Poor template quality or ineffective construct design
The DNA may be degraded, the promoter may be weak, or the sequence may not be optimized for the system.

Troubleshooting strategy:
Verify the DNA sequence, check DNA purity, compare plasmid versus linear DNA, and test different promoter or untranslated region designs.

Reason 2: Suboptimal reaction conditions
The magnesium concentration, potassium concentration, temperature, or pH may not be suitable.

Troubleshooting strategy:
Run a condition matrix varying temperature, Mg²⁺, K⁺, and reaction time to identify a better expression window.

Reason 3: The target protein is intrinsically difficult to express
The protein may aggregate, degrade, or require folding support.

Troubleshooting strategy:
Lower the temperature, add chaperones or stabilizers, shorten the construct if appropriate, or add membrane mimics if the protein is membrane-associated.

These troubleshooting approaches help distinguish whether the problem comes from the template, the reaction environment, or the protein itself.


Homework question from Kate Adamala

7. Design an example of a useful synthetic minimal cell as follows:

Pick a function and describe it.

I would design a synthetic minimal cell that makes an ice cream surface emit a temporary visible glow when it warms up. The goal is to create a dessert that appears to become active only during melting, producing a controlled “living-like” response without using living cells.

What would your synthetic cell do? What is the input and what is the output?

The synthetic cell would detect a warming event near the ice cream surface.

  • Input: an increase in temperature and local melting at the surface
  • Output: a visible luminescent signal

Could this function be realized by cell-free Tx/Tl alone, without encapsulation?

Only partially. A bulk cell-free Tx/Tl reaction could generate a signal, but without encapsulation the system would be more diluted, less protected, and much harder to localize spatially. Encapsulation is useful because it protects the reaction components and keeps the signal confined to specific regions of the dessert.

Could this function be realized by genetically modified natural cell?

In principle, yes. A genetically modified natural cell could potentially be engineered to respond to temperature and generate light. However, this would be less desirable here because using living engineered cells in an edible dessert raises stronger concerns about safety, regulation, storage stability, and user acceptance. A synthetic minimal cell is more appropriate for a controlled design application like this.

Describe the desired outcome of your synthetic cell operation.

The desired outcome is that the ice cream remains visually inactive while frozen, but begins to emit a faint and localized glow when it starts warming. The signal should be short-lived, noticeable, and limited mainly to the exposed surface.

Design all components that would need to be part of your synthetic cell.

The synthetic minimal cell would need:

  • A membrane compartment
  • A cell-free transcription/translation system
  • A reporter module for light production
  • A regulatory mechanism that links warming to activation
  • An energy regeneration module
  • Protective components to improve freeze-thaw stability

What would be the membrane made of?

The membrane could be made from phospholipids plus cholesterol. For example, a POPC and cholesterol membrane would be a reasonable prototype system because it provides a stable artificial membrane structure.

What would you encapsulate inside? Enzymes, small molecules.

Inside the synthetic cell, I would encapsulate:

  • A bacterial cell-free Tx/Tl lysate
  • Amino acids
  • Nucleotides
  • Buffer salts
  • An ATP regeneration system
  • DNA encoding a luminescent reporter
  • Any required small-molecule cofactors or substrates
  • Cryoprotective additives to improve survival during freezing

Which organism your Tx/Tl system will come from? Is bacterial OK, or do you need a mammalian system for some reason?

I would start with a bacterial cell-free system, most likely E. coli-based. A bacterial system is appropriate because it is easier to use, faster to optimize, and sufficient for a proof-of-concept visible reporter system. I would only switch to a mammalian system if the regulatory logic specifically required mammalian transcription factors or eukaryotic response elements.

How will your synthetic cell communicate with the environment? (hint: are substrates permeable? or do you need to express the membrane channel?)

The synthetic cell would communicate with the environment through a warming-triggered change. Ideally, the system would be designed so that increased temperature or local phase change activates the internal reaction. Depending on the final design, this could happen either through passive thermal sensitivity of the encapsulated system or through a membrane property that changes permeability during warming. If needed, a membrane channel or pore could be included in a more advanced design.


Experimental details

8. List all lipids and genes. (bonus: find the specific genes; for example, instead of just saying “small molecule membrane channel” pick the actual gene.)

Lipids:

  • POPC
  • Cholesterol

Genes:

  • A reporter gene for visible output, such as a luciferase-related construct
  • If needed, a membrane pore gene such as alpha-hemolysin (aHL) for controlled transport in a more advanced version
  • If a temperature-sensitive regulatory element is used, the corresponding control sequence would also be included in the construct design

For a concept-stage design, the exact reporter gene can be chosen based on brightness, ease of detection, and compatibility with the cell-free system. For an early proof of concept, a robust reporter is more important than final food compatibility.


9. How will you measure the function of your system?

I would measure the system in stages.

First, I would test the reporter in a standard cell-free reaction and quantify signal output using fluorescence or luminescence measurements.
Second, I would test the same system after encapsulation to see whether the synthetic cells still function.
Third, I would test the encapsulated system after freeze-thaw exposure to evaluate whether activity is preserved.
Finally, I would place the synthetic cells into a model ice cream matrix and monitor signal intensity as the temperature increases.

The key measurements would be:

  • Signal onset temperature
  • Signal brightness
  • Signal duration
  • Spatial localization of the signal
  • Retention of function after freezing

Homework question from Peter Nguyen

10. Freeze-dried cell-free systems can be incorporated into all kinds of materials as biological sensors or as inducible enzymes to modify the material itself or the surrounding environment. Choose one application field — Architecture, Textiles/Fashion, or Robotics — and propose an application using cell-free systems that are functionally integrated into the material. Answer each of these key questions for your proposal pitch:

Write a one-sentence summary pitch sentence describing your concept.

I propose a freeze-dried cell-free architectural coating that becomes visibly activated when water intrusion occurs, allowing walls or building surfaces to function as passive biological leak indicators.

How will the idea work, in more detail? Write 3-4 sentences or more.

The material would contain freeze-dried cell-free reactions embedded in a coating layer or sensing patch. Under normal dry conditions, the system remains inactive and stable. When water enters the material due to leakage or condensation, the reaction becomes hydrated and activates a reporter output, such as a visible color or fluorescence signal. This would allow building managers to detect hidden moisture problems earlier, before severe mold growth or structural damage occurs.

What societal challenge or market need will this address?

This concept addresses the need for early detection of water damage in buildings. Moisture problems are costly, often hidden, and can lead to mold, indoor air quality issues, and expensive repairs. A passive biological leak indicator could reduce inspection delays and improve maintenance efficiency.

How do you envision addressing the limitation of cell-free reactions (e.g., activation with water, stability, one-time use)?

I would address these limitations by designing the system as a replaceable sensing layer rather than a permanent active material. The freeze-dried reactions would remain stable until hydration, and the sensing patches could be modular and periodically replaced during maintenance cycles. I would also explore protective matrices and packaging strategies to improve shelf life and reduce accidental activation from ambient humidity.


Homework question from Ally Huang

11. Provide background information that describes the space biology question or challenge you propose to address. Explain why this topic is significant for humanity, relevant for space exploration, and scientifically interesting. (Maximum 100 words)

Long-duration spaceflight exposes biological systems to radiation, microgravity, confinement, and limited medical infrastructure. One major challenge is the ability to detect and respond to biological stress quickly without relying on large lab equipment. Developing compact cell-free diagnostic tools for space is significant because it supports crew health, enables on-site biological measurement, and advances low-resource biotechnology relevant both in space and on Earth. Scientifically, it is interesting because it tests how simplified biochemical systems perform under extreme conditions and whether portable synthetic biology can serve as a reliable research and health-monitoring platform.

Name the molecular or genetic target that you propose to study. Examples of molecular targets include individual genes and proteins, DNA and RNA sequences, or broader -omics approaches. (Maximum 30 words)

A DNA reporter construct responsive to oxidative stress-associated molecular signals, measured through cell-free reporter output.

Describe how your molecular or genetic target relates to the space biology question or challenge your proposal addresses. (Maximum 100 words)

Oxidative stress is one of the most important biological effects associated with radiation exposure and altered physiological conditions in space. A reporter construct linked to oxidative stress sensing can serve as a simplified way to detect biologically meaningful environmental or sample-related changes. By monitoring reporter output in a cell-free system, the experiment would test whether portable gene expression platforms can be used to detect molecular stress signatures in space-compatible settings. This directly relates to the challenge of performing biological monitoring in space with limited equipment, low mass, and limited crew time.

Clearly state your hypothesis or research goal and explain the reasoning behind it. (Maximum 150 words)

My hypothesis is that a freeze-dried cell-free expression system can be used as a compact and reliable platform to detect oxidative stress-related molecular signals in a constrained spaceflight-like environment. The reasoning is that cell-free systems remove the need to culture living cells, which makes them safer, faster, and easier to deploy in resource-limited settings. If a stress-responsive reporter can be activated reproducibly after rehydration, then this approach could support future low-resource biological monitoring during long-duration missions. The broader goal is not only to detect a signal, but to evaluate whether simplified gene expression tools can function as practical diagnostic or experimental platforms in space, where traditional wet-lab workflows are difficult to implement.

Outline your experimental plan - identify the sample(s) you will test in your experiment, including any necessary controls, the type of data or measurements that will be collected, etc. (Maximum 100 words)

I would test freeze-dried BioBits cell-free reactions containing a stress-responsive reporter construct. Experimental samples would include reactions exposed to oxidative stress-related input conditions and matched negative controls without the activating condition. Additional controls would include a positive-expression control to confirm the cell-free system is functioning properly. After rehydration and incubation, I would measure reporter output using fluorescence readout with the P51 viewer. The main data would be signal intensity, time to detectable signal, and comparison between activated, negative-control, and positive-control reactions.