Week 1 HW: Principles and Practices

1. Modular Biological Sensing and Response Platform

I want to create a modular biological sensing and response platform that detects environmental or industrial chemicals and produces a programmable biological output. The idea is to create interchangeable modules in a biological system that include:

A sensing module
A processing module
A response module

These modules would be engineered so they can be interchanged based on:

The target chemical
The desired response output

This idea excites me because it moves beyond creating one-off biosensors and instead focuses on programmable and engineered biological systems that can be used for real-life applications. My interest lies in combining biological networks with engineering logic to build systems that are predictable, controlled, and reusable.

2. Goals

Goal 1: Ensure Safety and Prevent Harm

Sub-goal 1:

The model should require in-built safety mechanisms in case of issues such as contamination.

Sub-goal 2:

Clear safety benchmarks must be met before any scaling is allowed.

Goal 2: Prevent Misuse

Sub-goal 1:

Resist use cases that could enable environmental manipulation or toxic/harmful chemical amplification.

3. Actions

Action 1: Mandatory Safety-by-Design Criteria

Proposed:
Introduce mandatory safety-by-design requirements where every project must demonstrate built-in contamination control and fail-safe mechanisms.

Design:
Institutional Biosafety Committees (IBCs) should evaluate system behavior, not just the organism.

Assumptions:
Incorporating safety at the early design stage reduces downstream risks.

Risk of Failure and Success:

Risk: Some designs may not fit strict safety rules.
Success: Safety measures become a mandatory checklist in biological engineering projects.

Proposed:
Share circuits and protocols openly, while identifying sensitive components that could be misused.

Design:
Review and categorize components based on their risk and sensitivity.

Assumptions:
Partial transparency supports research progress while maintaining safety.

Risk of Failure and Success:

Risk: Excessive restriction could slow research.
Success: Increased engagement, awareness, and interest in addressing complex challenges.

Action 3: Collaborations

Proposed:
Introduce public recognition for practical applications of this platform.

Design:
Provide grants, innovation challenges, and encourage strong industry–academia collaborations.

Assumptions:
Ethical goals can align with long-term innovation when supported institutionally.

5. Combined Strategy

A combined strategy that enforces mandatory safety-by-design standards along with controlled transparency is recommended. This analysis helped me understand that ethical governance is a core factor in shaping how such ideas can responsibly come to life.

Does the option:	Option 1	Option 2	Option 3
Enhance Biosecurity
• By preventing incidents	1	1	3
• By helping respond	2	2	1
Foster Lab Safety
• By preventing incident	1	2	3
• By helping respond	1	2	2
Protect the environment
• By preventing incidents	1	2	2
• By helping respond	2	2	1
Other considerations
• Minimizing costs and burdens to stakeholders	2	3	1
• Feasibility?	1	2	2
• Not impede research	2	3	1
• Promote constructive applications	2	2	1

Pre-Lecture Questions

Homework Questions from Dr. LeProust

1.## Error Rate of DNA Polymerase

Replicative DNA polymerases make an intrinsic mistake approximately once every
10⁴ to 10⁵ nucleotides (10,000 to 100,000).

The human genome is roughly 3 billion base pairs (3 × 10⁹).

Without correction, an error rate of 10⁻⁵ would result in hundreds of thousands of mistakes per cell division, which is unsustainable for life.

High-Fidelity DNA Replication System

Biology employs a multi-layered, high-fidelity error-correction system:

1. Proofreading

DNA polymerases possess 3′ → 5′ exonuclease activity
Incorrect nucleotides are removed immediately during replication
Improves accuracy by approximately 100-fold

2. Mismatch Repair (MMR)

Post-replication repair machinery
Scans newly synthesized DNA
Removes errors missed by proofreading

Degeneracy of the Genetic Code and Protein Encoding

An average human protein (approximately 400–500 amino acids) can, in theory, be encoded by an astronomically large number of different DNA sequences, potentially exceeding 10¹⁰⁰ possibilities, due to the degeneracy of the genetic code.

However, most of these alternative sequences fail in practice because they may:

Disrupt proper protein folding
Introduce premature stop codons
Contain suboptimal codons that slow translation
Reduce overall translation efficiency and accuracy

Homework Questions from Dr. LeProust

Oligonucleotide Synthesis

The most commonly used method for oligonucleotide synthesis is solid-phase phosphoramidite chemistry.

Limitations of Chemical Oligonucleotide Synthesis

Producing oligonucleotides longer than ~200 nucleotides (nt) by direct chemical synthesis
(typically using phosphoramidite technology) is challenging.

The primary reason is the cumulative, step-wise inefficiency of the synthesis process.
With each nucleotide addition having a small failure rate, the overall yield of the full-length product drops dramatically as length increases.

As a result:

The desired full-length oligo is produced at very low yield
The reaction mixture becomes dominated by truncated failure sequences
Purification of the correct product becomes extremely difficult, if not impractical

Direct, single-step chemical synthesis of a 2000 base pair (bp) DNA molecule is generally not feasible because the phosphoramidite chemistry used for oligonucleotide synthesis suffers from accumulating efficiency losses and errors as the chain length increases.

Homework Question from George Church

Essential Amino Acids

The 10 essential amino acids that must be consumed in the diet because animals cannot synthesize them in sufficient amounts are:

Lysine
Threonine
Isoleucine
Leucine
Methionine
Tryptophan
Valine
Histidine
Phenylalanine
Arginine (often considered essential in young or growing animals)

These amino acids are sometimes referred to as essential or indispensable amino acids.

The “Lysine Contingency” and Its Scientific Flaws

The “Lysine Contingency” from Jurassic Park is a fictional control mechanism in which engineered dinosaurs are designed to require dietary lysine to survive. This idea, while compelling narratively, is scientifically flawed.

Why the Lysine Contingency Fails

1. Ubiquity in Diet

Lysine is a common and widely available amino acid found in most protein-rich foods, including:

Plants
Animals
Microorganisms

An organism engineered to require lysine could easily obtain it from natural food sources, making this control mechanism ineffective.

2. Biological Reality vs. Fiction

The concept treats lysine as if it were a rare, artificial, or synthetic compound, whereas in reality it is a fundamental nutrient present throughout the food chain.

3. Flawed Control Method

As implied even within the movie, the dinosaurs could theoretically survive by:

Eating prey containing lysine-rich proteins
Consuming plants that naturally supply lysine

Thus, lysine dependence alone cannot function as a reliable biological fail-safe.

Conclusion

While the Lysine Contingency works as a dramatic plot device, it does not align with real biological principles. Effective biological control systems would require far more stringent, multi-layered, and biologically realistic safeguards than dependence on a universally available essential amino acid.

NA:NA → Base-Pairing Rules

Discrete, low-alphabet
Geometry + hydrogen-bond constrained
- A = T
- C ≡ G
Expanded alphabets possible later

AA:NA → Genetic Code / Recognition Code

Many-to-one mapping
- 64 codons → 20 amino acids
Implemented physically via aminoacyl-tRNA synthetase recognition
- Shape
- Charge
- Chemistry

Shared Properties (NA:NA and AA:NA)

Symbolic → physical
Error-tolerant
Composable
Readable / writable by biology

Goal for AA:AA

We want something analogous:

A low-dimensional, physically grounded, interaction-predictive code

Suggested AA:AA Interaction Code

Side-Chain Interaction Alphabet (SCIA)

Instead of 20 × 20 explicit pairwise rules, compress AA–AA interactions into interaction classes, defined by side-chain physics, not amino-acid identity.

Step 1: Reduce Amino Acids to Interaction Primitives

Each amino acid is encoded as a vector of interaction features:

Feature	Examples
Charge	+ / − / 0
Polarity	polar / nonpolar
H-bond role	donor / acceptor / both / none
Aromaticity	yes / no
Size class	small / medium / bulky
Flexibility	rigid / flexible

Example:
Lysine ≠ just K, but:

Step 2: Define an Interaction Grammar (Not Pairs)

Instead of explicit pairings like:

Define rules:

( + ↔ − ) → electrostatic attraction
( donor ↔ acceptor ) → hydrogen bond
( aromatic ↔ aromatic ) → π–π stacking
( hydrophobic ↔ hydrophobic ) → packing
( bulky ↔ bulky ) → steric exclusion (negative rule)

This generalizes Watson–Crick pairing logic beyond nucleic acids.

Step 3: Encode AA:AA Interactions Symbolically

Interaction Symbols

Code	Interaction
E	Electrostatic (+/−)
H	Hydrogen bonding
Φ	Hydrophobic packing
π	Aromatic stacking
S	Steric clash
Ø	Neutral / weak

Week 1 HW: Principles and Practices

1. Modular Biological Sensing and Response Platform

2. Goals

Goal 1: Ensure Safety and Prevent Harm

Sub-goal 1:

Sub-goal 2:

Goal 2: Prevent Misuse

Sub-goal 1:

3. Actions

Action 1: Mandatory Safety-by-Design Criteria

Action 2: Knowledge Sharing and Transparency

Action 3: Collaborations

5. Combined Strategy

Pre-Lecture Questions

Homework Questions from Dr. LeProust

High-Fidelity DNA Replication System

1. Proofreading

2. Mismatch Repair (MMR)

Degeneracy of the Genetic Code and Protein Encoding

Homework Questions from Dr. LeProust

Oligonucleotide Synthesis

The most commonly used method for oligonucleotide synthesis is solid-phase phosphoramidite chemistry.

Limitations of Chemical Oligonucleotide Synthesis

Homework Question from George Church

Essential Amino Acids

The “Lysine Contingency” and Its Scientific Flaws

Why the Lysine Contingency Fails

1. Ubiquity in Diet

2. Biological Reality vs. Fiction

3. Flawed Control Method

Conclusion

NA:NA → Base-Pairing Rules

AA:NA → Genetic Code / Recognition Code

Shared Properties (NA:NA and AA:NA)

Goal for AA:AA

Suggested AA:AA Interaction Code

Side-Chain Interaction Alphabet (SCIA)

Step 1: Reduce Amino Acids to Interaction Primitives

Step 2: Define an Interaction Grammar (Not Pairs)

Step 3: Encode AA:AA Interactions Symbolically

Interaction Symbols