Week 1 HW: Principles and Practices

Bioindicator for Microplastic Contamination in Agricultural Soils

1. Biological Engineering Application

Project Description

The proposed biological engineering application is the development of a bioindicator for detecting contamination by microplastics and their chemical additives in agricultural soils.

The system would be based on a genetically modified soil bacterium (Pseudomonas spp.), a microorganism naturally associated with the rhizosphere of crops such as:

Maize
Wheat

Functional Concept

The engineered bacterium would be designed to:

Detect stress caused by microplastic additives present in soil
Produce a visible signal (e.g., fluorescence) when exposed to such stress

This mechanism would enable:

Early detection of soil contamination
Identification of potential stress conditions affecting crops
On-site monitoring without exclusive reliance on laboratory-based analytical methods

Motivation

Microplastics are increasingly present in agricultural soils due to multiple sources, including agricultural plastics and contaminated inputs. Current detection methods are:

Technically complex
Costly
Primarily limited to laboratory analysis

This project aims to provide a field-deployable, cost-effective biological monitoring tool to complement existing analytical techniques.

Governance and Policy Framework

2. Governance and Policy Goals

The following goals guide the evaluation of governance actions for this project.

Biosecurity

Ensure that the use of a genetically modified microorganism as a bioindicator:

Does not create biological or environmental risks
Protects sustainable soil use
Avoids negative impacts on:
- Soil ecosystems
- Crops (maize and wheat)
- Non-target organisms

Ethics

Ensure responsible and socially acceptable use of the technology by:

Protecting local communities
Promoting transparency
Limiting the application strictly to environmental monitoring purposes

Other Considerations

Ensure technical feasibility
Consider economic costs
Minimize burdens to stakeholders
Avoid unnecessary impediments to scientific research
Promote constructive and responsible applications of biotechnology

3. Potential Governance Actions

Action 1: Environmental Biosecurity Assessment Before Field Use

Type: Regulatory requirement
Actors: Academic researchers, environmental regulators

Purpose

Require a formal environmental biosecurity assessment prior to field application in order to prevent biological and ecological risks in agricultural soils.

Design

Researchers prepare risk assessment protocols
Regulatory or institutional bodies review and approve these protocols before field testing

Assumptions

Laboratory experiments can reasonably predict environmental behavior and associated risks

Risks of Failure

Long-term environmental effects may be underestimated

Risks of Over-Strict Implementation

Excessively strict requirements may slow down research and innovation

Action 2: Technical Biological Containment Strategies

Type: Technical strategy
Actors: Researchers, technology developers

Purpose

Reduce environmental risks by limiting the survival or activity of the bioindicator microorganism outside controlled soil conditions.

Design

Genetic containment mechanisms are incorporated during the design stage
Containment is built directly into the microorganism’s genetic architecture

Assumptions

Containment systems are stable and effective
Containment does not significantly reduce bioindicator performance

Risks of Failure

Containment mechanisms fail under real environmental conditions

Risks of Over-Strict Implementation

Excessive biological control reduces detection sensitivity or functionality

Action 3: Transparency and Engagement with Local Farming Communities

Type: Ethical governance / incentive mechanism
Actors: Researchers, institutions, farmers

Purpose

Ensure ethical and socially responsible implementation by informing and engaging farming communities.

Design

Provide clear information regarding:
- Purpose of the bioindicator
- Operational limits
- Potential risks
Involve local communities in decision-making processes

Assumptions

Transparency increases trust and acceptance

Risks of Failure

Communication efforts may increase resistance or skepticism

Risks of Over-Expansion

Broad acceptance may lead to unintended or expanded uses

4. Comparative Evaluation

Scoring table to be inserted here.

5. Prioritization of Governance Options

Based on the updated scoring table, a combination of:

Option 1: Environmental Biosecurity Assessment
Option 2: Biological Containment Strategies

is prioritized.

These two options are essential for project viability because they:

Perform best in preventing biological and environmental risks
Address core ethical concerns
Strengthen long-term sustainability of the project

Although they may involve:

Higher implementation costs
Potential constraints on research

they are necessary to ensure responsible development.

Role of Option 3

Transparency and community engagement is not critical for technical viability but is essential for:

Implementation in agricultural plantations
Social acceptance
Practical adoption by farming communities

Conclusion

A combined governance strategy integrating:

Environmental biosecurity assessment
Technical biological containment
Community engagement

provides a balanced framework that aligns:

Safety
Technical feasibility
Ethical responsibility
Real-world applicability

This integrated approach supports the responsible development and deployment of a bioindicator system for detecting microplastic contamination in agricultural soils.

DNA Replication, Oligo Synthesis, and Molecular Coding Concepts

Questions from Professor Jacobson

DNA Replication Fidelity

Nature’s machinery for copying DNA is DNA polymerase, which has an error rate of approximately 1 in 10⁶ bases due to its 3’→5’ proofreading activity.

The human genome contains approximately 3 × 10⁹ base pairs. In principle, thousands of errors could occur during a complete replication cycle. However, biological systems maintain genomic integrity through multiple mechanisms:

Polymerase proofreading activity
Exonuclease activity
DNA repair systems

Together, these systems significantly reduce the final mutation rate and preserve genomic stability.

Combinatorial Space of Protein-Coding Sequences

There are approximately:

3^{400 ≈ 10}190

possible DNA sequences that could encode an average human protein of 400 amino acids.

In practice, many theoretical sequences are non-functional due to constraints such as:

Codon bias
mRNA secondary structure
Translation efficiency
Splicing motifs
Protein folding constraints

Thus, while the combinatorial sequence space is vast, functional protein-coding sequences represent only a small subset.

Questions from Dr. LeProust

Most Common Method for Oligonucleotide Synthesis

The most widely used method for oligonucleotide synthesis today is solid-phase phosphoramidite chemistry (β-cyanoethyl phosphoramidite method).

It is performed on a solid support, typically:

Controlled pore glass (CPG)
Functionalized silica

The synthesis proceeds through iterative cycles consisting of:

Coupling of a protected nucleoside phosphoramidite
Capping of unreacted hydroxyl groups
Oxidation of the phosphite triester to a phosphate triester
DMT deprotection (deblocking)

This cycle is repeated until the desired sequence is assembled.

Why Is It Difficult to Synthesize Oligos Longer Than 200 nt?

Direct solid-phase phosphoramidite synthesis becomes inefficient beyond approximately 200 nucleotides because each nucleotide addition step is not 100% efficient (typically ~99%).

Since each step introduces a small loss, the total yield decreases exponentially with length.

Consequences include:

Cumulative yield loss
Exponential decrease in full-length product
Accumulation of truncated sequences
Increased error rates
Increasingly difficult purification

For example, if each step is 99% efficient, after 200 cycles the overall yield is:

0.99^200

which results in a significant reduction of full-length product.

Therefore, beyond ~200 nt, direct synthesis becomes impractical.

Why Can’t a 2000 bp Gene Be Synthesized Directly?

A 2000 base pair gene cannot be synthesized by direct chemical oligo synthesis for the same reason: cumulative inefficiency.

Attempting 2000 consecutive synthesis cycles would result in:

Negligible yield of full-length product
Massive accumulation of truncated fragments
Increased error rates
Impractical purification requirements

Instead, modern approaches use high-throughput synthesis platforms (e.g., chip-based synthesis) to generate millions of short oligos (typically 60–150 nt) in parallel.

These shorter fragments are then assembled into full-length genes using methods such as:

PCR-based assembly
Gibson Assembly

This strategy allows efficient production of thousands of genes (e.g., 9,600 genes) in a practical and scalable way.

In summary, long genes are not synthesized directly because the chemistry is inefficient at that scale; instead, short oligos are synthesized and subsequently assembled.

Question from George Church

The 10 Essential Amino Acids in Animals

The essential amino acids (those that must be obtained from the diet) are:

Histidine
Isoleucine
Leucine
Lysine
Methionine
Phenylalanine
Threonine
Tryptophan
Valine
Arginine (essential in many animals, especially during growth)

Implications for the “Lysine Contingency”

Since lysine is already an essential amino acid in animals, metabolic dependence on lysine is not unusual in biology.

However, lysine is naturally present in many environments and food sources. Therefore, engineering an organism to depend on lysine as a containment strategy may offer limited biosafety, because the amino acid is neither rare nor synthetic.

This suggests that dependence on a non-natural amino acid would provide a stronger and more reliable biocontainment strategy.

Coding Relationships in Molecular Biology

AA:NA and NA:NA Codes

Definitions:

AA = Amino Acid
NA = Nucleic Acid (DNA or RNA)

NA:NA Code

Refers to nucleic acid–nucleic acid interactions, specifically base-pairing rules:

A–T (or A–U in RNA)
C–G

These interactions govern information storage and replication.

AA:NA Code

Refers to the genetic code: codons in nucleic acids specify which amino acids are incorporated into a protein.

This code determines the primary amino acid sequence of a protein.

What Code Would Describe AA:AA Interactions?

AA:AA interactions refer to amino acid–amino acid interactions within or between proteins, including:

Hydrophobic interactions
Hydrogen bonds
Ionic interactions
Disulfide bonds
Van der Waals forces

These interactions determine:

Protein folding
Three-dimensional structure
Stability
Function

Unlike the genetic code, AA:AA interactions do not form a simple digital code. Instead, they constitute a physicochemical interaction framework that governs protein behavior.

Using the AA:NA code, we can program the amino acid sequence of a protein. By selecting specific amino acids, we indirectly influence AA:AA interactions and therefore affect structure and function.

However, although protein sequences can be designed, final folding and functionality cannot always be predicted with certainty. Protein structure emerges from complex and sometimes unpredict