Francisco Martínez

Francisco M.C

About me

I’m Francisco, have a MSc in Biological Sciences in CINV, UV (Chile) with wet lab expertise in microbiology and energy metabolism, specifically studying C. elegans behavioral aversion to P. aeruginosa PA14. As an Electronic Engineer of UTFSM (Chile), I complement this by designing open-source hardware and microfluidics to bridge biological research with advanced instrumentation.

Contact info

LinkedIn GitHub YouTube WhatsApp

Homework

Labs

Projects

Subsections of Francisco Martínez

Homework

Weekly homework submissions:

  • Week 1 HW: Principles and Practices

    Oncological Bacteriotherapy: Iron Sequestration in the TME via Controlled Release of Stealth Siderophores under NAND Logic Gates and Biocontainment Circuits.

  • Week 2 HW: DNA-read-write-and-edit

    Nature’s Machinery for Copying DNA - oligo synthesis

  • Week 03 HW: Lab-Automation

    1. Python Script for Opentrons Artwork This artwork was generated using the HTGAA26 Opentrons Colab environment. opentron_code The design was implemented programmatically using geometric constructions and multi-color pipetting logic. To properly render Devanagari text (e.g., “चित्”) using PIL in Google Colab, system-level fonts must be installed before executing the Opentrons script. The Noto Sans Devanagari font was installed using the following commands in a separate Colab cell:

Subsections of Homework

Week 1 HW: Principles and Practices

“Oncological Bacteriotherapy: Engineered Siderophore Secretion and Safety Kill-Switch via NAND Logic Gates”

My project focuses on engineering a bacterial strain capable of sensing the tumor microenvironment and responding through the synthesis of siderophores (Salmochelin), integrated with a robust safety mechanism to prevent off-target effects.

1. General Objective

To induce the death of tumor cells through iron sequestration (Salmochelin), integrating spatial control logic circuits (NAND gate) so that the bacteria act exclusively under hypoxia and ultrasound conditions, and a biocontainment system (Kill-Switch) to guarantee host safety.

Abstract Draw Abstract Draw

2. Experimental Design

2.1.- The effector agent: E. coli Nissle 1917 (Locus iroBCDE, iroN)

  • Siderophore Selection: Release Salmochelin (glycosylated) instead of the siderophore Enterobactin (ENT), which is known to be neutralized by host Lipocalin-2 (Lcn2) (Saha et al. 2019). The use of Pyoverdine is discarded after technical analysis, as its complex biosynthetic pathway represents excessive metabolic stress for the bacteria.
  • Competitive Advantage: Based on the findings of Raffatellu et al. 2010, salmochelin is a siderophore that can survive in an environment with high levels of Lcn2.
  • Cytotoxic Effect (hypothetical): By depleting the labile iron pool (LIP), ferritinophagy and HIF-1α are promoted, but given the high affinity of the siderophore, mitochondrial function collapses, triggering apoptosis of cancer cells.

2.2.- Spatial Control Circuit (NAND Gate)

To avoid systemic toxicity (Pita-Grisanti et al., 2022), Salmochelin production is subject to a double de-repression. In the normal state, two repressors (LacI and TetR) block the operon. Synthesis occurs when the following conditions are met:

  • Condition A (Hypoxia): The P_vgb promoter turns off → The bacteria stop producing LacI-LVA → The LacI lock degrades.
  • Condition B (Ultrasound): The stimulus inactivates the P_tlpA repression system → The bacteria no longer have functional TetR-LVA → The TetR lock degrades.
  • Result: The Salmochelin genes (iroBCDE, iroN) must be under a promoter with binding sites for LacI and TetR. Only when there is NO LacI (Hypoxia) AND NO TetR (Ultrasound) can the polymerase pass and produce Salmochelin and its receptor. Basal expression (leakiness) in healthy tissues is eliminated, protecting the patient’s iron homeostasis. The bacteria only release siderophores inside the tumor and when the physician decides.

2.3.- Biocontainment System (Biosafety Kill-Switch)

Implementation of a cascade activation mechanism. This design minimizes metabolic stress by not producing the drug sensor while the bacteria are colonizing the tumor.

  • Oxygen Sensor (Normoxia): P_cyo promoter → araC gene. In the tumor (Hypoxia), the P_cyo promoter is turned off, so the AraC protein is not produced. In healthy tissue or the bloodstream (Normoxia), the P_cyo promoter is activated and synthesizes the AraC protein, which acts as the “key” for the drug.
  • Drug Sensor: P_bad promoter → ccdB gene (Toxin). The P_bad promoter has an AND activation logic: it is only activated if the AraC protein is present (only in normoxia) AND Arabinose (the drug) is administered.
  • Result: As long as the bacteria remain in the hypoxic TME, they are immune to the drug, as they lack the AraC protein. If the bacteria escape to oxygenated tissues or if the tumor is significantly reduced, the presence of oxygen allows the synthesis of AraC, making the bacteria sensitive to the drug for their total elimination.

2.4.- Note

Lipocalin-2 (Lcn2) can deplete intracellular iron in macrophages, generating an inflammatory response and promoting an attack on bacteria (Manfred Nairz et al., 2016); however, in cancer, Tumor-Associated Macrophages (TAMs) have an altered transcriptional program, where they are instead immunosuppressed. When the tumor size is significantly reduced, it is likely that macrophages will reprogram toward an M1 phenotype, assisting in the elimination of the bacteria alongside the kill-switch.

3. Governance and Policy Goals

To ensure that this bacteriotherapy contributes to an ethical and safe future, I have defined the following goals:

  • Goal 1: Ensure Safety & Eficcacy
    • Sub-goal 1a: Implement multi-layered biocontainment (NAND gate + Kill-switch) to prevent systemic iron depletion in the host.
    • Sub-goal 1b: Prevent bacterial environmental persistence through strictly controlled clinical waste protocols (analogous to radioguided surgery) to ensure no bacterial escape into public sewage.
  • Goal 2: Promote Constructive Applications & Equity
    • Sub-goal 2a: Develop the platform using probiotic E. coli Nissle 1917 to keep production costs low and accessible for developing regions.

4. Governance Actions Matrix

To manage the specific risks of genetic drift and metabolic stress associated with the AraC/CcdB cascade design (Kill-Switch only), I propose the following interconnected actions:

AspectAction 1: Technical (Biocontainment)Action 2: Academic (Transparency)Action 3: Regulatory (Standards)
PurposeImplementation of a genetic circuit that prevents activation of the kill-switch when the synthetic bacteria is administered and at the same time minimizes metabolic stress in a hypoxic environment (TEM).Establishment of a shared database focusing on the leakiness and metabolic burden of hypoxia-responsive sensors like P_cyo.Development of standardized certification to ensure clinical reliability.
DesignImplementation of a cascade activation mechanism, where P_cyo promoter acts as a gatekeeper for AraC synthesis. P_bad then requires both AraC AND Arabinose to express the CcdB toxin.Peer-reviewed publication of the “Stress-Safety Curve” of the AraC/CcdB cascade to define at what point mutation frequency increases.Technical standards (e.g., ISO/TC 276) that define the Mean Time To Failure (MTTF) of the cascade before plasmid loss.
AssumptionsAssumes the AraC and ccdB genes remains functional. Risk of plasmid loss is acknowledged. Assumes the P_cyo promoter remains tightly repressed in hypoxia to prevent metabolic burden.Assumes that labs will transparently share data when the P_cyo sensor leaks and kills the bacteria prematurely.Assumes regulatory bodies (like ISP/FDA) have the expertise to audit compliance with established rules.
RisksGenetic Drift: Loss-of-function mutations in araC or ccdB it generates bacteria that are immune to the drug. The overexpression of araC due to promoter mutations it generates Arabinose-sensitive bacteria even within the tumor.Dual-Use: Detailed performance maps of the P_cyo sensor could be exploited to design oxygen-evading pathogens.Innovation Lag: Excessive bureaucracy in certifying may delay new targeted gene therapies.

Note on Scope: While my policy goals include the NAND gate for spatial control, the following Governance Matrix focuses specifically on the Kill-switch (AraC/CcdB cascade). I have prioritized this component because it represents the highest risk for environmental escape and is the “weakest link” in terms of biocontainment due to potential genetic drift.

5. Scoring and Prioritization

I have scored my proposed governance actions against my specific Policy Goals (1=Best, 3=Worst):

Policy GoalAction 1: Technical (Cascade Stability)Action 2: Academic (Transparency)Action 3: Regulatory (Waste & Standards)
Goal 1: Safety (Non-malfeasance)222
Goal 2: Equity (Low-cost Access)213
Feasibility (Implementation)122

Technical Note: In this assessment, Action 1 is scored based on the current plasmid-based design. However, to minimize the probability of genetic drift, I propose that the final implementation should transition to genomic integration of the AraC/CcdB cascade. This would ensure that the safety circuits are permanently embedded in the bacterial DNA, significantly reducing the risk of mutants compared to episomal (plasmid) expression.

6. Final Recommendation and Prioritization

Based on the scoring, I prioritize a combination of Action 1 (Technical) and Action 3 (Regulatory).

  • Priority and Audience: My recommendation is directed to the authorities responsible for verifying the safety and efficacy of new gene therapies. The technical design alone is insufficient without a clear regulatory framework.

  • Trade-offs: I have chosen to prioritize these over Action 2 (Academic Transparency) to mitigate the Dual-Use risk. While sharing detailed performance data of the P_cyo and P_bad sensors would promote global equity, the risk of this information being exploited to design pathogens that evade oxygen-based immune barriers is a trade-off I consider necessary for public safety.

  • Assumptions and Uncertainties: One assumption and uncertainty regarding what is being proposed is that the safety and efficacy criteria defined by the respective authorities could have undetected safety biases due to a lack of evidence.

7. Ethical Reflection

The most significant ethical concern that arose for me is the Dual-Use Dilemma in the context of biocontainment. I realized that the very mechanisms I am designing to ensure a therapy is safe (like high-precision oxygen sensors) are the same tools that could be used to engineer biological threats that are harder to detect or neutralize.

8. References

  1. Saha P., et al. (2019). “Enterobactin, an iron chelating bacterial siderophore, arrests cancer cell proliferation” Biochemical Pharmacology.
  2. Raffatellu M., et al. (2010). “Lipocalin-2 resistance of Salmonella enterica serotype Typhimurium confers an advantage during life in the inflamed intestine” Cell Host & Microbe.
  3. Pita-Grisanti V., et al. (2022). “Understanding the Potential and Risk of Bacterial Siderophores in Cancer” Frontiers in Oncology.
  4. Nairz M., et al. (2015). “Lipocalin-2 ensures host defense against Salmonella Typhimurium by controlling macrophage iron homeostasis and immune response” Journal of Immunology.

9. AI Prompts

In compliance with HTGAA 2026 guidelines, I certify that this homework was developed with the assistance of Gemini (Google AI).

  • Image Generation: “Used Nano Banana to generate image “Abstract_Draw.png” from the detailed description of my project.”
  • Assessment: “Create a table in markdown format that allows me to compare the design of my logic circuit with a standard design.”
  • Troubleshooting: “Technical troubleshooting for personal profile configuration in the repository and helps to transfer the project in markdown format”

Week 2 HW: DNA-read-write-and-edit

1. Benchling & In-silico Gel Art

1.1 Extraction of Restriction Site Data

Restriction site positions for 10 enzymes were extracted from Benchling for the λ phage genome.
Based on these cut positions, fragment sizes were calculated for each individual enzyme digestion.

(Image 1: Benchling screenshot showing restriction sites)


1.2. Construction of the Combinatorial Space

All possible enzyme combinations were generated from the 10 enzymes
(2¹⁰ − 1 = 1023 combinations).

For each combination:

  • Fragment sizes were computed.
  • A discrete size axis was built from all unique fragment lengths.
  • A binary matrix (combination × fragment sizes) was constructed, indicating presence/absence of each fragment.

This forms the complete “puzzle space” of available molecular weight distributions.


1.3. Definition of the Target Pattern

A desired visual pattern was manually designed using an interactive executable that allows activation/deactivation of bands on the real fragment size axis.

The output is a binary target matrix (lanes × fragment sizes).

(Image 2: manually designed target pattern)

The notebook used to manually design the target band pattern can be accessed here:

➡️ Manual Pattern Generator Notebook


1.4. Similarity Comparison and Optimization

For each lane of the target pattern:

  • It was compared against all 1023 possible enzyme combinations.
  • A contrast-based similarity metric was computed (rewarding matches and penalizing undesired bands).
  • A score was assigned.
  • The highest-scoring combination was selected.

The resulting set of enzyme combinations per lane represents the optimal reconstruction of the desired pattern.

(Image 3: algorithm output + selected enzyme combinations)

The notebook implementing the combinatorial search and similarity scoring can be accessed here:

➡️ Optimization Algorithm Notebook


1.5 Result

After optimization and manual refinement, the following enzyme combinations were selected per lane:

Manually Forced Lanes

  • Lane 4 → (‘XhoI’, ‘KpnI’)
  • Lane 7 → (‘PvuII’, ‘XhoI’)

Best-Scoring Lanes (Algorithm Output)

  • Lane 03 → (‘EcoRI’, ‘KpnI’)
  • Lane 6 → (‘EcoRI’, ‘KpnI’)
  • Lane 8 → (‘EcoRI’, ‘KpnI’)
  • Lane 10 → (‘EcoRI’, ‘KpnI’)
  • Lane 12 → (‘BamHI’,)
  • Lane 13 → (‘EcoRI’, ‘KpnI’)
  • Lane 14 → (‘BamHI’,)
  • Lane 16 → (‘EcoRI’, ‘KpnI’)
  • Lane 17 → (‘BamHI’,)
  • Lane 19 → (‘EcoRI’, ‘KpnI’)
  • Lane 20 → (‘BamHI’,)
  • Lane 21 → (‘EcoRI’, ‘KpnI’)

The final configuration, combining algorithm-selected and manually adjusted lanes, produces a readable macroscopic pattern that spells:

CHITRA

This demonstrates that complex visual structures can be reconstructed using only physically valid restriction enzyme digestion combinations drawn from the complete 1023-piece combinatorial space.

3. DNA Design Challenge

3.1. Gene Selection: E. coli IroB

For my final project, I am designing a therapeutic bacterium for solid tumor treatment using E. coli Nissle 1917 as the chassis. My goal is to synthesize the iroBCDEN operon, which is responsible for Salmochelin production, allowing the bacteria to scavenge iron more efficiently in the tumor microenvironment.

For this sequence design exercise, I am focusing on the key enzyme: IroB (C-glycosyltransferase). To ensure optimal gene expression, proper protein folding, and to minimize metabolic burden, I selected the native IroB sequence from Escherichia coli (Accession: WP_016242764.1). Utilizing a sequence native to the chassis species is a biologically superior approach compared to importing foreign variants.

3.2. Protein Sequence Input

By starting directly with the pure amino acid sequence, I utilized a Reverse Translation approach to build a pristine DNA sequence tailored for my chassis.

Protein Sequence (WP_016242764.1):

MRILFVGPPLYGLLYPVLSLAQAFRVNGHEVLIASGGQFAQKAAEAGLVVFDAAPGLDSEAGYRHHEAQR KKSNIGTQMGNFSFFSEEMADHLVEFAGHWRPDLIIYPPLGVIGPLIAAKYDIPVVMQTVGFGHTPWHIK GVTRSLTDAYRRHNVGTTPRDMAWIDVTPPSMSILENDGEPIIPMQYVPYNGGAVWEPWWERRPERKRLL VSLGTVKPMVDGLDLIAWVMDSASEVDAEIILHISANARSDLRSLPSNVRLVDWIPMGVFLNGADGFIHH GGAGNTLTALHAGIPQIVFGQGADRPVNARVVAERGCGIIPGDVGLSSNMINAFLNNRSLRKASEEVAAE MAAQPCPGEVAKSLITMVQKG

3.3. Codon Optimization

Using the Twist Bioscience Expression Optimization tool, I reverse-translated the E. coli protein sequence into an optimized DNA sequence. (https://www.idtdna.com/CodonOpt)

Optimization Parameters & Constraints:

  • Chassis: Escherichia coli.
  • Genetic Logic Compatibility: I explicitly removed internal restriction sites (GGTCTC for BsaI and GAAGAC for BbsI). This is a critical engineering step to ensure the synthesized gene is fully compatible with the Golden Gate Assembly method, which is required for assembling my NAND logic gate circuit.

Optimized IroB DNA Sequence:

ATG AGA ATT TTA TTT GTT GGA CCG CCG CTC TAC GGC CTG CTG TAT CCG GTG CTG AGC CTG GCG CAG GCG TTC CGC GTC AAC GGC CAC GAG GTG CTG ATT GCC TCC GGC GGG CAG TTT > GCG CAG AAA GCG GCG GAA GCC GGT CTG GTG GTG TTT GAT GCC GCG CCG GGC CTG GAC TCT GAA GCG GGT TAC CGC CAT CAC GAA GCG CAG CGC AAA AAA AGC AAC ATT GGC ACC CAG > ATG GGT AAC TTC AGC TTC TTC TCT GAA GAA ATG GCC GAT CAC CTG GTT GAG TTT GCC GGT CAC TGG CGT CCG GAC CTG ATT ATC TAT CCG CCG CTG GGT GTG ATT GGT CCG CTG ATT > GCG GCA AAA TAT GAC ATC CCG GTG GTT ATG CAG ACC GTC GGC TTT GGT CAC ACG CCG TGG CAC ATC AAA GGC GTG ACC CGC AGC CTG ACC GAT GCC TAT CGC CGT CAC AAC GTT GGC > ACC ACA CCG CGT GAT ATG GCG TGG ATC GAC GTC ACA CCG CCA AGC ATG AGC ATC CTG GAA AAC GAC GGT GAG CCG ATC ATT CCG ATG CAG TAT GTG CCG TAC AAC GGT GGT GCG GTG > TGG GAG CCG TGG TGG GAG CGT CGT CCG GAG CGC AAG CGC CTG CTG GTG AGC CTG GGT ACG GTG AAA CCG ATG GTG GAC GGT CTG GAT CTG ATT GCC TGG GTG ATG GAC AGC GCC AGC > GAA GTT GAT GCG GAG ATC ATC CTG CAC ATC TCT GCC AAC GCG CGC AGC GAC CTG CGC TCG CTG CCG AGC AAC GTG CGC CTG GTT GAT TGG ATT CCG ATG GGT GTG TTC CTG AAC GGT > GCG GAC GGC TTT ATC CAC CAC GGT GGT GCG GGT AAC ACC CTG ACT GCG CTG CAT GCC GGT ATT CCG CAG ATT GTC TTT GGT CAG GGT GCT GAC CGC CCG GTT AAT GCG CGT GTG GTG > GCG GAG CGT GGC TGT GGG ATC ATC CCG GGT GAT GTC GGC CTG TCC AGC AAC ATG ATC AAC GCC TTC CTG AAC AAC CGC TCG CTG CGT AAA GCC TCT GAA GAG GTT GCG GCA GAA ATG > GCG GCG CAG CCG TGC CCG GGT GAG GTG GCC AAA TCG CTG ATC ACC ATG GTT CAG AAA GGG

3.4. Protein Production Technologies

To produce the IroB protein from the newly designed and optimized DNA sequence, two main technological approaches can be employed: cell-dependent (in vivo) and cell-free (in vitro) systems.

1. Cell-Dependent Method (In vivo expression)

This is the traditional recombinant protein production method. The optimized iroB DNA sequence would be cloned into an expression plasmid containing a strong promoter and a Ribosome Binding Site (RBS). This plasmid is then transformed into a bacterial host, such as Escherichia coli BL21(DE3) for massive lab-scale production, or directly into our therapeutic chassis, E. coli Nissle 1917. The living bacteria will act as bio-factories, using their native cellular machinery to express the protein during their growth phase.

2. Cell-Free Protein Synthesis (CFPS) (In vitro expression)

Alternatively, a cell-free system (such as the PURE system or an E. coli cell extract) can be used. This technology strips away the living cell and uses only the essential biological machinery (RNA polymerases, ribosomes, tRNAs, amino acids, and energy molecules) mixed in a tube. By adding our linear or plasmid DNA directly into this mixture, the IroB protein can be synthesized in a few hours. This method is highly advantageous for rapid prototyping and testing of genetic circuits, as it bypasses the need for cell transformation and culturing.

DNA sequence to Protein

  1. Transcription: Under the control of a hypoxia-sensitive promoter (part of my NAND gate logic), the bacterial RNA Polymerase enzyme recognizes and binds to the specific promoter sequence located just upstream of our iroB gene. The enzyme unwinds the double-stranded DNA and uses the template strand to synthesize a single-stranded messenger RNA (mRNA). It reads through our optimized sequence, creating an exact RNA copy, and stops when it reaches a terminator sequence.
  2. Translation: Once the mRNA is transcribed, the bacterial ribosome recognizes and binds to the Ribosome Binding Site (RBS) on the mRNA. The ribosome scans the mRNA until it finds the start codon (ATG). Because we performed codon optimization via reverse translation, the sequence is perfectly calibrated for E. coli. Transfer RNAs (tRNAs) carrying specific amino acids will efficiently recognize the optimized mRNA codons (three-letter nucleotide sequences) without stalling. The ribosome links these amino acids together through peptide bonds, moving along the mRNA until it reaches the stop codon (TAA, TAG, or TGA). At this point, the newly synthesized IroB C-glycosyltransferase protein folds into its 3D structure and is released to perform its catalytic function.
  3. Function: Once folded, IroB will begin glycosylating enterobactin within the cytoplasm to produce the therapeutic Salmochelin.

3.5 How does it work in nature/biological systems?

Historically, the rule in biology was “one gene, one protein.” However, we now know that a single gene can produce multiple different protein variants (isoforms) through mechanisms that alter the mRNA transcript before it is translated. At the transcriptional (and early post-transcriptional) level, there are two primary mechanisms for this:

1. Alternative Splicing

In eukaryotic cells, genes are composed of coding regions (exons) and non-coding regions (introns). When RNA polymerase transcribes the gene, it creates a precursor mRNA (pre-mRNA) that contains both. During a process called alternative splicing, a cellular complex called the spliceosome removes the introns and joins the exons together. However, the spliceosome can choose to include or skip certain exons. Depending on which combination of exons is spliced together to form the mature mRNA, the ribosome will translate completely different protein isoforms, each potentially having different structural domains or functions, all originating from the exact same DNA sequence.

2. Alternative Promoters (Alternative Transcription Start Sites)

A single gene can possess multiple promoters (the DNA sequence where RNA polymerase binds to initiate transcription). Depending on which promoter the cell activates—often influenced by environmental signals or tissue type—transcription will start at different points along the gene. If transcription starts at a downstream promoter, the resulting mRNA will be shorter and will lack the initial genetic instructions. When translated, this produces a truncated version of the protein, often missing specific signaling sequences or regulatory domains present in the full-length version.


4. Plasmid Construction and In Silico Validation (Phase I: iroB)

This section documents the construction of the initial expression vector in Benchling, starting from the optimized iroB gene and culminating in a verified plasmid assembly ready for future expansion.

4.1. Genetic Cassette Design and Optimization

The expression cassette was built systematically in Benchling, starting with the codon-optimized sequence for the iroB gene. To ensure functionality, modularity, and future purifiability, specific genetic parts were integrated:

  • Components: The cassette includes a Promoter (Pro), Ribosome Binding Site (RBS), the iroB CDS, and a strong transcriptional Terminator (Ter).
  • C-Terminal tag: A 7xHis-tag was added immediately upstream of the STOP codon to allow for future protein purification via affinity chromatography.
  • Modular Restriction Sites: To create a standardized “BioBrick-like” part, the entire cassette was flanked with unique restriction enzyme sites: NotI (GCGGCCGC) at the 5’ end and XbaI (TCTAGA) at the 3’ end.

SBOL Cassette Design SBOL Cassette Design Figure 1: SBOL Diagram of the iroB Expression Cassette. The symbols denote (from left to right): NotI RERS, Promoter, RBS, iroB CDS, 7xHis-Tag, Terminator, and XbaI RERS.

Technical Note: The XbaI site was placed immediately following the terminator to encompass the entire modular cassette. It is important to note that since the selected terminator sequence ends in ‘TA’ (and not ‘GA’), the formation of a Dam methylation site (GATCTAGA) is avoided. This serendipitous sequence alignment ensures that the enzyme will not be blocked by methylation, allowing for efficient cleavage during laboratory procedures.


4.2. Vector Selection and In Silico Assembly

The commercial vector pTwist Amp High Copy (2221 bp) was selected as the backbone for this phase. (https://www.twistbioscience.com/products/genes/vectors?tab=catalog-vectors)

Benchling Assembly Process:

  1. Backbone Preparation: The pTwist map was imported, identifying the Multiple Cloning Site (MCS:region between coordinates ~73 and ~245) as the optimal insertion point. Specifically, coordinate 200 was selected to ensure that critical elements like the origin of replication (ori) and the ampicillin resistance gene (AmpR) remained undisturbed.
  2. Insert Preparation: The 1326 bp iroB modular cassette was defined using the flanking NotI (5’ end) and XbaI (3’ end) recognition sites. This allows any researcher to infer the intended Forward direction of the gene by identifying the positions of these specific landmarks on the plasmid map. Modularity is also ensured, allowing the entire expression cassette to be excised and transferred to different vectors in future iterations of the project.
  3. Assembly Simulation: Using Benchling’s molecular biology tools, a Gibson Assembly was simulated to insert the designed iroB modular cassette into the pTwist MCS, resulting in a final circular plasmid of exactly 3561 bp.

Benchling Plasmid Map Benchling Plasmid Map Figure 2: Circular map of the assembled pTwist-iroB-cassette plasmid (3561 bp).


4.3. Results and Validation: Virtual Digest

To validate the structural integrity of the design, a Virtual Enzymatic Digest was performed using NotI and XbaI. The simulation results account for the redistribution of nucleotides at the restriction sites following the cleavage:

  • Fragment 1 (Vector Backbone - pTwist): 2228 bp. This represents the original 2221 bp pTwist sequence plus 7 bp derived from the flanking restriction site architecture.
  • Fragment 2 (iroB Expression Cassette): 1333 bp. This comprises the 1326 bp optimized cassette plus 7 bp from the remaining restriction site sequences.

The sum of these fragments confirms a total plasmid length of 3561 bp.

Virtual Digest Gel Virtual Digest Gel Figure 3: Virtual agarose gel electrophoresis (1% agarose). Lane 1: DNA Ladder. Lane 2: pTwist-iroB digested with NotI/XbaI, yielding two distinct and sharp bands at 2228 bp and 1333 bp. This result confirms successful in silico assembly and validates that the iroB optimized sequence is free of internal restriction sites for the selected enzymes.


4.4. Future Work: Iterative Design

This validated plasmid serves as the foundational “chassis” for the project. The next engineering phases involve:

  • Promoter Re-engineering: Replacing the current constitutive promoter with a Boolean Logic (e.g., NAND gate) promoter designed to respond to hypoxia and ultrasound-linked stimuli.

  • Operon Completion: Sequentially assembling the remaining genes (iroC, iroD, iroE, and iroN) into the cassette to generate a single polycistronic iroBCDEN operon.

  • Clinical-Grade Vector Redesign: The current backbone includes an antibiotic-resistance marker, which is not ideal for therapeutic applications due to biosafety and regulatory concerns. Future versions of the construct should transition to a non-antibiotic plasmid maintenance system appropriate for clinical use.

  • Biocontainment Strategy (Kill Switch Evaluation): A toxin–antitoxin-based containment module is a candidate approach. In such systems, continuous expression of an antitoxin neutralizes a stable toxin; loss or inhibition of the antitoxin can result in growth arrest or cell death. The stability, leakiness, and escape frequency of this strategy must be experimentally evaluated.

  • Expression Burden Mitigation: Full expression of the iroBCDEN complex may impose significant metabolic and translational burden. Strategies such as orthogonal translation systems (e.g., orthogonal ribosomes) or alternative burden-mitigation approaches should be assessed to improve stability and performance.

5. Theoretical Questions: DNA Read, Write, & Edit

5.1 DNA Read

(i) What DNA would you want to sequence (e.g., read) and why?

  • Target: I would sequence the genome of the Panther Chameleon (Furcifer pardalis) and the Common Octopus (Octopus vulgaris), focusing on the gene families of reflectins and the regulatory proteins of iridophores/chromatophores.
  • Why: Current biological reporters, such as Fluorescent Proteins (FPs), have a fundamental limitation: high stability leads to poor temporal resolution. Once an FP is expressed or activated, it remains fluorescent for hours, “smearing” the signal and masking real-time dynamics. This makes it impossible to observe rapid “on/off” pulses in neural circuits (like serotonergic vs. octopaminergic crosstalk) or the precise timing of a synthetic logic gate.

By sequencing these organisms, I aim to discover the genetic basis of reversible structural color. Unlike fluorescence, which requires high-energy lasers that cause phototoxicity and photobleaching, reflectins change their optical properties through rapid conformational shifts. In the context of my cancer-targeting project, these proteins could serve as “dynamic reporters” for my NAND logic gate. They would allow me to observe, in vitro, the exact moment the bacteria detect ultrasound or hypoxia and—crucially—see the signal vanish the instant the stimulus stops. This would provide a level of kinetic resolution and biophysical feedback that is currently unattainable with standard fluorescence, enabling the study of fast enzymatic transitions and synaptic-like communications without “staining” the entire experimental field.

(ii) In lecture, a variety of sequencing technologies were mentioned. What technology or technologies would you use to perform sequencing on your DNA and why?

  • Selected Technology: PacBio HiFi Sequencing combined with Iso-Seq.
  • Rationale: Since the goal is to find functional proteins with specific structural kinetics, I need to resolve not just the genome, but the full-length isoforms of the proteins being expressed in the skin cells. PacBio HiFi provides the extreme accuracy (99.9%) and long reads necessary to assemble these complex, repetitive protein domains without the errors of short-read platforms.

Detailed Technical Questions:

  • Is your method first-, second- or third-generation or other? How so?
    • It is a third-generation technology (Single Molecule, Real-Time). It sequences individual DNA molecules as they are synthesized by a polymerase in a Zero-Mode Waveguide (ZMW), allowing for real-time observation of base incorporation.
  • What is your input? How do you prepare your input (e.g. fragmentation, adapter ligation, PCR)? List the essential steps.
    • Input: High Molecular Weight (HMW) genomic DNA and full-length mRNA (for Iso-Seq) from dermal tissue.
    • Preparation steps: 1. Extraction: Specialized lysis to maintain long-strand integrity. 2. SMRTbell Library Prep: Ligation of hairpin adapters to create circular DNA templates. 3. Size Selection: Ensuring only long fragments (>10kb) are loaded to maximize information per read.
  • What are the essential steps of your chosen sequencing technology, how does it decode the bases of your DNA sample (base calling)?
    • Process: A polymerase at the bottom of a ZMW incorporates fluorescently labeled nucleotides. As each base is added, it emits a light pulse.
    • Base Calling: The system records the color and duration of these pulses. Because the template is circular, the polymerase reads it multiple times (Circular Consensus Sequencing), which allows the software to “correct” any random errors and produce a HiFi read of extremely high quality.
  • What is your output of your chosen sequencing technology?
    • The output is a BAM file containing highly accurate, long-read sequences, ready for de novo assembly of the structural color gene clusters.

5.2 DNA Write

(i) What DNA would you want to synthesize (e.g., write) and why?

  • Target: I want to synthesize a modular genetic circuit containing a NAND logic gate that integrates two environmental sensores (ultrasound-responsive promoters and hypoxia-inducible factors) to drive the expression of the iroB cluster for salmochelin production, including a safety kill-switch.
  • Why: This construct is the core of my final project: a targeted cancer therapy. The goal is to engineer bacteria that only produce potent iron-sequestering siderophores (salmochelins) within the tumor microenvironment (hypoxia) and under external activation (ultrasound). This ensures the therapy is localized, minimizing systemic toxicity. Synthesizing this specific construct via Twist would allow me to perform the first in vitro validations of the logic gate’s precision.
  • Sequence: I don’t have the complete cassette sequence yet; I need to redesign the promoter, add the Kill-Switch and add the iroBCDEN complex. Preliminary sequence: TTTACGGCTAGCTCAGTCCTAGGTATAGTGCTAGCCATTAAAGAGGAGAAAGGTACCATGATGAGAATTTTATTTGTTGGACCGCCGCTCTACGGCCTGCTGTATCCGGTGCTGAGCCTGGCGCAGGCGTTCCGCGTCAACGGCCACGAGGTGCTGATTGCCTCCGGCGGGCAGTTTGCGCAGAAAGCGGCGGAAGCCGGTCTGGTGGTGTTTGATGCCGCGCCGGGCCTGGACTCTGAAGCGGGTTACCGCCATCACGAAGCGCAGCGCAAAAAAAGCAACATTGGCACCCAGATGGGTAACTTCAGCTTCTTCTCTGAAGAAATGGCCGATCACCTGGTTGAGTTTGCCGGTCACTGGCGTCCGGACCTGATTATCTATCCGCCGCTGGGTGTGATTGGTCCGCTGATTGCGGCAAAATATGACATCCCGGTGGTTATGCAGACCGTCGGCTTTGGTCACACGCCGTGGCACATCAAAGGCGTGACCCGCAGCCTGACCGATGCCTATCGCCGTCACAACGTTGGCACCACACCGCGTGATATGGCGTGGATCGACGTCACACCGCCAAGCATGAGCATCCTGGAAAACGACGGTGAGCCGATCATTCCGATGCAGTATGTGCCGTACAACGGTGGTGCGGTGTGGGAGCCGTGGTGGGAGCGTCGTCCGGAGCGCAAGCGCCTGCTGGTGAGCCTGGGTACGGTGAAACCGATGGTGGACGGTCTGGATCTGATTGCCTGGGTGATGGACAGCGCCAGCGAAGTTGATGCGGAGATCATCCTGCACATCTCTGCCAACGCGCGCAGCGACCTGCGCTCGCTGCCGAGCAACGTGCGCCTGGTTGATTGGATTCCGATGGGTGTGTTCCTGAACGGTGCGGACGGCTTTATCCACCACGGTGGTGCGGGTAACACCCTGACTGCGCTGCATGCCGGTATTCCGCAGATTGTCTTTGGTCAGGGTGCTGACCGCCCGGTTAATGCGCGTGTGGTGGCGGAGCGTGGCTGTGGGATCATCCCGGGTGATGTCGGCCTGTCCAGCAACATGATCAACGCCTTCCTGAACAACCGCTCGCTGCGTAAAGCCTCTGAAGAGGTTGCGGCAGAAATGGCGGCGCAGCCGTGCCCGGGTGAGGTGGCCAAATCGCTGATCACCATGGTTCAGAAAGGGCATCACCATCACCATCATCACTAACCAGGCATCAAATAAAACGAAAGGCTCAGTCGAAAGACTGGGCCTTTCGTTTTATCTGTTGTTTGTCGGTGAACGCTCTCTACTAGAGTCACACTGGCTCACCTTCGGGTGGGCCTTTCTGCGTTTATA

(ii) What technology or technologies would you use to perform this DNA synthesis and why?

  • Selected Technology: Silicon-based high-throughput chemical synthesis (Twist Bioscience platform).
  • Rationale: For a complex circuit involving multiple promoters, a CDS (iroB), and a kill-switch, I need extreme precision and the ability to synthesize large quantities of different variations of the circuit. Twist’s technology uses a silicon platform that miniaturizes the traditional phosphoramidite chemistry, allowing for the synthesis of thousands of genes simultaneously with high fidelity and low cost, which is ideal for prototyping complex logic gates like mine.

Detailed Technical Questions:

  • What are the essential steps of your chosen synthesis method?
    • Phosphoramidite Cycle: The DNA is built base-by-base (A, T, C, G) on a silicon chip. Each addition follows a cycle of 4 steps: De-protection (preparing the strand), Coupling (adding the base), Capping (preventing errors), and Oxidation (stabilizing the bond).
    • Assembly: Since the chemical process can only print short pieces (oligos), these pieces are harvested from the chip and assembled into the full 2.5 kb circuit.
    • Error Correction: The final DNA is “polished” using enzymes to ensure there are no mutations, delivering a 100% accurate sequence.
  • What are the limitations of your synthesis method (if any) in terms of speed, accuracy, scalability?
    • Speed: The chemical synthesis is fast, but the complete process (assembly, quality control, and shipping) takes about 2 to 3 weeks.
    • Accuracy: As the DNA strand gets longer, the chance of errors increases, which is why we must assemble smaller, verified fragments to build a large circuit.
    • Scalability: It is highly scalable (thousands of genes at once), but it still depends on traditional chemicals, unlike newer enzymatic methods.

5.3 DNA Edit

(i) What DNA would you want to edit and why?

  • Target DNA: The genome of a therapeutic strain of Escherichia coli (like E. coli Nissle 1917).
  • Why: I need to integrate my synthetic NAND-iroB circuit into the bacterial chromosome. If the circuit stays on a plasmid, it could be lost or vary in copy number. By editing the bacterial genome to insert the circuit into a “safe harbor” locus, I ensure the therapy is stable and the kill-switch works perfectly every time.

(ii) What technology or technologies would you use to perform these DNA edits and why?

  • Selected Technology: CRISPR-Cas9 mediated Recombineering.
    • Rationale: CRISPR-Cas9 is extremely precise at “cutting” the DNA at a specific site in the bacterial genome, and Recombineering allows us to “paste” my large 2.5 kb synthetic circuit into that gap with high efficiency.

Detailed Technical Questions:

  • How does your technology of choice edit DNA? What are the essential steps?
    • Targeting: A Guide RNA (gRNA) leads the Cas9 protein to the exact spot in the bacterial genome.
    • Cutting: Cas9 creates a double-strand break (DSB) in that spot.
    • Insertion: Using the cell’s repair machinery (and a template I provide), the synthetic NAND-iroB circuit is “pasted” into the cut, becoming a permanent part of the bacteria’s DNA.
  • What preparation do you need to do (e.g. design steps) and what is the input (e.g. DNA template, enzymes, plasmids, primers, guides, cells) for the editing?
    • Design steps: I need to design the gRNA (to target the genome) and the Homology Arms (flanking sequences that match the insertion site).
    • Input: The Cas9 enzyme, the gRNA, and my synthetic DNA construct (the circuit synthesized by Twist).
  • What are the limitations of your editing methods (if any) in terms of efficiency or precision?
    • Efficiency: In bacteria, the rate of successful “pasting” (HDR) can be low, often requiring selection markers (like antibiotic resistance) to find the edited cells.
    • Precision: There is a small risk of off-target effects, where Cas9 might cut in the wrong place, though this is rare in simple bacterial genomes.

Week 03 HW: Lab-Automation

1. Python Script for Opentrons Artwork

This artwork was generated using the HTGAA26 Opentrons Colab environment. opentron_code The design was implemented programmatically using geometric constructions and multi-color pipetting logic.

opentron_design opentron_design

To properly render Devanagari text (e.g., “चित्”) using PIL in Google Colab, system-level fonts must be installed before executing the Opentrons script. The Noto Sans Devanagari font was installed using the following commands in a separate Colab cell:

  1. !sudo apt-get update -qq
  2. !sudo apt-get install -y -qq fonts-noto-core fonts-noto-unhinted
  3. !fc-list | grep -i “Devanagari” | head -n 20

2. Post-Lab Questions

2.1 Find and describe a published paper that utilizes the Opentrons or an automation tool to achieve novel biological applications.

A relevant example of automation enabling novel biological applications is Nielsen et al., 2016. “Genetic circuit design automation.” Science

In this study, the authors developed an automated design–build–test workflow to systematically engineer genetic logic circuits in living cells. Rather than constructing gene circuits through manual trial-and-error, they integrated computational modeling, standardized genetic parts, and high-throughput experimental validation to design functional Boolean logic gates such as AND, OR, NOR, and more complex multi-layer circuits.

The key innovation was the automation of circuit design and screening. Computational tools were used to predict circuit behavior based on promoter strength, repressor activity, and regulatory architecture. These predictions were then experimentally validated using high-throughput plate-based assays, allowing many circuit variants to be constructed and tested in parallel. Automated measurement of reporter outputs (e.g., fluorescence) enabled quantitative evaluation of logic performance, signal thresholds, and leakiness.

Automation significantly reduced the combinatorial complexity inherent in multi-input genetic circuit design. Instead of manually constructing and testing a few variants, the workflow enabled systematic exploration of many possible architectures, improving robustness and reproducibility. This approach demonstrated that genetic logic circuits can be engineered in a scalable and programmable manner, similar to electronic circuit design.

This paper shows how automation transforms synthetic biology from artisanal genetic assembly into an engineering discipline with predictive modeling and systematic validation.

2.2 Write a description about what you intend to do with automation tools for your final project. You may include example pseudocode, Python scripts, 3D printed holders, a plan for how to use Ginkgo Nebula, and more. You may reference this week’s recitation slide deck for lab automation details.

For my final project, I intend to engineer a bacterial therapeutic system for oncology applications. The core design involves a plasmid encoding a NAND logic gate integrating multiple tumor-associated inputs, including hypoxia, elevated lactate levels, and ultrasound stimulation. Only when specific tumor microenvironment conditions are satisfied would the circuit activate expression of a therapeutic cassette (e.g., the eriBCDEN complex).

Automation would be essential to systematically design and validate this multi-input logic system. I propose implementing an automated design–build–test workflow focused on high-throughput screening of circuit variants.

  1. Combinatorial plasmid assembly Promoter variants responsive to hypoxia, lactate, and ultrasound would be modularly assembled using combinatorial DNA assembly methods (e.g., Golden Gate). An acoustic liquid handler (e.g., Echo) could transfer defined promoter and RBS fragments into specified wells to systematically generate circuit variants.

  2. RBS strength tuning Ribosome binding site variants of defined translation strengths would be introduced to modulate repressor expression levels. This step allows fine control of expression thresholds and minimization of basal leakiness, which is critical for achieving accurate NAND logic behavior.

  3. High-throughput culture setup Following transformation and colony selection, bacterial variants would be distributed into 96-well plates using liquid handling robotics. This enables parallel testing of multiple architectures under standardized growth conditions.

  4. Controlled environmental testing Each well would be exposed to defined combinations of normoxia/hypoxia conditions, graded lactate concentrations, and ultrasound stimulation (applied externally). This systematic input matrix allows evaluation of all Boolean input states.

  5. Automated reporter quantification A fluorescent reporter would be used during circuit prototyping prior to therapeutic deployment. Fluorescence measurements using a plate reader (e.g., PHERAstar) would quantify output across all input combinations, enabling assessment of Boolean fidelity, dynamic range, activation thresholds, and leakiness.

Because multi-input genetic logic circuits require careful balancing of transcriptional and translational parameters, manual testing would be slow and prone to variability. Automation enables parallelized combinatorial screening and quantitative validation before integrating the therapeutic cassette.

By integrating automation into the circuit development pipeline, this approach would accelerate optimization of tumor-specific logic and improve safety and precision in engineered bacterial cancer therapies.

3. Final Project Ideas

  1. Programmable Tumor-Specific Bacterial NAND Therapeutic with Siderophore-Mediated Iron Depletion: Engineering a multi-input bacterial NAND logic circuit (hypoxia, lactate, ultrasound) to selectively activate a siderophore-expression cassette, enabling tumor-specific iron depletion as a therapeutic strategy.
  2. Lactate-Sensor–Triggered Neuroprotection in C. elegans TRNs (mec-4d): In mec-4d degenerative TRNs, use a lactate-responsive promoter (activated when intracellular lactate is depleted) to drive expression of a blocking effector (e.g., nanobody-based inhibition) that reduces mec-4d channel opening and limits Ca²⁺ influx as a dynamic negative-feedback protection mechanism.
  3. Chitra Neural: Ultrasound-Responsive Biological Transducers for Neural Signal Interface: Exploring engineered ultrasound-responsive biological systems (e.g., gas vesicles or mechanosensitive proteins) as a foundation for noninvasive neural signal modulation and potential future decoding applications

Subsections of Labs

Week 1 Lab: Pipetting

cover image cover image

Subsections of Projects

Individual Final Project

cover image cover image

Group Final Project

cover image cover image