Zarias Sacrati Aelius (Arman Saadatkhah) — HTGAA Spring 2026
About me
To serve society by researching and manufacturing resilient systems that secure essential infrastructure and enlighten the next generation through standards, education, and responsible innovation.
Assignment: Post-Lecture 1 First, describe a biological engineering application or tool you want to develop and why. This could be inspired by an idea for your HTGAA class project and/or something for which you are already doing in your research, or something you are just curious about.
Assignment Pre-Lecture 2 Professor Jacobson
Nature’s machinery for copying DNA is called polymerase. What is the error rate of polymerase? How does this compare to the length of the human genome. How does biology deal with that discrepancy?
The error rate for polymerase is 1:106. The human genome is ~3.2 billion base pairs (3.2 x 109). Biology deals with this discrepancy by bridging the gap using a multi-step quality control system to lower the error rate to 1:10^9. How many different ways are there to code (DNA nucleotide code) for an average human protein? In practice what are some of the reasons that all of these different codes don’t work to code for the protein of interest?
Subsections of Homework
Week 1 Post-Lecture HW: Principles and Practices
Assignment: Post-Lecture 1
First, describe a biological engineering application or tool you want to develop and why. This could be inspired by an idea for your HTGAA class project and/or something for which you are already doing in your research, or something you are just curious about.
Proposal:
A Modular Engineer Living Material (ELM) Ecosystem for Deep Space Habitation: The Multi-Trophic Myco-Foundry is a bio-industrial habitat architecture designed for Class IVc planetary mission (Mars surface operations). Unlike a singular/monolithic biological design, this system will utilize a distributed, multi-organism ecosystem to decouple structural integrity from metabolic function.
Technical Description:
The system architecture consists of four integrated biological layers, functioning analogous to organ systems:
The Structural Shell (Protective Layer)
The Vascular System (Transport Layer)
The Metabolic Hub (Atmosphere Layer)
The Pharmaceutical Payload (Functional Layer)
Rationale:
Current ISRU strategies rely on abiotic chemical processing that requires heavy & failure-prone hardware. This proposal shifts to Biological ISRU, where habitat is a regenerative asset that grows it’s own shielding, recycles waste, and manufactures critical medical supplies.
Next, describe one or more governance/policy goals related to ensuring that this application or tool contributes to an “ethical” future, like ensuring non-malfeasance (preventing harm). Break big goals down into two or more specific sub-goals. Below is one example framework (developed in the context of synthetic genomics) you can choose to use or adapt, or you can develop your own. The example was developed to consider policy goals of ensuring safety and security, alongside other goals, like promoting constructive uses, but you could propose other goals for example, those relating to equity or autonomy.
Planetary Protection: Ensure zero contamination of the planetary biosphere by preventing the escape and proliferation of engineered extremophiles.
Operational Integrity: Prevent the degradation of the pharmaceutical payload, ensuring that radiation does not induce mutations that alter drug efficacy or toxicity.
Next, describe at least three different potential governance “actions” by considering the four aspects below (Purpose, Design, Assumptions, Risks of Failure & “Success”). Try to outline a mix of actions (e.g. a new requirement/rule, incentive, or technical strategy) pursued by different “actors” (e.g. academic researchers, companies, federal regulators, law enforcement, etc). Draw upon your existing knowledge and a little additional digging, and feel free to use analogies to other domains (e.g. 3D printing, drones, financial systems, etc.).
Purpose: What is done now and what changes are you proposing?
Design: What is needed to make it “work”? (including the actor(s) involved - who must opt-in, fund, approve, or implement, etc)
Assumptions: What could you have wrong (incorrect assumptions, uncertainties)?
Risks of Failure & “Success”: How might this fail, including any unintended consequences of the “success” of your proposed actions?
Action 1: Synthetic Auxotrophy
Purpose: Enforce a biological “kill switch” that physically prevents organism survival outside the habitat.
Design: Phosphite dependence will be required for the vascular system so that the organism can replicate DNA.
Assumptions: Assume that the organism cannot evolve a pathway to utilize naturally occurring phosphate or from waste.
Risks of Failure: Horizontal Gene Transfer from native or crew-associated microbes could restore phosphate metabolism.
Action 2: ASTM for ELMs
Purpose: Establish global quality assurance baseline for the export and use of ELMs in aerospace.
Design: All aerospace ELMs must demonstrate <0.01% genomic variance over 500 generations under simulated galactic cosmic ray (GCR) exposure certifying stability.
Assumptions: Earth-based accelerated aging tests accurately predict biological behavior in the deep space radiation environment.
Risks: Excessive regulation creates a barrier to entry stifling innovation and forcing reliance on inferior abiotic materials.
Action 3: Metagenomic Sentinel
Purpose: Detect genomic drift and pathogenic mutations in real-time before crew’s health get impacted
Design: Automated sequencing loop featuring a robotic liquid handling system that samples the vascular fluid daily, running it through a sequencer to verify genetic integrity of bacteria payload against a digital reference.
Assumptions: Onboard computational power is sufficient for real-time assembly and analysis of metagenomics data without Earth downlink.
Failure Mode: False positives could trigger an automated sterilization protocol which would destroy crucial infrastructure during mission emergency.
Next, score (from 1-3 with, 1 as the best, or n/a) each of your governance actions against your rubric of policy goals. The following is one framework but feel free to make your own:
Does the option:
Synthetic Auxotrophy
ASTM for ELMs
Metagenomic Sentinel
Enhance Biosecurity
1
2
1
• Does it physically prevent the organism from escaping or mutating into a threat?
Intrinsic physical barrier, best for compliance
Administrative barrier, rules can be bypassed
Fastest response to a breach, catches mutations early
Foster Lab Safety
1
3
1
• Does it protect the crew from the organism inside the habitat?
Prevents overgrowth into crew quarters
Documentation heavy; does not stop an active bio-hazard
Proactive threat detection
Mass Efficiency (ISRU)
1
2
3
• Does it reduce the launch weight required for safety systems?
Zero mass, code is weightless
Low mass
High mass, requires heavy equipment & reagents
Energy Autonomy
2
1
3
• Does it function without draining the habitat’s power grid?
Requires energy to synthesize the specific nutrient
Zero energy cost
High energy cost; continuous computing & sequencing
Psychological Impact
2
3
1
• Does it make the crew feel safer living inside a “monster”?
Invisible safety, crew can’t see it working
Bureaucratic, offers no peace of mind
High reassurance, green light on dashboard
Operational Resilience
1
2
3
• Does it still work if the main computer fails?
Yes, biologic
Yes
No
Commercial Viability
3
1
2
• Does it encourage private companies to adopt it?
Hard, high barrier to entry
Standardizes the product
Adds cost but also adds value
Last, drawing upon this scoring, describe which governance option, or combination of options, you would prioritize, and why. Outline any trade-offs you considered as well as assumptions and uncertainties. For this, you can choose one or more relevant audiences for your recommendation, which could range from the very local (e.g. to MIT leadership or Cambridge Mayoral Office) to the national (e.g. to President Biden or the head of a Federal Agency) to the international (e.g. to the United Nations Office of the Secretary-General, or the leadership of a multinational firm or industry consortia). These could also be one of the “actor” groups in your matrix.
Committee on Space Research (COSPAR)
I propose a ‘Defense in Depth’ governance strategy that prioritizes Synthetic Auxotrophy as mandatory ‘hard constraint’ for launch approval, with runner-up as Active Surveillance. In deep space exploration, active safety systems are prone to failure, and a biological habitat must possess intrinsic safety that functions by metabolic law rather than by software code. This option resembles a nuclear reactor’s passive control rod, if the system loses power or control, the organism defaults to a safe state (dead) so it cannot scavenge Phosphite from the Martian environment.
Active Surveillance prevents mutations within the habitat while auxotrophy prevents escape, and this is crucial since cosmic radiation can cause production of toxins instead of medicine, so it is a secondary ‘soft constraint.’
Trade-offs:
Resilience vs Fragility: Engineering a dependency on a specific nutrient introduce a supply chain risk yet we accept this since the alternative presents an unacceptable existential risk to planetary science.
Disclaimer: Artificial Intelligence was used in this assignment to assist with conceptual brainstorming, technical copywriting, and formatting of the governance rubrics. The core scientific concept and final submission were curated by the student.
Week 2 Pre-lecture & Post-lecture HW: DNA Read, Write, and Edit
Assignment Pre-Lecture 2
Professor Jacobson
Nature’s machinery for copying DNA is called polymerase. What is the error rate of polymerase? How does this compare to the length of the human genome. How does biology deal with that discrepancy?
The error rate for polymerase is 1:106. The human genome is ~3.2 billion base pairs (3.2 x 109). Biology deals with this discrepancy by bridging the gap using a multi-step quality control system to lower the error rate to 1:10^9.
How many different ways are there to code (DNA nucleotide code) for an average human protein? In practice what are some of the reasons that all of these different codes don’t work to code for the protein of interest?
The average human protein is ~1036 base pairs long, which is around 345 amino acids. Since 61 codons encode 20 amino acids, there are approximately 3 codons per amino acid on average. The number of ways to code would be 3^345. Some of the reasons these different codes don’t work include codon bias (different organisms differ in their abundance of specific tRNAs) and GC content (extreme GC content can make DNA difficult to synthesize chemically or unstable for the cell to maintain).
Dr. LeProust
What’s the most commonly used method for oligo synthesis currently?
The industry standard is phosphoramidite chemistry.
Why is it difficult to make oligos longer than 200nt via direct synthesis?
For a 200nt oligo, the yield is significantly reduced because synthesis is a cyclic process and the efficiency drops exponentially per step. The result would be a mixture dominated by truncated failure sequences.
Why can’t you make a 2000bp gene via direct oligo synthesis?
A 2000bp strand is impossible based on the yield equation (Yield = EfficiencyN). It would effectively be zero since 0.9952000 = 0.004%.
Dr. Church
[Using Google & Prof. Church’s slide #4] What are the 10 essential amino acids in all animals and how does this affect your view of the “Lysine Contingency”?
The 10 Essential Amino Acids:
Arginine
Histidine
Isoleucine
Leucine
Lysine
Methionine
Phenylalanine
Threonine
Tryptophan
Valine
The Lysine Contingency is a concept from Jurassic Park where the dinosaurs were genetically engineered to be “Lysine dependent,” assuming if they escaped they would die. However, there is a crucial caveat: lysine is an essential amino acid. This means that animals cannot synthesize it naturally and must obtain it from their diet. Since lysine is abundant in nature, an escaped dinosaur could easily survive by eating standard food in the wild.
3.1. Choose your protein.
In recitation, we discussed that you will pick a protein for your homework that you find interesting. Which protein have you chosen and why? Using one of the tools described in recitation (NCBI, UniProt, google), obtain the protein sequence for the protein you chose.
Phosphite Dehydrogenase (ptxD) from Pseudomonas stutzeri; This protein is crucial to my proposal for deep space habitation since it catalyzes the oxidation of phosphite (PO33-) into phosphate (PO43-). By deleting native phosphate transporters and inserting ptxD gene, it is possible to achieve synthetic auxotrophy. Ensuring the engineered organisms can only survive if fed an artificial phosphite preventing planetary contamination.
Once a nucleotide sequence of your protein is determined, you need to codon optimize your sequence. You may, once again, utilize google for a “codon optimization tool”. In your own words, describe why you need to optimize codon usage. Which organism have you chosen to optimize the codon sequence for and why?
Optimization Organism:Bacillus subtilis
Rationale:
I have chosen to optimize the sequence for Bacillus subtilis because it serves as the primary “Bio-Pharmacy” chassis in my habitat proposal. Since the habitat’s nutrient system will be standardized on a phosphite-based supply, all biological components (including the B. subtilis pharmaceutical production units) must efficiently express the ptxD protein to survive.
What technologies could be used to produce this protein from your DNA? Describe in your words the DNA sequence can be transcribed and translated into your protein. You may describe either cell-dependent or cell-free methods, or both.
In-vivo (Cell-Dependent Expression): I would clone the DNA into an expression vector (plasmid) containing a strong B. subtilis promoter and transform it into the living cells. The cell’s natural machinery would use RNA polymerase for transcription, and ribosomes for translation that would produce the enzyme as the cells grow.
Cell-Free Protein Synthesis (CFPS): I could use a cell-free “extract” containing all the necessary molecular machinery (ribosomes, tRNAs, polymerases) in a test tube, we can produce in a specific protein without overhead of maintaining a living culture.
Part 4: Prepare a Twist DNA Synthesis Order
Part 5: DNA Read/Write/Edit Assignees for the following
(i) What DNA would you want to sequence (e.g., read) and why? This could be DNA related to human health (e.g. genes related to disease research), environmental monitoring (e.g., sewage waste water, biodiversity analysis), and beyond (e.g. DNA data storage, biobank)
I would want to sequence the metagenome of the Martian regolith after inoculation with the ecosystem or if on another exoplanet sequence the environmental with the goal of understanding the metabolic pathways that can be “harvested” and integrated into the ecosystem. This allows for the modular system to adapt to planetary conditions.
(ii) In lecture, a variety of sequencing technologies were mentioned. What technology or technologies would you use to perform sequencing on your DNA and why?
Oxford Nanopore Technologies (ONT) MinION
(iii) Is your method first-, second- or third-generation or other? How so?
Third-generation since it performs single-molecule, real-time sequencing without the need for PCR amplification.
(iv) What is your input? How do you prepare your input (e.g. fragmentation, adapter ligation, PCR)? List the essential steps.
Input: Pure sample extracted from exoplanet (soil, ice, or liquid)
Analysis: Design Primers for DNA that is best optimizied for sample data. Produce with Baseline (ELM strain) and customize Living Material in accordance to exoplant sample analysis.
Fragmentation: Minimal shearing to maintain “Long Reads”.
End-Repair/A-Tailing: Preparing DNA ends for adapter attachment.
Adapter Ligation: Attaching motor proteins and sequencing adapters to guide DNA into the pore.
(v) What are the essential steps of your chosen sequencing technology, how does it decode the bases of your DNA sample (base calling)?
DNA is taken through a microscopic protein nanopore embedded in an electrically resistant membrane, and when each nucleotide passes through, it causes a specific measurable disruption in the ionic current, and the algorithms translates these info into DNA bases.
(vi) What is the output of your chosen sequencing technology?
FastQ files containing long-read sequences which allow for the assembly of unknown microbial genomes with no reference template.
5.2 DNA Write
(i) What DNA would you want to synthesize (e.g., write) and why? These could be individual genes, clusters of genes or genetic circuits, whole genomes, and beyond. As described in class thus far, applications could range from therapeutics and drug discovery (e.g., mRNA vaccines and therapies) to novel biomaterials (e.g. structural proteins), to sensors (e.g., genetic circuits for sensing and responding to inflammation, environmental stimuli, etc.), to art (DNA origamis). If possible, include the specific genetic sequence(s) of what you would like to synthesize! You will have the opportunity to actually have Twist synthesize these DNA constructs! :)
I want to synthesize a customized ptxD expression cassette designed with modular “adapter regions.” Allowing the ptxD gene (the phosphite-based biocontainment lock) to be rapidly swapped into different host organisms depending on the destination’s gravity and radiation profile.
(ii) What technology or technologies would you use to perform this DNA synthesis and why?
Phosphoramidite Synthesis (Si-based)
(iii) What are the essential steps of your chosen sequencing methods?
De-blocking: Removing the protective DMT group from the 5’ hydroxyl of the first nucleotide.
Coupling: Activating and adding the next phosphoramidite nucleotide to the chain.
Capping: Acetylating any unreacted chains to prevent “deletion” errors.
Oxidation: Converting the unstable phosphite triester into a stable phosphate triester.
(iv) What are the limitations of your sequencing method (if any) in terms of speed, accuracy, scalability?
Speed: Chemical cycles take minutes per base, making long-gene synthesis time-consuming.
Accuracy: Error rates increase with length; chemical synthesis typically plateaus around 200bp, requiring enzymatic assembly for larger cassettes.
Scalability: While high-throughput on silicon chips, the cost scales linearly with length, making whole-genome “writing” prohibitively expensive for remote deployment.
5.3 DNA Edit
(i) What DNA would you want to edit and why? In class, George shared a variety of ways to edit the genes and genomes of humans and other organisms. Such DNA editing technologies have profound implications for human health, development, and even human longevity and human augmentation. DNA editing is also already commonly leveraged for flora and fauna, for example in nature conservation efforts, (animal/plant restoration, de-extinction), or in agriculture (e.g. plant breeding, nitrogen fixation). What kinds of edits might you want to make to DNA (e.g., human genomes and beyond) and why?Colossal, Biosciences Inc., biotechnology company that leverages genetic engineering to working to de-extinct various historic animals, such as the woolly mammoth.
I would edit the chaperone protein genes in the foundry’s microbial population, and by replacing the native promoters with environmentally responsive promoters, it is possible to achieve “self-tuning” meaning the ecosystem would increase metabolic activity or thicken walls of habitat in response to specific atmospheric stressors of exoplanet.
(ii) What technology or technologies would you use to perform these DNA edits and why?
Targeting: A Guide RNA (gRNA) is designed to match the specific DNA locus of the chaperone promoter.
Binding: The gRNA directs the Cas9 nuclease to the target site.
Cleavage: Cas9 induces a Double-Strand Break (DSB).
Repair: The cell uses Homology-Directed Repair (HDR) to incorporate a provided donor DNA template (the new responsive promoter).
(iii) What preparation do you need to do (e.g. design steps) and what is the input (e.g. DNA template, enzymes, plasmids, primers, guides, cells) for the editing?
Preparation & Input:
Design: Computational modeling of gRNAs to maximize binding efficiency.
Input: Cas9 protein (or mRNA), gRNA, and the donor DNA template delivered via plasmid or viral vector.
(iv) What are the limitations of your editing methods (if any) in terms of efficiency or precision?
Limitations:
Efficiency: HDR is naturally less frequent than the error-prone NHEJ pathway, leading to low success rates in certain fungi.
Precision: Risk of Off-target effects, where the Cas9 cuts at unintended locations, potentially compromising the “Kill Switch” integrity.
Disclaimer: Artificial Intelligence was used in this assignment to assist with conceptual brainstorming, technical copywriting, and formatting. The core scientific concept and final submission were curated by the student.