Week 2 HW: DNA Read Write and Edit

Part 1: Benchling & In-silico Gel Art:

Simulate Restriction Enzyme Digestion with the following Enzymes:

Create a pattern/image in the style of Paul Vanouse’s Latent Figure Protocol artworks.

I tried to make a pattern that looked like a staircase going down by using logic but couldn’t quite seem to get it right…

Part 3: DNA Design Challenge:

Protein Choice: AFP III (Antifreeze Protein Type III) I chose AFP III because it has an incredible practical application: allowing organisms to survive in sub-zero temperatures. In biotechnology, it is used to create smoother ice cream textures (by preventing large ice crystals) and in the preservation of organs for transplant. It is a fascinating example of how nature solves extreme physical challenges through molecular engineering.

Protein Sequence (UniProt P05140): sp|P05140|ANP3_MACAM Antifreeze protein type 3 NQASVVANQLIPINTALTLVMMKAEVVTPMGIPAEEIPNLVGMQVNRAVPLGTTLMPDMV KNY

Reverse Translation aatcaagcctctgtagtagccaaccagctgatccccatcaacaccgccctcaccctggtcatgatgaaggccgaggtcgtcacccccatgggcatccccgccgaggagatccccaacctggtcggcatgcaggtcaaccgcgccgtccccctcggcaccaccctgatgcccgacatggtcaagaactac

Codon Optimization In your own words, describe why you need to optimize codon usage. Which organism have you chosen to optimize the codon sequence for and why? This answer takes back to last homework, due to codon bias, using a sequence that’s not ideal for your chosen organism, you may have some problems with codons used tipically by the species that produces said protein naturally. I picked E.Coli because it is the standart for easy cultivation. I ran VectorBuilder’s tool with some uncertainty but it seemed good because it was the only one that said ‘free’ on the google search. This is the optimized sequence: AACCAGGCGAGCGTGGTGGCGAATCAGCTGATTCCGATTAATACCGCCCTGACCCTGGTGATGATGAAAGCCGAAGTGGTGACCCCGATGGGCATTCCGGCGGAAGAAATTCCGAACCTGGTGGGCATGCAGGTGAATCGCGCGGTGCCGCTGGGCACCACCCTGATGCCGGATATGGTGAAAAATTAT

You have a sequence, now what? Cell-Dependent Expression (In Vivo): The most common method is to transform E. coli cells with the plasmid. When an inducer (like IPTG) is added to the growth media, the bacteria’s internal machinery (RNA Polymerase and Ribosomes) begins to transcribe the DNA into mRNA and translate that mRNA into the physical AFP III protein.

Cell-Free Protein Synthesis (In Vitro): Alternatively, one could use a “cell-free” extract. This involves mixing the DNA directly with a “molecular soup” containing ribosomes, enzymes, and amino acids in a test tube.

Part 4: Prepare a Twist DNA Synthesis Order:

I made an account and twist doesn’t seem to allow me to use the website, saying I need to contact a distributor… I checked with a fellow brazilian colleague and he seems to be having the same issue, maybe it’s a regional block?

05/25/2026 - I just did this part because of the final project, so I’ll just post the cronological order of processess and hope for the best… First I optimized the sequence using idtDNA Codon Optimization tool to optimize the sequence.

MSNKLTLKFIAALLMNVSASAAEQKISQAELDESFREQIRDALKQLEDEMKASRDQLAQLYNEIQKIRDAMLAQKRELAEQAKKLQDELQKLQAELEALKQHREQIQKLQKDLQTLKEQLEQLKNERSRLQAELAQLRAQHDAEMKASREELQKLQEELEAAKQELQKLQEQLEQLKNERSRLQAELAQMRAQHGGGGSGGGGSWSFFSKVHC

Here’s the optimized sequence: (I had to use E. Coli K12 as the organism because there was no B. Fragilis option)

ATG AGC AAC AAA CTG ACC CTG AAA TTT ATC GCC GCG CTG CTG ATG AAC GTC TCA GCC TCA GCG GCT GAG CAG AAA ATC AGC CAG GCG GAG CTG GAT GAG AGC TTC CGT GAG CAG ATC CGC GAT GCG CTG AAA CAG CTG GAA GAT GAG ATG AAA GCC AGC CGC GAC CAG CTG GCG CAG CTG TAC AAC GAG ATC CAG AAA ATC CGC GAT GCG ATG CTG GCG CAG AAA CGT GAA CTG GCG GAA CAG GCC AAA AAA CTG CAG GAT GAA CTG CAG AAA CTG CAG GCG GAA CTG GAA GCG CTG AAA CAG CAC CGC GAA CAG ATT CAG AAA CTG CAG AAA GAC CTG CAG ACG CTG AAA GAG CAG CTG GAA CAG CTG AAA AAC GAA CGC AGC CGT CTG CAG GCG GAA CTG GCG CAG CTG CGT GCG CAG CAC GAT GCC GAA ATG AAA GCC AGC CGT GAA GAA CTG CAG AAA CTG CAG GAA GAG CTG GAA GCG GCG AAA CAG GAG CTG CAG AAA CTG CAG GAA CAG CTG GAG CAG CTG AAA AAC GAA CGC AGC CGC CTG CAG GCG GAA CTG GCG CAG ATG CGT GCA CAG CAC GGC GGC GGC GGC AGC GGT GGT GGT GGC AGC TGG AGC TTC TTC AGC AAA GTT CAC TGC

Then, I pasted the whole cassete on Benchling and annotated every part, it can be acessed through this link: https://benchling.com/s/seq-WmMVF12BOPLO97arwjjJ?m=slm-kk91qiIeeQRFhGaAtZQS

I also ran digestion to see the restriction sites which showed 0 cuts for the parts that seemed possibly problematic.

Then, I exported as FASTA, and proceeded to try to use twist once again.

I uploaded the FASTA to twist and there were a LOT of warnings and one error, proceeded to try and optimize the sequence.

I had to have BT1311 promoter replaced with synthetic Bacteroides P1 promoter (Whitaker et al. 2017) for synthesis compatibility, retaining equivalent constitutive expression activity. Changed it in benchling and redid everything…

Still with issues… to finish the homework will change the ClyA portion to sfGFP + a HisTag…

For the Twist ordering exercise, sfGFP was used as a placeholder CDS to demonstrate cassette design workflow. The actual therapeutic construct (ClyA-P2) presents synthesis challenges due to coiled-coil repeat regions in ClyA, which would require either: (1) gene fragmentation and assembly, (2) use of a truncated ClyA transmembrane domain only, or (3) a custom synthesis provider tolerant of complex sequences.

Here’s the final order screen (I hope this constitutes as a done homework…)

Part 5: DNA R/W/E:

In graduation I learned a ton about mangroves and their importance, so, to sequence the mangrove microbiome, I’d target the DNA in the sediment and root zones of these coastal forests. Mangroves are massive “blue carbon” sinks, and by reading the genetic material of the microbes living there, we can find hidden enzymes that fix nutrients or even break down plastics in salty water. This helps us understand how these forests fight climate change and gives us new “bio-tools” for cleaning up the environment. I would use Illumina Sequencing for this because it’s the best for handling the complex mix of species found in soil. This is a second-generation technology that uses “sequencing by synthesis” to read millions of DNA fragments at once. First, I’d extract the DNA, break it into tiny pieces, and attach “adapters” so they can stick to a glass flow cell. Inside the machine, the DNA is copied into clusters, and as fluorescently labeled bases are added, they flash a specific color for each letter (A,T,C, or G). A camera captures these flashes, and the software translates them into digital FASTQ files that we can piece together like a giant puzzle.

For the “DNA Write” part, I’d synthesize a caffeine biosynthetic pathway to put into yeast. This would let us “brew” caffeine in a lab without needing huge coffee or tea plantations, saving a lot of land and water. I’d use silicon-based synthesis (like Twist Bioscience), which uses a silicon chip to build thousands of DNA strands at once using a chemical process called phosphoramidite chemistry. We’d print short pieces of the caffeine genes and then stitch them together into a full circuit. The main catch is that it’s hard to write very long or repetitive sequences, but it’s incredibly fast and scalable for these kinds of metabolic projects.

Finally, I’d use CRISPR-Cas9 to edit a banana or tomato so it grows with the caffeine pathway built right in. The idea is to create a “Caffeine Fruit” for a natural, healthy morning energy boost. CRISPR works like a molecular GPS and scissors; a guide RNA leads the Cas9 enzyme to a specific spot in the plant’s DNA to make a cut. By providing a “donor template” with our caffeine genes, the plant’s own repair system pastes the new instructions into its genome. We’d use a “shuttle” like Agrobacterium to get the CRISPR tools into the plant cells. It’s a powerful method, though it can sometimes be slow to grow a full plant from an edited cell, and we have to be careful that the enzyme doesn’t cut the DNA in the wrong place.