Homework

Weekly homework submissions:

  • Week 1 HW: Principles and Practices

    Living Modified Organism(LMO) Cotton Textiles That Maintain Hygiene and Emit Pleasant Scents This application envisions a cotton-based textile integrated with genetically engineered microorganisms that are designed to function stably on the fabric surface. These microorganisms would metabolize organic compounds derived from human sweat and skin debrisโ€”such as fatty acids, ammonia, and keratin degradation byproductsโ€”thereby maintaining hygiene and reducing odor formation. Unlike conventional approaches that rely on antimicrobial chemicals or synthetic fragrances, this system would operate as an engineered living material, in which biological activity is deliberately programmed to occur only within a non-natural environment (the textile itself). Conventional perfumes and detergents repeatedly introduce chemically stable, exogenous compounds that may unintentionally interfere with human endocrine systems or disrupt the skin microbiome through persistent and non-selective exposure.

  • Week 2 HW: DNA Read, Write and Edit

    Part 0: Basics of Gel Electrophoresis #S.S: ๋ถ€ํŠธ์บ ํ”„์™€ Wk2 ์ˆ˜์—…์„ ๋“ค์€ ํ›„, ๊ณผ์ œ๋ฅผ ํ™•์ธํ–ˆ์ง€๋งŒ ์ •ํ™•ํ•œ ๊ฐœ๋… ์ •๋ฆฌ๊ฐ€ ์•ˆ๋˜์–ด ์žˆ์–ด์„œ ๋ญ˜ ์ง„ํ–‰ํ•˜๋Š”์ง€ ์ดํ•ดํ•˜๊ธฐ ์–ด๋ ค์› ๋‹ค. ๋”ฐ๋ผ์„œ ChatGPT๋ฅผ ํ™œ์šฉ, it became clearer once I understood the concept of restriction enzyme(molecular scissors) and gel electrophoresis(DNA size filter). Prompt: "restriction enzyme biology ์™€ gel electrophoresis์— ๋Œ€ํ•ด์„œ ์•„์ฃผ ๊ฐ„๋‹จํžˆ ์„ค๋ช…ํ•ด์ค˜" ๋ฅผ ํ†ตํ•ด ๋‹ค์Œ๊ณผ ๊ฐ™์€ ๋‹ต๋ณ€์„ ์–ป์—ˆ๋‹ค. 1. Restriction enzyme (์ œํ•œํšจ์†Œ) biology ์ œํ•œํšจ์†Œ๋Š” ํŠน์ •ํ•œ DNA ์—ผ๊ธฐ์„œ์—ด์„ ์ธ์‹ํ•ด์„œ ๊ทธ ์ง€์ ์„ ์ž๋ฅด๋Š” ํšจ์†Œ์ž…๋‹ˆ๋‹ค. ์›๋ž˜๋Š” ๋ฐ•ํ…Œ๋ฆฌ์•„์˜ ๋ฐฉ์–ด ์‹œ์Šคํ…œ์œผ๋กœ, ์นจ์ž…ํ•œ ๋ฐ”์ด๋Ÿฌ์Šค DNA๋ฅผ ์ž˜๋ผ ํŒŒ๊ดดํ•ฉ๋‹ˆ๋‹ค. ๊ฐ ํšจ์†Œ๋Š” ๊ณ ์œ ํ•œ ์ธ์‹ ์„œ์—ด(์˜ˆ: GAATTC)์„ ๊ฐ€์ง€๋ฉฐ, ๊ทธ ์œ„์น˜์—์„œ DNA๋ฅผ ์ •ํ™•ํžˆ ์ ˆ๋‹จํ•ฉ๋‹ˆ๋‹ค. ์ด ํŠน์„ฑ ๋•๋ถ„์— ๋ถ„์ž์ƒ๋ฌผํ•™์—์„œ๋Š” DNA๋ฅผ ์›ํ•˜๋Š” ์กฐ๊ฐ์œผ๋กœ ์ž๋ฅด๋Š” ๋„๊ตฌ๋กœ ์‚ฌ์šฉ๋ฉ๋‹ˆ๋‹ค. 2. Gel electrophoresis (๊ฒ” ์ „๊ธฐ์˜๋™) ๊ฒ” ์ „๊ธฐ์˜๋™์€ DNA ์กฐ๊ฐ์„ ํฌ๊ธฐ๋ณ„๋กœ ๋ถ„๋ฆฌํ•˜๋Š” ์‹คํ—˜ ๋ฐฉ๋ฒ•์ž…๋‹ˆ๋‹ค. DNA๋Š” ์Œ์ „ํ•˜๋ฅผ ๋ ๋ฏ€๋กœ ์ „๊ธฐ๋ฅผ ๊ฑธ๋ฉด ์–‘๊ทน ๋ฐฉํ–ฅ์œผ๋กœ ์ด๋™ํ•ฉ๋‹ˆ๋‹ค. ์ž‘์€ DNA ์กฐ๊ฐ์ผ์ˆ˜๋ก ๊ฒ” ์†์„ ๋” ๋นจ๋ฆฌ, ๋” ๋ฉ€๋ฆฌ ์ด๋™ํ•ฉ๋‹ˆ๋‹ค. ๊ฒฐ๊ณผ์ ์œผ๋กœ DNA ์กฐ๊ฐ๋“ค์ด ๋ฐด๋“œ ํ˜•ํƒœ๋กœ ๋‚˜ํƒ€๋‚˜๋ฉฐ, ํฌ๊ธฐ๋ฅผ ๋น„๊ตยทํ™•์ธํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค. One-line summary -Restriction enzymes: molecular scissors that cut DNA at specific sequences -Gel electrophoresis: a method to separate DNA fragments by size Part 1: Benchling & In-silico Gel Art #S.S: Benchling is such a cool platform that we can edit DNA in-silico. I uploaded โ€˜Lambda DNA.txtโ€™ and managed digest by choosing each enzyme that cuts DNA strand. My initial thought was to see the ladder that each enzyme makes to decide which figure/shape Iโ€™d like on my artwork, but the shown result were not a single bar but of multiple layers. โ€” at this stage, I have no idea how to compose an gel art.

  • Week 3 HW: Lab Automation

    A group HW: Design cell-free RNA biosensors RNA toehold switch๊ฐ€ ์ž‘๋™ํ•˜๋Š” ์›๋ฆฌ๋ฅผ recitation์—์„œ Ana๊ฐ€ ์„ค๋ช…ํ•ด ์ฃผ์—ˆ๋Š”๋ฐ, ๋‚˜๋Š” ์šฐ์„  mRNA๊ฐ€ ๋‹จ๋ฐฑ์งˆ๋กœ translation๋˜๋Š” ๊ธฐ์ „์„ ์ดํ•ดํ•˜๊ณ ๋‚˜์„œ ์ด๋Ÿฐ biosensor์˜ ๊ธฐ๋Šฅ์„ ์ดํ•ดํ•  ์ˆ˜ ์žˆ์—ˆ๋‹ค. mRNA์— ribosome์ด ์™€์„œ ๊ฒฐํ•ฉํ•œ๋‹ค. ์ฐพ์•„๋‹ค๋‹Œ๋‹ค๊ธฐ ๋ณด๋‹ค๋Š” cell๋‚ด์—์„œ ์ž์œ ๋กญ๊ฒŒ ๋– ๋‹ค๋‹ˆ๋˜ ribosome์ด ์ถฉ๋Œํ•˜์—ฌ ๊ฒฐํ•ฉํ•˜๋Š” ๊ฒƒ์— ๊ฐ€๊น๋‹ค๊ณ . Eukaryotes์˜ ๊ฒฝ์šฐ๋Š” rough endoplasmic reticulum(RER, ์„ธํฌ์†Œ๊ธฐ๊ด€organelle)์— ๊ฑฐ์น ์–ด ๋ณด์ด๋„๋ก ribosome๋“ค์ด ๋ถ™์–ด์žˆ์–ด์„œ ๋‹จ๋ฐฑ์งˆ์„ ์ƒ์‚ฐ, ๋ฐ˜๋ฉด์— Prokaryotes, ์ฆ‰, bacteria์˜ ๊ฒฝ์šฐ ribosome์€ free-floating ์ƒํƒœ๋กœ๋งŒ cytoplasm(์„ธํฌ์งˆ)์— ์กด์žฌ.

  • Week 4 HW: Protein Design Part I

    Part A. Conceptual Questions Answer any NINE of the following questions from Shuguang Zhang: (i.e. you can select two to skip) How many molecules of amino acids do you take with a piece of 500 grams of meat? (on average an amino acid is ~100 Daltons) Why do humans eat beef but do not become a cow, eat fish but do not become fish? Why are there only 20 natural amino acids? Can you make other non-natural amino acids? Design some new amino acids. Where did amino acids come from before enzymes that make them, and before life started? If you make an ฮฑ-helix using D-amino acids, what handedness (right or left) would you expect? Can you discover additional helices in proteins? Why are most molecular helices right-handed? Why do ฮฒ-sheets tend to aggregate? What is the driving force for ฮฒ-sheet aggregation? Why do many amyloid diseases form ฮฒ-sheets? Can you use amyloid ฮฒ-sheets as materials? Design a ฮฒ-sheet motif that forms a well-ordered structure. Part B: Protein Analysis and Visualization

  • Week 5 HW: Protein Design Part II

    Part A: SOD1 Binder Peptide Design (From Pranam) Superoxide dismutase 1 (SOD1) is a cytosolic antioxidant enzyme that converts superoxide radicals into hydrogen peroxide and oxygen. In its native state, it forms a stable homodimer and binds copper and zinc. Mutations in SOD1 cause familial Amyotrophic Lateral Sclerosis (ALS). Among them, the A4V mutation (Alanine โ†’ Valine at residue 4) leads to one of the most aggressive forms of the disease. The mutation subtly destabilizes the N-terminus, perturbs folding energetics, and promotes toxic aggregation.

  • Week 6 HW: Genetic Circuits - Part I

    DNA Assembly Answer these questions about the protocol in this weekโ€™s lab: What are some components in the Phusion High-Fidelity PCR Master Mix and what is their purpose? What are some factors that determine primer annealing temperature during PCR? There are two methods from this class that create linear fragments of DNA: PCR, and restriction enzyme digests. Compare and contrast these two methods, both in terms of protocol as well as when one may be preferable to use over the other. How can you ensure that the DNA sequences that you have digested and PCR-ed will be appropriate for Gibson cloning? How does the plasmid DNA enter the E. coli cells during transformation? Describe another assembly method in detail (such as Golden Gate Assembly) Explain the other method in 5 - 7 sentences plus diagrams (either handmade or online). Model this assembly method with Benchling or Asimov Kernel! Asimov Kernel

  • Week 7 HW: Genetic Circuits - Part II

Subsections of Homework

Week 1 HW: Principles and Practices


  • Living Modified Organism(LMO) Cotton Textiles That Maintain Hygiene and Emit Pleasant Scents

    This application envisions a cotton-based textile integrated with genetically engineered microorganisms that are designed to function stably on the fabric surface. These microorganisms would metabolize organic compounds derived from human sweat and skin debrisโ€”such as fatty acids, ammonia, and keratin degradation byproductsโ€”thereby maintaining hygiene and reducing odor formation.

    Unlike conventional approaches that rely on antimicrobial chemicals or synthetic fragrances, this system would operate as an engineered living material, in which biological activity is deliberately programmed to occur only within a non-natural environment (the textile itself). Conventional perfumes and detergents repeatedly introduce chemically stable, exogenous compounds that may unintentionally interfere with human endocrine systems or disrupt the skin microbiome through persistent and non-selective exposure.

    In contrast, genetically engineered microorganisms can be designed to activate only in specific physiological contexts and to simultaneously regulate both the production and degradation of fragrance-related compounds, thereby reducing the structural risks associated with continuous, indiscriminate chemical exposure. However, this approach also introduces ethical, safety, and governance challenges related to long-term human contact, environmental release, and biological containment.


  • Governance and Policy Goals for Living Cotton Textiles in KR :: “To Ensure Biosafety and Biosecurity”

    a) Prevent disruption of the human skin microbiome

    b) Minimize risks of unintended environmental dissemination

    c) Ensure users are fully informed that the product contains living, engineered organisms

{ref.} *ref1. https://www.nifs.go.kr/portal/pcon0000260/systa/actionConts.do

{ref.} *ref2. https://www.kdca.go.kr/kdca/3469/subview.do


option 1) Technical Containment via Textile-Dependent Microbial Design

Purpose:

Biosafety guidance tends to be general (prevent acute harm, prevent release), without microbiome-specific containment requirements for long-duration skin contact consumer products. Thus, require microbiome-protective technical containment for living textiles. Engineered microbes must be textile-restricted in survival, localization, and activity, and demonstrably non-colonizing to skin.

Design:

Self-limiting activity circuit

  1. Fail-safe kill switch triggered off-textile
  2. Localization constraint (stay on fiber, not on skin)

Actors:

  1. Developers (design + testing)
  2. Independent labs/IRB-like review bodies (protocol audit)
  3. Regulators/standards bodies

Risks of Failure and โ€œSuccessโ€:

  1. Failure could result in unintended persistence or adaptation
  2. Even successful containment may create long-term, poorly understood humanโ€“microbe interactions

{ref.}The Skin Microbiome: Current Techniques, Challenges, and Future Directions *https://pmc.ncbi.nlm.nih.gov/articles/PMC10223452/

option 2) Mandatory Environmental Impact Assessment for Synthetic Biology Applications (LMOs)

Purpose:

In addition to general biosafety guidelines(ํ•ฉ์„ฑ์ƒ๋ฌผํ•™ ์œก์„ฑ ๋ฐฉ์•ˆ, 2025. 3. 11.), require application-specific assessment of impacts on the realease of living cotton textiles products intended for prolonged skin contact.

Introduce a mandatory, application-specific Environmental Impact Assessment (EIA) for LMOs, to proactively evaluate environmental risks unique to engineered biological systems, in addition to compliance with existing chemical regulations.

{ref.} ํ•ฉ์„ฑ์ƒ๋ฌผํ•™ ์œก์„ฑ ๋ฐฉ์•ˆ (Ministry of Science and ICT ํšŒ๋ถ€ 2024. 9. 11. ์ƒ์ • 2024. 11. 20. ์ฒ˜๋ฆฌ 2025. 3. 11.) *https://opinion.lawmaking.go.kr/gcom/nsmLmSts/out/2203884/detailRP

Design:

Require an Environmental Impact Assessment prior to deployment for synthetic biology applications with potential environmental exposure

  1. Environmental persistence and degradation pathways
  2. Potential spread beyond intended containment
  3. Interaction with natural ecosystems and microbial communities

Actors:

  1. Developers in Academia/Industry (conduct and submit EIA)
  2. Korea Disease Control and Prevention Agency (KDCA) / Ministry of Science and ICT (MSIT)
  3. Environmental and chemical safety authorities

Risks of Failure and โ€œSuccessโ€:

  1. EIA may become a procedural formality without capturing dynamic biological risks
  2. Increased compliance burden may disadvantage smaller research groups or startups
  3. Insufficient data to assess long-term ecological effects

option 3) User-Centered Governance Through Labeling and Disclosure

Purpose:

Consumers are often unaware of the biological mechanisms underlying novel products (esp. GMO/LMO). This option prioritizes transparency and informed consent.

Design:

Clear labeling would indicate the presence of engineered microorganisms, describe their function, and provide guidance for use and disposal.

Actors:

  1. Manufacturers
  2. Consumer protection authorities

Assumptions:

Users will engage with and understand disclosed information

Risks of Failure and โ€œSuccessโ€:

  1. Information overload may lead to disregard
  2. Transparency may unintentionally provoke public anxiety or resistance

Does the option:Option 1Option 2Option 3
Enhance Biosecurity
โ€ข By preventing incidents123
โ€ข By helping respond321
Foster Lab Safety
โ€ข By preventing incident12n/a
โ€ข By helping respond33n/a
Protect the environment
โ€ข By preventing incidents113
โ€ข By helping respond322
Other considerations
โ€ข Minimizing costs and burdens to stakeholders132
โ€ข Feasibility?321
โ€ข Not impede research332
โ€ข Promote constructive applications122

Drawing on the scoring results, Option 1โ€”technical containment through precisely engineered microbial circuitsโ€”emerges as the most feasible and effective governance priority.

It consistently scores highest in preventing biosecurity, laboratory, and environmental incidents while minimizing costs and avoiding unnecessary constraints on research. By embedding safety directly into genetic and circuit-level design, this approach addresses risks at their source rather than relying on external oversight or post hoc responses.

Although it offers limited capacity for incident response, this reflects a deliberate and ethically preferable trade-off; systems designed to fail safely reduce the need for downstream intervention. While this strategy assumes long-term stability of engineered circuits and involves uncertainty around evolutionary dynamics, it provides a practical foundation that can later be complemented by regulatory review and transparency measures, making it the most realistic starting point for governing synthetic biology applications with close human and environmental interfaces.

…ethical concerns that arose: GMO๋ผ๋Š” ์šฉ์–ด๋‚˜ ๊ฐœ๋…์€ ์Œ์‹๊ณผ ๊ด€๋ จ์ด ์žˆ์–ด์„œ์ธ์ง€ ๋งค์šฐ ์นœ์ˆ™ํ•˜๊ฒŒ ๋А๊ปด์กŒ๋Š”๋ฐ, ๋ฏธ์ƒ๋ฌผ์„ ์œ ์ „์ ์œผ๋กœ ๋ณ€ํ˜•ํ•œ LMO๋ผ๋Š” ๊ฐœ๋…์€ ์ฒ˜์Œ ์•Œ๊ฒŒ ๋˜์—ˆ๋‹ค. ๋˜ํ•œ, ์œ ์ „์ž๋ณ€ํ˜•๋ฏธ์ƒ๋ฌผ์˜ ํ™˜๊ฒฝ์— ๋Œ€ํ•œ ์˜ํ–ฅ์€ ๋”์šฑ ํŒ๋‹จํ•˜๊ธฐ ์–ด๋ ค์šธ ๊ฒƒ์œผ๋กœ ๋ณด์—ฌ์ ธ์„œ, ์•ž์œผ๋กœ ํ•ฉ์„ฑ์ƒ๋ฌผํ•™ ์ œํ’ˆ๋“ค์ด ๊ฐœ๋ฐœ๋จ์— ๋”ฐ๋ผ ๋งŽ์€ ๊ทœ์ œ์™€ ์ด์Šˆ๋“ค์ด ์žˆ์„ ๊ฒƒ์œผ๋กœ ์ƒ๊ฐ๋œ๋‹ค.

Week 2 HW: DNA Read, Write and Edit

Part 0: Basics of Gel Electrophoresis

#S.S: ๋ถ€ํŠธ์บ ํ”„์™€ Wk2 ์ˆ˜์—…์„ ๋“ค์€ ํ›„, ๊ณผ์ œ๋ฅผ ํ™•์ธํ–ˆ์ง€๋งŒ ์ •ํ™•ํ•œ ๊ฐœ๋… ์ •๋ฆฌ๊ฐ€ ์•ˆ๋˜์–ด ์žˆ์–ด์„œ ๋ญ˜ ์ง„ํ–‰ํ•˜๋Š”์ง€ ์ดํ•ดํ•˜๊ธฐ ์–ด๋ ค์› ๋‹ค. ๋”ฐ๋ผ์„œ ChatGPT๋ฅผ ํ™œ์šฉ, it became clearer once I understood the concept of restriction enzyme(molecular scissors) and gel electrophoresis(DNA size filter).

  Prompt: "restriction enzyme biology ์™€ gel electrophoresis์— ๋Œ€ํ•ด์„œ ์•„์ฃผ ๊ฐ„๋‹จํžˆ ์„ค๋ช…ํ•ด์ค˜"
  ๋ฅผ ํ†ตํ•ด ๋‹ค์Œ๊ณผ ๊ฐ™์€ ๋‹ต๋ณ€์„ ์–ป์—ˆ๋‹ค.
  
  1. Restriction enzyme (์ œํ•œํšจ์†Œ) biology
  
  ์ œํ•œํšจ์†Œ๋Š” ํŠน์ •ํ•œ DNA ์—ผ๊ธฐ์„œ์—ด์„ ์ธ์‹ํ•ด์„œ ๊ทธ ์ง€์ ์„ ์ž๋ฅด๋Š” ํšจ์†Œ์ž…๋‹ˆ๋‹ค.
  ์›๋ž˜๋Š” ๋ฐ•ํ…Œ๋ฆฌ์•„์˜ ๋ฐฉ์–ด ์‹œ์Šคํ…œ์œผ๋กœ, ์นจ์ž…ํ•œ ๋ฐ”์ด๋Ÿฌ์Šค DNA๋ฅผ ์ž˜๋ผ ํŒŒ๊ดดํ•ฉ๋‹ˆ๋‹ค.
  ๊ฐ ํšจ์†Œ๋Š” ๊ณ ์œ ํ•œ ์ธ์‹ ์„œ์—ด(์˜ˆ: GAATTC)์„ ๊ฐ€์ง€๋ฉฐ, ๊ทธ ์œ„์น˜์—์„œ DNA๋ฅผ ์ •ํ™•ํžˆ ์ ˆ๋‹จํ•ฉ๋‹ˆ๋‹ค.
  ์ด ํŠน์„ฑ ๋•๋ถ„์— ๋ถ„์ž์ƒ๋ฌผํ•™์—์„œ๋Š” DNA๋ฅผ ์›ํ•˜๋Š” ์กฐ๊ฐ์œผ๋กœ ์ž๋ฅด๋Š” ๋„๊ตฌ๋กœ ์‚ฌ์šฉ๋ฉ๋‹ˆ๋‹ค.
  
  2. Gel electrophoresis (๊ฒ” ์ „๊ธฐ์˜๋™)
  ๊ฒ” ์ „๊ธฐ์˜๋™์€ DNA ์กฐ๊ฐ์„ ํฌ๊ธฐ๋ณ„๋กœ ๋ถ„๋ฆฌํ•˜๋Š” ์‹คํ—˜ ๋ฐฉ๋ฒ•์ž…๋‹ˆ๋‹ค.
  DNA๋Š” ์Œ์ „ํ•˜๋ฅผ ๋ ๋ฏ€๋กœ ์ „๊ธฐ๋ฅผ ๊ฑธ๋ฉด ์–‘๊ทน ๋ฐฉํ–ฅ์œผ๋กœ ์ด๋™ํ•ฉ๋‹ˆ๋‹ค.
  ์ž‘์€ DNA ์กฐ๊ฐ์ผ์ˆ˜๋ก ๊ฒ” ์†์„ ๋” ๋นจ๋ฆฌ, ๋” ๋ฉ€๋ฆฌ ์ด๋™ํ•ฉ๋‹ˆ๋‹ค.
  ๊ฒฐ๊ณผ์ ์œผ๋กœ DNA ์กฐ๊ฐ๋“ค์ด ๋ฐด๋“œ ํ˜•ํƒœ๋กœ ๋‚˜ํƒ€๋‚˜๋ฉฐ, ํฌ๊ธฐ๋ฅผ ๋น„๊ตยทํ™•์ธํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
  
  One-line summary
  -Restriction enzymes: molecular scissors that cut DNA at specific sequences
  -Gel electrophoresis: a method to separate DNA fragments by size

Part 1: Benchling & In-silico Gel Art

#S.S: Benchling is such a cool platform that we can edit DNA in-silico. I uploaded ‘Lambda DNA.txt’ and managed digest by choosing each enzyme that cuts DNA strand. My initial thought was to see the ladder that each enzyme makes to decide which figure/shape I’d like on my artwork, but the shown result were not a single bar but of multiple layers. — at this stage, I have no idea how to compose an gel art.

===

#S.S: Little update on my Gel Art, after attending a session from Designer Cells lab in Songdo, S. Korea.

Ana, one of our TAs, printed out the Gel Electrophoresis result for us, using an instant photo printer. The photo on the right bottom. It also is a sticker !!!

So, my first ever wetlab experience started with how to use pipettes. Next, we moved on to the main dish, how to perform gel electrophoresis.

  1. mix TAE and agarose(powder) and the solution would be microwaved until it is clear, which means it is completely dissolved.
  2. I missed the reasons- to add 1uL of SYBR Safe DNA stain, and ChatGPT explains as below:
  3. Place the comb(or we can put them later-it depends your way of doing it-just avoid forming bubbles) and pour the agarose gel.
  4. When the gel is solidified enough, remove the comb(which generated wells) and start loading the wells with your DNA stands.

There were 4 different sized DNA strands (200bp, 700bp, 2.3kb, 5kb) ready for us, thanks to one of our TAs, Hyunseo.

I designed my gel art saying “Hi”, but the result is shown rather like “HI”, which still, luckily conveys the intended meaning ;D Hyunseo guessed 5kb and 2.3kb might gone bad? and failed to show on our Gel.

I am presenting my artwork again below just to show off.

The most challenging part of wet lab was actually doing the math and figuring out volumn units. I would refer the summary from ChatGPT for my future references.


Basic Units and Calculations in Biology

1๏ธโƒฃ Volume Units (Metric System)

All volume units are based on powers of 10.

  • 1 L (liter) = base unit
  • 1 mL = 10โปยณ L
  • 1 ยตL = 10โปโถ L
  • 1 nL = 10โปโน L

Key rule: Each step (milli โ†’ micro โ†’ nano) differs by 1000ร—.

Examples:

  • 1 mL = 1000 ยตL
  • 1 ยตL = 1000 nL

2๏ธโƒฃ Mole (mol)

A mole measures the amount of substance.

1 mol = 6.022 ร— 10ยฒยณ molecules (Avogadroโ€™s number)

It is simply a very large counting unit for atoms or molecules.


3๏ธโƒฃ Molarity (M)

Molarity is concentration.

[ M = \frac{\text{moles}}{\text{liters}} ]

Example: 1 M solution = 1 mole dissolved in 1 liter.

Important:

  • mol = amount
  • M = concentration

4๏ธโƒฃ Dilution Formula

[ C_1 V_1 = C_2 V_2 ]

Used to prepare diluted solutions from a concentrated stock.

Example: To make 100 mL of 1 M solution from 10 M stock:

[ (10M)V_1 = (1M)(100mL) ]

Vโ‚ = 10 mL


5๏ธโƒฃ Key Takeaways

  • milli, micro, nano differ by 1000ร— each step
  • M means mol/L
  • Most lab work uses ยตL volumes
  • DNA amounts are often measured in ng (10โปโน g)

Part 3: DNA Design Challenge

#S.S: now, this is going to be a fun exercise!

  3.1. Choose your protein : Fibronectin type III domain-containing protein 10
  3.2. Reverse Translate: Protein (amino acid) sequence to DNA (nucleotide) sequence.
  '>sp|F2Z333|FND10_HUMAN Fibronectin type III domain-containing protein 10 OS=Homo sapiens OX=9606 GN=FNDC10 PE=1 SV=1 MRAPPLLLLLAACAPPPCAAAAPTPPGWEPTPDAPWCPYKVLPEGPEAGGGRLCFRSPARGFRCQAPGCVLHAPAGRSLRASVLRNRSVLLQWRLAPAAARRVRAFALNCSWRGAYTRFPCERVLLGASCRDYLLPDVHDSVLYRLCLQPLPLRAGPAAAAPETPEPAECVEFTAEPAGMQDIVVAMTAVGGSICVMLVVICLLVAYITENLMRPALARPGLRRHP'

  '>reverse translation of sp|F2Z333|FND10_HUMAN Fibronectin type III domain-containing protein 10 OS=Homo sapiens OX=9606 GN=FNDC10 PE=1 SV=1  to a 678 base sequence of consensus codons.
    atgmgngcnccnccnytnytnytnytnytngcngcntgygcnccnccnccntgygcngcn
  gcngcnccnacnccnccnggntgggarccnacnccngaygcnccntggtgyccntayaar
  gtnytnccngarggnccngargcnggnggnggnmgnytntgyttymgnwsnccngcnmgn
  ggnttymgntgycargcnccnggntgygtnytncaygcnccngcnggnmgnwsnytnmgn
  gcnwsngtnytnmgnaaymgnwsngtnytnytncartggmgnytngcnccngcngcngcn
  mgnmgngtnmgngcnttygcnytnaaytgywsntggmgnggngcntayacnmgnttyccn
  tgygarmgngtnytnytnggngcnwsntgymgngaytayytnytnccngaygtncaygay
  wsngtnytntaymgnytntgyytncarccnytnccnytnmgngcnggnccngcngcngcn
  gcnccngaracnccngarccngcngartgygtngarttyacngcngarccngcnggnatg
  cargayathgtngtngcnatgacngcngtnggnggnwsnathtgygtnatgytngtngtn
  athtgyytnytngtngcntayathacngaraayytnatgmgnccngcnytngcnmgnccn
  ggnytnmgnmgncayccn'

  3.3. Codon optimization.
  The purpose of codon optimisation is to improve expression efficiency. It would not alter the encoded protein but it will redesign the DNA sequence to match a host organism.

  '>Improved DNA:
  ATGGGCCCCCTGTTCGCCCTGGCCCCCCCCGGCGCCCGCCCCCCCCCCGT     
  GGGCCCCCCCGAGCCCTGGTGCCTGAAGTTCCGCGGCCGCGCCGGCGGCC     
  TGCTGCCCCGCGTGGTGCAGCCCGGCGGCAGCGCCCGCGGCTGGCGCTAA     
  CGCTTCCACGGCTGCCCCCGCGCCGGCGGCCTGCTGATGTGGGGCGCCAA     
  CGTGCCCGGCGGCCTGGGCTGGGACATCCCCAGCCAGAGCCTGGTGGTGA     
  CCAGCTGGCGCCCCCGCGCCCCCGAGCCCGACCGCGACGGCGACTACGCC     
  ACCGCCGACGCCGAGTGCGTGCACGACGCCTGGGGCCTGGTGTGCGTGTA     
  CTGCTGCGCCAACACCAAGTACGGCCGCTGCGCCGGCGGCACC'

  {ref.} https://www.jcat.de/ --- codon optimisation tool, free online access

  
  3.4. You have a sequence! Now what?
  Using plasmid? The DNA could be used to produce protein.

  3.5. How does it work in nature/biological systems?

#S.S(3.1.): Firstly, I had to recognise the differences between ‘gene vs. protein’. From my understanding, the central dogma goes like this : DNA(gene) -> RNA(transcription) -> Protein(translation) The problem is I am not familiar with names of proteins, thus come up with a gene that I’d encountered in the past! – such as, PCSK9

* PCSK9 - proprotein convertase subtilisin/kexin type 9
{ref. NCBI} https://www.ncbi.nlm.nih.gov/datasets/gene/255738/#transcripts-and-proteins

There are PCSK9 inhibitors in the market that are precsribed to regulated high LDL-C level of Dyslipidemia or FH(Familial Hyperlipidemia). But, I wasn’t happy with the NCBI lists of protein accession (NP_777596.2, etc.) since I don’t understand what they mean. In UniProtKB, however, I came across an interesting name, ‘Adnectin’ - I was curious, why does it have a special name on it? It is a protein therapeutics, according to below resources, and naturally I decided to look into Human Fibronectin deeper.

{UniProt} {google} {PubMed. NCBI}

Part 5: DNA Read/Write/Edit

5.1 DNA Read

(i) What DNA would you want to sequence (e.g., read) and why? This could be DNA related to human health (e.g. genes related to disease research), environmental monitoring (e.g., sewage waste water, biodiversity analysis), and beyond (e.g. DNA data storage, biobank). I would like to sequence my own DNA, and especially check on the currently available cancer gene test.

(ii) In lecture, a variety of sequencing technologies were mentioned. What technology or technologies would you use to perform sequencing on your DNA and why?
Also answer the following questions:

  1. Is your method first-, second- or third-generation or other? How so?

     NGS(second) method would fit for my own DNA read, since  it's fast enough and relatively cheaper that the third-gen method.
    

5.2 DNA Write

(i) What DNA would you want to synthesize (e.g., write) and why? These could be individual genes, clusters of genes or genetic circuits, whole genomes, and beyond. As described in class thus far, applications could range from therapeutics and drug discovery (e.g., mRNA vaccines and therapies) to novel biomaterials (e.g. structural proteins), to sensors (e.g., genetic circuits for sensing and responding to inflammation, environmental stimuli, etc.), to art (DNA origamis). If possible, include the specific genetic sequence(s) of what you would like to synthesize! You will have the opportunity to actually have Twist synthesize these DNA constructs! :)

I remember writing DNA of a bateriophage to used it as a therapeutic.

(ii) What technology or technologies would you use to perform this DNA synthesis and why?
Also answer the following questions:

  1. What are the essential steps of your chosen sequencing methods?
  2. What are the limitations of your sequencing method (if any) in terms of speed, accuracy, scalability?

5.3 DNA Edit

(i) What DNA would you want to edit and why? In class, George shared a variety of ways to edit the genes and genomes of humans and other organisms. Such DNA editing technologies have profound implications for human health, development, and even human longevity and human augmentation. DNA editing is also already commonly leveraged for flora and fauna, for example in nature conservation efforts, (animal/plant restoration, de-extinction), or in agriculture (e.g. plant breeding, nitrogen fixation). What kinds of edits might you want to make to DNA (e.g., human genomes and beyond) and why?

  I'm more interested in utilizing RNA than editing DNA form.

(ii) What technology or technologies would you use to perform these DNA edits and why?
Also answer the following questions:

  1. How does your technology of choice edit DNA? What are the essential steps?
  2. What preparation do you need to do (e.g. design steps) and what is the input (e.g. DNA template, enzymes, plasmids, primers, guides, cells) for the editing?
  3. What are the limitations of your editing methods (if any) in terms of efficiency or precision?

Week 3 HW: Lab Automation

A group HW: Design cell-free RNA biosensors

RNA toehold switch๊ฐ€ ์ž‘๋™ํ•˜๋Š” ์›๋ฆฌ๋ฅผ recitation์—์„œ Ana๊ฐ€ ์„ค๋ช…ํ•ด ์ฃผ์—ˆ๋Š”๋ฐ, ๋‚˜๋Š” ์šฐ์„  mRNA๊ฐ€ ๋‹จ๋ฐฑ์งˆ๋กœ translation๋˜๋Š” ๊ธฐ์ „์„ ์ดํ•ดํ•˜๊ณ ๋‚˜์„œ ์ด๋Ÿฐ biosensor์˜ ๊ธฐ๋Šฅ์„ ์ดํ•ดํ•  ์ˆ˜ ์žˆ์—ˆ๋‹ค.

  1. mRNA์— ribosome์ด ์™€์„œ ๊ฒฐํ•ฉํ•œ๋‹ค. ์ฐพ์•„๋‹ค๋‹Œ๋‹ค๊ธฐ ๋ณด๋‹ค๋Š” cell๋‚ด์—์„œ ์ž์œ ๋กญ๊ฒŒ ๋– ๋‹ค๋‹ˆ๋˜ ribosome์ด ์ถฉ๋Œํ•˜์—ฌ ๊ฒฐํ•ฉํ•˜๋Š” ๊ฒƒ์— ๊ฐ€๊น๋‹ค๊ณ . Eukaryotes์˜ ๊ฒฝ์šฐ๋Š” rough endoplasmic reticulum(RER, ์„ธํฌ์†Œ๊ธฐ๊ด€organelle)์— ๊ฑฐ์น ์–ด ๋ณด์ด๋„๋ก ribosome๋“ค์ด ๋ถ™์–ด์žˆ์–ด์„œ ๋‹จ๋ฐฑ์งˆ์„ ์ƒ์‚ฐ, ๋ฐ˜๋ฉด์— Prokaryotes, ์ฆ‰, bacteria์˜ ๊ฒฝ์šฐ ribosome์€ free-floating ์ƒํƒœ๋กœ๋งŒ cytoplasm(์„ธํฌ์งˆ)์— ์กด์žฌ.

    • ์š”์ƒˆ ๊ธฐ๋ณธ๊ฐœ๋… ํ•™์Šต์— ๋„์›€์„ ๋ฐ›๊ณ  ์žˆ๋Š” biology podcast(https://podcasts.apple.com/kr/podcast/teach-me-biology/id1525760514?l=en-GB&i=1000486622621)์˜ ๊ธฐ์–ต์„ ๋– ์˜ฌ๋ฆฌ๋ฉด, Eukaryotes/animal cell์—์„œ RER์€ protein์„ ์ƒ์‚ฐํ•˜๊ณ , smooth ER(w/o ribosome)์€ lipid๋ฅผ ์ƒ์‚ฐํ•œ๋‹ค. free ribosome์€ ์„ธํฌ ๋‚ด๋ถ€์—์„œ ์‚ฌ์šฉํ•  ๋‹จ๋ฐฑ์งˆ์„ ์ƒ์‚ฐํ•œ๋‹ค๊ณ  ํ•œ๋‹ค. ์ถ”๊ฐ€๋กœ, mitochondria๋Š” release energy & produce ATP. ๊ทธ๋ฆฌ๊ณ  risosome์€ pathogen์ด๋‚˜ dead cell, etc๋ฅผ ์—†์• ๋Š” ์—ญํ• ์„ ํ•˜๋Š” ๊ฒƒ ๊ฐ™๋‹ค.
  2. mRNA์—๋Š” RBS(ribosome binding site)๊ฐ€ ์žˆ์–ด์„œ ribosome์ด ๋ถ€์ฐฉ๋˜๊ณ  ๋ฒˆ์—ญ์„ ์‹œ์ž‘ํ•  ์ˆ˜ ์žˆ๋‹ค. ์ด RBS๋Š” bacteria์—์„œ๋Š” Shineโ€“Dalgarno sequence๋ผ๊ณ  ๋ถˆ๋ฆฐ๋‹ค๊ณ . ์‹œ์ž‘์ฝ”๋ˆ AUG ์•ž์— ์œ„์น˜ํ•œ๋‹ค.

  3. ํ•˜๋‚˜ ๋” ๋ชฐ๋ž๋˜ ๊ฐœ๋…, RNA๋Š” folding landscape, ๋™์ ์œผ๋กœ ์ ‘ํ˜”๋‹ค ํ’€๋ ธ๋‹ค ํ•˜๋Š” ๊ตฌ์กฐ์ง‘ํ•ฉ์„ ๋ณด์ด๋Š”๋ฐ, toehold switch์— ๋‚˜์˜จ hairpin ๊ตฌ์กฐ๋Š” rna ์—ผ๊ธฐ ๊ฐ€๋‹ฅ์ด ์„œ๋กœ ์—ฐ๊ฒฐ๋˜์ง€ ์•Š์€ loop, ๊ทธ๋ฆฌ๊ณ  ์ผ๋ถ€๊ฐ€ ์ƒ๋ณด์ ์œผ๋กœ ์—ฐ๊ฒฐ๋œ stem์œผ๋กœ ๊ตฌ์„ฑ๋œ loop-stem ๊ตฌ์กฐ์ด๋‹ค. ๋ชจ๋“  RNA๊ฐ€ ์ด๋ ‡๊ฒŒ ๋ชจ์–‘์ด ์ƒ๊ธด๊ฑด ์•„๋‹ˆ๊ณ  ์—ผ๊ธฐ์„œ์—ด์— ๋”ฐ๋ผ์„œ ๋™์ ์ธ๋“ฏ ํ•˜๋‹ค. ๊ทธ๋ ‡์ง€๋งŒ ๋ฐ•ํ…Œ๋ฆฌ์•„๋Š” ๋น„๊ต์  mRNA ๊ตฌ์กฐ๊ฐ€ ๋‹จ์ˆœํ•˜๋ฏ€๋กœ ์˜๋„์ ์œผ๋กœ hairpin๊ตฌ์กฐ๋กœ ๋งŒ๋“ค์–ด์„œ ํ™œ์šฉํ•˜๋Š” ๋“ฏ ํ•˜๋‹ค.

  4. Finally, toehold switch์—์„œ hairpin๊ตฌ์กฐ์˜ RNA switch๋Š” ํ‰์†Œ(off status) RBS(is hidden within stem)์™€ AUG๋ฅผ ๊ฐ€๋ฆผ์œผ๋กœ์จ ribosome์˜ ์ ‘๊ทผ์„ ๋ง‰์•„์„œ protein translation์„ ๋ง‰๋Š”๋‹ค. When switched-on(target RNA๊ฐ€ ์ ‘๊ทผ ์‹œ, it will bind and zip-open the stem followed by strand displacement(ํ—ค์–ดํ•€ ๋ถ•๊ดด), in the end, ribosome can start translation process.

    ์•ฝ๊ฐ„ ์ดํ•ด๊ฐ€ ์•ˆ๋˜์—ˆ๋˜ ๋ถ€๋ถ„์€ RNA๊ฐ€ ์ง€๊ธˆ 2์ข…์ด ๋‚˜์˜จ๋‹ค๋Š” ๊ฑฐ์˜€๋Š”๋ฐ, chatGPT์—๊ฒŒ ์งˆ๋ฌธํ•˜๋ ค๊ณ  ๊ฐœ๋…์„ ์จ๋ณด๋‹ค ๋ณด๋‹ˆ ์–ผ์ถ” ์•„๋ž˜์™€ ๊ฐ™์ด ์ดํ•ด๊ฐ€ ๋˜์—ˆ๋‹ค.

    toehold switch๋Š” bacteria์˜ RNA๊ฐ€ ํ‘œํ˜„๋˜๊ธฐ๋ฅผ hairpin ๊ตฌ์กฐ๋ฅผ ์ทจํ•˜๋„๋ก re-engineered it. hairpin ์ƒํƒœ์— ์žˆ๋‹ค๊ฐ€, target RNA(์™ธ๋ถ€์˜ phage๊ฐ™์€ ๊ฒƒ๋“ค์˜ input ๋“ฑ)๊ฐ€ ๋“ค์–ด์˜ค๋ฉด protein์„ ๋งŒ๋“ค๋„๋ก ์„ค๊ณ„.

    ChapGPT์˜ ๋„์›€์„ ๋ฐ›์•„ ์žฌ์ •๋ฆฌํ•ด๋ณด๋ฉด, Bacterial mRNA์˜ 5’UTR (AUG๊ฐ€ ๋ถ™์€ ์•ž์ชฝ ๋ถ€๋ถ„์œผ๋กœ ์˜ˆ์ƒ.)์„ hairpin ๊ตฌ์กฐ๋กœ ํ•ฉ์„ฑํ•œ ๊ฒƒ! (์ž์—ฐ์ ์œผ๋กœ ํ˜•์„ฑ๋œ switch๋กœ Riboswitch๋ผ๋Š”๊ฒŒ ์กด์žฌํ•จ)

#s.s: At this stage, I wondered what it means to be ‘cell-free’ and why it matters. Below is an examplary table from ChatGPT to help my understanding. It seems cell-free system could partially avoid GMO-regulatory risks or make things easier! (Which helps me narrow down the desired setting for my final project, i guess)

Category (ํ•ญ๋ชฉ)Cell-based (์„ธํฌ ๊ธฐ๋ฐ˜)Cell-free (์„ธํฌ ๋น„์˜์กด)
Viability (์ƒ์กด)Required (ํ•„์š”)Not required (๋ถˆํ•„์š”)
Proliferation (์ฆ์‹)Present (์žˆ์Œ)Absent (์—†์Œ)
Regulation (๊ทœ์ œ)GMO-regulated (GMO ๊ทœ์ œ)Relatively relaxed (์ƒ๋Œ€์  ์™„ํ™”)
Response Speed (๋ฐ˜์‘ ์†๋„)Slow (๋А๋ฆผ)Fast (๋น ๋ฆ„)
Circuit Complexity (ํšŒ๋กœ ๋ณต์žก์„ฑ)High (๋†’์Œ)Low (๋‚ฎ์Œ)
Field Deployability (ํ˜„์žฅ ์ ์šฉ์„ฑ)Low (๋‚ฎ์Œ)High (๋†’์Œ)

Brainstorming on RNA Biosensor, assisted by ChatGPT

  1. Target AMR(Antimicrobial Resistance) gene

Development of resistance for antibiotics seemed like a major issue in the medical field. It is routinely tested in hospital already with AST (antimicrobial susceptibility testing). Also, molecular testing for AMR gene is performed as an early guidance. Thus, RNA biosensor have potential to be the fastest detection tool replacing the current molecular test in urgent scenarios, although it might lack precision compared to AST.

  1. Microneedle patch for Vaccination

Could a biosensor detect humoral immunity after vaccine innoculation?
It is possible to check interstitial fluid (ISF) on one’s skin, but immunity is proved mostly by protein(as an antibody) and RNA sensor is not the best option to detect a vaccination result… so, I would dump this idea.

  1. Virus RNA biosensor

    Back to the familiar idea that I was introduced from the class, which is the detection of RNA viruses. RNA is THE genetic material of a pathogene and the sensor have potential to complement or partially replace PCR. In addition, it would have significant societal value - high impact on public health.

So, I’m thinking โ€œbroad coronavirus biosensorโ€ as a front-line detector.

#GPT: “Detecting viral RNA directly is important because The presence of viral RNA typically indicates an active infection, rather than past exposure or residual immunity.

In regarding Coronaviruses, related viruses such as SARS-CoV (2003) and MERS-CoV (2012) had already caused outbreaks and the coronavirus family was already known. However, SARS-CoV-2(COVID-19) was a newly emerged zoonotic virus that humans had not previously encountered at population scale. So COVID-19 was not created โ€œfrom nothing,โ€ but it represented a new spillover event of a genetically distinct virus within a known viral family.

Coronaviruses belong to the family Coronaviridae.

Within it are multiple genera:

  • Alphacoronavirus
  • Betacoronavirus (includes SARS-CoV, MERS-CoV, SARS-CoV-2)

If you design a biosensor targeting highly conserved regions (e.g., RdRp gene)

You could potentially detect:

  • SARS-like coronaviruses
  • MERS-like coronaviruses
  • Related zoonotic betacoronaviruses

This is because the RNA-dependent RNA polymerase (RdRp) region is relatively conserved across coronaviruses.

So in principle:

A conserved-region RNA biosensor could detect the presence of a coronavirus even before it is classified as a new human pathogen.


Would it detect a completely new zoonotic spillover?

Scenario 1: Spillover from a closely related SARS-like bat CoV

  • Likely detectable with a conserved-region sensor.

Scenario 2: Spillover from a distantly related coronavirus lineage

  • Detection depends on how conserved the target region is.

Scenario 3: Spillover from a completely different viral family

  • โŒ A coronavirus-targeted sensor would not detect it.

Technical Considerations to design a broad CoV biosensor,

Multi-sequence alignment across:

  • Bat SARS-like CoVs
  • Pangolin CoVs
  • Human SARS-CoV
  • SARS-CoV-2
  • MERS-CoV

Identification of:

  • Highly conserved 25โ€“35 nt regions
  • Structurally accessible regions
  • Minimal mutation hotspots

Possibly:

  • Degenerate base tolerance in design
  • Multiplexed switches covering multiple conserved motifs

   โ€” A biosensor targeting conserved coronavirus regions could potentially detect certain future zoonotic spillovers within the CoV family.

But:

  It would not detect unrelated viral families.
  It would not replace sequencing.
  It must be part of an integrated surveillance system.

Week 4 HW: Protein Design Part I

Part A. Conceptual Questions

Answer any NINE of the following questions from Shuguang Zhang: (i.e. you can select two to skip)

  How many molecules of amino acids do you take with a piece of 500 grams of meat? (on average an amino acid is ~100 Daltons)
  Why do humans eat beef but do not become a cow, eat fish but do not become fish?
  Why are there only 20 natural amino acids?
  Can you make other non-natural amino acids? Design some new amino acids.
  Where did amino acids come from before enzymes that make them, and before life started?
  If you make an ฮฑ-helix using D-amino acids, what handedness (right or left) would you expect?
  Can you discover additional helices in proteins?
  Why are most molecular helices right-handed?
  Why do ฮฒ-sheets tend to aggregate?
  What is the driving force for ฮฒ-sheet aggregation?
  Why do many amyloid diseases form ฮฒ-sheets?
  Can you use amyloid ฮฒ-sheets as materials?
  Design a ฮฒ-sheet motif that forms a well-ordered structure.

Part B: Protein Analysis and Visualization

In this part of the homework, you will be using online resources and 3D visualization software to answer questions about proteins. Pick any protein (from any organism) of your interest that has a 3D structure and answer the following questions:

  Briefly describe the protein you selected and why you selected it.
  Identify the amino acid sequence of your protein.
  How long is it? What is the most frequent amino acid? You can use this Colab notebook to count the frequency of amino acids.
  How many protein sequence homologs are there for your protein? Hint: Use Uniprotโ€™s BLAST tool to search for homologs.
  Does your protein belong to any protein family?
  Identify the structure page of your protein in RCSB
  When was the structure solved? Is it a good quality structure? Good quality structure is the one with good resolution. Smaller the better (Resolution: 2.70 ร…)
  Are there any other molecules in the solved structure apart from protein?
  Does your protein belong to any structure classification family?
  Open the structure of your protein in any 3D molecule visualization software:
  PyMol Tutorial Here (hint: ChatGPT is good at PyMol commands)
  Visualize the protein as โ€œcartoonโ€, โ€œribbonโ€ and โ€œball and stickโ€.
  Color the protein by secondary structure. Does it have more helices or sheets?
  Color the protein by residue type. What can you tell about the distribution of hydrophobic vs hydrophilic residues?
  Visualize the surface of the protein. Does it have any โ€œholesโ€ (aka binding pockets)?

Part C. Using ML-Based Protein Design Tools

Week 5 HW: Protein Design Part II

Part A: SOD1 Binder Peptide Design (From Pranam) Superoxide dismutase 1 (SOD1) is a cytosolic antioxidant enzyme that converts superoxide radicals into hydrogen peroxide and oxygen. In its native state, it forms a stable homodimer and binds copper and zinc.

Mutations in SOD1 cause familial Amyotrophic Lateral Sclerosis (ALS). Among them, the A4V mutation (Alanine โ†’ Valine at residue 4) leads to one of the most aggressive forms of the disease. The mutation subtly destabilizes the N-terminus, perturbs folding energetics, and promotes toxic aggregation.

  Your challenge:
  
  Design short peptides that bind mutant SOD1.
  Then decide which ones are worth advancing toward therapy.
  You will use three models developed in our lab:
  
  PepMLM: target sequence-conditioned peptide generation via masked language modeling
  PeptiVerse: therapeutic property prediction
  moPPIt: motif-specific multi-objective peptide design using Multi-Objective Guided Discrete Flow Matching (MOG-DFM)
  Part 1: Generate Binders with PepMLM
  Begin by retrieving the human SOD1 sequence from UniProt (P00441) and introducing the A4V mutation.
  Using the PepMLM Colab linked from the HuggingFace PepMLM-650M model card:
  Generate four peptides of length 12 amino acids conditioned on the mutant SOD1 sequence.
  To your generated list, add the known SOD1-binding peptide FLYRWLPSRRGG for comparison.
  Record the perplexity scores that indicate PepMLMโ€™s confidence in the binders.
  Part 2: Evaluate Binders with AlphaFold3
  Navigate to the AlphaFold Server: alphafoldserver.com
  For each peptide, submit the mutant SOD1 sequence followed by the peptide sequence as separate chains to model the protein-peptide complex.
  Record the ipTM score and briefly describe where the peptide appears to bind. Does it localize near the N-terminus where A4V sits? Does it engage the ฮฒ-barrel region or approach the dimer interface? Does it appear surface-bound or partially buried?
  In a short paragraph, describe the ipTM values you observe and whether any PepMLM-generated peptide matches or exceeds the known binder.
  Part 3: Evaluate Properties of Generated Peptides in the PeptiVerse
  Structural confidence alone is insufficient for therapeutic development. Using PeptiVerse, letโ€™s evaluate the therapeutic properties of your peptide! For each PepMLM-generated peptide:
  
  Paste the peptide sequence.
  Paste the A4V mutant SOD1 sequence in the target field.
  Check the boxes
  Predicted binding affinity
  Solubility
  Hemolysis probability
  Net charge (pH 7)
  Molecular weight
  Compare these predictions to what you observed structurally with AlphaFold3. In a short paragraph, describe what you see. Do peptides with higher ipTM also show stronger predicted affinity? Are any strong binders predicted to be hemolytic or poorly soluble? Which peptide best balances predicted binding and therapeutic properties?
  
  Choose one peptide you would advance and justify your decision briefly.
  
  Part 4: Generate Optimized Peptides with moPPIt
  Now, move from sampling to controlled design. moPPIt uses Multi-Objective Guided Discrete Flow Matching (MOG-DFM) to steer peptide generation toward specific residues and optimize binding and therapeutic properties simultaneously. Unlike PepMLM, which samples plausible binders conditioned on just the target sequence, moPPIt lets you choose where you want to bind and optimize multiple objectives at once.
  
  Open the moPPit Colab linked from the HuggingFace moPPIt model card
  Make a copy and switch to a GPU runtime.
  In the notebook:
  Paste your A4V mutant SOD1 sequence.
  Choose specific residue indices on SOD1 that you want your peptide to bind (for example, residues near position 4, the dimer interface, or another surface patch).
  Set peptide length to 12 amino acids.
  Enable motif and affinity guidance (and solubility/hemolysis guidance if available). Generate peptides.
  After generation, briefly describe how these moPPit peptides differ from your PepMLM peptides. How would you evaluate these peptides before advancing them to clinical studies?

Part C: Final Project: L-Protein Mutants High level summary: The objective of this assignment is to improve the stability and auto-folding of the lysis protein of a MS2-phage. This mechanism is key to the understanding of how phages can potentially solve antibiotic-resistance.

Week 6 HW: Genetic Circuits - Part I

DNA Assembly Answer these questions about the protocol in this weekโ€™s lab:

  What are some components in the Phusion High-Fidelity PCR Master Mix and what is their purpose?
  What are some factors that determine primer annealing temperature during PCR?
  There are two methods from this class that create linear fragments of DNA: PCR, and restriction enzyme digests. Compare and contrast these two methods, both in terms of protocol as well as when one may be preferable to use over the other.
  How can you ensure that the DNA sequences that you have digested and PCR-ed will be appropriate for Gibson cloning?
  How does the plasmid DNA enter the E. coli cells during transformation?
  Describe another assembly method in detail (such as Golden Gate Assembly)
  Explain the other method in 5 - 7 sentences plus diagrams (either handmade or online).
  Model this assembly method with Benchling or Asimov Kernel!

Asimov Kernel

  Create a Repository for your work
  Create a blank Notebook entry to document the homework and save it to that Repository
  Explore the devices in the Bacterial Demos Repo to understand how the parts work together by running the Simulator on various examples, following the instructions for the simulator found in the โ€œInfoโ€ panel (click the โ€œiโ€ icon on the right to open the Info panel)
  Create a blank Construct and save it to your Repository
  Recreate the Repressilator in that empty Construct by using parts from the Characterized Bacterial Parts repository
  Search the parts using the Search function in the right menu
  Drag and drop the parts into the Construct
  Confirm it works as expected by running the Simulator (โ€œplayโ€ button) and compare your results with the Repressilator Construct found in the Bacterial Demos repository
  Document all of this work in your Notebook entry - you can copy the glyph image and the simulator graphs, and paste them into your Notebook
  Build three of your own Constructs using the parts in the Characterized Bacterials Parts Repo
  Explain in the Notebook Entry how you think each of the Constructs should function
  Run the simulator and share your results in the Notebook Entry
  If the results donโ€™t match your expectations, speculate on why and see if you can adjust the simulator settings to get the expected outcome

Week 7 HW: Genetic Circuits - Part II