Week 4 HW: Protein Design Part I
Part A. Conceptual Questions
Why do humans eat beef but do not become a cow, eat fish but do not become fish?
When you eat beef or fish, your body does not keep the meat intact and turn it into “cow tissue” or “fish tissue.” Instead, your digestive system breaks everything down into basic molecules, like proteins into amino acids, fats into fatty acids + glycerol, carbohydrates into simple sugars and DNA into nucleotides.
Why are there only 20 natural amino acids?
Life could have used more (and sometimes does), but 20 appears to be a near-optimal balance. But why is that so? The Genetic Code has limits. Proteins are built using codons — 3-letter sequences in DNA/RNA. ➜ 4 bases (A, U/T, C, G), 3 positions per codon, 4³ = 64 possible codons. But 3 are stop signals and many codons are redundant (multiple codons for the same amino acids). Therefore, The code settled on 20 standard amino acids early in evolution and became highly conserved. Changing the code would break nearly all exiting proteins and would be catastrophically disruptive!
Can you make other non-natural amino acids? Design some new amino acids.
Synthetic biology and medicinal chemistry routinely create non-natural amino acids, and some are even genetically encoded in engineered organisms. Some are chemically synthesized and incorporated during peptide synthesis, while others are genetically encoded using engineered tRNA/synthetase systems.
One amino acid would be Alkyne-Lysine (Bioorthogonal Handle). The lysine’s side chain is modified to include a terminal alkyne. Alkynes allow click chemistry (azide–alkyne cycloaddition), site-specific labeling and fluorescent tagging. This amino acid could be used in protein imagin, drug conjugationg and synthetic protein networks.
Where did amino acids come from before enzymes that make them, and before life started?
Amino acids are within all living things on Earth, being the building blocks of proteins. Proteins are essential for many processes within living organisms, including catalysing reactions (enzymes), replicating genetic material (ribosomes), transporting molecules (transport proteins) and providing a structure to cells and organisms (e.g. collagen). Therefore, amino acids would have been needed in significant amounts within the region where life began on Earth. The Miller–Urey Experiment (1953) showed, that organic molecules can spontaneously form under plausible early-Earth conditions. Chemists simulated early Earth’s atmosphere and within days, the flask contained amino acids like Glycine, Alanine and Aspartate. Without enzymes or cells, just chemistry. That means, enzymes didn´t invent amino acids. Instead, Geochemistry made amino acids, those amino acids accumulated, some began forming short peptides, eventually, self-replicating systems emerged and only later did enzyme-based metabolism evolve.
If you make an α-helix using D-amino acids, what handedness (right or left) would you expect?
If you build an α-helix entirely from D-amino acids, it will form a left-handed helix. The reason why is that natural proteins use L-amino acids. In Biology, almost all amino acids are L and standard α-helices in proteins are right-handed. D-amino acids are mirror images of L-amino acids. So if you build A peptide from L residues ➜ right-handed α-helix and the exact mirror molecule (all D residues) must adopt the mirror conformation. Therefore, the entire structure inverts and the mirror image of a right-handed helix is a left-handed helix.
Can you discover additional helices in proteins?
Yes, and in fact, there already has been discovered additional helices beyond the standard α-helix. But whether new helices can exist is a deeper structural question. Other helices are: 3₁₀ Helix and π-Helix. There can be more, but it is very constrained.
Why are most molecular helices right-handed?
Most molecular helices in biology are right-handed because life uses L-amino acids, and L stereochemistry makes the right-handed α-helix energetically favored.
Why do β-sheets tend to aggregate?
β-sheets aggregate because their backbone hydrogen bonding is unsatisfied at the edges, and the easiest way to satisfy it is by binding to another β-sheet.
8.1 What is the driving force for β-sheet aggregation?
The driving force for β-sheet aggregation is driven by a combination of backbone hydrogen bonding, hydrophobic interactions, and water-mediated entropy effects, with cooperativity making it autocatalytic.
Why do many amyloid diseases form β-sheets?
Amyloid diseases form β-sheets because β-sheets have exposed hydrogen-bonding edges at misfolded regions, β-strands are geometrically compatible with stacking and fibril formation, hydrophobic and polar side chains stabilize sheet stacking, cross-β fibrils represent a low-energy, highly stable state and misfolding exposes β-prone sequences that nucleate aggregation.
9.1 Can you use amyloid β-sheets as materials?
Yes, amyloid β-sheets are not just pathological; their structural properties make them ideal building blocks for engineered materials. Some amyloid-based materials are:
- Hidrogels ➜ Short amyloidogenic peptides form cross-β networks in water and creates soft, viscoelastic gels that can be used in tissue engineering scaffolds, drug delivery systems and 3D cell culture matrices.
- Nanofibers and Films ➜ Amyloid fibrils can be aligned to make strong, thin fibers and they can be embedded in composites for e.g. biocompatible electronics
- Functionalized Materials ➜ Side chains can be chemically modified to bind metals, fluorophores, or enzymes and enables catalytic amyloid materials, light-responsive materials, and sensing platforms
Part B: Protein Analysis and Visualization
I selected the human hemoglobin because it is a crucial and very well-known protein that transports oxygen from the lungs to tissues and carbon dioxide back to the lungs.

The structure of hemoglobin. Source: https://chemistwizards.com/wp-content/uploads/2026/01/hemoglobin-structure-1024x687.webp
- 🩸Sequence:
sp|P69905|HBA_HUMAN Hemoglobin subunit alpha OS=Homo sapiens OX=9606 GN=HBA1 PE=1 SV=2 MVLSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHFDLSHGSAQVKGHG KKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLLSHCLLVTLAAHLPAEFTP AVHASLDKFLASVSTVLTSKYR
🩸This is the frequency of amino acids from Google Colab:

🩸On UniProt´s Blast tool, it showed that there are 113 protein sequence homologs.

🩸Hemoglobin belongs to the globin superfamily, which is a large group of proteins that bind heme and transport or store oxygen. Some common features of globins are globin fold, heme-binding pocket and conserved residues. Also, within the globin superfamily, hemoglobin has a subfamily ➜ Alpha-globin and Beta-globin.
- 🩸The structure from RCSB was released in 1998-04-29. The resolution is 1.80 Å

🩸There is a molecule in the structure. A ligand called “PROTOPORPHYRIN IX CONTAINING FE”
References
- https://pmc.ncbi.nlm.nih.gov/articles/PMC10105836/
- https://www.chemistryworld.com/features/why-are-there-20-amino-acids/3009378.article
- https://www.pittwire.pitt.edu/pittwire/features-articles/liu-chemistry-proteins-synthesis#:~:text=to%20Pittwire%20Today-,A%20new%20chemical%20process%20makes%20it%20easier%20to%20craft%20amino,proteins%20or%20their%20smaller%20cousins.
- https://astrobiology.com/2023/04/how-were-amino-acids-formed-before-the-origin-of-life-on-earth.html#:~:text=After%20several%20millions%20of%20years,other%2C%20similar%20to%20human%20hands.
- https://pmc.ncbi.nlm.nih.gov/articles/PMC8508955/#:~:text=Abstract,conductive%20materials%2C%20and%20catalytic%20materials.