Week 4 HW: Protein Design Part I
Part B: Protein Analysis and Visualization
In this part of the homework, you will be using online resources and 3D visualization software to answer questions about proteins.
1. Pick any protein (from any organism) of your interest that has a 3D structure and answer the following questions. Briefly describe the protein you selected and why you selected it.
I chose the p53 protein, which triggers programmed cell death when ailments like cancer cause extensive DNA damage from oxidative stress like UV light, oxygen radicals or chemicals. In a cancerous cell, the p53 protein will travel to the nucleus and signal the mitochondria to release reactive oxygen species or increase calcium levels. Other death factors released include cytochrome c, which activates caspases and SMAC which blocks survival proteins (Fogg et al., 2011). I selected this protein as mutations in this protein can cause cancer and it is vital to protect the human genome from damage .
- Identify the amino acid sequence of your protein. How long is it? What is the most frequent amino acid? You can use this notebook to count most frequent amino acid - https://colab.research.google.com/drive/1vlAU_Y84lb04e4Nnaf1axU8nQA6_QBP1?usp=sharing
p53 is 393 amino acids long.
The most common amino acid is: P (Proline), which appears 45 times.
How many protein sequence homologs are there for your protein? Hint: Use the pBLAST tool to search for homologs and ClustalOmega to align and visualize them.
I found 175 total homologs using the p53 human version (https://www.uniprot.org/uniprotkb/P04637/entry).
My Cluster alignment sequences are below:
Does your protein belong to any protein family?
The protein belongs to the p53 family.
Identify the structure page of your protein in RCSB When was the structure solved? Is it a good quality structure? Good quality structure is the one with good resolution. Smaller the better (Resolution: 2.70 Å)
The structure for 1TUP (https://www.rcsb.org/structure/1TUP) was solved and made public in 1995. It is 2.20 Å, higher resolution, so it is a high quality structure.
Are there any other molecules in the solved structure apart from protein?
DNA which is bound and complexed with p53.
Does your protein belong to any structure classification family?**
Open the structure of your protein in any 3D molecule visualization software: PyMol Tutorial Here (hint: ChatGPT is good at PyMol commands) Visualize the protein as “cartoon”, “ribbon” and “ball and stick”.
Color the protein by secondary structure. Does it have more helices or sheets?
Color the protein by residue type. What can you tell about the distribution of hydrophobic vs hydrophilic residues?
Visualize the surface of the protein. Does it have any “holes” (aka binding pockets)?
References
Fogg, V. C., Lanning, N. J., & MacKeigan, J. P. (2011). Mitochondria in cancer: At the crossroads of life and death. Chinese Journal of Cancer, 30(8), 526–539. https://doi.org/10.5732/cjc.011.10018