BIOINFORMATICS
Project – Analysis
Two genomic sequences and one mRNA sequence is available for analysis. One genomic sequence is the gene sequence and the other is the promoter sequence for the gene. You have cloned and sequenced this novel (never sequenced or identified before) gene and promoter from human mammary tissue. You have also characterized the mRNA of the gene by identifying the exon/intron boundaries. The exons are located at base pairs 1-510, 1401-1640, 2299-2538, 2941-3081, and 3672-5121 of the genomic sequence. After some initial database searches, you think that the gene is derived by exon shuffling between several different genes. Your job is to identify and characterize the promoter, gene, mRNA, and protein and decide if your exon shuffling theory is correct.
Summarize your findings and your proposed function of this gene in the mammary gland. Include how you arrived at your conclusions, i.e. databases used, information gathered. Do not cut and paste any results into your final draft. If you would like to include this information, you can add it as a second attachment or an appendix.
1. Promoter Sequence Analysis (300 words)
– Conduct all appropriate database searches to identify all homology and all significant promoter regions. Explain the results and conclusions and the reasoning for the conclusions, e.g. statistical significance.
2. Genomic sequence Analysis (300 words)
– Conduct all appropriate database searches to identify homology in all regions of the gene. Explain the results and conclusions and the reasoning for the conclusions, e.g. statistical significance.
3. mRNA Sequence Analysis (300 words)
– Conduct all appropriate database searches to identify homology in all regions of the mRNA. Explain the results and conclusions and the reasoning for the conclusions, e.g. statistical significance.
4. Protein Sequence Analysis (300 words)
– Translate the protein sequence correctly. Conduct all appropriate database searches to identify homology in all regions of the protein sequence and significant protein regions. Explain the results and conclusions and the reasoning for the conclusions, e.g. statistical significance.
5. Conclusion (300 words)
– Use the homology information gained to identify all possible functions of the protein and expression patterns and/or locations of the protein. Determine if exon shuffling is possible and explain why or why not.