研究生: |
廖信睿 Sin-ruei Liao |
---|---|
論文名稱: |
改良BLAST之功能以搜尋異種脊椎動物之啟動子/加強子 Improve BLAST for searching different bertebrate promoter/enhancer |
指導教授: |
呂永和
Yung-Ho Lu |
口試委員: |
羅乃維
Nai-Wei Lo 鮑興國 Hsing-Kuo Kenneth Pao |
學位類別: |
碩士 Master |
系所名稱: |
管理學院 - 資訊管理系 Department of Information Management |
論文出版年: | 2005 |
畢業學年度: | 93 |
語文別: | 中文 |
論文頁數: | 57 |
中文關鍵詞: | BLAST 、啟動子 、轉錄因子 |
外文關鍵詞: | BLAST, transcriptional factor, promoter |
相關次數: | 點閱:315 下載:3 |
分享至: |
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
人類基因的完全定序是生物醫學上的重大成就,然而要進一步分析基因的功能調控則是後基因体時代更大的挑戰。轉錄因子與啟動子的相互作用可以影響很多重要的生理功能,而其缺陷可以引起各種疾病。然而截至目前只有少數啟動子已被確認。
BLAST是是目前做生物資訊研究的人,最常使用到的工具。BLAST的演算法主要根據一定的核酸長度(11mers)完全配對才能做比較,並不適合用來做短基因序列之間的alignment,加上人類和斑馬魚之間演化上的差距比較大,因此BLAST無法找出人類和斑馬魚之間有意義的alignment。本研究希望能夠透過修改BLAST的演算法,開發一個工具,藉由比較自由的參數設定,放寬alignment的條件限制,改良BLAST之功能,以達到搜尋異種脊椎動物之啟動子序列之目的。
The analysis of novel gene functions poses the major challenge in the post-genomic era. Especially, the identification of transcription factors and their binding sites are deem to be very useful. Binding a transcription factor to its binding site may cause many physiological functions; defects in the binding may cause diseases. However, only a few transcription binding sites are identified.
BLAST is one of the most frequently used tools in comparison of sequences. However, the algorithm is based on fixed length (11 mers). Therefore, it is not suitable for alignment of short stretch of DNA sequences, such as the transcription factor bind site. Moreover, BLAST is also not suitable for comparison between two genomic sequences with long evolutional distance, such as zebrafish and human genomic sequences. In this study, we modified the BLAST algorithm by changing its seeding procedure; make it possible to search for the promoter/enhancer sequences.
1 Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J Mol Biol. 1990;215:403-410.
2 Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997;25:3389-402.
3 Bulyk M L. Computational prediction of transcription-factor binding site locations
Genome Biol 2003, 5:201-211.
4 Detrich HW, Westerfield M and Zon LI, Overview of the zebrafish system. Methods Cell Biol 1999;59:3–10.
5 Gish W, States DJ. Identification of protein coding regions by database similarity search. Nat Genet. 1993;3:266-272.
6 Gottgens B, Barton LM, Chapman MA, Sinclair AM, Knudsen B, Grafham D, Gilbert JG, Rogers J, Bentley DR, Green AR. Transcriptional regulation of the stem cell leukemia gene (SCL)--comparative analysis of five vertebrate SCL loci. Genome Res. 2002;12:749-759.
7 Gottgens B, Barton LM, Gilbert JG, Bench AJ, Sanchez MJ, Bahn S, Mistry S, Grafham D, McMurray A, Vaudin M, Amaya E, Bentley DR, Green AR, and Sinclair AM. Analysis of vertebrate SCL loci identifies conserved enhancers. Nat Biotechnol. 2000;18:181-186.
8 Gottgens B., Nastos A, Kinston S, Piltz S, Delabesse EC, Stanley M, Sanchez MJ, Ciau-Uitz A, Patient R, and Green AR. 2002. Establishing the transcriptional programme for blood: the SCL stem cell enhancer is regulated by a multiprotein complex containing Ets and GATA factors. EMBO J. 21:3039–3050.
9 Heicklen-Klein A, McReynolds LJ, Evans T. Using the zebrafish model to study GATA transcription factors. Semin Cell Dev Biol. 2005;16:95-106.
10 Higgins D, Taylor W “Bioinformatics: Sequence, structure, and databanks: A Practical Approach” Oxford University Press, 2000
11 Korf I, Yandell M and Bedell J. “BLAST” O’Reilly & Associates, Inc. 2003
12 Loots GG and Ovcharenko I. rVISTA 2.0: evolutionary analysis of transcription factor binding sites. Nucleic Acids Res. 2004;32:W217-21.
13 Loots GG, Ovcharenko I, Pachter L, Dubchak I, Rubin EM. rVista for comparative sequence-based discovery of functional transcription factor binding sites. Genome Res. 2002;12:832-839.
14 Mayor C, Brudno M, Schwartz JR, Poliakov A, Rubin EM, Frazer KA, Pachter LS, Dubchak I. VISTA : visualizing global DNA sequence alignments of arbitrary length. Bioinformatics. 2000;16:1046-1047.
15 Morgenstern B, Frech K, Dress A, Werner T. DIALIGN: finding local similarities by multiple sequence alignment. Bioinformatics. 1998;14:290-294.
16 Ovcharenko I, Loots GG, Hardison RC, Miller W, Stubbs L. zPicture: dynamic alignment and visualization tool for analyzing conservation profiles. Genome Res. 2004;14:472-477.
17 Schug J, Schuller WP, Kappen C, Salbaum JM, Bucan M, Stoeckert CJ Jr. Promoter features related to tissue specificity as measured by Shannon entropy. Genome Biol. 2005;6:R33.
18 Schwartz S, Zhang Z, Frazer KA, Smit A, Riemer C, Bouck J, Gibbs R, Hardison R, Miller W. PipMaker--a web server for aligning two genomic DNA sequences. Genome Res. 2000;10:577-586.
19 Stern HM, Zon LI. Cancer genetics and drug discovery in the zebrafish. Nat Rev Cancer. 2003;3:533-539.
20 Suzuki Y, Yamashita R, Shirota M, Sakakibara Y, Chiba J, Mizushima-Sugano J, Nakai K, Sugano S. Sequence comparison of human and mouse genes reveals a homologous block structure in the promoter regions. Genome Res. 2004;14:1711-1718.