簡易檢索 / 詳目顯示

研究生: 廖信睿
Sin-ruei Liao
論文名稱: 改良BLAST之功能以搜尋異種脊椎動物之啟動子/加強子
Improve BLAST for searching different bertebrate promoter/enhancer
指導教授: 呂永和
Yung-Ho Lu
口試委員: 羅乃維
Nai-Wei Lo
鮑興國
Hsing-Kuo Kenneth Pao
學位類別: 碩士
Master
系所名稱: 管理學院 - 資訊管理系
Department of Information Management
論文出版年: 2005
畢業學年度: 93
語文別: 中文
論文頁數: 57
中文關鍵詞: BLAST啟動子轉錄因子
外文關鍵詞: BLAST, transcriptional factor, promoter
相關次數: 點閱:315下載:3
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 人類基因的完全定序是生物醫學上的重大成就,然而要進一步分析基因的功能調控則是後基因体時代更大的挑戰。轉錄因子與啟動子的相互作用可以影響很多重要的生理功能,而其缺陷可以引起各種疾病。然而截至目前只有少數啟動子已被確認。

      BLAST是是目前做生物資訊研究的人,最常使用到的工具。BLAST的演算法主要根據一定的核酸長度(11mers)完全配對才能做比較,並不適合用來做短基因序列之間的alignment,加上人類和斑馬魚之間演化上的差距比較大,因此BLAST無法找出人類和斑馬魚之間有意義的alignment。本研究希望能夠透過修改BLAST的演算法,開發一個工具,藉由比較自由的參數設定,放寬alignment的條件限制,改良BLAST之功能,以達到搜尋異種脊椎動物之啟動子序列之目的。


    The analysis of novel gene functions poses the major challenge in the post-genomic era. Especially, the identification of transcription factors and their binding sites are deem to be very useful. Binding a transcription factor to its binding site may cause many physiological functions; defects in the binding may cause diseases. However, only a few transcription binding sites are identified.

    BLAST is one of the most frequently used tools in comparison of sequences. However, the algorithm is based on fixed length (11 mers). Therefore, it is not suitable for alignment of short stretch of DNA sequences, such as the transcription factor bind site. Moreover, BLAST is also not suitable for comparison between two genomic sequences with long evolutional distance, such as zebrafish and human genomic sequences. In this study, we modified the BLAST algorithm by changing its seeding procedure; make it possible to search for the promoter/enhancer sequences.

    第一章 緒論   1.1 生物資訊工具簡介 8   1.2 研究動機與目的 8 第二章 相關研究 11   2.1 斑馬魚 11   2.2 啟動子 11   2.3 NCBI 12   2.4 BLAST簡介 14   2.5 BLAST演算法 15     2.5.1 seeding 17     2.5.2 extension 19     2.5.3 evaluation 23   2.6 SCL 26 第三章 研究方法 29   3.1 BLASTN 29     3.1.1 Human 29     3.1.2 Zebrafish 30     3.1.3 總結 30   3.2 BL2SEQ 包含SCL基因的序列 30     3.2.1 人類168bps VS 人類GI:6911928 31     3.2.2 斑馬魚173bps VS 斑馬魚GI:14669430 31     3.2.3 人類GI:6911928 VS 斑馬魚GI:14669430 32   3.3 BL2SEQ 兩兩相比 33     3.3.1 Human VS Mouse 33     3.3.2 Human VS Chicken 34     3.3.3 Human VS Pufferfish 34     3.3.4 Human VS Zebrafish 34     3.3.5 Mouse VS Chicken 35     3.3.6 Mouse VS Pufferfish 35     3.3.7 Mouse VS Zebrafish 35     3.3.8 Chicken VS Pufferfish 36     3.3.9 Chicken VS Zebrafish 36     3.3.10 Pufferfish VS Zebrafish 37     3.3.11 總結 38   3.4 生物演化 39   3.5 轉錄因子結合部位 40   3.6 演算法 42     3.6.1 seeding 42     3.6.2 extension 43     3.6.3 evaluation 43 第四章 實驗結果 45   4.1 說明 45   4.2 Gottgens 45     4.2.1 W=11、T=11 45     4.2.2 W=11、T=10 46     4.2.3 W=11、T=9 47     4.2.4 總結 47   4.3 包含SCL基因的序列 48     4.3.1 人類168bps VS 人類GI:6911928 48     4.3.2 斑馬魚173bps VS 斑馬魚GI:14669430 49     4.3.3 人類168bps VS 斑馬魚GI:14669430 49     4.3.4 斑馬魚 173bps VS 人類GI:6911928 49     4.3.5 人類GI:6911928 VS 斑馬魚GI:14669430 50     4.3.6 總結 50 第五章 結論 52   5.1 研究貢獻 52   5.2 未來發展 53 參考文獻 55

    1 Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J Mol Biol. 1990;215:403-410.

    2 Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997;25:3389-402.

    3 Bulyk M L. Computational prediction of transcription-factor binding site locations
    Genome Biol 2003, 5:201-211.

    4 Detrich HW, Westerfield M and Zon LI, Overview of the zebrafish system. Methods Cell Biol 1999;59:3–10.

    5 Gish W, States DJ. Identification of protein coding regions by database similarity search. Nat Genet. 1993;3:266-272.

    6 Gottgens B, Barton LM, Chapman MA, Sinclair AM, Knudsen B, Grafham D, Gilbert JG, Rogers J, Bentley DR, Green AR. Transcriptional regulation of the stem cell leukemia gene (SCL)--comparative analysis of five vertebrate SCL loci. Genome Res. 2002;12:749-759.

    7 Gottgens B, Barton LM, Gilbert JG, Bench AJ, Sanchez MJ, Bahn S, Mistry S, Grafham D, McMurray A, Vaudin M, Amaya E, Bentley DR, Green AR, and Sinclair AM. Analysis of vertebrate SCL loci identifies conserved enhancers. Nat Biotechnol. 2000;18:181-186.

    8 Gottgens B., Nastos A, Kinston S, Piltz S, Delabesse EC, Stanley M, Sanchez MJ, Ciau-Uitz A, Patient R, and Green AR. 2002. Establishing the transcriptional programme for blood: the SCL stem cell enhancer is regulated by a multiprotein complex containing Ets and GATA factors. EMBO J. 21:3039–3050.

    9 Heicklen-Klein A, McReynolds LJ, Evans T. Using the zebrafish model to study GATA transcription factors. Semin Cell Dev Biol. 2005;16:95-106.

    10 Higgins D, Taylor W “Bioinformatics: Sequence, structure, and databanks: A Practical Approach” Oxford University Press, 2000

    11 Korf I, Yandell M and Bedell J. “BLAST” O’Reilly & Associates, Inc. 2003

    12 Loots GG and Ovcharenko I. rVISTA 2.0: evolutionary analysis of transcription factor binding sites. Nucleic Acids Res. 2004;32:W217-21.

    13 Loots GG, Ovcharenko I, Pachter L, Dubchak I, Rubin EM. rVista for comparative sequence-based discovery of functional transcription factor binding sites. Genome Res. 2002;12:832-839.

    14 Mayor C, Brudno M, Schwartz JR, Poliakov A, Rubin EM, Frazer KA, Pachter LS, Dubchak I. VISTA : visualizing global DNA sequence alignments of arbitrary length. Bioinformatics. 2000;16:1046-1047.

    15 Morgenstern B, Frech K, Dress A, Werner T. DIALIGN: finding local similarities by multiple sequence alignment. Bioinformatics. 1998;14:290-294.

    16 Ovcharenko I, Loots GG, Hardison RC, Miller W, Stubbs L. zPicture: dynamic alignment and visualization tool for analyzing conservation profiles. Genome Res. 2004;14:472-477.

    17 Schug J, Schuller WP, Kappen C, Salbaum JM, Bucan M, Stoeckert CJ Jr. Promoter features related to tissue specificity as measured by Shannon entropy. Genome Biol. 2005;6:R33.

    18 Schwartz S, Zhang Z, Frazer KA, Smit A, Riemer C, Bouck J, Gibbs R, Hardison R, Miller W. PipMaker--a web server for aligning two genomic DNA sequences. Genome Res. 2000;10:577-586.

    19 Stern HM, Zon LI. Cancer genetics and drug discovery in the zebrafish. Nat Rev Cancer. 2003;3:533-539.

    20 Suzuki Y, Yamashita R, Shirota M, Sakakibara Y, Chiba J, Mizushima-Sugano J, Nakai K, Sugano S. Sequence comparison of human and mouse genes reveals a homologous block structure in the promoter regions. Genome Res. 2004;14:1711-1718.

    QR CODE