簡易檢索 / 詳目顯示

研究生: 楊勝源
Sheng-Yuan Yang
論文名稱: 新一代智慧型網路資訊系統FAQ-master
Development of FAQ-master as a New Intelligent Web Information System
指導教授: 何正信
Cheng-Seen Ho
李漢銘
Hahn-Ming Lee
口試委員: 許清琦
Ching-Chi Hsu
蘇豐文
Von-Wen Soo
李錫智
Shie-Jue Lee
許鈞南
Chun-Nan Hsu
陳錫明
Shyi-Ming Chen
學位類別: 博士
Doctor
系所名稱: 電資學院 - 電子工程系
Department of Electronic and Computer Engineering
論文出版年: 2005
畢業學年度: 94
語文別: 英文
論文頁數: 121
中文關鍵詞: 資訊整合網路資訊系統代取代理人答覆代理人搜尋代理人介面代理人網站塑模使用者塑模本體論
外文關鍵詞: Information Integration, Web Information systems, Interface Agents, Proxy Agents, Answerer Agents, Search Agents, User Modeling, Website Modeling, Ontology
相關次數: 點閱:235下載:23
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 本論文闡述我們發展智慧型網路資訊系統FAQ-master的成果。FAQ-master具有從浩瀚網際中,有效發現、擷取、過濾、代取、排序與呈現高品質資訊,來滿足使用者資訊需求的能力。所謂高品質資訊的意義就是具深度、最新且貼近於使用者問題的解答。論文中探討下列問題:如何忠實且傳神地擷取使用者的意圖、如何有效地發現與整合鬆散無特定結構的網路資訊、如何呈現給使用者相關的查詢結果、以及如何提供有效的代取機制縮短系統的回應時間。提出的技術包括:本體論、使用者模式、網站模式、以及資料整合與代取機制。本論文並勾勒出FAQ-master的四個主要組件,亦即介面代理人、代取代理人、答覆代理人與搜尋代理人的系統架構,祈能從使用者意圖、網頁文件處理與網站搜尋等三個觀點,有效地改善網際網路搜尋的成果。
    介面代理人扮演使用者與系統間的溝通者,來抓取使用者真正的查詢意圖。在使用者塑模、樣板及本體論支援下,本代理人提供增強式型樣與樣板比對之自然語言查詢、人機溝通之協助與引導、以及較佳的個人化資訊服務,亦能處理使用者針對提供解答的回饋。代取代理人則扮演介面代理人與後端答覆代理人間之兩階段中介者,引進本體論增強之智慧式代取機制,可有效降低後端伺服器資料庫的擷取負擔。答覆代理人負責清理、擷取與轉換來自不同網站的資訊,並存成本體論主導的資料庫。本代理人引進包裝器技術,將搜尋代理人收集的網頁資訊在系統後端進行本體論主導的資訊匯集。最後,搜尋代理人藉由本體論支援的網站模式,執行使用者導向與領域相關兼顧的網路資訊擷取。這種語意層次解答的做法,使得搜尋代理人能提供具使用者高滿意度之特定領域聚焦的網路資訊探索。
    本系統的成果之一為發展介面代理人的本體論支援與樣板為主之使用者塑模及查詢處理的技術。我們初步的實驗結果顯示,近八成的使用者問句可由本系統正確地辨識出使用者的查詢意圖與焦點。此外,實驗也驗證了樣式匹配技術的完整性,對於了解使用者問句的意圖與焦點相當有效。
    本系統的成果之二為發展代取代理人的查詢預測技術。本代理人的特色為(1)利用完美雜湊與資料庫分解之改良式的循序型樣採掘技術,自使用者查詢歷史紀錄中,挖掘出使用者查詢行為,進行使用者導向之快速採掘及預測;(2)利用PC本體論中的VRelationships,進行本體論主導之改良式案例式推理。實驗顯示,本系統大約可舒緩後端答覆代理人約70 %的工作負擔,很明顯的改善了整體的查詢效能。
    本系統的成果之三為發展答覆代理人的組織與處理鬆散無特定結構之網路資訊技術。本代理人引進本體論支援的包裝器技術,對來自異質環境下的FAQ資訊進行清理、擷取與轉換,並儲存在一個依知識本體結構建構的整合式資料庫;利用本體論支援去除FAQ雜訊、不一致、或互衝的情形,採用全部關鍵詞包含式或部分關鍵詞包含式的方式,擷取出更多可供回覆的FAQ。為呈現最正確有效的解答,本代理人引入豐富的排名指標技術,包括:出現率、滿意值、相容值與相似值,來強化查詢結果的排名次序,以提供使用者更佳的呈現結果。經實驗顯示,本代理人的確能提升查詢解答的精確值,並呈現較佳的排名效果。
    本系統的成果之四為發展搜尋代理人的使用者導向與領域聚焦兼顧之網路搜尋技術。引進本體論支援的網站模式提供搜尋引擎具語意層次的解答,藉以產出快速、精準、穩定與高滿意度的搜尋結果。由於網站模式與領域本體論的關聯密切,在網站模式的建構與應用上能支援包括:查詢擴展、網頁註解、網頁與網站分類、以及兼顧領域相關與使用者興趣的網路資源收集。本代理人的特色為(1)本體論支援的網站模式建構:提出將領域語意引進網路資源蒐集與儲存的觀念,重要的成果是一個能精準且穩定分類網頁,並支援正確語意註解之新的本體論分類器OntoClassifier,經實驗顯示,本分類器確能獲致滿意的網頁分類結果;(2)網站模式支援的網路資源探索:兼顧了使用者興趣與領域特殊性,成果之一為引進具使用者查詢驅動之網頁擴展、自主式網站擴展、以及深度開發查詢結果等革新策略,來有效擴展網站模式之新的Focused Crawler;(3)網站模式支援的網頁擷取:揭露出以本體論特徵值當作快速索引架構,來定位出符合使用者需求網頁的功效。


    This thesis describes the result of our research in developing FAQ-master as an intelligent Web information system. The system is developed to perform intelligent discovery, retrieval, filtering, proxy, ranking and presentation of Web information to provide high-quality FAQ solutions to meet user information request. By a high quality answer we mean an answer that is profound, up-to-date, and relevant to the user’s question. We summarized problems into: how to faithfully capture user intention, how to effectively discover and aggregate Web information, how to present the relevant result to the user, and how to provide efficient proxy mechanism to help speed up the turn around time. We propose the following techniques to tackle the above issues: ontology, user models, website models, and data aggregation and proxy mechanisms. Based upon the techniques, FAQ-master was developed to contain four agents, namely, Interface Agent, Proxy Agent, Answerer Agent, and Search Agent, which can effectively and efficiently improve the search result from the following three aspects of the Web search activity, namely, user intention, document processing, and website search.
    The Interface Agent was developed to work as an assistant between the user and FAQ system for capturing true user’s intention. Based on user modeling, template-based and ontology-supported techniques, the agent can support natural language query, enhanced by the pattern-match and template-based technique; assistance and guidance for human-machine interaction; and better personalized information services. It also handles user feedback on the suitability of the proposed responses. The Proxy Agent was developed to work as a two-tier mediator between the Interface Agent and backend Answerer Agent. It employs an ontology-enhanced intelligent proxy mechanism to effectively alleviate the overloading problem usually associated with a backend server. The Answerer Agent was developed to help clean, retrieve, and transform FAQ information collected from a heterogeneous environment, such as the Web, and stores it in an ontological database. It works as a back end process to perform ontology-directed information aggregation, supported by the wrapper technique, from the webpages collected by the Search Agent. Finally, the Search Agent was developed to work as an both user-oriented and domain-related Web information retrieval with the help of ontology-supported website models. This approach provides a semantic level solution for the Search Agent so that it can provide domain-specific, focused Web information discovery toward a high degree of user satisfaction.
    Our first contribution is on the techniques of user modeling and query processing involved in the development of Interface Agent, which features ontology-supported, template-based user modeling technique and query processing. Our preliminary experimentation demonstrates that user intention and focus of up to eighty percent of the user queries can be correctly understood by the system. In addition, from the experiments we verify the robustness of the linguistic pattern match technique by demonstrating its effectiveness in analyzing users’ query intention and focus.
    The second contribution is on the techniques of query prediction in Proxy Agent. The agent features following interesting points. First, it performs fast user-oriented mining and prediction by discovering frequent queries and predicted queries from user query history. The improved sequential pattern mining algorithm is made more efficient by the techniques of perfect hashing and database decomposition. Second, it performs ontology-directed case-based reasoning. The semantics of PC ontology, in particular the VRelationships, are used in determining similar cases, performing case adaptation, and case retaining. Our experiments show that the agent can share up to 70% of the query loading from the backend process, which helps a lot on the overall query performance.
    The third contribution is on the techniques of organizing and processing unstructured Web information in Answerer Agent. The agent employs ontology as the key technique, supported by the wrapper techniques to help clean, retrieve, and transform unstructured FAQ information collected from a heterogeneous environment, and stores it in an ontological database, which reflects the ontological structure. When it comes to the retrieval of FAQs, the agent trims irrelevant query keywords, employs either full keywords or partial keywords to retrieve FAQs, and removes conflicting FAQs before turning the final results to the user, all of which are supported by ontology. In addition, to producing a more effective presentation of the search results, the agent employs an enhanced ranking technique, which includes Appearance Probability, Satisfaction Value, Compatibility Value, and Statistic Similarity Value as four measures with proper weights to rank the FAQs. Our experiments show the Agent does improve the precision rate and produces better ranking results.
    The final contribution is on the techniques of reflecting both user-oriented and domain-focused aspects in web search in Search Agent. The agent features an ontology-supported website modeling technique to provide a semantic level solution for a search engine so that it can provide fast, precise and stable search results with a high degree of user satisfaction. The website modeling technique closely connected to the domain ontology, which supports the following functions in both website model construction and application: query expansion, webpage annotation, webpage/website classification, and focused collection of domain-related and user-interested Web resources. The agent features the following interesting characteristics. 1) Ontology-supported construction of website models. By this, we attribute domain semantics into the Web resources collected and stored in the local database. One important contribution here is the new Ontology-supported OntoClassifier which can do very accurate and stable classification on webpages to support more correct annotation of domain semantics. Our experiments show that Ontoclassifier performs very well in obtaining accurate and stable webpages classification. 2) Website models-supported web resource discovery. By this, we take into account both user interests and domain specificity. The contribution here is the new Focused Crawler which employs progressive strategies to do user query-driven webpage expansion, autonomous website expansion, and query results exploitation to effectively expand the website models. 3) Website models-supported Webpage Retrieval. By this, we leverage the power of ontology features as a fast index structure to locate most-wanted webpages for the user.

    TABLE OF CONTENTS ABSTRACT (IN CHINESE) i ABSTRACT (IN ENGLISH) v ACKNOWLEDGEMENT (IN CHINESE) ix TABLE OF CONTENTS xi LIST OF TABLES xv LIST OF FIGURES xvii CHAPTER 1 INTRODUCTION 1 1.1 BACKGROUND 1 1.2 MOTIVATION 4 1.3 PROBLEM SPECIFICATION 5 1.4 PROPOSED SOLUTIONS 6 1.5 APPLICATION DOMAIN 10 1.6 ORGANIZATION OF THE THESIS 10 CHAPTER 2 RELATED WORKS 13 2.1 RELATED WORKS ABOUT INTERFACING SYSTEMS 13 2.2 RELATED WORKS ABOUT QUERY PREDICTION AND CACHE 14 2.3 RELATED WORKS ABOUT ANSWERING SYSTEMS 17 2.4 RELATED WORKS ABOUT SEARCHING SYSTEMS 18 CHAPTER 3 SYSTEM ARCHITECTURE AND FUNDAMENTAL TECHNIQUES 21 3.1 DESIGN PHILOSOPHY 21 3.2 SYSTEM ARCHITECTURE 23 3.3 DOMAIN ONTOLOGY AS THE DOWN-TO-THE-EARTH SEMANTICS 25 3.4 QUERY TEMPLATE-BASED NATURAL LANGUAGE PROCESSING IN INTERFACE AGENT 27 3.5 ONTOLOGY FORMATION FOR SEARCH AGENT 31 CHAPTER 4 INTERFACE AGENT 35 4.1 USER MODELING 35 4.2 SYSTEM ARCHITECTURE 37 4.2.1 INTERACTION AGENT 39 4.2.2 QUERY PARSER 40 4.2.3 WEB PAGE PROCESSOR 43 4.2.4 SCORER 44 4.2.5 PERSONALIZER 44 4.2.6 USER MODEL MANAGER 44 4.2.7 RECOMMENDER 46 CHAPTER 5 PROXY AGENT 49 5.1 TWO-TIER PROXY DESIGN 49 5.2 FIRST-TIER PROXY: QUERY PREDICTION AND QUERY CACHE 51 5.2.1 FULL-SCAN-WITH-PHP ALGORITHM 52 5.2.2 CONSTRUCTION OF QUERY POOL 53 5.2.3 FIRST-TIER SOLUTION SERVICE 55 5.3 SECOND-TIER PROXY: CASE-BASED REASONING 56 5.3.1 CASE RETRIEVAL 57 5.3.2 CASE REUSE 58 5.3.3 CASE ADAPTATION 58 5.3.4 CASE MAINTENANCE 60 5.3.4.1 SOLUTION CASES 60 5.3.4.2 REFERENCE CASES 61 5.3.4.3 ADAPTED CASES 62 5.3.4.4 NEW SOLUTION CASES FROM THE BACKEND PROCESS 62 CHAPTER 6 ANSWERER AGENT 63 6.1 SYSTEM ARCHITECTURE 63 6.1.1 ONTOLOGY-DIRECTED FAQ STORAGE 64 6.1.2 ONTOLOGY-SUPPORTED WEBPAGE WRAPPING 64 6.2 ONTOLOGY-SUPPORTED FAQ RETRIEVAL 65 6.2.1 RANKING METHOD FOR FULL KEYWORDS MATCH 66 6.2.2 RANKING METHOD FOR PARTIAL KEYWORDS MATCH 68 6.3 WEIGHT SYSTEM OF MEASURES 69 CHAPTER 7 PROXY AGENT 71 7.1 WEBSITE MODEL AND CONSTRUCTION 71 7.1.1 WEBSITE MODEL 71 7.1.2 WEBSITE MODELING 74 7.2 WEBSITE MODELS APPLICATION 77 7.2.1 FOCUSED WEB CRAWLING SUPPORTED BY WEBSITE MODELS 78 7.2.1.1 USER-ORIENTED WEB SEARCH SUPPORTED BY WEBSITE MODELS 80 7.2.1.2 DOMAIN-ORIENTED WEB SEARCH SUPPORTED BY WEBSITE MODELS 82 7.2.2 WEBPAGE RETRIEVAL FROM WEBSITE MODELS 86 7.3 OPERATION OF THE SEARCH AGENT 89 CHAPTER 8 SYSTEM EVALUATION 93 8.1 EXPERIMENT SETTING 93 8.2 PERFORMANCE EVALUATION OF ONTOCLASSIFIER 94 8.3 PERFORMANCE EVALUATION OF ONTOLOGY-SUPPORTED HTML WRAPPER 97 8.4 PERFORMANCE EVALUATION OF TWO-TIER PROXY 98 8.5 PERFORMANCE EVALUATION OF QUERY PARSER 99 CHAPTER 9 CONCLUSIONS AND FUTURE RESEARCH 101 9.1 SUMMARY 101 9.2 CONTRIBUTIONS 102 9.3 DISCUSSIONS AND FUTURE RESEARCH 104 REFERENCES 109 APPENDIX A PART OF INTENTION WORD BASE 118 APPENDIX B PART OF INTENTION CLASSES AND PATTERN MATCH 119

    References
    [Abas00] J.M. Abasolo and M. Gómez, “MELISA: An Ontology-Based Agent for Information Retrieval in Medicine,” Available at http://www.ics.forth.gr/proj/isst/SemWeb/./proceedings/session3-1/paper.pdf, 2000.
    [Adam97] P. Adam, “The Search is over,” Available at http://www.zdnet.com/pccomp/features/fea1096/sub2.html, 1997.
    [Agra94] R. Agrawal and R. Srikant, “Fast Algorithm for Mining Association Rules,” Proc. of the 20th International Conference on Very Large Databases (VIDB’94), Santiago de Chile, Chile, 1994, 487-499.
    [Agra95] R. Agrawal and R. Srikant, “Mining Sequential Patterns,” Proc. of the 11th International Conference on Data Engineering, Taipei, Taiwan, 1995, 3-14.
    [Arna00] L.H. Arnaud, L.H. Philippe, W. Lauren, N. Gavin, R. Jonathan, C. Mike, and B. Steve, “Document Object Model (DOM) Level 2 Core Specification,” Available at http://www.w3.org/TR/DOM-Level-2-Core/, 2000.
    [Aroc98] G. Arocena and A. Mendelzon, “Viewing Web Information Systems as Database Application,” Communications of the ACM, 41(7), 1998, 101-102.
    [Asni97] F.A. Asnicar and C. Tasso, “ifWeb: A Prototype of User Model-Based Intelligent Agent for Document Filtering and Navigation in World Wide Web,” Proc. of the 6th International Conference on User Modeling (UM-97), Chia Laguna, Sardinia, Italy, 1997, 3-12.
    [Boni03] D. Bonino, F. Corno, and G. Squillero, “A Real-Time Evolutionary Algorithm for Web Prediction,” Proc. of the 2003 IEEE/WIC International Conference on Web Intelligence (WI’03), Halifax, Canada, 2003, 139-145.
    [Bota02] F. Bota, F. Corno, L. Farinetti, and G. Squillero, “A Transparent Search Agent for Closed Collections,” International Conference on Advances in Infrastructure for e-Business, e-Education, e-Service, and e-Medicine on the Internet, L'Aquila, Italy, 2002, 205-210.
    [Bric00] D. Brickley and R.V. Guha, “Resource Description Framework (RDF) Schema Specification,” Available at http://www.w3.org/TR/2000/CR-rdf-schema-20000327, 2000.
    [Burk97] R. Burke, K. Hammond, V. Kulyukin, S. Lytinen, N. Tomuro, and S. Schoenberg, “Natural language Processing in the FAQFinder System: Results and Prospects,” AAAI Spring Symposium on Natural Language Processing for the World Wide Web, Stanford, CA, USA, 1997, 17-26.
    [Carl98] F.E. Carlos, G.D. Joseph and K.G. Aditya, “Database Querying on the WWW: UniGuide-An Object-Relational Search Engine for Australian Universities,” Available at http://budhi.uow.edu.au/postgrad/carlos/tr98_5_101.htm, Tech. Rep. TR-1998-1, Dept. of Business Systems, University of Wollongong, Wollongong, Australia, 1998.
    [Caru94] R. Caruana and D. Freitag, “Greedy Attribute Selection,” Proc. of the 11th International Conference on Machine Learning, Rutgers University, New Brunswick, 1994, 28-36.
    [Chan98] B. Chandrasekaran, J.R. Josephson and V.R. Benjamins, “Ontology of Tasks and Methods,” Proc. of the 13th European Conference on Artificial Intelligence (ECAI’98), Brighton, England, 1998, 32-44.
    [Chan99] B. Chandrasekaran, J.R. Josephson, and V.R. Benjamins, “What Are Ontologies, and Why Do We Need Them?” IEEE Intelligent Systems, 14(1), Special Issue on Ontologies, 1999, 20-26, Also available at http://dlib.computer.org/ex/books/ex1999/pdf/x1020.pdf.
    [Chan99a] C.H. Chang and C.C. Hsu, “Enabling concept-based relevance feedback on World Wide Web,” IEEE Transactions on Knowledge and Data Engineering, Special Issue on Web Technologies, 11(4), 1999, 595-609.
    [Chen01] Y.J. Chen and V.W. Soo, “Ontology-Based Information Gathering Agents,” The 2001 International Conference on Web Intelligence (WI-01), Maebashi TERRSA, Japan, 2001, 423-427.
    [Chen98] M.S. Chen, J.S. Park, and P.S. Yu, “Efficient Data Mining for Path Traversal Patterns,” IEEE Transactions on Knowledge and Data Engineering, 10(2), 1998, 209-221.
    [Chiu03] Y.H. Chiu, An Interface Agent with Ontology-Supported User Models, Master Thesis, Dept. of Electronic Engineering, National Taiwan University of Science and Technology, Taipei, Taiwan, 2003.
    [Chu01] Y.C. Chu, Ontology-Based Solution Integration for Internet FAQ Systems, Mater Thesis, Dept. of Electronic Engineering, National Taiwan University of Science and Technology, Taipei, Taiwan, 2001.
    [Chua03] F.C. Chuang, Ontology-Supported and Ranking Technique-Enhanced FAQ Systems, Master Thesis, Dept. of Computer Science and Information Engineering, National Taiwan University of Science and Technology, Taipei, Taiwan, 2003.
    [Crav98] M. Craven, D. DiPasquo, D. Freitag, A. McCallum, T. Mitchell, K. Nigram and S. Slattery, “Learning to Extract Symbolic Knowledge from the World Wide Web,” Proc. of the 15th National Conference on Artificial Intelligence (AAAI-98), Madison, Wisconsin, 1998, 509-516.
    [DAML01] DAML, http://www.daml.org/, 2001.
    [Desi97] DesingLab at University of Kansas, “ProFusion meta-search,” Available at http://www.designlab.ukans.edu/profusion/, 1997.
    [Díaz02] B. Díaz-Agudo and P.A. Gonzlez-Calero, “CBROnto: A Task/Method Ontology For CBR,” Proc. of the 15th Florida Artificial Intelligence Research Society Conference (FLAIRS’02), Pensacola, Florida, USA, 2002, 101-105.
    [DOIL01] DAML+OIL, http://www.daml.org/2001/03/daml+oil-index/, 2001.
    [DOM00] W3C, “Document Object Model (DOM) Level 2 Core Specification,” at http://www.w3.org/TR/2000/REC-DOM-Level-2-Core-20001113/, 2000.
    [Drei96] D. Dreilinger and A.E. Howe, “Experiences with Selecting Search Engines Using Meta-Search,” Available at http://www.media.mit.edu/~daniel, 1996.
    [Duin00] A.J. Duineveld, R. Stoter, M.R. Weiden, B. Kenepa and V.R. Benjamins, “WonderTools? A Comparative Study of Ontological Engineering Tools,” International Journal of Human-Computer Studies, 52(6), 2000, 1111-1133.
    [Eich98] D. Eichmann, “Automated Categorization of Web Resources,” Available at http://www.iastate.edu/~CYBERSTACKS/Aristotle.htm, 1998.
    [Fens98] D. Fensel, M. Erdmann, and R. Studer, “OntoBroker: How to Make the WWW Intelligent,” Proc. of the 11th Banff Knowledge Acquisition for Knowledge-Based Systems, Workshop (KAW-98), Banff, Canada, April 1998, Also available at http://ksi.cpsc.ucalgary.ca/KAW/KAW98/KAW98Proc.html.
    [Fern98] M. Fernandez, D. Florescu, J. Kang, A. Levy, and D. Suciu, “Catching the Boat with Strudel: Experience with a Website management System,” Proc. of the ACM SIGMOD Conference on Management of Data, Seattle, WA, 1998, 414-425.
    [Fink96] J. Fink, A. Kobsa, and A. Nill, “User-oriented Adaptivity and Adaptability in the AVANTI Project,” Proc. of the International Conference on Design for the Web: Empirical Studies, Microsoft Usability Group, Redmond, WA, 1996, Also available at http://www.microsoft.com/usability/webconf.htm.
    [Flor98] D. Florescu, A. Levy, and A. Mendelzon, “Database Techniques for the World-Wide Web: A Survey,” Sigmod Records, 27(3), 1998, 59-74.
    [Fowl99] J. Fowler, B. Perry, M. Nodine and B. Bargmeyer, “Agent-Based Semantic Interoperability in InfoSleuth,” SIGMOD Record, 28(1), 1999, 60-67, Also available at http://www.mcc.com/projects/infosleuth/publications/index.html.
    [Fraw92] W. Frawley, G. Picatetsky-Shapiro and C. Matheus, “Knowledge Discovery in Database: An Overview,” AI Magazine, 13(3), 1992, 57-70.
    [Grav94] L. Gravano, H. García-Molina, and A. Tomasic, “Precision and Recall of GIOSS Estimators for Database Discovery,” Proc. of the 3rd International Conference on Parallel and Distributed Information Systems (PDIS’94), Austin, Texas, USA, 1994, 103-106.
    [Grub93] T.R. Gruber, “A Translation Approach to Portable Ontology Specifications,” Journal of Knowledge Acquisition, 5, 1993, 199-220.
    [Grub93a] T.R. Gruber, “Toward Principles for the Design of Ontologies Used for Knowledge Sharing,” Proc. of International Workshop on Formal Ontology in Conceptual Analysis and Knowledge Representation, Padova, Italy, 1993, Also available at http://ksl-web.stanford.edu/KSL_Abstracts/KSL-93-04.html.
    [Guar98] N. Guarino, “Formal Ontology and Information System,” Proc. of FOIS'98, Trento, Italy, Amsterdam, IOS Press, 1998, 3-15, Also available at ftp://ftp.ksl.stanford.edu/pub/KSL_Reports/KSL-92-71.ps.
    [Guar99] N. Guarino, C. Masolo and G. Vetere, “OntoSeek: Content-Based Access to the Web,” IEEE Intelligent Systems, 14(3), 1999, 70-80.
    [Hefl99] J. Heflin, J. Hendler, and S. Luke, “Applying Ontology to the Web: A Case Study,” Proc. of the 5th International Work-Conference on Artificial and Natural Neural Networks (IWANN’99), Alicante, Spain, 1999, 715-724.
    [Hend01] J. Hendler, “Agents and the Semantic Web,” IEEE Intelligent Systems, 16(2), 2001, 30-37.
    [Henr01] S.T. Henry, B. David, M. Murray and M. Noah, “XML Base,” Available at http://www.w3.org/TR/2001/REC-xmlbase-20010627/, 2001.
    [Ho00] C.S. Ho, Research on FAQ-master as an Intelligent High-Quality Information Integration Agent for PCDIY, Tech. Rep. NSC-89-2213-E-011-059, National Science Council, Taipei, Taiwan, 2000.
    [Ho01] C.S. Ho, An Intelligent Web Information Integration System Based upon Intelligent Retrieval, Filtering, and Integration, Tech. Rep. NSC-89-2218-E-011-014, National Science Council, Taipei, Taiwan, 2001.
    [Hovy01] E. Hovy, L. Gerber, U. Hermjakob, C.Y. Lin, and D. Ravichandran, “Toward Semantics-Based Answer Pinpointing,” Proc. of the DARPA Human Language Technology Conference (HLT), San Diego, 2001, 69-76, Also available at http://www.isi.edu/~ravichan/papers/01hlt.pdf.
    [Hovy02] E. Hovy, U. Hermjakob, and D. Ravichandran, “A Question/Answer Typology with Surface Text Patterns,” Proc. of the DARPA Human Language Technology conference (HLT), 2002, 247-250, Also available at http://www.isi.edu/natural-language/projects/webclopedia/pubs/02hlt.pdf.
    [Hsu99] C.C. Hsu and C.S. Ho, “Acquiring Patient Data by an Intelligent Interface Agent with Medicine-Related Common Sense Reasoning,” Expert Systems with Applications: An International Journal, 17(4), 1999, 257-274.
    [Iwaz96] M. Iwazume, H. Takeda and T. Nishida, “Ontology-Based Information Gathering and Categorization from the Internet,” Proc. of the 9th International Conference on Industrial and Engineering Applications of Artificial Intelligence and Expert Systems (IEA/AIE'96), Fukuoka, Japan, 1996, 305-314.
    [Joac96] T. Joachims, “A Probabilistic Analysis of the Rocchio Algorithm with TFIDF for Text Categorization,” Proc. of the 14th International Conference on Machine Learning (ICML-97), 1996, 143-151, Also available at ftp://reprots.adm.cs.cmu.edu/usr/anon/1996/CMU-CS-96-118.ps.
    [Joac97] T. Joachims, D. Freitag, and T. Mitchell, “WebWatcher: A Tour Guide for the World Wide Web,” Proc. of the 1997 International Joint Conference on Artificial Intelligence (IJCAI’97), Nagoya, Japan, 1997, 770-775.
    [Kers01] L. Kerschberg, W. Kim and A. Scime, “A Semantic Taxonomy-Based Personalizable Meta-Search Agent,” Available at http://www-dbs.cs.uni-sb.de/lehre/ws01_02/proseminarliteratur/websifterii.pdf, 2001.
    [Kers01a] L. Kerschberg, W. Kim and A. Scime, “WebSifter II: A Personalizable Meta-Search Agent Based on Weighted Semantic Taxonomy Tree,” Proc. of the International Conference on Internet Computing (IC’2001), Las Vegas, Nevada, 2001, 14-20.
    [Lau99] T. Lau and E. Horvitz, “Patterns of Search: Analyzing and Modeling Web Query Refinement,” Proc. of the 7th International Conference on User Modeling (UM’99), Banff Centre, Banff, Canada, 1999, 119-128.
    [Lawr98] S. Lawrence and L. Giles, “Context and Page Analysis for Improved Web Search,” IEEE Internet Computing, 2(4), 1998, 38-46.
    [Lee00] C.L. Lee, Intention Extraction and Semantic Matching for Internet FAQ Retrieval, Master Thesis, Dept. of Computer Science and Information Engineering, National Cheng Kung University, Tainan, Taiwan, 2000.
    [Lee00a] K.S. Lee, J.H. Oh, J.X. Huang, J.H. Kim, and K.S. Choi, “TREC-9 Experiments at KAIST: QA, CLIR and Batch Filtering,” Proc. of the TREC-9 Conference, NIST, Gaithersburg, MD, 2000, 303-316, Also available at http://trec.nist.gov/pubs/trec9/papers/kaist-trec9-qa-filtering.pdf.
    [Levy00] A. Y. Levy and D. S. Weld, “Intelligent Internet Systems,” Artificial Intelligence, 118, 2000, 1-14.
    [Liao03] P.C. Liao, An Intelligent Proxy Agent for FAQ, Master Thesis, Dept. of Electronic Engineering, National Taiwan University of Science and Technology, Taipei, Taiwan, 2003.
    [Lieb95] H. Lieberman, “Letizia: An Agent that Assist Web Browsing,” Proc. of the 1995 International Joint Conference on Artificial Intelligence (IJCAI’95), Montreal, Canada, 1995, 924-929.
    [Lin99] C.Y. I. Lin and C.S. Ho, “A Generic-Ontology-Based Approach for Requirement Analysis and its Application in Network Management Software,” Artificial Intelligence for Engineering Design, Analysis and Manufacturing, 13(1), 1999, 37-61.
    [Liou00] J.C. Liou, An Interface Agent Empowered by Ontology-Supported Use-Models, Master Thesis, Dept. of Electronic Engineering, National Taiwan University of Science and Technology, Taipei, Taiwan, 2000.
    [Liu01] James N.K. Liu and Tommy T.S. Leung, “A Web-based CBR Agent for Financial Forecasting,” Workshop 5 Soft Computing in Case-Based Reasoning, Vancouver, BC, Canada, 2001, Also available at http://www.aic.nrl.navy.mil/papers/2001/AIC-01-003/ws5/ws5toc11.pdf.
    [Lou02] W.W. Lou and H.J. Lu, “Efficient Prediction of Web Accesses on a Proxy Server,” Proc. of the 11th International Conference on Information and Knowledge Management (CIKM’02), McLean, Virginia, USA, 2002, 169-176.
    [Lyti02] S. Lytinen and N. Tomuro, “The Use of Question Types to Match Questions in FAQFinder,” AAAI Spring Symposium on Mining Answers from Texts and Knowledge Bases, Stanford, CA, USA, 2002, 46-53.
    [Mano98] F. Manola, “Towards a Web Object Model,” Available at http://op3.oceanpark.com/papers/wom.html, 1998.
    [Mend99] M. Mendonca and N.L. Sunderhaft, “Mining Software Engineering Data: A Survey,” A DACS (Data & Analysis Center for Software) State-of-the-Art Report, 1999.
    [Mlad99] D. Mladenic, “Machine Learning Used by Personal WebWatcher,” Proc. of ACAI-99 Workshop on Machine Learning and Intelligent Agents, Chania, Crete, 1999, Also available at http://www.cs.cmu.edu/~TextLearning/pww/pww.html.
    [Mock96] K.J. Mock, Intelligent Information Filtering via Hybrid Techniques: Hill Climbing, Case-Based Reasoning, Index Patterns, and Generic Algorithms, Ph.D. Dissertation, Dept. Computer Science, University of California, Davis, 1996.
    [Mouk96] A. Moukas, “Amalthea: Information Discovery and Filtering Using a Multiagent Evolving Ecosystem,” Proc. of the 1st International Conference on the Practical Applications of Intelligent Agents and MultiAgent Technology (PAAM), London, April 1996 (Available at http://moux.www.media.mit.edu/people/moux/papers/PAAM96/)
    [Müll99] M. Müller, “An Intelligent Multi-Agent Architecture for Information Retrieval from the Internet,” Technical report, U. of Osnabrüeck, Germany, Available at http://mir.cl-ki,uni-osnabrueck.de/~martin/index.publications.2.html, 1999.
    [Noy01] N.F. Noy and D.L. McGuinness, “Ontology Development 101: A Guide to Creating Your First Ontology,” Stanford Knowledge Systems Laboratory Tech. Rep. KSL-01-05 and Stanford Medical Informatics Tech. Rep. SMI-2001-0880, Also available at http://www.ksl.stanford.edu/people/dlm/papers/ontology-tutorial-noy-mcguinness.pdf, 2001.
    [Noy97] N.J. Noy and C.D. Hafner, “The State of the Art in Ontology Design,” AI Magazine, 18(3), 53-74.
    [OIL00] OIL, http://www.ontoknowledge.org/oil/, 2000.
    [OuYa00] Y.L. OuYang, Study and Implementation of A Dialogued-Based Query System for Telecommunication FAQ Services, Master Thesis, Dept. of Computer and Information Science, National Chiao Tung University, HsinChu, Taiwan, 2000.
    [Özel01] S.A. Özel and H.A. Güvenir, “An Algorithm for Mining Association Rules Using Perfect Hashing and Database Pruning,” Proc. of the 10th Turkish Symposium on Artificial Intelligence and Neural Networks (TAINN’01), North Cyprus, 2001, 257-264.
    [Pazz96] M. Pazzani, J. Muramatsu, and D. Billsus, “Syskill&Webert: Identifying Interesting Web Sites,” Proc. of the 13th National Conference on Artificial Intelligence and 8th Innovative Applications of Artificial Intelligence Conference (AAAI 96, IAAI 96), Portland, OR, 1996, 54-61.
    [Piro99] P. Pirolli and J. Pitkow, “Distributions of Surfers’ Paths through the World Wide Web: Empirical Characterization,” World Wide Web, 2(1-2), 1999, 29-45.
    [Plun99] T.K. Plunkett and D. Thompson, “Intelligent Web Search Agents,” Available at http://csgrad.cs.vt.edu/~tplunkett/, 1999.
    [Pohl99] W. Pohl and A. Nick, “Machine Learning and Knowledge Representation in the LaboUr Approach to User Modeling,” Proc. of the 7th International Conference on User Modeling, Banff, Canada, 1999, 179-188, Also available at http://citeseer.nj.nec.com/pohl99machine.html.
    [RDF00] D. Brickley and R. V. Guha, “Resource Description Framework (RDF) Schema Specification,” Available at http://www.w3.org/TR/2000/CR-rdf-schema-20000327, 2000.
    [Rich79] E. Rich, “User Modeling via Stereotypes,” Cognitive Science, 3, 1979, 329-354.
    [Rugg02] S. Ruggieri, “Efficient C4.5,” IEEE Transactions on Knowledge and Data Engineering, 14(2), 2002, 438-444.
    [Salt75] G., Salton, A. Wong, and C.S. Yang, “A Vector Space Model for Automatic Indexing,” Communications of ACM, 18(11), 1975, 613-620.
    [Salt83] G. Salton and M.J. McGill, Introduction to Modern Information Retrieval. McGraw-Hill Book Company, New York, USA, 1983.
    [Salt88] G. Salton and C. Buckley, “Term Weighting Approaches in Automatic Text Retrieval,” Information Processing and Management, 24(5), 1988, 513-523.
    [Selb95] E. Selberg and O. Etzioni, “Multi-service Search and Comparison Using the MetaCrawler,” Proc. of the 4th International World Wide Web Conference, Boston, MA, USA, 1995, 169-173.
    [Shoh95] B. Marko, Y. Shoham and Y. Yeogirl, “An Adaptive Agent for Automated Web Browsing,” Stanford University Digital Library Project Working Paper, SIDL-WP-1995-0023, Available at http://www-diglib.stanford.edu/cgi-bin/WP/get/SIDL-WP-1995-0023, 1995.
    [Shor95] R. Shortland and R. Scarfe, “Digging for Gold,” IEE Review, 41, 1995, 213-217.
    [Silv02] I.G.L da Silva, B.P. Amorim, P.G. Campos, and L.M. Brasil, “Integration of Data Mining and Hybrid Expert System,” Proc. of the 15th Florida Artificial Intelligence Research Society Conference (FLAIRS’02), Pensacola, Florida, USA, 2002, 267-271.
    [Simo97] J. Simons, “Using a Semantic User Model to Filter the World Wide Web Proactively,” The 6th International Conference on User Modeling (UM’97), Chia Laguna, Sardinia, Italy, 1997, 455-456.
    [Snei99] E. Sneiders, “Automated FAQ Answering: Continued Experience with Shallow Language Understanding,” AAAI Fall Symposium on Question Answering Systems, Tech. Rep. FS-99-02, North Falmouth, Massachusetts, USA, AAAI Press, 1999, 97-107, Also available at http://www.dsv.su.se/~eriks/Sneiders1999.pdf.
    [Soub01] M.M. Soubbotin and S.M. Soubbotin, “Patterns of Potential Answer Expressions as Clues to the Right Answer,” Proc. of the TREC-10 Conference, NIST, Gaithersburg, MD, 2001, 293-302, Also available at http://trec.nist.gov/pubs/trec10/papers/insight_trec10.pdf.
    [Su00] Z. Su, Q. Yang, Y. Lu, and H. Zhang, “WhatNext: A Prediction System for Web Requests Using N-gram Sequence Models,” Proc. of the 1st International Conference on Web Information System and Engineering Conference (WISE’00), Hong Kong, China, 2000, 200-207.
    [Susa99] G. Susan, H. Winston and L. Sean, “ProFusion: Intelligent Meta-Search,” Available at http://www.ittc.ku.edu/~sgauch/profusion.html, 1999.
    [Thom96] C.G. Thomas and G. Fischer, “Using Agents to Improve the Usability and Usefulness of the World-Wide Web,” Proc. of the 5th International Conference in User Modeling (UM’96), West Newton, 1996, 5-12.
    [Ting00] Y. Ting, A Search Agent with Website Models, Master Thesis, Dept. of Electronic Engineering, National Taiwan University of Science and Technology, Taipei, Taiwan, 2000.
    [Tsai02] C.F. Tsai, H.C. Wu, and C.W. Tsai, “A new Data Clustering Approach for Data Mining in Large Databases,” International Symposium on Parallel Architectures, Algorithms, and Networks (ISPAN’02), Makati City, Metro Manila, Philippines, 2002, 278-283.
    [Tsai96] C.H. Tsai, “MMSEG: A Word Identification System for Mandarin Chinese Text Based on Two Variants of the Maximum Matching Algorithm,” Available at http://www.geocities.com/hao510/mmseg/, 1996.
    [Van02] H.L. Van and A. Trentini, “FAQshare: a frequently asked questions voting system as a collaboration and evaluation tool in teaching activities,” Proc. of the 14th International Conference on Software Engineering and Knowledge Engineering, Ischia, Italy, 2002, 557-560.
    [Wang01] B. Wang, H. Xu, Z. Yang, Y. Liu, X. Cheng, D. Bu, and S. Bai, “TREC-10 Experiments at CAS-ICT: Filtering, Web, and QA,” Proc. of the TREC-10 Conference, NIST, MD, 2001, 109-121, Also available at http://trec.nist.gov/pubs/trec10/papers/CASICT.pdf.
    [Wang03] C.M. Wang, Web Search with Ontology-Supported Technology, Master Thesis, Dept. of Computer Science and Information Engineering, National Taiwan University of Science and Technology, Taipei, Taiwan, 2003.
    [Weib99] S. Weibel, “The State of the Dublin Core Metadata Initiative,” D-Lib Magazine, 5(4), 1999, Also Available at http://mirrored.ukoln.ac.uk/lisjournals/dlib/dlib/dlib/april99/04weibel.html.
    [Wu01] J.W. Wu, Using Rule Mining and Behavior Prediction Techniques in Webpages Query Processing, Master Thesis, Dept. of Computer Science and Information Engineering, National Taiwan University of Science and Technology, Taipei, Taiwan, 2001.
    [XML01] W3C, “XML Base,” at http://www.w3.org/TR/2001/REC-xmlbase-20010627/, 2001.
    [Yang01] Q. Yang, H.H. Zhang, and T. Li, “Mining Web Logs for Prediction Models in WWW Caching and Prefetching,” Proc. of the 7th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD’01), San Francisco, California, USA, 2001, 473-478.
    [Yang01a] S.Y. Yang, J.W. Wu, and C.S. Ho, “Using Rule Mining and Behavior Prediction Techniques in Webpages Query Processing,” Proc. of the 6th Conference on Artificial Intelligence and Applications (TAAI'01), Kaohsiung, Taiwan, 2001, 574-579.
    [Yang01b] S.Y. Yang, Y.J. Chu, and C.S. Ho, “Ontology-Based Solution Integration for Internet FAQ Systems,” Proc. of the 6th Conference on Artificial Intelligence and Applications (TAAI'01), Kaohsiung, Taiwan, 2001, 52-57.
    [Yang03] S.Y. Yang and C.S. Ho, “A Website-Model-Supported New Search Agent,” The 2nd International Workshop on Mobile Systems, E-Commerce, and Agent Technology (MSEAT’03), Miami, Florida, USA, 2003, 563-568.
    [Yang03a] S.Y. Yang, C.M. Wang, and C.S. Ho, “How Do Ontology-Supported Website Models Help Web Search?” Submitted to International Journal on Artificial Intelligence Communications, 2003.
    [Yang04] S.Y. Yang, F.C. Chuang, and C.S. Ho, “An FAQ System Empowered by Ontology and Improved Ranking Techniques,” Submitted to International Journal of Intelligent Information Systems, 2004.
    [Yang04a] S.Y. Yang, P.C. Liao, and C.S. Ho, “An Intelligent Two-Tier Proxy Agent for FAQ Service,” Submitted to International Journal on Data and Knowledge Engineering, 2004.
    [Yang04b] S.Y. Yang, Y.H. Chiu, and C.S. Ho, “Ontology-Supported and Query Template-Based User Modeling in Interface Agents,” Submitted to International Journal of Human-Computer studies, 2004.
    [Yang99] S.Y. Yang and C.S. Ho, “Ontology-Supported User Models for Interface Agents,” Proc. of the 4th Conference on Artificial Intelligence and Applications (TAAI'99), ChangHua, Taiwan, 1999, 248-253.
    [Zhan01] H. Zhang, “Improving Performance On WWW Using Path-Based Predictive Caching And Prefetching,” Available at http://citeseer.nj.nec.com/459947.html, 2001.
    [Zuck99] I. Zuckerman, D.W. Albrecht, and A.E. Nicholson, “Predicting User’s Request on the WWW,” Proc. of the 7th International Conference on User Modeling (UM’99), Banff Centre, Banff, Canada, 1999, 275-284.

    QR CODE