簡易檢索 / 詳目顯示

研究生: 盧世凱
Shih-Kai Lu
論文名稱: 運用關係標記與子句邊界偵測作為解決機器翻譯重排序問題關鍵之研究
A Study of the Relational Marking and Clause Boundary Detection as the Keys to Solving Reordering Problems in Machine Translation Systems
指導教授: 陳献忠
Shian-Jung Chen
口試委員: 王世平
Shih-Ping Wang
高照明
Zhao-Ming Gao
學位類別: 碩士
Master
系所名稱: 人文社會學院 - 應用外語系
Department of Applied Foreign Languages
論文出版年: 2012
畢業學年度: 100
語文別: 英文
論文頁數: 224
中文關鍵詞: 機器翻譯字詞結構重排序關係標誌符號子句邊界界定自然語言剖析表關係的文法功能字名物化動詞英文造句法格位格框語法語意表達語意為本的翻譯
外文關鍵詞: machine language (MT), reordering, relational marker, clause boundary detection (CBD), language parsing, relational function word, meaning-based translation, case frame building, nominalized event, meaning representation, sentence construction principles
相關次數: 點閱:312下載:7
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報

翻譯過程中不變的東西才是兩個語言溝通的關鍵。翻譯前後,字詞格式與結構通常不該堅持沿用或延續,應該要保留的是訊息或語意。語意基本上可區分為指稱、界定與關係三種:指稱語意指的是文本所指涉的人事時地物,界定語意指的是文本所指涉的人事時地物的界定範圍,關係語意指的是文本所指涉的人事時地物之間的各種關係。
本研究證明目前機器翻譯研究與開發的主要瓶頸在於字詞結構的重排序,並且找出解決該問題的關鍵。本研究首先使用自己設計的翻譯人工評量方法,根據語意翻譯原則,利用人事時地物指稱必須一直形成一體的概念,分析重排序所造成的問題,並追究造成問題的原因。
本研究將證明字詞結構重排序的在於關係語意,指稱人事時地物之間的關係決定字詞的前後排序。英文表關係的文法功能字、名物化動詞、動詞的字尾、以及事件的格框等乃是表達關係語意的標誌符號。本研究使用擅長利用上下文義的語言剖析器找出文句中的指稱以及指稱間的關係,研究者將進一步示範本文的分析方法以及剖析器的分析結果對譯後修訂非常有幫助。本研究的最終目標在點出解決翻譯字詞結構重排序問題所需的關係標誌符號,可以用翻譯前階段工程的自然語言剖析器分析取得並支援。


Meaning-based human translation insists on the preservation of meaning rather than form and structure. Its advocates believe that meaning or message stays the same before and after translation, and there are essentially three aspects of meaning, referential meaning to stand for entities and events (together as referents), specificational meaning for the specification of referents, and relational meaning for how referents are related or linked. They believe that TL (target language) generation should base reordering on relational meaning, namely on how entities or events are related, which can only get clues from SL (Source Language) analysis, by extracting relations between referents.
This study starts with a manual translation evaluation method that is better equipped to diagnose reordering errors and show those errors are made because pieces of information buried in relational function words, case frame building, nominalized events, verbs ending with –ed and –ing, and sentence construction are ignored or missed. Further investigation would reveal how those relational markers can be found by a language parser that is capable of doing NP finding, PP attachment, Infinitival attachment, scope of coordination resolution, clause boundary detection, and other kinds of relation finding. The study would then end up with some sort of post-editing that makes use of relational marking to perform reordering.

Table of Contents List of Tables and Figures vii Abbreviation ix 中文摘要 i ABSTRACT ii Chapter One INTRODUCTION 1 1.1 BACKGROUND OF THE STUDY 2 1.2 RULE-BASED APPROACHES OR STATISTICAL APPROACHES 3 1.3 WHY SMT IS GOOD AT WSD 3 1.4 WHY SMT IS NOT GOOD AT GETTING THE WORD ORDER RIGHT 4 1.5 SL RELATIONAL MARKERS AND TL RELATIONAL MARKERS 5 1.6 MEANING-BASED TRANSLATION AND MEANING REPRESENTATION 5 1.7 USE OF LINGUISTIC KNOWLEDGE IN SMT 6 1.8 SIGNIFICANCE OF THE STUDY 6 1.9 PURPOSE OF THE STUDY 6 1.10 RESEARCH QUESTIONS 7 1.11 ORGANIZATION OF THE STUDY 7 Chapter Two LITERATURE REVIEW 9 2.1 MACHINE TRANSLATION 10 2.2 MT HISTORY AND DIFFERENT APPROACHES 10 2.3 RULE-BASED APPROACHES 11 2.4 CORPUS-BASED APPROACHES 13 2.4.1 Statistical MT 14 2.4.2 Phrase-based Translation 14 2.5 MT EVALUATION 14 2.6 AUTOMATIC MT EVALUATION 15 2.6.1 BLEU 15 2.6.2 NIST 15 2.6.3 WER and PER 16 2.7 MT PROBLEMS 16 2.7.1 New Challenges 16 2.7.2 Lexical Ambiguity and Word Sense Disambiguation (WSD) 17 2.7.3 Reordering 17 2.8 MT REORDERING RESEARCH 18 2.9 MEANING-BASED TRANSLATION AND MEANING REPRESENTATION 20 2.10 CONTENT WORDS AND FUNCTION WORDS 21 2.11 SENTENCE CONSTRUCTION PRINCIPLES 21 2.12 CASE GRAMMAR 23 2.13 CLAUSE BOUNDARY DETECTION 23 2.14 LANGUAGE PARSING 24 2.15 CONTEXT GRAMMAR PARSER 24 2.16 SUMMARY 26 Chapter Three RESEARCH METHODOLOGY 28 3.1 A LINGUISTICS APPROACH TO TRANSLATION 28 3.2 THREE ASPECTS OF MEANING 29 3.3 RELATIONAL MEANING AND CONSTITUENT ORDER 30 3.4 CASE FRAME FOR AN EVENT AND THEIR ROLE IN REORDERING 31 3.5 NOMINALIZED EVENTS AND REORDERING 32 3.6 SENTENCE CONSTRUCTION PRINCIPLES AND THE PROBLEMS OF MT REORDERING 32 3.7 DIFFERENT LEVELS OF REORDERING -- WORD SWAPPING, 32 PHRASE REORDERING AND CLAUSE REORDERING 32 3.8 CONSTITUENCY, CLAUSE BOUNDARY DETECTION, AND RELATION FINDING 37 3.9 RELATIONAL MEANING AND LANGUAGE SPECIFIC WORD ORDER 37 3.10 INTERFACE BETWEEN SYNTAX AND SEMANTICS 38 3.11 ROOTS OF MT REORDERING PROBLEMS WHEN ENGLISH IS THE SOURCE LANGUAGE 39 3.12 RELATIONAL MARKING AND REORDERING 40 3.13 BUILDING OF A LANGUAGE PARSER THAT CAN UNEARTH RELATIONAL MARKERS 42 3.14 DIFFERENT KINDS OF AMBIGUITY 45 3.14.1 POS Ambiguity 45 3.14.2 Inseparable POS Tagging is Embedded in the Parser 45 3.14.3 Words that are Categorically more Ambiguous 45 3.14.4 Between Prepositions and Particles 46 3.14.5 Between Prepositions and Subordinate Conjunctions 46 3.14.6 Between Relative Pronouns and Demonstrative Pronouns or other Similar Complementizers 47 3.14.7 Between Nouns and Verbs 47 3.14.8 Between Adjectives or Adverbs and other Parts of Speech 48 3.14.9 Between Function Words and other Parts of Speech 48 3.14.10 Words ending with –ed or –ing 48 3.14.11 Another Reminder of the Significance of POS Problems 49 3.14.12 Form, Distribution, and Function for POS Disambiguation 49 3.15 CONSTITUENCY AND DIFFERENT KINDS OF BOUNDARY DETECTION 50 3.15.1 Where Each Type of Phrase Begins and Ends 51 3.15.2 Where Each Type of Clause Begins and Ends 52 3.15.3 Case Role Assignment and Building of Case Frames for Clause Boundary Detection 52 3.16 CONSTITUENCY AND CONTEXT GRAMMAR PARSER NETWORKS 53 3.17 EVENT-CENTERED PROCESSING, CASE FRAME BUILDING AND RELATION FINDING 67 3.18 ENGLISH SENTENCE CONSTRUCTION PRINCIPLES AS THE LINKS BETWEEN STRUCTURAL RELATIONS AND CONCEPTUAL RELATIONS 68 3.19 NP FINDING, CLAUSE BOUNDARY DETECTION AND CASE FRAME BUILDING 70 3.20 USING OF RELATIONAL MARKERS TO DO POST-EDITING 75 Chapter Four REORDERING ERRORS IN MT AND THE ORIGIN OF THE PROBLEM 77 4.1 CONTENT WORDS, FUNCTION WORDS AND TRANSLATION EVALUATION 77 4.2 WORD BY WORD EVALUATION: 80 4.3 SUMMARY OF WORD-BASED EVALUATION 136 4.4 CONSTITUENT-BASED EVALUATION 137 4.5 EVENTS, ATTACHMENT, NOMINALIZATION, AND SENTENCE CONSTRUCTION 162 4.6 MT EVALUATION AND POST-EDITING 162 4.7 SUMMARY 164 Chapter Five PARSER OUTPUT AND DISCUSSION 166 5.1 PARSE RESULTS AND DISCUSSION 166 5.2 SUMMARY 198 Chapter Six CONCLUSION 199 6.1 RESEARCH QUESTIONS REVISITED 199 6.2 FINAL REMARK AND FUTURE RESEARCH 200 Reference 202   List of Tables and Figures TABLES: TABLE 3-1: CONSTRUCTION PRINCIPLE 69 TABLE 3-2: NP FINDING 71 TABLE 3-3: CLAUSE BOUNDARY DETECTION AND CASE FRAME BUILDING 74 TABLE 3-4: CLAUSE BOUNDARY DETECTION AND CASE FRAME BUILDING 75 TABLE 4-1: EVALUATION OF EXAMPLE 1 79 TABLE 4-2: EVALUATION OF EXAMPLE 2 82 TABLE 4-3: EVALUATION OF EXAMPLE 3 84 TABLE 4-4: EVALUATION OF EXAMPLE 4 86 TABLE 4-5: EVALUATION OF EXAMPLE 5 88 TABLE 4-6: EVALUATION OF EXAMPLE 6 91 TABLE 4-7: EVALUATION OF EXAMPLE 7 94 TABLE 4-8: EVALUATION OF EXAMPLE 8 96 TABLE 4-9: EVALUATION OF EXAMPLE 9 99 TABLE 4-10: EVALUATION OF EXAMPLE 10 103 TABLE 4-11: EVALUATION OF EXAMPLE 11 109 TABLE 4-12: EVALUATION OF EXAMPLE 12 111 TABLE 4-13: EVALUATION OF EXAMPLE 13 113 TABLE 4-14: EVALUATION OF EXAMPLE 14 115 TABLE 4-15: EVALUATION OF EXAMPLE 15 117 TABLE 4-16: EVALUATION OF EXAMPLE 16 119 TABLE 4-17: EVALUATION OF EXAMPLE 17 121 TABLE 4-18: EVALUATION OF EXAMPLE 18 122 TABLE 4-19: EVALUATION OF EXAMPLE 19 124 TABLE 4-20: EVALUATION OF EXAMPLE 20 126 TABLE 4-21: EVALUATION OF EXAMPLE 21 128 TABLE 4-22: EVALUATION OF EXAMPLE 22 129 TABLE 4-23: EVALUATION OF EXAMPLE 23 131 TABLE 4-24: EVALUATION OF EXAMPLE 24 133 TABLE 4-25: EVALUATION OF EXAMPLE 25 135 TABLE 4-26: EVALUATION RESULTS 136 FIGURES: FIGURE 2-1 VAUQUOIS TRIANGLE 12 FIGURE 3-1 SCREEN SHOT 61 FIGURE 5-1 PARSING EXAMPLE 1 167 FIGURE 5-2 PARSING EXAMPLE 2 168 FIGURE 5-3 PARSING EXAMPLE 3 169 FIGURE 5-4 PARSING EXAMPLE 4 170 FIGURE 5-5 PARSING EXAMPLE 5 170 FIGURE 5-6 PARSING EXAMPLE 6 171 FIGURE 5-7 PARSING EXAMPLE 7 173 FIGURE 5-8 PARSING EXAMPLE 8 174 FIGURE 5-9 PARSING EXAMPLE 9 176 FIGURE 5-10 PARSING EXAMPLE 10 177 FIGURE 5-11 PARSING EXAMPLE 11 180 FIGURE 5-12 PARSING EXAMPLE 12 183 FIGURE 5-13 PARSING EXAMPLE 13 184 FIGURE 5-14 PARSING EXAMPLE 14 185 FIGURE 5-15 PARSING EXAMPLE 15 186 FIGURE 5-16 PARSING EXAMPLE 16 188 FIGURE 5-17 PARSING EXAMPLE 17 189 FIGURE 5-18 PARSING EXAMPLE 18 190 FIGURE 5-19 PARSING EXAMPLE 19 191 FIGURE 5-20 PARSING EXAMPLE 20 192 FIGURE 5-21 PARSING EXAMPLE 21 193 FIGURE 5-22 PARSING EXAMPLE 22 194 FIGURE 5-23 PARSING EXAMPLE 23 195 FIGURE 5-24 PARSING EXAMPLE 24 196 FIGURE 5-25 PARSING EXAMPLE 25 197

Reference

Bach, Nguyen , Qin Gao, & Stephan Vogel (2009) Source-side dependency tree reordering models and subtree movements and constraints. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp. 17-24.
Badr, Ibrahim , Rabih Zbib, & James Glass (2009) Syntactic phrase reordering for English-to-Arabic statistical machine translation. EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.86-93
Banerjee, Satanjeev & Alon Lavie: METEOR: an automatic metric for MT evaluation with improved correlation with human judgments. ACL-2005: Workshop on Intrinsic and Extrinsic Evaluation Measures for Machine Translation and/or Summarization, University of Michigan, Ann Arbor, 29 June 2005; pp. 65-72
Birch, Alexandra & Miles Osborne (2011): Reordering metrics for MT. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland , Oregon , June 19-24, 2011; pp.1027-1035.
Braune, Fabienne, Anita Gojun, & Alexander Fraser (2012) Long-distance reordering during search for hierarchical phrase-based SMT. EAMT 2012: Proceedings of the 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, May 28-30 2012, ed. Mauro Cettolo, Marcello Federico, Lucia Specia, Andy Way; pp.177-184
Brown P, Della Pietra S, Della Pietra V & Mercer R (1993).‘The mathematics of machine translation: parameter estimation.’ Computational Linguistics 19(2), 263–312.
Cao, Hailong & Eiichiro Sumita (2010) Filtering syntactic constraints for statistical machine translation. ACL 2010: the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, July 11-16, 2010: Proceedings of the Conference Short Papers; pp.17-21.
Chang, Pi-Chuan, Huihsin Tseng, Dan Jurafsky, & Christopher D. Manning (2009) Discriminative reordering with Chinese grammatical features. Proceedings of SSST-3: Third Workshop on Syntax and Structure in Statistical Translation, Boulder, Colorado, 5 June 2009; pp.51-59.
Chen Shian-Jung (1996), Analysis of Chinese for Chinese-English machine translation, PhD dissertation, Georgetown University, Washington D.C., USA
Chen, Shian-Jung (1997) 翻譯課程的理論基礎與設計(Theory and Curriculum Design of the Teaching of Translation), 2nd Consotium of the Teaching of Translation and Oral Interpretation, Taipei (in Chinese)
Chen Shian-Jung (1997), 翻譯課程的理論基礎與設計(Theory and Curriculum Design of the Teaching of Translation), 2nd Consotium of the Teaching of Translation and Oral Interpretation, Taipei, 1997 (in Chinese)
Chen Shian-Jung (1998), 中英機器與人工翻譯的語意表達(Meaning Representation in Machine Translation and in Human Translation), 1st International Conference of Translation, Taipei, 1998 (in Chinese)
Chen Shian-jung (2010) , Linguistic relativity revisited, 2010年跨文化研究國際研討會論文集(Nov. 2011), 輔仁大學
Chen, Shian-jung & Chen-yin Wang (2011), How should function words such as English prepositions be learned or translated? 2011 British Association for Applied Linguistics Conference at UWE Bristol, September 2011
Chen Shian-Jung & Don Loritz (2005), Context Grammar and POS Tagging, 2nd Midwest Computational Linguistics Colloquium, 2005
Chen, Shian-jung & Lu, Shih-Kai (2012) Clause boundary detection and relational marking for MT reordering, 2012 International Conference on Applied Linguistics & Language Teaching (ALLT)
Chiang, David, Yuval Marton, & Philip Resnik (2008) Online large-margin training of syntactic and structural translation features. EMNLP 2008: Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.224-233.
Chomsky, Noam (1965). Aspects of the Theory of Syntax. MIT Press.
Cook, Walter A., SJ (1989) Case Grammar Theory. Washington, DC: Georgetown University Press.
Costa-jussà, Marta R. & José A.R. Fonollosa (2007) Analysis of statistical and morphological classes to generate weighted reordering hypotheses on a statistical machine translation system. ACL 2007: proceedings of the Second Workshop on Statistical Machine Translation, June 23, 2007, Prague, Czech Republic; pp. 171-176
Costa-jussà, Marta R. & José A.R.Fonollosa (2008): Computing multiple weighted reordering hypotheses for a statistical machine translation phrase-based system. AMTA-2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, Waikiki, Hawai’i, 21-25 October 2008; pp.82-88.
Du, Jinhua & Andy Way (2010) A discriminative latent variable-based “DE” classifier for Chinese-English SMT. Coling 2010: 23rd International Conference on Computational Linguistics. Proceedings of the conference, 23-27 August 2010, Beijing International Convention Center, Beijing, China; pp.286-294.
Elming, Jakob & Nizar Habash (2009) Syntactic reordering for English-Arabic phrase-based machine translation. Proceedings of the EACL 2009 Workhop on Computational Approaches to Semitic Languages, Athens, Greece, 31 March 2009; pp.69-77.
Federico, Marcello, Arianna Bisazza & Christian Hardmeier (2011) Morphological processing and word reordering for statistical MT of highly inflected languages. Machine Translation and Morphologically- rich Languages: Research Workshop of the Israel Science Foundation, University of Haifa, Israel, 24 January, 2011
Fillmore, Charles (1968) "The Case for Case" In Bach and Harms (Ed.): Universals in Linguistic Theory. New York: Holt, Rinehart, and Winston, 1-88.
Gale WA & Church K W (1991). ‘A program for aligning sentences in bilingual corpora.’ In Proceedings of the 29th annual meeting of the Association for Computational Linguistics (ACL). Berkeley, California.
Galley, Michel, Mark Hopkins, Kevin Knight & Daniel Marcu (2004) What’s in a translation rule? HLT-NAACL 2004: Human Language Technology conference and North American Chapter of the Association for Computational Linguistics annual meeting, May 2-7, 2004, The Park Plaza Hotel, Boston, USA; pp.273-280.
Greenberg J H (1963). ‘Some universals of grammar with particular reference to the order of meaningful elements.’ In Greenberg J H (ed.) Universals of language. Cambridge: MIT Press.
Hashimoto, Kei, Hirofumi Yamamoto, Hideo Okuma, Eiichiro Sumita, & Keiichi Tokuda (2009) Reordering model using syntactic information of a source tree for statistical machine translation. Proceedings of SSST-3: Third Workshop on Syntax and Structure in Statistical Translation, Boulder, Colorado, 5 June 2009; pp.69-77.
Hoang, Hieu & Philipp Koehn (2009) Improving mid-range reordering using templates of factors. EACL-2009: Proceedings of the 12th Conference of the European Chapter of the ACL, Athens, Greece, 30 March – 3 April 2009; pp.372-379.
Holmqvist, Maria, Sara Stymne & Lars Ahrenberg (2011) Experiments with word alignment, normalization and clause reordering for SMT between English and German, Proceedings of the 6th Workshop on Statistical Machine Translation, pages 393–398, Edinburgh, Scotland, UK, July 30–31, 2011.
Howlett, Susan & Mark Dras (2011) Clause restructuring for SMT not absolutely helpful . ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Short papers, Portland , Oregon , June 19-24, 2011; pp.384-388.
Isabelle, P. & G. Foster (2006). Machine Translation: Overview, Encyclopedia of Language & Linguistics, 2nd. Ed. Amsterdam: Elsevier Press. 404-422.
Jiang, Jie, Jinhua Du, & Andy Way (2010) Improved phrase-based SMT with syntactic reordering patterns learned from lattice scoring. AMTA 2010: the Ninth conference of the Association for Machine Translation in the Americas, Denver, Colorado, October 31 – November 4, 2010; 10pp.
Kashioka, Hideki, Takehiko Maruyama, & Hideki Tanaka (2003) Building a parallel corpus for monologues with clause alignment MT Summit IX, New Orleans, USA, 23-27 September 2003; pp.216-223.
Khalilov, Maxim, José A.R.Fonollosa, & Mark Dras (2009) A new subtree-transfer approach to syntax-based reordering for statistical machine translation. EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation, ed. Lluís Màrquez and Harold Somers, 14-15 May 2009, Universitat Politècnica de Catalunya, Barcelona, Spain; pp.197-204.
Koehn P, Och F J & Marcu D (2003). ‘Statistical phrase-based translation.’ In Hovy E (ed.) Proceedings of the human language technology conference of the North American chapter of the Association for Computational Linguistics. Edmonton, Alberta, Canada. 27–133.
Liu, Zhanyi, Haifeng Wang, Hua Wu, Ting Liu, & Sheng Li (2011) Reordering with source language collocations. ACL-HLT 2011: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics, Portland, Oregon, June 19-24, 2011; pp.1036-1044.
Loritz, Don (1992) Generalized transition network parsing for language study: the GPARS systems for English, Russian, Japanese and Chinese, CALICO Journal, Volume 10 Number 1
Lytinen, Steven (1993), [Review] of Machine translation: a knowledge-based approach [by] Sergei Nirenburg et al. (Morgan Kaufmann, 1992); and of: The KBMT project: a case study in knowledge-based machine translation, [ed.by] Kenneth Goodman and Sergei Nirenburg (Morgan Kaufmann, 1991). Computational Linguistics 19 (1), pp. 207-209
Madnani, N. (2011). "iBLEU: Interactively Scoring and Debugging Statistical Machine Translation Systems" in "Proceedings of the Fifth IEEE International Conference on Semantic Computing (Demos), Palo Alto, CA" pp. 213-214
Na, Hwidong, Jin-Ji Li, Jungi Kim, & Jong-Hyeok Lee (2009) Improving fluency by reordering target constituents using MST parser in English-to-Japanese phrase-based SMT. MT Summit XII: proceedings of the twelfth Machine Translation Summit, August 26-30, 2009, Ottawa, Ontario, Canada; pp.276-283.
Nagata, Masaaki, Kuniko Saito, Kazuhide Yamamoto, & Kazuteru Ohashi (2006) A clustered global phrase reordering model for statistical machine translation. Coling-ACL 2006: Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics, Sydney, 17-21 July 2006; pp.713-720.
Nakazawa, Toshiaki & Sadao Kurohashi (2009) Statistical phrase alignment model using dependency relation probability. Proceedings of SSST-3: Third Workshop on Syntax and Structure in Statistical Translation, Boulder, Colorado, 5 June 2009; pp.10-18.
Nida, Eugene (1964), Toward a science of translating, Brill
Nida, Eugene (2004) “Principles of Correspondence”. The Translation Studies Reader. Ed. Lawrence Venuti. London: Routledge, 2004.
Niehues, Jan & Muntsin Kolss (2009) A POS-based model for long-range reorderings in SMT. Proceedings of the Fourth Workshop on Statistical Machine Translation, Athens, Greece, 30 March – 31 March 2009; pp.206-214.
Nikoulina, Vassilina & Marc Dymetman (2008) Experiments in discriminating phrase-based translations on the basis of syntactic coupling features. Second ACL Workshop on Syntax and Structure in Statistical Translation (ACL-08 SSST-2), Proceedings, 20 June 2008, Columbus, Ohio, USA; pp.55-60.
NIST (2005), NIST 2005 Machine Translation Evaluation Official Results
Och F & Ney H (2004). ‘The alignment template approach to statistical machine translation.’ Computational Linguistics 30(4).
Quirk, Randolph, Sidney Greenbaum, Geoffrey Leech, and Jan Svartvik (1985). A comprehensive grammar of the English language, Longman
Ramanathan, Ananthakrishnan & Karthik Visweswariah (2012) A study of word-classing for MT reordering. LREC 2012: Eighth international conference on Language Resources and Evaluation, 21-27 May 2012, Istanbul, Turkey; pp.3971-3976
Ramanathan, Ananthakrishnan, Pushpak Bhattacharyya, Karthik Visweswariah, Kushal Ladha, & Ankur Ga ndhe (2011) Clause-based reordering constraints to improve statistical machine translation. [IJCNLP 2011] Proceedings of the 5th International Joint Conference on Natural Language Processing, Chiang Mai , Thailand , November 8-13, 2011; pp.1351-1355.
Resnik, Philip (2008) Review of Treebanks: Building and Using Parsed Corpora, Language 83(4), 2007, pp. 876-879.
Rottmann, Kay & Stephan Vogel (2007) Word reordering in statistical machine translation with a POS-based distortion model. TMI-2007: Proceedings of the 11th International Conference on Theoretical and Methodological Issues in Machine Translation, Skövde [Sweden], 7-9 September 2007; pp.171-180
Setiawan, Hendra , Chris Dyer, & Philip Resnik (2010) Discriminative word alignment with a function word reordering model. [EMNLP 2010] Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, MIT, Massachusetts, USA, 9-11 October 2010; pp.534-544.
Setiawan, Hendra , Min-Yen Kan, & Haizhou Li (2007) Ordering phrases with function words. ACL 2007: proceedings of the 45th Annual Meeting of the Association for Computational Linguistics, Prague, Czech Republic, June 2007; pp. 712-719
Setiawan, Hendra, Min-Yen Kan, Haizhou Li, & Philip Resnik (2009) Topological ordering of function words in hierarchical phrase-based translation. [ACL-IJCNLP-2009] Proceedings of the 47th Annual Meeting of the ACL and the 4th IJCNLP, Suntec, Singapore, 2-7 August 2009; pp.324-332
Siewierska, A (2006) Word Order and Linearization. Encyclopedia of Language & Linguistics, 2nd. Ed. Amsterdam: Elsevier Press. 642-649.
Simard M, Foster G F & Isabelle P (1992). ‘Using cognates to align sentences in bilingual corpora.’ In Proceedings of the 4th conference on theoretical and methodological issues in machine translation (TMI). Montre´al, Que´bec.
Stymne, Sara (2012) Clustered word classes for preordering in statistical machine translation. EACL Joint Workshop on Unsupervised and Semi-Supervised Learning in NLP, 23-27 April 2012, Avignon, France; pp. 28-34.
Su, Keh-Yih & Jing-Shin Chang (1992) Why corpus-based statistics-oriented machine translation, Fourth International Conference on Theoretical and Methodological Issues in Machine Translation (TMI-92), Empiricist vs. rationalist methods in MT, June 25-27, 1992, Montreal, CCRIT-CWARC; pp.249-262.
Sudoh, Katsuhito, Kevin Duh, Hajime Tsukada, Tsutomu Hirao, & Masaaki Nagata (2010) Divide and translate: improving long distance reordering in statistical machine translation. ACL 2010: Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR. Proceedings of the workshop, 15-16 July 2010, Uppsala University, Uppsala, Sweden; pp. 418-427.
Vauquois B (1968). ‘A survey of formal grammars and algorithms for recognition and transformation in machine translation.’ In IFIP, Congress-68. Edinburgh. 254–260
Wasson, Mark, Don Loritz, Shian-jung Chen, et al. (2005) System and Method for Extracting Information from Text Using Text Annotation and Fact Extraction, US Patent US7912705, 19 Jan 2010
White, John S. (1995) Approaches to black box MT evaluation. MT Summit V Proceedings, Luxembourg , July 10-13, 1995; 10pp.
Woods, W. A. (1970) Transition Network Grammar for Natural Language Analysis, Communications of the ACM, Vol. 13, No. 10
Xia, Fei & Michael McCord (2004) Improving a statistical MT system with automatically learned rewrite patterns. Coling 2004: 20th International Conference on Computational Linguistics, 23-27 August 2004, University of Geneva, Switzerland, Proceedings; 7pp.
Xiang, Bing, Niyu Ge, & Abraham Ittycheriah (2011) Improving reordering for statistical machine translation with smoothed priors and syntactic features. Proceedings of SSST-5, Fifth Workshop on Syntax, Semantics and Structure in Statistical Translation, ACL HLT 2011, Portland, Oregon, USA, June 2011; pp.61-69
Xu, Peng, Jaeho Kang, Michael Ringgard, & Franz Och (2009) Using a dependency parser to improve SMT for subject-object-verb languages. NAACL HLT 2009. Human Language Technologies: the 2009 annual conference of the North American Chapter of the ACL, Boulder, Colorado, May 31 - June 5, 2009; pp.245-253.
Zhang, Dongdong, Mu Li, Chi-Ho Li, & Ming Zhou (2007) Phrase reordering model integrating syntactic knowledge for SMT. EMNLP-CoNLL-2007: Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, June 28-30, 2007, Prague, Czech Republic; pp. 533-540.
Zhang, Min & Haizhou Li (2009) Tree kernel-based SVM with structured syntactic knowledge for BTG-based phrase reordering. EMNLP-2009: proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, Singapore, 6-7 August 2009; pp.698-707.
Zhang, Yuqi, Richard Zens, & Hermann Ney (2007) Chunk-level reordering of source language sentences with automatically learned rules for statistical machine translation. SSST, NAACL-HLT-2007 AMTA Workshop on Syntax and Structure in Statistical Translation, 26 April 2007, Rochester, NY; pp.1-8
Zhao, Bing & Yaser Al-Onaizan (2008) Generalizing local and non-local word-reordering patterns for syntax-based machine translation. EMNLP 2008: Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing, 25-27 October 2008, Honolulu, Hawaii, USA; pp.572-581.
Zwarts, Simon & Mark Dras (2007) Syntax-based word reordering in phrase-based statistical machine translation: why does it work? MT Summit XI, 10-14 September 2007, Copenhagen , Denmark . Proceedings; pp.559-566
Reference

QR CODE