2
0
0
0
0
Discourse processing [electronic resource]
- 作者: Stede, Manfred, 1965-
- 其他題名:
- Synthesis lectures on human language technologies,
- 出版: [San Rafael, Calif.] : Morgan & Claypool Publishers
- 叢書名: Synthesis lectures on human language technologies,#15
- 主題: Discourse analysis , Computational linguistics , text structure , document structure , topic segmentation , coreference , anaphora resolution , coherence relation , discourse parsing
- ISBN: 9781608457359 (ebook.) 、 1608457354 (ebook.) 、 9781608457342 (pbk.) 、 1608457346 (pbk.)
- FIND@SFXID: CGU
- 資料類型: 電子書
- 內容註: Includes bibliographical references (pages 137-154). 1. Introduction. 2. Large discourse units and topics -- 2.1 Genre-induced text structure -- 2.1.1 Logical document structure -- 2.1.2 Content zones -- 2.1.3 Example: scientific papers -- 2.1.4 Example: film reviews -- 2.2 Topic-based segmentation -- 2.2.1 Introduction: "What is this all about?" -- 2.2.2 Exploiting surface cues -- 2.2.3 Lexical chains -- 2.2.4 Word distributions -- 2.2.5 Probabilistic models of segmentation and topics -- 2.2.6 Combining evidence -- 2.3 Summary. 3. Coreference resolution -- 3.1 Reference and coreference: an overview -- 3.2 Corpus annotation -- 3.3 Entity-based local coherence -- 3.4 Determining anaphoricity and familiarity status -- 3.5 Rule-based methods for resolving nominal anaphora -- 3.5.1 Matching proper names -- 3.5.2 Pronoun resolution -- 3.5.3 Resolving definite noun phrases -- 3.5.4 Web-assisted resolution of 'Other'-anaphora -- 3.6 Statistical approaches to coreference resolution -- 3.6.1 Features -- 3.6.2 Mention-pair models -- 3.6.3 Alternative models -- 3.7 Evaluation -- 3.8 Summary. 4. Small discourse units and coherence relations -- 4.1 Coherence relations -- 4.2 Segmentation: finding elementary discourse units -- 4.2.1 Defining EDUs -- 4.2.2 A subproblem: attribution -- 4.2.3 Automatic EDU segmentation -- 4.3 Recognizing coherence relations -- 4.3.1 Connectives: an introduction -- 4.3.2 Identifying connectives -- 4.3.3 Interpreting connectives -- 4.3.4 Detecting implicit coherence relations -- 4.3.5 Finding relations: the problem at large -- 4.4 Coherence-relational text structure -- 4.4.1 Trees -- 4.4.2 Parsing coherence-relational trees -- 4.4.3 Graphs -- 4.5 Summary: guessing or underspecifying? 5. Summary: text structure on multiple interacting levels. A. Sample text -- Bibliography -- Author's biography. Part of: Synthesis digital library of engineering and computer science.
- 摘要註: Discourse Processing here is framed as marking up a text with structural descriptions on several levels, which can serve to support many language-processing or text-mining tasks. We first explore some ways of assigning structure on the document level: the logical document structure as determined by the layout of the text, its genre-specific content structure, and its breakdown into topical segments. Then the focus moves to phenomena of local coherence. We introduce the problem of coreference and look at methods for building chains of coreferring entities in the text. Next, the notion of coherence relation is introduced as the second important factor of local coherence. We study the role of connectives and other means of signaling such relations in text, and then return to the level of larger textual units, where tree or graph structures can be ascribed by recursively assigning coherence relations. Taken together, these descriptions can inform text summarization, information extraction, discourse-aware sentiment analysis, question answering, and the like.
-
讀者標籤:
- 系統號: 005119128 | 機讀編目格式