搜索结果: 1-15 共查到“文学 evaluation”相关记录68条 . 查询时间(0.295 秒)
RELPRON: A Relative Clause Evaluation Data Set for Compositional Distributional Semantics
Distributional Semantics assessment
2017/4/6
This article introduces RELPRON, a large data set of subject and object relative clauses, for
the evaluation of methods in compositional distributional semantics. RELPRON targets an
intermediate lev...
Automated essay evaluation and feedback systems:Are they useful for ESL test takers and ESL teachers?
Automated essay evaluation feedback systems ESL test takers ESL teachers
2016/2/25
Automated essay scoring (AES) has become increasingly popular in the last decade with many assessment agencies developing and promoting their automated scoring and automated feedback systems. Ware (20...
System design and evaluation methodologies receive significant attention in natural language
processing (NLP), with the systems typically being evaluated on a common task and against
shared data set...
Graph-Based Word Alignment for Clinical Language Evaluation
Graph-Based Word Alignment Clinical Language Evaluation
2016/2/23
Among the more recent applications for natural language processing algorithms has been the analysis of spoken language data for diagnostic and remedial purposes, fueled by the demand for simple, objec...
Improved Estimation of Entropy for Evaluation of Word Sense Induction
Entropy for Evaluation Word Sense Induction
2015/9/14
Information-theoretic measures are among the most standard techniques for evaluation of clustering methods including word sense induction (WSI) systems. Such measures rely on sample-based estimates of...
A Large-Scale Pseudoword-Based Evaluation Framework for State-of-the-Art Word Sense Disambiguation
Large-Scale Pseudoword State-of-the-Art Word Sense Disambiguation
2015/9/14
The evaluation of several tasks in lexical semantics is often limited by the lack of large amounts of manual annotations, not only for training purposes, but also for testing purposes. Word Sense Disa...
Learning and Evaluation of Dialogue Strategies for New Applications: Empirical Methods for Optimization from Small Data Sets
New Applications Small Data Sets
2015/9/9
We present a new data-driven methodology for simulation-based dialogue strategy learning,
which allows us to address several problems in the field of automatic optimization of dialogue
strateg...
Linguistically Annotated Reordering:Evaluation and Analysis
Linguistically Annotated Reordering:Evaluation Analysis
2015/9/8
Linguistic knowledge plays an important role in phrase movement in statistical machine translation. To efficiently incorporate linguistic knowledge into phrase reordering, we propose a new approach: L...
Constructing Corpora for the Development and Evaluation of Paraphrase Systems
Paraphrase Systems Constructing Corpora
2015/9/6
Automatic paraphrasing is an important component in many natural language processing tasks.
In this article we present a new parallel corpus with paraphrase annotations. We adopt a definition o...
Automatic Evaluation of Information Ordering:Kendall's Tau
Information Ordering Kendall's Tau
2015/9/1
This article considers the automatic evaluation of information ordering, a task underlying many text-based applications such as concept-to-text generation and multidocument summarization. We propose a...
The PARADISE Evaluation Framework: Issues and Findings
Issues and Findings Evaluation Framework
2015/9/1
There has been a great deal of interest over the past 20 years in developing metrics and
frameworks for evaluating and comparing the performance of spoken-language dialogue stems. One of the results ...
Large-Scale Induction and Evaluation of Lexical Resources from the Penn-II and Penn-III Treebanks
Large-Scale Induction Evaluation of Lexical Resources
2015/8/31
We present a methodology for extracting subcategorization frames based on an automaticlexical-functional grammar (LFG) f-structure annotation algorithm for the Penn-II and Penn-III Treebanks. We extra...
A Critique and Improvement of an Evaluation Metric for Text Segmentation
Critique and Improvement Evaluation Metric Text Segmentation
2015/8/27
The Pk evaluation metric, initially proposed by Beeferman, Berger, and Lafferty (1997), is becoming the standard measure for assessing text segmentation algorithms. However, a theoretical analysis of ...
The Need for Accurate Alignment in Natural Language System Evaluation
Natural Language System Evaluation
2015/8/26
As evaluations of computational linguistics technology progress toward higher-level interpretation tasks, the problem of determining alignments between system responses and answer key entries may beco...
A Corpus-Based Evaluation of Centering and Pronoun Resolution
Pronoun Resolution Centering
2015/8/26
In this paper we compare pronoun resolution algorithms and introduce a centering algorithm (LefRight Centering) that adheres to the constraints and rules of centering theory and is an alternative
to ...