搜索结果: 1-15 共查到“计算语言学 corpus”相关记录20条 . 查询时间(0.031 秒)
Large Linguistic Corpus Reduction with SCP Algorithms
Large Linguistic Corpus Reduction SCP Algorithms
2015/9/16
Linguistic corpus design is a critical concern for building rich annotated corpora useful in different domains of applications. For example, speech technologies such as ASR (Automatic Speech Recogniti...
Distributional Memory:A General Framework for Corpus-Based Semantics
Distributional Memory General Framework Corpus-Based Semantics
2015/9/8
Research into corpus-based semantics has focused on the development of ad hoc models that treat single tasks, or sets of closely related tasks, as unrelated challenges to be tackled by extracting diff...
A Flexible,Corpus-Driven Model of Regular and Inverse Selectional Preferences
Flexible Corpus-Driven Model Regular Inverse Selectional Preferences
2015/9/8
We present a vector space–based model for selectional preferences that predicts plausibility scores for argument headwords. It does not require any lexical resources (such as WordNet). It can be train...
CCGbank: A Corpus of CCG Derivations and Dependency Structures Extracted from the Penn Treebank
Dependency Structures Extracted CCG Derivations
2015/9/2
This article presents an algorithm for translating the Penn Treebank into a corpus of Combinatory Categorial Grammar (CCG) derivations augmented with local and long-range word–word
dependencies. The ...
The Proposition Bank: An Annotated Corpus of Semantic Roles
Semantic Roles Proposition Bank
2015/8/31
The Proposition Bank project takes a practical approach to semantic representation, adding a
layer of predicate-argument information, or semantic role labels, to the syntactic structures of
the Penn...
Representing Discourse Coherence: A Corpus-Based Study
Corpus-Based Study Discourse Coherence:
2015/8/31
This article aims to present a set of discourse structure relations that are easy to code and to
develop criteria for an appropriate data structure for representing these relations. Discourse
struct...
CorMet: A Computational, Corpus-Based Conventional Metaphor Extraction System
Corpus-Based Conventional Metaphor System
2015/8/31
CorMet is a corpus-based system for discovering metaphorical mappings between concepts. It
does this by finding systematic variations in domain-specific selectional preferences, which are...
The Web, teeming as it is with language data, of all manner of varieties and languages, in
vast quantity and freely available, is a fabulous linguists’ playground. This special issue of
Computationa...
Parallel corpora have become an essential resource for work in multilingual natural language
processing. In this article, we report on our work using the STRAND system for mining parallel
text on th...
Pattern Grammar:A Corpus-Driven Approach to the Lexical Grammar of English
Pattern Grammar Corpus-Driven Approach Lexical Grammar of English
2015/8/26
In this book Hunston and Francis describe an approach to lexical and grammatical description that was used to produce two remarkable Collins COBUILD reference works, Grammar Patterns 1: Verbs and Gram...
A Corpus-Based Evaluation of Centering and Pronoun Resolution
Pronoun Resolution Centering
2015/8/26
In this paper we compare pronoun resolution algorithms and introduce a centering algorithm (LefRight Centering) that adheres to the constraints and rules of centering theory and is an alternative
to ...
Using Suffix Arrays to Compute Term Frequency and Document Frequency for All Substrings in a Corpus
Suffix Arrays Compute Term Frequency Document Frequency All Substrings
2015/8/26
Bigrams and trigrams are commonly used in statistical natural language processing; this paper will describe techniques for working with much longer n-grams. Suffix arrays (Manber and Myers 1990) were ...
Domain-Specific Ontology Mapping by Corpus-Based Semantic Similarity
Domain-Specific Ontology Mapping Corpus-Based Semantic Similarity
2015/6/30
Mapping heterogeneous ontologies is usually performed manually by domain experts, or accomplished by computer programs via comparing the structures of the ontologies and the linguistic semantics of th...
FUNDAMENTAL FREQUENCY MODELING FOR CORPUS-BASED SPEECH SYNTHESIS BASED ON A STATISTICAL LEARNING TECHNIQUE
FUNDAMENTAL FREQUENCY MODELING CORPUS-BASED SPEECH SYNTHESIS A STATISTICAL LEARNING TECHNIQUE
2014/11/27
This paper proposes a novel two-layer approach to fundamental frequency modeling for concatenative speech synthesis based on a statistical learning technique called additive models. We define an addit...
The Constituency of Hyperlinks in a Hypertext Corpus.