Methods for semantic role labeling of Russian texts

A. O. Shelmanov, I. V. Smirnov

Research output: Contribution to journalArticlepeer-review

24 Citations (Scopus)

Abstract

The paper introduces two methods for semantic role labeling of Russian texts. The first method is based on semantic dictionary that contains information about predicates, roles and syntaxeme features that correspond to the roles. It also uses heuristics and integer linear programming to find the best joint assignment of roles. The second method is data-driven semantic-syntactic parsing, which was implemented using MaltParser. It performs transition-based data-driven parsing simultaneously building a syntactic tree and assigning semantic roles. It was trained with various feature sets on SynTagRus Treebank, which was automatically enriched with semantic roles by the dictionary-based parser. We managed to automatically alleviate mistakes in the training corpus using output of the datadriven parser. We evaluated the performance of the parsers on the subcorpus of SynTagRus, which we manually annotated with semantic information. The dictionary-based parser and the data-driven semantic-syntactic parser showed close performance. Although the data-driven parser did not outperform the dictionary-based parser, we expect that it can be beneficial in some cases and has potentials for further improvement.

Original languageEnglish
Pages (from-to)607-619
Number of pages13
JournalKomp'juternaja Lingvistika i Intellektual'nye Tehnologii
Publication statusPublished - 2014
Externally publishedYes

Keywords

  • Datadriven dependency parsing
  • Parser
  • Semantic dictionary
  • Semantic role labeling
  • Semantic-syntactic analysis
  • Semantically annotated corpus

Fingerprint

Dive into the research topics of 'Methods for semantic role labeling of Russian texts'. Together they form a unique fingerprint.

Cite this