RST Discourse Parser for Russian: An Experimental Study of Deep Learning Models

Elena Chistova, Artem Shelmanov, Dina Pisarevskaya, Maria Kobozeva, Vadim Isakov, Alexander Panchenko, Svetlana Toldova, Ivan Smirnov

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This work presents the first fully-fledged discourse parser for Russian based on the Rhetorical Structure Theory of Mann and Thompson (1988). For the segmentation, discourse tree construction, and discourse relation classification we employ deep learning models. With the help of multiple word embedding techniques, the new state of the art for discourse segmentation of Russian texts is achieved. We found that the neural classifiers using contextual word representations outperform previously proposed feature-based models for discourse relation classification. By ensembling both methods, we are able to further improve the performance of the discourse relation classification achieving the new state of the art for Russian.

Original languageEnglish
Title of host publicationAnalysis of Images, Social Networks and Texts - 9th International Conference, AIST 2020, Revised Selected Papers
EditorsWil M. van der Aalst, Vladimir Batagelj, Dmitry I. Ignatov, Michael Khachay, Olessia Koltsova, Andrey Kutuzov, Sergei O. Kuznetsov, Irina A. Lomazova, Natalia Loukachevitch, Amedeo Napoli, Alexander Panchenko, Panos M. Pardalos, Marcello Pelillo, Andrey V. Savchenko, Elena Tutubalina
PublisherSpringer Science and Business Media Deutschland GmbH
Pages105-119
Number of pages15
ISBN (Print)9783030726096
DOIs
Publication statusPublished - 2021
Event9th International Conference on Analysis of Images, Social Networks and Texts, AIST 2020 - Moscow, Russian Federation
Duration: 15 Oct 202016 Oct 2020

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume12602 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference9th International Conference on Analysis of Images, Social Networks and Texts, AIST 2020
Country/TerritoryRussian Federation
CityMoscow
Period15/10/2016/10/20

Keywords

  • Deep learning
  • Discourse parsing
  • Pre-trained language models
  • Rhetorical structure theory

Fingerprint

Dive into the research topics of 'RST Discourse Parser for Russian: An Experimental Study of Deep Learning Models'. Together they form a unique fingerprint.

Cite this