RST Discourse Parser for Russian: An Experimental Study of Deep Learning Models

Elena Chistova, Artem Shelmanov, Dina Pisarevskaya, Maria Kobozeva, Vadim Isakov, Alexander Panchenko, Svetlana Toldova, Ivan Smirnov

Результат исследований: Глава в книге, отчете, сборнике статейМатериалы для конференциирецензирование

Аннотация

This work presents the first fully-fledged discourse parser for Russian based on the Rhetorical Structure Theory of Mann and Thompson (1988). For the segmentation, discourse tree construction, and discourse relation classification we employ deep learning models. With the help of multiple word embedding techniques, the new state of the art for discourse segmentation of Russian texts is achieved. We found that the neural classifiers using contextual word representations outperform previously proposed feature-based models for discourse relation classification. By ensembling both methods, we are able to further improve the performance of the discourse relation classification achieving the new state of the art for Russian.

Язык оригиналаАнглийский
Название основной публикацииAnalysis of Images, Social Networks and Texts - 9th International Conference, AIST 2020, Revised Selected Papers
РедакторыWil M. van der Aalst, Vladimir Batagelj, Dmitry I. Ignatov, Michael Khachay, Olessia Koltsova, Andrey Kutuzov, Sergei O. Kuznetsov, Irina A. Lomazova, Natalia Loukachevitch, Amedeo Napoli, Alexander Panchenko, Panos M. Pardalos, Marcello Pelillo, Andrey V. Savchenko, Elena Tutubalina
ИздательSpringer Science and Business Media Deutschland GmbH
Страницы105-119
Число страниц15
ISBN (печатное издание)9783030726096
DOI
СостояниеОпубликовано - 2021
Событие9th International Conference on Analysis of Images, Social Networks and Texts, AIST 2020 - Moscow, Российская Федерация
Продолжительность: 15 окт. 202016 окт. 2020

Серия публикаций

НазваниеLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Том12602 LNCS
ISSN (печатное издание)0302-9743
ISSN (электронное издание)1611-3349

Конференция

Конференция9th International Conference on Analysis of Images, Social Networks and Texts, AIST 2020
Страна/TерриторияРоссийская Федерация
ГородMoscow
Период15/10/2016/10/20

Fingerprint

Подробные сведения о темах исследования «RST Discourse Parser for Russian: An Experimental Study of Deep Learning Models». Вместе они формируют уникальный семантический отпечаток (fingerprint).

Цитировать