PolyNarrative: A Multilingual, Multilabel, Multi-domain Dataset for Narrative Extraction from News Articles”

The dataset described in the paper:

Nikolaos Nikolaidis, Nicolas Stefanovitch, Purificação Silvano, Dimitar Iliyanov Dimitrov, Roman Yangarber, Nuno Guimarães, Elisa Sartori, Ion Androutsopoulos, Preslav Nakov, Giovanni Da San Martino, Jakub Piskorski
PolyNarrative: A Multilingual, Multilabel, Multi-domain Dataset for Narrative Extraction from News Articles
In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics, Vienna, Austria, 2025

can be accessed an used for research purposes in the following manner:

  1. Please register to the Task 10 at SemEval 2025 at:
    https://propaganda.math.unipd.it/semeval2025task10/

  2. Once registered and vetted the access will be granted to train and dev data of the shared task which corresponds to TRAIN and TEST the dataset described in the paper.

TRAIN set: The TRAIN set from the paper corresponds to the union of the files: batch 1-3 (Dec. 4) and “Additional Training data for Russian” File names after downloading: target_4_December_release.zip and training_data_RU_final_19_January_2025_release.zip respectively.

TEST set: Display name on website: Development set File name after downloading: target_4_December_release.zip

Citation

@inproceedings{nikolaidis-etal-2025-polynarrative,
    title = "{P}oly{N}arrative: A Multilingual, Multilabel, Multi-domain Dataset for Narrative Extraction from News Articles",
    author = "Nikolaidis, Nikolaos  and
      Stefanovitch, Nicolas  and
      Silvano, Purifica{\c{c}}{\~a}o  and
      Dimitrov, Dimitar Iliyanov  and
      Yangarber, Roman  and
      Guimar{\~a}es, Nuno  and
      Sartori, Elisa  and
      Androutsopoulos, Ion  and
      Nakov, Preslav  and
      Da San Martino, Giovanni  and
      Piskorski, Jakub",
    editor = "Che, Wanxiang  and
      Nabende, Joyce  and
      Shutova, Ekaterina  and
      Pilehvar, Mohammad Taher",
    booktitle = "Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)",
    month = jul,
    year = "2025",
    address = "Vienna, Austria",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2025.acl-long.1513/",
    pages = "31323--31345",
    ISBN = "979-8-89176-251-0"}