You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
<metaitemprop="description"content="Multi-News, consists of news articles and human-written summaries of these articles from the site newser.com. Each summary is professionally writtenby editors and includes links to the original articles cited. There are two features: - document: text of news articles seperated by special token "|||||". - summary: news summary. To use this dataset: ```python import tensorflow_datasets as tfds ds = tfds.load('multi_news', split='train') for ex in ds.take(4): print(ex) ``` See [the guide](https://www.tensorflow.org/datasets/overview) for more informations on [tensorflow_datasets](https://www.tensorflow.org/datasets). " />
6
+
<metaitemprop="description"content="# Multi-News Dataset Multi-News consists of news articles and human-written summaries of these articles from the news site `newser.com`. Each summary is professionally written by editors andincludes links to the original articles cited. This is the first large-scale dataset for multi-document summarization on news articles. Each record has two features: * `document`: Texts of news articles, separated by special token "|||||". * `summary`: Summary of the news. To use this dataset: ```python import tensorflow_datasets as tfds ds = tfds.load('multi_news', split='train') for ex in ds.take(4): print(ex) ``` See [the guide](https://www.tensorflow.org/datasets/overview) for more informations on [tensorflow_datasets](https://www.tensorflow.org/datasets). " />
<metaitemprop="citation"content="@misc{alex2019multinews, title={Multi-News: a Large-Scale Multi-Document Summarization Dataset and Abstractive Hierarchical Model}, author={Alexander R. Fabbri and Irene Li and Tianwei She and Suyi Li and Dragomir R. Radev}, year={2019}, eprint={1906.01749}, archivePrefix={arXiv}, primaryClass={cs.CL} }" />
@@ -12,14 +12,26 @@
12
12
# `multi_news`
13
13
14
14
15
+
Note: This dataset has been updated since the last stable release. The new
16
+
versions and config marked with
17
+
<spanclass="material-icons"title="Available only in the tfds-nightly package">nights_stay</span>
18
+
are only available in the `tfds-nightly` package.
19
+
15
20
***Description**:
16
21
17
-
Multi-News, consists of news articles and human-written summaries of these
18
-
articles from the site newser.com. Each summary is professionally written by
19
-
editors and includes links to the original articles cited.
22
+
# Multi-News Dataset
23
+
24
+
Multi-News consists of news articles and human-written summaries of these
25
+
articles from the news site `newser.com`. Each summary is professionally written
26
+
by editors and includes links to the original articles cited.
27
+
28
+
This is the first large-scale dataset for multi-document summarization on news
29
+
articles.
30
+
31
+
Each record has two features:
20
32
21
-
There are two features: - document: text of news articles seperated by special
22
-
token "|||||". - summary: news summary.
33
+
*`document`: Texts of news articles, separated by special token "|||||".
0 commit comments