EASY offers sustainable archiving of research data and access to thousands of datasets.
2017-01-20 Velden, T. (Technische Universität Berlin) 10.17026/dans-zzq-z4xh
This data set was generated to compare topic extraction solutions against each other without the availability of an authoritative ground truth. The topics were extracted from a corpus of bibliographic data from the Web of Science of publications in 59 journals in Astrophysics and Astronomy published between 2003-2011. The comparative analysis was undertaken to obtain insights into how topic extraction approaches construct topics differently and how the resulting topical structures differ.