This compressed folder includes the code used for scraping and building the dataset, the intermediate datasets and the (not cross-verified) exhaustive dataset. This dataset is linked to the following paper that should be cited directly instead of the data itself:
Morgane Laouenan, Palaash Bhargava, Jean-Benoît Eyméoud, Olivier Gergaud, Guillaume Plique, Etienne Wasmer (2022) A cross-verified database of notable people, 3500BC-2018AD, Scientific Data, June 2022.
Bibtex:
@article{bhht3,
author = {Laouenan, Morgane and Bhargava, Palaash and Eyméoud, Jean-Benoît and Gergaud, Olivier and Plique, Guillaume and Wasmer, Etienne},
title = {A cross-verified database of notable people, 3500BC-2018AD},
journal = {Scientific Data},
publisher = {Nature Publishing Group},
year = {2022},
month = {Jun},
day = {09},
volume = {9},
number = {1},
pages = {290},
issn = {2052-4463},
doi = {10.1038/s41597-022-01369-4},
url = {https://doi.org/10.1038/s41597-022-01369-4}
}
The intermediate files as well as the exhaustive database are not cross-verified and should not be used directly or under the full responsibility of users.
All datasets included in this folder are subject to CC-BY-SA licensing.