We obtained and integrated
40 datasets from various sources such as the World Bank,
the Global Innovation Index site, and UNESCO. Some
datasets
that we looked at include:
- books published per country per year
- scientific publications
- GDP
Our datasets also contained time series
data ranging from 1950 to 2020 for a
variety of countries.
Many countries also had
sparse data, which presented an additional challenge
during the pre-processing phases. After normalizing and creating an
integrated dataset,
we proceeded with our analysis.