As part of the delivery of Milestone 46, a baseline list of approximately 50 international Research Infrastructures (RIs) that support collections digitisation was aggregated from existing internal knowledge. The list covered biodiversity collections as well as adjacent fields - microbiology, chemistry, marine biology, etc.
Next, the types of data and insights were identified that would be of most interest for users to see and be able to query. The following data categories were identified as the key components on which to aggregate and query data for each research infrastructure: fields and specialities, geographic coverage, funding source(s), funding timeline, services, projects and users.
Data was then collected on each of these categories and coded it into a consistent structure, relying primarily on RI websites and funding body websites like the European Commission's CORDIS and the National Science Foundation (NSF) grants portal. This process was applied to a sample set of 35 RIs covering a range of specialities and geographies to ensure the categorization was applicable in a wide variety of circumstances. The final categorisation consisted of 8 services categories comprised of more than 40 specific services as well as 8 fields comprised of 7 specialities.
This sample dataset was then aggregated and organised into a relational database structure for prototyping into a visual dashboard using Microsoft Power BI.
A more detailed report on the methodology used can be found in Milestone 47.