DATABASES
In order to estimate the total digitisation activity from incomplete survey results, it is necessary to compile other statistics that represent as complete a picture as possible of the total "universe" of cultural heritage institutions. The simplest database to represent that universe would be a complete list provided by each country of their domains: Audio-visual / Film Institutes; Archives / Records Offices; Museums / Galleries; and Libraries. These domains are coarse labels for the institutions concerned, as many could qualify for listing under several headings, and the organisational responsibility is often split at ministerial level differently according to each country. Hence a complete list is not yet available. Therefore, we have necessarily resorted to compiling a statistical estimate of the "universe". We have called this a "foundation database", because it forms the platform upon which the results from sample surveys can be applied to infer estimates of digitisation activity for all the relevant cultural heritage institutions. This will be updated and the statistics amended to reflect new data provided on Sheet "A", the proforma foundation quota tool distributed to experts in each country.
The foundation statistics, prior to verification by experts in each country, are reproduced in a downloadable spreadsheet. The database also covers countries outside the European Union (EU27), as those other countries may also wish to participate in the framework for generating digitisation statistics.
The database statistics are estimated based on extrapolations of information deriving from a variety of sources listed below:
- United Nations Educational, Scientific and Cultural Organisation (UNESCO)
- International Monetary Fund (IMF)
- EUROSTAT
- ERICARTS (Council of Europe: "Compendium of Cultural Policies and Trends in Europe", 8th edition 2007. )
- European Group on Museum Statistics (EGMUS) surveys
- Association des Cinémathèques Européennes (ACE)
- Conference of European National Librarians (CENL) survey
- LIBECON (EU project that compiled library statistics)
- Initial results from the Numeric survey
The data are attributed to the relevant sources in the spreadsheet.
Many of these data sources were incomplete and, where there were missing values, totals were inferred ("grossed") using averages based on values for available countries, weighting the estimate according to population size and/or GDP. The methodology is described in more detail below. These inferred values are intended to facilitate the estimation of EU totals rather and do not purport to be reliable bases for any single country.
As more data are assembled, we expect to either refine these estimates or overwrite them with actual data.
Grossing Methodology
For the purpose of grossing countries within the European Union, we defined two groups, based on their GDP at Purchasing Power Parity in 2004 (sourced from the IMF).
|
GDP$ (PPP) 2004 - Groupings
|
|
Luxembourg
|
71,405
|
Slovenia
|
20,863
|
|
Ireland
|
38,986
|
Czech Republic
|
19,432
|
|
Austria
|
32,738
|
Malta
|
18,760
|
|
Denmark
|
32,678
|
Hungary
|
16,884
|
|
United Kingdom
|
31,950
|
Slovakia
|
14,666
|
|
Netherlands
|
31,807
|
Estonia
|
14,221
|
|
Belgium
|
31,557
|
Lithuania
|
13,078
|
|
Finland
|
30,922
|
Poland
|
12,893
|
|
Sweden
|
30,674
|
Latvia
|
11,855
|
|
Italy
|
28,963
|
Romania
|
8,417
|
|
Germany
|
28,413
|
Bulgaria
|
8,188
|
|
France
|
28,252
|
|
|
|
Cyprus
|
26,609
|
|
|
|
Spain
|
25,082
|
|
|
|
Greece
|
22,891
|
|
|
|
Portugal
|
21,154
|
|
|
The missing values within the foundation database have been inferred from the known values within each group, based on per capita measures for each heading within the database. Where no data existed for all countries within the group, the EU average was used.
Where possible, other contextual data have been used to inform the estimates: Where we have been able to locate a directory, this has provided a reasonable estimate of the number of relevant institutions, and on this base the number of staff and institutional expenditure has been approximated, applying factors derived from known staffing and budgets in the other countries. An example of this approach is as follows:
- step a: Country X is in the EU >€21,000 GDP (PPP) as at 2004, and has 4 archives;
- step b: The observed average number of staff per archive is 24.4 full-time equivalents (i.e. based on known data);
- step c: Therefore, the estimated number of staff for Country X is: 24.4 x 4 = 97.6
Financial data are converted to euros using the relevant interbank exchange rates before grossing. No attempt has been made to convert the prices either to constant prices or to purchasing power parity. Therefore, the estimates represent cash expended each year.
Feed-back
The current foundation database provides an indicative first estimate of the "population" of cultural heritage institutions and the "universe" in which they exist, and will be refined as the study progresses. Please scrutinise the data for your own country, and if you know of a better source send us the details in an email to Numeric@ipf.co.uk.
In the meantime, please note that the statistics in the file remain to be validated, and that use of the data for purposes other than assisting with the Numeric Study is undertaken at the user`s own risk.
MODELS
Some survey data collected during the Numeric study may be suitable for informing estimates about the digitisation process. For instance, the average based on observed activities in a number of institutions, may provide a reasonable guide for another institution that has not yet undertaken the same digitisation process, but wishes to plan an appropriate budget.
CAUTION - THE MODELS ARE INTENDED TO PROVIDE AN INDICATION, NOT A DEFINITIVE BENCHMARK
As the Numeric study progresses, more data will become available upon which to generate estimator models. Even so, such models can only provide indicative advice, rather than definitive benchmarks. Users of the models will know that the characteristics of the analogue materials they wish to digitise may be different to those upon which the survey statistics were based. The standards to be adopted by your institution and other special requirements could require your budget to differ markedly from the model average. Furthermore, estimates based on any survey results are limited by the consistency of respondents` data and whether or not their institutions are representative of your own.
Please provide us with your feed-back about any of the models made available on the site, so that we can refine and improve them for the benefit of others. (Only one model is available at present) E-Mail you comments to Numeric@ipf.co.uk
INDICATIVE COST OF DIGITISING BOOKS
This model provides an estimate of the cost per page, according to the volume of material to be digitised. It is based on observations of planned projects varying between 14,000 and 300,000 pages. The model is in a Microsoft Excel spreadsheet, containing no macros.