Besides our main Europeana APIs for searching and retrieving metadata about objects, we also offer other methods for downloading and harvesting metadata that are better suited if you’re looking to extract large amounts of data. On this page, you can explore the two solutions available.
If you want a full discrete dataset from a single data provider, or if you want just a snapshot of our data, then we suggest downloading object metadata from our FTP server as pre-generated compressed ZIP files.
If you want to be kept up-to-date as metadata is changed or if you already use harvesting software, then we recommend using our Harvesting solution using the OAI-PMH service. OAI-PMH serves your files in XML format, which is ideal for data processing activities, especially for digital cultural heritage research. For researchers who are used to working with semantic frameworks and tools such as JENA and SPARQL, we also offer compressed zip files for download formatted in Turtle.
Our FTP server serves ZIP files containing the metadata of all objects in Europeana's repository, organised by dataset, readily available for bulk download. These files are generated on Sunday evening each week, which guarantees that the data is as up-to-date as possible.
two top-level directories, ‘XML’ and ‘TTL’, split the data in RDF-XML format and in Turtle format respectively.
Within those directories, every ZIP file has all of the metadata for each Dataset in Europeana, where the name of the file is the dataset identifier (e.g. 2021672.zip). Every ZIP file has a corresponding MD5 checksum file under the file extension .md5sum (e.g. 2021672.zip.md5sum) which can be used to validate the file upon download.
In each compressed zip file there will be a file for each Europeana metadata record where the name of the file will be the local identifier of the Record in Europeana.
The FTP server will provide you with a ZIP file with the metadata for all the objects in the dataset with the dataset number '2021672' if you request the URL ftp://download.europeana.eu/dataset/XML/2021672.zip. Unzipping the ZIP File will give you an XML file for every digital cultural heritage object. You can find the metadata for the “Girl with the Pearl Earring” in the ZIP file with the ID of that object, 'resource_document_mauritshuis_670' in the XML file named "resource_document_mauritshuis_670.xml"
The Europeana OAI-PMH Service offers a way to collect large amounts of Europeana data from our repository through a protocol named OAI-PMH (Open Archives Initiative Protocol for Metadata Harvesting, presently in v2.0). This service allows you to harvest the entirety of our database or a selection of our database. You can select which parts of the Europeana database to download by specifying which datasets you want to download, or by filtering on the date of creation or date of modification of the data.
Below you can find the available requests. The base URL for all requests is https://api.europeana.eu/oai/record/. These links and requests return XML, for which you need to use an XML-aware browser or viewing application.
List of available requests defined by the OAI-PMH protocol:
Europeana currently doesn't maintain a deleted record registry. Therefore we recommend you re-harvest or download the entire collection at least every six months to ensure your copy of the Europeana repository is up-to-date.
Roadmap and Changelog
We deploy new versions of the service primarily to fix any outstanding issues or introduce new features. The current version of the OAI-PMH Service is 0.8 Beta (2020-10). To see the changes made for this version and also all previous releases, see the API changelog in the project GitHub.