Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Table of Contents
minLevel1
maxLevel7

Europeana’s FTP server

Our FTP server serves ZIP files containing the metadata of all objects in Europeana's repository, organised by dataset, readily available for bulk download. These files are generated on Sunday evening each week, which guarantees that the data is as up-to-date as possible.

FTP listing and file structure

All the files are available on our FTP server at ftp://download.europeana.eu/dataset/. You can connect to an FTP server by using software programs like FileZilla, or you can connect to an FTP server as a Shared Network Location or using the Command Prompt. If you are using a Linux OS, you can run the command: wget -m ftp://download.europeana.eu/dataset/XML

...

  • two top-level directories, ‘XML’ and ‘TTL’, split the data in RDF-XML format and in Turtle format respectively.

  • Within those directories, every ZIP file has all of the metadata for each Dataset in Europeana, where the name of the file is the dataset identifier (e.g. 2021672.zip). Every ZIP file has a corresponding MD5 checksum file under the file extension .md5sum (e.g. 2021672.zip.md5sum) which can be used to validate the file upon download.

  • In each compressed zip file there will be a file for each Europeana metadata record where the name of the file will be the local identifier of the Record in Europeana.

Example

The data for the Girl with the Pearl Earring from the Mauritshuis encoded using the RDF-XML format will be available at the following URL ftp://download.europeana.eu/dataset/XML/2021672.zip. To find to which dataset any record belongs, you can check the URL of the record (for the Girl with the pearl earring, the Europeana item URL is https://www.europeana.eu/item/2021672/resource_document_mauritshuis_670 ), or you can find the dataset name next to the field 'Collection Name' in the 'More Metadata' tab on the item page.

The FTP server will provide you with a ZIP file with the metadata for all the objects in the dataset with the dataset number '2021672' if you request the URL ftp://download.europeana.eu/dataset/XML/2021672.zip. Unzipping the ZIP File will give you an XML file for every digital cultural heritage object. You can find the metadata for the “Girl with the Pearl Earring” in the ZIP file with the ID of that object, 'resource_document_mauritshuis_670' in the XML file named "resource_document_mauritshuis_670.xml"

OAI-PMH

The Europeana OAI-PMH Service offers a way to collect large amounts of Europeana data from our repository through a protocol named OAI-PMH (Open Archives Initiative Protocol for Metadata Harvesting, presently in v2.0). This service allows you to harvest the entirety of our database or a selection of our database. You can select which parts of the Europeana database to download by specifying which datasets you want to download, or by filtering on the date of creation or date of modification of the data.

You can learn more about the harvesting protocol on the Open Archives Initiative (OAI) website and also by reading the OAI for beginners tutorial from the Open Archives Forum.

Available requests

Below you can find the available requests. The base URL for all requests is https://api.europeana.eu/oai/record/. These links and requests return XML, for which you need to use an XML-aware browser or viewing application.

...

Structure and Format of the Data

The records in the OAI-PMH service are grouped into Datasets and are available as EDM RDF/XML. An example of a dataset ID that is accepted by the OAI-PMH service is 2022608_Ag_NO_ELocal_DiMu. The records are identified by their URIs. An example of such an identifier is http://data.europeana.eu/item/2022608/AAK_AAKS_2007_02_0206. To learn more about http://data.europeana.eu and its resources please see the EDM definitions.

Known limitations

Europeana currently doesn't maintain a deleted record registry. Therefore we recommend you re-harvest or download the entire collection at least every six months to ensure your copy of the Europeana repository is up-to-date.

Roadmap and Changelog

We deploy new versions of the service primarily to fix any outstanding issues or introduce new features. The current version of the OAI-PMH Service is 0.8 Beta (2020-10). To see the changes made for this version and also all previous releases, see the API changelog in the project GitHub.