Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Given the fact that the files are very big and can take many hours to download, as an alternative to download directly via the browser, you can login to the FTP server at "download.europeana.eu" with username "anonymous". This will allow you to resume if the download gets stuck.

dataset number

Metadata1

Full-text (ALTO)2

Page level full-text (EDM)3

Issue level full-text (EDM)4

9200300

download

 (229M) (MD5)

download

 (63G) (MD5)

download

 (116G) (MD5)

download

 (113G) (MD5)

9200301

download

 (37M) (MD5)

download

 (13G) (MD5)

download

 (20G) (MD5)

download

 (20G) (MD5)

9200338

download

 (213M) (MD5)

download

 (158G) (MD5)

download

 (278G) (MD5)

download

 (277G) (MD5)

9200339

download

 (39M) (MD5)

download

 (11G) (MD5)

download

 (21G) (MD5)

download

 (17G) (MD5)

9200355

download

 (212M) (MD5)

download

 (97G) (MD5)

download

 (159G) (MD5)

download

 (157G) (MD5)

9200356

download

 (137M) (MD5)

download

 (40G) (MD5)

download

 (17G) (MD5)

download

 (17G) (MD5)

9200357

download

 (23M) (MD5)

download

 (5G) (MD5)

download

 (9G) (MD5)

download

 (9G) (MD5)

9200396

download

 (4M) (MD5)

download

 (849M) (MD5)

download

 (2G) (MD5)

download

 (1G) (MD5)

Legend:

  1. The original metadata in EDM XML format before being ingested into Europeana. There are slight differences between this data and the one published. For more information see the EDM documentation page.

  2. The full-text encoded using ALTO (Analyzed Layout and Text Object) as it was delivered to Europeana. The ALTO is an open XML Schema meant to describe text coming from OCR and layout information of pages for digitized material. For more information see the official documentation page at the Library of Congress.

  3. The full-text encoded using the EDM profile for IIIF fullltext after being preprocessed for publication in Europeana. A note that as opposed to the format used by the API (ie. JSON-LD), the data is in RDF/XML as it is the format used for ingestion into Europeana.

  4. Very similar to (3) but wih the full-text represented at the Issue level. This means that the edm:FullTextResource will convey the complete transcription of the Newspaper.

...

On each compressed zip file, there will typically be a file per each item (ie. metadata or issue level full-text) or page (ie. ALTO and page level full-text) with the following structure:

Item

DATASET_ID/LOCAL_ID.xml

Page

DATASET_ID/LOCAL_ID/PAGE_ID.xml

That structure can be translated into links to the Europeana Collection portal where the item can be displayed or into the several APIs described on this page.

...

Europeana currently doesn't maintain a deleted record registry. Therefore we recommend you re-harvest or download the entire collection at least every six months to ensure your copy of the Europeana repository is up-to-date.

Console

Swc macro
urlhttps://api.europeana.eu/console/docs/v3/oai.json

Roadmap and Changelog

We deploy new versions of the service primarily to fix any outstanding issues or introduce new features. The current version of the OAI-PMH Service is 0.8 Beta (2020-10). To see the changes made for this version and also all previous releases, see the API changelog in the project GitHub.