Media policy

The Europeana Media Policy outlines the requirements for the links to the media resources that are part of the provided metadata. It complements the information provided with the Publishing Guide and the EDM Mapping Guidelines. By following these requirements, we can give audiences a better experience and a greater connection with your collections. Following these requirements we will:

  • Extract technical metadata from media resources (e.g. mime-type, image size, duration, colour) to allow for a richer search and browse experience and to power functionalities such as enlarging and downloading images and other media.

  • Generate thumbnail (preview) images based on media resources, and show these images as part of the search results to help users find what they are looking for.

  • Display media resources on the item pages in Europeana Collections for a user to get the best possible experience and be able to interact with the content.

Metadata and Media

We will process (links to) media resources in the following three metadata fields:

  • edm:object

  • edm:isShownBy

  • edm:hasView (for each - More than one instance of edm:hasView is possible when more than one media resource is provided.)

We will also process links provided with edm:isShownAt, but we will not generate any thumbnails. Processing the edm:isShownAt involves checking that the link is resolvable and storing the mime-type of the working link. For more details about the metadata fields and their definition, please consult the EDM documentation.

Extract Technical Metadata

Europeana will extract metadata from the media resources provided via the above metadata fields to aid search in Europeana Collections and the APIs, and to determine how to present or consume a web resource in Europeana Collections. This is what we refer to as technical metadata (e.g. the image size, image colours, duration of audio clips). EDM was extended to fit the five media types currently supported by Europeana, namely: Sound, Video, Text, Image and 3D objects.

EDM profile for technical metadata

To provide additional useful facets and filtering of Europeana resources the addition of technical metadata at the level of the WebResource, which was currently absent from EDM, is needed. This specification applies to five media types which Europeana currently supports, namely: Sound, Video, Text, Image and 3D objects. This profile lists the properties that will apply to the WebResource class and an additional class that were defined to support such functionality.

The working document that contains background information and that forms the basis of this listing is available here:https://docs.google.com/spreadsheets/d/17LjVpdUHsqpLhvQ0kvvDd7ja6eAPhSB9dN3NQ0kEbko/edit?pli=1&gid=0#gid=0 . Most of the added properties come from the EBUCore set of properties, available at: http://www.ebu.ch/metadata/ontologies/ebucore/index.html

Approach


A minimal approach has been adopted to the addition of properties to EDM. Not all functions require the addition of a specific property. Consider for example the filtering of Europeana objects based on the presence of a usable thumbnail. This can be detected by checking for the presence of a URL in the “edm:preview” property (and the resource flagged accordingly in a search system) without the need for a specific EDM property to hold that flag in Europeana’s reference data.

Some values can be used to generate further values and, depending on the stage in the processing of data where this happens, a property may be needed to record the output. For example, the orientation of an image will be generated by processing numeric values given in the height and width properties, giving the result as “Portrait” or “Landscape”. A few properties have been created in the EDM namespace even though similar properties already exist elsewhere (e.g. Exif and EBUCore). This is because the allowable values did not match our requirements at this time (e.g. EDM needs literal values in some cases).
These properties are: “edm:hasColorSpace”, “edm:componentColor”, “edm:spatialResolution” and “edm:codecName”.

Properties

All the properties defined below should be applied to edm:WebResource.

edm:codecName

edm:codecName

URI

http://www.europeana.eu/schemas/edm/codecName

Label

Codec

Definition

The name of a device or computer program capable of encoding or decoding a digital data stream or signal (i.e. coder-decoder)

Domain

edm:WebResource

Obligation & Occurrence

Optional (Minimum: 0, Maximum: 1)

Example

mp4v

Comment

Used for video

ebucore:hasMimeType

ebucore:hasMimeType

URI

http://www.ebu.ch/metadata/ontologies/ebucore/ebucore#hasMimeType

Label

MIME Type

Definition

The main MIME types as defined by IANA: e.g. audio, video, text, application, or a container MIME type.

Subproperty of

ebucore:hasFormat

Domain

edm:WebResource

Obligation & Occurrence

Optional (Minimum: 0, Maximum: 1)

Example

video/mp4

Comment

The controlled values are defined by IANA list of internet media types. Used for Video, Image, Sound, Text, 3D.

ebucore:fileByteSize

ebucore:fileByteSize

URI

http://www.ebu.ch/metadata/ontologies/ebucore/ebucore#fileByteSize

Label

File size

Definition

The size of a Media Resource expressed in bytes.

Domain

edm:WebResource

Range

xsd:long (as defined in EBUCore)

Obligation & Occurrence

Optional (Minimum: 0, Maximum: 1)

Example

39508

Comment

To indicate the storage requirements or file size of a digital resource. The file size is expressed in bytes. Used for Video, Image, Sound, Text, 3D.

ebucore:duration

ebucore:duration

URI

http://www.ebu.ch/metadata/ontologies/ebucore/ebucore#duration

Label

Duration

Definition

The duration of a track or a signal expressed in ms.

Subproperty of

ebucore:duration

Domain

edm:WebResource

Obligation & Occurrence

Optional (Minimum: 0, Maximum: 1)

Example

270000

Comment

An implementation decision was made to use this property instead of a more normative property such as “ebucore:durationNormalPlayTime” so that a duration could be expressed as a number. Used for Video and Sound.

ebucore:width

ebucore:width

URI

http://www.ebu.ch/metadata/ontologies/ebucore/ebucore#width

Label

Width

Definition

The width of e.g. a video frame typically expressed as a number of pixels.

Domain

edm:WebResource

Range

xsd:integer

Obligation & Occurrence

Optional (Minimum: 0, Maximum: 1)

Example

1024

Comment

To define different widths of a video image e.g. display, active etc. Used for Video and Image.

ebucore:height

ebucore:height

URI

http://www.ebu.ch/metadata/ontologies/ebucore/ebucore#height

Label

Height

Definition

The height of e.g. a video frame typically expressed as a number of pixels.

Domain

edm:WebResource

Range

xsd:integer (as defined in EBUCore)

Obligation & Occurrence

Optional (Minimum: 0, Maximum: 1)

Example

768

Comment

To define different heights of a video image e.g. display, active etc. Used for Video and Image.

edm:spatialResolution

edm:spatialResolution

URI

http://www.europeana.eu/schemas/edm/spatialResolution

Label

Spatial resolution

Definition

The spatial resolution of a media resource expressed in DPIs.

Domain

edm:WebResource

Range

xsd:nonNegativeInteger (as defined in EBUCore)

Obligation & Occurrence

Optional (Minimum: 0, Maximum: 1)

Example

300

Comment

Used for Text (e.g. PDFs).

ebucore:sampleSize

ebucore:sampleSize

URI

http://www.ebu.ch/metadata/ontologies/ebucore/ebucore#sampleSize

Label

Sample size

Definition

The size of an audio sample in bits. Also called bit depth.

Domain

edm:WebResource

Range

xsd:integer (as defined in EBUCore)

Obligation & Occurrence

Optional (Minimum: 0, Maximum: 1)

Example

16

Comment

Used for Sound.

ebucore:sampleRate

ebucore:sampleRate

URI

http://www.ebu.ch/metadata/ontologies/ebucore/ebucore#sampleRate

Label

Sample rate

Definition

The frequency at which an audio is sampled per second. Also called sampling rate.

Domain

edm:WebResource

Range

xsd:integer (as defined in EBUCore)

Obligation & Occurrence

Optional (Minimum: 0, Maximum: 1)

Example

44100

Comment

Used for Sound.

ebucore:bitRate

ebucore:bitRate

URI

http://www.ebu.ch/metadata/ontologies/ebucore/ebucore#bitRate

Label

Bitrate

Definition

To provide the bitrate at which the Media Resource can be played in bits
per second.

Domain

edm:WebResource

Range

xsd:nonNegativeInteger (as defined in EBUCore)

Obligation & Occurrence

Optional (Minimum: 0, Maximum: 1)

Example

128000

Comment

Used for Video and Sound.

 

ebucore:frameRate

ebucore:frameRate

URI

http://www.ebu.ch/metadata/ontologies/ebucore/ebucore#frameRate

Label

Frame rate

Definition

The frame rate of the video signal in frames per second.

Domain

edm:WebResource

Range

xsd:double (as defined in EBUCore)

Obligation & Occurrence

Optional (Minimum: 0, Maximum: 1)

Example

300

Comment

Used for Video.

edm:hasColorSpace

edm:hasColorSpace

URI

http://www.europeana.eu/schemas/edm/hasColorSpace

Label

Color Space

Definition

The color space of an image resource i.e. “grayscale” or “sRGB”

Domain

edm:WebResource

Obligation & Occurrence

Optional (Minimum: 0, Maximum: 1).

Example

grayscale

Comment

Used for Image.

edm:componentColor

edm:componentColor

URI

http://www.europeana.eu/schemas/edm/componentColor

Label

Component Color

Definition

A significant color present in an image. The colors must be taken from the CSS3 standard color palette and are expressed as a hexadecimal binary value.

Domain

edm:WebResource

Range

xsd:hexBinary

Obligation & Occurrence

Optional (Minimum: 0, Maximum: 6)

Example

FF0000

Comment

Used for Image.

ebucore:orientation

ebucore:orientation

URI

http://www.ebu.ch/metadata/ontologies/ebucore/ebucore#orientation

Label

Orientation

Definition

The orientation of a document or an image i.e. “landscape” or “portrait”.

Domain

edm:WebResource

Range

xsd:string (as defined in EBUCore)

Obligation & Occurrence

Optional (Minimum: 0, Maximum: 1)

Example

portrait

Comment

The value is generated by comparing the values in “ebucore:height” and “ebucore:width”. Used for Image.

ebucore:audioChannelNumber

ebucore:audioChannelNumber

URI

http://www.ebu.ch/metadata/ontologies/ebucore/ebucore#audioChannelNumber

Label

Audio channels

Definition

The total number of audio channels contained in the Media Resource.

Domain

edm:WebResource

Range

xsd:nonNegativeInteger (as defined in EBUCore)

Obligation & Occurrence

Optional (Minimum: 0, Maximum: 1)

Example

2

Comment

Used for Sound.

rdf:type

rdf:type

URI

http://www.w3.org/1999/02/22-rdf-syntax-ns#type

Label

Type

Definition

To indicate the type of a resource.

Range

rdfs:Resource

Obligation & Occurrence

Optional (Minimum: 0, Maximum: 1)

Example

<rdf:type rdf:resource="http://www.europeana.eu/schemas/edm/FullTextResource "/>

Comment

This is used to indicate that a web resource contains searchable, machine readable text. In the context of this EDM extension, “rdf:type” has been introduced to be used in relation with classes relevant for this extension, especially “edm:FullTextResource”.

Classes

edm:FullTextResource

edm:FullTextResource

URI

http://www.europeana.eu/schemas/edm/FullTextResource

Label

Full-text Resource

Definition

A resource that has machine readable full text content.

Subclass of

edm:InformationResource

Example

Typically this could be the result of a newspaper that has been digitised as an image or PDF file and then OCRed to produce a separate file with the full text.

Comment

When full-text is available for a digital object it must be represented in a specific class. This is because in some cases the WebResource that provides the view of the object to end-users does not contain the full-text in machine readable form. This is taken from the Europeana Libraries report on full-text.

Document history:

Jan 20, 2015 Robina Clayphan, Hugo Manguinhas - Final version after review

Feb 17, 2015 Hugo Manguinhas, Antoine Isaac - Additional comments incorporated

An overview of all the technical metadata facets in Europeana Collections can be found at Documentation on the Europeana Search API. The content tiers, i.e. the quality of a media resource as specified in the Europeana Publishing Framework, are calculated on the basis of the technical metadata.

Retrieving the links to media resources in the above mentioned metadata fields is required in order to access and download those media resources. Downloading each resource is necessary in order to generate the technical metadata (and thumbnails, see below). Europeana will only temporarily download the media resource, which will be discarded after processing.

In order to successfully access and download a media resource, the following requirements must be met:

  • The link to the media resource must be a valid URL (Complies with either IETF RFC 3986: Uniform Resource Identifier (URI): Generic Syntax or IETF RFC 3987 Internationalized Resource Identifiers (IRIs)) and we recommend such URL to use the HTTPS protocol.

  • The link must resolve to a media resource directly, not to a webpage (i.e. where the mime-type of the resource would be "text/html").

  • The link must not return more than three redirects before ending up on the media resource.

  • The media resource must be able to be downloaded within a time span of 20 minutes.

  • The media resource must have a valid mime-type. We have established a list of valid mime-types that we support from ingestion to display in Europeana Collections based on the mime-types maintained by the Internet Assigned Numbers Authority.

  • Large PDF files should be optimised for Fast Web View so they take less time to download. An exception from these requirements are media resources that are embedded in the Europeana Collections portal (see the section below).

Media Formats/Mime Types

A MIME type (also known as a Multipurpose Internet Mail Extension) is a standard that indicates the format of a file. It is a fundamental characteristic of a digital resource that influences its ability to be accessed and used over time.

The purpose of this section is to provide a list of MIME types corresponding to different types of content/media that are supported or not by Europeana.

Media formats supported on Europeana Collections

Format Name

Mime-type

File Extension(s)

Format Name

Mime-type

File Extension(s)

Image

JPEG

image/jpeg

.jpeg

PNG

image/png

.png

GIF

image/gif

.gif

Generic Bitmap

image/bmp

.bmp

Microsoft Bitmap

image/x-ms-bmp

.bmp

Text

PDF

application/pdf

.pdf

Video

MPEG-4 Video

video/mp4

.mp4

Open Web Media Project – Video

video/webm

.webm

M4v

video/x-m4v

.m4v

Quicktime video

video/quicktime

.mov, .qt

Audio

MP3 or other MPEG format

audio/mpeg

.mp3

Waveform Audio

audio/x-wav

.wav, .wave

Media formats not supported in Europeana Collections

Not all the links to media resources that Europeana receives from data providers can be displayed in the Europeana Collections portal, however these are still made available for download to our users for reuse. The respective media formats (and mime-types) are presented in the table below.

Format Name

Mime-type

File Extension(s)

Format Name

Mime-type

File Extension(s)

Image

TIFF

image/tiff

.tiff

Adobe Photoshop

image/vnd.adobe.photoshop

.psd

Text

Plain Text

text/plain

.txt

Video

Microsoft Windows Media Video

video/x-ms-wmv

.wmv

Flash Video

video/x-flv

.flv

MPEG Video

video/mpeg

.mpg

Audio Video Interleave (AVI)

video/x-msvideo

.avi

Microsoft Advanced Systems Format (ASF)

video/x-ms-asf

.asf

Audio

Native FLAC format (FLAC in its own container)

audio/x-flac

.flac

Windows Media Audio

audio/x-ms-wma

.wma

Audio Interchange File Format

audio/x-aiff

.aiff, .aif, .aifc

Generate Thumbnail Preview Images

Thumbnails are a small image of the digital object. They directly affect the user click-through rate. Requirements and recommendations about the quality of thumbnails are provided in the Publishing Guide.

We will generate thumbnails from the media resources provided via the above metadata fields, if the mime-type of the media resources is valid (see also above). Note that for media resources with the mime-type “application/pdf” we only generate a thumbnail when we find an image in the pdf.

We will generate the following thumbnails:

  • An image with a width of at maximum 200 pixels.

  • An image with a width of at maximum 400 pixels.

The following must be noted:

  • The height of the thumbnail will be proportional in accordance with the aspect ratio.

  • If an image is smaller than 200 or 400 pixels, the original width of the image will be used as the size of the thumbnail (Europeana does not recommend to use images which are smaller than 400 pixels in width).

All thumbnails that we generate can be retrieved via the Thumbnail API. Note that for the thumbnail used for display in the search results of Europeana Collections, we have a specific metadata field: edm:preview. The value for edm:preview will be the thumbnail corresponding to either the media resource obtained from edm:object if available, otherwise the image with the highest resolution of either edm:isShownBy or the first edm:hasView.

Display of Media Resources

CORS

In order for the Europeana Collections portal to access and display media resources coming from providers’ servers, the latter must support Cross-Origin Resource Sharing (CORS). This is because web browsers restrict resources on a web page to be requested from another domain outside the domain it was served (Collections in this case). The CORS standard is needed because it allows servers to specify not only who can access its resources but also how these resources can be accessed.

This is especially important for providers who share resources via IIIF as CORS is essential for our IIIF viewer to perform the image information requests and to obtain the presentation manifests. Without CORS, although the IIIF resources will be displayed in the provider’s website, it won’t be possible to be viewed in the Europeana Collections portal. For more information about CORS.

HTTPS

Providers of IIIF resources are encouraged to deliver their resources over HTTPS so that they can be included in our web page without issue.

Embedding of media resources

In order for an item to be embedded in the Europeana Collections portal, there must be a valid oEmbed endpoint (see example from SketchFab) available where the media referred to in the item is displayed using a third party viewer or player. An exception is made for some data partners that do not support oEmbed.

On the metadata side, a link to a webpage where the item can be displayed and which can be mappable to an oEmbed URL is provided within edm:isShownBy. In order to do this and because there is no information in the metadata that relates the webpage link to the oEmbed endpoint, the Europeana Collections portal has to apply additional logic to convert the URL provided in edm:isShownBy into an oEmbed URL that can be used for embedding. For this, it uses an internal registry of oEmbed endpoints which consists of both a public list and manual additions made by the Collections development team (which also includes the Europeana oEmbed implementations to cover the exceptions). This means that whenever a new dataset is added that uses a different oEmbed endpoint it will need to be added to this registry.