<?xml version="1.0"?>
<?xml-stylesheet type="text/xsl" href="/style.xsl"?>
<html>
<head>
  <meta content="text/html; charset=ISO-8859-1" http-equiv="content-type" />
  <title>Home::Standards</title>
  <dir>Standards</dir>
  <topic>Standards</topic>
</head>
<body>

<h1>Multimedia Metadata Standards</h1><br />
Metadata is the value-added information which documents the
administrative, descriptive, preservation, technical and usage history
characteristics associated with resources. It provides the underlying
foundation upon which digital resources management systems are based to
provide fast, precise access to relevant resources across networks and
between organizations.<br />
Multimedia content analysis refers to understand semantic meanings
of MM documents by means of metadata extracted using common techniques of
image and signal processing and image analysis and understanding.<br />
Metadata is an important aspect of the creation and management of
digital images and other MM files. The information contained in metadata standards can regards the following aspects:<br />
<ul style="margin-left: 40px;">
  <li>the technical format of the image file</li>
  <li>the process by which the image was created</li>
  <li>the content of the image</li>
</ul>
Following these standards helps organizations to consistently record
information about their MM documents in a way that facilitates
retrieval and sharing in a networked environment.<br />
Metadata for MM documents can be classify according to the following three
types:<br />
<ol style="margin-left: 40px;">
  <li><span style="font-weight: bold; font-style: italic;">Descriptive
or Content metadata</span>: is information about the object
captured in the document (the object's name, title, materials, dates,
physical description, etc..). Content metadata is very important, as it
is the main way by which people can search and retrieve the MM
documents from a database. There are standards available to assist in
determining what information should be recorded about the object, and
how to record it.</li>
  <li><span style="font-weight: bold; font-style: italic;">Technical
metadata</span>: is also essential to properly manage digital
images. Technical metadata is data about the MM document itself (not
about an object in the document). For example, for a digital image, it
can include information about: the technical processes used in image
capture or manipulation or colour or file formats, and some of the
technical information that is recorded about the image, such as the
image file type, must be machine-readable (following specific technical
formats) so that, a computer system will be able to properly display
the image.</li>
  <li><span style="font-weight: bold; font-style: italic;">Administrative
metadata</span> includes information related to the
management of MM documents (such as rights management).</li>
</ol>
MM Metadata can be also classified according to other criteria
considering the level of data description, the producibility and the
domain dependence <a href="./../References/References.html#ref4">[4]</a>.
<br />
They can be classified by:<br />
<ol style="margin-left: 40px;">
  <li><span style="font-weight: bold; font-style: italic;">Level</span><span
 style="font-style: italic;">: </span>we can distinguish between a
technical level in which
lower level aspects of the MM content is described and a
semantic level in which aspects of higher level of abstraction on the
MM content are taken into account.</li>
  <li><span style="font-weight: bold;">Producibility</span>: the
production of metadata can either be
automatic, which is a very desirable property from the economic point of
view, and regards more frequently the low level technical metadata; for
semantic metadata describing the information covered by MM
content it is typically required human knowledge. So in these cases the
metadata production is manually performed.</li>
  <li><span style="font-weight: bold; font-style: italic;">Dependencies</span>:
metadata can be domain-dependent, for instance the
position of a tumour can be interesting for medical applications, while
the colour distribution of an image can be useful for many application
domains. Metadata can also be media type-dependent considering for
instance the colour distribution as applicable only to visual media
while the creation date applicable to any media.</li>
</ol>
Metadata represents surely a gain in terms of benefits produced for MM
data descriptions but most of all for MM applications (content
analysis).<br />
There are also disadvantages related to metadata. Some of them are its
cost, its unreliability, its subjectivity, its lack of authentication
and its lack of interoperability with respect to syntax, semantics,
vocabularies and languages. However, there are many researchers
currently investigating strategies to overcome different aspects of
these limitations in an effort to provide more efficient means of
organizing content on the Internet.<br />
The main reference topics related to techniques and projects developed
for MM content analysis can be briefly summarised as follows
<a href="./../References/References.html#ref3">[3]</a>:<br />
<ul style="margin-left: 40px;">
  <li> Automatic document Indexing/Classification: there is a variety
of techniques to classify documents in subject categories. These
techniques include: Bayesian analysis of the patterns of words in the
document, clustering of sets of documents according to similarity
measures, neural networks, sophisticated linguistic inferences, the use
of pre-existing sets of categories and seeding categories with
keywords. The most common methods used by auto-categorization software
are based on scanning every word in a document and analysing the
frequencies of patterns of words and, based on a comparison with an
existing taxonomy, assigning the document to a particular category in
the taxonomy. Other approaches use &#8220;clustering&#8221; and &#8220;taxonomy building&#8221;
techniques searching through all combinations of words to find cluster
of documents that appear to be together. Some systems are capable of
automatically generating a summary of a document by scanning through
the document and finding important sentences using rules like: the
first sentence of the first paragraph is often important.</li>
  <li> New researches are focusing on Semantics-Sensitive Matching <a
 href="./../References/References.html#ref1">[1]</a>
and Automatic Linguistic Indexing in which the system is capable of
recognizing real-world objects or concepts <a
 href="./../References/References.html#ref2">[2]</a>. Image retrieval
research has moved on from the IBM QBIC (query by image content) system
(QBIC, 2001) which uses colours, textures, and shapes to search for
images <a href="./../References/References.html#ref7">[7]</a>.
In particular the IBM CUEVideo project <a
 href="./../References/References.html#ref50">[50]</a></li>
  <li> Speech recognition is increasingly being applied to the indexing
and retrieval of digitised speech archives. Speech recognition systems
can generate searchable text that is indexed to time code on the
recorded media, so users can both call up text and jump right to the
audio clip containing the keyword. Normally, running a speech
recogniser on audio recordings does not produce a highly accurate
transcript because speech-recognition systems have difficulty if they
haven't been trained for a particular speaker or if the speech is
continuous. However the latest speech recognition systems will work
even in noisy environments, are speaker-independent, work on continuous
speech and are able to separate two speakers talking at once.</li>
  <li> Video Indexing and retrieval: the latest video indexing systems
combine a number of indexing methods, embedded textual data, scene
change detection, visual clues and continuous-speech recognition to
convert spoken words into text. Some systems <a
 href="./../References/References.html#ref51">[51]</a> can
automatically
analyse videos and extract named entities from transcripts which can be
used to produce time and location metadata. This metadata can then be
used to explore archives dynamically using temporal and spatial
graphical user interfaces.</li>
  <li> The Annotation systems also represent an useful tool for
metadata extraction. The motivation behind annotation systems is
related to the problem of metadata trust and authentication. Users can
attach their own metadata, opinions, comments, ratings and
recommendations to particular resources or documents on the Web, which
can be read and shared with others. The basic philosophy is that we
give more probably value and trust to the opinions of people we respect
than metadata of unknown origin. The W3C's Annotea system <a
 href="./../References/References.html#ref54">[54]</a> and
DARPA's Web Annotation Service <a
 href="./../References/References.html#ref49">[49]</a> are two
web-based annotation
systems which have been developed. Other annotation tools for
film/video and MM content (IBM VideoAnnEx, 2001) <a
 href="./../References/References.html#ref52">[52]</a>, (Ricoh
MovieTool, 2002) <a href="./../References/References.html#ref53">[53]</a>,
(DSTC's FilmEd, 2003) <a href="./../References/References.html#ref56">[56]</a>
and tools to enable
the attachment of spoken annotations to digital resources (PAXit, 2003)
    <a href="./../References/References.html#ref55">[55]</a> such as
images or photographs have been developed.</li>
  <li> Metadata for Preservation : a lot of initiatives are focusing on
metadata pursuing the goal of the MM resource preservation.
Such initiatives include: Reference Model for an Open Archival
Information System (OAIS, 2002) <a
 href="./../References/References.html#ref57">[57]</a>, the CURL
Exemplars in Digital
Archives project (CEDARS, 2002) <a
 href="./../References/References.html#ref58">[58]</a>, the National
Library of Australia
(NLA) PANDORA project (PANDORA, 2002) <a
 href="./../References/References.html#ref59">[59]</a>. These
initiatives rely on
the preservation of both digital objects and associated metadata for an
easy interpretation in the future. The preservation metadata provides
sufficient technical information about the resources and can facilitate
the long-term access of the digital resources by providing a complete
description of the technical environment needed to view the work, the
applications and version numbers needed, decompression schemes as well
as any other files that need to be linked to it.</li>
</ul>
A large number of metadata standardisation initiatives has been
developed in recent years, in order to describe MM contents in
so many different domains and to grant sharing, exchanging and
interoperability across wide range networks.<br />
We can distinguish between two different standard typology, according
to what each of them represents in terms of its functionalities.<br />
The first typology is directly related to the representation of
MM content for a specific domain and each of these standards
can be referred as a standardised description scheme, while the second
one considers the possibility of integrating more metadata standards
mapped on different application domains, providing rich metadata models
for media descriptions together with languages allowing one to define
other description schemes for arbitrary domains.<br />
The table below represents a selection of several metadata standards
description schemes, which can be considered the most frequently cited
and representative for a quite wide range of different application
domains. So, in the following table is illustrated a list of descriptive
characteristics of each reference standard taken in account. In
particular, information about standardisation bodies, last version
dates, described MM data types, application domains, description
semantic levels and the way by which metadata has been produced
(manually or automatically) have been considered.<br />
<br />

<table summary="Selection of several standardised description schemes" style="border: 1px solid rgb(0, 255, 255); text-align: left; font-size: 90%; font-family: Arial Narrow, Arial, Helvetica; margin-left: auto; margin-right: auto;"
border="1" cellspacing="0">
<tbody>
<tr>
<td style="text-align: center; vertical-align: middle;"> </td>
<td style="text-align: center; vertical-align: middle;"><strong><a href="MARC/index.xml">MARC</a></strong></td>
<td style="text-align: center; vertical-align: middle;"><strong><a href="Dublin_Core/index.xml">Dublin Core</a></strong></td>
<td style="text-align: center; vertical-align: middle;"><strong><a href="CDWA/index.xml">CDWA</a></strong></td>
<td style="text-align: center; vertical-align: middle;"><strong><a href="VRA_Core/index.xml">VRA Core</a></strong></td>
<td style="text-align: center; vertical-align: middle;"><strong><a href="CSDGM/index.xml">CSDGM</a></strong></td>
<td style="text-align: center; vertical-align: middle;"><strong><a href="NISO_Z39.87/index.xml">Z39.87</a></strong></td>
<td style="text-align: center; vertical-align: middle;"><strong><a href="LOM/index.xml">LOM</a></strong></td>
<td style="text-align: center; vertical-align: middle;"><strong><a href="DIG35/index.xml">DIG35</a></strong></td>
<td style="text-align: center; vertical-align: middle;"><strong><a href="METS/index.xml">METS</a></strong></td>
<td style="text-align: center; vertical-align: middle;"><strong><a href="JPX/index.xml">JPX</a></strong></td>
<td style="text-align: center; vertical-align: middle;"><strong><a href="SMPTE/index.xml">SMPTE Metadata Dictionary</a></strong></td>
</tr>
<tr>
<td style="text-align: center; vertical-align: middle;"><strong>Standardization
Body</strong></td>
<td style="text-align: center; vertical-align: middle;">Library of Congress</td>
<td style="text-align: center; vertical-align: middle;">Dublin Core Metadata Initiative (DCMI)</td>
<td style="text-align: center; vertical-align: middle;">Art Information Task Force (AITF)</td>
<td style="text-align: center; vertical-align: middle;">Visual Resource Association</td>
<td style="text-align: center; vertical-align: middle;">Federal Geographi Data Committee (FDDC)</td>
<td style="text-align: center; vertical-align: middle;">National Information Standard Organization (NISO)</td>
<td style="text-align: center; vertical-align: middle;">IEE (LTSC)</td>
<td style="text-align: center; vertical-align: middle;">Digital Imaging Group (DIG of I3A)</td>
<td style="text-align: center; vertical-align: middle;">Digital Library Federation (DLF)</td>
<td style="text-align: center; vertical-align: middle;">Joint Photographic Experts Group (JPEG)</td>
<td style="text-align: center; vertical-align: middle;">Society of Motion Picture and Television Engineers (SMPTE)</td>
</tr>
<tr>
<td style="text-align: center; vertical-align: middle;"><strong>Year</strong></td>
<td style="text-align: center; vertical-align: middle;">Current Version MARC 21 since 1999</td>
<td style="text-align: center; vertical-align: middle;">Current Version 1.1 since 1999</td>
<td style="text-align: center; vertical-align: middle;">Current Version 2.0 since 2000</td>
<td style="text-align: center; vertical-align: middle;">Current Version 3.0 since 2002</td>
<td style="text-align: center; vertical-align: middle;">1998</td>
<td style="text-align: center; vertical-align: middle;">2002</td>
<td style="text-align: center; vertical-align: middle;">2002</td>
<td style="text-align: center; vertical-align: middle;">Current Version 1.1 April 2001</td>
<td style="text-align: center; vertical-align: middle;">Last Review 2001</td>
<td style="text-align: center; vertical-align: middle;">2000</td>
<td style="text-align: center; vertical-align: middle;">Last Review 2004</td>
</tr>
<tr>
<td style="text-align: center; vertical-align: middle;"><strong>MM Type</strong></td>
<td style="text-align: center; vertical-align: middle;">Any</td>
<td style="text-align: center; vertical-align: middle;">Any</td>
<td style="text-align: center; vertical-align: middle;">Any</td>
<td style="text-align: center; vertical-align: middle;">Images</td>
<td style="text-align: center; vertical-align: middle;">Any</td>
<td style="text-align: center; vertical-align: middle;">Images</td>
<td style="text-align: center; vertical-align: middle;">Any</td>
<td style="text-align: center; vertical-align: middle;">Images</td>
<td style="text-align: center; vertical-align: middle;">Any</td>
<td style="text-align: center; vertical-align: middle;">Images</td>
<td style="text-align: center; vertical-align: middle;">Any</td>
</tr>
<tr>
<td style="text-align: center; vertical-align: middle;"><strong>Domain</strong></td>
<td style="text-align: center; vertical-align: middle;background-color: #0000ff;">Bibliographic media description</td>
<td style="text-align: center; vertical-align: middle;background-color: #0000ff;">Bibliographic media description</td>
<td style="text-align: center; vertical-align: middle;background-color: #00aa00;">Description of Art works</td>
<td style="text-align: center; vertical-align: middle;background-color: #00ff00;">Description of images of Art works</td>
<td style="text-align: center; vertical-align: middle;background-color: #00ffff;">Description of Geographic media</td>
<td style="text-align: center; vertical-align: middle;background-color: #f000ff;">Description of still images</td>
<td style="text-align: center; vertical-align: middle;background-color: #0aa0aa;">Description of educational media</td>
<td style="text-align: center; vertical-align: middle;background-color: #dd0000;">Description of digital images</td>
<td style="text-align: center; vertical-align: middle;background-color: #ff0000;">Description of digital objects</td>
<td style="text-align: center; vertical-align: middle;background-color: #dd0000;">Description of digital images</td>
<td style="text-align: center; vertical-align: middle;background-color: #fff000;">Description of audio/video objects</td>
</tr>
<tr>
<td style="text-align: center; vertical-align: middle;"><strong>Level</strong></td>
<td style="text-align: center; vertical-align: middle;">Largely semantic</td>
<td style="text-align: center; vertical-align: middle;">Largely semantic</td>
<td style="text-align: center; vertical-align: middle;">Largely semantic</td>
<td style="text-align: center; vertical-align: middle;">Largely semantic</td>
<td style="text-align: center; vertical-align: middle;">Semantic and technical</td>
<td style="text-align: center; vertical-align: middle;">Technical</td>
<td style="text-align: center; vertical-align: middle;">Largely semantic</td>
<td style="text-align: center; vertical-align: middle;">Semantic and technical</td>
<td style="text-align: center; vertical-align: middle;">Semantic and technical</td>
<td style="text-align: center; vertical-align: middle;">Semantic and technical</td>
<td style="text-align: center; vertical-align: middle;">Semantic and technical</td>
</tr>
<tr>
<td style="text-align: center; vertical-align: middle;"><strong>Producibility</strong></td>
<td style="text-align: center; vertical-align: middle;">Mainly manual</td>
<td style="text-align: center; vertical-align: middle;">Mainly manual</td>
<td style="text-align: center; vertical-align: middle;">Mainly manual</td>
<td style="text-align: center; vertical-align: middle;">Mainly manual</td>
<td style="text-align: center; vertical-align: middle;">Manual and Authomatic</td>
<td style="text-align: center; vertical-align: middle;">Mainly automatic</td>
<td style="text-align: center; vertical-align: middle;">Mainly manual</td>
<td style="text-align: center; vertical-align: middle;">Mainly manual</td>
<td style="text-align: center; vertical-align: middle;">Mainly manual</td>
<td style="text-align: center; vertical-align: middle;">Mainly manual</td>
<td style="text-align: center; vertical-align: middle;">Manual and Authomatic</td>
</tr>
</tbody>
</table>
<p style="text-align:center">Selection of several standardised description schemes</p>
 
</body>
</html>

