Difference between revisions of "File Formats Assessments"

From wiki.dpconline.org
Jump to navigation Jump to search
Line 29: Line 29:
=== Preservation Risk Assessments by Format Type ===
=== Preservation Risk Assessments by Format Type ===
----
----
{|
{| border="0"
!colspan="3" style="text-align: left"|IMAGE FORMATS
!colspan="3" style="text-align: left"|IMAGE FORMATS
|- style="vertical-align:top;"
|- style="vertical-align:top;"
Line 48: Line 48:
|[[Media:PDF_Assessment_v1.3.pdf | '''Portable Document Format''']]
|[[Media:PDF_Assessment_v1.3.pdf | '''Portable Document Format''']]
A file format optimised for the consistent display of text and embedded images, regardless of platform.
A file format optimised for the consistent display of text and embedded images, regardless of platform.
|[[File:WhiteBorder100.jpg|10px]]
|[[File:Icon-EPUB.png|80px|link={{filepath:EPUB_Assessment_v1.2.pdf}}]]
|[[File:Icon-EPUB.png|80px|link={{filepath:EPUB_Assessment_v1.2.pdf}}]]
|[[Media:EPUB_Assessment_v1.2.pdf | '''EPUB''']]
|[[Media:EPUB_Assessment_v1.2.pdf | '''EPUB''']]
An open standard for electronic books (eBooks) and other content types published by the International Digital Publishing Forum (IDPF).
An open standard for electronic books (eBooks) and other content types published by the International Digital Publishing Forum (IDPF).
|[[File:WhiteBorder100.jpg|10px]]
|[[File:Icon-JATS.png|80px|link={{filepath:JATS NLM Assessment v1.3.pdf}}]]
|[[File:Icon-JATS.png|80px|link={{filepath:JATS NLM Assessment v1.3.pdf}}]]
|[[Media:JATS NLM Assessment v1.3.pdf | '''Journal Article Tag Suite''']]
|[[Media:JATS NLM Assessment v1.3.pdf | '''Journal Article Tag Suite''']]
An XML-based mark-up standard for e-Journal content, based on the earlier NLM Archiving and Interchange DTD.
An XML-based mark-up standard for e-Journal content, based on the earlier NLM Archiving and Interchange DTD.
|[[File:WhiteBorder100.jpg|10px]]
|[[File:Icon-ODT.png|80px|link={{filepath:ODT Assessment-v1.pdf}}]]
|[[File:Icon-ODT.png|80px|link={{filepath:ODT Assessment-v1.pdf}}]]
|[[Media:ODT Assessment-v1.pdf | '''Open Document Text''']]
|[[Media:ODT Assessment-v1.pdf | '''Open Document Text''']]
A format for editable textual documents that is part of the ISO 26300 OpenDocument Format family that is maintained by OASIS.
A format for editable textual documents that is part of the ISO 26300 OpenDocument Format family that is maintained by OASIS.
|[[File:WhiteBorder100.jpg|10px]]
|[[File:Icon-MOBI.png|80px|link={{filepath:Mobipocket_Assessment_v1.pdf}}]]
|[[File:Icon-MOBI.png|80px|link={{filepath:Mobipocket_Assessment_v1.pdf}}]]
|[[Media:Mobipocket_Assessment_v1.pdf | '''Mobipocket Format''']]
|[[Media:Mobipocket_Assessment_v1.pdf | '''Mobipocket Format''']]
A proprietary standard for electronic book (eBook) content; used by Amazon as the basis of its AZW and KF8 formats.
A proprietary standard for electronic book (eBook) content; used by Amazon as the basis of its AZW and KF8 formats.
|[[File:WhiteBorder100.jpg|10px]]
<h5 style="color:red;text-align:center;">NEW! </h5>
<h5 style="color:red;text-align:center;">NEW! </h5>
|-
|-
Line 73: Line 78:
|'''Geography Markup Language'''
|'''Geography Markup Language'''
An XML grammar for expressing geographical features used as a modelling language for GIS and cartographic products.
An XML grammar for expressing geographical features used as a modelling language for GIS and cartographic products.
<h5 style="color:red;text-align:center;">COMING SOON</h5>
<h5 style="color:red;">COMING SOON</h5>
|[[File:WhiteBorder100.jpg|10px]]
|-
|-
|&nbsp;
|&nbsp;
Line 82: Line 88:
|[[Media: WAV_Assessment_v1.0.pdf | '''Waveform Audio File Format''']]
|[[Media: WAV_Assessment_v1.0.pdf | '''Waveform Audio File Format''']]
An audio file format standard recommended by several professional bodies and memory institutions for the long-term preservation of audio files.
An audio file format standard recommended by several professional bodies and memory institutions for the long-term preservation of audio files.
|[[File:WhiteBorder100.jpg|10px]]
|[[File:Icon-FLAC.png|80px|link={{filepath:FLAC_Assessment_v1.0.pdf}}]]
|[[File:Icon-FLAC.png|80px|link={{filepath:FLAC_Assessment_v1.0.pdf}}]]
|[[Media: FLAC_Assessment_v1.0.pdf | '''FLAC (Free Lossless Audio Codec)''']]
|[[Media: FLAC_Assessment_v1.0.pdf | '''FLAC (Free Lossless Audio Codec)''']]
A non-proprietary open source lossless audio file format.
A non-proprietary open source lossless audio file format.
|[[File:WhiteBorder100.jpg|10px]]
<h5 style="color:red;text-align:center;">NEW! </h5>
<h5 style="color:red;text-align:center;">NEW! </h5>
|[[File:Icon-MP3.png|80px|link={{filepath:MP3_Assessment_v1.0.pdf}}]]
|[[File:Icon-MP3.png|80px|link={{filepath:MP3_Assessment_v1.0.pdf}}]]
|[[Media: MP3_Assessment_v1.0.pdf | '''MP3 (MPEG Audio Layer III)''']]
|[[Media: MP3_Assessment_v1.0.pdf | '''MP3 (MPEG Audio Layer III)''']]
A widely available and supported but lossy audio file format.
A widely available and supported but lossy audio file format.
|[[File:WhiteBorder100.jpg|10px]]
<h5 style="color:red;text-align:center;">NEW! </h5>
<h5 style="color:red;text-align:center;">NEW! </h5>
|-
|-
Line 103: Line 112:
|[[Media:MusicXML_Format_Assessment_v1.pdf | '''MusicXML Format''']]
|[[Media:MusicXML_Format_Assessment_v1.pdf | '''MusicXML Format''']]
An XML-based exchange format for music notation, currently developed by the W3C Music Notation Community Group.
An XML-based exchange format for music notation, currently developed by the W3C Music Notation Community Group.
|[[File:WhiteBorder100.jpg|10px]]
<h5 style="color:red;text-align:center;">NEW! </h5>
<h5 style="color:red;text-align:center;">NEW! </h5>
|-
|-
Line 110: Line 120:
|[[Media:XML_Assessment_v1.3.pdf | '''Extensible Markup Language''']]
|[[Media:XML_Assessment_v1.3.pdf | '''Extensible Markup Language''']]
A generic markup language for the encoding of text and data; specification maintained by the World Wide Web Consortium (W3C).
A generic markup language for the encoding of text and data; specification maintained by the World Wide Web Consortium (W3C).
|[[File:WhiteBorder100.jpg|10px]]
|-
|-


=== Assessment Criteria ===
=== Assessment Criteria ===
See the '''[[File Format Assessment Factors | format assessment factors]]''' covered in each assessment.
See the '''[[File Format Assessment Factors | format assessment factors]]''' covered in each assessment.

Revision as of 15:55, 9 February 2018

FileFormatsMainGrey.jpg File formats are a means of structuring information in a sensible way for storage, retrieval and use. There are a wealth of different formats supporting a range of data types, from specific instances to container formats able to store different types of data.

As discussed in their iPRES paper “Sustainability Assessments at the British Library: Formats, Frameworks and Findings”, the Digital Preservation Team at the British Library has undertaken file format assessments to capture knowledge about the gaps in current best practice, understanding and capability in working with specific file formats. The focus of each assessment is on capturing evidence-based preservation risks and the implications of institutional obsolescence which lead to problems maintaining the content over time.

The British Library’s assessments are being made available via this DPC wiki page in order to share their findings and facilitate engagement with the broader preservation community.

Feedback is always welcome. If you have any comments or suggestions, please email: DPT at the British Library

BL Logo (Big).jpg

Collaboration

The British Library, The Library of Congress, Harvard Library, NARA and the Digital Preservation Coalition are beginning a new collaboration to coordinate and make available their file format assessments. This will grow the pool of assessments available, while avoiding duplication, increasing the quality, and minimising the effort of maintenance. As a first stage, these organisations are coordinating their next assessment work here.

Preservation Risk Assessments - Summaries


Ebooks.png eBook Summary

A broad overview of formats available within the eBook sector.

Preservation Risk Assessments by Format Type


Assessment Criteria

See the format assessment factors covered in each assessment.

IMAGE FORMATS
Icon-TIFF.png Tagged Image File Format

A widely-supported raster format for images.

WhiteBorder100.jpg Icon-JP2.png JPEG 2000

A compression standard and coding system for images, created by the Joint Photographic Experts Group.

WhiteBorder100.jpg
 
DOCUMENT FORMATS
Icon-PDF.png Portable Document Format

A file format optimised for the consistent display of text and embedded images, regardless of platform.

WhiteBorder100.jpg Icon-EPUB.png EPUB

An open standard for electronic books (eBooks) and other content types published by the International Digital Publishing Forum (IDPF).

WhiteBorder100.jpg Icon-JATS.png Journal Article Tag Suite

An XML-based mark-up standard for e-Journal content, based on the earlier NLM Archiving and Interchange DTD.

WhiteBorder100.jpg Icon-ODT.png Open Document Text

A format for editable textual documents that is part of the ISO 26300 OpenDocument Format family that is maintained by OASIS.

WhiteBorder100.jpg Icon-MOBI.png Mobipocket Format

A proprietary standard for electronic book (eBook) content; used by Amazon as the basis of its AZW and KF8 formats.

WhiteBorder100.jpg
NEW!
 
GEOSPATIAL FORMATS
Icon-NTF.png National Transfer Format

A vector format standard (BS 7567) developed in the 1980s for the transfer of geospatial information, now mostly obsolete.

WhiteBorder100.jpg Icon-GML.png Geography Markup Language

An XML grammar for expressing geographical features used as a modelling language for GIS and cartographic products.

COMING SOON
WhiteBorder100.jpg
 
AUDIOVISUAL FORMATS
Icon-WAV.png Waveform Audio File Format

An audio file format standard recommended by several professional bodies and memory institutions for the long-term preservation of audio files.

WhiteBorder100.jpg Icon-FLAC.png FLAC (Free Lossless Audio Codec)

A non-proprietary open source lossless audio file format.

WhiteBorder100.jpg
NEW!
Icon-MP3.png MP3 (MPEG Audio Layer III)

A widely available and supported but lossy audio file format.

WhiteBorder100.jpg
NEW!
 
DIGITAL SHEET MUSIC FORMATS
Icon-SIB.png Sibelius Format

A proprietary format for music notation designed to be used with Avid Software's Sibelius composing and music editing software.

NEW!
WhiteBorder100.jpg Icon-MusicXML.png MusicXML Format

An XML-based exchange format for music notation, currently developed by the W3C Music Notation Community Group.

WhiteBorder100.jpg
NEW!
GENERIC FORMATS
Icon-XML.png Extensible Markup Language

A generic markup language for the encoding of text and data; specification maintained by the World Wide Web Consortium (W3C).

WhiteBorder100.jpg