About file formats

To properly archive and give access to a file in DSpace@Cambridge, we need to know what format it is, for example "PDF", "HTML", or "OpenDocument format". In order to facilitate preservation of the deposited content stored in DSpace@Cambridge we need to ascertain that the content are stored in file formats that are either open or well known and described. Below you will find a list of formats all recommended by the DSpace@Cambridge team. The list is not exhaustive, please contact us  if you have content you wish to deposit in other formats.

Name
MIME type Extensions
Description
       
PDF application/pdf
pdf
Portable document format
ODF
application/vnd.oasis.opendocument.text
application/vnd.oasis.opendocument.spreadsheet
application/vnd.oasis.opendocument.presentation
odt, ods, odp
Open document format
HTML
text/html
htm, html HyperText Markup Language
XML
application/xml
xml
Extensible Markup Language
Text text/plain
txt, asc
Plain text
WAV audio/wav
audio/wave
audio/x-wav
wav Waveform audio format, lossless audio
AIFF
audio/x-aiff
audio/aiff
aiff, aif, aifc
Audio Interchange File Format, lossless audio
VORBIS
audio/ogg
ogg, oga
Lossy audio
DNG
  dng
Digital negative
TIFF
image/tiff, image/tiff-fx
tiff, tif
Tagged Image File Format
PNG
image/png png
Portable Network Graphics
JPEG
image/jpeg
.jpeg, .jpg, .jpe jfif, .jfi, .jif
JPEG Interchange Format
MPEG video/mpeg mpeg, mpg, mpe
 
       
For videos we strongly recommend the H.264 codec with an MP4 container. Other formats will have to be discussed with the DSpace@Cambridge team before ingest.