 |
|
|
Preservation Eprints Services - Enabling long-term open access to materials in institutional repositories (IRs) |
|
See also Papers and presentations produced by the project
Preserv bibliography on digital preservation
Authoritative papers on key developments and practical applications for
digital preservation, viewed through the prism of the Preserv project,
which is concerned with preservation services for content from
institutional repositories. Chronological by section.
Sections Preservation
and institutional repositories | File formats (including Significant properties and representation) | OAIS (including Archive Ingest and Handling Test
(AIHT)) | Preservation
metadata | Packaging metadata (inc. METS)
|
OAI and preservation
harvesting | Trusted
digital repositories | Business
models, costs,
lifecycle, workflow | Users
| Tools
(inc. migration, emulation, storage management, identifiers),
including Bit-level
storage | Practical
applications/architectures
| Electronic journal
archiving | Legal issues
| General digital
preservation
Last updated 1 December 2008,
by Steve Hitchcock
All of these papers should be freely accessible, i.e. open access, from
the links provided.
Send additions, updates, corrections, comments to sh94r@ecs.soton.ac.uk
Latest additions
Caplan, Priscilla, Repository to Repository Transfer of Enriched
Archival Information Packages,
D-Lib
Magazine, Volume 14 Number 11/12, November/December 2008
http://www.dlib.org/dlib/november08/caplan/11caplan.html
On a project just launched, Towards
Interoperable Preservation Repositories (TIPR), a collaboration of The
Florida Center for Library Automation], the Cornell University Library
and the New York University Libraries, to develop a proof of concept
for the exchange of information between digital preservation
repositories.
Aschenbrenner, Andreas, Tobias Blanke, Mark Hedges, David
Flanders and Ben O'Steen, The Future of Repositories? Patterns for
(Cross-)Repository Architectures,
D-Lib
Magazine, Volume 14 Number 11/12, November/December 2008
http://www.dlib.org/dlib/november08/aschenbrenner/11aschenbrenner.html
Not strictly on preservation, but
among its broader repository architectures and services themes the
paper neatly weaves in some of the themes Preserv is exploring. A
fascinating view that demands careful reading. At times the paper gets
a little ahead of itself in suggesting that certain capabilities have
been realised, but even in its anticipation it is leading in the right
direction.
Rumsey, Sally and Ben O’Steen, OAI-ORE, PRESERV2 and Digital
Preservation,
Ariadne, Issue
57, 30-October-2008
http://www.ariadne.ac.uk/issue57/rumsey-osteen/
Beagrie, Neil, Najla Semple, Peter Williams, and Richard Wright,
Digital Preservation Policies Study, JISC, 30 October 2008
http://www.jisc.ac.uk/Home/publications/publications/jiscpolicyfinalreport.aspx
This study is well written, helpfully
structured, resourceful, and innovative. The main resource provided is
in the appendices, where existing, broader institutional strategy
documents concerning research, learning and teaching, etc., are
analysed and extended by the authors of this study with ways in which
DP policy can support and enhance these strategies.
Salo blog comment: "I can’t
recommend this report from JISC highly enough. It lays out
what needs to appear in an institutional digital preservation policy,
how to pitch it as part of the institutional mandate, and what examples
are worth following. ... excellent." (4 November 2008)
http://cavlec.yarinareth.net/2008/11/04/digital-preservation-policy-how-to/
Haber,
Stuart, Kamat, Pandurang and Kamineni, Kiran, A content integrity
service for digital repositories, HP Laboratories, HPL-2008-177,
October 21, 2008
http://www.hpl.hp.com/techreports/2008/HPL-2008-177.html?mtxs=rss-hpl-tr
Durrant, Sarah,
Long-term
Preservation: Results from a survey investigating preservation
strategies amongst ALPSP publisher members (pdf 16pp), ALPSP, 2008
(announced October)
http://www.alpsp.org/ForceDownload.asp?id=882
Another bland
survey hewn from the
same rock as the Portico/Ithaka
Survey of U.S. Library
Directors
Beagrie blog comment: "interesting reading and overall is a very
worthwhile report" (13 October 2008)
http://blog.beagrie.com/archives/2008/10/13/alpsp-survey-long-term-preservation-strategies-for-e-journals/
Notable iPres papers
Abrams, Stephen, Sheila Morrissey, Tom Cramer,
Next-Generation JHOVE2 Architecture for Format-Aware
Characterization,
iPRES
2008 Fifth International Conference on Preservation of
Digital Objects, London, 29-30th September 2008
in
http://www.bl.uk/ipres2008/ipres2008-proceedings.pdf
also at
http://confluence.ucop.edu/pages/viewpage.action?pageId=3932229
Fischer, Randall, Carol Chou, Franco Lazzarino, Updating DAITSS -
Transitioning to a web service architecture,
iPRES
2008 Fifth International Conference on Preservation of
Digital Objects, London, 29-30th September 2008
in
http://www.bl.uk/ipres2008/ipres2008-proceedings.pdf
Hitchcock,
Steve, David Tarrant, Adrian Brown, Ben O’Steen, Neil Jefferies and
Leslie Carr, Towards smart storage for repository preservation
services,
iPRES
2008 Fifth International Conference on Preservation of
Digital Objects, London, 29-30th September 2008
also in ECS EPrints, Southampton University, 13 Oct 2008
http://eprints.ecs.soton.ac.uk/16785/
Klas, Claus-Peter, Holger Brocks, Lars Muller, and Matthias Hemmje,
Embedding Legacy Environments into A Grid-Based Preservation
Infrastructure,
iPRES
2008 Fifth International Conference on Preservation of
Digital Objects, London, 29-30th September 2008
in
http://www.bl.uk/ipres2008/ipres2008-proceedings.pdf
Kosovic, Douglas, and Jane Hunter, Implementing Preservation Services
over the Storage
Resource Broker,
iPRES
2008 Fifth International Conference on Preservation of
Digital Objects, London, 29-30th September 2008
in
http://www.bl.uk/ipres2008/ipres2008-proceedings.pdf
Ramalho, Jose Carlos, Miguel Ferreira, Luis Faria, Rui Castro,
Francisco
Barbedo, and Luis Corujo, RODA and Crib: A Service-Oriented Digital
Repository,
iPRES
2008 Fifth International Conference on Preservation of
Digital Objects, London, 29-30th September 2008
in
http://www.bl.uk/ipres2008/ipres2008-proceedings.pdf
"One of the highlights for me ...
describe(s) the preservation solution developed as part of the RODA and
CRiB projects: a service oriented preservation approach to dealing with
repository information using the concept of significant properties …
ahhh, music to the ears! All up and running and being used by the
Portuguese National Archives." Neil Grindley, JISC Information
Environment Team blog, 2 October 2008
http://infteam.jiscinvolve.org/2008/10/02/ipres-2008/
Rusbridge, A. and Ross, S., Establishing a community-based approach to
electronic journal archiving: the UK LOCKSS Pilot Programme,
iPRES
2008 Fifth International Conference on Preservation of
Digital Objects, London, 29-30th September 2008
in
http://www.bl.uk/ipres2008/ipres2008-proceedings.pdf
also in
Glasgow ePRINTS
Service, 15 October 2008
http://eprints.gla.ac.uk/4635/
</selected iPres papers>
Chute, Ryan and Herbert Van de Sompel, Introducing
djatoka: A
Reuse
Friendly, Open Source JPEG 2000 Image Server,
D-Lib Magazine, Vol. 14
No. 9/10, September/October 2008
http://www.dlib.org/dlib/september08/chute/09chute.html
Dappert, Angela, and Markus Enders, Using METS, PREMIS and MODS
for
Archiving eJournals,
D-Lib Magazine,
Vol. 14 No. 9/10,
September/October 2008
http://www.dlib.org/dlib/september08/dappert/09dappert.html
Day,
Michael, Toward Distributed Infrastructures for Digital Preservation:
The Roles of Collaboration and Trust,
International
Journal of Digital Curation, Vol. 3, No. 1 (2008), announced
August 2008
http://www.ijdc.net/ijdc/article/view/60
Moore, Reagan, Towards a Theory of Digital Preservation,
International
Journal of Digital Curation, Vol. 3, No. 1 (2008), announced
August 2008
http://www.ijdc.net/ijdc/article/view/63
Striking breadth of thinking,
converging on iRODS
Pearson, David, and Colin Webb, Defining File Format Obsolescence: A
Risky Journey,
International
Journal of Digital Curation, Vol. 3, No. 1 (2008), announced
August 2008
http://www.ijdc.net/ijdc/article/view/76
About AONS II (Automatic Obsolescence
Notification System)
Frey, Jeremy, Curation of Laboratory Experimental Data as Part of the
Overall Data Lifecycle,
International
Journal of Digital Curation, Vol. 3, No. 1 (2008), announced
August 2008
http://www.ijdc.net/ijdc/article/view/62
Advances the idea that data curation
should begin in the laboratory
Preservation and institutional repositories
Sierman, Barbara, Long Term Preservation for repositories, in A DRIVER's Guide to European Repositories:
Five studies of important Digital Repository related issues and good
practices, Kasja Weenink, Leo Waaijers and Karen van
Godtsenhoven (eds), Amsterdam University Press/Surf/EU-Driver, 16
January 2008, 153-184
Agnew, Grace, and Yang Yu, The Rutgers Workflow Management System:
Migrating a Digital Object Management Utility to Open Source,
The Code4Lib Journal, issue 1,
2007-12-17
http://journal.code4lib.org/articles/25
Describes a workflow-ingest tool for
a Fedora repository
Brody, Tim, Leslie Carr, Jessie M.N. Hey, Adrian Brown, Steve
Hitchcock, PRONOM-ROAR: Adding Format Profiles to a Repository Registry
to Inform Preservation Services,
International
Journal of Digital Curation, Vol. 2, No. 2, December 2007
Carr, Les, EPrints
and the Sun Storagetek 5800 System, A Persistent, Scalable and
Interoperable Solution (pdf 9pp), Sun Microsystems White Paper,
November 2007
http://www.sun.com/storagetek/disk_systems/enterprise/5800/SunEPrintsWP.pdf
Sketches some initial architectures
and considers the practical
implementation of this Sun storage system with an EPrints repository.
"At the most superficial level, simply having easy access to that
quantity of storage can revolutionise the use to which repositories can
be put"
Grant,
Carl, Delivering
Digital Repositories with Open Solutions (pdf 21pp), Sun Microsystems
White Paper, Version 8.0, November 2007
http://www.sun.com/storagetek/disk_systems/enterprise/5800/OpenSolutions_LR.pdf
Treloar, Andrew, David Groenewegen and Cathrine Harboe-Ree
, The Data Curation Continuum:
Managing Data Objects in Institutional Repositories,
D-Lib Magazine, Vol. 13, No. 9/10,
September/October 2007
http://www.dlib.org/dlib/september07/treloar/09treloar.html
"I cannot express how blown away I am with
the strikingly intelligent pragmatism (the authors frm Monash
University) display" Patel, Manjula and Simon Coles, A study of Curation and
Preservation Issues in the eCrystals Data Repository and Proposed
Federation, JISC eBank-UK Project, Final Version (Revised), 7th
September 2007
http://www.ukoln.ac.uk/projects/ebank-uk/curation/eBank3-WP4-Report%20(Revised).pdf
An impressive report that takes a
proactive and rigorous, enquiring approach to identify the key issues.
What's important is that it engages the repository view, and real
scenarios, with the preservation view.
Smith, MacKenzie, and Reagan W. Moore, Digital Archive Policies
and
Trusted Digital Repositories,
International
Journal of Digital Curation, Vol. 2, No. 1, June 2007
http://www.ijdc.net/ijdc/article/view/27
This might have been included in the
Trusted Digital Repositories section of this bibliography, but is
included here because if details (in a quite technical way) important
aspects of the interface between a repository (here called a 'digital
archive') and a preservation service (here referred to as a
preservation system)
Knight, Gareth, and Mark Hedges, Modelling OAIS Compliance for
Disaggregated Preservation Services,
International
Journal of Digital Curation, Vol. 2, No. 1, June 2007
http://www.ijdc.net/ijdc/article/view/25
Bradley, Kevin, Junran Lei and Chris Blackall, Towards an Open
Source Archival Repository and Preservation System, UNESCO Memory of
the World Programme and Australian Partnership for Sustainable
Repositories (APSR), June 2007
Hitchcock, Steve, Tim Brody, Jessie M.N. Hey and Leslie Carr, Digital
Preservation Service Provider Models for Institutional Repositories:
Towards Distributed Services,
D-Lib Magazine, Vol. 13, No. 5/6,
May/June 2007
http://www.dlib.org/dlib/may07/hitchcock/05hitchcock.html
a fine article on preservation
vis-à-vis institutional repositories."
Knight, Gareth, Recommendations to ensure the long-term
preservation of digital objects stored by institutional repositories
(pdf 11pp) Sherpa-DP project report, 28 February 2007
http://www.sherpadp.org.uk/documents/wp65-migration_review.pdf
Hockx-Yu,
Helen, Digital preservation in the context of
institutional repositories, E-LIS, 06 October 2006, also in
Program: Electronic Library
and Information Systems, Vol. 40, No. 3, 2006, 232-243
http://eprints.rclis.org/archive/00007351/
Curtis, Joseph, AONS System Documentation, Australian
Partnership for Sustainable Repositories, The
Australian National University, Revision 169 2006-09-29, September 2006
http://www.apsr.edu.au/publications/aons_report.pdf
"AONS (Automated Obsolescence
Notification System) is a system to analyse digital repositories and
determine whether any digital objects contained therein may be in
danger of becoming obsolescent. It uses perservation information about
file formats and the software which supports these formats to determine
if the formats used by digital objects are in danger."
Wilczek, Eliot and Kevin Glick, Fedora
and the Preservation of University Records Project: Reports and
Findings, Tufts University and Yale University, Final Narrative Report
to National Historical Publications and Records Commission, September
27, 2006
http://dca.tufts.edu/features/nhprc/reports/index.html
see also
Glick, Kevin and Eliot Wilczek, Fedora and the Preservation of
University Records Project, RLG
DigiNews, Volume 10, Number 5, 15 October 2006
http://www.rlg.org/en/page.php?Page_ID=20987#article0
Strictly, this report is about preserving university records rather
than about
IRs and research papers, but Fedora has recently been adapted to act as
an IR and much of this report has relevance to its preservation
capabilities in that context. The RLG DigiNews version can be read as a
summary of the full report above.
Ferreira, Miguel, Ana Alice
Baptista and José Carlos
Ramalho, A Foundation for Automatic Digital Preservation, Ariadne, Issue 48, 30-July-2006
http://www.ariadne.ac.uk/issue48/ferreira-et-al/
Describes a preservation service
architecture that could in principle use a combination of service
providers and Web service agents. There is no evaluation here to show
how effective this approach is.
see also Lee et al., APSR
PREMIS Requirement Statement Project Report, July 2006, in Preservation
Metadata section below
Goodyear, Marilu, and Richard Fyffe,
Institutional
Repositories: An Opportunity for CIO Campus Impact, EDUCAUSE
Review, Vol. 41, No. 2, March/April 2006, 10–11
http://www.educause.edu/apps/er/erm06/erm0626.asp
"On most campuses, repository
programs already involve librarians,
archivists, records managers, and institutional administrators. It is
important that the chief information officer (CIO) see (and seize) the
opportunity presented by a
new IR program to demonstrate how the security and control provided by
central infrastructure (secure hardware, campus networks, data centers,
etc.) contribute to the long-term preservation of access to and
usability of these important campus assets."
Hitchcock, Steve, Tim Brody, Jessie M.N. Hey and
Leslie Carr, Preservation for Institutional Repositories: practical and
invisible. Ensuring Long-term Preservation and Adding Value to
Scientific and Technical data (PV 2005), Edinburgh, UK
http://preserv.eprints.org/papers/PV05/pv05-preserv-hitchcock.doc
Knight, Gareth, An OAIS compliant model for Disaggregated services,
SHERPA-DP Report, version 1.1, 5/09/2005
http://ahds.ac.uk/about/projects/sherpa-dp/sherpa-dp-oais-report.pdf
An early model-based, rather than
evidence- or experience-based analysis. The Introduction and Figs 5
(detailed view of a disaggregated OAIS-compliant model) and 6
(workflow) provide a reasonable starting point for the relationship
between IRs and preservation service providers, but experience will
bring more complexity and more clarity
Prudlo, Marion, E-Archiving: An Overview of Some Repository
Management Software Tools,
Ariadne,
Issue 43, 30-April-2005
Aschenbrenner, Andreas and Max Kaiser, White Paper on Digital
Repositories, reUSE project, March 2005 (pdf 90pp)
http://www2.uibk.ac.at/reuse/docs/reuse-d11_whitepaper_10.pdf
Bradley, Kevin, APSR Sustainability Issues Discussion Paper, Australian
Partnership for Sustainable Repositories - National Library of
Australia, 28 January 2005
http://www.apsr.edu.au/documents/APSR_Sustainability_Issues_Paper.pdf
Broad ranging agenda, more the first
word (by APSR) than the last word
Wheatley, Paul, Institutional Repositories in the Context of Digital
Preservation (pdf
19pp), Digital Preservation Coalition, Technology Watch Series
Report 04-02,
March 2004
http://www.dpconline.org/docs/DPCTWf4word.pdf
see erpaAssessment of this paper
http://www.erpanet.org/assessments/show.php?id=1096369103&t=1
James, Hamish; Ruusalepp, Raivo; Anderson, Sheila; Pinfield, Stephen,
Feasibility and Requirements Study on Preservation of E-Prints (pdf
69pp)
, JISC, October 29, 2003
http://www.jisc.ac.uk/uploaded_documents/e-prints_report_final.pdf
Major sections on file formats,
preservation metadata and cost models, and describes a core set of
non-functional requirements that would make IRs 'trusted' based on
RLG-OCLC criteria and OAIS, and recommends that preservation tasks
might be
outsourced to specialist support services
see also
Pinfield, Stephen; James, Hamish, The Digital Preservation of e-Prints,
D-Lib Magazine, Vol. 9 No. 9,
September, 2003
http://www.dlib.org/dlib/september03/pinfield/09pinfield.html
Counters the Harnad view, that
preservation should not be a priority for eprints, with its own
rhetoric that raises questions about preservation but doesn't get to
grips or answer any of its own questions
Müller, Eva, Uwe Klosa, Peter Hansson, Stefan Andersson, Archiving
Workflow between a Local Repository and the National Archive:
Experiences from the DiVA Project, ECDL conference, Trondheim, August
2003
http://epc.ub.uu.se/files/archiving_ECDL_2003.pdf
(11pp)
Wheatley, Paul, A way forward for developments in the digital
preservation functions of DSpace: options, issues and recommendations,
25th July 2003
http://dspace.org/news/articles/DpAndDSpace.pdf
Not about DSpace but a discussion
document about preservation features that could be considered for
inclusion at that time
Lynch, Clifford A., Institutional repositories: essential
infrastructure for scholarship in the digital age. ARL Bimonthly Report, No. 226,
February 2003
http://www.arl.org/resources/pubs/br/br226/br226ir.shtml
Not about preservation, but it's a
common refrain in this treatise about the institution's commitment to
stewardship, including preservation, that an IR implies. Mentions idea
of "federating" institutional repositories, e.g. swaps of storage
between institutional repositories to gain geographic and systems
diversity in pursuit of backup, preservation, and disaster recovery
Tansley, R., Bass, M. and Smith, M., DSpace as an Open Archival
Information System: Current Status and Future Directions, Proceedings of Research and Advanced
Technology for Digital Libraries: 7th European
Conference, ECDL 2003, Trondheim, Norway, August 2003, LNCS
2769 (Springer Verlag), pp. 446-460
or see ppt slides version
Bass, Michael J., David Stuve, R. Tansley, Margret Branschofsky, Peter
Breton, P. Carmichael, Bill Cattey, Dan Chudnov and J. Ng, DSpace
Internal Reference Specification, Technology & Architecture, MIT,
Hewlett-Packard, 2002-03-01
http://libraries.mit.edu/dspace-mit/technology/architecture.pdf
Day, Michael, E-print Services and Long-term Access to the Record of
Scholarly and Scientific Research, Ariadne,
Issue 28, 22-June-2001
http://www.ariadne.ac.uk/issue28/metadata/intro.html
see erpaAssessment of this article
http://www.erpanet.org/assessments/show.php?id=1037194679&t=1
see also
erpaAssessments, Project Abstract: DSpace, 17.01.2003
http://www.erpanet.org/assessments/show.php?id=1045041083&t=3
File formats
Buonora, Paolo, and Franco Liberati, A Format
for Digital Preservation of Images: A Study on JPEG 2000 File
Robustness,
D-Lib Magazine,
Vol. 14 No. 7/8, July/August 2008
http://www.dlib.org/dlib/july08/buonora/07buonora.html
Ribera Turró, Mireia, Is the PDF format accessible? E-LIS, 08
May 2008
http://eprints.rclis.org/archive/00013305/
Fanning, Betsy A., Preserving the Data Explosion: Using PDF,
Digital Preservation
Coalition, Technology Watch Report 08-02, April 2008 (pdf 27pp)
http://www.dpconline.org/docs/reports/dpctw08-02.pdf
"Using PDF/A as a standard will help
information officers ensure that key business data survives. But it
should never be viewed as the Holy Grail. It is merely a tool in the
armoury of a well thought out records management policy.“ Adrian Brown,
DPC press release
"I wish a little more
attention had been paid to the writing, which is
messy in places, but there's a lot of useful information there." Gary
McGath, File Formats Blog, April 25, 2008,
http://fileformats.blogspot.com/2008/04/preserving-data-explosion.html
Rog,
Judith and Caroline van Wijk, Evaluating File Formats for
Long-term Preservation (pdf 11pp), Koninklijke Bibliotheek, February
2008
http://www.kb.nl/hrd/dd/dd_links_en_publicaties/publicaties/KB_file_format_evaluation_method_27022008.pdf
Buckley, Robert, JPEG 2000 - a Practical Digital Preservation
Standard?
Digital Preservation
Coalition, Technology Watch Report 08-01, February 2008 (pdf 28pp)
http://www.dpconline.org/docs/reports/dpctw08-01.pdf
Abrams, Stephen,
File
Formats, instalment of DCC, Digital
Curation Manual, Version 1.0, October 2007
http://www.dcc.ac.uk/resource/curation-manual/chapters/file-formats/
Ditch, Walter, XML-based
Office Document Standards, version 1.0, JISC Techwatch report, August
2007
http://www.jisc.ac.uk/media/documents/techwatch/tsw0702pdf.pdf
Vitale, Tim, Digital Image File Formats – TIFF, JPEG, JPEG2000, RAW and
DNG (pdf 45pp), Version 20, July 2007
http://aic.stanford.edu/sg/emg/library/pdf/vitale/2007-07-vitale-digital_image_file_formats.pdf
Sefton, Peter,
An
integrated approach to
preparing, publishing, presenting and preserving theses (pdf 24pp)
, ETD 2007 added
values to e-theses: 10th International Symposium on Electronic Theses
and Dissertations, Uppsala, Sweden, 13-16 June 2007
http://epc.ub.uu.se/ETD2007/files/papers/paper-32.pdf
Describes the Integrated Content
Environment for research and scholarship (ICE-RS).
Rauch,
Carl, Harald Krottmaier and Klaus Tochtermann, File-Formats for
Preservation: Evaluating the Long-Term Stability of
File-Formats,
ELPUB2007, Proceedings
of the 11th International Conference on Electronic
Publishing, Vienna, edited by Leslie
Chan and Bob Martens, 13-15 June 2007, pp. 101-106
http://elpub.scix.net/cgi-bin/works/Show?122_elpub2007
Barnes, Ian, The Digital Scholar's Workbench,
ELPUB2007, Openness in Digital Publishing:
Awareness, Discovery and
Access - Proceedings of the 11th International Conference on Electronic
Publishing, Vienna, Austria, 13-15 June 2007, edited by Leslie
Chan and Bob Martens, pp. 285-296
http://elpub.scix.net/cgi-bin/works/Show?_id=159_elpub2007
Knight, Gareth, File format typing and format registries (pdf
10pp), Sherpa-DP project report, 2 March 2007
http://www.sherpadp.org.uk/documents/wp63-formatregistries.pdf
Among others, covers PRONOM, PRONOM
Persistent Unique Identifier and DROID. "it is recommended the AHDS
Preservation Service use JHOVE to generate technical information and
consider the implementation of a format registry when they provide
functionality, such as obsolescence monitoring that is unavailable in a
repository-level application."
Peters McLellan, Evelyn, Selecting Digital File Formats for Long-Term
Preservation (pdf 26pp), InterPARES 2, General Study 11, Final Report,
December 4, 2006 (revised March 13, 2007)
http://www.interpares.org/display_file.cfm?doc=ip2_file_formats(complete).pdf
Gladney comment, Digital Document
Quarterly, Vol. 7 No. 3, 3Q2008 http://home.pacbell.net/hgladney/ddq_7_3.htm#_edn4
Barnes, Ian
, Preservation
of
TeX/LaTeX Documents (pdf 16pp), Australian Partnership for Sustainable
Repositories, Australian National University, 21 July 2006
http://www.apsr.edu.au/publications/LaTeX-preservation.pdf
Barnes, Ian, The Preservation of Word Processing Documents (pdf
19pp), Australian Partnership for Sustainable
Repositories, Australian National University, 14 July 2006
http://www.apsr.edu.au/publications/word_processing_preservation.pdf
Rog, Judith, PDF Guidelines: Recommendations for the Creation of PDF
Files for Long-term Preservation and Access. Koninklijke Bibliotheek,
version 1.4, 14 July 2006
http://www.kb.nl/hrd/dd/dd_links_en_publicaties/PDF_Guidelines.pdf
Donnelly, Martin, JSTOR/Harvard Object Validation Environment
(JHOVE), Digital Curation Centre Case Studies and Interviews, March
2006 (pdf 21pp)
http://www.dcc.ac.uk/resource/case-studies/jhove/
Brown, Adrian, The PRONOM PUID Scheme: A scheme of persistent
unique identifiers for representation information, The National
Archives, Digital Preservation Technical Paper: 2, 10 November 2005
(pdf 9pp)
http://www.nationalarchives.gov.uk/aboutapps/pronom/pdf/pronom_unique_identifier_scheme.pdf
Phelps, Thomas A. and P.B. Watry, A No-Compromises Architecture for
Digital Document Preservation,
Proceedings
of the 9th European Conference on Research and Advanced Technology for
Digital Libraries (ECDL 2005), Vienna, September 18-23, 2005
http://multivalent.sourceforge.net/Research/Live.pdf
Brown, Adrian, Automatic Format Identification Using PRONOM and DROID,
The National Archives,
Digital Preservation Technical Paper: 1, 17 September 2005 (pdf 30pp)
http://www.nationalarchives.gov.uk/aboutapps/fileformat/pdf/automatic_format_identification.pdf
Arms, Caroline R. and Carl Fleischhauer, Digital Formats: Factors
for Sustainability, Functionality, and Quality (pdf 6pp)
2005-04-29, also presented at
Second
IS&T Archiving Conference, Washington, D.C., April 2005
http://memory.loc.gov/ammem/techdocs/digform/Formats_IST05_paper.pdf
see also the authors' ongoing resource Sustainability of Digital
Formats Planning for Library of Congress Collections
http://www.digitalpreservation.gov/formats/index.shtml
Brown, Adrian, Automating Preservation: New Developments in the
PRONOM Service,
RLG DigiNews,
Volume 9, Number 2, April 15
Stanescu, A., Assessing the Durability of Formats in a Digital
Preservation Environment: The INFORM Methodology, D-Lib Magazine, Volume 10 Number
11, November 2004
http://www.dlib.org/dlib/november04/stanescu/11stanescu.html
on the OCLC INFORM methodology
Christensen, Niels H., Towards
format repositories for web archives, 4th International Web Archiving Workshop
(IWAW04), August 2004
http://netarchive.dk/website/publications/FormatRepositories-2004.pdf
Clausen, Lars, Handling File Formats, May 2004
http://netarchive.dk/website/publications/FileFormats-2004.pdf
Holdsworth, David, and Paul Wheatley, Long-Term Stewardship of
Globally-Distributed Representation Information, 12th NASA Goddard/21st IEEE Conference on
Mass Storage Systems and Technologies, Adelphi, MD, April 13-16,
2004 (pdf 13pp) http://romulus.gsfc.nasa.gov/msst/conf2004/Papers/MSST2004-03-Holdsworth-a.
LeFurgy, William G., PDF/A:
Developing a File Format for Long-Term Preservation, RLG DigiNews, 7(6), 15 December 2003
http://www.rlg.org/preserv/diginews/diginews7-6.html#feature1
see comments : Current Cites, Dec 2003; Crawford, Cites & Insights,
July 2004
<http://cites.boisestate.edu/civ4i9.pdf>,
page 17
Darlington, Jeffrey, PRONOM - A Practical Online Compendium of File
Formats, RLG DigiNews, Vol. 7, No. 5, October 15, 2003
http://www.rlg.org/legacy/preserv/diginews/diginews7-5.html#feature2
Abrams, Stephen L. and Seaman, David, Towards a Global Digital Format
Registry, World Library and
Information Congress: 69th IFLA General
Conference and Council, 1-9 August, 2003, Berlin
http://www.ifla.org/IV/ifla69/papers/128e-Abrams_Seaman.pdf
Brown, Adrian, Graphics File Formats, The
National Archives, Digital Preservation Guidance Note: 4, 9 July 2003
(pdf 14pp)
http://www.nationalarchives.gov.uk/documents/graphic_file_formats.pdf
Brown, Adrian, Selecting File Formats for Long-Term Preservation, The
National Archives, Digital Preservation Guidance Note: 1, 19 June 2003
(pdf 8pp)
http://www.nationalarchives.gov.uk/documents/selecting_file_formats.pdf
Müller, Eva, Uwe Klosa, Peter Hansson, Stefan Andersson, Erik
Siira, Using
XML for Long-term Preservation, ETD
2003: 6th International Conference on Electronic Theses and
Dissertations,
Berlin, 21-24 May 2003
http://www.hu-berlin.de/ETD2003/userinfoviewpaper.php4?paper_id=39
Wheatley, Paul, Survey and assessment of sources of information
on file
formats and
software documentation (pdf 48pp), The Representation and Rendering
Project, Final
Report, JISC, 2003
http://www.jisc.ac.uk/uploaded_documents/FileFormatsreport.pdf
Phelps, Thomas A. and Robert Wilensky, The Multivalent Browser: A
Platform for New Ideas,
Proceedings of Document Engineering
2001, Atlanta, Georgia, November 2001
http://multivalent.sourceforge.net/Research/PlatformForNewIdeas.pdf
on the Multivalent Browser
Ockerbloom, John Mark, Archiving and Preserving PDF Files, RLG DigiNews, Vol. 5, No. 1,
February 15, 2001
http://www.rlg.org/preserv/diginews/diginews5-1.html#feature2
Lawrence, Gregory, et al., Risk
Management of Digital Information: A File Format Investigation.
Washington, D.C.: Council on Library and Information Resources, June
2000
http://www.clir.org/pubs/reports/pub93/contents.html
Ockerbloom, John, Mediating Among Diverse Data Formats: Thesis Summary
(pdf 14pp),
January 14, 1998
http://tom.library.upenn.edu/pubs/thesis/summary.pdf
on the Typed Object Model (TOM)
see also
Liu, Xiaoming, Lyudmila Balakireva, Patrick Hochstenbach and Herbert
Van
de Sompel, File-based storage of Digital Objects and constituent
datastreams: XMLtapes and Internet Archive ARC files, ArXiv, Computer
Science, cs.DL/0503016, 3 Jun 2005, presented at ECDL 2005
http://arxiv.org/abs/cs.DL/0503016
concatenating XML-based
representations of multiple Digital
Objects into a single file named an XMLtape
Burner, Mike, and Brewster Kahle, Arc File Format, Version 1.0,
September
15, 1996
http://www.archive.org/web/researcher/ArcFileFormat.php
An archival format rather than
authored file format. ARC format is used
by Internet Archive to store data. It seems to be concerned with
efficient management and storage of large volumes of data and many
files rather than more specialised preservation concerns, e.g. format
viability and migration
see also
PRONOM http://www.nationalarchives.gov.uk/pronom/
JHOVE (JSTOR/Harvard Object Validation Environment)
http://hul.harvard.edu/jhove/
DFDL - Grid Forum Data Format Description
Language
http://forge.gridforum.org/projects/dfdl-wg
to define an XML-based language for describing the
structure of binary and character encoded (ASCII/Unicode) files and
data streams so that their format, structure, and metadata can be
exposed
Significant properties and representation
information
Significant properties is not just related to formats, but properties
tend to be classified by object types and a major part of this is
formats. Similarly for representation information, which is why this
subsection is included here. Strictly, formats could be seen as a
subset of both, so this should be the major section with formats as a
subsection. As can be seen above, however, the volume of literature on
formats dominates, hence the inversion.
Brown, Adrian, Representation Information Registries, Planets project,
White Paper, 29 January 2008
Comment from Digital Curation
Blog, 14 April 2008 "contains the best discussion on representation
information I have seen"
http://digitalcuration.blogspot.com/2008/04/representation-information-from-planets.html
Knight, Gareth (contributors: Stephen Grace, Lynne Montague), Framework
for the definition of significant properties, version: V1 (pdf 49pp),
AHDS, InSPECT Project Document, 05/02/2008
http://www.significantproperties.org.uk/documents/wp33-propertiesreport-v1.pdf
Coyne, Mike, David Duce, Bob Hopgood, George Mallen
and Mike Stapleton, The Significant Properties of Vector Images,
JISC Digital Preservation Programme report, v4.3,
27.11.07 (pdf 74pp)
http://www.jisc.ac.uk/media/documents/programmes/preservation/vector_images.pdf
Examines, and recommends, WebCGM, SVG
1.1 and PDF/A be used as the archival formats for 2D vector graphics
Wilson, Andrew, Significant Properties Report, V2 (pdf 10pp), AHDS,
InSPECT Work Package 2.2, 10/04/2007
http://www.significantproperties.org.uk/documents/wp22_significant_properties.pdf
For origins of Significant Properties
see discussion of: Significant Properties workshop, Digital
Curation Blog, 26 March 2008
http://digitalcuration.blogspot.com/2008/03/significant-properties-workshop.html
OAIS
see also
Egger, Alexander, Shortcomings of the Reference Model
for an Open
Archival Information System (OAIS), TCDL
Bulletin (IEEE Technical Committee on Digital Libraries), Volume
2, Issue 2, 2006
Beedham, Hilary, Julie
Missen, Matt Palmer and Raivo Ruusalepp, Assessment of UKDA and TNA
compliance with OAIS and METS standards, JISC study, 2005
http://www.jisc.ac.uk/index.cfm?name=project_oais
Practical examples of OAIS application
Lavoie, Brian F., Introduction to OAIS, Digital
Preservation Coalition,
Technology Watch Series Report 04-01,
January 2004 (pdf 20pp)
http://www.dpconline.org/docs/lavoie_OAIS.pdf
The OAIS Reference Model, section 4B in Digital
Preservation Management: Implementing Short-Term Strategies
for Long-Term Problems, Cornell University, September 2003
http://www.library.cornell.edu/iris/tutorial/dpm/foundation/oais/index.html
Excellent tutorial
Gladney,
H., Use
and Misuse of OAIS, Digital Document Quarterly, Vol. 1, No. 3, 3Q2002
http://home.pacbell.net/hgladney/ddq_1_3.htm
DDQ is Gladney's personal newsletter
and viewpoint: "The more I considered how OAIS was used in
digital preservation articles, the more it puzzled me. I could not
firmly determine whether the
authors, who mostly were Research Library Group affiliates, viewed OAIS
as an ontology or were planning to use it as an architecture. ...
.What’s missing is an architecture."
Reference
Model for an Open Archival Information System (OAIS) (pdf
148 pp),
Consultative Committee for Space Data Systems, CCSDS 650.0-B-1, Blue
Book, Issue 1, January 2002, adopted as ISO 14721:2003
http://public.ccsds.org/publications/archive/650x0b1.pdf
Lavoie, Brian, Meeting the challenges of digital preservation: The OAIS
reference
model, originally in the OCLC
Newsletter, No. 243:26-30
(January/February
2000)
http://www.oclc.org/research/publications/newsletter/repubs/lavoie243/
Introduction to OAIS
Archive Ingest and Handling Test (AIHT)
For completeness this collection of papers from a special issue of D-Lib Magazine are listed under
this sub-head. The title of the project places the papers within the
context of OAIS, but the papers could just as easily have been
variously listed in the sections on Preservation and institutional
repositories (DiLauro et al.), File formats
(Abrams et al., Anderson et al.) and Tools (Nelson et al.).
The AIHT reveals important practical experience, although there are
some differences with anticipated preservation
service models for IRs. For example, in AIHT:
- There is no scope for interaction between creator and archive
- There is no moderated ongoing transfer process or protocol, just
a single disc of compressed data containing all files
- There is no business model (i.e. who is doing what for whom, and
why)
- The scope of the test archive may or may not reflect a typical
profile of an institutional repository
Shirky, Clay, AIHT: Conceptual Issues from Practical Tests, D-Lib Magazine,
Vol. 11, No.
12, December
2005
http://www.dlib.org/dlib/december05/shirky/12shirky.html
Practical sections (Phases I,
II, III) are useful, notably the parts on identifiers, 'small errors',
and 'multiple expressions', although the lack of a business model makes
it tricky to assess the relevance of conclusions
Abrams, Stephen, Stephen
Chapman, Dale Flecker, Sue Kreigsman, Julian Marinus, Gary McGath, and
Robin Wendler, Harvard's
Perspective on the Archive Ingest and Handling Test, D-Lib Magazine,
Vol. 11, No.
12, December
2005
http://www.dlib.org/dlib/december05/abrams/12abrams.html
By the creators of JHOVE, makes a point about versioning of format
ID tools: "'These discrepancies (between versions of JHOVE) point out
the importance of standardizing on processing tools and criteria for
format well-formedness and validity."
DiLauro, Tim, Mark Patton, David Reynolds, and
G. Sayeed Choudhury, The Archive Ingest and
Handling Test: The Johns Hopkins University Report, D-Lib Magazine,
Vol. 11, No.
12, December
2005
http://www.dlib.org/dlib/december05/choudhury/12choudhury.html
Concerned with ingest into
Fedora and DSpace-based archival stores
Nelson, Michael L., Johan Bollen, Giridhar Manepalli, and Rabia
Haq, Archive Ingest and Handling Test: The
Old Dominion University Approach, D-Lib Magazine, Vol. 11, No. 12, December 2005
http://www.dlib.org/dlib/december05/nelson/12nelson.html
The only non-library to
participate in the AIHT and 'instead, our focus was on alternative
archiving concepts'. This is chiefly notable for its use of MPEG-21
DIDL instead of METS: "If we could assume cooperation on the part of
the web site maintainer, then the easiest thing for them to do is to
install mod_oai, an Apache module that streams out web site contents
using the OAI-PMH and complex object formats such as MPEG-21 DIDL.
mod_oai essentially converts web sites into OAIS DIPs."
Anderson, Richard, Hannah Frost, Nancy Hoebelheinrich, and Keith
Johnson, The AIHT at Stanford University:
Automated Preservation Assessment of Heterogeneous Digital Collections, D-Lib Magazine,
Vol. 11, No.
12, December
2005
http://www.dlib.org/dlib/december05/johnson/12johnson.html
Concerned with file format
assessment. One comment stands out: "until it becomes common practice
to integrate digital stewardship and preservation concerns into the
entire digital content lifecycle - especially front-end content
creation - most digital preservation workflows intended to be inclusive
will be reactive instead of prescriptive."
Preservation metadata
Guenther, Rebecca
S., Battle of the Buzzwords:
Flexibility vs. Interoperability When Implementing PREMIS in METS, D-Lib Magazine, Vol. 14 No. 7/8,
July/August 2008
http://www.dlib.org/dlib/july08/guenther/07guenther.html
PREMIS (PREservation Metadata: Implementation Strategies) Preservation
Metadata Maintenance Activity,
http://www.loc.gov/premis/v2/premis-2-0.pdf
This publication includes the PREMIS Introduction, the Data Dictionary,
and Data Dictionary Entity Hierarchical Listing, which are also
available as separate
documents
Lavoie, Brian F., PREMIS With a Fresh Coat of Paint: Highlights from
the Revision of the PREMIS Data Dictionary for Preservation Metadata, D-Lib Magazine, Vol. 14, No. 5/6, May/June 2008
http://www.dlib.org/dlib/may08/lavoie/05lavoie.html
Woodyard-Robinson, Deborah, Implementing the PREMIS data
dictionary: a
survey of approaches (pdf 56pp), The PREMIS Maintenance
Activity/Library of Congress, 4 June 2007
http://www.loc.gov/standards/premis/implementation-report-woodyard.pdf
Guenther, R. and Xie, Z., Implementing PREMIS in Container Formats, Archiving
2007, Arlington, Virginia, 21-24 May 2007 (pdf 4pp)
http://www.loc.gov/standards/premis/IST-premis-containers.pdf
Coyle, Karen, Rights in the PREMIS Data Model (pdf 32pp), Report for
the Library of Congress, December 2006 http://www.loc.gov/standards/premis/Rights-in-the-PREMIS-Data-Model.pdf
"a less discovered gem ... While the
primary focus of the report is to discuss the required enhancements to
incorporate digital object rights information into the PREMIS data
model, a particular value of this report is its comprehensive overview
of the PREMIS metadata scheme. For those unfamiliar with PREMIS, this
report is a good introduction" Frank Cervone, Current Cites, June 2007
Knight, Gareth, A minimal preservation metadata element set for the
SHERPA DP Project, v1.3, 07/08/2006
http://www.sherpadp.org.uk/documents/wp44-preservation-metadata.pdf
(21pp)
Caplan, Priscilla, Preservation Metadata, DCC Digital Curation Manual, 1
August 2006
http://www.dcc.ac.uk/resource/curation-manual/chapters/preservation-metadata/
Lee, Bronwyn, Gerard Clifton and Somaya Langley,
Australian Partnership for Sustainable Repositories PREMIS Requirement
Statement Project Report (pdf 59pp), National Library of Australia,
July 2006
http://www.apsr.edu.au/publications/presta.pdf
Lavoie, Brian,
and Richard Gartner, Preservation metadata, DPC
Technology Watch Series Report 05-01, September 2005
http://www.dpconline.org/docs/reports/dpctw05-01.pdf
"comprehensive but highly readable update
on developments in preservation metadata and METS"
PREMIS (PREservation Metadata: Implementation Strategies)
Working
Group, Data Dictionary for Preservation Metadata: Final Report of the
PREMIS Working Group (May 2005)
http://www.oclc.org/research/projects/pmwg/
"The
data dictionary is a translation of the OAIS-based 2002 Framework into
a set of implementable semantic units." "represents a significant step
forward in terms of closing the gap between theory and practice in
preservation metadata, and represents the only cross-institutional,
cross-domain consensus-building activity in this area." From
Lavoie and Gartner (2005)
The authoritative reference, won the
2005 Digital Preservation Award and the 2006 Society of American
Archivists Preservation Publication Award. Superseded by
Guenther, Rebecca, PREMIS - Preservation Metadata Implementation
Strategies Update 2: Core Elements for Metadata to Support Digital
Preservation, RLG DigiNews,
Volume 8, Number 6, Dec. 2004
http://www.rlg.org/en/page.php?Page_ID=20492#article2
Caplan, Priscilla, PREMIS - Preservation Metadata - Implementation
Strategies Update 1. Implementing Preservation Repositories for Digital
Materials: Current Practice and Emerging Trends in the Cultural
Heritage Community, RLG DigiNews,
Volume 8, Number 5, Oct. 2004
http://www.rlg.org/en/page.php?Page_ID=20462#article2
a summary of
Implementing Preservation Repositories For
Digital Materials: Current
Practice And Emerging Trends In The Cultural Heritage Community, A
Report by the PREMIS Working Group, September 2004
http://www.oclc.org/research/projects/pmwg/surveyreport.pdf
results from a survey ... Nearly 50
responses were received ... institutions included libraries, archives,
and museums, among others. ... respondents were heavily skewed toward
US libraries ... responses underscored a number of issues impacting
preservation metadata, including the extent to which repository
architectures are informed by OAIS; the needs of repository
stakeholders; methods for obtaining metadata for archived digital
objects; types of metadata currently used by repositories; the nature
and use of rights management metadata; access mechanisms for archived
materials; and strategies for meeting long-term preservation objectives.
From Lavoie and Gartner (2005)
Day, Michael, Preservation metadata, 11-Dec-2003
http://www.ukoln.ac.uk/metadata/publications/iylim-2003/
also in G. E. Gorman and Daniel G. Dorner (eds.), Metadata applications
and management, International Yearbook of Library and Information
Management, 2003-2004, London: Facet Publishing, 2004
Good narrative (i.e. not especially
technical) review
Day, Michael, Preservation metadata initiatives: practicality,
sustainability, and interoperability, ERPANET
Training Seminar on
Metadata in Digital Preservation, Marburg, Germany, 3-5
September 2003
(revised)
http://www.ukoln.ac.uk/preservation/publications/erpanet-marburg/day-paper.pdf
Searle, S., and Thompson, D., Preservation metadata :
pragmatic first steps at the National Library of New Zealand. D-Lib
Magazine, 9(4), 2003
http://www.dlib.org/dlib/april03/thompson/04thompson.html
Little detail, effectively an overview of
National Library of New Zealand, Metadata Standards Framework –
Preservation Metadata, November 2002 (pdf 42pp)
http://www.natlib.govt.nz/files/4initiatives_metaschema.pdf
Contains the detailed element list
for the NLNZ preservation metadata schema 'designed to strike a balance
between the principles of preservation metadata and the practicalities
of implementing a working set of preservation metadata', but should be
cross-checked with the PREMIS Data Dictionary as a later, broader
reference
OCLC/RLG Working Group on Preservation Metadata, Preservation Metadata
and the OAIS Information Model: A Metadata Framework
to Support the
Preservation of Digital Objects (pdf 54pp), June 2002
http://www.oclc.org/research/projects/pmwg/pm_framework.pdf
"first international consensus-driven
statement on the scope of preservation metadata." From Lavoie
and Gartner (2005)
reviewed by Hans Hoffman, DigiCult Info, Issue 2, October 2002 (pp15-20)
http://www.digicult.info/downloads/digicult_info2.pdf
Cedars Guide To Preservation Metadata,
March 2002
http://www.leeds.ac.uk/cedars/guideto/metadata/guidetometadata.pdf
Preservation Metadata for Digital Objects: A Review of the State of the
Art, A White Paper by the OCLC/RLG Working Group on Preservation
Metadata,
January 31, 2001
http://www.oclc.org/research/projects/pmwg/presmeta_wp.pdf
"provided a definition of
preservation metadata, described its role in the digital preservation
process, and reviewed a number of existing preservation metadata
initiatives, with an emphasis on identifying points of convergence and
divergence among them." From Lavoie and Gartner (2005)
Lupovici, Catherine, Julien Masanès, Metadata for long
term-preservation, Nedlib Consortium, July 2000
http://www.kb.nl/coop/nedlib/results/D4.2/D4.2.htm
National Library of Australia, Preservation Metadata for Digital
Collections, 15 October 1999
http://www.nla.gov.au/preserve/pmeta.html
Day, M., Metadata for digital preservation: an update, Ariadne, 22
December 1999
http://www.ariadne.ac.uk/issue22/metadata/
Day, Michael, Metadata for
Preservation, Cedars Project Document AIW01, CEDARS, 3 August
1998 http://www.ukoln.ac.uk/metadata/cedars/AIW01.html
RLG Working Group on Preservation Issues of Metadata, Final
Report, May 1998
http://www.rlg.org/preserv/presmeta.html
see also
National Library of New Zealand Metadata Extraction Tool Version
1.0
http://www.natlib.govt.nz/en/whatsnew/4initiatives.html#extraction
Packaging metadata (inc. METS)
Smith, Joan A. and Michael L. Nelson, CRATE: A Simple Model for
Self-Describing Web Resources (pdf 12pp), 7th International Web
Archiving Workshop (part of ACM IEEE Joint Conference on Digital
Libraries), Vancouver, Canada, 23 June 2007
http://www.iwaw.net/07/IWAW2007_smith.pdf
Proposes a "simple preservation
model, a complex object consisting of undifferentiated metadata and the
resource byte stream" aimed at the "everyday, personal, departmental,
or community web site where a long-term preservation strategy does not
yet exist." Compares the approach with ARC/WARC, VEO, METS, PREMIS and
MPEG-21 DID.
Popham, Michael, An investigation of METS as a method of packaging
metadata and data (pdf 6pp), Sherpa-DP project report, 28 April 2006
http://www.sherpadp.org.uk/documents/wp42-mets.pdf
More specifically on METS for
institutional repositories, includes an "investigation of possible uses
of METS to package e-prints metadata" and an "assessment of METS as a
mechanism for transferring data from DSpace and EPrints repositories"
Bekaert, Jeroen, and Herbert Van de Sompel, Representing Digital
Assets
using MPEG-21 Digital Item Declaration, ArXiv, Computer Science,
cs.DL/0508065, 13 Aug 2005, International
Journal on
Digital Libraries, accepted for publication
http://arxiv.org/abs/cs.DL/0508065
An alternative to METS?
Metadata and Encoding
Transmission Standard: METS and Overview and Tutorial, Library
of Congress, 2004
http://www.loc.gov/standards/mets/METSOverview.v2.html
"METS has the greatest potential for
(organising and linking preservation metadata to its associated
content), as it was designed to implement the OAIS Reference Model's
abstract model of an Information Package." From Lavoie and
Gartner (2005)
see also
XML Formatted Data Units (XFDU), an XML packaging standard developed by
CCSDS at NASA/SDSCC
http://www.ccsds.org/docu/dscgi/ds.py/GetRepr/File-1912/html#Head%3E%20-11.htm
"METS schema inherited as the basis
for this effort. METS is a very
flexible structure that has been developed by the Digital Library with
some attention to the OAIS RM. The metadata and file association
methods are very flexible. However, METS is more of a conceptual model
and the Representation Data mapping is questionable. The proposed XFDU
Data Model should map directly to the OAIS Information Model Classes"
Waibel, Günter, Like Russian Dolls: Nesting Standards for
Digital
Preservation, RLG DigiNews, Vol. 7, No. 3, June 15, 2003
http://www.rlg.org/preserv/diginews/diginews7-3.html#feature2
links OAIS, METS and NISO Data
Dictionary—Technical Metadata for
Digital Still Images
see also
MPEG DIDL
"Set to be industrial standard, though 'non-free' (companies own IP in
various areas)"
NISO Z39.87 Metadata for Still Images
http://www.niso.org/committees/committee_au.html and related effort NISO Metadata for Images in XML (NISO MIX)
http://www.loc.gov/standards/mix/
OAI and preservation harvesting
Smith, Joan A. and Michael L. Nelson, Creating Preservation-Ready Web
Resources, D-Lib
Magazine,
Vol. 14,
No. 1/2, January/February
2008
http://www.dlib.org/dlib/january08/smith/01smith.html
Proposes "a simple model for everyday
web sites which takes
advantage of the web server itself to help prepare the site's resources
for preservation. In this paper we discuss modoai, the web server
module we developed to support this approach."
Lewis, Stuart David, and Jon Bell, Using OAI-PMH and METS for
exporting metadata and digital objects between repositories, CADAIR,
University of Wales Aberystwyth Institutional Repository, 2006
(announced 1 August 2006)
http://cadair.aber.ac.uk/dspace/handle/2160/203
also in
Program: Electronic Library
and Information Systems, Vol. 40, No. 3, 2006, 268-276
Santhanagopalan,
Kamini, Edward A. Fox and Gail McMillan, A
Prototype for Preservation and Harvesting of International ETDs using
LOCKSS and OAI-PMH (pdf 36pp), 9th
International Symposium on Electronic Theses and Dissertations,
Quebec City, June 7 - 10, 2006
http://www6.bibl.ulaval.ca:8080/etd2006/pages/papers/SP10_%20Kamini_Santhanagopalan.pdf
Bekaert, Jeroen, and Herbert Van de Sompel, Access Interfaces for Open
Archival Information Systems based on the OAI-PMH and the OpenURL
Framework for Context-Sensitive Services, Arxiv.org, Computer
Science, cs.DL/0509090, 28 Sep 2005
also presented at PV 2005
Conference, Ensuring Long-term
Preservation and Adding Value to Scientific and Technical Data,
Edinburgh, November 2005
http://arxiv.org/abs/cs.DL/0509090
Bekaert, Jeroen, and Herbert Van de Sompel, A Standards-based Solution
for the Accurate Transfer of Digital Assets, D-Lib Magazine, Volume 11 Number 6,
June 2005
http://dx.doi.org/10.1045/june2005-bekaert
offers real solutions to real issues,
and has particular application
to Preserv
Van de Sompel, Herbert, Jeroen Bekaert, Xiaoming Liu, Luda Balakireva
and Thorsten Schwander, aDORe: a modular, standards-based Digital
Object Repository, ArXiv, Computer Science, cs.DL/0502028, 4 Feb 2005,
to appear in Computer Journal
http://arxiv.org/abs/cs.DL/0502028
Van de Sompel, Herbert, Michael L. Nelson, Carl Lagoze and Simeon
Warner, Resource Harvesting within the OAI-PMH Framework, D-Lib Magazine, Volume 10 Number
12, December 2004
http://www.dlib.org/dlib/december04/vandesompel/12vandesompel.html
Current Cites comment December 2004
http://sunsite.berkeley.edu/CurrentCites/2004/cc04.15.12.html
Trusted digital repositories
Berman, Fran, Ardys Kozbial, Robert H. McDonald, and
Brian E. C. Schottlaender, The Need to Formalize Trust Relationships in
Digital Repositories
, EDUCAUSE Review, vol. 43, no. 3
(May/June 2008), 10–11
http://connect.educause.edu/Library/EDUCAUSE+Review/TheNeedtoFormalizeTrustRe/46608?time=1210622303
Moore, Reagan W. and
MacKenzie Smith, Automated Validation of
Trusted Digital Repository Assessment Criteria,
Journal of Digital Information, Vol
8, No 2 (2007)
http://journals.tdl.org/jodi/article/view/198/181
this paper is believed to be based on, or a later version of
Moore, R. W. and Smith, M.,
Assessment of RLG Trusted Digital
Repository Requirements. Presented at the “Digital Curation &
Trusted Repositories: Seeking Success” Workshop, in conjunction with
the
Joint Conference on Digital
Libraries, JCDL 2006, Chapel Hill, NC,
June 11-15, 2006
http://sils.unc.edu/events/2006jcdl/digitalcuration/Moore_Smith-JCDLWorkshop2006.pdf
Trustworthy Repositories Audit & Certification: Criteria and
Checklist (pdf 94pp) Center for Research Libraries/RLG-OCLC, Version
1.0,
February 2007,
http://www.crl.edu/PDF/trac.pdf
Revised and expanded version of the
RLG-NARA Audit Checklist for the Certification of Trusted
Digital Repositories
(August 2005,
below)
Kaczmarek, Joanne, Patricia Hswe, Janet Eke and Thomas G. Habing,
Using the Audit Checklist for the Certification of a Trusted Digital
Repository as a Framework for Evaluating Repository Software
Applications,
D-Lib Magazine,
Vol. 12, No. 12, December 2006
http://www.dlib.org/dlib/december06/kaczmarek/12kaczmarek.html
This ought to be an important paper
since it explores a possible fault line between the concept of a
Trusted Digital Repository and repositories, repository software and
services. Frustratingly inconclusive, which suggests there may be more
to emerge from this work
Ross, Seamus, and Andrew McHugh, The Role of Evidence in
Establishing Trust in Repositories.
D-Lib
Magazine, Vol. 12, No. 7/8, July/August 2006
Dobratz, S., Schoger, A. and Strathmann, S., The nestor Catalogue of
Criteria for Trusted Digital Repository Evaluation and Certification.
Presented at the “Digital Curation & Trusted Repositories: Seeking
Success” Workshop, in conjunction with the Joint Conference on Digital
Libraries, JCDL 2006, Chapel Hill, NC, June 11-15, 2006
http://sils.unc.edu/events/2006jcdl/digitalcuration/Dobratz-JCDLWorkshop2006.pdf
Ross, Seamus and Andrew McHugh, Audit and
Certification of Digital Repositories: Creating a Mandate for the
Digital Curation Centre (DCC), RLG DigiNews, Volume 9, Number 5, 15 October 2005
http://www.rlg.org/en/page.php?Page_ID=20793#article1
Lists a daunting range of
requirements for trusted repsoitories, and reports DCC's plan to become
mandated to manage audit and certification
Rosenthal, David S. H., Thomas Robertson, Tom Lipkis, Vicky
Reich and
Seth Morabito, Requirements for Digital Preservation Systems: A
Bottom-Up Approach, D-Lib Magazine,
Vol. 11 No. 11, November 2005
http://www.dlib.org/dlib/november05/rosenthal/11rosenthal.html
also in ArXiv, Computer Science, cs.DL/0509018, 6 Sep 2005
http://www.arxiv.org/abs/cs.DL/0509018
RLG/NARA, An Audit Checklist for the Certification of Trusted Digital
Repositories (draft for comment), August 2005
http://www.rlg.org/en/page.php?Page_ID=20769
implements a certification procedure
that builds on the RLG-OCLC report
from May 2002 (below). All the targets for audit procedure testing are
specialist preservation services
Jantz, Ronald and Michael J. Giarlo, Digital Preservation:
Architecture and Technology for Trusted Digital Libraries, D-Lib Magazine, Volume 11 Number 6,
June 2005
Trusted Digital Repositories: Attributes and Responsibilities, An
RLG-OCLC Report, May 2002
http://www.rlg.org/longterm/repositories.pdf
Definitions by RLG and OCLC working
group
Business models, costs, lifecycle, workflow
Quint, Barbara,
OCLC
Introduces High-Priced Digital Archive Service,
Newsbreaks, May 2, 2008
http://newsbreaks.infotoday.com/nbReader.asp?ArticleId=49018
Not a research paper, but a
journalist investigates the relative costs of
storage solutions, one aimed directly at digital libraries. The OCLC
archive
service that is the subject here is given a critical examination in
relation to other services, yet it presents a robust response to tough
questions. This reveals the issues that are driving awareness and
takeup of preservation-linked services. See also Chapman
(2003)
Beagrie, Neil, Julia Chruszcz, and Brian Lavoie, Keeping
Research
Data Safe: A Cost Model and Guidance for UK Universities (pdf 169pp),
Charles Beagrie Limited, Final Report to JISC, April 2008
http://www.jisc.ac.uk/publications/publications/keepingresearchdatasafe.aspx
Principally about research data, and
includes four case studies - Archaeology Data Service, Kings College
London (including what was the AHDS), Cambridge University and
Southampton University (Crystallography and Oceanography) - the last
three of which involve the respective institutional repositories
Wheatley, P., Ayris,
P., Davies, R., Mcleod, R. and Shenton,
H., The LIFE Model v1.1.
Discussion paper, LIFE
project, UCL Eprints, 23 October 2007
http://eprints.ucl.ac.uk/archive/00004831/
Currall, James, Claire Johnson, Peter McKinney, The world
is all grown
digital.... How shall a man persuade management what to do in such
times? International Journal of
Digital Curation, Vol 2, No. 1, June 2007
http://www.ijdc.net/ijdc/article/view/22
Robertson, R. John, Digital preservation in the Tertiary education sector:
management implications, Strathprints, University of Strathclyde
Institutional Repository, 03 October 2006, also in Library Review, 55 (3),