The SPECTRa Project

Jim Downing

University of Cambridge

This presentation

The need for Open Data

This is the context for the SPECTRa project.

Problems in Chemistry

* - Open Data! Prof. P. Murray-Rust. JISC Conference, June 2005

SPECTRa ...

Submission, Preservation and Exposure of Chemistry Training and Research Data

http://www.lib.cam.ac.uk/spectra/

... To The Rescue

Data is lost
Workflow integrated tools to capture data
Extra work required to publish data
User driven development of tools to minimize effort of publication to archive
Lack of exemplars to illustrate benefits
Participating scientists will use tools day to day - demonstrable success.
Lack of infrastructure to handle Open Data
Tools will be portable, customizable Open Source software components
Integration with DSpaces at Cambridge and Imperial College.

SPECTRa Phases

Crystallography from 30,000ft

Measuring diffraction patterns to determine crystal and molecular structure

Structural data + chemical context is valuable.

Problems

Crystallography Sample Manager

Crystallography 2

All the data can be present, but the structure can't be published.

Publication sensitivity

Failure to collect metadata up front coupled with this time delay has been major barrier in crystallographers doing their own structure publication

Crystallography system

  • Solve the sensitivity by storing the publishable packages in escrow
  • Introduce a new 'dark' repository
  • Escrow could be managed by the crystallographer
  • Or by a computer system
  • The development of generic Escrow services and policies is something we're hoping to develop in the future.

DSpace Escrow Repository

DSpace Core Strengths

DSpace Core Development Areas

Potential Collaborations?

SPECTRa In Summary

Thanks for listening!

Questions?

References / Extra Info