Data

Chris Lang and Barnum Brown stand on two sides of a Tyrannosaurus rex fossil skeleton laid out on a table and platform for its head. Charles Lang and Barnum Brown working in laboratory with skeleton of Tyrannosaurus rex. 
Thane L. Bierwert/© AMNH

Collecting Data 

The information that is associated with a fossil, which is known as specimen data, is almost as important as the fossil itself.

Specimen data can help confirm the identity and age of the fossil; provide information about the paleoenvironment of the site where the fossil was found; help other researchers find the site again, so that further collections can be made; guide curators, conservators, and preparators in making decisions about how best to treat the material; and provide information on the history of collecting. Without these data, the fossil is much less “valuable” to scientists than it would otherwise be. For this reason, collection managers spend almost as much time managing data as they do looking after the fossils themselves.

The Darwin Core Standard

Because so many institutions collect data about specimens, and want to share these data efficiently, there are now major efforts within the scientific community to reach agreement on the basic types of information that should be collected. This “data about data” is known as “metadata” and the list of agreed metadata for natural history specimens is called the Darwin Core standard (or DwC for short). The Darwin Core is defined as “a specification of data concepts and structure intended to support the retrieval and integration of primary data that documents the occurrence of organisms in space and time and the occurrence of organisms in biological collections.” 

Sharing data is an important form of collection access. Researchers need to get access to this information in order to make the most effective use of the collection, while members of the public rely on interpretations of the data to understand specimens that are on display. However, it’s important to remember that collections data are valuable proprietary materials. Institutions need to keep this information secure and develop policies about what information to release and under what conditions it should be shared.

Important Types of Data

Sharing Data

Databasing: Storing Data

Many large institutions have their collection information in databases that are capable of storing not only specimen data, but also different types of associated media (e.g., specimen images), and which can make these available and searchable by the academic community and general public via the internet.

The choice, design, and building of a collection database is an enormous topic, far beyond the scope of this website. But because it is such an important topic, what follows is a quick overview of some of the basic things to keep in mind for any collection.

Backing up Your Database

The data contained in a collections database are of paramount importance and so it is essential to have procedures in place to back up your data to protect against data loss or corruption. You should have a plan in place for backing up your database on a regular basis onto some media that, ideally, can be stored off-site. At its simplest level, backing up may involve copying your database to an external source.

These Collection Management resources were originally developed in 2007 with the support of the National Science Foundation (NSF).

National Science Foundation logo is of an illustration of planet earth with a gold border and text that reads NSF.