Mok, DouMing; Brooke, Matthew; Tao, Jing; Clark, Jeanette; Jones, Matthew B.; Nesbitt, Ian
Arctic Data Center (2024)doi: 10.18739/a2zg6g87q
HashStore is a server-side python package that implements a hash-based object storage file system for storing and accessing data and metadata for DataONE services. The package is used in DataONE system components that need direct, filesystem-based access to data objects, their system metadata, and extended metadata about the objects. This package is a core component of the [DataONE federation](https://dataone.org/), and supports large-scale object storage for a variety of repositories, including the [KNB Data Repository](http://knb.ecoinformatics.org/), the [NSF Arctic Data Center](https://arcticdata.io/catalog/), the [DataONE search service](https://search.dataone.org/), and [other repositories](https://dataone.org/network).
HashStore is a piece of software that allows for efficient storage of large numbers of unique objects based on their contents. In addition to storing the objects themselves, Hashstore allows for the storage of metadata about those objects in a way that lets repository software easily catalogue, find, and serve those objects and metadata to repository users.