We are a home for Earth science data and computing professionals. Our sessions bring together the community for hands-on, interdisciplinary deep dives as we explore "Innovation to Impact" this year. Learn more about ESIP: esipfed.org
Research data management systems (RDMS) are software systems like any other, and are subject to the same best practices as other software, including backup/replication and test your backups regularly. For RDMS the core goal is to preserve all datasets, all metadata and data, any backups should preserve this in a fashion that can be ingested by the same RDMS that created the backup or an instance of the RDMS somewhere else. As the saying goes, you are only as good as your last tested backup. Testing the ability of other repositories to ingest your data backups is an exercise to perform repeatedly. In this hands-on session we will present some examples of how to backup/replicate common open source RDMS, such as ERDDAP and Metacat. We may explore thoughts around automated ingest testing capabilities to allow for data facilities to test their backups on a small scale without disrupting regular activities at multiple data facilities. This is a hands-on companion session to “Data replication and repository succession: sustaining and preserving data for the long-term”.
Value to Session Participants: Hands-on experience with doing replication with their RDMS.
Recommended Ways to Prepare for this Session: For those running open source RDMS, familiarity with the backup and ingest functions of their RDMS. For those running home grown/proprietary RDMS, same as above and additional focus on documentation for others to reference in case of a database dump.
Since 1986 I have had parallel careers in Information Technology, Data Management and Analysis, and Archaeology. Since 1993 when I arrived at UNM I have worked as a Graduate Student in Anthropology, Research Scientist, Research Faculty, Applied Research Center Director, and currently... Read More →