Healers Project: Interview Collection DMP

Roles & Responsibilities

The DMP should clearly articulate how sharing of primary data is to be implemented. It should outline the rights and obligations of all parties with respect to their roles and responsibilities in the management and retention of research data. It should also consider changes to roles and responsibilities that will occur if a project director or co-project director leaves the institution or project. Any costs stemming from the management of data should be explained in the budget notes.

The Healers interview collection consists of audio clips of interviews, as well as accompanying transcripts and translations, conducted through years of fieldwork throughout the Caribbean and the Pacific Northwest. These interviews broadly address the question: What does healing look like in the context of Afro-indigenous traditions? The interview collection will be shared with the general public using the University of Oregon’s cultural heritage digital library, Oregon Digital. It will also be deposited into the Digital Library of the Caribbean at the University of Florida. We are taking a multi-institutional approach to having the collection in both digital repositories because we believe in LOCKSS (Lots of Copies Keep Stuff Safe). However, the authoritative interviews will be held by the University of Oregon and aggregated through the Digital Library of the Caribbean not the University of Oregon.

Expected Data

The DMP should describe the types of data, samples, physical collections, software, curriculum materials, or other materials to be produced during the project. It should then describe the expected types of data to be retained. Project directors should address matters such as these in the DMP:

Data types

There will be a minimum of 20 interviews in this digital collection. The interviews that will be published on the web include audio (.mp3) and transcription and translation documents (.pdfs). Data quantity: As of spring 2023, there are 20 interviews. These interviews are clips from longer ones. It is important to note that the collection will grow as more interviews are added. This means for every interview in the collection, data managers need also count the transcription and translation. So, if there are 20 interview clips then there are 20 transcriptions and 20 translations, which makes the collection contact 60 files. Here is a breakdown of collection size, work type, file format, total number of assets, average file size, and average total size for a set of file types.

Work Type File Format Total Assets What is the average file size (in MB) for each file type in this collection? What is the total size of all files (in MB) that have the same file type?
audio mp3 16 3.46 mb 55.4 mb
text srt 16 3.9 kb 62.4 kb
text pdf 32 2.48 mb 6.44 mb
text docx 32 43.38 kb 694 kb

In total the collection’s data quantity is 62.59mb as of March 2023.

Data Handling: Interviews will be captured by Alai and her research field team. They are following their research project’s human-subject research protocol to ensure data privacy and protections are in place before passing interviews on for processing and archiving. Interview clips will be made by people on Alai’s field team before handed over to the data management team. Once interviews are ready for archival processing then the data managers will apply file naming standardization, file conversions, resource descriptions, and create transcriptions and translations as web accessible PDFs and create closed caption files. All data shared with the data management team will be hosted and activity handled in a University of Oregon Dropbox Team folder. Once the interviews have been processed, they will then be made ready for upload to Oregon Digital and dLOC.

Folder structure: There will be 1 folder inside Dropbox, and it will be used to keep all interview materials that have been shared with data managers. Each level indicates the holder hierarchical order.

  1. Interview_collection a. Originals – The field team deposits interviews for processing in this folder. b. Processing_clips: This folder is for data managers to prepare clips for conversation, manipulation, and transcription and translation c. Cataloging: This folder is for data managers to apply resource descriptions to interviews d. Ready_share: Data managers should add options and metadata that are ready to share through Oregon Digital and the Digital Library of the Caribbean

File naming standard: Filenames should follow the following format: name_topic_serialnumber; name_topic_sourcetype_serialnumber Examples: daniela_nature_001; daniela_nature_transcript_001

Resource Descriptions: See Appendix A

Legal and ethical restrictions on access to non-aggregated data: This has the same treatment as aggregated data. Kate Thornhill and Rachael Lee are the managers of the Dropbox folder where the Healers interview collection data are stored. If a collaborator leaves the Healers project, they will ensure that any permissions have been re-attributed (if applicable) and revoke their access.

Aggregated and Shared data: All interviews and metadata will be available for download and reuse through the University of Oregon and the Digital Library of the Caribbean.

All Healers interview collection digital assets are original content created by Dr. Alai Reyes-Santos and community partners. Rights and reuse are governed by several frameworks in consultation with the healers themselves.

Traditional Knowledge Label: Each interview, transcript, and translated transcript has a Traditional Knowledge Label to indicate attribution, access, and use rights. The Traditional Knowledge Labels are provided by LocalContexts.org, which “supports Indigenous communities to manage their intellectual and cultural property, cultural heritage, environmental data and genetic resources within digital environments.” Traditional Knowledge Labels in use for the interview collection include the following:

Rights: Each interview, transcript, and translated transcript is covered by the same rights, which were selected from the options in RightsStatements.org. This organization provides simple and standardized rights statements that “are designed to be used by cultural heritage institutions to communicate the copyright and re-use status of digital objects to their users. These statements provide a best practice for use by both international, national and regional aggregators of cultural heritage data, and the individual institutions and organisations that contribute data to them.” The rights statement in use for the interview collection is the following:

Rights Holder: The healer who provides the interview is the rights holder.

Period of Data Retention

NEH is committed to timely and rapid data distribution. However, it recognizes that types of data can vary widely and that acceptable norms also vary by discipline. It is strongly committed, however, to the underlying principle of timely access. In their DMP applicants should address how timely access will be assured.

Interviews will be made available to the public within 12 months after grant funding ends.

Data Formats and Dissemination

The DMP should describe data formats, media, and dissemination approaches that will be used to make data and metadata available to others. Policies for public access and sharing should be described, including provisions for appropriate protection of privacy, confidentiality, security, intellectual property, or other rights or requirements. Research centers and major partnerships with industry or other user communities must also address how data are to be shared and managed with partners, center members, and other major stakeholders. Work types and their appropriate file formats will be compliant with these standards. | Object Type | Work Type | File Format | | ———- | ———- | ———- | | audio | Interview recording | .mp3 | | text | Transcription; translations | .pdf |

Metadata will be shared through Oregon Digital and the dLOC. Privacy and security will be considered and applied to the data based on a research protocol created by the field team. This includes not using healer last names or disclosing their locations in greater detail. The field team will also work with data managers to erase GPS data from recordings. Intellectual property information can be referenced in the expected data section of this DMP.

Data Storage & Preservation of Access

The DMP should describe physical and cyber resources and facilities that will be used to effectively preserve and store research data. These can include third-party facilities and repositories. Data preservation will be ensured by Oregon Digital and dLOC.

Technical Credits - CollectionBuilder

This digital collection is built with CollectionBuilder, an open source framework for creating digital collection and exhibit websites that is developed by faculty librarians at the University of Idaho Library following the Lib-Static methodology.

The site started from the CollectionBuilder-GH template which utilizes the static website generator Jekyll and GitHub Pages to build and host digital collections and exhibits.

More Information Available

Technical Specifications
IMLS Support