1. Rationale for Data Dissemination

1.1 Open Science and Reproducibility

Open science promotes transparency, accessibility, and collaboration in research. By sharing data openly, researchers enable reproducibility of results, foster innovation, and facilitate interdisciplinary studies. This approach aligns with the broader scientific community's move towards openness and accountability.

1.2 FAIR Principles

The FAIR principles provide a framework for effective data management:

Adhering to these principles ensures long-term utility and integration of datasets across platforms.


2. Infrastructure for Data Publication

2.1 PaleoMIX Open Archaeological Database (O.A.D.)

The PaleoMIX O.A.D., part of the ARHUT data platform, is built on Directus, an open-source content management system. This setup offers:

This infrastructure supports efficient data management and facilitates collaboration among researchers.

2.2 External Data Repositories

For broader dissemination and preservation, data can be deposited in established repositories:

Utilising these repositories enhances data visibility and integration into global research efforts.


3. Data Upload and Publishing Protocols

3.1 Preparation of Data

Before uploading, ensure that the datasets:

3.2 Upload Procedure

  1. Access the Platform: Log into the PaleoMIX O.A.D. portal.

  2. Select Appropriate Collection: Choose the relevant data category (e.g., Proteomics, Radiocarbon Dating).

  3. Upload Files: Use the interface to upload data files and associated metadata.

  4. Review and Validate: Ensure all information is accurate and complete.

  5. Submit for Publication: Finalise the upload, making the data available to authorised users or the public, depending on access settings.

3.3 Licensing and Access

Assign appropriate licenses (e.g., CC BY 4.0) to datasets, clearly indicating usage rights. Define access levels to balance openness with any necessary restrictions.


4. Integration with Collaborative Initiatives

4.1 API Utilisation

The Directus-based infrastructure provides APIs designed for efficient data management and enhanced interoperability:

Utilising APIS significantly expands the utility and accessibility of Paleomix datasets.

4.2 Data Usability for Artificial Intelligence

Recently, emphasis has been placed on structuring and managing data to optimise its usability for artificial intelligence (AI) models. An example of this effort is the COST Action MAIA (Managing Artificial Intelligence in Archaeology), which specifically addresses the challenges and solutions related to making archaeological datasets standardised and AI-compatible, promoting advanced analytical techniques and collaborative research.



5. Requirements for Raw Data and Quantitative Summaries

5.1 Raw Data

Raw datasets should include:

5.2 Quantitative Summaries

Processed data should provide:

Ensuring both raw and processed data are available supports transparency and facilitates further research.