Special interest group proposal

Dear honourable members of the CAA Steering Committee,

With this letter, we would like to express our intention to establish a CAA Special Interest Group for Scientific Scripting Languages in Archaeology (SIG-SSLA) under the auspices of the CAA. The idea for this SIG was developed in 2018 in a session on R as an archaeological tool. Out of this event and the resulting discussions, it was proposed to form such a Special Interest Group to create a permanent platform for the exchange and discussion of ideas and to develop the practical use of R for archaeological applications. After further discussions with colleagues, it became clear that it would be a good idea to extend the focus to all scientific script languages.

Consequently, a well-attended and constitutive round table meeting took place in Krakow at CAA 2019 (S03 Roundtable Scientific Scripting Languages in Archaeology - Limits and Opportunities of Open Research). At this meeting, a statement of purpose was presented and discussed. The SIG already exists de facto with a Google mailing list (https://groups.google.com/forum/#!forum/scientific-scripting-languages-in-archaeology), a Github organization (https://github.com/sslarch) and its active members (currently 14). This group is currently coordinated by Sophie C. Schmidt, Martin Hinz and Clemens Schmid. For the CAA 2020, we would now like to propose to formally include this SIG in the canon of the SIGs of the CAA.

Statement of purpose

The purpose of the CAA Special Interest Group Scientific Scripting Languages in Archaeology is to promote the widespread implementation of computer scripts in archaeology. We will provide an exchange platform for scientists in and around archaeology who use scripting languages to conduct major or minor parts of their research. Scripts are computer programs that automate the execution of various kinds of tasks and are typically used by scientists to process and visualize information in ways that are amenable to the principles of open research. The purpose of this SIG is to help foster the growing community of archaeologists who increasingly use scientific scripting languages to pursue archaeological interests.

This SIG is particularly supportive of students and early career researches who may be interested in developing their skills and promoting more pervasive use of scientific scripting languages in research, publication and teaching. By promoting the widespread adoption of practical technological skills, we hope to establish a robust community of practice that may serve as a foundation for further growth.

The SIG is an open format that supports discussion and the circulation of ideas drawn from various perspectives. Everyone may participate on equal terms, following CAA International’s established Ethics Policy. To ensure expedient and civil discourse, the SIG committee will remove, edit, or reject comments, commits, code, issues, and other contributions that are not aligned with this policy. To be as inclusive as possible and also for the sake of reproducibility, we strongly prefer open-source over proprietary software.

Scripting languages

A scripting language is a programming language that automates the execution of various data-driven tasks. The tendency for scripting languages to be run through an interpreter rather than as compiled from a prior human-readable state enables scripts to be implemented in a rapid, iterative and exploratory manner that is well suited for scientific research.

Examples of popular scripting languages in archaeology and other sciences include, but are not limited to:

  • R
  • Python
  • Bash
  • Netlogo
  • Stan
  • OxCal
  • JavaScript
  • SQL

Implications for open and reproducible research

The ease with which scripts can be understood by humans, as well as their tendency to be implemented in a step-wise manner, affords their capability to be well-documented. A well-documented script may potentially be used to reproduce analytical processes and verify their results. Scripting languages are therefore lauded as facilitators of open and reproducible research.

Code should be equally important as publications. That also means that code generally should be subject to a scientific reviewing process. It is, therefore, one long-term aim of this SIG to offer guidelines on how code publication and - review can be fruitfully implemented into research and publication processes (see Objectives).

The tendency for scripts to be stored in human-readable text-based file formats also makes them particularly amenable to flexible means of storage, dissemination and version control. Reproducible research ideally also documents the science production process with all inductive and deductive steps of hypothesis building. A modern and powerful way to store a commented succession of changes in text and scripted data analysis is available with tools like Git or SVN which were initially designed for software development. They provide a simple environment to keep track of even minor changes, go back to earlier working stages and allow for well-documented collaborations thanks to forking. The combination of scripting languages and version control is an important basis for a fully comprehensible research process.

These somewhat tangential yet meaningful implications relating to the use of scientific scripting languages underscore aspects of their utility that the SSLA aims to promote.

Objectives of the SIG SSLA

In summary, the SIG SSLA aims to:

  • promote the replacement of proprietary and opaque programs with open source and transparent software based on scripting languages, which encourage the advancement of open and reproducible research practices
  • help establish scripting as a readily-accessible component of the archaeologist’s toolbox
  • promote the integration of coding into the basic archaeological teaching curriculum
  • promote the publication of archaeological research data so that they may be used by others, and to facilitate reproducible research practices overall
  • discuss and critically assess the development of scripting languages and the implications that they afford
  • develop step-by-step pipelines for common archaeological tasks
  • outline best practices and guidelines to support more effective writing, sharing and publishing code
  • construct a peer review network for code in publications and software libraries
  • establish an inheritance network to arrange long term maintenance and responsibility for software written by and for archaeologists
  • stand up for diversity and fairness in the field of computational archaeology

If you approve the formation of this SIG, we would also like to ask you if and how the SIG could get a short slot in the schedule of the CAA2020 conference in Oxford to discuss the organizational affairs among new and established members.

Thank you for considering this proposal.

With best regards,

Sophie C. Schmidt, Martin Hinz, Zack Batist and Clemens Schmid

Scientific Scripting Languages in Archaeology

A special interest group of CAA International dedicated to scientific scripting languages in archaeology.