Dura-Europos Stories: Developing interactive storytelling applications using knowledge graphs for cultural heritage exploration
Abstract
We introduce Dura-Europos Stories, a multimedia application for viewing artifacts and places related to the Dura-Europos archaeological excavation. We describe the process of mapping data to the Wikidata data model as well as the process of contributing data to Wikidata. We provide an overview of the functionality of an interactive application for viewing images of the artifacts in the context of their metadata. We contextualize this project as an example of using knowledge graphs in research projects in order to leverage technologies of the Semantic Web in such a way that data related to the project can be easily combined with other data on the web. Presenting artifacts in this story-based application allows users to explore these objects visually, and provides pathways for further exploration of related information.
1.From dataset to multimedia application
People working in cultural heritage organizations often publish images on the web to make groups of artefacts browsable for people who would like to view the collection remotely. We designed the Dura-Europos Stories project to pull data from a public knowledge graph, Wikidata, that contains information from multiple institutions related to the Dura-Europos archaeological site.11 Not only does this application provide support for browsing images of artefacts, we also present them in the context of the larger site and connect them to as much supporting information as possible.
Similar projects in the domain of digital humanities include the Biography Sampo project and the other Sampo applications from Finland [5,6]. Many researchers have explored the use of semantic technologies to support historical inquiry [9,15]. Other projects have also described the use of Wikidata to support research in the domain of archaeology [12]. Some groups are exploring how to display 3D content related to collections [18]. A diverse range of topics are covered under the term ‘digital humanities’, and researchers have reused data from Wikidata in many digital humanities projects [21]. Our project contains some features of digital databases described in [14]. We based many of the design decisions for this project on our previous work creating multimedia biographies powered by Wikidata [16,19]. We describe a collaborative project between art historians, information scientists and software engineers to create an interactive application for viewing multimedia content related to the excavations at Dura-Europos.
2.Dura-Europos
The site of Dura-Europos (Syria) was founded around 300 BCE as part of the Seleucid Kingdom, a Hellenistic Greek successor state formed in Western Asia after the death of Alexander the Great. Located on the western bank of the Euphrates river, the city stood on valuable real estate between competing ancient eastern and western powers. As a result, over the course of its history, the city passed from Seleucid control through successive phases of Arsacid (Parthian) and Roman occupation, and ultimately fell to the besiegement efforts of the Sasanian Persians in the 250s CE. Attempting to shore up the city’s defenses in advance of the Persian attack, the Roman soldiers garrisoned at Dura constructed a massive earthen embankment to reinforce the city’s vulnerable western wall. In a move that would prove fateful for the site’s exceptional archaeological preservation, buildings in the vicinity of the west wall – including the oldest excavated Christian church, the most elaborately decorated ancient Jewish synagogue discovered to date, and various pagan temples that attest to the city’s ethnic and cultural diversity – were requisitioned and filled with earth and debris.
What was at the time a highly destructive process, in fact, created (together with the hot, dry climate) the conditions for the site’s extraordinary degree of preservation. Thanks to these unique circumstances, large sections of rare mural painting, and hundreds of objects made from organic materials survive at Dura-Europos. These rarely-preserved artifact types were found together with sculptures, inscriptions, arms and armor, ceramics, coins, and other objects of everyday life. The site therefore provides unparalleled glimpses into the multicultural, religiously-diverse frontier life, and the running of a military garrison with coexisting soldiers and civilians.
Dura-Europos is just one of a number of blockbuster archaeological sites that present significant challenges thanks to long and complex histories of research in multiple languages, resulting in dispersed and heterogeneous datasets. With methodological developments in many areas of the humanities that embrace the epistemological value of material culture and lend serious weight to context, perhaps more than ever before, archaeological datasets need to be able to “speak” across traditional disciplinary boundaries. Similarly, ethically-informed reflections on collecting and research practices have made evident the entrenched Eurocentric knowledge and accessibility hierarchies that frequently exist in the wake of foreign-run excavations in colonial contexts, and have soundly made the case for the need for interventions that include multilingual access. Semantic Web technologies offer possibilities for mitigating some of these challenges. Our work in the context of Dura-Europos, therefore, stands as a test-case for the development of methods that may prove valuable in responding to similar challenges in the context of analogous cultural heritage sites.
3.Dura-Europos data model
The core of Dura-Europos data set consists of metadata created by the Yale University Art Gallery. The International Digital Dura-Europos Archive (IDEA) team then created mappings from the metadata to Wikidata properties to align with the Wikidata data model. For each artefact there are pieces of information about the title, location, height, width, length, approximate age, place of excavation, materials used, and type. Currently there are more than fourteen thousand items in the Dura-Europos collection.
After aligning the existing metadata to the Wikidata data model, the IDEA team created thousands of new items to represent the artifacts, and contributed tens of thousands of statements about them. The artifact with greatest number of statements is ‘Large bowl with rounded sides (twenty-seven rim and side fragments), Yale University Art Gallery, inv. 1938.5999.490’ at 66 statements.22 The bowl consists of twenty-seven fragments. The large number of statements is due to the fact that measurements for the dimensions of each fragment are provided on this item. The artifact with the lowest number of statements is ‘Lizard, Yale Peabody Museum of Natural History, YPM ANT 015992’.33 On average, artifacts from this collection have fifteen statements.
4.Wikidata data model
Wikidata is a knowledge base of structured data that allows community members to contribute data [20]. More than 12,000 editors are active in Wikidata each month [1]. The Wikidata data model consists of items and properties. Items have identifiers, called Qids, structured as unique resource identifiers (URIs). Properties are identified with Pids. For example, in Fig. 1, we see that the property ‘made from material’ (P186), is used with three different values ‘wood’, ‘rawhide’ and ‘paint’. Similarly, property ‘depicts’ (P180), is used with four values to indicate what figures are represented on the scutum.
Fig. 1.
The Wikidata community uses this identifier structure in order to be neutral with regard to human languages. Wikidata currently supports more than 300 human languages, so labels for each language are then associated with the numeric identifiers [7]. The multilingual support available in Wikidata is relevant for Dura-Europos because people from many different language backgrounds are interested in the excavation and artifacts. Institutions in several different countries hold materials related to the Dura-Europos excavation, and the excavated objects have been described in several human languages. Due to the fact that the Dura-Europos data is now in Wikidata, people can now add information in many different languages that make this data accessible for more people.
When searching for artifacts, some people will search for the type of object in general. For example, if people use the word ‘statue’ as a search term, they will find that there are multiple statues that were excavated from the Dura-Europos site. The Wikidata item for ‘statue’ is Q179700.44 As of April, 2022 there are more than one-hundred-fourteen labels in different languages on the Wikidata item for ‘statue’ a sample of this set of labels is available in Fig. 2. In the current version of the Dura-Europos Stories application we have support for search functionality in English. In the future we plan to extend this to additional languages. The multilingual data in Wikidata will allow us to extend search functionality for different languages for words describing types of objects by adjusting the SPARQL query we use to identify these types within Wikidata. If we contrast this with the alternative of commissioning translators to create additional labels in many human languages, leveraging the multilingual data from Wikidata can conserve time and work for people building upon this data.
Fig. 2.
The Wikidata data model also includes references [11]. Editors can add references to any statement on any Wikidata item. The members of our team who contributed the Dura-Europos data to Wikidata created many references that point back to the Yale University Art Gallery website pages for the artifacts. For researchers interested in Dura-Europos, these references provide pathways to additional information about these materials.
5.Getting Dura-Europos data into Wikidata
To bring about the creation of Wikidata items for artifacts excavated from Dura-Europos, the IDEA team worked from spreadsheets of metadata provided by partner institutions. Using their own internal collection management system, each partner institution provided a CSV export of items in their collection related to Dura-Europos. Metadata pertaining to each item was listed in sheet columns. The CSV was then processed using Open Refine55 to regularize equivalent expressions and correct spelling and spacing errors.
In preparing the initial upload into Wikidata, the aim was to reflect all of the metadata provided by the institution as completely as possible. Some metadata fields recorded by partner institutions in CSV column headings were easily matched with equivalent Wikidata properties with minimal research using the SQID tool.66 For instance, the object ‘medium’ provided in institutional metadata mapped neatly to Wikidata Property ‘made from material’ (P186). Other fields supplied by the institution, like classificatory categories of artifact (ex. coin, lamp, statue), translated to 1instance of’ (P31) Wikidata statements.
A few metadata fields, like those expressing uncertainty around the time of an object’s date of creation or the object’s dimensions required parsing of a single column into multiple columns in preparation for Wikidata upload. In institutional records, object dimensions were expressed with height, width, and depth measurements in the same ‘dimensions’ column. In Wikidata, however, each dimensional angle corresponds to its own property (ex. height (P2048), width (P2049), horizontal depth (P5524)). To ease the process of item creation, each facet of measurement was parsed into its own column with a new heading corresponding to the relevant Wikidata property. The IDEA team sought the advice of the Wikidata community on how to preserve the uncertainty surrounding an object’s precise date of creation. Following the pattern established by more mature projects in the GLAM sector, for objects whose precise date of creation could not be established (ex. via externally datable references in inscripional content) the institution-supplied date column was parsed into three new columns: an ‘inception’ (P571) column whose value corresponded to ‘placeholder for some value’ (Q53569537), and two additional columns to hold values corresponding to ‘earliest date’ (P1319) and ‘latest date’ (P1326) as qualifiers on the ‘inception’ property. This modeling strategy allows one to query the data according to a span of years rather than forcing the artificial tethering of an object to a single date.
More sustained and specialized human input was required to extract Wikidata ‘depicts’ (P180) and ‘location of discovery’ (P189) statements, as well as item labels and descriptions that would allow easy disambiguation and capture metadata content not otherwise reflected in standalone statements. In the Dura datasets, the ‘title’ values provided by the host institutions contained information that pertained to discovery context (ie. ‘location of discovery’ (P189)) and/or the scholarly interpretation of content depicted. The institutional labels were thus manually assessed to create new ‘location of discovery’ (P189) and ‘depicts’ (P180) columns; values for each field were either matched with an existing item, or a new Wikidata item for the corresponding value was created (prior to running the batch upload).77 A summary of the properties used to translate the institutional CSV exports and rationale for specific modeling decisions is available on the Wikiproject IDEA page.88
The expanded and modified CSV was then used as the basis for a batch upload. Data was written from the CSV into Wikidata using QuickStatements99 and the DuraEuroposBot.1010
5.1.Getting images into Wikimedia commons
The IDEA team uploaded thousands of images of artifacts to Wikimedia Commons. Wikimedia Commons is a repository for multimedia content that is used by many projects of the Wikimedia Foundation [8].
Other organizations that hold material related to Dura-Europos may also contribute images or models to Wikimedia Commons. Currently four organizations have uploaded media related to materials from Dura-Europos: Yale University Art Gallery, Beinecke Rare Book & Manuscript Library, Bibliothèque nationale de France, and the Department of Near Eastern Antiquities of the Louvre. As additional organizations make the decision to publish media on Wikimedia Commons, additional images may become available. This means that a more holistic set of images could become available for reuse in the Dura-Europos Stories application. Rather than exclusively reusing images from a single institution, the images can be sourced from multiple collections. All of the metadata for these images is presented alongside them in the IIIF viewer we embed into the stories. In this way we are bringing images together from multiple collections, but also clearly communicating the provenance of the images and crediting the institutions that have made them available online.
5.2.Getting polygons into Wikimedia commons
Members of the IDEA team contributed data to Wikimedia Commons in the form of shapefiles, which represent multiple points on a map rather than a single set of coordinates. These shapefiles in Wikimedia Commons can be connected to Wikidata items through the use of Property 3896 ‘geoshape’. In Fig. 3, we see a dark gray polygon on a light brown background that represents the footprint of Tower 3 at the Dura-Europos site. By connecting Wikidata items to shapefiles, we can reuse the data in the shapefile within mapping applications to indicate the layout, position, and relative sizes of these features in the Dura-Europos site.
Fig. 3.
6.Semantic Web
After contributing data related to Dura-Europos to Wikidata, we can leverage additional data from the Semantic Web. An example of data that is not in Wikidata itself, but is connected to items from the Dura-Europos dataset in Wikidata is the Pleiades Gazetteer. The Pleiades Gazetteer is a web-based digital gazetteer that describes places in historical contexts [2]. Wikidata has two properties related to the Pleiades Gazetteer project: Property 1584 ‘Pleiades ID’ and Property 2938 ‘Pleiades category identifier’ which can be used to connect Wikidata items to relevant content from the Pleiades Gazetteer. Currently more than thirty Wikidata items representing parts of Dura-Europos have Pleiades identifiers. This means that for each of those items we can follow that external identifier and find out additional information about these places from the Pleiades Gazetteer. For example, the Dura-Europos synagogue has the Pleiades identifier ‘335307374’. in Fig. 4, we see the Pleiades page for the Dura-Europos synagogue.1111
Fig. 4.
Combining data from many different sources allows us to create a more detailed and complex representation of the people, places, and artifacts related to Dura-Europos.
7.Data quality
Many different stakeholders contribute data to Wikidata related to Dura-Europos. Multiple institutions have collection material related to the Dura-Europos site and excavation and independently create items in Wikidata that describe their holdings. Many student researchers are also involved in projects related to data describing Dura-Europos. With a diverse group of contributors, we employ the strategy of sharing relevant data models on-wiki. We share our data models in Wikidata’s E namespace which is dedicated to schemas [17]. The Wikidata community uses ShEx to encode schemas within the schema namespace. ShEx is a formal data modeling and data validation language for RDF graphs [3].
We compose our schemas in ShExC, the compact syntax. ShExC is concise, which makes it appropriate for human data modelers. It is also machine-actionable, thus our schemas can be used for validation of entity data from Wikidata. In this way we can share our data models with human editors who can then confidently contribute additional data in alignment with these data models. We can also test entity data for conformance with these schemas to identify items that need review or curation.
People looking for schemas in Wikidata’s E namespace use the search bar to find relevant options. By using the pattern E: in combination with a search term, people can search across the labels of the schemas in the E namespace. In Fig. 5, when searching for ‘E:coin hoard’ the schema E366 is returned.
Fig. 5.
In Fig. 6, when searching in Arabic, people will find schema E366. If people who prefer specific languages for interaction add the ‘EntitySchemaHighlighter.js’ user script to their accounts, they will be able to see labels for items and properties used in the schema when they hover.1212 Through the use of this user script, Wikidata editors have multilingual access to schemas making them an important data modeling tool that reduces barriers to communication about data structuring for people from different language contexts. When collaborating with others in different language communities, sharing data models via schemas in Wikidata’s schema namespace allows people to read schemas in the language of their choice.
Fig. 6.
8.Digital storytelling with stories services
The Stories application supports both browsing and searching for stories. The browse page consists of a set of cards representing individual artifacts excavated from Dura-Europos. The cards include an image of the artifact, if available, as well as the Qid for the item representing the artifact in Wikidata as well as a ‘learn more’ button which takes the visitor to the story for the artifact. Visitors can also use the search bar to search for terms, names, places, and more. For example, searching for the word ‘amphora’ in the search bar of the Dura-Europos Stories application will return twenty-six results. A subset of the results can be seen in Fig. 7.
Fig. 7.
We decided to highlight the images of these artifacts as the entry point into the stories. For each artifact with an image, the first moment on the story is the image and descriptive name of the object or location. These images are also used on the cards that make up the Browse page in the application as seen in Fig. 8. The list of moments is presented on the left-hand side of the screen and serves as the navigational menu for each story. Users click on a moment to select it, and then the content is rendered in the primary window within the application. The moment menu is always available so that users can select the next moment of interest. Moments are interactive, and users can explore the content presented within each moment by hovering, clicking, and scrolling.
Fig. 8.
The Sketchfab moment provides a viewer for three dimensional models that have been uploaded to Sketchfab. Sketchfab is a website that allows people to upload 3D models and share them on the web [4]. For example, in one story there is a 3D model of a scutum excavated from Dura-Europos, as seen in Fig. 9. The ability to view a 3D model of an artifact supports users in getting a sense of what the object looks like from different angles and provides a sense of depth. In future work we would like to integrate with additional sources of 3D content, such as tours of the Dura-Europos site itself, building on the work in [13].
Fig. 9.
The Artefact moment provides an overview of the dimension of the objects and the materials of which they are composed, as seen in Fig. 10. This moment allows users to get a sense of how the object may have been created and an understanding of scale.
Fig. 10.
The Map moment is a visualization of any geocoordinates related to the object, such as location of excavation and location of the institution in which it is currently held, as seen in Fig. 11. Plotting these coordinates provides the user with an understanding of how people have moved the object since the time it was excavated. For certain artifacts, such as the Dura-Europos Route Map, this moment provides detailed information of places mentioned in the artifact itself.
Fig. 11.
The Library moment contains a set of shelves on which icons are displayed that represent each article or book that describe these artifacts, as seen in 12. This is one strategy for weaving the images of the artifacts themselves into the context of the scholarly literature which describes them.
Fig. 12.
The Dura-Europos Digital Archive team annotated hundreds of images using the Wikidata Image Positions Tool.1313 This process involved using Wikidata property P2677 ‘relative position within image’ as well as property P180 ‘depicts’ to indicate a where something is and then provide an annotation of what is being depicted. This tool uses the International Image Interoperability Framework (IIIF) to display the image as a IIIF canvas with highlighted rectangles around detail areas with tool tips that provide the label for what is depicted. The ability to hover over different parts of the image and learn more about what the artifact represents provides an interactive experience to people exploring the Dura-Europos Stories application.
The Learn More moment displays cards describing information sources for further reading. As seen in 13 these cards include all external identifiers for the item in the story as well as all references used to support statements on the item. External identifiers are the identifiers for items in systems other than Wikidata. People can quickly gather information about where to go for additional information related to the subject of the story.
Fig. 13.
Many aspects of the stories are customizable. The order of moments, the text, and the colors within moments can all be used with their default settings or with user-selected configurations. We provide a Publisher Workspace for people to configure their stories and customize their moments.
9.Stories services
The Stories Services team created the Dura-Europos Stories application based on the framework we created for Science Stories [16]. These applications are both powered by Wikidata as well as the Stories Services API. We chose to use Python as the primary programming language for the backend of the application. We used a PostgreSQL database to store configuration information related to the presentation metadata of each story. We use Redis1414 to cache Wikidata SPARQL query results.
The Stories Services team created a package for working with Wikidata data in a Django application.1515 The Django-wikidata-api library is designed loosely around the core Django object-relational mapping (ORM), modified to interact with Wikibase via SPARQL queries rather than a relational database. The WikidataItemBase python class within the package has an interface for determining which statements are needed to represent a dataset. This class is also used to generate OpenAPI documentation, and to construct serializers compatible with the Django Rest Framework1616 to provide JSON responses used in the Dura-Europos Stories application.
The frontend of Stories Services provides a consistent design for content syndicated from Wikidata. We use the frontend to render and manage multimedia stories. For story rendering, we developed react-stories-api,1717 a React.js open-source component library. This library controls all visual story elements in the Dura-Europos Stories application. This allows us to decouple infrastructure from presentation so that the Dura-Europos Digital Archive team can host the stories on any domain and data is provided through the Stories Services API. We developed the react-stories-api library to minimize maintenance for people who host their own stories application as a statically-served single-page application (SPA). The library contains rendering components as well as the API client itself. The react-stories-api also provides a component for presenting all stories within a collection as well as search functionality and pagination support. The Dura-Europos Stories website leverages both components to create a full user experience including routing and navigation with no backend server or data store needed. We use Material Design1818 as our design framework and most of the components in the library are built with the Material-UI core library.1919
The Stories Services team also developed a Publisher Workspace to serve as the visual frontend of API operations. The Publisher Workspace can be used to manage the collection and story presentation. The Publisher Workspace can be used by organizations or people who which to customize story structure and styling. While the Stories Services API layer powers the data in the Dura-Europos website, the Publisher Workspace is where administrators can rearrange the ordering of moments, modify the story metadata itself, and most importantly, enhance the stories with curated content such as images, videos, and links found outside of Wikidata. The Publisher Workspace is built using React.js and has react-stories-api as a core dependency. This design allows publishers real-time previews of their story selections.
The React Stories API, as part of the Stories Services framework, provides an interactive, visually-appealing front end for subsets of data in Wikidata. Project teams that have curated specific subsets of Wikidata, and are looking for a front-end solution, may want to consider the Stories Services framework as a way to showcase their data in an engaging way.
10.The case for knowledge graphs in research projects
The data powering this collection of multimedia stories will grow in complexity and depth over time as members of the Wikidata community add additional information to Wikidata related to these artifacts. Each time a story is rendered, the Stories application takes in new information from Wikidata that matches the data model of a story. In this way, if relevant data is added to Wikidata, it will be included in a story the next time that story is viewed.
The multilingual design of Wikidata offers the potential for access points into this collection via more than three hundred human languages. Creating this project within Wikidata means that, as others learn of the project and decide to contribute additional data related to Dura-Europos, they will be able to interact with Wikidata in any of the languages Wikidata supports. The Wikidata platform enables collaboration between people even if they do not share any human languages in common.
Another type of connection that the Wikidata knowledge base stores is information about external data sources. Wikidata does this through the use of external identifiers. External identifiers are a type of Wikidata property. External identifiers are used to store identifiers for items in information systems, databases, or collections outside of Wikidata. While there are not many external identifiers on the items for the artifacts, there are many external identifiers on the items depicted by the artifacts as well as some of the places. Wikidata serves as a hub for external identifiers on the web [10]. Due to the large number of identifiers stored in Wikidata, it has become a efficient place to find many identifiers for a resource with a single search. Thus by contributing data to Wikidata, a research team will likely gain additional sources of related information. This saves time for research teams, as they no longer need to seek out these other sources, or map their data to each additional source individually.
For research projects that contain multilingual data, or those that attract users from diverse language communities, multilingual knowledge graphs offer a structure that enables the creation of multilingual applications. Many of the artifacts in the Dura-Europos collection contain representations of people, objects, places, or symbols. By using the ‘depicts’ property to connect artifacts and the items for the entities represented, we open up pathways to additional information about what is represented. For example, multiple gods and goddesses are depicted among the artifacts at Dura-Europos. Wikidata contains many external identifiers for them. These external identifiers are the entry point into other databases and systems that contain additional content about the gods and goddesses. The Wikidata item for Aphrodite has external identifiers pointing out to forty-five different databases or systems. Using these external identifiers we can quickly locate additional information about Aphrodite from the Oxford Classical Dictionary (P9106), the Getty Iconography Authority File (P5986), the Consortium of European Research Libraries Thesaurus (P1871), and forty-two other systems.2020
Connections between this set of artifact items and other types of items in the knowledge graph provides additional information that supplements the original data set. For example, people who were involved in the Dura-Europos excavation are not described in the set of metadata about the artifacts. The names of these people and information about their involvement are described in archival material and in published documents. As these documents are added to Wikidata, it will be possible to create additional connections between some of the artifacts and the items which represent people. We expect that these connections will be created in the knowledge graph in the future, an example of how the data will grow even after the time of contribution to Wikidata.
Wikidata editors have already connected a number of publications to items related to Dura-Europos. These connections are asserted through the use of ‘main subject’ (P921) as seen in Fig. 14. Connecting the items for scholarly publications to the items that they describe helps people reusing data from Wikidata find groups of publications that are related to topical areas. We anticipate that members of the Wikidata community will add more publications related to Dura-Europos to the knowledge base. We hope that surfacing these publications in the library moments of the Dura-Europos Stories application will allow people interested in this area to find additional information, or to become inspired to add more publications to Wikidata.
Fig. 14.
11.Conclusion
Powering an application with data from a knowledge base such as Wikidata provides several advantages to research teams. Advantages include the fact that Wikimedians build tools for interacting with Wikidata, that Wikidata itself serves as a platform for collaboration, that it is possible for the data to grow over time, that there are more curators looking at and contributing to the data, that it has built-in support for hundreds of human languages, and that it provides connections to the hub of external identifiers on the web.
Transforming the metadata for the artifacts related to the Dura-Europos excavation into a set of Wikidata items and statements enabled us to reuse the data within the Stories Services framework. Using Stories Services, we created a custom application for exploring the dataset. This process also enabled new connections between this dataset and other data on the web. The Stories Services framework was built with the Wikidata data model in mind. After mapping the Dura-Europos data to Wikidata and publishing it to the knowledge base, the Stories Services framework pulls this data into the Dura-Europos Stories application for browsing and display of the content. Similarly, the Dura-Europos community can also make use of a wide range of additional tools built by the Wikidata community for working with Wikidata. If the Dura-Europos team had selected a database for storing their data, not only would the data be siloed, they would not be able to leverage the tooling created by Wikimedians.
Wikidata itself serves as a platform for collaboration. Many organizations hold material related to Dura-Europos. If these organizations decide to describe their materials in Wikidata, Wikipedia or Wikimedia Commons, these platforms function as platforms for collaboration. Shared properties, categories, and links between items serve as the structures of organization that bring together different pieces of information. As more people contribute content related to Dura-Europos to Wikidata, the knowledge base itself serves as the infrastructure for collaboration, and this project will be able to reuse the content others contribute.
The Wikidata community of editors actively engage with the knowledge base every day [1]. If we consider the data in Wikidata that we present in the Dura-Europos Stories application to be the Dura-Europos subset of Wikidata, this subset will grow over time. Contributors to Wikidata may create additional items for other artefacts excavated from Dura-Europos, or add publications related to the excavation or artefacts, or they may add labels for existing items in additional human languages. This model has an advantage over creating a project-specific database that people stop maintaining at the end of a project.
The multilingual design of Wikidata, with support for more than three hundred human languages, has led to the creation of millions of items with labels in many languages. Not only does this mean that more editors from diverse language backgrounds can collaborate and potentially extend data in the domain of your research area, but it also means that applications that reuse data from Wikidata have more multilingual data to show, potentially bringing your research to additional communities of users.
Mapping a new dataset to the Wikidata data model, and contributing the data to the knowledge base replaces the work of structuring the data for a relational database or a set of spreadsheets. We benefit from using Wikidata’s graph of external identifiers to unlock pathways to additional information sources with no extra effort for the research team. Having associations between external resources and the original dataset can increase the number of questions the research team can ask about their domain. It can also make it easier to collaborate with a wider range of other researchers because they may be more comfortable thinking about the dataset in terms of the identifiers from a external resource with which they are already familiar.
We enable users of the Dura-Europos stories application to browse multimedia content related to the archaeological site and the excavated artifacts. The application is powered by data from the Wikidata knowledge base. We present images of the artifacts along with geospatial data about their excavation locations, and metadata about the artefacts. This information is presented in the context of information from the web of linked statements about Dura-Europos. The Dura-Europos Stories application provides an interactive interface for presenting this data that users can explore to learn about the site and the excavation, as well as the material culture represented through the artefacts.
Notes
1 The Dura-Europos Stories project is available at: https://dura-europos.stories.k2.services/stories.
7 For the Dura dataset, all artifacts were given a ‘location of discovery’ (P189): ‘Dura-Europos’ (Q464266). In addition, however, the IDEA team is in the process of defining and publishing location identifiers down to the building level of granularity with Pleiades (https://pleiades.stoa.org/), an online gazetteer authority for ancient Mediterranean place-related data. The Dura-Europos urban gazetteer data from Pleiades will ultimately be mirrored in Wikidata to enable on-the-fly visualizations of artifact discovery locations.
Acknowledgements
The International Digital Dura-Europos Archive (IDEA) has been made possible in part by a major grant from the National Endowment for the Humanities. We would like to thank the Wikidata community for contributing to the knowledge base that anyone can edit.
References
[1] | Active Editors, (2023) , https://stats.wikimedia.org/#/wikidata.org/contributing/active-editors/normal|line|2-year|(page_type)~content*non-content|monthly. |
[2] | E. Barker, R. Simon, L. Isaksen and P. de Soto Cañamares, The Pleiades Gazetteer and the Pelagios Project, (2016) , http://oro.open.ac.uk/48328/4/2016_Simon_Barker_etal_Gazetteers.pdf. |
[3] | I. Boneva, J.E. Labra, S. Hym, E.G. Prud’hommeaux, H. Solbrig and S. Staworko, Validating RDF with shape expressions, arXiv e-prints, (2014) . |
[4] | S. Heath, Digital creation and expression in the context of teaching roman art and archaeology, in: DATAM: Digital Approaches to Teaching the Ancient Mediterranean, (2020) , pp. 149–170. https://hcommons.org/deposits/download/hc:29192/CONTENT/datam_heath.pdf/. |
[5] | E. Hyvönen et al., “Sampo” model and semantic portals for digital humanities on the Semantic Web, in: Proceedings of the Digital Humanities in the Nordic Countries 5th Conference (DHN 2020), CEUR-WS.org, (2020) , http://hdl.handle.net/10138/334001. |
[6] | E. Hyvönen, P. Leskinen, M. Tamper, H. Rantala, E. Ikkala, J. Tuominen and K. Keravuori, BiographySampo – publishing and enriching biographies on the semantic web for digital humanities research, in: European Semantic Web Conference, Springer, (2019) , pp. 574–589. doi:10.1007/978-3-030-21348-0_37. |
[7] | L.-A. Kaffee, A. Piscopo, P. Vougiouklis, E. Simperl, L. Carr and L. Pintscher, A glimpse into babel: An analysis of multilinguality in Wikidata, in: Proceedings of the 13th International Symposium on Open Collaboration, OpenSym ’17, ACM, New York, NY, USA, (2017) , pp. 14:1–14:5. ISBN 978-1-4503-5187-4. doi:10.1145/3125433.3125465. |
[8] | E.J. Kelly, Reuse of Wikimedia commons cultural heritage images on the wider web, Evidence Based Library and Information Practice 14: (3) ((2019) ), 28–51. doi:10.18438/eblip29575. |
[9] | A. Meroño-Peñuela, A. Ashkpour, M. Van Erp, K. Mandemakers, L. Breure, A. Scharnhorst, S. Schlobach and F. Van Harmelen, Semantic technologies for historical research: A survey, Semantic Web 6: (6) ((2015) ), 539–564. doi:10.3233/SW-140158. |
[10] | J. Neubert, Wikidata as a linking hub for knowledge organization systems? Integrating an authority mapping into Wikidata and learning lessons for KOS mappings, in: NKOS@ TPDL, (2017) , pp. 14–25, http://ceur-ws.org/Vol-1937/paper2.pdf. |
[11] | A. Piscopo, L.-A. Kaffee, C. Phethean and E. Simperl, Provenance information in a collaborative knowledge graph: An evaluation of Wikidata external references, in: International Semantic Web Conference, Springer, (2017) , pp. 542–558. doi:10.1007/978-3-319-68288-4_32. |
[12] | S.C. Schmidt, F. Thiery and M. Trognitz, Practices of linked open data in archaeology and their realisation in Wikidata 2: (3), 333–364. doi:10.3390/DIGITAL2030019. |
[13] | Y. Shen, Z. Wang, Q. Sun, A. Chen and H. Rushmeier, Reconstructing Dura-Europos from sparse photo collections using deep contour extraction, in: Eurographics Workshop on Graphics and Cultural Heritage, V. Hulusic and A. Chalmers, eds, The Eurographics Association, (2021) , ISSN 2312-6124. ISBN 978-3-03868-141-0. doi:10.2312/gch.20211408. |
[14] | A. Siebold and M. Valleriani, Digital perspectives in history, Histories 2: (2) ((2022) ), 170–177, https://www.mdpi.com/2409-9252/2/2/13. doi:10.3390/histories2020013. |
[15] | G. Sugimoto, Building linked open date entities for historical research, in: Metadata and Semantic Research: 14th International Conference, MTSR 2020, Madrid, Spain, December 2–4, 2020, Revised Selected Papers, Vol. 14: , Springer, (2021) , pp. 323–335. doi:10.1007/978-3-030-71903-6_30. |
[16] | K. Thornton and K. Seals-Nutt, Science stories: Using IIIF and Wikidata to create a linked-data application, in: International Semantic Web Conference (P&D/Industry/BlueSky), (2018) , http://ceur-ws.org/Vol-2180/paper-68.pdf. |
[17] | K. Thornton, H. Solbrig, G.S. Stupp, J.E. Labra Gayo, D. Mietchen, E. Prud’hommeaux and A. Waagmeester, Using shape expressions (ShEx) to share RDF data models and to guide curation with rigorous validation, in: The Semantic Web, Springer International Publishing, (2019) , pp. C1–C1. doi:10.1007/978-3-030-21348-0_40. |
[18] | A. Učakar, A. Sterle, M. Vuga, T. Trček Pečak, D. Trček, J. Ahtik, K. Košak, D. Muck, H. Gabrijelčič Tomc and T.N. Kočevar, 3D digital preservation, presentation, and interpretation of wooden cultural heritage on the example of sculptures of the FormaViva Kostanjevica Na Krki collection, Applied Sciences 12: (17) ((2022) ), https://www.mdpi.com/2076-3417/12/17/8445. doi:10.3390/app12178445. |
[19] | M. Van Remoortel, J.M. Birkholz, P. De Potter, K. Thornton and K. Seals-Nutt, Linking women editors of periodicals to the Wikidata knowledge graph, (2021) . doi:10.3233/SW-222845. |
[20] | D. Vrandečić and M. Krötzsch, Wikidata: A free collaborative knowledgebase, Communications of the ACM 57: (10) ((2014) ), 78–85. doi:10.1145/2629489. |
[21] | F. Zhao, A systematic review of Wikidata in digital humanities projects, Digital Scholarship in the Humanities ((2022) ), fqac083. doi:10.1093/llc/fqac083. |