Reconstruction of Iberian ceramic potteries using generative adversarial networks

Navarro, Pablo; Cintas, Celia; Lucena, Manuel; Fuertes, José Manuel; Segura, Rafael; Delrieux, Claudio; González-José, Rolando

doi:10.1038/s41598-022-14910-7

Download PDF

Article
Open access
Published: 23 June 2022

Reconstruction of Iberian ceramic potteries using generative adversarial networks

Pablo Navarro^1,2,
Celia Cintas³,
Manuel Lucena^4,5,
José Manuel Fuertes^4,5,
Rafael Segura^4,5,
Claudio Delrieux⁶ &
…
Rolando González-José¹

Scientific Reports volume 12, Article number: 10644 (2022) Cite this article

2555 Accesses
3 Citations
20 Altmetric
Metrics details

Subjects

Abstract

Several aspects of past culture, including historical trends, are inferred from time-based patterns observed in archaeological artifacts belonging to different periods. The presence and variation of these objects provides important clues about the Neolithic revolution and given their relative abundance in most archaeological sites, ceramic potteries are significantly helpful in this purpose. Nonetheless, most available pottery is fragmented, leading to missing morphological information. Currently, the reassembly of fragmented objects from a collection of thousands of mixed fragments is a daunting and time-consuming task done almost exclusively by hand, which requires the physical manipulation of the fragments. To overcome the challenges of manual reconstruction and improve the quality of reconstructed samples, we present IberianGAN, a customized Generative Adversarial Network (GAN) tested on an extensive database with complete and fragmented references. We trained the model with 1072 samples corresponding to Iberian wheel-made pottery profiles belonging to archaeological sites located in the upper valley of the Guadalquivir River (Spain). Furthermore, we provide quantitative and qualitative assessments to measure the quality of the reconstructed samples, along with domain expert evaluation with archaeologists. The resulting framework is a possible way to facilitate pottery reconstruction from partial fragments of an original piece.

A proof of concept for machine learning-based virtual knapping using neural networks

Article Open access 07 October 2021

A deep-learning model for predictive archaeology and archaeological community detection

Article Open access 25 November 2021

A 36,200-year-old carving from Grotte des Gorges, Amange, Jura, France

Article Open access 09 August 2023

Introduction

Material evidence of past foraging populations is a prolific research field in archaeology. Among the many factors that inform the Neolithic transition, ceramic potteries are very informative in terms of cultural selection processes. They are one of the most frequently found archaeological artifacts, as well. Since they are usually short-lived, researchers find these artifacts useful to explore chronological and geographical, given that shape and decoration are subject to significant fashion changes over time and space¹. This gives a basis for dating the archaeological strata, and provides evidence from a large set of valuable data, such as local production, trade relations, and consumer behavior of the local population^2,3,4. Several prior studies analyze various aspects of ceramics using complete pottery profiles. Automatic profile classification^5,6,7,8,9 and feature extraction^{10,11,12,13,14,15,16,17} have been widely studied, ranging from traditional image processing techniques to deep learning approaches. Unfortunately, ceramics are fragile, and therefore most of the actual ceramics recovered from archaeological sites are broken, so the vast majority of the available material appears in fragments. The reassembly of the fragments is a daunting and time-consuming task done almost exclusively by hand, which requires the physical manipulation of the fragments. An intuitive way to understand the fragmentation process, as well as to improve the reconstruction task, is to produce large amounts of potteries imitating the procedures followed by the Iberian craftsmen, breaking them, and then analyzing the resulting sets of fragments. Unfortunately, these and similar manual processing methods for this type of incomplete material are very time-consuming and labor-intensive, even for skilled archaeologists¹⁸. Due to these factors, there is an increasing interest in automatic pottery reassembly and reconstruction^19,20,21 and fragment analysis²². Nonetheless, existing work resolves the fragments problem using comparisons between known pieces. The best match within the dataset is the best fragment for that pottery. Here we propose a deep learning approach in which the “best fragment” is artificially generated based on a set of known fragments in the model, thus creating new virtual pottery with the same features as the real ones. The main contributions of this paper are:

We present IberianGAN, a framework based on generative models that reconstruct pottery profiles from rim or base fragments (see Fig. 1A,B).
We generate artificial fragment samples using a method to partition the full pottery profiles into two parts (resp. base and rim, see Fig. 1C).
We evaluate four more approaches for comparison with our architecture. Furthermore, we validate the five methods using a study based on geometric morphometrics (see Fig. 1D and Fig. 2), a domain experts’ validation and open/closed shape classifier (Fig.S1).

Iberian pottery data

The raw data belong to binary profile images, corresponding to Iberian wheel-made pottery from various archaeological sites of the upper valley of the Guadalquivir River (Spain). The available images consist of a profile view of the pottery, where image resolutions (in pixels), corresponding to size scale, may vary according to the acquisition settings (Fig. S2). We partitioned these images into rim and base portion to simulate the fractures in the profiles. The partitioning criterion and orientation depends on the initial shape (closed or open, see Fig. 1B). The resulting dataset is composed of 1075 images, randomly divided into a training subset containing 752 images (70%), a validation set of 108 (10%), and a test set of 215 images (20% of the total dataset).

Algorithmic background

GANs have shown remarkable results in various computer vision tasks such as image generation^23,24, image translation^25,26, face image synthesis^27,28,29 and recently text^30,31 and audio generation³². A typical GAN³³ framework contains a generative (G) and a discriminative (D) neural network such that G aims to generate realistic samples, while D learns to discriminate if a sample is from the real data distribution (H₀) or not. D(x) should be high when x comes from training data and low when x comes from the generator. The variable z is a latent space vector sampled from a normal distribution. G(z) represents the generator function which maps the latent vector z to data-space of Iberian pottery profiles.

$${\text{min}}_{G}\, {\text{max}}_{D} V\left( {D,G} \right) = {\mathbb{E}}_{{x \sim p_{{\text{data }}} \left( x \right)}} \left[ {{\text{log}}D\left( x \right)} \right] + {\mathbb{E}}_{{z \sim p_{z} \left( z \right)}} \left[ {{\text{log}}\left( {1 - D\left( {G\left( x \right)} \right)} \right)} \right]$$

(1)

Multiple iterations will inform G on how to adjust the generation process to fool D. In our case, the data element x, corresponds to a binary two-dimensional array containing the pottery profile geometry. D(G(z)) is the probability that the output of the generator G is a real sample from the Iberian pottery dataset. D tries to maximize (log D(x)), which is the probability of having a correct classification of actual shapes, while G tries to minimize (log (1 − D(G(x))), which is the probability of D recognizing any of the faked outputs generated by G. Deep Convolutional Generative Adversarial Networks (DCGAN)³⁴ are among the most popular and successful networks designed for GANs. The model is mainly composed with convolution layers without max pooling or fully connected layers. It uses convolutional stride and transposed convolutions for down-sampling and up-sampling. In other works, the vector z is constructed from one or more input images, the generated sample is conditioned by the input. To this type of Autoencoding GAN (AE-GAN) is added a network of encoders that is trained to learn an $E:X \to Z$ function, mapping each real sample to a point (z) in latent space³⁵. The detailed design and implementation of our proposed generative approach is described in “Materials and methods” section.

Results

Results from IberianGAN were compared with multiple approaches based on AE-GAN³⁵. All approaches contain variations in the architecture or training process (see the “Materials and methods” section). We assess the methods across several generative metrics, a geometric morphometric analysis, a validation based on an open and closed shape classifier, and a validation test made by domain experts. Particularly, to evaluate the quality of images produced by IberianGAN, we computed the following generative metrics: Root Mean Square Error (RMSE), Frechet Inception Distance (FID)³⁶, Geometry Score (GS)³⁷, and Dice Coefficient³⁸. RMSE allows evaluating the generated results in comparison with the actual profiles. RMSE quantifies how different two images are. The smaller an RMSE value, the more similar the profiles are. The metric FID is aimed to compare the distribution of generated images with the distribution of real images. A lower FID value indicates better-quality images, and a higher score indicates a lower quality output. The GS allows comparing the topology of the underlying manifolds for two shapes (in this case, actual pottery and synthetic ones) in a stochastic manner³⁷. Low GS values indicate similar topologies between two sets of data. Finally, the Dice coefficient is used to compare two binary images (black or white pixels). The metric takes a normalized value in [0, 1], where 0 means two images are completely different, and 1 occurs when both are the same image. In Table 1 we present the performance metrics for the test set from the Iberian dataset. For RMSE, FID, and DC scores, IberianGAN has a significantly better performance when compared with the architectures presented elsewhere.

Table 1 Quantitative performance evaluation for different approaches using a dataset test.

Full size table

This means that the generated profiles have a similar geometric distribution with respect to the actual samples, and thus the resulting potteries are comparable to the actual samples. A proposed alternative with reinforcement learning (AE-GAN-LR) improves the topology similarity (GS metric). Nonetheless, the topological similarity is not the most relevant factor, and that there is indeed an overlap between synthetic topologies generated by AE-GAN-LR and IberianGAN (see Fig. 3A) we consider that the synthetic samples generated by the latter can be regarded as topologically correct as compared to the actual samples. Furthermore, we evaluated the distribution of data qualitatively. For this, we created a feature space using Principal Component Analysis (PCA) with the images from actual and generated pottery. In Fig. 3B, we observe that the distribution in this feature space of the actual images is similar to the distribution of images generated with IberianGAN. We qualitatively compare the results of all approaches. In Fig. 4, we show some results using the same input and comparing it with the original image of the dataset. As observed, IberianGAN looks at the input image and completes the fragment with convincing results (see further results in Fig. S4). Given the results mentioned above, IberianGAN can be used satisfactorily to estimate missing fragments and provide realistic, complete pottery profiles, maintaining the geometric properties of original potteries.

Shape validation

In the real profile dataset, the base shape of a profile appears in combination with only a subset of the entire rims set (and vice-versa), i.e. not all base/rim combinations are present in real profiles. This is because the entire structure of the pottery is usually designed to serve only one purpose (e.g. liquid storage, cooking, transportation, drinking, ritual, etc.). Some base/rim combinations would create useless or impractical pots (e.g., with a very small base and a large rim). A similar effect is seen when analyzing the design of the projectile point³⁹ where the variations of the designs of the stem and the blade (two parts of a projectile point) of these artifacts are studied in a modular way to determine the relation in the designs of its shapes. Thus, we evaluate the ability of IberianGAN to generate rims with a valid shape from existing bases and vice-versa. Based on³⁹, we extracted semi-landmarks to analyze the shape of the generated fragments. Using the profile dataset of actual pottery, we created a morphometric space using the semi- landmarks of the fragments as input for a PCA. We worked with four morphometric spaces, two for closed and two for open pottery shapes, each one containing its corresponding rims and bases. In order to obtain a metric that allows us to compare generated profiles, we analyze the Euclidean distance between the generated fragments and the real pottery profiles in these morphometric spaces (see a graphical description in Fig. 2). Given a pot generated from an existing fragment (e.g., a rim), we first divide the generated profile and locate the two resulting halves on their corresponding spaces, and then analyze the distance between the actual and the generated fragments (dr in Fig. 2). To evaluate the other half of the generated profile, we use the closest K fragments (K = 50) to the real one (the input fragment) in the first space, and we place its pairs in the other space (in our example, the space generated for all of the real bases). We calculate the minimum distance in this space between the generated fragment and its neighbors in the first space (dg in Fig. 2). This type of morphometric validation establishes the ability of the method to generate a fragment with an actual shape from an input fragment. In Table 2 we show the mean Euclidean distances in all the approaches tested in this work (see “Materials and methods” section). The table presents two parts, corresponding to open and closed shapes. We considered two scenarios, when the input is a rim or is a base. As IberianGAN only generated the unknown fragment, the distances between the input and the known fragment are close to zero. In the approaches where the network generated the shape for the entire profile, the distances between known and unknown fragments are similar.

Table 2 Euclidean distances in morphometric spaces for open and closed Iberian pottery shapes.

Full size table

Sample quality validation based on open and closed shape classifier

Separately from the generative modes, we trained a binary classifier. This model is capable of classifying open and closed vessel profiles. We used pre-trained weights of ResNet-18⁴⁰. This validation aims to verify that the data generated by the different models is able to imitate the real samples and that the classifier can predict the correct classes even when trained with only real data samples. Table S1 shows the classification metrics using the different datasets. In particular, it can be seen that the classifier is not affected by the generated data. Notably, the metrics improve compared to the actual test data portion in all cases. Additionally, in Fig. S1, we can see a graphic representation of the sensitivity versus the specificity of the classifier as the discrimination threshold is varied. This type of result shows that the generated new samples are similar in their distribution and shape to the real data. In addition, note that they do not affect the accuracy of the classifier.

Domain expert validation

We designed an experiment for domain archaeology experts to evaluate the capability of IberianGAN to create pottery profiles with an adequate Iberian style. For this purpose, we present in the form of an online questionnaire a set of images (see Fig. S3) to six archaeologists specialized in Iberian culture. In the survey, we display a random selection of 20 images where half of them correspond to actual Iberian pottery profiles and half IberianGAN-generated. Each image has a multiple-choice to rate it between 0 and 5 to determine the level of similarity with an Iberian style, where 0 means unrelated to the Iberian Style, and 5 means fully within Iberian Style. Overall, generated samples rated 3.88 on average with a standard deviation in 1.43 across all archaeologists and actual samples rated 3.93 ± 1.45. To conclude, the archaeologists consider that the potteries generated have on average an Iberian style similar to that found in actual potteries. This is important since IberianGAN is capable of generating an Iberian-style pottery from an incomplete fragment.

Discussion

Ceramics are one of the most frequently found archaeological artifacts and constitute the central remains usually used to investigate variations in style, materials employed, and manufacturing techniques. Exploring diachronic and geographical variation in pottery is of key importance to reconstruct the dynamics of the Neolithic transition in different regions. However, ceramics are fragile, and therefore most of the recovered material from archaeological sites is broken. Consequently, the available samples appear in fragments. The reassembly of the fragments is a daunting and time-consuming task made almost exclusively by hand, which requires the physical manipulation of the ceramic shreds. Thus, a generative approach, such as IberianGAN, that automatically processes fragments and provides a reconstruction analysis can assist archaeologists in the reassembly process.

Such an approach has a broader impact by providing a general framework for object reassembly. Our proposed framework is flexible to work on different ceramic datasets that present a variety of fractured materials (see results from Roman Pottery in Fig. S6). IberianGAN could be used beyond just ceramic pottery in order to reconstruct other archaeological (e.g. projectile points, historical buildings, etc.) and anthropological remains (e.g. crania, postcranial bones, etc).

We have evaluated the performance of IberianGAN on the basis of three different but complementary approaches: (a) classical metrics to evaluate the generative process of images (see Table 1 and Fig 2); (b) shape analysis based on pottery structure (see section “Results”: Shape validation), and (c) validation via independent examination of archaeologists specialized in Iberian heritage (“Results”: Domain expert validation).

Results obtained under the three approaches suggest that our approach is capable of generating potteries that satisfy the image, pottery morphometric structure, and expert validation criteria. While encouraging performance is achieved with IberianGAN for the prediction of fragments in the database of Iberian wheel-made pottery, some limitations need to be addressed. In general, archaeologists work with fragments belonging to the base or top of the pottery. Therefore the network was trained always using a base or rim fragment, meaning that the model will always position a fragment as a base or rim. Furthermore, our approach uses large fragments during training and evaluation. Additional studies are needed to determine the minimum accepted size of a fragment for the model to perform as expected. Nonetheless, we believe our proposed framework is the first step towards broader use of generative networks for the recognition and assembly of fragments, which will open new avenues of research related to applications on different measurements of fragments and even in 3D ceramics in particular and objects in general.

Related work

Previous research on pottery includes both classical approaches, based on the comparative analysis of shape, dimensions, decoration, technological elements, color, geometric characteristics, axis of symmetry, used materials, etc; and novel methods based on machine learning techniques in general and deep learning in particular applied towards ceramic characterization. As a whole, pottery profiles were used in the context of classification^5,6,7,8,9, and to study variations in shape and/or style attributes^{10,11,12,13,14,15,16,17}. As previously stated, not all the potteries found in the excavations are complete; that is why it is critical to improve characterization methods aimed to identify fragmented ceramics. Rasheed et al¹⁹ presented a method based on a polynomial function for reconstructing pottery from archaeological fragments. Given an image of a fragment, the edge curve is extracted and approximated by a polynomial function to obtain a coefficient vector. The best matching between pairwise pottery fragments is done according to the relationship of their coefficients.

Other authors proposed a method to generate missing pieces of an archaeological find^20,21 departing from a 3D model. In the area where the missing fragments are supposed to be, sketches are created through reverse modeling and consequently used to design the missing fragments. Finally, the digital reproduction of the missing part is achieved Additive Manufacturing technology.

GANs have shown remarkable results in various application domains. Their ability to learn complex distributions and generate semantically meaningful samples has led to multiple variations in network design and novel training techniques (GANs³³, conditional GANs⁴¹, InfoGAN⁴², BAGAN⁴³). Customized loss functions (Content loss⁴⁴, Cycle consistency loss⁴⁵), and domain adaptation approaches (ADDA⁴⁶, CycleGAN⁴⁷), etc. A more comprehensive review of the different GAN variants and training techniques can be found in^48,49,50.

Furthermore, there are multiple examples of GANs applied to cultural heritage domains. For instance, techniques such as automated image style transfer⁵¹, were used to develop a model aimed to generate Cantonese porcelain styled images departing from user-defined masks. Similar techniques were also applied to material degradation^{47,52,53,54,55,56}. Hermoza et al.⁵⁷, for instance, introduced ORGAN, a 3D reconstruction GAN to restore archaeological objects. This is based on an encoder-decoder 3D DNN on top of a GAN based on cGANs⁴¹. This network can predict the missing parts of an incomplete object. A similar approach is followed in ⁵⁸, where a Z-GAN translates a single image of a damaged object into voxels in order to reconstruct the original piece. Both studies address the problem of the prediction of missing geometry on damaged objects that have been 3D modeled and voxelized. More specifically, these studies depart from the assumption that man-made objects exhibit some kind of structure and regularity. The most common type of structure used is symmetry. Starting from a GAN they learn the structure and regularity of a collection of known objects and use it to complete and repair incomplete damaged objects. Another example of cultural heritage preservation can be found in reference⁵⁹, where an image completion approach is adapted⁶⁰ for the curation and completion of damaged artwork.

Materials and methods

We designed, trained, and evaluated five different generative networks based on AE-GAN but used multiple training procedures during the experimentation phase. In this section, we detailed each incremental strategy applied in the process and their corresponding hyperparameters and training techniques and setup. The data and source code with the hyperparameter setup and different approaches analyzed in this study are openly available in IberianGAN at https://github.com/ celiacintas/vasijas/tree/iberianGAN for extension and replication purposes.

Experimental setup and training

All the resulting networks were trained for 5000 epochs using Binary Cross Entropy as a loss function, at a learning rate 2 × 10⁻⁴ for the generative network (G) and 2 × 10⁻⁵ for the discriminator (D). To optimize the training process of all models, we scaled the images to a uniform resolution of 128 × 128 pixels and inverted the colors. We applied data augmentation, particularly a random rotation (between 0 and 45 degrees). We used ADAM optimization⁶¹ for both G and D with β₁ = 0.5 and β₂ = 0.999 and used Binary Cross Entropy as the loss function. Particularly, for the training of D, we used Label Smoothing⁶², the real set is represented with a random number between 0.7 and 1.2 and the generated set with 0.0 and 0.33.

AE-GAN incremental variations and IberianGAN

Initially, we trained a typical AE-GAN to generate a complete pottery profile. We implemented a generator (G) with an architecture that allows two images as input and a discriminator (D) with three input images (the two inputs and the generated image). During training and to speed the convergence process of G, we create different input types with the same probability and select a pair of images. The possible input types were rim/base (or base/rim), base/black image or rim/black image (see Fig. S5-A). Subsequently, aiming to obtain a translation from the input fragment to the complete pottery profile, we modify the architecture of the encoder in the AE-GAN part of the generator, called AE-GAN-MP. In this case, the generator encoder processes one input image at a time. We do this to embed the input images separately and apply a max-pooling layer to join the two representations, see Fig. S5-B. This modification allows more variability in the representation for generating the full profile.

Additionally, we define a new loss function for training the generator of the AE-GAN-MP architecture. Using the strategy of multiple types of input (rim/base, base/rim, rim/black, and base/black), we compute this new loss function only when the inputs are complete (e.g., rim/base or base/rim). For this, we use Mean Square Error (MSELoss) defined as follows:

$${\text{L}}\left( {y,\hat{y}} \right) = \frac{1}{N}\sum_{i = 0}^{N} \left( {y - \hat{y}_{i} } \right)^{2}$$

(2)

where $\hat{y}$ is the predicted pottery and ${\text{y}}$ is the real example. The goal is that the generator minimizes the MSE error between the result and the target (real pottery profile). Finally, to obtain a stronger relationship between the inputs and the generated pottery we design a strategy to modify the resulting pottery (or iterate to get a more precise result). We do this by using the input with the previous result to generate new pottery (the final result) with two iterations. The intermediate result adds to the input using image matrix operations, see Fig. S5-C. We called this approach AE-GAN with reinforcement learning (AE-GAN-RL).

IberianGAN is based on the AE-GAN, where the generator is an Autoencoding network ${\text{Encode}}\left( {\text{x}} \right) \to {\text{z }} \in {\text{R}}^{{\text{m}}} ,{\text{ Decode}}\left( {\text{Z}} \right) \to {\text{x}}^{{\prime }}$, where ${\text{x}} \in \left[ {0,1} \right]^{{{\text{m}} \times {\text{m}}}}$, is the input fragment, a binary two-dimensional array containing the fragment shape information, and x' is a missing generated part. To train the discriminator network, we use D(y) where ${\text{y }} = {\text{ x }} + {\text{ x}}^{{\prime }}$ for the examples generated. At this point, the network generates only an unknown fragment and the discriminator is trained with the complete profile. As IberianGAN only generates the missing fragment, for its training process, it is not necessary to use two images as input (see Fig. 1A). For training, we only use an image that corresponds to the base or the rim of the profile. The complete definition, implementation, training and evaluation of IberianGAN can be found here: https://github.com/celiacintas/vasijas/tree/iberianGAN.

Evaluation metrics

In this section, we show the process of evaluating the quality of the generated samples. To compare the results of the different approaches, we use two approaches. First, a set of measurements used to evaluate GANs, these metrics refer to the general distribution of all the potteries generated. Additionally, we use metrics comparing the result obtained with the actual potteries, for example, to evaluate the known fragments in the potteries generated. For the first type, we consider evaluating the distribution and shape of the generated profiles. First, we use Frechet Inception Distance (FID)³⁶, which is currently one of the most common metrics to evaluate GANs⁶³. FID allows the quantification of the differences in the density of two distributions in the high-dimensional feature space of an InceptionV3 classifier⁶⁴. In detail, FID embeds the images in a descriptor space (defined by an intermediate layer of Inception-V3) with a high level of abstraction. This feature space is used to calculate the mean and variance of the generated data and the actual data. The Fletcher distance is calculated between these distributions. FID is calculate following this equation:

$$FID\left( {r,g} \right) = ||\mu_{r} - \mu_{g}||^{2}_{2} + {\text{Tr}}\left( {\Sigma_{r} + \Sigma_{g} - 2\sqrt {\Sigma_{r} \Sigma_{g} } } \right)$$

(3)

where (µ_r, Σ_r) and (µ_g, Σ_g) are the mean and co-variance of the actual data and the generated distributions, respectively. Small distances indicate that the distribution of the data is similar, in our case, that the generated potteries have a distribution similar to the real ones. FID is based on a classifier network. It has been shown that this type of metric focuses on textures rather than shapes⁶⁵, so we decided to evaluate the approaches with a shape-based metric, Geometry Score (GS)³⁷.

GS is a metric for comparing the topological properties of two data sets. Formally GS is the l2 distance between means of the relative lifetime vectors (RLT) associated with the two groups of images. The RLT of a group of images (encoded in a feature space, for example) is an infinite vector (v₁, v₂, ..., v_i) where the i-th entry is a measure of persistent intervals that have a persistent homologous group rank equal to i. v_i is defined as follows:

$$v_{i} = \frac{1}{{d_{{n\left( {n - 1} \right)/2}} }}\sum_{j = 1}^{{n\left( {n - 1} \right)/2}} I_{j} \left( {d_{j} + 1 - d_{j} } \right)$$

(4)

where I_j = 1 is the rank of a persistent homologous group with dimension 1 in the interval [d _j, d _j + 1] is i and I_j = 0 is the opposite³⁷. Low GS values indicate similar topology between the set of images. On the other hand, for the second group of metrics, we evaluate the results against the complete potteries. It is important to clarify that we do not try that the results are the same as the actual pottery since it is generated only with a fragment, with this objective, we use two frequent metrics in image processing, Root Mean Square Error (RMSE) and DICE Coefficient³⁸.

RMSE is a metric that enables similarity comparisons between two samples (pottery profiles in this case). This is measured using the square root of the average of the squared differences between the pixels in the generated image and the actual image. The RMSE between a actual profile image, (image d) and the generated image, (image f ) is given by

$$\sqrt {\frac{1}{n}\sum_{i = 1}^{n} \left( {d_{i} - f_{i} } \right)^{2} }$$

(5)

This metric is calculated pixel by pixel, where d_i and f_i are the pixels of the image D and F respectively. In this formula, low RMSE values show a minor error. The DICE coefficient allows to evaluate the geometry between the generated profile and the real one. This metric is commonly used to evaluate results in segmentation networks⁶⁶. That is why to calculate the DICE coefficient, the images must be binary (black and white). This coefficient evaluates the images as two overlays of shapes. To do this, the region of the generated image and the region of the actual profile are calculated. Given the generated profile A and the real profile B, DICE is calculated³⁸:

$$DICE = \frac{{2\left| {A \cup B} \right|}}{\left| A \right| + \left| B \right|}$$

(6)

where A and B is the size in pixel of the profile. The maximum value of the metric is 1 when the shape is identical to the real one and 0 when the total shape does not match.

Data availability

The data and code that support the findings of this study are openly available in IberianGAN at https://github.com/celiacintas/vasijas/tree/iberianGAN for extension and replication purposes.

References

Eslami, D., Di Angelo, L., Di Stefano, P. & Pane, C. Review of computer-based methods for archaeological ceramic sherds reconstruction. Virtual Archaeol. Rev. 11, 34–49 (2020).
Article Google Scholar
Orton, C., Tyers, P. & Vinci, A. Pottery in Archaeology (Cambridge University Press, 1993).
Google Scholar
Kampel, M. & Sablatnig, R. An automated pottery archival and reconstruction system. J. Vis. Comput. Animat. 14, 111–120 (2003).
Article Google Scholar
Kashihara, K. An intelligent computer assistance system for artifact restoration based on genetic algorithms with plane image features. Int. J. Comput. Intell. Appl. 16, 1750021 (2017).
Article Google Scholar
Lucena, M., Fuertes, J. M., Martinez-Carrillo, A. L., Ruiz, A. & Carrascosa, F. Efficient classification of Iberian ceramics using simplified curves. J. Cult. Herit. 19, 538–543. https://doi.org/10.1016/j.culher.2015.10.007 (2016).
Article Google Scholar
Lucena, M., Fuertes, J. M., Martínez-Carrillo, A. L., Ruiz, A. & Carrascosa, F. Classification of archaeological pottery profiles using modal analysis. Multimed. Tools Appl. 76, 21565–21577. https://doi.org/10.1007/s11042-016-4076-9 (2017).
Article Google Scholar
Cintas, C. et al. Automatic feature extraction and classification of Iberian ceramics based on deep convolutional networks. J. Cult. Herit. 41, 106–112. https://doi.org/10.1016/j.culher.2019.06.005 (2020).
Article Google Scholar
Llamas, J., Lerones, P. M., Zalama, E. & Gómez-García-Bermejo, J. Applying deep learning techniques to cultural heritage images within the INCEPTION project. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 10059 LNCS, 25–32. https://doi.org/10.1007/978-3-319-48974-2_4 (2016).
Di Angelo, L., Di Stefano, P. & Pane, C. Automatic dimensional characterisation of pottery. J. Cult. Herit. 26, 118–128. https://doi.org/10.1016/j.culher.2017.02.003 (2017).
Article Google Scholar
Shennan, S. & Wilcock, J. Shape and style variation in central German bell beakers. Sci. Archaeol. 15, 17–31 (1975).
Google Scholar
Rice, P. M. Pottery Analysis (University of Chicago Press, 1987).
Google Scholar
Nautiyal, V. et al. Geometric modeling of indian archaeological pottery: A preliminary study. In Clark, J. & Hagemeister, E. (eds.) Exploring New Frontiers in Human Heritage. CAA2006. Computer Applications and Quantitative Methods in Archaeology (Fargo, United States, 2006).
Mom, V. SECANTO—The SECtion Analysis TOol. In Figueiredo, A. & Velho, G. L. (eds.) The world is in your eyes. CAA2005. Computer Applications and Quantitative Methods in Archaeology, 95–101 (Tomar, Portugal, 2007).
Saragusti, I., Karasik, A., Sharon, I. & Smilansky, U. Quantitative analysis of shape attributes based on contours and section profiles in artifact analysis. J. Archaeol. Sci. 32, 841–853 (2005).
Article Google Scholar
Karasik, A. & Smilansky, U. Computerized morphological classification of ceramics. J. Archaeol. Sci. 38, 2644–2657 (2011).
Article Google Scholar
Smith, N. G. et al. The pottery informatics query database: A new method for mathematic and quantitative analyses of large regional ceramic datasets. J. Archaeol. Method Theory 21, 212–250. https://doi.org/10.1007/s10816-012-9148-1 (2014).
Article ADS Google Scholar
Navarro, P. et al. Learning feature representation of iberian ceramics with automatic classification models. J. Cult. Herit. 48, 65–73. https://doi.org/10.1016/j.culher.2021.01.003 (2021).
Article Google Scholar
Di Angelo, L., Di Stefano, P. & Pane, C. An automatic method for pottery fragments analysis. Measurement 128, 138–148. https://doi.org/10.1016/j.measurement.2018.06.008 (2018).
Article ADS Google Scholar
Rasheed, N. A. & Nordin, M. J. A polynomial function in the automatic reconstruction of fragmented objects. J. Comput. Sci. 10, 2339–2348 (2014).
Article Google Scholar
Fragkos, S., Tzimtzimis, E., Tzetzis, D., Dodun, O. & Kyratsis, P. 3D laser scanning and digital restoration of an archaeological find. MATEC Web Conf. 178, 03013. https://doi.org/10.1051/matecconf/201817803013 (2018).
Article Google Scholar
Kalasarinis, I. & Koutsoudis, A. Assisting pottery restoration procedures with digital technologies. Int. J. Comput. Methods Herit. Sci. IJCMHS 3, 20–32 (2019).
Article Google Scholar
Chateau-Smith, C. A computer tool to identify best matches for pottery fragments. J. Archaeol. Sci. Rep. 37, 102891. https://doi.org/10.1016/j.jasrep.2021.102891 (2021).
Article Google Scholar
Emami, H., Dong, M., Nejad-Davarani, S. P. & Glide-Hurst, C. K. Generating synthetic CTS from magnetic resonance images using generative adversarial networks. Med. Phys. 45, 3627–3636 (2018).
Article Google Scholar
Han, C. et al. Gan-based synthetic brain MR image generation. In 2018 IEEE 15th International Symposium on Biomedical Imaging (ISBI 2018), 734–738 (2018).
Zhu, J.-Y. et al. Toward multimodal image-to-image translation. In Advances in Neural Information Processing Systems, 465–476 (2017).
Armanious, K. et al. Medgan: Medical image translation using gans. Comput. Med. Imaging Graph. 79, 101684 (2020).
Article Google Scholar
Karras, T., Aila, T., Laine, S. & Lehtinen, J. Progressive growing of gans for improved quality, stability, and variation. arXiv preprint arXiv:1710.10196 (2017).
Ye, L., Zhang, B., Yang, M. & Lian, W. Triple-translation gan with multi-layer sparse representation for face image synthesis. Neurocomputing 358, 294–308 (2019).
Article Google Scholar
Zhang, H. et al. Stackgan++: Realistic image synthesis with stacked generative adversarial networks. IEEE Trans. Pattern Anal. Mach. Intell. 41, 1947–1962 (2018).
Article Google Scholar
Chen, L. et al. Adversarial text generation via feature-mover’s distance. In NIPS, 4666–4677 (2018).
Xu, J., Ren, X., Lin, J. & Sun, X. Diversity-promoting gan: A cross-entropy based generative adversarial network for diversified text generation. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, 3940–3949 (2018).
Lorenzo-Trueba, J. et al. Can we steal your vocal identity from the internet?: Initial investigation of cloning Obama’s voice using Gan, Wavenet and low-quality found data. arXiv preprint arXiv:1803.00860 (2018).
Goodfellow, I. et al. Generative adversarial nets. Adv. Neural Inf. Process. Syst. 27, 2516 (2014).
Google Scholar
Radford, A., Metz, L. & Chintala, S. Unsupervised representation learning with deep convolutional generative adversarial networks. arXiv preprint arXiv:1511.06434 (2015).
Lazarou, C. Autoencoding generative adversarial networks. arXiv preprint arXiv:2004.05472 (2020).
Heusel, M. et al. Gans trained by a two time-scale update rule converge to a Nash equilibrium. CoRR abs/1706.08500 (2017).
Khrulkov, V. & Oseledets, I. Geometry score: A method for comparing generative adversarial networks. arXiv preprint arXiv:1802.02664 (2018).
Sorensen, T. A. A method of establishing groups of equal amplitude in plant sociology based on similarity of species content and its application to analyses of the vegetation on Danish commons. Biol. Skar. 5, 1–34 (1948).
Google Scholar
de Azevedo, S., Charlin, J. & González-José, R. Identifying design and reduction effects on lithic projectile point shapes. J. Archaeol. Sci. 41, 297–307 (2014).
Article Google Scholar
He, K., Zhang, X., Ren, S. & Sun, J. Deep residual learning for image recognition. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2016).
Mirza, M. & Osindero, S. Conditional generative adversarial nets. CoRR abs/1411.1784 (2014).
Chen, X. et al. Infogan: Interpretable representation learning by information maximizing generative adversarial nets. Adv. Neural Inf. Process. Syst. 29, 1247 (2016).
Google Scholar
Mariani, G., Scheidegger, F., Istrate, R., Bekas, C. & Malossi, C. Bagan: Data augmentation with balancing gan. https://doi.org/10.48550/ARXIV.1803.09655 (2018).
Azadi, S. et al. Multi-content gan for few-shot font style transfer. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2018).
Wang, L., Sindagi, V. A. & Patel, V. M. High-quality facial photo-sketch synthesis using multi-adversarial networks. 2018 13th IEEE Int. Conf. on Autom. Face & Gesture Recognit. (FG 2018) 83–90 (2018).
Tzeng, E., Hoffman, J., Saenko, K. & Darrell, T. Adversarial discriminative domain adaptation. https://doi.org/10.48550/ARXIV.1702.05464 (2017).
Zhu, J.-Y., Park, T., Isola, P. & Efros, A. A. Unpaired image-to-image translation using cycle-consistent adversarial networks. In 2017 IEEE International Conference on Computer Vision (ICCV), 2242–2251. https://doi.org/10.1109/ICCV.2017.244 (2017).
Shamsolmoali, P. et al. Image synthesis with adversarial networks: A comprehensive survey and case studies. Inf. Fusion 72, 126–146. https://doi.org/10.1016/j.inffus.2021.02.014 (2021).
Article Google Scholar
Creswell, A. et al. Generative adversarial networks: An overview. IEEE Signal Process. Mag. 35, 53–65. https://doi.org/10.1109/MSP.2017.2765202 (2018).
Article ADS Google Scholar
Wang, Z., She, Q. & Ward, T. E. Generative adversarial networks in computer vision: A survey and taxonomy. ACM Comput. Surv. 54, 2514. DOI: https://doi.org/10.1145/3439723 (2021).
Chen, S. et al. Cantonese porcelain image generation using user-guided generative adversarial networks. IEEE Comput. Graph. Appl. 40, 100–107. https://doi.org/10.1109/MCG.2020.3012079 (2020).
Article PubMed Google Scholar
Papadopoulos, S., Dimitriou, N., Drosou, A. & Tzovaras, D. Modelling spatio-temporal ageing phenomena with deep generative adversarial networks. Signal Process. Image Commun. 94, 156 (2021).
Article Google Scholar
Liu, M.-Y., Breuel, T. & Kautz, J. Unsupervised Image-to-Image Translation Networks, 700–708 (CVPR, 2017).
Isola, P., Zhu, J.-Y., Zhou, T. & Efros, A.A. Image-to-image translation with conditional adversarial networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 1125–1134 (2017).
Wang, H., He, Z., Huang, Y., Chen, D. & Zhou, Z. Bodhisattva head images modeling style recognition of Dazu Rock Carvings based on deep convolutional network. J. Cult. Herit. 27, 60–71. https://doi.org/10.1016/j.culher.2017.03.006 (2017).
Article Google Scholar
Zachariou, M., Dimitriou, N. & Arandjelovic, O. Visual reconstruction of ancient coins using cycle-consistent generative adversarial networks. Science 2, 124. https://doi.org/10.3390/sci2030052 (2020).
Article Google Scholar
Hermoza, R. & Sipiran, I. 3D reconstruction of incomplete archaeological objects using a generative adversarial network. In Proceedings of Computer Graphics International 2018, 5–11 (ACM, 2018).
Kniaz, V. V., Remondino, F. & Knyaz, V. A. Generative adversarial networks for single photo 3D reconstruction. In ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences, vol. 42, 403–408 (2019).
Jboor, N., Belhi, A., Al-Ali, A., Bouras, A. & Jaoua, A. Towards an inpainting framework for visual cultural heritage. In IEEE Jordan International Joint Conference on Electrical Engineering and Information Technology (JEEIT), 602–607. https://doi.org/10.1109/JEEIT.2019.8717470 (Amman, Jordan, 2019).
Yeh, R. et al. Semantic image inpainting with deep generative models. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 5485–5493 (2017).
Kingma, D. P. & Ba, J. L. Adam: a Method for Stochastic Optimization. Int. Conf. on Learn. Represent. 2015 1–15 (2015). 1412.6980.
Salimans, T. et al. Improved techniques for training gans. Adv. Neural Inf. Process. Syst. 29 (2016).
Nunn, E. J., Khadivi, P. & Samavi, S. Compound frechet inception distance for quality assessment of gan created images. arXiv preprint arXiv:2106.08575 (2021).
Szegedy, C., Vanhoucke, V., Ioffe, S., Shlens, J. & Wojna, Z. Rethinking the inception architecture for computer vision. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2818–2826 (2016).
Karras, T. et al. Analyzing and improving the image quality of stylegan. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 8110–8119 (2020).
Feng, Y. et al. A comparative study of automatic image segmentation algorithms for target tracking in MR-IGRT. J. Appl. Clin. Med. Phys. 17, 441–460 (2016).
Article Google Scholar

Download references

Acknowledgements

This work was supported by the European Union through the Operative Program FEDER Andalucía 2014-2020 Research Project under Grant UJA-1381115 and UJA-1265116, the Center for Advanced Studies in Information and Communication Technologies (CEATIC) and the Research University Institute for Iberian Archaeology of the University of Jaén.

Author information

Authors and Affiliations

Instituto Patagónico de Ciencias Sociales y Humanas, Centro Nacional Patagónico, CONICET, Bv. Almirante Brown 2915, 9120, Puerto Madryn, PC, Argentina
Pablo Navarro & Rolando González-José
Departamento de Informática (DIT), Facultad de Ingeniería, Universidad Nacional de La Patagonia San Juan Bosco, Mitre 665, 9100, Trelew Chubut, PC, Argentina
Pablo Navarro
IBM Research Africa, Catholic University of Eastern Africa Campus, Bogani E Rd, Nairobi, 00200, PC, Kenya
Celia Cintas
Department of Computer Science, University of Jaén, Campus Las Lagunillas s/n, 23071, Jaén, PC, Spain
Manuel Lucena, José Manuel Fuertes & Rafael Segura
Research University Institute for Iberian Archaeology, University of Jaén, Campus Las Lagunillas s/n, 23071, Jaén, PC, Spain
Manuel Lucena, José Manuel Fuertes & Rafael Segura
Departamento de Ingeniería Eléctricarica y de Computadoras, Universidad Nacional del Sur, and CONICET, San Andre´s 800, Campus Palihue, 8000, Bahía Blanca, PC, Argentina
Claudio Delrieux

Authors

Pablo Navarro
View author publications
You can also search for this author in PubMed Google Scholar
Celia Cintas
View author publications
You can also search for this author in PubMed Google Scholar
Manuel Lucena
View author publications
You can also search for this author in PubMed Google Scholar
José Manuel Fuertes
View author publications
You can also search for this author in PubMed Google Scholar
Rafael Segura
View author publications
You can also search for this author in PubMed Google Scholar
Claudio Delrieux
View author publications
You can also search for this author in PubMed Google Scholar
Rolando González-José
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

P.N., C.C., M.L., J.M.F., R.S., C.D and R.G.-J. designed research; P.N., C.C., M.L. and J.M.F. performed research; P.N., C.C., M.L., J.M.F., C.D and R.G.-J. analyzed data; and P.N., C.C., M.L., J.M.F., C.D and R.G.-J. wrote the paper.

Corresponding author

Correspondence to Rolando González-José.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Navarro, P., Cintas, C., Lucena, M. et al. Reconstruction of Iberian ceramic potteries using generative adversarial networks. Sci Rep 12, 10644 (2022). https://doi.org/10.1038/s41598-022-14910-7

Download citation

Received: 09 February 2022
Accepted: 14 June 2022
Published: 23 June 2022
DOI: https://doi.org/10.1038/s41598-022-14910-7

This article is cited by

AI-enabled Underground Water Pipe non -destructive Inspection
- Georgios-Fotios Angelis
- Dimitrios Chorozoglou
- Dimitrios Tzovaras
Multimedia Tools and Applications (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.