Abstract
As computed tomography and related technologies have become mainstream tools across a broad range of scientific applications, each new generation of instrumentation produces larger volumes of more-complex 3D data. Lagging behind are step-wise improvements in computational methods to rapidly analyze these new large, complex datasets. Here we describe novel computational methods to capture and quantify volumetric information, and to efficiently characterize and compare shape volumes. It is based on innovative theoretical and computational reformulation of volumetric computing. It consists of two theoretical constructs and their numerical implementation: the spherical wave decomposition (SWD), that provides fast, accurate automated characterization of shapes embedded within complex 3D datasets; and symplectomorphic registration with phase space regularization by entropy spectrum pathways (SYMREG), that is a non-linear volumetric registration method that allows homologous structures to be correctly warped to each other or a common template for comparison. Together, these constitute the Shape Analysis for Phenomics from Imaging Data (SAPID) method. We demonstrate its ability to automatically provide rapid quantitative segmentation and characterization of single unique datasets, and both inter-and intra-specific comparative analyses. We go beyond pairwise comparisons and analyze collections of samples from 3D data repositories, highlighting the magnified potential our method has when applied to data collections. We discuss the potential of SAPID in the broader context of generating normative morphologies required for meaningfully quantifying and comparing variations in complex 3D anatomical structures and systems.
Similar content being viewed by others
Introduction
Imaging and data complexity
Advances in modern digital imaging methods are revolutionizing a wide range of scientific disciplines by facilitating the acquisition of huge amounts of 3D volumetric data that represent objects of scientific interest with unprecedented fidelity. Such data can often replace examination of physical objects and thereby increase the quantitative rigor and computational power that can be aimed at the analysis of those objects1. This is particularly true in the fields of evolutionary biology and paleontology where 3D volume data avail both the external and internal complex geometry of museum specimens to detailed visualization and quantitative study. In addition, digitizing and virtualizing research objects such as those in museum collections allows them to be deposited in web repositories or broadly shared, thereby increasing accessibility and value of these collections for research and public benefit, while simultaneously reducing the risk of damage to fragile specimens due to handling. Methods such as MRI and CT are rapidly changing the world of evolutionary biology research and have had a profound influence on scientific discovery. These data are just the starting point for the scientific exploration that modern computational and visualization methods enable. In particular, our ability to understand phenomic effects of genetic variation is limited by availability of quantitative representations of the morphological/anatomical phenomes in this case). Therefore, the more advanced our ability to characterize morphological variation, the more we can eventually understand about genetic variation.
These advances come at a cost. While the reservoir of digitized specimen data is growing rapidly, as evidenced from the frequency of published papers using digital volume data2,3,4,5,6,7,8,9,10,11,12,13,14, the cumulative impact and power of these data remains unrealized because of the labor intensive data analysis. As dataset size, dimensionality and complexity grow along with sample sizes (number of datasets available), so does the need for increasingly efficient and accurate algorithms capable of robustly extracting quantitative parameters relevant to research questions. The difficulties are not only computational, as data sizes and dimensionalities continue to increase, but also, perhaps surprisingly, theoretical as well: The vast majority of studies still rely on surfaced based analysis methods, and thus do not fully exploit the volumetric nature of the data and the vast amount of information it contains.
In short, the great potential that these advanced digital imaging systems have for making sense of biological diversity remains untested in many ways because of the significant difficulties in analyzing the increasingly complex volumes of data.
Phenomics and computational morphology
During the past decade, biology has been revolutionized by enormous advances in molecular and cellular biology. Conspicuously, comparative morphology has been slower to evolve during this same period15, even though great technological advances in imaging can provide volumetric digital data of exquisite resolution. One reason why morphometrics has lagged behind genomics is the difficulty in characterizing and comparing complex morphological variations in 3D volumetric data16. Moreover, volumetric morphological datasets are also orders of magnitude larger than sequence data.
Broadly speaking, revitalizing the study of anatomical aspects of organismal phenome (i.e., morphology) is critical, as patterns of shape variation in biological structures form the basis for understanding important aspects of gene function17,18, developmental mechanisms19, ecological adaptation20,21, and evolutionary history22. This is also an important short-fall because as research designs in phylogenetics become increasingly multi-factorial and complex, there is an increased need for more accurate and sophisticated techniques to measure morphological variation, particularly as modern scanners provide increasing resolving power. Further—though amazing insight continues to flow from rapid and profound advances in genomics—there is growing recognition that the utility and information content of genetic data will only reach its fullest extent once data on associated phenotypes can be analyzed at equivalent rates and scales23. This is a central issue for the emerging field of phenomics24.
Geometric morphometrics has emerged as an important tool for phenomics, becoming commonly used to quantify morphology, wherein landmark points are identified in images and then are fit to a warped mesh that provides a common coordinate system in which different specimens can be compare25,26,27,28,29,30,31,32. Landmark coordinates are the basis of statistical comparisons of geometric morphometrics. These coordinates are intended to be biologically equivalent on all specimens analyzed and they are usually defined manually, though more recently new algorithmic approaches for automatically placing landmarks have begun to arise16,33,34,35,36 and statistically compare morphologies based on these points. However, current methods remain primarily surface-based37,38,39,40,41. In fact, true volumetric analysis should be much better suited to detecting and comparing subtle morphological variations and is a more efficient use of volumetric data. But the extension to volumetric analysis comes at the cost of an increased complexity of the analysis.
Shape analysis for phenomics from imaging data (SAPID)
Over the last several years, our (Dr Frank and Dr Galinsky) Center for Scientific Computation in Imaging (CSCI, http://csci.ucsd.edu) has developed a theoretical and computational methodology to address the two major technological issues involved in the application of shape analysis to 3D volumetric data to phenomics: (1) characterizing and (2) comparing shapes. Characterizing volumetric geometric properties of different tissue types within a sample involves shape analysis and image segmentation and is done on single specimens, while comparing geometric characteristics involves image registration and is done between specimens. The two algorithms we have developed for addressing these issues are the Spherical Wave Decomposition (SWD)42 for shape analysis and image segmentation, and symplectomorphic registration with phase space regularization by ESP (SYMREG)43 for registration. Together, we collectively call these the Shape Analysis for Phenomics from Imaging Data (SAPID) toolkit.
The technical details of these methods have been discussed elsewhere42,43, so here we will first present the methodology in more descriptive and intuitive form. Our main objective in this paper though is to describe the application of this methodology to different volumetric datasets from three collaborators (Drs Rowe, Boyer, and Witmer) who make heavy use of imaging in their ongoing research in evolutionary biology and paleontology. The intent is to show that this small sample of data from a broad spectrum of applications will serve as sufficient demonstration of the practical utility of our approach in solving computational morphology problems of widespread applicability. Data used in this study are shown in Table 1.
The theoretical foundations and computational implementations of the SAPID methods SWD and SYMREG have been described in detail in previous publications42,43. These papers are necessarily quite technical in order to describe the details of the algorithm implementations. Here we will focus on a more intuitive description of the methods with some illustrative examples aimed at elucidating the important features of the methods.
Results
Example 1: analysis of CT growth series data of stained Monodelphis specimens (Rowe Lab)
In the late 1980s and early 90s Dr Rowe’s team assembled a calibrated growth series of the opossum Monodelphis domestica, a popular experimental model species for early-diverging mammals. It was provided by the Southwestern Foundation for Biomedical Research, an NIH-sponsored facility that supports research on model organisms. The collection consists of \(\approx 200\) specimens some of which were cleared and double-stained, and others skeletonized. Six of the small skulls representing different levels of maturity (post-natal days 27, 48, 57, 75, 90, and a retired breeder) were the first to be CT scanned using an industrial high-resolution scanner at the University of Texas High-Resolution X-ray Computed Tomography Facility (UTCT). At that time it was a state-of-the-art instrument, producing \(512 \times 512\) pixel images representing \(200\upmu\) thick slices, and 300 Mb datasets (see DigiMorph.org). Early research on these data provided novel insights into the effects of relative growth of the brain and auditory ossicles including a novel solution to the problem of evolution of the mammalian middle ear. It took weeks to manually register images from the different datasets, to plot growth vectors for different elements of the braincase and middle ear, and to illustrate the results which were sufficiently important for publication in the journal Science44.
Applying SAPID to register pairs of these same datasets took only a few hours and required no user manual input (Fig. 1). The results not only corroborated the earlier findings of differential growth trajectories for specific points on the skull, but also provided comparisons of the entire skulls in different slice planes that reveal differential growth in the face and dentition that have not previously been studied quantitatively in 3D. It also enabled identification of artifacts resulting from drying the skulls before scanning, especially in the less mature specimens. Preparation artifacts are rarely identified as sources of uncertainty in morphometric comparisons between recent specimens. With our latest Northstar ultra high-resolution scanner, we can generate imagery up to \(4098 \times 4098\) pixels, at micron-scale resolutions, and data volumes more than an order of magnitude larger than those produced by the previous generations of scanners. The advent of iodine-staining and phase contrast scanning45 has enabled CT to image soft tissues as well as the skeleton. CT is now poised to facilitate a much fuller understanding of the complex dynamic between the developing skull, its neurosensory structures, and its musculature. Moreover, we can now successfully image 1-day-old specimens in which nearly all of the skeleton is still cartilaginous, and capture all of postnatal ontogeny using a much denser sample of specimens. The data volumes will quickly grow into hundreds of gigabytes. Quantitative analysis and visualization of such large and rich volumes of data can for the first time be accomplished using the improved, automated algorithms integrated in SAPID.
Additionally, SAPID provided the SWD quantitative geometric measures that compares skull shape between specimens. The original data is shown in Fig. 1(left) and the registered data is shown in Fig. 1(middle). We emphasize that these results are fully volumetric, as shown in Fig. 1(right). The measurements indicate rapid differentiation of shape in early ontogeny followed by slowing and then a plateau approaching maturity. These results are not surprising, but quantifying rates of morphogenesis over the course of ontogeny has been limited to surface based methods until now, which limits the ability to detect more subtle morphological changes that occur in 3D. Using fossils in a phylogenetic context, we will be able to quantify rates of evolutionary morphogenesis througout the entire image volume as well. Whole head analyses, quantitatively in 3D, will enable testing of recent hypotheses that olfactory gene duplications were major drivers in the evolution of skull and cortical organization along the mammalian stem and in many of its living clades46,47.
Example 2: posture correction and interspecies skull comparison (Boyer Lab)
It has long been recognized size variation among individuals within a species scales proportionally and is generally less than size variation between species48. Understanding of the intraspecific limits on size variation has been critical for utilizing data on size variation to address questions about evolution. A complete geometric characterization not just of size but shape necessarily provides greater quantitative sensitivity to evolutionary changes. Classic morphometric methods have proven useful in this problem but are limited by their restriction to surfaces, and not easy to generalize to full volumetric data. They are also somewhat limited in the application to specimens that have well defined similar landmarks, though even in this case they artificially assume the importance of certain landmarks chosen a priori, as discussed below in “Hypothesis testing” section.
In the case where similar landmarks do not exist, current analysis methods become even more problematic. However, the volumetric registration of SAPID is unaffected by this problem, as no landmark identification is required a priori. Even in the absence of similar landmarks, SAPID provides a general method for describing intraspecific variation in different species by iteratively permuting monospecific sample compositions to generate a distribution of best-parameters for each sample. The distribution of SWD parameters then becomes the measure of shape variation that can be compared between species. When pairs of groups have non-overlapping distributions of parameters, one could define these groups as having heterogeneous levels of variation. The same parameters that we use to demonstrate differences in intraspecific variation can also be used to assess group membership of unknown individuals or to assess shape overlap between populations.
A demonstration of the capabilities of shape characterization with SWD is shown in Fig. 2 where skulls from an anthropoid and a strepsirrhine (from MorphoSource.org) are compared by characterizing each in terms of the SWD at different accuracies which allows the quantitative characterization of similar and dissimilar local and global shape changes. A demonstration of volumetric non-linear registration using SYM-REG on volumetric CT scans of two Callicebus cupreus skulls is shown in Fig. 3. An example of intra-species and inter-species quantitative morphological comparison facilitated by SAPID is shown in Fig. 4 using volumetric CT scans of quantitative morphological comparison using CT scans of seven anthropoid skull and seven strepsirrhine skulls.
These whole body datasets present a more difficult but very critical practical problem for characterizing phenotypic differences. Variations of posture of different specimens introduce large differences in volumetric geometry unrelated to differences due to shape of individual bones. We found that SAPID is able to distinguish these, as demonstrated by the example where the same specimen scanned twice in two completely different postures can still be recognized as uniquely similar even when compared to different specimens in similar postures (Fig. 5). In addition, the SAPID derived parameters have been able to demonstrate the relative similarity of different scans. This is completely outside the realm of capability of any other shape comparison method based on either volumes or surfaces.
Example 3: shape analysis and segmentation of Tyrannosaurus rex (Witmer Lab)
Paleobiology is another field where the rapid advancement of imaging and computational capabilities is completely altering our understanding of form and function. The SAPID algorithms have the capabilities of addressing common problems in vertebrate paleontology, such as deformation, disarticulation, and missing bony elements, within a framework of rigor and repeatability, and help address key questions in dinosaur paleobiology.
Imaging in paleobiology research shares common goals with evolutionary biology of extant species, such as automatic segmentation and quantitative shape characterization of internal structures and statistical assessment of structure variations between specimens. But extending these analytical advances into deep time, however, presents challenges for paleobiological interpretation due to the added complication of diagenetic and taphonomic factors.
Ultimately, optimization of the SAPID methods for paleobiological data will require addressing the unique characteristics of CT in inhomogeneous samples (e.g., fossils in rock matrix, often with additional materials introduced by museum staff such as plaster, adhesives, and metal mounting armature) and the optimal fitting parameters for data that contain a large number of sharp edges (e.g., from fractures). We performed an initial SWD analysis on the skull of the well-known specimen of Tyrannosaurus rex known as “Sue” (FMNH PR 2081). This specimen, on display at the Field Museum of Natural History in Chicago, is among the most complete (approximately 85%) and largest specimens of T. rex collected. The results, shown in Fig. 6, demonstrate the ability to fit and segment a specimen that has these issues.
Future work will require addressing deformation and other taphonomic artifacts, including development of standardized analysis and reproducibility pathways for retrodeformation of fossils, and optimizing the tools to take into account diagenetic and/or taphonomic deformation, which hampers analytical comparison of specimens.
Just as it is able to factor out differences due to posture, the SAPID tools also have the potential to meaningfully quantify and compare specimens even when bones are out of position or missing, as is often the case in fossils. This will provide a level of objectivity and repeatability previously lacking in dinosaur studies.
The ultimate goal of SAPID in this application is the elucidation of specific paleobiological issues such as dinosaur skull function, such as sensory organization and behavior of tryannosaurs49. Analytical advances in bioengineering—such as finite element analysis (FEA) of feeding mechanics or computational fluid dynamics (CFD) of nasal airflow—are highly sensitive to structural conformation. Having powerful analytical software connected to high-fidelity 3D data for multiple specimens within different clades will allow unprecedented computational accuracy and reproducibility of functional studies. In conjunction with soft-tissue reconstructions (e.g., jaw muscles, brain), this will allow hypothesis-testing of critical functional systems (e.g., feeding, respiration, sound production, sensory and cognitive ability).
Example 4: the fossil problem: Archaeoptery × lithographica (Rowe Lab)
A good example of the challenges facing paleobiological imaging data, and the potential utility of SAPID in addressing them, is illustrated with high resolution CT data of the skull of the important London specimen of Archaeopteryx lithographica (BMNH 37001) shown in Fig. 7. This was the subject of a study that elucidated the avian nature of the brain and inner ear and its adaptation to flight50.
The braincase of BMNH 37001 was scanned twice, at low and high X-ray energies (120 kV and 180 kV, respectively) The resulting matrix data size was \(1024 \times 1024 \times 650\) voxels and the voxel size was \(20 \times 20 \times 46 \upmu {\mathrm{m}}\). (For full scanning details, see50, and additioanal imagery on http://digimorph.org/specimens/Archaeopteryx_lithographica/).
In Fig. 7A is shown the results of the original manual processing (Fig. 1b in50) using Mimics v7.3 software (www.materialise.com). This required a significant amount of hand editing and took approximate 100 h to complete. In Fig. 7B is shown the SWD analysis, which used on the high power (180 kV) data. The computation of the SWD took \(\approx 2\) min and the segmentation took \(\approx 5\) mins.
It is important to recognize that there is a fundamental difference between the traditional histogram method and the SWD method for segmentation. The most principled approach to histogram based segmentation is to estimate different statistical distributions of intensities. However, this will still include any voxels with similar intensities, even if they are spatially isolated from the large scale structures, such as the ubiquitous sand grains that are often isointense with bone in CT images and thus contaminate segmentations that use standard histogram based methods. However the SWD is first performing a quantitative estimate of the entire volumetric shape of these structures by fitting them to a set of 3D functions (as described in the “Methods” section), then using the estimated coefficients to detect boundary regions between components (i.e., rock and bone). The results (e.g. Fig. 7B) are thus distinct (i.e., segmented) 3D shapes fully numerically characterized by a set of coefficients that can be used for geometric analysis—computation of volumes, surface areas, etc, and can be compared with other shapes. It should be noted that while bird and reptile brains are smooth, some mammals, especially humans, cetaceans, proboscideans, and artiodactyls are highly folded, and the SWD can be used to quantify the degree of foliation. This analysis has been shown to be useful in the study of cerebellar foliations in elasmobranch brains, which is related to their evolutionary development, predation strategies, habitat, behavior, and cognitive capabilities51.
Discussion
Volumetric versus surface based methods
The spherical wave decomposition (SWD)52 was developed to overcome the limitations of surface-based methods by directly analyzing the entire data volume, obviating the segmentation, inflation, and surface fitting steps of surface based methods, significantly reducing the computational time and eliminating topological errors while providing a more detailed quantitative description based upon a more complete theoretical framework for volumetric data. One of its most important features is the lack of topological defects that plague surfaced based methods52.
Characterization of morphological features embedded within noisy volumetric data is the essential technical issue in quantitative descriptions of morphology, and so plays a critical role in the characterization of morphology both within and among species. Standard approaches to this problem employ surface based methods. Those that attempt to fit functional coefficients require an initial segmentation of a surface and often a subsequent inflation of this surface to satisfy the uniqueness or stability of subsequent surface fitting algorithms. These methods are inefficient and time consuming because of the need for segmentation prior to fitting and the computationally intensive inflation process, the latter being also a significant source of errors due to creation of topological defects.
It is also worth noting that most implementations of surface-based methods are of the external surfaces, not the internal surfaces, so that a significant portion of the morphology remains uncharacterized. Exceptions to this are surface renderings of internal negative spaces within skulls, such the brain and inner ear spaces and the nasal cavity49. However, this type of skull morphometric analyses to use internal anatomy along with external anatomy is uncommon. The SAPID approach presented here makes such analysis straightforward.
Another important feature of our SWD numerical implementation is its speed: it can reconstruct a volumetric data of size \(256 \times 256 \times 128\) in the order of about 10 s, as compared to a surface only reconstruction using a standard package used in medical imaging53 on the same data, which took \(\approx 14\) h. This speed opens up the possibility of automatically exploring very large databases in a practical amount of time, enabling broader intra- and inter-species comparisons.
It should be noted that the volumetric approach emphasized here does not preclude the use in cases where only surfaces, or other substructures, are of interest (and other features ignored), as these are straightforward to extract following the volumetric processing.
Scientific use cases: pivotal questions
The dominant patterns and processes of morphological change across animal lineages remain subjects of active debate. Morphological change can be modeled as primarily gradual in pattern or as saltatory and punctuated, with rapid accumulations of evolutionary change after long periods of stasis54. The processes of morphological diversification among lineages are also contentious55. Diversification may occur primarily through widespread adaptive evolution, bringing lineages closer to theoretical ecological or engineering optima. If evolution is primarily driven by different selective pressures then frequent convergence in shape among distantly related species may be observed56. Proposed examples of functional-adaptive diversity and convergence include a few different dietary ecologies in hundreds of species in bats57, similarities in the dental morphology of mammals adapted to similar diets but representing widely divergent clades58; similarities in the distal limb morphology of eutherian and metatherian large mammal predators59; and even relatively specialized structures like the “plagiaulacoid” form of premolar seen in multituberculates, marsupials, and plesiadapiforms60. Alternatively, diversification in anatomical form may occur primarily along pathways regulated by legacies of constraint61. Under this model of diversification, a more limited role of adaptive evolution as well as an extensive role for non-selective drift and historical constraints is acknowledged. This model of limited adaptation within the parameters of constrained inherited body plans was proposed62 to describe the evolution of animal life subsequent to the Cambrian. However, as Gould noted, this hypothesis is impossible to conclusively assess in the absence of quantitative methods for capturing organismal shape in the absence of homology. In other words, a key premise of the hypothesis is that differences between phyla should be larger and more environmentally correlated than differences within phyla. But since all shape comparison methods reference specific homologous structures we cannot yet actually comment on whether differences between worms and vertebrates are in fact bigger or smaller than differences within vertebrates.
One might think that finding the answer to questions about evolutionary process is now the purview of genomics since anatomy has a genetic basis and population genetics provides researchers with straightforward tools for determining whether selection has operated or is operating on a given genetic region or locus63. However, the ability of genetics to describe the selective basis of differences in complex structural anatomical traits may in fact be quite limited64,65. There is growing recognition that patterns of anatomical variation themselves may still hold the most promise, especially when quantified correctly and analyzed in the appropriate statistical frameworks66,67,68. A starting point for such analyses is recognizing cohesive species/population units in the fossil record and extant museum collections, which allows for documenting magnitudes and patterns of intraspecific variation and trait covariance69,70,71. Broadly characterizing general properties of shape variation, while critical for understanding macroevolutionary processes, has stymied evolutionary biologists and paleontologists alike. While there have been a growing number of 3D morphometric studies based on landmark datasets, these are still based on surface based methods, and only register a limited number of preselected landmark points, rather than the entire volume. With the SAPID approach to shape characterization and comparison, not only can we statistically quantify the shape in populations (such as the mean, which provides a template or atlas, and the variance which is the necessary unit of evolutionary change), but we can define shape difference without reference to subject landmarks, linear measures, or ordinations of such subjective variables. In other words, we can define metrics representing variation in overall anatomy without being limited by the presence/absence of particular features, while still quantifying the subregion variation contributing to these overall metrics.
The efficiency and automation of the SAPID algorithms will facilitate exploration of large datasets to investigate the role of natural selection in driving extant vertebrate diversity. Distinguishing epigenetic relationships from change driven by natural selection is a tenacious problem. This is currently not within the capability of existing algorithms. For example, with large virtual collections of specimens as CT data, one could investigate the hypothesis that natural selection has had a significant and primary effect on cranial and skeletal anatomy. The scan data combined with SAPID methods would allow one to evaluate the prediction that (a) those groups with greater ecological diversity should have greater cranial and skeletal diversity when estimated age of the group’s common ancestor is taken into account, (b) and that when grouped by ecological niche, different clades of primates or classes of vertebrates should show common trends relating and distinguishing those niche groups.
Development of standardized analysis and reproducibility pathways
SAPID demonstrably represents a fundamental advance in methods for volumetric shape comparison and image registration from a theoretical, mathematical, and computational perspective. However, it is not necessarily intuitive to most morphometricians for some of the same reasons. The most fundamental difference between SAPID and traditional shape comparisons is that ‘shape distance’ between two objects is not uniquely defined as it would be in comparing landmark datasets in geometric morphometrics via Procrustes distances. Instead, with SAPID, it is possible to control the ‘goodness’ of fit between two distinct shapes by modifying parameters used to generate the symplectomorphic mapping, thus producing results with different “shape distances” for the same two shapes. This could be frustrating for morphometricians because it superficially adds a new level of ambiguity to image comparison. However, such ambiguity, in truth, has always been there. In traditional geometric morphometrics, it is masked by the problematic assumption that affine transformations are biologically and geometrically sufficient, and that a researcher’s particular choice of landmarks and the between-species deformations implied by those landmarks are meaningful descriptions of the geometry. Allowing the first of these assumptions to monopolize geometric morphometrics risks misinterpreting the repeated observation of patterns due to this approach as fundamental patterns of shape variation, which could mislead the development of broad evolutionary theories. The second set potentially adds to problems of reproducibility between researchers and fundamentally prohibits comparing variation among structures that are not homologous on some level.
A non-unique distance measure leaves two possible routes for assessing degree difference between morphological samples. (1) Use a biological control (aka reference) sample to define the appropriate parameters of image registration, and apply these to experimental perturbations of the sample or comparisons with other samples. (2) Use a critical measure of shape difference (e.g., zero difference), to determine the appropriate parameters for image registration between any two objects. In this second scenario, understanding and quantifying biological variation between samples will require introduction of some kind of registration metric that can possibly incorporate the magnitude of the deformation and/or the parameters of the fit needed to perfectly match individuals to each other and/or a template representing the average.
The benefit of the first approach is that it shares more similarity with other current methods of shape comparison and we can already define shape statistics for these comparisons (e.g. parameters of image registration can even be chosen to keep the deformations uniform across the volume or across each coordinate independently, thus resulting in familiar orthogonal or affine transforms)31. The drawback is that it requires the assumption that particular control samples have been appropriately defined, it requires the preservation of these control samples for reproducibility, and it means that conclusions of studies hinge on the composition of their control samples. A further problem with this approach is that, while it should usually produce satisfying results in suggesting large quantitative differences among qualitatively different shapes, the registration of these shapes often will not make much biological sense to researchers, leading them to question the utility of the distance.
The benefit of the second approach is that forcing a complete registration between two objects leaves little ambiguity about the correctness of the mapping (researchers can easily verify that obviously homologous structures (e.g., nasal apertures, orbits, neurocrania) are mapped to each other. The drawback is that statistics for parameters describing the amount deformation used to accomplish the registration are not yet defined. Defining statistics for this mode of comparison is thus an area of important work.
Finally, one opportunity presented by the SAPID approach to shape registration is that effects of non-linear differences in position of homologous objects (like differences in postures of two articulated skeletons) can be separated from differences due to registering their volumes to one another. If implemented into an analytical workflow, scans of multiple skeletons of the same species could be retro-deformed into similar postures and then compared for whole skeleton morphological variation.
Another area of future work is the assessment of sensitivity to control samples and problems with mappings of qualitative different shapes. Possible approaches to this problem include (1) iteratively permuting composition of control samples for various bones of various species to assess the sensitivity of intraspecific mappings to these variables. While helping us understand the effect of robustness of intraspecifically (or subspecifically) defined parameters, this will allow comparison of heterogeneity in variation among species and among anatomical regions which will also address interesting biological questions.
The SAPID analysis can be used to augment existing taxonomic strategies. For example, after computing within and between species differences using parameters based on one or several intraspecific samples, the matrix of interspecific differences can be used to define the minimum spanning tree relating those samples where the length of a connection between species, which is defined as the shape differences between two species templates under the predefined image registration parameters. This graph can be used to compose (or guide) mappings between species with large qualitative differences thereby avoiding distances based on unlikely registrations but maintaining a constant set of registration parameters.
Hypothesis testing
In the end, what is of greatest importance for morphometric methods is their utility in hypothesis testing. This requires the ability to quantitatively compare complex shape variations within or between species over time. The more precise the geometric characterization of a volumetric shape and the registration between specimens, the greater the sensitivity to subtle variations hypothesized in evolutionary models. This is ultimately the greatest power of the SAPID method, which provides such precision in the very general context of volumetric imaging data, be it from MRI, CT, or any other scanner type.
A specific example that might be helpful in clarifying this point is provided by the question of the coevolution of the mammalian middle ear and neocortex examined in44 using CT data of Monodelphis domestica. At the time this paper was written (\({\sim }\, 25\) y ago), 3D analysis methods were far less developed. Studying the development and growth of the didelphid mandibular arch was done by examining how CMJ and middle ear growth relate to each other in the context of cortical plane growth. The CMJ is a fixed reference point to align the different individuals to show the growth trajectory of the fenestra vestibuli, and outlines of the ’cortical equator’ were used to represent brain growth. This required manual tracing of structures in a single slice, identifying appropriate landmarks for comparison, then merging the results onto a single image (see Fig. 1d in44). This took several weeks of work to complete.
That same CT data is shown Fig. 1(left column) where the entire dataset, or any substructure(s) of interest, can be quantitatively characterized and compared between species and over time. In Fig. 1(middle column) we have demonstrated the simplest form of such a comparison—the decreasing mean squared difference in shape between maturing specimens and a reference adult specimen, just to emphasize these capabilities. In the practical application to this problem, SAPID could be used to examine the particular substructures of interest (the CMJ and fenestra vestibuli) and provide quantitative characterizations of the shape variations in terms of whatever metrics are deemed of greatest interest, such as distances, angles, mid-points, shape complexity (i.e., number of SWD coefficients necessary to characterize the shape, etc). These would provide more precise quantitative measures that can then be used in a hypothesis-driven analyses of shape association with particular factors such as diet/locomotion, disparity measurements, and sometimes estimates of integration and/or modularity—more often than not in a comparative/phylogenetic context. Such work is beyond the scope of the current paper but will be addressed in a future study.
It is important to recognize that the SAPID methods actually broaden the scope of hypothesis driven research. The current standard methods for shape comparison that involve identification and subsequent registration of landmarks identify these landmarks a priori, and thus there is an implicit bias as to their relevance. On the contrary, SYMREG will automatically register the entire shapes. Standard landmarks (homologous structures) will automatically be registered but any additional co-registered shapes can be identified ex post facto as useful landmarks, potentially providing new hypotheses about the evolutionary trajectory. These methods might usefully inform more current research on a wide variety of topics (e.g.,72,73,74).
The important overarching point is that the ability of the SAPID approach to produce quantitative volumetric characterization and registration provides the basis for utilizing a wide range of standard statistical comparison methods with much greater sensitivity to geometric variations in specific tissue types than extant methods. This can result in an enhanced ability to more precisely quantitatively address significant specific questions in evolutionary biology.
Conclusion
In conclusion, we have presented two theoretical constructs and their numerical implementations, the spherical wave decomposition (SWD)42 that provides fast, accurate automated characterization of shapes embedded within complex 3D datasets, and symplectomorphic registration with phase space regularization by ESP (SYMREG)43, a volumetric non-linear registration method that allows homologous structures to be correctly warped to each other or a common template for comparison. Take together, these constitute an automated approach to true volumetric computational morphometrics that we call the Shape Analysis for Phenomics from Imaging Data (SAPID) method. This paper has shown its capability on several datasets of importance to evolutionary biology, paleobiology, and digital library data usage, which suggests its widespread utility in a broad scope of these and related disciplines.
Methods
Volumetric shape analysis
The goal of shape analysis is to quantitatively characterize a 3D “object”, such as a skull specimen, that has been imaged by some volumetric imaging modality, such as computed tomography (CT). Specifically, the objective is to represent the digitized version of the object by some well-defined mathematical functions to assign numbers containing the shape information that uniquely describe the specimen and can be used to compare to other specimens. It is useful to think of this process as involving two steps: (1) Choosing a set of mathematical functions appropriate to generally describe such objects, which we will call the issue representation and (2) Determining the parameters of that representation that uniquely describe a specific volumetric image of a specimen, a process called reconstruction.
The goal of quantitative morphology is to construct a numerical characterization of the spatial organization of an object. There are often many ways to do this but implementing an efficient numerical method can be greatly facilitated by a judicious choice of the representation. One way to think about this problem is as a decision about the choice of coordinate system. This is best illustrated by three examples, which will quickly take us to the motivation for the SWD.
Cartesian basis and spatial coordinates
Consider the problem of analyzing very regular shapes, such as an office building which we assume to be a perfectly rectangular cuboid. The shapes are characterized by the three numbers—the length a along one base, which will call the x-axis, the length b along the other base, which will call the y-axis, and the height c along the vertical or z-axis. Since these directions are all perpendicular, we can define a vector of unit length along each direction:
and then any point f(x, y, z) within the building can be described by
where \(0 \le \alpha ,\beta ,\gamma \le 1\). The vectors in Eq. (1) are called basis vectors of the Cartesian coordinate system. The construction in Eq. (2) is called a linear combination of basis vectors and it works because the basis vectors are mutually perpendicular, or orthogonal. This important property will be shared by the other bases we use below. A basis can be thought of a set of functions that can completely and uniquely describe all points in a space. In order to strictly qualify as a basis, certain properties are required, which go beyond the scope of this paper but can be found in any linear algebra book75.
Characterizing the shape of the building—its volume, distance between the ground floor entrance and your desk on the 10th floor, the volume of a substructure (such as a floor) can all be done using these basis vectors. We just use points along these axes and the rule for vector addition and we can compute the geometric quantities of interest—distance, area, volume, etc.
The frequency domain and plane waves
Unlike an office building, biological objects such as the CT of a skull or the MRI of a brain have exceedingly complex geometries, so a more general approach is necessary. One feature of biological structures that can be used to advantage is that they tend to exhibit extended spatial patterns, though these maybe complicated. This suggested working in a coordinate system of patterns, rather than directly in the spatial coordinates. A simple example is the very regular pattern of a sine wave in space, which in one-dimension is:
where x is the spatial coordinate. This wave is described by only three parameters: (1) its spatial frequency in the x-direction k (in units of \(1/\text{distance }\))—how rapidly it oscillates as a function of position; (2) its amplitude A—the height of the peak; and (3) the phase \(\theta\)—the offset of the starting point. In other words, the best coordinate system to describe such a wave is not the spatial domain \(\{ x,y,z \}\), but rather the coordinates \(\{ \omega ,A,\theta \}\). This is called the spatial frequency domain. The variables x and k are called conjugate variables because we can characterize the wave equation (3) in either domain.
Somewhat surprisingly, more complex shapes can be described by the combination (i.e. the sum) of waves of different amplitudes, spatial frequencies, and phases. This is the essence of Fourier’s Theorem. The remarkable property of the \(\sin\) and \(\cos\) functions is that they are mutually orthogonal and form a basis in frequency space, in the same way that the vectors equation (1) formed a basis in coordinate space. Frequency space is also called Fourier space and the trigonometic functions are called the Fourier basis functions.
Recalling that \(\sin\) and \(\cos\) are just phase shifted versions of one another, we can eliminate the explicit phase \(\theta\) and write such a combination of waves in one-dimension at a single point \(x_{i}\)
where m is the number of spatial frequencies in the shape and the phase \(\theta\) is now contained in the relative amplitudes of a and b. The last equality is a consequence of Euler’s relation \(e^{i b} = \cos b + i \sin b\) where \(i = \sqrt{-1}\) defines a complex number which can be thought of as a convenient bookkeeping device that makes the equations much easier to work with as it avoids a proliferations of terms.
This notation is particularly convenient because we can easily generalize the expression for the signal at any point \(\varvec{x}_{i}\) to 3-dimensions as
where bold-faced represents vectors, which in this case are \(\varvec{x} = \left( x,y,z \right)\) (a standard, if somewhat confusing reuse of x) and \(\varvec{k} = \left( k_{x},k_{y},k_{z} \right)\), and the dot represents the dot product, i.e. \(\varvec{k} \cdot \varvec{x}_{i} = k_{x} x_{i} + k_{y} y_{i} + k_{z} z_{i}\). In this representation the waves are perpendicular to planes in 3D space and so are called plane waves. Equation (5) is called the Fourier Transform of the coefficients c. It converts the coefficients in the frequency domain into a signal in the spatial domain. Using Eq. (5) we can write the expression for data collected at n points \(\varvec{x} = \{ x_{1}, \ldots , x_{n} \}\) represented by m plane waves in compact matrix form as
where \(\varvec{F}\) is a matrix if size \(m \times n\) and \(\varvec{c}\) is a vector m. This form will prove useful when we turn to the problem of reconstruction.
A simple example of the Cartesian Fourier (i.e. plane wave) representation is the description of the shape of a sand dune that has wind-blown ripples on its surface, as in Fig. 8A. A highly idealized mathematical representation, for illustrative purposes, is shown in Fig. 8B. The underlying dune is constructed from a low spatial frequency but high amplitude wave and the ripples are created from a high spatial frequency but low amplitude wave. The entire dune is the sum of these waves.
The representation of a generic object in terms of a set of basis functions can be thought of as the construction of a model for the data. Characterization of a particular object requires finding the parameters of that model that produces the most faithful representation of the object. This is the problem of reconstruction. It is an inverse problem that requires methods of inference, and hence probability theory. The example of Fourier reconstruction provides a nice example of the power of probability in reconstruction problems (see, for example,76) but is beyond the scope of the present paper.
From Eq. (6), given the data \(s(\varvec{x})\) the estimated coefficients \(\hat{\varvec{c}}\) can naively be estimated by using the Inverse Fourier Transform
where \(\varvec{F}^{\dagger }\) is the complex conjugate transpose of the model functions. The result of this procedure for the sand dunes is shown in Fig. 8C, which shows that the shape, characterized by the Fourier spectrum is reasonably retrieved. It is important to recognize, however, that Eq. (7) is formally not the correct answer, even though in many cases it is a reasonable approximation. But generally reconstruction is not the same thing as multiplying by the inverse of the basis functions!. For example, in standard least-squares estimation, c would be reconstructed by replacing F† with the pseudo-inverse of F in Eq. (7). This example was chosen for its simplicity and because the constituent waves are visible as such. But much more complex geometries that do not appear wavelike can be constructed by the same approach, as we shall see.
Spherical basis and spherical waves
In the sand dune example we ignored the fact that the sand dune is sitting on a curved surface—the Earth. This was reasonable because the curvature of the Earth is insignificant over the extent of the dune so that, for all intents and purposes, the dune is sitting on a flat surface. Mathematically, we say that the Earth is locally flat in the region of the dune. This problem is therefore naturally described as Fourier functions in Cartesian coordinates—just the plane waves discussed in the previous section.
But what if we wanted to characterize the entire surface of the Earth, not just a single dune? And then go even further and examine the surfaces of the major interior layers of the Earth (Fig. 9)? For this there is a more efficient, and intuitively obvious, coordinate system called the spherical coordinate system, parameterized by the three coordinates \(\{ r,\theta ,\phi \}\) which are the radius r—the distance from the center of a sphere to a point on its surface, such as the top of a mountain, the polar angle \(\theta\)—the angle from the geographical (not magnetic) North pole, and the azimuthal angle \(\phi\)—the angle along the Equator, measured from some reference point (say, the Greenwich Meridian). These are more familiar, of course, as altitude, the latitude and the longitude. (Actually the latitute is measured from the equator, so \(90^{\circ } - \theta\), but the lines of constant latitude are the same).
The spherical coordinate system does not provide any more information than the Cartesian coordinate system (indeed, one can transform from one to the other), but it is more natural, and provides more parsimonious descriptions. For example, the path of someone traveling on an arbitrary path on the surface of a sphere can be specified by only two parameter—the angles \(\{ \theta ,\phi \}\), since the radius r is a constant. In Cartesian coordinates, description of the same path would require all three parameters \(\{ x,y,z \}\).
Let’s consider first a hypothetical situation in which the Earth’s surface was entirely covered by giant sand dunes. And for simplicity let’s assume that the shape of the Earth underneath the dunes is a perfect sphere (it’s actual an oblate spheroid). How do we describe the wave in spherical coordinates so as to take into account the curved surfaces on which the waves are propagating? Well, it turns out that the Fourier plane wave functions developed in a Cartesian coordinate system can be rewritten in spherical coordinates using special functions: spherical harmonics and spherical Bessel functions. The combination of spherical harmonics (the angular variations) and spherical Bessel functions (the radial variations) are called spherical waves. These are the functions that we will use to characterize the entire shape of the digitized 3D objects within the volumetric images.
The characterization of the Earth’s volumetric features in terms of spherical waves is shown schematically in Fig. 9. No actual fitting has been performed, and the surface features are idealized and exaggerated—this is just an illustration of the ability of spherical waves to characterize both radial and angular variations volumetric shapes.
Determining the coefficients that characterize the shape of the object through these spherical wave functions is what we call the spherical wave decomposition42. An example of volumetric shape characterization of a single high resolution anatomical MRI brain scan is shown in Fig. 10.
The model order problem
There is an insidious problem lurking in the previous examples. In Eq. (5) the label m represents the number of model functions necessary to completely described the shape. This is called the model order. But how many basis functions do we need to accurately and optimally characterize the shape or, alternatively, what is the optimal model order \(\hat{m}\)? In the sand dune example, this problem was hidden because we constructed such a simple model with separate spatial frequencies that were well resolved. And there was no noise. But with noisy data that contain a wide spectrum of spatial frequencies, we will no longer be able to ignore this issue.
Every imaging process has limited resolution because of sampling restrictions and is contaminated by imperfections that we will generically call “noise”. In addition, the number of functions necessary to characterize the shape, also called the model order, is another unknown. The goal of the reconstruction process is to determine the coefficients of the data model and the correct model order from the noisy volumetric data of finite resolution. We can illustrate the problem with a very simple example of fitting a polynomial discussed in Fig. 11. An example of this applied to a volumetric human brain image using the SWD is shown in Fig. 12.
The model order has physical significance as a measure of an objects complexity, and consequently can have biological and evolutionary significance, as was demonstrated in our study to characterize of cerebellar foliations in elasmobranch brains51, which has implications in the quantification and comparison of the cerebellum in different species of elasmobranchs where variation in cerebellar foliations has been shown to be related to species habitat and predation strategies and has evolutionary significance77,78,79.
Volumetric segmentation
One important difference between the SWD and surface based methods is that the SWD provides a method for naturally and effectively performing the very complex task of volume segmentation. Segmentation is not tractable by surface based methods alone because they are themselves predicated on segmentation. That is, segmentation must be performed (either by hand or by using additional specialized semi-automated segmentation tools) before the surface based methods can even be applied to new volumetric data. In the SWD approach, the segmentation is done on the entire volume. All important features of the SWD approach (including weighted Fourier smoothing, optimal SWD order and volume morphometry/complexity) are also applicable to segmented and independently represented structures.
Because the SWD estimates fit coefficients for a set of volumetric functions, the coefficients for the analytic expansion of the derivatives is produced as well. This automatically provides an estimate of the coefficients of tissue border regions and thus allows automated segmentation of tissue types. This is shown for the human brain data in Fig. 13 where separate volumes for the two tissue types, white matter (WM) and gray matter (GM), are automatically produced.
Summary
The SWD produces a quantitative description of an entire volume of data. This output can then be used for comparison between datasets. Comparison of homologous structures between datasets requires a method for non-linear registration, which we discuss next.
Volumetric image registration
The registration problem
The general idea behind geometric morphometrics is to transform different images to the same coordinate system so that relations between homologous points can be determined. Approaches to this problem of registration, or spatial normalization, fall into three basic categories: (1) Rigid registration, which seeks a linear transformation that best aligns two objects without altering the objects’ size and shape, and thus involves only translation and rotation; (2) Affine registration, which seeks a general linear transformation that allows objects’ global size and shape to be altered and thus involves shear and scaling in addition to translation and rotation; and (3) Diffeomorphic registration, a non-linear operation that, in addition to affine registration, normalizes the objects’ size and shapes as well. Not surprisingly, it is also the most complex.
However, current geometric morphometrics80,81,82,83 is typically based on some form of Generalized Procrustes Alignment (GPA)25,31 which is based upon a minimum Procrustes distance over all affine transformations84 and then performing a subsequent statistical analysis. Unfortunately, the affine methods are only approximations (and sometimes quite poor) for matching, comparing, or combining homologous anatomical structures because affine registration is a global transformation and is incapable of capturing localized changes in shape. In many cases of interest, the localized changes are the most important part. The Procrustes method is rather simplistic in this respect as the geometric differences are essentially distributed as evenly as possible among all landmarks representing each specimen since it aligns specimens by their centroids and then rotates them to minimize the summed distance among all landmark coordinate pairs.
A review82 discussing the state of geometric morphometrics states that the 3D extension of 2D methods (namely, Procrustes superimpositions) is straightforward, but the methods discussed only represent affine transformations. These linear methods (utilized in the popular TpsRelW85) are currently the standard (e.g.,39). Correctly extending non-linear methods to volumetric data is a difficult theoretical and computational problem, encountered often in MRI where it is necessary to compare volumetric data from multiple subjects.
The SYMREG method
The ability to do accurate quantitative geometric morphometrics is predicated on the ability to address the non-linear registration problem. A conceptual way to think about this problem is that two different volumetric data sets have different non-linearly related coordinate systems. The goal of registration is the put all data into the same Cartesian coordinate system. A schematic of this problem is shown in Fig. 14.
As remarked above, diffeomorphic methods are the current standard for current non-linear registration. However, despite their ubiquity, in practice these methods have significant limitations in speed and accuracy that compromise their practical utility86,87. The SYMREG method is similar in spirit to diffeomorphic mapping, but is more general and flexible. This development was motivated not only to address the issues of speed and accuracy, but also to facilitate merging of multiple imaging modalities with different resolutions. This has implications for computational morphology where different modalities, in particular both MRI and CT, are in common use.
The SYMREG method is developed within a coordinate space that is more general than just the spatial coordinates used for diffeomorphic methods. Registration methods work by warping the coordinates of one spatial grid onto another. The target grid is called the template (the undistorted Cartesian grid in the top row of Fig. 14). Each spatial coordinate in the warped grid (the grid in the bottom row of Fig. 14) must therefore move along some non-linear path at some rate determined by the multiple steps of the numerical algorithm which iterates towards a final spatial configuration that minimizes the error between the warped grid and the template grid. Therefore each point has a “velocity” in addition to a position. Systems with both a spatial coordinate q and a velocity v (or more generally, a momentum p) can be characterized by the space (q, p) that contains both coordinates. This is called the phase space of the system. The dynamics within this space can be described a function H, called the Hamiltonian, which can be thought of as characterizing the energy of the system, and can be used to impose constraints that reduce the space of possible solutions, thereby increasing speed and accuracy. Specifically, whereas current methods employ diffeomorphic transformation in spatial coordinates, SYMREG is diffeomorphic in phase space, which is called a symplectomorphism.
There is another important and unique aspect of the SYMREG method. Image grids are characterized by the relative spatial locations of the points. Incorporating this relative information between points can be used to impose addition constraints that incorporate position dependent information into the problem. A very general and flexible way to do this is using our theory of Entropy Spectrum Pathways (ESP)88 where coupling between neighboring locations, constructed using a spatially dependent coupling density \(Q(\varvec{x},\varvec{x}')\) matrix, results in the ability to constrain, or regularize, the solutions with interactions that extend spatially away from the individual points. These are called non-local interactions. What this means is easily understood in simplest case that \(Q(\varvec{x},\varvec{x}')\) depends only on the different in positions and is a Gaussian centered at the targe location \(\varvec{x}'\) with inverse covariance matrix S, i.e., \(Q(\varvec{x},\varvec{x}') = Q(\varvec{x}-\varvec{x}') \sim N(\varvec{x}',S^{-1})\). The results in the commonly used Gaussian regularization kernel. Generally, however, more complex coupling schemes can incorporate more relevant prior information, resulting in more robust warping schemes. For more details, the reader is referred to43.
An example of the use of SYMREG registration on multiple MRI volumetric brain images from different normal human volunteers is shown in Fig. 1543,89. This demonstrates that naïvely combining (e.g., averaging) datasets (e.g., brains) from multiple subjects produces highly blurred images due to the natural geometric variations of the organs between individuals (Fig. 15B). Accurately taking into accounts the non-linear geometrical relations using SYMREG produced a combined image with little blurring Fig. 15C). The corresponding volumetric 3D warping grid used in the registration in Fig. 15 generated by SYMREG is shown in Fig. 16.
In summary, SYMREG allows for rapid and robust automated non-linear registration of multi-modality multi-subject data.
Data availability
Researchers interested in obtaining the SAPID software can contact Dr. Lawrence Frank at lfrank@ucsd.edu. Data used in this study is listed in Table 1 which includes the Digital Object Identifiers (DOI) where users can access the data.
References
Rowe, T. & Frank, L. R. The disappearing third dimension. Science 331, 712–714 (2011).
Smith, S. et al. Virtual taphonomy using synchrotron tomographic microscopy reveals cryptic features and internal structure of modern and fossil plants. Proc. Natl. Acad. Sci. U. S. A. 106(29), 12013–12018 (2009).
Ziegler, A. et al. Application of magnetic resonance imaging in zoology. Zoomorphology 130(4), 227–254 (2011).
Lautenschläger, S. Cranial myology and bite force performance of Erlikosaurus andrewsi: A novel approach for digital muscle reconstructions. J. Anat. 222, 260–272 (2013).
Lowe, S., Garwood, R., Simonsen, T., Bradley, R. & Withers, P. Metamorphosis revealed: Time-lapse three-dimensional imaging inside a living chrysalis. J. R. Soc. Interface 10, 20130304 (2013).
Bourke, J. et al. Breathing life into dinosaurs: Tackling challenges of soft-tissue restoration and nasal airflow in extinct species. Anat. Rec. 297, 2148–2186 (2014).
Cunningham, J., Rahman, I., Lautenschlager, S., Rayfield, E. & Donoghue, P. A virtual world of paleontology. Trends Ecol. Evol. 29(6), 347–357 (2014).
Akkari, N., Enghoff, H. & Metscher, B. A new dimension in documenting new species: High-detail imaging for myriapod taxonomy and first 3D cybertype of a new millipede species (Diplopoda, Julida, Julidae). PLOS ONE 10(8), e0135243 (2015).
Bright, J., Marugan-Lobon, J., Cobb, S. & Rayfield, E. The shapes of bird beaks are highly controlled by nondietary factors. Proc. Natl. Acad. Sci. U. S. A. 113(19), 5352–5357 (2016).
Goswami, A. et al. Do developmental constraints and high integration limit the evolution of the marsupial oral apparatus?. Integr. Comp. Biol. 56(3), 404–415 (2016).
Wipfler, B., Pohl, H., Yavorskaya, M. & Beutel, R. A review of methods for analysing insect structures—The role of morphology in the age of phylogenomics. Curr. Opin. Insect Sci. 18, 60–68 (2016).
Bock, C., Wermter, F. & Mintenbeck, K. MRI and MRS on preserved samples as a tool in fish ecology. Magn. Reson. Imaging 38(1), 39–46 (2017).
Herzog, H., Klein, B. & Ziegler, A. Form and function of the teleost lateral line revealed using three-dimensional imaging and computational fluid dynamics. J. R. Soc. Interface 14, 20160898 (2017).
Kohnk, K., Baudewig, J., Brandis, D. & Boretius, S. What’s in this crab? MRI providing high-resolution three-dimensional insights into recent finds and historical collections of Brachyura. Zoology 121, 1–9 (2017).
Kristensen, E., Parsons, T., Hallgrimsson, B. & Boyd, S. A novel 3-D image-based morphological method for phenotypic analysis. IEEE Trans. Biomed. Eng. 55(12), 2826–2831 (2008).
Boyer, D. M., Gunnell, G. F., Kaufman, S. & McGeary, T. M. A new fully automated approach for aligning and comparing shapes. Anat. Rec. 298, 249–276 (2016).
Morgan, T. The origin of five mutations in eye color in Drosophila and their modes of inheritance. Science 33, 534–537 (1911).
Abzhanov, A. et al. The calmodulin pathway and evolution of elongated beak morphology in Darwin’s finches. Nature 442, 563–567 (2006).
Harjunmaa, E. et al. On the difficulty of increasing dental complexity. Nature 483, 324–327 (2012).
Losos, J. The evolution of form and function: Morphology and locomotor performance in West Indian Anolis lizards. Evolution 44, 1189–1203 (1990).
Feder, M. & Mitchell-Olds, T. Evolutionary and ecological functional genomics. Nat. Rev. Genet. 4, 649–655 (2003).
Leakey, L., Tobias, P. & Napier, J. A new species of the genus Homo from Olduvai Gorge. Nature 202, 7–9 (1964).
Houle, D., Govindaraju, D. R. & Omholt, S. Phenomics: The next challenge. Nat. Rev. Genet. 11, 855–856 (2010).
Houle, D. Numbering the hairs on our heads: The shared challenge and promise of phenomics. Proc. Natl. Acad. Sci. U. S. A. 107(S1), 1793–1799 (2010).
Gower, J. Generalized procrustes analysis. Psychometrika 40, 33–51 (1975).
Bookstein, F. When one form is between two others: An application of biorthogonal analysis. Am. Zool. 20, 627–642 (1980).
Bookstein, F. Morphometric Tools for Landmark Data. Geometry and Biology (Cambridge University Press, 1991).
Bookstein, F. Can biometrical shape be a homologous character? In Homology: The Hierarchical Basis of Comparative Biology (ed. Hall, B.) 197–227 (Academic Press, 1994).
Dryden, I. & Mardia, K. Statistical Shape Analysis: With Applications in R (Wiley, 2016).
Rohlf, F. & Slice, D. Extensions of the Procrustes method for the optimal superimposition of landmarks. Syst. Zool. 39(1), 40–59 (1990).
Zelditch, M., Swiderski, D., Sheets, H. & Fink, W. Geometric Morphometrics for Biologists: A Primer (Elsevier, 2004).
Slice, D. Modern Morphometrics in Physical Anthropology (Springer, 2006).
Polly, P. & MacLeod, N. Locomotion in fossil Carnivora: An application of eigensurface analysis for morphometric comparison of 3D surfaces. Palaeontol. Electron. 11, 10–13 (2008).
Boyer, D. et al. Algorithms to automatically quantify the geometric similarity of anatomical surfaces. Proc. Natl. Acad. Sci. 108, 18221–18226 (2011).
Koehl, P. & Hass, J. Landmark-free geometric methods in biological shape analysis. J. R. Soc. Interface 12, 20150795 (2015).
Gao, T., Yapuncich, G., Daubechies, I., Mukherjee, S. & Boyer, D. Development and assessment of fully automated and globally transitive geometric morphometric methods, with application to a biological comparative dataset with high interspecific variation. Anat. Rec. 301(4), 36–658 (2017).
Loy, A., Boglione, C., Gagliardi, F., Ferrucci, L. & Cataudella, S. Geometric morphometrics and internal anatomy in sea bass shape analysis (Dicentrarchus labrax L., moronidae). Aquaculture 186(1–2), 33–44 (2000).
Pulcini, D., Costa, C., Aguzzi, J. & Cataudella, S. Light and shape: A contribution to demonstrate morphological differences in diurnal and nocturnal teleosts. J. Morphol. 269(3), 375–385 (2008).
Antonucci, F., Costa, C., Aguzzi, J. & Cataudella, S. Ecomorphology of morpho-functional relationships in the family of Sparidae: A quantitative statistic approach. J. Morphol. 270(7), 843–855 (2009).
Recasens, L., Lombarte, A. & Sanchez, P. Teleostean fish assemblages in an artificial reef and a natural rocky area in Catalonia (northwestern Mediterranean): An ecomorphological approach. Bull. Mar. Sci. 78(1), 71–82 (2006).
Vitek, N. et al. Semi-supervised determination of pseudocryptic morphotypes using observer-free characterizations of anatomical alignment and shape. Ecol. Evol. 7, 5041–5055 (2017).
Galinsky, V. L. & Frank, L. R. Automated segmentation and shape characterization of volumetric data. NeuroImage 92, 156–168 (2014).
Galinsky, V. L. & Frank, L. R. Symplectomorphic registration with phase space regularization by entropy spectrum pathways. Magn. Reson. Med. 81(2), 1225–1335 (2018).
Rowe, T. Coevolution of the mammalian middle ear and neocortex. Science 273(5275), 651–654 (1996).
Metscher, B. MicroCT for comparative morphology: Simple staining methods allow high-contrast 3D imaging of diverse non-mineralized animal tissues. BMC Physiol. 9, 11 (2009).
Rowe, T. B. & Shepherd, G. M. Role of ortho-retronasal olfaction in mammalian cortical evolution. J. Comp. Neurol. 524(3), 471–495 (2015).
Rowe, T. The emergence of mammals. In Evolution of Nervous Systems 2nd edn (ed. Kaas, J. H.) 1–52 (Elsevier, 2020).
Gingerich, P. Species in the primate fossil record. Evol. Anthropol. 23, 33–35 (2014).
Witmer, L. M. & Ridgely, R. C. New insights into the brain, braincase, and ear region of Tyrannosaurs (Dinosauria, Theropoda), with implications for sensory organization and behavior. Anat. Rec. 292(9), 1266–1296 (2009).
Alonso, P. D., Milner, A. C., Ketcham, R. A., Cookson, M. J. & Rowe, T. B. The avian nature of the brain and inner ear of Archaeopteryx. Nature 430(7000), 666–669 (2004).
Yopak, K. E., Berquist, R. M., Galinsky, V. L. & Frank, L. R. Quantitative classification of cerebellar foliation in cartilaginous fishes (Class: Chondrichthyes) using 3D shape analysis and its implications for evolutionary biology. Brain Behav. Evol. 87(4), 252–264 (2016).
Galinsky, V. L. & Frank, L. R. Automated segmentation and shape characterization of volumetric data. Neuroimage 92, 156–168 (2014b).
Dale, A., Fischl, B. & Sereno, M. Cortical surface-based analysis. I. Segmentation and surface reconstruction. Neuroimage 9(2), 179–194 (1999).
Gould, S. The Structure of Evolutionary Theory (Harvard University Press, 2002).
Powell, R. Is convergence more than an analogy? Homoplasy and its implications for macroevolutionary predictability. Biol. Philos. 22, 565–578 (2007).
Simpson, G. Tempo and Mode in Evolution (Columbia University Press, 1944).
Dumont, E. et al. Selection for mechanical advantage underlies multiple cranial optima in new world leaf-nosed bats. Evolution 68, 1436–1449 (2014).
Evans, A. & Sanson, G. The tooth of perfection: Functional and spatial constraints on mammalian tooth shape). Biol. J. Linnean Soc. 78(2), 173–191 (2003).
Ercoli, M., Prevosti, F. & Alvarez, A. Form and function within a phylogenetic framework: Locomotory habits of extant predators and some Miocene Sparassodonta (Metatheria). Zool. J. Linnean Soci. 165, 224–251 (2012).
Simpson, G. The ‘plagiaulacoid’ type of mammalian dentition. J. Mamm. 14, 97–107 (1933).
Gould, S. & Lewontin, R. The spandrels of San Marco and the Panglossian paradigm: A critique of the adaptationist programme. Proc. R. Soc. Lond. B 205, 581–598 (1979).
Gould, S. The disparity of the Burgess Shale arthropod fauna and the limits of cladistic analysis: Why we must strive to quantify morphospace. Paleobiology 17, 411–423 (1991).
Pavlidis, P. & Alachiotis, N. A survey of methods and tools to detect recent and strong positive selection. J. Biol. Res. Thessaloniki 24(7), 1–17 (2017).
Weiss, K. & Buchanan, A. Evolution: What it means and how we know. In A Companion to Biological Anthropology (ed. Larsen, C.) 41–55 (Wiley-Blackwell, 2010).
Chabris, C., Lee, J., Cesarini, D., Benjamin, D. & Laibson, D. The fourth law of behavior genetics. Curr. Dir. Psychol. Sci. 24, 304–312 (2015).
Marroig, G. & Cheverud, J. Did natural selection or genetic drift produce the cranial diversification of neotropical monkeys?. Am. Nat. 163, 417–428 (2004).
Nunn, C. The Comparative Method in Evolutionary Anthropology and Biology (University of Chicago Press, 2011).
Grabowski, M. Bigger brains led to bigger bodies?: The correlated evolution of human brain and body size. Curr. Anthropol. 57, 174–196 (2016).
Lande, R. Quantitative genetic analysis of multivariate evolution, applied to brain: Body size allometry. Evolution 33, 402–416 (1979).
Cheverud, J. M. A comparison of genetic and phenotypic correlations. Evolution 42, 958–968 (1988).
Polly, P. Developmental dynamics and g-matrices: Can morphometric spaces be used to model phenotypic evolution?. Evol. Biol. 35, 83–96 (2008).
Ramírez-Chaves, H. et al. Mammalian development does not recapitulate suspected key transformations in the evolutionary detachment of the mammalian middle ear. Proc. R. Soc. B 283, 20152606 (2016).
Urban, D. et al. A new developmental mechanism for the separation of the mammalian middle ear ossicles from the jaw. Proc. R. Soc. B. 283, 20152606 (2016).
Lautenschlager, S., Gill, P., Luo, Z.-X., Fagan, M. & Rayfield, E. The role of miniaturization in the evolution of the mammalian jaw and middle ear. Nature 561, 533–537 (2018).
Strang, G. Introduction to Linear Algebra 5 edn. (Wellesley-Cambridge, 2016). ISBN 978-09802327-7-6
Bretthorst, G. L. Bayesian Spectrum Analysis and Parameter Estimation (Springer, 1988).
Yopak, K. E., Balls, G. & Frank, L. R. Cortical surface structure analysis in sharks using magnetic resonance imaging (MRI). In Proceedings of the International Society of Magnetic Resonance in Medicine, Vol. 17, 2925 (2009).
Yopak, K. E. & Frank, L. R. Variation in cerebellar foliation in cartilaginous fishes: Ecological and behavioral considerations. Brain Behav. Evol. 70, 210 (2007).
Yopak, K. E. & Frank, L. R. Brain size and brain organization of the whale shark, Rhincodon typus, using magnetic resonance imaging. Brain Behav. Evol. 74(2), 121–142 (2009).
Rohlf, F. & Marcus, L. A revolution in morphometrics. Trends Ecol. Evol. 8(4), 129–132 (1993).
Marcus, L., Corti, M., Loy, A., Naylor, G. & Slice, D. Advances in Morphometrics (Plenum Press, 1996).
Adams, D. C., Rohlf, F. J. & Slice, D. E. Geometric morphometrics: Ten years of progress following the ‘revolution’. Ital. J. Zool. 71(1), 5–16 (2004).
Elewa, A. Morphometrics: Applications in Biology and Paleontology (Springer, 2004).
Wiley, D. et al. Evolutionary Morphing. In Proceedings of IEEE Visualization 2005, Minneapolis (2005).
Rohlf, F. TpsRelW, Relative Warp Analysis (Department of Ecology and Evolution, State University of New York Stony Brook, 1996).
Klein, A. et al. Evaluation of 14 nonlinear deformation algorithms applied to human brain MRI registration. Neuroimage 46(3), 786–802 (2009).
Ribeiro, A. S., Nutt, J., & McGonigle, D. J. Which metrics should be used in non-linear registration evaluation? In Medical Image Computing and Computer-Assisted Intervention–MICCAI 2015 388–395 (Elsevier Science, 2015).
Frank, L. R. & Galinsky, V. L. Information pathways in a disordered lattice. Phys. Rev. E 89(3), 11 (2014).
Galinsky, V. L. & Frank, L. R. A unified theory of neuro-MRI data shows scale-free nature of connectivity modes. Neural Comput. 29, 1441–1467 (2017).
Acknowledgements
LRF and VLG were supported by NSF Grants DBI-1143389, DBI-1147260, EF-0850369, PHY-1201238 and NIH Grant R01 MH096100. TBR was supported by NSF Grants EAR-1762458 and EAR-1919700. DMB was supported by NSF Grants BCS-1552848, DBI-1661386, DBI-1759839. LMW was supported by NSF Grants IOB-0517257, IOS-1050154, and IOS-1456503.
Author information
Authors and Affiliations
Contributions
L.F. wrote the main manuscript text and prepared the figures. T.R. contributed data and text in “Example 1: Analysis of CT growth series data of stained Monodelphis specimens (Rowe Lab)” and “Example 4: the fossil problem: Archaeoptery × lithographica (Rowe Lab)” sections and additional text through the paper. D.B. contributed data and text in “Example 2: posture correction and interspecies skull comparison (Boyer Lab)” section. L.W. contributed data and text in “Example 3: shape analysis and segmentation of Tyrannosaurus rex (Witmer Lab)” section. V.G. developed the numerical algorithms used to process the data.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Frank, L.R., Rowe, T.B., Boyer, D.M. et al. Unveiling the third dimension in morphometry with automated quantitative volumetric computations. Sci Rep 11, 14438 (2021). https://doi.org/10.1038/s41598-021-93490-4
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-021-93490-4
This article is cited by
-
A novel method for assessment of human midpalatal sutures using CBCT-based geometric morphometrics and complexity scores
Clinical Oral Investigations (2023)
-
Applications of Microct Imaging to Archaeobotanical Research
Journal of Archaeological Method and Theory (2023)
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.