Background & Summary

The history of interest in paleocurrents goes back to the middle 1800s, but serious work on them began in the 1960’s. Potter and Pettijohn’s classic work1, published first in 1963, framed the rapid expansion of interest in paleocurrents seen in the geological literature, as sedimentologists and petroleum geologists recognized the usefulness of paleocurrents for establishing provenance and defining sedimentary basins. Paleocurrents also played a significant role in establishing plate tectonics as a viable model2.

This project began as an outgrowth of an assignment in a graduate sedimentology class using Potter and Pettijohn’s book. Students were challenged to recover paleocurrent data from the published literature and to use that data to test models of basinal sedimentation. The results encouraged us to continue the study. In particular, this pursuit was stimulated by the textbook authors’ challenge: ‘An aspect of paleocurrents, still far from being exploited, is compilation of more region-wide paleocurrent and facies maps... of regions exceeding a thousand or more kilometres in length and width3’. To accomplish this objective would require the accumulation of data on a heretofore unavailable scope. This we sought to accomplish. This phase of the work continued until the mid-1980’s when one of us (A.C.) took a new position. Subsequently the class was no longer offered and from that point the work was continued sporadically as time allowed.

The study of paleocurrent structures has long provided useful information to sedimentologists and stratigraphers. Paleocurrents have contributed to our understanding of flow directions of paleorivers4, longshore currents5, mass transport flows6 and paleowinds7. They have enabled us to decipher original paleoslopes8, and in unimodal systems, have given clues about provenance of sediments9. They continue to be useful for interpreting paleoenvironments10 and depositional processes11. Paleocurrents, along with other kinds of data have contributed to defining basinal geometry12 and basinal processes13. They have directed in the exploration for placer deposits14 and petroleum reservoirs15,16. For these and other reasons, a database of paleocurrent data can be a valuable resource for geologists working in a variety of disciplines.

Over time, we have continued to extract paleocurrent data from the literature. We developed a computer program to handle the data, and display it graphically. As our work continued, by the mid 1990s we had exhausted the readily available published records, but we found a new resource in geological theses and dissertations which often contained abundant data that were not published elsewhere. We changed our focus to this new source and began to acquire these data from universities around the world. Geology departments at nearly every major university in North America, in Australia, in England, in South America, and in Spain were visited and theses and dissertations were pulled and relevant parts were copied and later analysed for data. This work was carried out whenever it could be scheduled along with our other research projects. At present there are just over 1,000 references from bachelors, masters and doctoral theses in the database.

In the late 1990s and beyond, published papers with paleocurrent data began to be available on the internet. As a result, we were able to add extensive coverage of areas such as mainland China that have not been readily accessible until recently. A graduate student, Mingmin Wang joined our project in 2011 to help us read and interpret these papers. She continued to work with us on other areas as well. We have also begun to backfill from the published literature for the period extending back to the 1990s. It is our intent that this database will be supplemented with additional datasets over time. This is an ongoing work, but we felt that the database itself would be useful to other investigators in its present and growing form.

Data were acquired principally from North America during the initial phases of the study. With time, the study expanded into other areas of the world as it became apparent that paleocurrents patterns existed on the larger scales that Potter and Pettijohn had anticipated. The geographic and stratigraphic distributions of the datasets in the database are depicted in Fig. 1.

Figure 1: Plot of the distribution of paleocurrent datasets by stratigraphic interval (abscissa) and geographic area (stacked bars).
figure 1

A total of 30,136 individual datasets are represented in the diagram.

The great abundance of paleocurrent datasets (Fig. 1) in the Proterozoic is a reflection of the greater abundance of clastic rocks found there compared with the overlying Paleozoic rocks. The decreased level of paleocurrent records for the post-Cambrian Palaeozoic likely results from the abundance of limestone at the expense of clastic rocks found there. Geologists rarely report paleocurrents from carbonate rocks, even though they are often derivable from those rocks17. The diminished coverage in epochs of the Cenozoic is in part (Paleocene, Oligocene), a reflection of the relative scarcity of these rocks in North America.

As presented here, the database contains 30,135 datasets representing over one million discrete paleocurrent determinations on outcrop (or in a few cases in well cores). The global geographic distribution of the data is represented in Fig. 2.

Figure 2: Paleocurrent datasets plotted on equirectangular projection image.
figure 2

One red dot represents a single dataset in the database. Background image modified from Strebe (Creative Commons Attribution-Share Alike 3.0 Unported license).

Methods

Initially, the research involved examining published reports in geological journals. The following data were acquired from each publication: full citation (authors, title, journal, year, volume and pages), the stratigraphic position of the beds and the name of the formation, the latitude and longitude of each recorded dataset, the direction the current was moving, and the number of actual measurements. Additional data collected for each record included the area in square kilometres from which the dataset was obtained, the degrees of dispersion for the dataset, the continent (or plate) from which the data were derived, the environment of deposition inferred by the author, the type of paleocurrent indicators used, and the lithology. Each paper contained from one to many entries in the database, depending upon how many individually reported paleocurrent datasets were included. For example, if a paper included a map displaying five separate rose diagrams summarizing the outcrop measurements in five areas, then five entries were made in the database, one for each rose diagram. Thus, as used here, dataset refers to a single depiction of paleocurrent data in a paper, including the results of one or more outcrop determinations of paleocurrent direction, and would be represented by a single entry in the database.

Because there is no standard method of reporting the results of paleocurrent measurements, the data are inherently variable. For example, one paper may report one paleocurrent measurement for a 500 square km area. Another paper may report 500 measurements from one square km. Both are providing data, and thus both would be included in the database, and the additional information on the area and the number of measurements and the dispersion of the data can be used to try to compensate for the vagaries. There is also variability related to the method in which the data were presented. Some papers displayed every measurement as a discrete data point. Other papers may have the paleocurrent data grouped into rose diagrams or simply represented by a single directional sign, generally an arrow. In some cases the paleocurrent datasets were expressed in writing without any graphic. All of these would be included in the database, but only papers where the paleocurrent data were derived by actual field measurements were considered for inclusion. As indicated above, one of our goals in assembling these data was to look for trends on large scale trends, and it is our intent to do this. However, attempts to combine data from such inhomogeneous sources remain problematic. Rao and Sengupta18 have developed statistical methods for comparing data from multiple sources that appear promising for dealing with inhomogeneity of sources, but these methods have not yet been applied to the data in this database.

Where data were displayed as rose diagrams, a single direction was recorded for each diagram. This would be the vector mean direction if given. In cases where no vector mean was recorded, for rose diagrams, an estimate of the vector mean direction was computed by balancing the petal areas about a line, and deriving the direction using a protractor. In other cases, where an arrow was used to represent the data, the protractor was employed to obtain a direction from the arrow. Information about the data dispersion was obtained by estimating the number of degrees covered by the data, ignoring outliers. Thus, in the case of rose diagrams, the original rose could be approximately reconstituted if desired. Truly bidirectional data (as opposed to linear data with no defined sense) were represented by two datasets in the same geographic position.

Every effort was made to capture the data as presented in the papers, and to exclude any speculations that were based upon criteria other than physical, measurable parameters. Papers in which the author inferred source areas based on models or other criteria which did not include empirical support were not included. No record was made as to whether the author(s) applied palinspastic corrections to the data, or whether those corrections were warranted. In most cases, such corrections were not required, because the data were obtained from sediments which were uncomplicated tectonically. However in those cases where such corrections were warranted (and often were applied), it would be the responsibility of the user of the database to determine the suitability of the data.

Data Records

The paleocurrent dataset is provided in xlsx format in Dryad (Data Citation 1). It contains the following fields for each record (Table 1).

Table 1 Descriptions of Elements in Database
Table 2 Stratigraphic assignment
Table 3 Division of globe into plates and subplates
Table 4 Environment of deposition for relevant formation or unit
Table 5 Type of paleocurrent indicator used to derive flow direction from the unit in question
Table 6 Lithology of the unit from which the paleocurrent data were derived

Technical Validation

Data were accumulated and accessioned over a period of years by a large number of individuals. Because some judgments were necessarily made, and geographic coordinates were assigned to the data, there are many places in the dataset where quality control was applied. The data were tested for geographic accuracy by checking the positional data for each continent with the boundaries of the continent in question. They were also checked visually by uploading the data to Google earth and evaluating the point distributions. The data appear to be accurate and valid geographically from these tests. Directional assignments and other data associated with each dataset were checked by at least two individuals.

Usage Notes

The data as presented are suitable for analysis of paleocurrent trends over much of the world, and should prove useful for basin analysis and for tracing other local and regional sedimentary patterns. A program written in CPP allows the data to be graphically represented on a map of the world. The program and instructions for its use are available from the authors.

Additional Information

How to cite this article: Brand, L. et al. Global database of paleocurrent trends through the Phanerozoic and Precambrian. Sci. Data 2:150025 doi: 10.1038/sdata.2015.25 (2015).