Fatigue database of additively manufactured alloys

Zhang, Zian; Xu, Zhiping

doi:10.1038/s41597-023-02150-x

Download PDF

Data Descriptor
Open access
Published: 02 May 2023

Fatigue database of additively manufactured alloys

Scientific Data volume 10, Article number: 249 (2023) Cite this article

3509 Accesses
3 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Fatigue is a process of mechanical degradation that is usually assessed based on empirical rules and experimental data obtained from standardized tests. Fatigue data of engineering materials are commonly reported in S-N (the stress-life relation), ε-N (the strain-life relation), and da/dN-ΔK (the relation between the fatigue crack growth rate and the stress intensity factor range) data. Fatigue and static mechanical properties of additively manufactured (AM) alloys, as well as the types of materials, parameters of AM, processing, and testing are collected from thousands of scientific articles till the end of 2022 using natural language processing, machine learning, and computer vision techniques. The results show that the performance of AM alloys could reach that of conventional alloys although data dispersion and system deviation are present. The database (FatigueData-AM2022) is formatted in compact structures, hosted in an open repository, and analyzed to show their patterns and statistics. The quality of data collected from the literature is measured by defining rating scores for datasets reported in individual studies and through the fill rates of data entries across all the datasets. The database also serves as a high-quality training set for data processing using machine learning models. The procedures of data extraction and analysis are outlined and the tools are publicly released. A unified language of fatigue data is suggested to regulate data reporting for the fatigue performance of materials to facilitate data sharing and the development of open science.

Fatigue dataset of high-entropy alloys

Article Open access 06 July 2022

Elastic modulus data for additively and conventionally manufactured variants of Ti-6Al-4V, IN718 and AISI 316 L

Article Open access 20 July 2023

Dataset of mechanical properties and electrical conductivity of copper-based alloys

Article Open access 29 July 2023

Background & Summary

Fatigue is a detrimental process of mechanical degradation experienced by structural materials and components under long-term service in, for example, the aerospace, nuclear power, oil, and gas industry¹. The design of structural integrity with the fatigue damage taken into account can be carried out in principles of safe life or damage tolerance. In safe-life design, flaws are not explicitly considered and products are intended to be removed from service after the design life. The philosophy of design relies on experimental data from standard specimens tested under specific loading conditions, which can be extended to structural components. In practice, arbitrary loading spectra are handled by considering cumulative damage, for example, by using the linear Miner’s rule². The effects of the size of specimens, mean stress, multiaxiality, and environment can also be included. The stress-life (S-N) data produced by stress-controlled (force-controlled) tests and strain-life (ε-N) data by strain-controlled tests are the two fundamental sets of experimental data for safe-life design, which describe the relationship between the maximum (σ_max, ε_max) or amplitude (σ_a, ε_a) of stress/strain and the number of loading cycles (N) and are commonly used for high-cycle fatigue (HCF)/low-cycle fatigue (LCF) design, respectively (Fig. 1a). In damage-tolerance design, a structural component is considered to be able to sustain flaws (e.g. cracks) safely before the next inspection point, and the component is then repaired or replaced². Fatigue crack growth (FCG) can be rationalized in the theory of fracture mechanics and experimentally assessed using compact-tension (CT) specimens. The dependence of the FCG rate (da/dN) on the stress intensity factor (SIF) range (ΔK) is thus referred to in structural health monitoring and maintenance (Fig. 1a). The S-N, ε-N and da/dN-ΔK data offer standard measures for the degradation of mechanical resistance under cyclic loads, which is a unique feature that can be exploited in data-centric research.

Compared to Young’s modulus and tensile strength, the fatigue performance of materials is susceptible to their microstructures, surface conditions as well as the loading and environmental conditions^2,3. The fatigue process involves microstructural evolution from nano-, micro- to structural scales, and theoretical prediction of the performance remains challenging⁴. Fatigue databases thus become of crucial importance for structural design. The initiation of the Aircraft Structural Integrity Program (ASIP) in the 1950s led to great success in preventing catastrophic failures and prolonging the life of structural components. However, only a few databases are publicly released, usually by authoritative research institutions for conventional alloys, and are limited in types of materials and the number of data records. For example, the Metallic Materials Properties Development and Standardization (MMPDS) handbook includes 213 S-N, 15 ε-N, and 39 da/dN-ΔK figures for 62 types of metallic materials, which are accepted for use in the Federal Aviation Administration (FAA), Department of Defense (DoD), and National Aeronautics and Space Administration (NASA)⁵. The National Institute for Materials Science (NIMS) Fatigue Data Sheet beginning in 1978 in Japan hosts 126 sheets of fatigue properties for 59 types of metallic materials⁶.

Standardized specimen preparation and testing conditions suppress most of the external sources of uncertainties in fatigue data and retain much of the correlation between the material performance and the material types as well as loading and environmental conditions. The reported fatigue data, however, still show a highly scattered nature for the variations in the microstructures of materials. From a complementary perspective, statistical analysis of this scattered nature based on a large volume of data may offer key insights into the material performance that cannot be reached by other means⁷.

Additive manufacturing (AM) is a facile technique to fabricate structural components with flexibility in structural design and benefits in the cost and lead time⁸ (Fig. 1b). Microstructural control offers an excellent route to explore the processing-microstructures-performance (PMP) relationship^9,10. In the past few decades, significant efforts have been made to explore the performance limits of AM alloys, especially on their fatigue behaviors¹¹. It is well-known that the surface conditions, internal defects, and other microstructural features strongly affect the fatigue performance of AM alloys, but the understanding of the PMP relationship remains largely qualitative^12,13. Both physics-^14,15 and machine learning (ML)-based approaches^16,17 were developed to resolve this issue, which demands reliable fatigue data for model verification and validation (V&V). Although the volume of data is much smaller than that reported for alloys produced by conventional techniques such as casting and forging, thousands of papers have been published on the fatigue performance of AM alloys, which provide a complete subset of data for analysis. Recent studies collected and analyzed AM fatigue data of selected AM alloys (e.g. Ti-6Al-4V, AlSi10Mg/AlSi7Mg, 316 L) from the literature^18,19,20,21. However, no datasets were released for follow-up data processing and analysis. Moreover, the quality of the summarized results is limited by the specific scope of the studies, and there is a need for standards or norms to report the fatigue performance of materials.

Open science, including open publication, data, and related resources, has recently become a global consensus to accelerate scientific research, promoting collaboration and benefiting the community^22,23. Digitization and open-access development offer entirely new opportunities for data-centric studies based on literature data, which can be compiled into structured databases and used in, for example, material screening and engineering design. Compared to the data released by authoritative institutions, open data has its richness in the material microstructures and the conditions of testing, which may be helpful for gaining more insights into the PMP correlation. However, data heterogeneity is expected at least in the quality of test specimens and the design of fatigue tests, which should be assessed to produce reliable records. Journal articles, conference proceedings, and technical reports form a vast and continually growing corpus of unstructured information, which can be processed by state-of-the-art natural language processing (NLP), ML, and computer vision (CV) techniques. Progress has been witnessed in this direction, where databases for material synthesis recipes²⁴ and properties^25,26,27 were released.

In this work, we collect fatigue data and related data reported for AM alloys including titanium, nickel, aluminum, and steel from 3,415 scientific articles (up to the end of 2022). Open-source and in-house codes are used for data extraction from figures, tables, and text. The description of research and reported S-N, ε-N and da/dN-ΔK data are outlined. To illustrate the usage of data, the fatigue performance of AM alloys is analyzed, offering suggestions for future research and more effective data publications.

Methods

Our workflow includes content acquisition (search and download), data extraction (from figures, tables, and text), and database construction (Fig. 2). The database contains metadata of articles and scientific data. Metadata includes information such as authors, funding agencies, and the year of publication, which outline the history of development, the state of the art, and the science of science (SciSci)²⁸. Scientific data describes the contents of research such as the types of materials, parameters of AM, processing and testing, fatigue and static mechanical properties, and their relationship. The scientific data in each article are organized into separated fatigue datasets for the S-N, ε-N or da/dN-ΔK data.

Content acquisition

Articles focusing on AM fatigue are identified in the citation databases and the full text are downloaded from the publishers. Keywords for AM fatigue are summarized and compiled into search formulas (Table 1). In materials science or mechanics of materials, ‘fatigue’ generally covers studies on the behaviors under cyclic loads and is used as the search keyword. For AM, a series of synonyms, branches, and their abbreviations are used, according to the terminology found in the AM standards^29,30 and review articles^{31,32,33,34,35}. The search is conducted in the authoritative citation database, Web of Science Core Collection (WoS), through the fields of ‘title’, ‘abstract’, and ‘author keywords’. WoS returns 3,415 records of articles and their metadata are obtained through the ‘export’ function. An NLP model is applied for the classification of articles according to their abstracts³⁶. Articles such as those on physiological ‘fatigue’ or research topics in irrelevant fields are discarded. Following NLP classification and manual examination, 2,001 candidate articles are identified.

Table 1 Keywords used for article search in the citation databases.

Full size table

The digital object identifiers (DOIs) in the metadata provide links to the full text. 104 of the 2,001 AM fatigue articles do not have DOIs in WoS records. In addition, 22 articles are not written in English, and 27 articles are from publishers with less than 10 publications. These records are discarded. 1,848 articles are downloaded for analysis and used to construct the database. Studies on the fatigue performance of AM alloys started after the year 2000, and most of the articles are published in both the portable document format (PDF) and extensible markup language (XML)/hypertext markup language (HTML) formats. PDF and XML/HTML files are more friendly to manual examination and automated code parsing, respectively. For Elsevier, 1,122 PDFs of articles are retrieved through the Application Programming Interface (API), accounting for 60% of the downloaded AM fatigue articles. PDFs from other sources are retrieved through the code article-downloader³⁷ (24%), Scopus Document Download Manager (12%) or manually from the publishers’ sites (4%). Elsevier API provides access to XML files (60% articles). HTML files, if available, are retrieved from other publishers by using the code article-downloader (37% articles).

Figure processing

The fatigue data (S-N, ε-N, and da/dN-ΔK) presented as scatter plots in figures or entries in tables are extracted and stored as data pairs. Scatter plots are more readable and concise than tables and are widely adopted in the literature, although the latter presentation provides direct numerical values. Figures are extracted from the PDF documents using PyMuPDF. Figures containing fatigue data are screened and those with multiple plots are manually segmented into single plots. Scattered data points are extracted by an in-house MATLAB code IMageEXtractor (IMEX). The code enables automatic and manual data extraction and allows subsequent manual correction. The automatic extraction function includes axis calibration, legend parsing, and data recognition by employing CV techniques.

The figures (98% published in color) are pre-processed into grayscale images and binarized by using a grayscale threshold of 80% to improve the efficiency of image processing in automatic extraction (Fig. 3a). The color, grayscale, and binarized versions of the figures are stored and selected for use in specific conditions. Clusters of connected black pixels in the binarized images are found and stored as figure components (FCs). The bounding box (BB) of an FC is defined as a rectangular region defined by its leftmost, rightmost, topmost, and bottommost pixels (Fig. 3a).

Axis calibration outputs the axis positions, axis labels, axis scales, ticks, and tick labels. The x-y coordinate system (CS) constitutes the largest FC, measured by the area under its BB. The x- and y-axis are identified as lines longer than 70% of the figure by scanning the largest FC in the vertical and horizontal directions. Lines perpendicular to axes are recognized as ticks. The labels are extracted by optical character recognition (OCR)³⁸ and assigned to the axes and ticks according to their positions. The scales of axes (linear/log) are determined according to the position and label of ticks.

The legend regions are selected manually in the current study since the positions and layouts of legends vary from figure to figure. In the selected region, symbols of data points are recognized and stored as templates, and the legend labels are marked down. Pixels containing data points in the CS are recognized according to the color codes of templates. Data reported in the binarized representation are recognized using the shapes. In 55% of the da/dN-ΔK data, the symbols are densely arranged and their shapes cannot be distinguished. Consequently, only pixels extracted using the color codes are stored. All S-N, ε-N and the rest 45% of da/dN-ΔK data are extracted according to both color and shape that are consistent. The extracted pixels are matched to the shapes of templates to detect the types of symbols. The centroids of these symbols are then extracted as data points. The method of data extraction (‘color and shape’, ‘color’, or ‘shape’) is recorded in the database. The extracted axes, legends, and data are visualized and manually corrected in IMEX. Data extracted from figures are converted from pixel units to physical units according to the position and scale of ticks. Ticks at two ends of the axes are chosen as references to minimize the error in determining the locations.

The performance of figure data extraction can be assessed by the metrics

$${\rm{precision}}=\frac{{\rm{TP}}}{{\rm{TP}}+{\rm{FP}}},$$

(1)

$${\rm{recall}}=\frac{{\rm{TP}}}{{\rm{TP}}+{\rm{FN}}},$$

(2)

$${\rm{F}}1=2{\rm{}}\frac{{\rm{precision}}\times {\rm{recall}}}{{\rm{precision}}+{\rm{recall}}},$$

(3)

where TP denotes the true positive or the number of correctly-extracted data, FP is the false positive or the number of incorrectly-extracted data, and FN is the false negative or the number of data that are not extracted. The F1 score is the harmonic mean of precision and recall. The metrics of axis calibration, legend parsing, and data recognition are summarized in Table 2. We find that data recognition underperforms axis calibration and legend parsing due to the technical difficulties in analyzing overlapped data points.

Table 2 Evaluation metrics of automated data processing.

Full size table

Table processing

Fatigue data in fewer than 5% articles are reported in tables. Tables are thus used in this work only to verify the data extracted from figures. Tables containing parameters of AM, processing, testing as well as static mechanical and fatigue properties are of interest, which can be identified from the table captions. Tables in XML/HTML files are parsed by table extractor³⁹ whereas those embedded in the PDFs are processed manually. The evaluation metrics of table data extraction are summarized in Table 2. The F1 score is 60%, which is not high since the data of non-AM alloys or data from external references are included. Combining text information in processing data in the tables could improve performance.

Text processing

Text processing includes text classification and data extraction (Fig. 3b). Structured text files in the XML/HTML format are processed using our in-house parsing codes TEXTract (adapted to the standard styles provided by the publishers) and in combination with the Python packages xml.dom.minidom for XML and BeautifulSoup for HTML. Text is extracted from PDFs by PDFDataExtractor⁴⁰ if the XML/HTML files are not available.

Text classification is conducted for abstracts and paragraphs using the NLP library Simple Transformer. The Robustly Optimized BERT Pretraining Approach (RoBERTa)³⁶, an improved model of the pre-trained Bidirectional Encoder Representation from Transformers (BERT)⁴¹, is used to transform text sequences into embedding vectors of abstract or paragraphs. The embedding vectors are passed to a fully connected neural network with one linear layer and output neurons corresponding to class labels. The RoBERTa and classification models are integrated into a classification module in Simple Transformer. The model is trained on AM fatigue articles with the AdamW⁴² optimizer using a cross-entropy loss function and a learning rate of 4 × 10⁻⁵. Abstract classification identifies AM fatigue articles from the search outputs of WoS based on a manually-labeled dataset of 500 abstracts, with class labels of ‘AM fatigue’ and ‘Non-AM fatigue’. Paragraphs are classified into ‘Method’ and ‘Non-method’ classes and passed to data extraction. ‘Method’ paragraphs include information of materials, parameters of AM, processing, and testing. The training set consisting of 3,350 paragraphs from 82 articles is constructed from sections with keywords of ‘method’, ‘fabrication’, ‘process’, ‘test’, and ‘experiment’ in their headings. Both abstract and paragraph datasets are split into training/testing/validation sets with a ratio of 0.8:0.1:0.1.

Data including the types of materials, parameters of AM, processing, testing, and static mechanical properties are extracted from text. To identify the types of materials, the chemical named entity recognition (NER) of ChemDataExtractor 2.0⁴³ is applied together with a dictionary of the trade name of alloys, prepared according to MMPDS-17⁵ and the domain knowledge. The scope of AM materials recognition contains title, abstract, and method paragraphs. For data entries of AM, processing, and testing, keywords are summarized and organized into regular expressions (REs) to extract data from the ‘Method’ paragraphs. In a specific domain such as AM fatigue, where the variants of keywords and sentence patterns for target data are limited, it is relatively easy to construct the REs. In practice, one physical quantity may be associated with several data entries. For example, ‘temperatures’ are relevant for specifications of AM procedures, heat treatment, and fatigue testing. Therefore, the extracted data are assigned to entries according to manually defined keywords in the current and previous sentences, such as ‘fabricate’ for AM procedures, ‘heat treat’ for heat treatment, and ‘test’ for fatigue testing. Static mechanical properties such as Young’s modulus, yield strength (YS), ultimate tensile strength (UTS), and elongation are identified by REs in the paragraphs of the ‘Method’ and subsequent sections. The evaluation metrics of text classification and data extraction are summarized in Table 2. Both abstract and paragraph classification gain an F1 score higher than 80%. The F1 score of data extraction is 63%, which is not high since it is difficult to effectively introduce the context information in the rule-based RE approach. The processing of figures, tables, and text thus achieves good performance in the tasks of axis calibration, legend parsing, and text classification. The performance of data extraction can be improved by refining the parsing rules, employing dependency parsing, or using advanced NLP models such as the Generative Pre-trained Transformer (GPT). GPT-3 is a large pre-trained language model with 175 billion parameters with improved performance of few-shot learning⁴⁴, which reduces the need for task-specific data and expertise in NLP. With fine-tuning, GPT-3 has the potential to extract structured data from complex scientific text with F1 score >80%⁴⁵. The capability of GPT-4 is further elevated, especially in complex tasks⁴⁶. Their applications to fatigue data remain to be explored.

Database integration and data correction

To construct the database, fatigue data extracted from figures should be correlated with data entries of materials, AM, processing, testing, and static mechanical properties extracted from text and tables. Most of the data entries do not vary in specific research reported in an article. Single values extracted for a specific data entry are assigned to all datasets related to the article. For data entries with multiple values, the assignment is made according to the legend labels.

Unlike static mechanical properties, fatigue data are more sensitive to fabrication, processing, and testing conditions, resulting in data dispersion. Consequently, although the F1 scores of data extraction can be improved by using advanced techniques, the performance may still be insufficient to establish high-quality databases for fatigue analysis in engineering. In this work, we address this issue through manual examination and correction. For fatigue data, we firstly correct data using our IMEX interface, and then print out the data for comparison with those in the source figures. For entries related to materials, AM, processing, testing, and static mechanical properties, we export the data to an EXCEL file and compare them with the PDF files. Besides data examination and correction, the manual work also involves figure selection and segmentation, and legend region selection. We extract the size and shape of specimens during the manual examination since most of them are presented in figures instead of text. Examining the text is the dominant part of manual work, and a domain expert can process 4–8 articles per hour. An automated multimodal (figures, tables, and texts) data annotation and correction system could reduce the workload. Standardized data reporting coordinated by the authors, publishers, and data users can also facilitate the construction of databases.

Fatigue data processing

In the experimental tests to measure the S-N and ε-N data, the amplitude (σ_a or ε_a) and the maximum (σ_max or ε_max) stress/strain are used, which can be related through

$${\sigma }_{{\rm{a}}}=\frac{{\sigma }_{{\rm{\max }}}-{\sigma }_{{\rm{\min }}}}{2}\quad {\rm{or}}\quad {\varepsilon }_{{\rm{a}}}=\frac{{\varepsilon }_{{\rm{\max }}}-{\varepsilon }_{{\rm{\min }}}}{2},$$

(4)

In the current study, the maxima (35% of the full database) are converted to amplitudes through the load ratio

$${R}_{\sigma }=\frac{{\sigma }_{{\rm{\min }}}}{{\sigma }_{{\rm{\max }}}}\quad {\rm{or}}\quad {R}_{\varepsilon }=\frac{{\varepsilon }_{{\rm{\min }}}}{{\varepsilon }_{{\rm{\max }}}},$$

(5)

$${\sigma }_{{\rm{a}}}=\frac{1-{R}_{\sigma }}{2}{\sigma }_{{\rm{\max }}}\quad {\rm{or}}\quad {\varepsilon }_{{\rm{a}}}=\frac{1-{R}_{\varepsilon }}{2}{\varepsilon }_{{\rm{\max }}}.$$

(6)

For the da/dN-ΔK data, the SIF range is

$$\Delta K={K}_{{\rm{\max }}}-{K}_{{\rm{\min }}}.$$

(7)

For analysis, scattered fatigue data of the S-N and ε-N relations are fitted by assuming a log-normal distribution with a constant variance by following ASTM E739-10⁴⁷, that is

$${\log }_{10}N=A+{B\log }_{10}{{\rm{\sigma }}}_{{\rm{a}}}\,{\rm{o}}{\rm{r}}\,{\log }_{10}N=A+{B\log }_{10}{{\rm{\varepsilon }}}_{{\rm{a}}},$$

(8)

where A and B are the fitting parameters. The S-N relation can be converted to the form of the Basquin’s equation

$${\sigma }_{{\rm{a}}}={A}_{1}{\left(N\right)}^{{B}_{1}},$$

(9)

where A₁ and B₁ are the fitting parameters.

The da/dN-ΔK data are fitted by the Paris equation

$${\rm{d}}a/{\rm{d}}N=C{(\Delta K)}^{m},$$

(10)

where C and m are the fitting parameters.

Data Records

The FatigueData-AM2022 database⁴⁸ collects experimental S-N, ε-N, and da/dN-ΔK data of AM alloys. The studies on structural components or architectured materials are not included^49,50. Data are collected for fatigue tests under uniaxial or bending conditions. Fatigue performance under variable, torsional, and multiaxial loads are reported in only a few studies at this stage and are not incorporated to maintain data integrity. The FatigueData-AM2022 database⁴⁸ is available as MAT (MATLAB), JSON, and EXCEL files at https://doi.org/10.6084/m9.figshare.22337629. The MAT and JSON files are formatted into a hierarchical tree structure. The tree nodes that directly store data values are called data entries. Data entries include string and numeric data types. Text data such as titles, types of AM, and fatigue tests are stored as strings. Data with multiple strings such as authors, countries, and institutions are stored as string arrays. The year of publication is defined as a numeric number, and other numeric data such as fatigue data, parameters of AM, and load ratios are stored in the form of numeric arrays. The tree nodes used to group data entries are called data structs. Multiple structs such as articles or fatigue datasets are arranged into struct arrays. To facilitate programming implementation and data acquisition, keys are defined for data entries, structs, and struct arrays (Fig. 4 and Tables 3–5).

Table 3 Contents of the struct of ‘metadata’ and children nodes of ‘fatigue datasets’.

Full size table

Table 4 Contents of the struct of ‘AM parameters’, dependent on the types of AM.

Full size table

Table 5 Contents of the struct in the ‘processing parameters’ struct array, dependent on the types of processing.

Full size table

The structure of the FatigueData-AM2022 database⁴⁸ is summarized in Fig. 4. The root node is the database, containing children nodes of articles and a default unit system (e.g. MPa for stress, °C for temperature, μm for layer thickness, W for power). Raw numeric data are converted to the default units of data entries. Articles are stored as a struct array, and each article contains two structs of metadata and scientific data. Metadata contains data entries such as the titles and authors of articles. Scientific data store a struct array of fatigue datasets, each of which is obtained from experimental tests under different conditions. A fatigue dataset contains 6 structs (fatigue, materials, AM, processing, testing, and static mechanical properties), under which multiple data entries, structs, or struct arrays are defined (Table 3). A rating score is assigned to each fatigue dataset to measure the quality of data, which will be explained in the next section. The struct of AM parameters and processing parameters depends on their type, as shown in Tables 4, 5, respectively. The processing parameters are organized as a struct array, ‘proc_para’, for it may contain multiple steps. The processing sequence is recorded in the ‘proc_seq’ array. The processing parameters can be identified in the ‘proc_para’ array through the index entry in ‘proc_seq’.

The terminology of data types is largely inherited from MATLAB (the MAT file). Exceptions are string arrays and the struct array of processing parameters, which correspond to cell arrays in the MAT file. For the JSON file, the struct is defined as a dictionary, and all types of arrays are defined as lists. The FatigueData-AM2022 database⁴⁸ is also flattened into an EXCEL file, including 4 worksheets. The worksheets of ‘S-N’, ‘e-N’, and ‘dadn’ store S-N, ε-N, and da/dN-ΔK data, respectively. In these 3 worksheets, each row stores the index of a fatigue dataset and a data descriptor (S/ε, N, and the run-out flag for ‘S-N’/‘e-N’, da/dN and ΔK for ‘dadn’). The da/dN-ΔK data extracted by color stores all matched pixels. The number of data points exceeds the maximum number of rows allowed by EXCEL (1,048,576). As a result, 500 data points are sampled from each dataset and then recorded. In the 4th worksheet of ‘parameter’, each row stores the index of a fatigue dataset and its contents. Each column corresponds to a data entry. Data in the ‘parameter’ worksheet is linked to the other three through the index of fatigue datasets.

With the database structure outlined above, the data entries are explained here in detail. The ‘fatigue data’ array store N or ΔK in the first column, and the values of σ_a, ε_a or da/dN in the second column. ε_a stands for the amplitude of total strain including the elastic or plastic components. The third column stores the run-out flag for S-N and ε-N data, where ‘1’ denotes the test stops before failure (run-out) and ‘0’ denotes failure. The fatigue life and the FCG rate are sensitive to material anisotropy. In this work, the direction of specimens is measured by an angle between the building platform in AM and the loading direction⁵¹. The size effect of AM specimens could be significant due to the limited accuracy of printing, the presence of defects, and residual stress^52,53,54. The size of the critical cross-section stores the diameter for specimens with circular cross-sections, the outer and inner diameters for those with annular cross-sections, and the width and thickness for those rectangular cross-sections, respectively. The shapes of the cross-sections are stored in the description of specimens (‘spec_desc’). In the numeric arrays of other data entries, a single value stands for a specific value or the mean, and two values stand for the lower and upper bound, respectively.

For the convenience of comparison between string data, unified nomenclature is used for data entries such as types of AM, materials, machines, affiliations, and funding agencies. 98% of the AM types can be classified into four categories of laser powder bed fusion (L-PBF), electron beam powder bed fusion (E-PBF), powder-based directed energy deposition (P-DED), and wire-based directed energy deposition (W-DED). Other AM types are recorded by their names such as binder jetting and metal extrusion. The default feedstock type is ‘powder’ for L-PBF, E-PBF, and P-DED, and ‘wire’ for W-DED.

In our database, data entries not reported explicitly are recorded as empty arrays (MAT), lists (JSON), strings (MAT and JSON), or cells (EXCEL). ‘As-built’ is assigned to surface treatment, ‘NHT’ is assigned to heat treatment, and ‘25 °C’ is assigned to preheat temperature if they are not applied (NA). We also assume that the testsing are uniaxial and conducted under an ambient environment (25 °C, air) with a stress concentration factor, K_t = 1 if not specified. The default load control is ‘force’ for S-N, ‘strain’ for ε-N, ‘load’ for da/dN-ΔK, and ‘displacement’ for very high-cycle fatigue (VHCF) irrespectively of data types. It is suggested that optional procedures or settings should be stated as NA in reporting fatigue data if not specifically stated.

In summary, the FatigueData-AM2022 database⁴⁸ covers 116 types of AM alloys in total. 459 articles report 1,610 S-N datasets with 15,146 data points, 79 articles report 236 ε-N datasets with 1,840 data points, and 135 articles report 614 da/dN-ΔK datasets (Fig. 4). 65% of data are S-N data used to measure fatigue life in the HCF regime and for safe-life design^55,56,57. Critical components in the aerospace and power industry under harsh conditions also require ε-N and da/dN-ΔK data.

Technical Validation

The performance metrics of figure, table, and text processing show that the F1 scores of automated extraction are ~60–90% (Table 2). All data records are manually examined and corrected to produce a high-quality database. Subsequent inspection of 50 randomly chosen articles shows that the precision is improved to be >98%.

One of the practical issues in extracting data from figures is the distortion of symbols and axis ticks after pixelation, which makes it difficult to determine the positions of centroids with high accuracy. Comparing S-N and ε-N data extracted from figures and those from the tables, if both of them were published, shows inconsistency in less than 5% of the 40 articles due to the uncertainties in locating the data points. The fitting parameters of data using Eq. 8 are compared with values reported in articles, also showing inconsistency <5%.

Representative data and their statistics are plotted in Fig. 5 for illustration and the quality of data is assessed by the domain knowledge. S-N data for the 4 mostly reported AM alloys (Ti-6Al-4V, 316 L, AlSi10Mg, and IN718) are included in Fig. 5a and the fatigue life decreases as the stress amplitude increases. The fatigue strength of Ti-6Al-4V and IN718 alloys are superior, followed by 316 L and AlSi10Mg (Fig. 5a). The statistic of materials, types of AM, and surface treatment of S-N datasets are summarized in Fig. 5b. Ti-6Al-4V occupies 90% of the data for AM titanium alloys, and IN718 occupies 77% for AM nickel alloys. The high percentage of occupations stems from their dominance in conventional titanium and nickel alloys for the high strength and mature manufacturing procedures^58,59. Though AlSi10Mg is not very popular among conventional aluminum alloys, it accounts for 66% of AM aluminum alloys due to its good printability⁶⁰. 316 L accounts for only 43% of AM steels and other types also take a share, signaling the diversity in the applications of steels⁶¹. It is noted that most of the fatigue specimens are prepared by PBF, especially L-PBF (83%), which is the most mature and commercialized AM technique (Fig. 5b)⁶¹. The layer-by-layer printing process and non-equilibrium nature of AM may result in poor surface quality, to which the S-N data are susceptible. Different types of surface treatment are investigated (Fig. 5b).

Representative ε-N and da/dN-ΔK data are shown in Fig. 5c, d. The fatigue life decreases as strain amplitude increases (Fig. 5c), and the FCG rate increases with the SIF range (Fig. 5d). The quality of data is further assessed by the relationship between fatigue data and other properties of the alloys, which is demonstrated here using the S-N data as an example. The relation between fatigue strength (σ_f) and UTS (σ_u), and the effects of loading and processing conditions are well-known for conventional alloys^2,62,63. Fig. 6a confirms the positive correlation between σ_f and σ_u, that is, high σ_u indicates high resistance to fatigue by suppressing damage accumulation. The ratio between σ_f and σ_u (0.2–0.7) for AM alloys is close to that of conventional alloys (0.25–0.65)².

S-N tests are commonly conducted at specific stress ratios, R_σ, which could introduce the effect of mean stress, σ_m = (σ_max + σ_min)/2. The relation between R_σ and σ_m can be derived from Eq. 5, which is ${R}_{\sigma }=1-\frac{2{\sigma }_{{\rm{a}}}}{{\sigma }_{{\rm{m}}}+{\sigma }_{{\rm{a}}}}$. Figure 6b shows the performance of AM Ti-6Al-4V tested under R_σ = −1 (σ_m = 0) and 0.1 (σ_m = 0.55σ_max). The mean tensile stress downgrades the fatigue strength even under strong data dispersion, which agrees with the domain knowledge of conventional alloys as well.

There are limitations in the applications of fatigue databases constructed from open sources in comparison with the datasets released from authoritative institutions. In addition to the diversity in material fabrication, sample preparation, and surface finishing of the specimens, the incompatibility in testing standards and incompleteness of records also lead to difficulties in improving the quality of data, as well as the integration with authoritative databases or new data reported in the literature. A rating system is introduced for the data to be used in the design of structural integrity. Data entries can be assigned with weights according to the domain knowledge or their covariance with fatigue data. Additional measures such as the number of fatigue data⁴⁷, the number of citations of the publication, and the accuracy of data extraction could also be introduced. For each fatigue dataset, a rating score between 0 and 1 is computed as the weighted summation of non-empty entries. The scoring algorithm is subjective, and we leave this work to data users. Here, for the sake of simplicity, we assume equal weights for all the entries (Fig. 7a). Surface and heat treatment (including HIP and NHT) are regarded as two separate entries of processing parameters. We find that most datasets are rated with scores ranging from 0.5 to 0.9 since not all of the data entries are documented. 87% of the datasets have scores higher than 0.6, which contain essential information such as types of materials, types of AM, and fatigue testing. Fill rates (FRs) of data entries counted over all the datasets measures the quality of the database (Fig. 7b), which is expected to be not high for the diversity of data sources. The types of materials (e.g. Ti-6Al-4V, IN718), AM (e.g. PBF, DED), fatigue testing (e.g. uniaxial, bending), and load ratios are essential information and are provided in most AM fatigue articles. For the data entries related to AM and processing, the FRs of AM machine, layer thickness, the direction of specimens, heat treatment, and surface treatment are higher than 70% whereas other entries are less filled. For fatigue testing, 80% articles reported the loading frequency since it could vary by 4 orders of magnitudes in practice. The effects of frequency could be significant as the heating effect is introduced, for example, by plastic dissipation in LCF or vibration in VHCF. In addition, the strain rate is proportional to the frequency, to which the damage processes could be susceptible, and in a corrosive environment, material degradation is rate-dependent as well^64,65. Surprisingly, only 40% articles reported the standard of fatigue testing they followed. Considering the variation in microstructures and (as-built) surface conditions, the implementation of traditional fatigue testing standards for AM fatigue research should be assessed⁶⁶. New designs of specimens, e.g. in miniature types⁶⁷, and testing techniques such as VHCF are also worth further discussion. FRs of static mechanical properties are no more than 50% since the data dispersion is not high.

Our results highlight the need for standards of AM fatigue testing as well as norms of reporting data in journals, conference proceedings, and technical reports, which are crucial for the development of high-quality databases and data-centric research. A unified language of fatigue data (ULFD) is suggested here according to related standards for AM, processing, and testing⁶⁸. The current database can be exported using the ULFD (Fig. 7c), which not only outlines the workflow of database construction but also guides data analysis and experimental planning.

Usage Notes

Data dispersion and system deviation should be noted while analyzing fatigue data reported in the literature. For example, the fatigue strength of AM Ti-6Al-4V is not only inferior to its conventional counterpart as reported in the NIMS database but also shows a larger scatter (Fig. 8a). Comparison to the MMPDS data leads to the same conclusion. To quantify the degree of dispersion, the log-normal probability density function ${p}_{{\rm{f}}}\left(x\right)=\frac{1}{s\sqrt{2\pi }}\exp \left[-\frac{1}{2}{\left(\frac{ln(x)-\mu }{s}\right)}^{2}\right]$ is assumed and fitted using Eq. 9 to compute the mean, μ, and variance, s of the fatigue strength after 10⁶ cycles (Fig. 8b). The values of s for the datasets range from 1.6 × 10⁻⁴ to 45.1 × 10⁻⁴, most of which are higher than the values in NIMS 1100 class (3.1 × 10⁻⁴) and 900 class for Ti-6Al-4V (1.7 × 10⁻⁴). AM data are more scattered than the NIMS data regardless of the types of materials, which can be attributed to the diversity in material microstructures including the defects. Optimizing AM parameters or post-processing procedures could reduce the dispersion of fatigue performance and better serve critical applications. Although displaying a more scattered nature compared to authoritative databases, AM data collected from the literature still provide key insights into the material properties and guidelines for fatigue design (Fig. 5).

In addition to data dispersion, Fig. 8b shows that system deviation exists among fatigue data from different studies. For example, hot isostatic pressing (HIP) is an effective high-pressure, high-temperature procedure to reduce internal (porous) defects in alloys, which improves their HCF performance by suppressing crack initiation. The effect of HIP on fatigue performance is compared to that of ordinary heat treatment that operates at lower temperatures without pressurization (Fig. 8c). The two sets of data can hardly be distinguished due to not only data dispersion, but also system deviation resulting from differences in the specimen preparation and testing procedures. To resolve this issue, one of the published HIP fatigue data is selected as a reference. All HIP fatigue data are then fitted by Basquin’s equation (Eq. 9) and scaled to the reference. The scaling factor for σ_a at specific cycles N is calculated as

$$\alpha =\frac{{A}_{1}^{{\rm{ref}}}{\left(N\right)}^{{B}_{1}^{{\rm{ref}}}}}{{A}_{1}{\left(N\right)}^{{B}_{1}}},$$

(11)

where the superscript ‘ref’ denotes the reference data. The heat treatment (HT) data are then scaled using the value of α for the HIP data reported in the same articles, that is, ${\sigma }_{{\rm{a}}}^{{\rm{HT}},{\rm{scaled}}}=\alpha {\sigma }_{{\rm{a}}}^{{\rm{HT}}}$. The results clearly show that HIP outperforms HT in improving the HCF performance, where fatigue life is controlled by crack initiation (Fig. 8d). However, HT seems to be superior for LCF (N < 10⁴), where plastic deformation is crucial. This can be explained by the process of grain coarsening in HIP, which weakens the resistance of alloys to plastic deformation⁶⁹.

Our database lays the ground for data-driven material screening and life estimation of AM components, offering cost-effective solutions for engineering design. Critical analysis of the entries in the database offers key insights into technical roadmapping⁷⁰, which could optimize the investment strategy in research and development. Our database can also serve as a training dataset for NLP, ML, and CV models to improve the performance of model predictions. In addition, the current approach can be extended to other information on AM alloys and fatigue data of other alloys. However, extracting data from earlier literature for conventional alloys could suffer from challenges in processing image-based PDFs, where both text and figures/tables are of low quality and difficult to extract. Future work will focus on improving the level of automation of the current workflow and addressing the problems of parsing early documents.

Code availability

The scripts utilized to extract information from figures, tables, and text are mainly based on open-source codes such as ChemDataExtractor 2.0⁴³, table extractor³⁹, and Simple Transformer (https://simpletransformers.ai/), respectively. The in-house scripts for data extraction and analysis are publicly released at the GitHub repository (https://github.com/xuzpgroup/ZianZhang/tree/main/FatigueData-AM2022), which can be used by acknowledging the current article and under the MIT license⁷¹. These scripts include a detailed, step-by-step tutorial for loading and analyzing the dataset in the repository.

References

Suresh, S. Fatigue of Materials (Cambridge University Press, 1998).
Stephens, R. I., Fatemi, A., Stephens, R. R. & Fuchs, H. O. Metal Fatigue in Engineering (John Wiley & Sons, 2000).
Agrawal, A. et al. Exploration of data science techniques to predict fatigue strength of steel from composition and processing parameters. Integr. Mater. Manuf. Innov. 3, 90–108 (2014).
Article Google Scholar
Yaghoobi, M. et al. PRISMS-fatigue computational framework for fatigue analysis in polycrystalline metals and alloys. npj Comput. Mater. 7, 38 (2021).
Article ADS CAS Google Scholar
Battelle Memorial Institute. Metallic Materials Properties Development and Standardization (MMPDS-17) (Battelle Memorial Institute, 2022).
Furuya, Y., Nishikawa, H., Hirukawa, H., Nagashima, N. & Takeuchi, E. Catalogue of NIMS fatigue data sheets. Sci. Technol. Adv. Mater. 20, 1055–1072 (2019).
Article PubMed PubMed Central Google Scholar
Kononova, O. et al. Opportunities and challenges of text mining in materials research. iScience 24, 102155 (2021).
Article ADS PubMed PubMed Central Google Scholar
Herzog, D., Seyda, V., Wycisk, E. & Emmelmann, C. Additive manufacturing of metals. Acta Mater. 117, 371–392 (2016).
Article ADS CAS Google Scholar
Nadammal, N. et al. Critical role of scan strategies on the development of microstructure, texture, and residual stresses during laser powder bed fusion additive manufacturing. Addit. Manuf. 38, 101792 (2021).
CAS Google Scholar
Li, Y., Liang, X., Yu, Y., Wang, D. & Lin, F. Review on additive manufacturing of single-crystal nickel-based superalloys. Chin. J. Mech. Eng.: Addit. Manuf. Front. 1, 100019 (2022).
Google Scholar
Sanaei, N. & Fatemi, A. Defects in additive manufactured metals and their effect on fatigue performance: A state-of-the-art review. Prog. Mater. Sci. 117, 100724 (2021).
Article CAS Google Scholar
Shao, S., Khonsari, M. M., Guo, S., Meng, W. J. & Li, N. Overview: Additive manufacturing enabled accelerated design of Ni-based alloys for improved fatigue life. Addit. Manuf. 29, 100779 (2019).
CAS Google Scholar
Zhao, L. et al. Review on the correlation between microstructure and mechanical performance for laser powder bed fusion AlSi10Mg. Addit. Manuf. 56, 102914 (2022).
CAS Google Scholar
Molaei, R. et al. Fatigue of additive manufactured Ti-6Al-4V, Part II: The relationship between microstructure, material cyclic properties, and component performance. Int. J. Fatigue 132, 105363 (2020).
Article CAS Google Scholar
Cao, M., Liu, Y. & Dunne, F. P. A crystal plasticity approach to understand fatigue response with respect to pores in additive manufactured aluminium alloys. Int. J. Fatigue 161, 106917 (2022).
Article CAS Google Scholar
Zhan, Z. & Li, H. Machine learning based fatigue life prediction with effects of additive manufacturing process parameters for printed SS 316L. Int. J. Fatigue 142, 105941 (2021).
Article CAS Google Scholar
Maleki, E. et al. On the efficiency of machine learning for fatigue assessment of post-processed additively manufactured AlSi10Mg. Int. J. Fatigue 160, 106841 (2022).
Article CAS Google Scholar
Li, P., Warner, D., Fatemi, A. & Phan, N. Critical assessment of the fatigue performance of additively manufactured Ti-6Al-4V and perspective for future research. Int. J. Fatigue 85, 130–143 (2016).
Article CAS Google Scholar
Chern, A. H. et al. A review on the fatigue behavior of Ti-6Al-4V fabricated by electron beam melting additive manufacturing. Int. J. Fatigue 119, 173–184 (2019).
Article CAS Google Scholar
Afkhami, S., Dabiri, M., Piili, H. & Björk, T. Effects of manufacturing parameters and mechanical post-processing on stainless steel 316L processed by laser powder bed fusion. Mater. Sci. Eng., A 802, 140660 (2021).
Article CAS Google Scholar
Lesperance, X., Ilie, P. & Ince, A. Very high cycle fatigue characterization of additively manufactured AlSi10Mg and AlSi7Mg aluminium alloys based on ultrasonic fatigue testing. Fatigue Fract. Eng. Mater. Struct. 44, 876–884 (2021).
Article CAS Google Scholar
UNESCO. UNESCO Recommendation on Open Science https://doi.org/10.54677/MNMH8546 (2021).
Woelfle, M., Olliaro, P. & Todd, M. H. Open science is a research accelerator. Nat. Chem. 3, 745–748 (2011).
Article CAS PubMed Google Scholar
Kononova, O. et al. Text-mined dataset of inorganic materials synthesis recipes. Sci. Data 6, 203 (2019).
Article PubMed PubMed Central Google Scholar
Court, C. J. & Cole, J. M. Auto-generated materials database of Curie and Néel temperatures via semi-supervised relationship extraction. Sci. Data 5, 180111 (2018).
Article CAS PubMed PubMed Central Google Scholar
Kumar, P., Kabra, S. & Cole, J. M. Auto-generating databases of yield strength and grain size using ChemDataExtractor. Sci. Data 9, 292 (2022).
Article CAS PubMed Central Google Scholar
Chen, S. et al. Fatigue dataset of high-entropy alloys. Sci. Data 9, 381 (2022).
Article CAS PubMed PubMed Central Google Scholar
Fortunato, S. et al. Science of science. Science 359, eaao0185 (2018).
Article PubMed PubMed Central Google Scholar
ISO/ASTM 52900:2015. Additive Manufacturing–General Principles–Terminology (ISO/ASTM International, 2015).
ASTM F3413–19. Guide for Additive Manufacturing–Design–Directed Energy Deposition (ASTM International, 2019).
Thompson, S. M., Bian, L., Shamsaei, N. & Yadollahi, A. An overview of Direct Laser Deposition for additive manufacturing; Part I: Transport phenomena, modeling and diagnostics. Addit. Manuf. 8, 36–62 (2015).
Google Scholar
Gu, D. D., Meiners, W., Wissenbach, K. & Poprawe, R. Laser additive manufacturing of metallic components: Materials, processes and mechanisms. Int. Mater. Rev. 57, 133–164 (2012).
Article CAS Google Scholar
Frazier, W. E. Metal additive manufacturing: A review. J. Mater. Eng. Perform. 23, 1917–1928 (2014).
Article CAS Google Scholar
Murr, L. E. et al. Metal fabrication by additive manufacturing using laser and electron beam melting technologies. J. Mater. Sci. Technol. 28, 1–14 (2012).
Article CAS Google Scholar
Suwanpreecha, C. & Manonukul, A. A review on material extrusion additive manufacturing of metal and how it compares with metal injection moulding. Metals 12, 429 (2022).
Article CAS Google Scholar
Liu, Y. et al. RoBERTa: A robustly optimized BERT pretraining approach. Preprint at https://arxiv.org/abs/1907.11692 (2019).
Kim, E. et al. Machine-learned and codified synthesis parameters of oxide materials. Sci. Data 4, 170127 (2017).
Article CAS PubMed PubMed Central Google Scholar
Berchmans, D. & Kumar, S. Optical character recognition: An overview and an insight. 2014 Int. Conf. on Control, Instrumentation, Commun. Comput. Technol., 1361–1365 (2014).
Jensen, Z. et al. A machine learning approach to zeolite synthesis enabled by automatic literature data extraction. ACS Cent. Sci. 5, 892–899 (2019).
Article CAS PubMed PubMed Central Google Scholar
Zhu, M. & Cole, J. M. PDFDataExtractor: A tool for reading scientific text and interpreting metadata from the typeset literature in the portable document format. J. Chem. Inf. Model. 62, 1633–1643 (2022).
Article CAS PubMed PubMed Central Google Scholar
Devlin, J., Chang, M. W., Lee, K. & Toutanova, K. BERT: Pre-training of deep bidirectional transformers for language understanding. Preprint at https://arxiv.org/abs/1810.04805 (2018).
Loshchilov, I. & Hutter, F. Decoupled weight decay regularization. Preprint at https://arxiv.org/abs/1711.05101 (2017).
Mavracic, J., Court, C. J., Isazawa, T., Elliott, S. R. & Cole, J. M. ChemDataExtractor 2.0: Autopopulated ontologies for materials science. J. Chem. Inf. Model. 61, 4280–4289 (2021).
Article CAS PubMed Google Scholar
Brown, T. et al. Language models are few-shot learners. Adv. Neural Inf. Process. Syst. 33, 1877–1901 (2020).
Google Scholar
Dunn, A. et al. Structured information extraction from complex scientific text with fine-tuned large language models. Preprint at https://arxiv.org/abs/2212.05238 (2022).
OpenAI. GPT-4 technical report. Preprint at https://arxiv.org/abs/2303.08774 (2023).
ASTM E739–10. Standard Practice for Statistical Analysis of Linear or Linearized Stress–Life (S–N) and Strain–Life (ε–N) Fatigue Data (ASTM International, 2015).
Zhang, Z. & Xu, Z. Fatigue database of additively manufactured alloys. figshare. https://doi.org/10.6084/m9.figshare.22337629 (2023).
Li, P., Warner, D. & Phan, N. Predicting the fatigue performance of an additively manufactured Ti-6Al-4V component from witness coupon behavior. Addit. Manuf. 35, 101230 (2020).
CAS Google Scholar
Gu, D. et al. Material-structure-performance integrated laser-metal additive manufacturing. Science 372, eabg1487 (2021).
Article CAS PubMed Google Scholar
Xu, Z., Liu, A. & Wang, X. Fatigue performance and crack propagation behavior of selective laser melted AlSi10Mg in 0°, 15°, 45° and 90° building directions. Mater. Sci. Eng., A 812, 141141 (2021).
Article CAS Google Scholar
Murchio, S. et al. Additively manufactured Ti-6Al-4V thin struts via laser powder bed fusion: Effect of building orientation on geometrical accuracy and mechanical properties. J. Mech. Behav. Biomed. Mater. 119, 104495 (2021).
Article CAS PubMed Google Scholar
Li, P. et al. Towards predicting differences in fatigue performance of laser powder bed fused Ti-6Al-4V coupons from the same build. Int. J. Fatigue 126, 284–296 (2019).
Article CAS Google Scholar
Levkulich, N. et al. The effect of process parameters on residual stress evolution and distortion in the laser powder bed fusion of Ti-6Al-4V. Addit. Manuf. 28, 475–484 (2019).
CAS Google Scholar
Forschungskuratorium Maschinenbau (FKM). FKM-Guideline: Analytical Strength Assessment of Components in Mechanical Engineering (VDMA Verlag, 2003).
EN 1993–1–9. Eurocode 3: Design of Steel Structures–Part 1–9: Fatigue (European Committee for Standardization, 2005).
ANSI/ASME B106.1 M–1985. Design of Transmission Shafting (The American Society of Mechanical Engineers, 1985).
Leyens, C. & Peters, M. Titanium and Titanium Alloys: Fundamentals and Applications (John Wiley & Sons, 2003).
Paulonis, D. F. & Schirra, J. J. Alloy 718 at Pratt & Whitney: Historical perspective and future challenges. Superalloys 718, 625, 706 and Various Derivatives, 13–23 (2001).
Aboulkhair, N. T. et al. 3D printing of aluminium alloys: Additive manufacturing of aluminium alloys using selective laser melting. Prog. Mater. Sci. 106, 100578 (2019).
Article CAS Google Scholar
Narasimharaju, S. R. et al. A comprehensive review on laser powder bed fusion of steels: Processing, microstructure, defects and control methods, mechanical properties, current challenges and future trends. J. Manuf. Process. 75, 375–414 (2022).
Article Google Scholar
Liu, R., Zhang, P., Zhang, Z., Wang, B. & Zhang, Z. A practical model for efficient anti-fatigue design and selection of metallic materials: I. Model building and fatigue strength prediction. J. Mater. Sci. Technol. 70, 233–249 (2021).
Article Google Scholar
Liu, R., Zhang, P., Zhang, Z., Wang, B. & Zhang, Z. A practical model for efficient anti-fatigue design and selection of metallic materials: II. Parameter analysis and fatigue strength improvement. J. Mater. Sci. Technol. 70, 250–267 (2021).
Article Google Scholar
Tahmasbi, K., Alharthi, F., Webster, G. & Haghshenas, M. Dynamic frequency-dependent fatigue damage in metals: A state-of-the-art review. Forces Mech. 10, 100167 (2023).
Article Google Scholar
Milne, L., Gorash, Y., Comlekci, T. & MacKenzie, D. Frequency effects in ultrasonic fatigue testing (UFT) of Q355B structural steel. Procedia Struct. Integrity 42, 623–630 (2022).
Article Google Scholar
Hrabe, N. W. et al. Findings from the NIST/ASTM workshop on mechanical behavior of additive manufacturing components. Advanced Manufacturing Series (NIST AMS), 1–13 (2016).
Nicoletto, G. Smooth and notch fatigue behavior of selectively laser melted Inconel 718 with as-built surfaces. Int. J. Fatigue 128, 105211 (2019).
Article CAS Google Scholar
Wang, Z. et al. ULSA: Unified language of synthesis actions for the representation of inorganic synthesis protocols. Digital Discov. 163, 313–324 (2022).
Article Google Scholar
Schijve, J. Fatigue of Structures and Materials (Springer, 2009).
De Weck, O. L. Technology Roadmapping and Development: A Quantitative Approach to the Management of Technology (Springer Nature, 2022).
Saltzer, J. H. The origin of the “MIT license”. IEEE Ann. Hist. Comput. 42, 94–98 (2020).
Article Google Scholar

Download references

Acknowledgements

This study was supported by the National Natural Science Foundation of China through grants 11825203, 11832010, 11921002, 52090032, 12122204, and 11872150.

Author information

Authors and Affiliations

Tsinghua University, Applied Mechanics Laboratory and Department of Engineering Mechanics, Beijing, 100084, China
Zian Zhang & Zhiping Xu

Authors

Zian Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Zhiping Xu
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Z.X. conceived and supervised the research. Z.Z. performed the work. Both authors participated in discussing the results and preparing the manuscript.

Corresponding author

Correspondence to Zhiping Xu.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Zhang, Z., Xu, Z. Fatigue database of additively manufactured alloys. Sci Data 10, 249 (2023). https://doi.org/10.1038/s41597-023-02150-x

Download citation

Received: 04 January 2023
Accepted: 12 April 2023
Published: 02 May 2023
DOI: https://doi.org/10.1038/s41597-023-02150-x

This article is cited by

Prediction method of longitudinal surface settlement caused by double shield tunnelling based on deep learning
- Wentao Shang
- Yan Li
- Xiangrong Gao
Scientific Reports (2024)
Fatigue database of complex metallic alloys
- Zian Zhang
- Haoxuan Tang
- Zhiping Xu
Scientific Data (2023)