Introduction

The medicinal plant based formulations have long been used for the treatment of various human ailments since ancient times till modern days1,2. The unsustainable use of these plants has created a negative pressure on their biodiversity and availability; which also leads to unethical malpractices in the current times by the practitioners and traders of the medicinal plants3,4,5,6,7. During recent years, there have also been reports pertaining to the toxicity of various plant based formulations. Such complains, however, are also being attributed to a larger extent with the unethical malpractices such as mislabeling, fraudulently replacement as well as unintentional negligence8. Since most of the times the real identity of the plant remains spurious; therefore, unintended plant formulation is given to the patient, leading to toxicity8. Thus, the correct identification and authentication of medicinal plants for their safe use is the need of the hour and seeks global attention.

DNA barcoding9 has emerged as a modern reliable tool for the identification and authentication of individual medicinal and herbal plants at molecular level7,10,11. This however, requires extraction of PCR quality DNA from such material. Extraction of DNA from herbal and aromatic plants is often problematic, because these plants contain high levels of secondary metabolites including lipids, phenolic compounds, and viscous polysaccharides that can interfere with downstream molecular applications. The problem becomes even worse if the material under investigation is dried herb or herbal powder, since the quantity and purity of DNA often recovered from such material is not up to the mark, leading to the failure of downstream molecular applications12. Thus, the DNA isolation from such herbal materials so far heavily depends on expensive commercial kits3,12,13,14,15.

Here, we have developed a novel method for the extraction of PCR quality DNA from aromatic herbs and dried herbal powders, without the use of any expensive commercial DNA extraction kit. Our method is modified from the cetyltrimethylammonium bromide (CTAB) method16 and the key novel step in this method is the use of cow skim milk (0.1% fat) during the CTAB lysis of dried herbs and aromatic plants. Skim milk possibly acts by adsorbing the DNA and competing with other adsorption competitors and impurities present in the crude lysates17. In the second stage of our procedure, the skim milk adsorbed DNA is purified and separated from co-extracted impurities using routine phenol:chloroform extraction18.

Material and methods

The plant material

Various dried herbs, which often include the dried seed, stem part, flowers or any other part of various herbal plants, are being sold in the online marketplace in India. Since these herbs are dried, these are often stored in shops and other marketplace at room temperature. We ordered small quantities of a total of 18 randomly selected dried herbs from an online herbal marketplace (see Tables 1, 2, Fig. 1), which were delivered to us in individually sealed and labeled small packets by routine parcel at room temperature. Once arrived in the lab, a specific voucher code was given to each sample (see Table 2) and these were stored at room temperature until further processing. Further, the 35 fresh aromatic herbal plants (mostly the fresh leaves) were randomly picked up from countryside land near Hyderabad, India (Supplementary Fig. 1, Supplementary Table 3). The five ayurvedic powders included in this study were personally obtained by SKV from a traditional practitioner of Ayurveda (see Supplementary Fig. 2, Supplementary Table 4).

Table 1 Details of the 12 randomly selected dried herbs (Photographs of each of these herbs are provided in Fig. 1) used for the standardization of the DNA extraction procedure developed in this study.
Table 2 Various dried herbs included in this study to validate the DNA extraction procedure and the success of downstream molecular analysis.
Figure 1
figure 1

Dried herbs included in this study. Refer to Table 2 for the details.

Figure 2
figure 2

Agarose gel images showing the patterns of isolated DNA and amplified PCR products obtained from dried herbs included in this study. The DNA was isolated from 12 randomly selected dried herbs (Table 1) without (a) or with (c) addition of skim milk in the first step of DNA isolation as described in Methods section. The DNA obtained from both sets of experiments was subjected to PCR amplification using modified matK primers developed in this study (see material and methods section). The PCR band obtained from untreated samples are shown in panel ‘b’; and the PCR band obtained from skim milk treated samples are shown in panel ‘d’. The lanes marked as ‘+ve’ in panels ‘b’ and ‘d’ are the ‘positive’ control for PCR and the lanes marked as ‘–ve’ are the ‘internal negative control’ for PCR reactions. The molecular weight markers (Lane M) used are Thermo Scientific GeneRuler™ 1 kb DNA Ladder (panel ‘a’ and ‘c’) and Thermo Scientific GeneRuler™ 100 bp Plus DNA Ladder (panel ‘b’ and ‘d’). The uncropped, multiple original exposures of the full length agarose gels displayed here are shown in Supplementary Fig. 4a–d.

DNA extraction: the first step

One gram of plant material (dried herbs, fresh aromatic herbs or ayurvedic powders) was manually crushed in liquid nitrogen using a mortar and pestle. Four ml of CTAB isolation buffer (2% Cetyl trimethylammonium bromide, 1.4 M NaCl, 20 mM EDTA, 100 mM Tris–HCl, pH 8.0 and 1% Polyvinylpyrrolidone 40, all from Sigma Aldrich) and 100 μl of proteinase k (from 20 mg/ml stock; Sigma Aldrich) was added into the fine powder and the mix was transferred into 50 ml polypropylene tube. Four ml of fresh skim milk (0.1% fat) was then added to the tube followed by incubation at either 37 °C for overnight (in case of freshly picked up aromatic herbs), or at 56 °C for overnight (in case of dried herbs and ayurvedic powders) on a rotating wheel in hybridization oven. Four ml of milliQ water was added in the untreated samples instead of skim milk to keep the volumes and concentration of detergent etc. constant for the treated and untreated samples. After the overnight incubation, 5 ml of chloroform/isoamyl alcohol (24:1 v/v) was added to each sample, gently mixed for 10 min and centrifuged at 10,000 rcf for 10 min at 4 °C. The supernatant was transferred to a new 15 ml polypropylene tube, 0.1 volume of 7.5 M ammonium acetate was added and mixed by inverting the tube 4–5 times. After mixing, two volume of 100% chilled ethanol was added to the sample to precipitate the DNA upon centrifugation at 10,000 rcf for 10 min at 4 °C. A big white or colored pellet of skim milk adsorbed DNA and other precipitated impurities is obtained at this stage, which is then washed with 70% ethanol, air dried and resuspended in 1 ml TE buffer (10 mM Tris, pH 8.0, 1 mM EDTA).

DNA extraction: the second step

In the second step the precipitated skim milk adsorbed DNA was extracted to obtain the PCR quality DNA. For this, 1 ml CTAB buffer was added to the resuspended DNA solution from the above step, and incubated at room temperature for 5 min. The DNA was then extracted using routine phenol:chloroform procedure18. In the final step, the DNA was precipitated using 0.1 V of 3 M Sodium acetate (pH 5.2) and 0.6 V of chilled isopropanol. The pellet of pure DNA obtained was washed with 70% ethanol, air-dried and resuspended in 100 μl of TE buffer (10 mM Tris, pH 8.0, 0.1 mM EDTA) for downstream PCR amplification.

PCR using animal specific universal primers mcb398 and mcb869

In order to assess the presence of contaminating DNA from the skim milk, the DNA obtained from the untreated and skim milk treated plant samples were amplified using animal specific universal primers mcb398 and mcb869 previously developed by us19 to generate the species-specific molecular signature (Supplementary Fig. 5). The, amplification was carried out in a 20 μl reaction volume containing 10 μl of 2 × Emrald Taq PCR mix (Takara, USA), 5 pM of each primer (mcb398 and mcb869), 7 μl miliQ water, and 1.0 μl template DNA (30–50 ng). The PCR conditions were: an initial denaturation at 95 °C for 5 min, followed by 35 cycles each of denaturation at 94 °C for 1 min, annealing at 51 °C for 1 min, and extension at 72 °C for 2 min. The extension step at the 35th cycle was held for 10 min. The 5 μl of PCR amplicons were loaded in 1% agarose gel to check the PCR success. The PCR products obtained were sequenced using Sanger sequencing in 3730Xl DNA Analyser (Thermo Fisher Scientific, USA) on both strands in duplicate and the sequence resolved were blasted against nr databases of NCBI using BLAST program and the ID of the species of contaminating DNA from skim milk was established using the procedure as described before19.

Modified matK primers, PCR amplification and sequencing

The matK primers used in this study were modified from Jing et al.20 to give a robust amplification from a wide range of plant taxa. For this, more than 20 thousand available full and partial matK gene sequences were obtained from GenBank database of National Center for Biotechnology Information and aligned using ClustalX2 programme21 (data not shown). The aligned sequences were opened in BioEdit Sequence Alignment Editor V7.2.522 and manually searched for the highly conserved regions to develop universal degenerate primers. The final sequence of the primers used in this study was—matK472F: 5′-CCCRTYCATCTGGAAATCTTGGTTC-3′ and CCMB-matK1248R: 5′-GCTRTRATAATGAGAAAGATTTCTGC-3′. The PCR reactions were conducted in a 20 μl mixture system containing 10 μl 2 × Emrald Taq PCR mix (Takara, USA), 5 pM of each primer, 7 μL miliQ water, and 1.0 μL template DNA (30–50 ng). The PCR conditions were: an initial denaturation at 95 °C for 5 min, followed by 35 cycles each of denaturation at 94 °C for 1 min, annealing at 54 °C for 1 min, and extension at 72 °C for 2 min. The final extension at 72 °C was held for 10 min. The PCR products obtained were sequenced using Sanger sequencing in 3730Xl DNA Analyser (Thermo Fisher Scientific, USA) on both strands in duplicate and the sequence resolved were blasted against nr databases of NCBI using BLAST program23 and the ID of the plant material was established using the procedure as described before19.

Results and discussion

DNA isolation of herbal material can be a challenge due to inhibitors that result in sub-optimal DNA quality. This study takes a look at using skimmed-milk to enhancing the standard CTAB protocol in extracting DNA from herbal material. Here, we have shown that our modified CTAB extraction protocol using skim milk was able to generate quality DNA to successfully amplify PCR products. In addition, due to the issues surrounding the difficulty in the identification of herbal plant, we created our own primers to determine whether the herb materials used in the study have been correctly identified.

Skim milk is reported to prevent the degradation of DNA and its adsorption by soil colloids17,24. Skim milk was also found effective in extraction of DNA from recalcitrant soil that strongly adsorb DNA, indicating that it acts as the adsorbent competitors present in the crude lysates25. Taking the clue from above studies, we anticipated that the skim milk might also help in the extraction of pure DNA from dried aromatic herbs and herbal powders, where it is difficult to extract pure DNA due to the presence various secondary metabolites that inhibit the PCR reactions and hamper the downstream molecular applications. Our assumption was indeed found true, which was tested and validated in this study. The DNA was isolated from 12 randomly selected dried herbs (see “Methods”, Table 1, Fig. 1) with or without addition of skim milk in the first step of DNA isolation as described in Methods section. The DNA obtained from both sets of experiments was subjected to PCR amplification using modified matK primers developed in this study as described in Methods section. When the DNA was isolated from these dried herbs without addition of skim milk in the first step, only four samples produced desirable PCR product (Fig. 2b). The intensity of the PCR bands was low and multiple isolates from some of the samples did not produce reproducible results (Fig. 2b). However, skim milk treated samples consistently yielded PCR quality DNA from all of the 12 samples tested and a specific band of expected size and high intensity was obtained (Fig. 2d). Contamination with exogenous DNA from skim milk was negligible as the genomic DNA was not detected in skim milk treated internal blank controls (data not shown), and subsequent PCR of matK gene did not produce any detectable band in internal negative control (Fig. 2d). However, the presence of animal specific exogenous DNA in skim milk treated herb DNA samples was confirmed by the PCR amplification of these samples using animal specific universal primers (mcb398 and mcb869)19 (Supplementary Fig. 5). A band of specific size (472 bp) was obtained in skim milk treated dried herbs samples; however, no band was detected in the DNA from untreated dried herbs using these animal specific universal primers (Supplementary Fig. 5). The species origin of the contaminating DNA from skim milk was confirmed to that of Bos taurus (cow) as summarized in Supplementary Tables 1 and 2. This experiment confirmed that the presence of animal origin exogenous DNA from skim milk did not affect the matK based PCR amplication of plant DNA for downstream analysis.

The applicability of this method of DNA isolation from herbs was further assessed and validated with an additional 35 randomly picked up fresh aromatic herb samples collected from countryside land near Hyderabad, India (Supplementary Fig. 1, Supplementary Table 3), six additional dried herbs (total 18, Table 2) obtained from online herbal marketplace (Fig. 1, Table 2), and five herbal powders obtained from a traditional practitioner of Ayurveda (Supplementary Fig. 2, Supplementary Table 4). For all the samples tested, our method successfully yielded PCR quality DNA, which was subsequently amplified and sequenced using matK primers confirming that the DNA obtained was good enough for the downstream PCR applications (All nucleotide Sequences deposited in NCBI database as accession Nos MN006706 to MN006768). The matK based DNA typing9 was also able to establish the identity of the fresh aromatic herbs (Supplementary Fig. 1) either at the species level or at the genus level with high BLAST scores as shown in Supplementary Table 3.

Furthermore, we also checked the authenticity of the dried herbs and herbal powders included in this study using matK based DNA typing. It was found that of the 18 dried herbs tested, a total of 15 were identified as the same herb as it was mentioned on its label by online seller; however, plant species origin of one of the dried samples, which was sold to us in online marketplace with the spurious name ‘Dashmool Bharad’ (Sample—H7), was found to be the that of Tribulus sp. (Fig. 1, Table 2). Similarly, the identity of two other herbal samples (Samples—H12 and H21 in Fig. 1) was found spurious (see footnotes in Table 2) either due to mislabeling or fraudulently replacement, indicating the incidences of malpractices in online herbal marketplace.

Furthermore, three of the five ayurvedic powders were also identified as that of the same herb’s origin as labeled by local ayurvedic practitioner; however, the identity of two of the samples was found spurious (Supplementary Fig. 2, Supplementary Table 4). These findings were confirmed by careful examination of high BLAST scores of the IDs of each specimen tested with multiple specimen vouchers entries in ncbi database, and also by inclusion of reference samples for some of the specimen tested in this study (Supplementary Fig. 3, Supplementary Table 5). These results provided initial DNA based evidence of the instances of malpractices in online herbal marketplace, these findings; however, need to be confirmed by comparing with a high fidelity reference database of authentic, dried medicinal herbs being used in various systems of plants based medicines around the world, which still needed to be done. Further investigation on such mislabeling or fraudulently replacement of dried herbs in online herbal marketplace will be presented elsewhere in detail as this is beyond the scope of this work.

This study provides a novel, validated method of DNA isolation from dried aromatic herbs and herbal powders, which could be used as an economic alternative to successfully extract PCR quality DNA from such plant materials. The presented milk protocol was an “improved” protocol compared to the standard CTAB protocol of DNA isolation as shown in this study. Nevertheless we have not compared this protocol with alternative DNA extraction protocols and commercial kits; it would be interesting to see whether it outperforms even the alternative extraction protocols and available commercial kits for plant DNA extraction. In its current format, our method seems to be labor-intensive in terms of time relative to the commercial kits, but it is possible that inclusion of novel step of pretreatment of plant samples with skim milk may even enhance the success rates and efficiency of such kits in future.

This study further present a set of robust matK primers which could be used as plant barcoding resource in future studies. While validating the utility of our method of plant DNA isolation and typing in a brief molecular survey of randomly selected dried herbs obtained from online marketplace and herbal powders obtained from local ayurvedic practitioner, we also noticed the instances of malpractices or mislabeling of dried herbs and herbal powders, which are reported in this study and further seek an urgent attention. This also indicates the urgent need for a global effort to develop a high fidelity reference database of authentic, dried medicinal herbs which are being used in various systems of plants based medicines around the world.