Evolutionary, computational, and biochemical studies of the salicylaldehyde dehydrogenases in the naphthalene degradation pathway

Jia, Baolei; Jia, Xiaomeng; Hyun Kim, Kyung; Ji Pu, Zhong; Kang, Myung-Suk; Ok Jeon, Che

doi:10.1038/srep43489

Download PDF

Article
Open access
Published: 24 February 2017

Evolutionary, computational, and biochemical studies of the salicylaldehyde dehydrogenases in the naphthalene degradation pathway

Baolei Jia^1,2,
Xiaomeng Jia²,
Kyung Hyun Kim²,
Zhong Ji Pu³,
Myung-Suk Kang⁴ &
…
Che Ok Jeon²

Scientific Reports volume 7, Article number: 43489 (2017) Cite this article

2895 Accesses
18 Citations
Metrics details

Subjects

Abstract

Salicylaldehyde (SAL) dehydrogenase (SALD) is responsible for the oxidation of SAL to salicylate using nicotinamide adenine dinucleotide (NAD⁺) as a cofactor in the naphthalene degradation pathway. We report the use of a protein sequence similarity network to make functional inferences about SALDs. Network and phylogenetic analyses indicated that SALDs and the homologues are present in bacteria and fungi. The key residues in SALDs were analyzed by evolutionary methods and a molecular simulation analysis. The results showed that the catalytic residue is most highly conserved, followed by the residues binding NAD⁺ and then the residues binding SAL. A molecular simulation analysis demonstrated the binding energies of the amino acids to NAD⁺ and/or SAL and showed that a conformational change is induced by binding. A SALD from Alteromonas naphthalenivorans (SALDan) that undergoes trimeric oligomerization was characterized enzymatically. The results showed that SALDan could catalyze the oxidation of a variety of aromatic aldehydes. Site-directed mutagenesis of selected residues binding NAD⁺ and/or SAL affected the enzyme’s catalytic efficiency, but did not eliminate catalysis. Finally, the relationships among the evolution, catalytic mechanism, and functions of SALD are discussed. Taken together, this study provides an expanded understanding of the evolution, functions, and catalytic mechanism of SALD.

The tree of life of polyamine oxidases

Article Open access 20 October 2020

Substrate specificity of a branch of aromatic dioxygenases determined by three distinct motifs

Article Open access 03 September 2024

The metabolic network of the last bacterial common ancestor

Article Open access 26 March 2021

Introduction

Naphthalene (C₁₀H₈; CAS number 91-20-3), which is the most abundant polycyclic aromatic hydrocarbon (PAH), is a contaminant that is found environmentally as a constituent of coal tar, crude oil, and cigarette smoke¹. Naphthalene and its substituted derivatives are also used in chemical manufacturing as a chemical intermediate for many commercial products ranging from pesticides to plastics. Humans are exposed to naphthalene through a wide range of mechanisms, resulting in the production of reactive metabolites that deplete glutathione and result in oxidative stress². Based on its abundance and toxicity, naphthalene has been identified as a priority pollutant and a possible human carcinogen by the Environmental Protection Agency of the USA³. As the simplest PAH, naphthalene has been used as a model compound for studies on the metabolism of PAHs by microorganisms⁴.

Chemical, physical, and biological methods have been used for naphthalene remediation⁵. Above all, microbial biodegradation methods have been favored because of their environmental-friendliness, effectiveness, and low costs⁶. Bacterial strains isolated from contaminated soil or sediments such as Pseudomonas spp., Bacillus spp.⁷, Burkholderia spp., Comamonas spp., and Rhodococcus sp.⁶ are some of the best-studied naphthalene-degrading bacteria. Our previous work demonstrated that Alteromonas naphthalenivorans is a key biodegrader of PAH in crude oil-contaminated coastal sediment by two years of monitoring⁸. PAH bi,odegradation using filamentous fungi (including white rot fungi) such as Phanerochaete chrysosporium, Pleurotus ostreatus, and Trametes versicolor has been reported^9,10. Some Ascomycota fungi such as Fusarium sp. and Aspergillus sp. have also been reported to degrade naphthalene¹¹. The majority of reported naphthalene degradation pathways in bacteria are aerobic and can be divided into two stages: the upper pathway transforms naphthalene to salicylate and the lower pathway converts salicylate to tricarboxylic acid cycle intermediates through meta-cleavage pathway enzymes¹². The fungi metabolize naphthalene with the enzymes lignin peroxidase, manganese peroxidase, laccase, cytochrome P450, and epoxide hydrolase¹³.

During naphthalene degradation in bacteria, salicylaldehyde (SAL) dehydrogenase (EC 1.2.1.65, denoted as SALD) catalyzes the oxidation of SAL to salicylate using NAD⁺ as a cofactor. SALD is considered to be the last enzyme in the upper catabolic pathway and it plays an important role in connecting the upper pathway to the lower catabolic pathway, which leads to the production of tricarboxylic acid cycle intermediates¹². Two genes encoding SALD were discovered in Pseudomonas putida ND6, namely NahV and NagF, which showed a 72% identity to each other in their conserved regions. The corresponding enzymes showed different but overlapping properties, thus ensuring that single gene mutants could survive in naphthalene-containing environments^14,15. The SALD from Pseudomonas sp. strain C6. was found to be a functional homotrimer and showed a broad substrate specificity¹⁶. The crystal structure of the SALD from P. putida G7 (SALDpp) was determined and showed α/β folding with three domains, namely the oligomerization, cofactor-binding, and catalytic domains. The SAL was buried in a deep pocket in the structure where the catalytic Cys284 and Glu250 residues were located. The cysteine residue was able to attack the carbonyl carbon of the substrate and the glutamic acid residue functioned as a general base. In addition, the residues Arg157, Gly150, and Trp96 were found to play an important role in determining the specificity of the enzyme for aromatic and aliphatic aldehyde dehydrogenases^17,18.

SALD belongs to the aldehyde dehydrogenase (EC 1.2.1.3) superfamily, the members of which are responsible for the oxidation of a wide variety of aliphatic and aromatic aldehydes to carboxylic acids using nicotinamide adenine dinucleotide (NAD⁺) or nicotinamide adenine dinucleotide phosphate (NADP⁺) as a coenzyme. The overall reaction catalyzed by the aldehyde dehydrogenases is: RCHO + NAD⁺ + H₂O → RCOOH + NADH + H⁺. Our previous work demonstrated that SALD from A. naphthalenivorans (SALDan) was specifically up-regulated in response to naphthalene¹⁹. To gain insights into the function, evolution, and catalytic mechanism of SALD, we performed a comprehensive analysis using a sequence similarity network (SSN), a phylogenetic tree, molecular dynamics (MD), kinetics, and mutation analysis using SALDan (Uniprot AC: F5Z5S7) as a template. SALDan showed 63% identity to SALDpp (Uniprot AC: Q1XGL7). We found that SALD is present in both bacteria and fungi. The catalytic cysteine and the amino acids binding NAD⁺ have been highly conserved during evolution. SALDan can bind NAD⁺ strongly and SAL binding can decrease the binding energy to NAD⁺. We further purified the wild-type and mutant SALDan proteins. Biochemical assays of the enzymes supported the evolutionary and MD analysis results.

Results

Distribution and evolution of SALD

To determine the distribution of SALD in the biosphere, the protein sequences were acquired by searching the UniProt database²⁰ using SALDan sequence as a query with an e-value threshold of 10⁻⁸⁰ (>40% sequence identity). This threshold was chosen because several studies have shown that sequences that share >40% identity are very likely to share functional similarity, as judged by Enzyme Commission numbers²¹. In total, 2039 proteins were obtained and listed in Supplementary Dataset 1. To further clarify the distribution of these proteins and their relationships, a network for 2039 protein sequences was constructed using the Enzyme Function Initiative-Enzyme Similarity Tool²² with an e-value threshold of 10⁻¹⁵⁰ (>60% sequence identity was the cutoff for an edge between nodes [i.e., proteins]). The proteins were mainly separated into 11 groups and each protein was painted according to its taxonomic classification (Fig. 1). Further analysis of the proteins from NCBI metagenomics protein database (env_nr) showed that the homologues of SALD from environmental samples were able to be classified into group 1, 5, and 11 (Supplementary Fig. 1). Members of the enzymes were found in the domain “bacteria” and kingdom “fungi”. In bacteria, at the class level, the proteins were mainly distributed in Actinobacteria (7.19%), Alphaproteobacteria (16.80%), Betaproteobacteria (31.33%), Firmicutes (3.65%), and Gammaproteobacteria (21.58%). In fungi, at the phylum level, the prevalence of SALD genes was 0.55% in Ascomycota and 17.80% in Basidiomycota.

**Figure 1: Taxonomic distribution of SALDs.**

To provide a more detailed view of the evolutionary relationships across the groups, we performed a phylogenetic analysis using the proteins in the clusters assigned based on sequence comparisons (Fig. 2). Proteins from the same cluster always clustered together and were well-separated in the phylogenetic tree, except that clusters 2 and 9 from Ascomycota were clustered in the same branch. The separation of these groups had a high level of bootstrap support in the phylogenetic tree. Meanwhile, the proteins from bacteria were gathered in a clade with a high level of bootstrap support. The proteins from fungi formed separate branches in the phylogenetic tree. Cluster 7 from Ascomycota and cluster 8 from Basidiomycota were much closer to the proteins of bacteria and clusters 2 and 9 formed a distinct branch.

**Figure 2: Maximum likelihood phylogenetic tree for 2039 proteins from bacteria and fungi generated using MEGA.**

Conservation and coevolution of amino acids in SALD sequences

To examine the conservation and coevolution of primary sequences of SALD, we first determined consensus sites based on multiple sequence alignments (MSA) of 2039 sequences using SALDan as the reference sequence to display MSA and the conservation of the residues (Fig. 3). The most highly conserved amino acid was found to be Cys284 (Fig. 3A,B), which functions as the active site nucleophile in the dehydrogenase reaction. The other conserved amino acids were always hydrophobic, including Pro147, Asn149, Phe226, Gly228, Gly281, Gln282, and Phe381. Glu250, which may interact with the aldehyde substrate, is also in the list. We further investigated the coevolution of SALD amino acids using mutual information (MI)²³ (Fig. 3A,B). If two residues share a high MI score, they are most likely coevolving, meaning that to maintain a given enzymatic function, a mutation of one residue is linked to a specific compensatory mutation of the other residue^24,25. The MI network for 2039 SALD members revealed that higher MI values (the top 10% of MI values) were evenly distributed across all amino acid positions from the N-terminus to the C-terminus (Fig. 3A). The 11 most conserved residues were chosen for further analysis. These conserved residues formed a connected network, indicating that these residues also shared a significant MI value (Fig. 3B). The strong correspondence observed between the conserved residues and the coevolving residue positions is consistent with previous studies^26,27. Mapping the top coevolving and conserved residues onto the SALD structure illustrated the distances between and communication among the amino acids in this network (Fig. 3C). The mapping of the conserved amino acids revealed that NAD⁺ and SAL were surrounded by them. Among those amino acids, Asn149 and Glu250 may bind SAL, while Trp148, Phe226, Gly228, and Phe381 may bind NAD⁺. Thus, we propose here that the conserved and coevolving amino acids in SALD play important roles in catalysis and in substrate and cofactor binding.

**Figure 3: Conserved and coevolved amino acid residues in SALDs represented using SALDan as a reference sequence.**

MD simulation

To explore the potential function of the amino acids in SALD, we first constructed a 3D model of SALDan by homology modeling using the Modeller 9 program²⁸ based on the crystal structures of SALDpp (PDB ID: 4JZ6) and other aldehyde dehydrogenases (PDB ID: 4FR8, 4O6R, 4NMK, 2O2P, and 3PQA). The overall stereochemical parameters for the modeled proteins were measured using G-factor generated by PROCHECK²⁹, which showed that 99.8% of the residues were found in allowed regions of the Ramachandran plot (Supplementary Fig. 2). Moreover, 94.3% of the total amino acids were positioned in the most favored regions of the Ramachandran plot. The model of SALDan was used for the further analysis of substrate binding, the molecular dynamics (MD) simulation, and the calculation of binding free energy.

The complexes of SALD with SAL and/or NAD⁺ were built by superimposing these substrates from the crystal structures of SALDpp complexed with SAL¹⁷ and human aldehyde dehydrogenase complexed with NAD⁺ on that of SALDan³⁰. The complex structure based on the superimposition results was used as the initial structure for MD simulations, which were performed using GROMACS software³¹ to investigate the conformational changes and protein internal motions within a nanosecond timescale for apo-SALDan, SALDan-NAD⁺, SALDan-SAL, and SALDan-NAD⁺-SAL. In the simulation, the root-mean-square deviation (RMSD) is a crucial parameter of convergence in protein structure changes over the course of a simulation. The backbone RMSD of apo-SALDan equilibrated around 0.28 nm after 20 ns of simulation. The backbone RMSDs of SALDan-NAD⁺, SALDan-SALD, and SALDan-NAD⁺-SAL equilibrated around 0.35 nm over the same time frame, as shown in Fig. 4A. SALDan in complex with its substrate and/or cofactor showed a higher RMSD value than the apoenzyme did. This suggests that substrate or cofactor binding causes a conformational change in SALDan. Based on the RMSD analysis, the first 10 ns MD trajectory was deleted and the remaining 30 ns trajectory was used in the production analysis.

**Figure 4: Molecular dynamic analysis of SALDan.**

The predicted binding mode of SALDan with NAD⁺ based on the MD simulation is illustrated in Fig. 4B. In the MD simulation, the analyses revealed that SALDan can bind to SAL, with the side chains of the Asn149, Gly150, Val153, Leu154, Glu250, Ile283, Met285, Asn440, and Tyr446 residues localized to the active region. The NAD⁺ molecule has numerous interactions with residues in the α/β folds through hydrophobic interaction and hydrogen bonds. Briefly, the adenine dinucleotide part of NAD⁺ is stabilized by Gly208, Glu209, Val212, Phe226, Gly228, Val232, Ile236, Glu379, and Phe381. The dinucleotide also forms a hydrogen bond with Lys172, Glu175, and Asn213. The nicotinamide of NAD⁺ interacts with Trp148 and Asn149 via hydrogen bond formation. Pro147, Leu251, and Gly252 contribute the binding by hydrophobic interactions. Cys284 is positioned close to both NAD⁺ and SAL, implicating it as a potentially important residue.

In contrast to RMSD, which is a global measurement of the protein motion, root-mean square fluctuation (RMSF) can be used to analyze the flexibility of each residue present in the systems (Fig. 4C). The results of RMSF calculations showed that the four systems followed a similar pattern of fluctuation, and the highest RMSF values were calculated for the amino acids at the C-terminus, which suggested that the C-terminus is the most flexible region of SALDan. However, there are differences in the fluctuation patterns of the complexes and apo-protein during 30 ns of simulation. The average RMSF values per residue in the NAD⁺ binding sites for apo-SALDan and the SALDan-NAD⁺ complex were 0.76 and 0.62 Å, respectively. The average RMSF values per residue for the SAL binding sites of apo-SALD and the SALDan-SAL complex were 0.71 and 0.64 Å, respectively. After binding both NAD⁺ and SAL, the average RMSF values per residue in the NAD⁺ binding sites and SAL binding sites were 0.64 and 0.65 Å, respectively. This indicates that these residues become more rigid after binding to the cofactor and/or substrate. We suggested that this decrease in flexibility allows the protein to undergo the proper conformational change for catalysis.

Free energy analysis for the wild-type complexes

Quantification of the average energy of the interaction between NAD⁺/SAL and the specific residues located in the binding sites could provide further insight about which residues were important for the substrates’ binding. Therefore, ligand-residue interaction decomposition was performed by the molecular mechanic/Poisson–Boltzmann surface area (MM/PBSA) method using the g_mmpbsa package^32,33. The summations of the per-residue interaction free energies were separated into molecular mechanics energy (ΔE_MM), polar binding energy (ΔG_polar), non-polar solvation free energy (ΔG_np), and total contribution (ΔG_total). The binding amino acids residues listed in Fig. 4B are selected and the energy contributions from those residues are summarized in Fig. 5. As shown, the obtained results for the SALDan-NAD⁺ complex showed that Glu175, Glu209, and Glu379 had an appreciable polar binding energy (ΔG_polar) contribution, with ΔG_polar values of −48.21, −29.99, and −43.72 kcal/mol, respectively (Fig. 5). In addition, the residues Trp148, Asn149, Gly208, Val212, Phe226, Val232, and Phe381, which all had ΔG_np values of ≤−6.8 kcal/mol, had strong hydrophobic interactions with the ligand. In the SALDan-SAL complex, Val153, Leu154, Ile283, and Asn440 in the binding cavity had the appreciable ΔE_MM values of −3.79, −2.74, −3.38, and −4.34 kcal/mol, respectively, which indicated that SALD can bind the substrate with the binding cavity effectively. The calculated ΔG_total values for the ligands of the SALDan-NAD⁺ and SALDan-SAL complexes contributed by the selected residues were −82.87 kcal/mol and −9.59 kcal/mol, respectively, which indicated that the protein can bind NAD⁺ much more strongly than SAL. The binding energy of the ligands in the SALDan-NAD⁺-SAL complex was also analyzed. Interestingly, the calculated ΔG_total values for NAD⁺ and SAL in the complex were changed to 0.04 kcal/mol and −14.45 kcal/mol, respectively. The dramatic change of the ΔG_total values with NAD⁺ binding after the ternary complex formation suggested that a conformational change occurred in the NAD⁺ binding site, which may facilitate NADH release. In contrast, the binding to SAL was increased after the formation of the ternary complex. Among the binding amino acids, Ile283 and Asn440 had the largest contributions to the increase, with decreased ΔG_total values of 1.74 kcal/mol and 4.37 kcal/mol, respectively. Based on the above data, it would appear that the amino acids in the active site have varying contributions to ligand binding and a conformational change is induced during the binding process.

Purification of SALDan

The gene encoding SALDan was cloned and overexpressed in E. coli and SALDan was purified using an Ni-NTA column (Supplementary Fig. 3). The molecular mass of purified SALDan was approximately 53 kDa. To examine the native structure of SALDan, the purified protein was analyzed by gel filtration chromatography (Fig. 6A) and protein cross-linking (Fig. 6B). One peak was eluted from the gel filtration column at approximately 160 kDa. To determine whether SALDan can form a stable complex under in vitro conditions, cross-linking analysis was performed by incubating SALDan with aldehyde for various periods. The cross-linked products were then separated by SDS-PAGE followed by Coomassie blue staining. The results showed that the subunits of the SALDan proteins are cross-linked to various intermediates corresponding to sizes from monomers to trimers. Upon prolonged incubation, no additional band appeared. These results suggest that SALDan consists of three subunits in its native form.

**Figure 6: Oligomerization of SALDan.**

Kinetics of SALDan

The influences of pH and temperature on the activity of purified SALDan were examined. The specific activity of purified SALDan toward SAL was examined in the pH range 4.5–9.0 using a mixture of different buffers including sodium acetate, HEPES, glycine, and sodium phosphate (Supplementary Fig. 4A). The optimum pH for enzyme activity was approximately 7.5, which is well within the pH range (pH 6.0–9.0) that is suitable for the growth of A. naphthalenivorans³⁴. The optimum pH of SALDan was different from that of the SALD of Pseudomonas spp. C6 and G7 (optimal pH 8.0–8.5), which reflects the differences in the environmental conditions of these organisms. SALDan activity was measured at temperatures from 15–45 °C (Supplementary Fig. 4B) and exhibited high activity from 25–35 °C. SALDan catalyzes a two-substrate reaction. The K_m values for NAD⁺ and SAL were determined by varying the concentration of one substrate (SAL or NAD⁺) in the presence of a constant concentration of the other substrate. The plot of velocity versus [NAD⁺] substrate showed a typical Michaelis–Menten profile. The K_m and V_max values for NAD⁺ were 39.5 ± 3.2 μM and 94.1 ± 11.5 U/mg, respectively (Table 1). A direct plot of velocity versus [SALD] was not observed to be hyperbolic in nature (as for linear inhibition), but had an unusual shape. The velocity increased together with concentration elevation and the highest velocity could be achieved at 40 μM. The activity was approximately 80% of the maximal velocity when assayed at 200 μM, indicating the characteristic of substrate inhibition. The K_m and V_max for SAL were calculated to be 3.8 ± 0.5 μM and 49.5 ± 4.5 U/mg, respectively, and the K_i of SALDan was 378.7 ± 45.2 μM (Table 1).

Table 1 Substrate specificity of the recombinant SALDan.

Full size table

The substrate specificity of SALDan was investigated using a variety of aldehydes (Table 1, Supplementary Fig. 5). SALDan oxidizes wide range of aldehydes, especially aromatic aldehydes (SAL, benzaldehyde, chlorobenzaldehyde, nitrobenzaldehyde, and naphthaldehyde). The apparent K_m values for aromatic aldehydes ranged from 2.3–6.5 μM. V_max values for aromatic aldehydes also showed a narrow range from 22.6–74.2 U/mg. Finally, the catalytic efficiencies (App k_cat/K_m (s⁻¹ _• μM⁻¹) of these substrates were all within the same order of magnitude. SALDan catalyzes the oxidation of linear aliphatic aldehydes with short chains with k_cat values from 30.4–54.0 s⁻¹. However, SALDan exhibited K_m values toward short-chain aliphatic aldehydes that were much higher than those toward aromatic substrates, which caused short-chain aliphatic aldehydes to have k_cat/K_m values incomparable with those of aromatic aldehydes.

Site-directed mutagenesis of SALDan

The amino acids evolution and MD simulation analyses revealed that Asn149 is highly conserved and involved in both NAD⁺ and SAL binding. Additional free energy analysis further demonstrated that Val153 and Glu175 contributed significant free energy to binding NAD⁺ and SAL, respectively. These three amino acids were each mutated to alanine to study their functions (Supplementary Fig. 2). The catalytic efficiency of N149A for SAL decreased by about three times compared with that of the wild-type protein (Table 2, Supplementary Fig. 6). In contrast, V153A had no significant effect on the kinetic parameters for either NAD⁺ or SAL. Finally, the mutation of Glu to Ala at position 175 had a remarkable effect on the catalytic activity. The apparent K_m value of E175A to NAD⁺ increased by about two times and the k_cat/K_m toward the cofactor decreased by four times. The K_m value of the mutant toward SAL also increased and the catalytic efficiency decreased by six times. These site-directed mutagenesis experiments demonstrated that the mutation of the selected three amino acids affected enzyme activity to some extent, but not decisively, compared to that of the wild-type enzyme.

Table 2 Apparent kinetic parameters of the wild-type and mutant SALDan enzymes.

Full size table

Discussion

In this study, we first performed a large-scale in silico analysis of SAL dehydrogenases (SALDs), which revealed that SALDs are distributed among both bacteria and fungi. The evolutionary relationship of these enzymes was established by the SSN and phylogenetic tree analyses. Evolutionary studies indicated that the residues that are directly involved in substrate/cofactor binding and catalytic activity are highly conserved and have coevolved. Furthermore, the amino acids in SALD from A. naphthalenivorans (SALDan) that contributed to binding the substrate/cofactor were identified by MD simulation and free energy analysis. Finally, we purified and characterized SALDan, which was found to be capable of oxidizing aromatic aldehydes and had broad substrate specificity. Point mutations of the amino acids binding the substrate/cofactor had no detrimental effect on the enzyme activity.

SALD and the homologues were particularly abundant in bacteria (81.65%), followed by fungi (18.35%) (Fig. 1). This may be because the majority of PAH-degrading microorganisms are bacteria and their metabolic mechanisms and pathways have been well-studied³⁵. In fungi, the phylum Ascomycota, the subphylum Mucoromycotina, and the phylum Basidiomycota are the main contributors to PAH degradation in polluted environments³⁶. Our studies also showed that the proteins showing high similarity to SALD existed in the phyla Ascomycota and Basidiomycota of the fungal kingdom. The detailed pathway of PAH degradation in fungi is not clear and several possible degradation pathways have been suggested^37,38. In the PAH degradation process of fungi, fungal enzymes such as lignin peroxidase, laccase, cytochrome P450 monooxygenase, epoxide hydrolases, lipases, protease, and dioxygenases may be involved in oxidizing the PAH compounds³⁷. To our knowledge, the fungal proteins showing high similarity to SALD in bacteria have not been reported or reviewed by the Swiss-Prot database (Fig. 1). Considering that the fungal strains harboring the enzymes may be capable of degrading PAH, we speculate that the SALD-homologous proteins in fungi may also play a role in aromatic hydrocarbon degradation.

The aldehyde dehydrogenase family members contain two conserved sites: a cysteine active site and a glutamic acid active site³⁹. The cysteine residue acts as a nucleophile and attacks the carbonyl carbon of the aldehyde to form an intermediate⁴⁰, while the glutamic acid residue acts as a general base in the hydrolysis of the acyl-enzyme intermediate⁴¹. In the evolutionary analysis of SALD, one cysteine residue was found to be highly conserved, while the Glu residue corresponding the active site was not as conserved as the cysteine (Fig. 2). This may be because the enzymes have very distinct catalytic properties, even though their subunit structures are similar. Further experiments also showed that site-directed mutagenesis of the conserved glutamic acid residues 209 and 333 to glutamine residues in class 3 human aldehyde dehydrogenase had only marginal effects on enzyme activity⁴². In addition to the catalytic residues, five residues (Pro147, Trp148, Asn149, Gly228, and Phe381) that bind NAD⁺ and one residue (Asn149) that binds SAL were also found to be highly conserved through MD simulation using SALDan as the example (Fig. 2). The ordering of conservation was determined as catalytic residues >NAD⁺ -binding residues >SALD-binding residues. In SALDan, Asn149 stabilizes both NAD⁺ and SAL among these amino acid residues. Mutation of Asn149 caused the catalytic efficiency to both substrates to decrease by 50% (Table 2). Meanwhile, mutation of Glu175, which provides the lowest free energy for binding NAD⁺, increased K_m to NAD⁺ by 5-fold. However, the mutation of Val153, which exhibited the lowest free energy for binding SAL, had a marginal effect on the catalytic parameters. This may be because Val to Ala replacement does not affect the positioning of the amino acids. On the other hand, because SALD and its homologous proteins always show a broad substrate specificity and have a wide pocket to bind the aromatic ring aldehydes, the mutation of one residue will not affect the enzyme activity dramatically.

The aldehyde dehydrogenases exhibit diverse oligomeric states, including dimer, trimer, tetramer, and hexamer⁴³. The aldehyde dehydrogenase from Corynebacterium glutamicum exists in dimeric, trimeric, and tetrameric forms, as revealed by gel filtration chromatography⁴⁴. In the case of SAL dehydrogenase, SALDpp generated a dimeric biological unit, as shown by the determination of its crystal structure⁴⁵. The enzyme from Pseudomonas sp. C6, which shares 67% sequence identity with SALDpp, has a trimeric form based on gel filtration chromatography analysis¹⁶. SALDan was also determined to be trimeric based on gel filtration chromatography and protein cross-linking analyses. Despite having different modes of oligomerization, each monomer can be divided into three domains: a cofactor binding domain composed of a core that resembles the Rossmann fold, a catalytic α/β domain, and a small protruding domain that enables oligomerization^45,46. The oligomerization domains of SALD are located in the β7, β21, and C-terminus regions (Fig. 7). RMSF analysis showed that the highest amount of fluctuation was present in the C-terminus region, which suggested that the oligomerization domain is highly flexible. Besides, this domain is less evolutionarily conserved (Figs 2 and 7). Therefore, we speculate that the diversity of the oligomeric forms of the aldehyde dehydrogenases is caused by both the instability in structure and the low evolutionary conservation of the oligomerization domain.

**Figure 7: Sequence alignments of aldehyde dehydrogenases towards the aromatic aldehydes with broad specificity.**

The substrate specificities of the aldehyde dehydrogenases are determined by their catalytic domains. SALDan, SALDpp, and SALD from Pseudomonas sp. C6 all show activity towards a broad range of aromatic aldehyde substrates. The aldehyde dehydrogenase from Corynebacterium glutamicum, which has three oligomeric forms, catalyzes the oxidization of p-hydroxybenzaldehyde, 3,4-dihydroxybenzaldehyde, o-phthaldialdehyde, cinnamaldehyde, syringaldehyde, and benzaldehyde⁴⁴. The same properties have been reported for the aldehyde dehydrogenases from Geobacillus thermodenitrificans⁴⁷ and Brevibacterium sp. KU1309⁴⁸. Sequence alignments of the enzymes that have a broad substrate specificity showed that the catalytic residues are highly conserved, and the cofactor binding residues and core region of the cofactor binding domain are also well conserved. However, the amino acids that bind the aromatic aldehydes are less conserved than other regions, although all of the amino acids fulfilling this function were hydrophobic (Fig. 7). The property of a low evolutionary conservation can be extended to all SALDs (Fig. 2). The structure of SALDpp highlighted that the dimensions of the substrate binding pocket and its hydrophobic environment allow a broad substrate spectrum. Here, we propose that the evolutionary flexibility of the amino acids in the substrate binding pocket also contributes to the wide variety of substrates of the SALD subfamily.

In conclusion, our results highlight that SALD and its homologous proteins exist in both bacteria and fungi. The key residues for catalysis and cofactor binding have been conserved during evolution, but the residues binding the substrate are less well conserved. The domain for oligomerization with a flexible structure is also less well conserved. Finally, we characterized SALDan as having a broad range of substrates, which may be related to the evolutionary flexibility of the amino acids in the substrate binding pocket.

Materials and Methods

Collection of SALD and construction of SSN

A protein BLAST search was performed using SALDan as a query sequence in the UniProt or env_nr database with a cut-off e–value of 10⁻⁸⁰ (>40% sequence identity)²⁰. The proteins identified as homologous from the two databases are listed in Supplementary Dataset 1 and 2, respectively. An SSN of the homologous proteins obtained via BLAST was constructed using the Enzyme Function Initiative-Enzyme Similarity Tool²² and visualized by Cytoscape 3.3⁴⁹. Each node in the network indicates a protein and the edge indicates that the two nodes share significant similarity with an e-value less than the selected cutoff.

MSA and coevolving protein residues

MSA of protein sequences were performed using the ClustalW (version 2) software program⁵⁰. The phylogenetic trees were constructed with MEGA7 using the neighbor-joining method and a bootstrap test was carried out with 1000 iterations^51,52. Analysis of coevolving residues was carried out by calculating MI between two positions in the MSA. MI reflects the extent to which knowing the amino acid at one position can predict the amino acid identity at another position. MI was calculated between pairs of columns in the MSA using the MISTIC approach and web server²³.

MD simulation and binding free energy calculation

The three-dimensional structure of SALDan was modeled using the Modeller 9 software program²⁸. Structural validation of the enzyme was performed by creating a Ramachandran plot using the PROCHECK server²⁹. SALDan-SAL complex was built by superimposing the crystal structure of SALDpp (PDB ID: 4JZ6) on that of SALDan, then deleting the protein structure in 4JZ6. SALDan-NAD⁺ was built by superimposing the crystal structure of human aldehyde dehydrogenase (PDB ID: 4FR8) on that of SALDan, then deleting the protein structure in 4FR8. The SALDan-NAD⁺-SAL was built by the same approach by superimposing the structure of SALDan-NAD⁺ on that of SALDan-SAL^53,54. The structure was used as a starting geometry and parameters for NAD⁺ and SAL were generated with the ACPYPE tool using the general AMBER force field⁵⁵ with AM1-BCC atomic charges. Each system was solvated in a cubic box full of explicit TIP3P water molecules with a 10 Å buffer along each dimension. The systems were neutralized by adding explicit counter ions (Na⁺-Cl⁻) for each complex system. An MD simulation was performed using the GROMACS 5.0.4 software program with the AMBER force field⁵⁶ implemented on a LINUX operating system. To ensure that the system was relaxed and has no serious clashes or unsuitable geometry, the potential energy of the system first needed to be minimized by the steepest descent energy minimization method. The system was then gently heated by incrementing the temperature from 0 to 300 K at constant volume and using periodic boundary conditions. A velocity-rescaling⁵⁷ thermostat was used for temperature coupling with a constant number of atoms, volume, and temperature run for a duration of 200 ps using an ensemble with a constant temperature of 300 K and a coupling constant of 0.1 ps. A constant number of atoms, pressure, and temperature run were performed for 400 ps using an ensemble with a constant pressure of 1 bar and a coupling constant of 0.1 ps. The isotropic Berendsen protocol was used for pressure coupling. Long-range electrostatic effects were modeled using the particle-mesh-Ewald method⁵⁸. A 12 Å cutoff was applied to Lennard–Jones and electrostatic interactions. Bond lengths involving bonds to hydrogen atoms were constrained using the LINCS algorithm⁵⁹. Afterwards, an equilibration production MD simulation was run for 40 ns at constant temperature and pressure. The gmx rmsd and gmx rmsf programs in the GROMACS 5.0.4 was used to obtain the RMSD and RMSF, respectively³¹.

Calculation of binding free energy

Two hundred and fifty snapshots were extracted from the last 25 ns along the MD trajectory at an interval of 100 ps. The MM/PBSA method was performed using the g_mmpbsa package^32,33 to calculate the binding free energy of the enzyme and substrate. The MM/PBSA method can be conceptually summarized as three energetic terms:

where ΔG_total denotes the binding free energy, ΔE_MM denotes the difference in molecular mechanics energy between the complex and each binding partner in a vacuum, ΔG_sol denotes the solvation free energy, and TΔS denotes the entropy change. ΔE_MM can be further divided into two parts:

where ΔE_ele and ΔE_vdw denote the electrostatic interaction and van der Waals energy in a vacuum, respectively. In addition, the solvation free energy can also be divided into two parts:

where ΔG_polar and ΔG_np denote the polar and non-polar solvation free energies, respectively. For ΔG_polar, the dielectric constants of the solute and solvent were set to 2.0 and 80.0 in our calculations. For ΔG_np, the values of coefficients γ and β were set to 0.0054 kcal/mol/A² and 0.92 kcal/mol, respectively.

The entropy change (TΔS) arises from changes in the translational, rotational, and vibrational degrees of freedom. The calculation of entropy change is extremely time-consuming and inaccurate, and for similar protein-inhibitor complex systems the entropy change is similar⁶⁰. Therefore, in our study, we ignored the calculation of entropy change.

Cloning and site-directed mutagenesis of SALDan

PCR using A. naphthalenivorans genomic DNA as a template was performed to isolate SALDan using the following oligonucleotide primers: forward: 5′-CG GGATCC ATG AAT AAT CAA GAA CTC TTA AGC-3′ and reverse: 5′-CCC AAGCTT TCA TAT CGGA TAT ATC AGAG AGT T-3′ (the underlined bases indicate the restriction enzyme sites for BamHI and HindIII). The PCR product and the pET28-(a) vector were digested by those two restriction enzymes. The ligation products were transformed into Escherichia coli BL21 (DE3) cells by electroporation and confirmed by sequencing.

The primers used for the single amino acid mutant were as follows: N149A, forward primer, 5′-ATA GCA CCG TGG GCC GGG CCT ATT GTA TTA-3′; reverse primer, 5′-TAA TAC AAT AGG CCC GGC CCA CGG TGC TAT-3′; V153A, forward primer, 5′-AAT GGG CCT ATT GTA TTA GCG GCT CGG-3′; reverse primer, 5′-CCG AGC CGC TAA TAC AAT AGG CCC ATT-3′; E175A, forward primer, 5′-TTTAAAGCTTCAGAAGTAAGTCCTAAA-3′, reverse primer, 5′-GTT GGG ACA GTC TTC CAG CCC-3′; E175A, forward primer, 5′-ATG CTC CAC CAA ATT GGG CAC-3′; reverse primer, 5′-TTT AGG ACT TAC TTC TGA AGC TTT AAA-3′. The PCR was performed using Pfu polymerase, and the cycling parameters were: 95 °C for 5 min (one cycle), 95 °C for 30 s, and 68 °C for 12 min (12 cycles). After amplification, the PCR mixture was digested with DpnI and then transformed into E. coli BL21(DE3) by electroporation⁶¹. The mutants were confirmed by DNA sequencing.

Expression and purification of SALDan

E. coli BL21(DE3) cells containing the pET28a-SALDan plasmid were cultured in 2 L of LB broth containing kanamycin (30 μg.mL⁻¹) at 37 °C for 3 h. When the OD₆₀₀ reached 0.7, isopropyl-β-d-thiogalactopyranoside was added to a final concentration of 1 mM to induce protein expression. After 4 h of culture with shaking, cells were harvested by centrifugation for 10 min at 4 °C⁶². The cell pellets were resuspended in lysis buffer containing 50 mM Tris-HCl (pH 8.0), 300 mM NaCl, 20 mM 2-mercaptoethanol, and 20 mM imidazole. The cell suspension was sonicated and the supernatants were collected following centrifugation and loaded on a Ni-NTA column. After washing the column with lysis buffer, SALDan was eluted using an imidazole gradient (50–250 mM). The purified SALDan was visualized after separation by 12% sodium dodecyl sulfate polyacrylamide gel electrophoresis (SDS-PAGE). After dialysis with 50 mM HEPES buffer (pH 8.0) containing 150 mM NaCl and 20 mM 2-mercaptoethanol to remove metal ions, the purified proteins were stored at −80 °C. Protein concentrations were estimated by the method of Bradford using bovine serum albumin as a standard⁶³.

Effects of temperature and pH on SALDan activity

The activity of the purified proteins was determined using SAL as a substrate. A reaction volume of 500 μL was prepared in a microfuge tube containing purified SALDan (0.1 μM), SAL (100 μM), and NAD⁺ (100 μM) dissolved in 50 mM HEPES (pH 7.5) containing 150 mM NaCl. The SALDan activity was monitored as the rate of appearance of NADH at 340 nm spectrophotometrically. The specific activity of the enzyme (units.mg⁻¹.min⁻¹) was expressed as the amount of enzyme required to produce 1 μM of NADH under the assay conditions. The influence of pH on the activity was determined using the protocol described above, with the exception of replacing the Tris-HCl buffer with 50 mM sodium acetate (pH 3.0–5.0), 50 mM 2-(N-morpholino)ethanesulfonic acid (pH 5.0–7.5), 50 mM HEPES (pH 8.0–8.5), 50 mM glycine (pH 9.0–10.0), or 50 mM sodium phosphate (pH 11.0)⁵². All assays were performed at the optimal temperature. To determine the influence of temperature on the enzymatic activity, the reactions were performed at 15, 20, 25, 30, 35, 40, and 45 °C, respectively.

Kinetics assay of SALDan

For kinetic studies of aldehydes, the initial velocities of the enzymatic reactions were examined by varying the concentrations of various aromatic as well as aliphatic aldehydes in the presence of NAD⁺ (100 μM) and the enzyme (0.1 μM) under optimal conditions. For substrates that absorb at 340 nm, thus interfering with the calculation of the initial rate, their molar coefficients (ε) were calculated as described previously⁶⁴ and are presented in Table S1. The apparent kinetic parameters were calculated using the equation:

The apparent kinetic parameters for NAD⁺ were determined at a fixed concentration of SAL (50 μM) and a varying concentration of NAD⁺ (0–500 μM). The apparent K_m and V_max values were calculated by using the equation:

All the activity data were determined by three separate experiments with at least three technical replicates.

Additional Information

How to cite this article: Jia, B. et al. Evolutionary, computational, and biochemical studies of the salicylaldehyde dehydrogenases in the naphthalene degradation pathway. Sci. Rep. 7, 43489; doi: 10.1038/srep43489 (2017).

Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

Schreiner, C. Genetic toxicity of naphthalene: a review. J. Toxicol. Env. Heal. B, Part B 6, 161–183 (2003).
CAS Google Scholar
Stohs, S. J., Ohia, S. & Bagchi, D. Naphthalene toxicity and antioxidant nutrients. Toxicology 180, 97–105 (2002).
CAS PubMed Google Scholar
Preuss, R., Angerer, J. & Drexler, H. Naphthalene—an environmental and occupational toxicant. Int. Arch. Occup. Environ. Health 76, 556–576 (2003).
CAS PubMed Google Scholar
Brusick, D. Critical assessment of the genetic toxicity of naphthalene. Regulatory Toxicol. Pharmacol. 51, 37–42 (2008).
Google Scholar
Gan, S., Lau, E. V. & Ng, H. K. Remediation of soils contaminated with polycyclic aromatic hydrocarbons (PAHs). J. Hazard Mater. 172, 532–549 (2009).
CAS PubMed Google Scholar
Haritash, A. K. & Kaushik, C. P. Biodegradation aspects of polycyclic aromatic hydrocarbons (PAHs): A review. J. Hazard Mater. 169, 1–15 (2009).
CAS PubMed Google Scholar
Annweiler, E. et al. Naphthalene degradation and incorporation of naphthalene-derived carbon into biomass by the thermophile Bacillus thermoleovorans . Appl. Environ. Microbiol. 66, 518–523 (2000).
CAS PubMed PubMed Central Google Scholar
Jin, H. M., Kim, J. M., Lee, H. J., Madsen, E. L. & Jeon, C. O. Alteromonas as a key agent of polycyclic aromatic hydrocarbon biodegradation in crude oil-contaminated coastal sediment. Environ. Sci. Technol. 46, 7731–7740 (2012).
CAS PubMed ADS Google Scholar
Mollea, C., Bosco, F. & Ruggeri, B. Fungal biodegradation of naphthalene: microcosms studies. Chemosphere 60, 636–643 (2005).
CAS PubMed ADS Google Scholar
Mao, J. & Guan, W. Fungal degradation of polycyclic aromatic hydrocarbons (PAHs) by Scopulariopsis brevicaulis and its application in bioremediation of PAH-contaminated soil. Acta. Agric. Scand. Sect. B Soil Plant Sci. 66, 399–405 (2016).
CAS Google Scholar
Aranda, E. Promising approaches towards biotransformation of polycyclic aromatic hydrocarbons with Ascomycota fungi. Curr. Opin. Biotechnol. 38, 1–8 (2016).
CAS PubMed Google Scholar
Suenaga, H. et al. Novel organization of aromatic degradation pathway genes in a microbial community as revealed by metagenomic analysis. ISME J. 3, 1335–1348 (2009).
CAS PubMed Google Scholar
Cerniglia, C. E. & Sutherland, J. B. of referencing In Handbook of Hydrocarbon and Lipid Microbiology(ed. Kenneth, N. T. ) 2079–2110 (Springer Berlin Heidelberg, 2010).
Zhao, H., Li, Y., Chen, W. & Cai, B. A novel salicylaldehyde dehydrogenase-NahV involved in catabolism of naphthalene from Pseudomonas putida ND6. Chin. Sci. Bull. 52, 1942–1948 (2007).
CAS Google Scholar
Li, S., Li, X., Zhao, H. & Cai, B. Physiological role of the novel salicylaldehyde dehydrogenase NahV in mineralization of naphthalene by Pseudomonas putida ND6. Microbiol. Res. 166, 643–653 (2011).
CAS PubMed Google Scholar
Singh, R., Trivedi, V. D. & Phale, P. S. Purification and characterization of NAD⁺-dependent salicylaldehyde dehydrogenase from carbaryl-degrading Pseudomonas sp. strain C6. Appl. Biochem. Biotechnol. 172, 806–819 (2014).
CAS PubMed Google Scholar
Coitinho, J. B., Costa, D. M., Guimaraes, S. L., de Goes, A. M. & Nagem, R. A. Expression, purification and preliminary crystallographic studies of NahF, a salicylaldehyde dehydrogenase from Pseudomonas putida G7 involved in naphthalene degradation. Acta. Crystallogr. F Struct. Biol. Commun. 68, 93–97 (2012).
CAS Google Scholar
Gullo, M., Caggia, C., De Vero, L. & Giudici, P. Characterization of acetic acid bacteria in “traditional balsamic vinegar”. Int. J. Food Microbiol. 106, 209–212 (2006).
CAS PubMed Google Scholar
Jin, H. M. et al. Genome-wide transcriptional responses of Alteromonas naphthalenivorans SN2 to contaminated seawater and marine tidal flat sediment. Sci. Rep. 6, 21796 (2016).
CAS PubMed PubMed Central ADS Google Scholar
UniProt Consortium. UniProt: a hub for protein information. Nucleic Acids Res 43, D204–D212 (2015).
Pearson, W. R. of referencing In Current protocols in bioinformatics(ed. Andreas, D. B. et al.) Chapter 3, Unit3-1 (2013).
Gerlt, J. A. et al. Enzyme Function Initiative-Enzyme Similarity Tool (EFI-EST): A web tool for generating protein sequence similarity networks. Biochim. Biophys. Acta. 1854, 1019–1037 (2015).
CAS PubMed PubMed Central Google Scholar
Simonetti, F. L., Teppa, E., Chernomoretz, A., Nielsen, M. & Marino Buslje, C. MISTIC: Mutual information server to infer coevolution. Nucleic Acids Res. 41, W8–14 (2013).
PubMed PubMed Central Google Scholar
Jia, B., Jia, X., Kim, K. H. & Jeon, C. O. Integrative view of 2-oxoglutarate/Fe(II)-dependent oxygenase diversity and functions in bacteria. BBA-Gen. Subjects 1861, 323–334 (2017).
CAS Google Scholar
Petit, D. et al. Integrative view of α2,3-sialyltransferases (ST3Gal) molecular and functional evolution in deuterostomes: significance of lineage specific losses. Mol. Biol. Evol. 32, 906–927 (2014).
PubMed PubMed Central Google Scholar
Tse, A. & Verkhivker, G. M. Molecular determinants underlying binding specificities of the ABL kinase inhibitors: combining alanine scanning of binding hot spots with network analysis of residue interactions and coevolution. PLoS One 10, e0130203 (2015).
PubMed PubMed Central Google Scholar
Yeang, C. H. & Haussler, D. Detecting coevolution in and among protein domains. PLoS Comput. Biol. 3, e211 (2007).
MathSciNet PubMed PubMed Central ADS Google Scholar
Webb, B. & Sali, A. of referencing In Protein Structure Prediction(ed Daisuke Kihara ) 1–15 (Springer New York, 2014).
Laskowski, R. A., MacArthur, M. W., Moss, D. S. & Thornton, J. M. PROCHECK: a program to check the stereochemical quality of protein structures. J. Appl. Cryst. 26, 283–291 (1993).
CAS Google Scholar
Lang, B. S. et al. Vascular bioactivation of nitroglycerin by aldehyde dehydrogenase-2: reaction intermediates revealed by crystallography and mass spectrometry. J. Biol. Chem. 287, 38124–38134 (2012).
CAS PubMed PubMed Central Google Scholar
Abraham, M. J. et al. GROMACS: High performance molecular simulations through multi-level parallelism from laptops to supercomputers. SoftwareX 1–2, 19–25 (2015).
ADS Google Scholar
Genheden, S. & Ryde, U. The MM/PBSA and MM/GBSA methods to estimate ligand-binding affinities. Expert Opin. Drug Discov. 10, 449–461 (2015).
CAS PubMed PubMed Central Google Scholar
Kumari, R., Kumar, R. & Lynn, A. g_mmpbsa—A GROMACS tool for high-throughput mm-pbsa calculations. J. Chem. Inf. Model. 54, 1951–1962 (2014).
CAS PubMed Google Scholar
Jin, H. M., Kim, K. H. & Jeon, C. O. Alteromonas naphthalenivorans sp. nov., a polycyclic aromatic hydrocarbon-degrading bacterium isolated from tidal-flat sediment. Int. J. Syst. Evol. Microbiol. 65, 4208–4214 (2015).
CAS Google Scholar
Seo, J.-S., Keum, Y.-S. & Li, Q. X. Bacterial degradation of aromatic compounds. Int. J. Environ. Res. Public Health 6, 278–309 (2009).
CAS PubMed PubMed Central Google Scholar
Aranda, E. Promising approaches towards biotransformation of polycyclic aromatic hydrocarbons with Ascomycota fungi. Curr. Opin. Biotechnol. 38, 1–8 (2016).
CAS PubMed Google Scholar
Kadri, T. et al. Biodegradation of polycyclic aromatic hydrocarbons (PAHs) by fungal enzymes: A review. J. Environ. Sci. 10.1016/j.jes.2016.08.023 (2016).
Marco-Urrea, E., García-Romera, I. & Aranda, E. Potential of non-ligninolytic fungi in bioremediation of chlorinated and polycyclic aromatic hydrocarbons. New Biotechnol. 32, 620–628 (2015).
CAS Google Scholar
Chang, H.-Y. & Mitchell, A. Dionysian mysteries-the aldehyde dehydrogenase (ALDH) family. Interpro Protein Focus http://interprodb.blogspot.kr/2014/05/dionysian-mysteries-aldehyde.html (2014).
Zhao, C. et al. Identification and characterization of aldehyde dehydrogenase 9 from Lampetra japonica and its protective role against cytotoxicity. Comp. Biochem. Physiol. B Biochem. Mol. Biol. 187, 102–109 (2015).
CAS PubMed Google Scholar
Di Costanzo, L., Gomez, G. A. & Christianson, D. W. Crystal structure of lactaldehyde dehydrogenase from Escherichia coli and inferences regarding substrate and cofactor specificity. J. Mol. Biol. 366, 481–493 (2007).
CAS PubMed Google Scholar
Mann, C. J. & Weiner, H. Differences in the roles of conserved glutamic acid residues in the active site of human class 3 and class 2 aldehyde dehydrogenases. Protein Sci. 8, 1922–1929 (1999).
CAS PubMed PubMed Central Google Scholar
Tanner, J. J. SAXS fingerprints of aldehyde dehydrogenase oligomers. Data Brief 5, 745–751 (2015).
PubMed PubMed Central Google Scholar
Ding, W. et al. Functional characterization of a vanillin dehydrogenase in Corynebacterium glutamicum . Sci. Rep. 5, 8044 (2015).
CAS PubMed PubMed Central Google Scholar
Coitinho, J. B. et al. Structural and kinetic properties of the aldehyde dehydrogenase NahF, a broad substrate specificity enzyme for aldehyde oxidation. Biochemistry 55, 5453–5463 (2016).
CAS PubMed Google Scholar
Cobessi, D. et al. Apo and holo crystal structures of an NADP-dependent aldehyde dehydrogenase from Streptococcus mutans1. J. Mol. Biol. 290, 161–173 (1999).
CAS PubMed Google Scholar
Li, X. et al. Characterization of a broad-range aldehyde dehydrogenase involved in alkane degradation in Geobacillus thermodenitrificans NG80-2. Microbiol. Res. 165, 706–712 (2010).
CAS PubMed Google Scholar
Hirano, J., Miyamoto, K. & Ohta, H. Purification and characterization of aldehyde dehydrogenase with a broad substrate specificity originated from 2-phenylethanol-assimilating Brevibacterium sp. KU1309. Appl. Microbiol. Biotechnol. 76, 357–363 (2007).
CAS PubMed Google Scholar
Shannon, P. et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 13, 2498–2504 (2003).
CAS PubMed PubMed Central Google Scholar
Larkin, M. A. et al. Clustal W and Clustal X version 2.0. Bioinformatics 23, 2947–2948 (2007).
CAS PubMed Google Scholar
Kumar, S., Stecher, G. & Tamura, K. MEGA7: Molecular Evolutionary Genetics Analysis version 7.0 for bigger datasets. Mol. Biol. Evol. 33, 1870–1874 (2016).
CAS PubMed Google Scholar
Jia, B. et al. A zinc-dependent protease AMZ-tk from a thermophilic archaeon is a new member of the archaemetzincin protein family. Front. Microbiol. 6, 1380 (2015).
PubMed PubMed Central Google Scholar
Zhang, Y., Zheng, Q., Zhang, J. & Zhang, H. Insights into the epimerization activities of RaCE and pAGE: the quantum mechanics/molecular mechanics simulations. RSC Adv. 5, 102284–102293 (2015).
CAS Google Scholar
Kufareva, I. & Abagyan, R. Methods of protein structure comparison. Methods Mol. Biol. 857, 231–257 (2012).
CAS PubMed PubMed Central Google Scholar
da Silva, A. W. S. & Vranken, W. F. ACPYPE-Antechamber python parser interface. BMC Res. Notes 5, 1 (2012).
Google Scholar
Lindorff‐Larsen, K. et al. Improved side-chain torsion potentials for the Amber ff99SB protein force field. Proteins 78, 1950–1958 (2010).
PubMed PubMed Central Google Scholar
Bussi, G., Donadio, D. & Parrinello, M. Canonical sampling through velocity rescaling. J. Chem. Phys. 126, 014101 (2007).
PubMed ADS Google Scholar
Darden, T., York, D. & Pedersen, L. Particle mesh Ewald: An N⋅log(N) method for Ewald sums in large systems. J. Chem. Phys. 98, 10089–10092 (1993).
CAS ADS Google Scholar
Hess, B., Bekker, H., Berendsen, H. J. & Fraaije, J. G. LINCS: a linear constraint solver for molecular simulations. J. Comput. Chem. 18, 1463–1472 (1997).
CAS Google Scholar
Homeyer, N. & Gohlke, H. Free energy calculations by the molecular mechanics poisson−boltzmann surface area method. Mol. Inform. 31, 114–122 (2012).
CAS PubMed Google Scholar
Guan, Q. et al. Cloning, purification and biochemical characterisation of an organic solvent-, detergent-, and thermo-stable amylopullulanase from Thermococcus kodakarensis KOD1. Process Biochem. 48, 878–884 (2013).
CAS Google Scholar
Jia, B. & Jeon, C. O. High-throughput recombinant protein expression in Escherichia coli: current status and future perspectives. Open Biol. 6, 160196 (2016).
PubMed PubMed Central Google Scholar
Bradford, M. M. A rapid and sensitive method for the quantitation of microgram quantities of protein utilizing the principle of protein-dye binding. Process Biochem. 72, 248–254 (1976).
CAS Google Scholar
MacKintosh, R. W. & Fewson, C. A. Benzyl alcohol dehydrogenase and benzaldehyde dehydrogenase II from Acinetobacter calcoaceticus. Purification and preliminary characterization. Biochem. J. 250, 743–751 (1988).
CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This work was supported by grants from the National Institute of Biological Resources (NIBR No. 2015-02-066) of the Ministry of Environment and the Strategic Initiative for Microbiomes in Agriculture and Food of the Ministry of Agriculture, Food and Rural Affairs as part of the multi-ministerial Genome Technology to Business Translation Program, Republic of Korea.

Author information

Authors and Affiliations

School of Bioengineering, Qilu University of Technology, Jinan, 250353, China
Baolei Jia
Department of Life Science, Chung-Ang University, Seoul, 06974, Republic of Korea
Baolei Jia, Xiaomeng Jia, Kyung Hyun Kim & Che Ok Jeon
School of Life Science and Biotechnology, Dalian University of Technology, Dalian, 116024, China
Zhong Ji Pu
Microorganism Resources Division, National Institute of Biological Resources, Incheon, 22689, Republic of Korea
Myung-Suk Kang

Authors

Baolei Jia
View author publications
You can also search for this author in PubMed Google Scholar
Xiaomeng Jia
View author publications
You can also search for this author in PubMed Google Scholar
Kyung Hyun Kim
View author publications
You can also search for this author in PubMed Google Scholar
Zhong Ji Pu
View author publications
You can also search for this author in PubMed Google Scholar
Myung-Suk Kang
View author publications
You can also search for this author in PubMed Google Scholar
Che Ok Jeon
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

B.J. and C.J. designed the research; B.J. performed main experiment and data analysis. X.J., K.H.K., Z.J.P., and M.S.K. conducted experiments, analyzed data, provided key reagents, and revised the manuscript; C.J. supervised the study and obtained funding. All authors read and approved the final version of the manuscript.

Corresponding authors

Correspondence to Baolei Jia or Che Ok Jeon.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Supplementary information

Supplementary Table and Figures (PDF 540 kb)

Supplementary Dataset 1 and 2 (XLS 206 kb)

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Jia, B., Jia, X., Hyun Kim, K. et al. Evolutionary, computational, and biochemical studies of the salicylaldehyde dehydrogenases in the naphthalene degradation pathway. Sci Rep 7, 43489 (2017). https://doi.org/10.1038/srep43489

Download citation

Received: 14 November 2016
Accepted: 24 January 2017
Published: 24 February 2017
DOI: https://doi.org/10.1038/srep43489

This article is cited by

In-depth comparative transcriptome analysis of Purpureocillium sp. CB1 under cadmium stress
- Aslıhan Kurt-Kızıldoğan
- Çiğdem Otur
- Büşra Abanoz-Seçgin
Applied Microbiology and Biotechnology (2023)
AGeNNT: annotation of enzyme families by means of refined neighborhood networks
- Florian Kandlinger
- Maximilian G. Plach
- Rainer Merkl
BMC Bioinformatics (2017)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.