Structure based function-annotation of hypothetical protein MGG_01005 from Magnaporthe oryzae reveals it is the dynein light chain orthologue of dynlt1/3

Magnaporthe oryzae is a model fungal plant pathogen employed for studying plant-fungi interactions. Whole genome sequencing and bioinformatics analyses revealed that this fungal pathogen has more than 12,000 protein-coding genes with 65% of the genes remaining functionally un-annotated. Here, we determine the structure of the hypothetical protein, MGG_01005 and show that it is the Magnaporthe oryzae Dynein light chain Tctex-type 1 (dynlt1/3), demonstrated by its structural similarity to other orthologous dynlt1 proteins and its conserved interaction with the N-terminus of the Magnaporthe oryzae dynein intermediate chain, MoDyn1I2. In addition, we present the structure of the MGG_01005-MoDyn1I2 complex together with mutagenesis studies that reveals a di-histidine motif interaction with a glutamate residue in the dynein intermediate chain within a conserved molecular interface. These results demonstrate the utility of structure-based annotation and validate it as a viable approach for the molecular assignment of hypothetic proteins from phyto-pathogenic fungi.

1 Overall quality at a glance i O The following experimental techniques were used to determine the structure:

X-RAY DIFFRACTION
The reported resolution of this entry is 2.40 Å.
Percentile scores (ranging between 0-100) for global validation metrics of the entry are shown in the following graphic. The table shows the number of entries on which the scores are based.

Metric
Whole archive (#Entries) Similar resolution (#Entries, resolution range(Å)) R The table below summarises the geometric issues observed across the polymeric chains and their t to the electron density. The red, orange, yellow and green segments on the lower bar indicate the fraction of residues that contain outliers for >=3, 2, 1 and 0 types of geometric quality criteria. A grey segment represents the fraction of residues that are not modelled. The numeric value for each fraction is indicated below the corresponding segment, with a dot representing fractions <=5% The upper red bar (where present) indicates the fraction of residues that have poor t to the electron density. The numeric value is given above the bar.
Mol Chain Length Quality of chain In the tables below, the ZeroOcc column contains the number of atoms modelled with zero occupancy, the AltConf column contains the number of residues with at least one atom in alternate conformation and the Trace column contains the number of residues modelled with at most 2 atoms.
Molecule 1  3 Residue-property plots i O These plots are drawn for all protein, RNA and DNA chains in the entry. The rst graphic for a chain summarises the proportions of errors displayed in the second graphic. The second graphic shows the sequence view annotated by issues in geometry and electron density. Residues are colorcoded according to the number of geometric quality criteria for which they contain at least one outlier: green = 0, yellow = 1, orange = 2 and red = 3 or more. A red dot above a residue indicates a poor t to the electron density (RSRZ > 2). Stretches of 2 or more consecutive residues without any outlier are shown as a green connector. Residues present in the sample, but not in the model, are shown in grey.
• Molecule 1: Uncharacterized protein Chain A: There are no bond length outliers.
There are no bond angle outliers.
There are no chirality outliers.

Mol Chain Non-H H(model) H(added) Clashes Symm-Clashes
All (28)  In the following table, the Percentiles column shows the percent Ramachandran outliers of the chain as a percentile score with respect to all X-ray entries followed by that with respect to entries of similar resolution.
The Analysed column shows the number of residues for which the backbone conformation was analysed, and the total number of residues.
Mol Chain Analysed Favoured Allowed Outliers Percentiles There are no Ramachandran outliers to report.

Protein sidechains i O
In the following table, the Percentiles column shows the percent sidechain outliers of the chain as a percentile score with respect to all X-ray entries followed by that with respect to entries of similar resolution.
The Analysed column shows the number of residues for which the sidechain conformation was analysed, and the total number of residues.
Mol Chain Analysed Rotameric Outliers Percentiles  (and percentage) of RSRZ outliers, followed by percent RSRZ outliers for the chain as percentile scores relative to all X-ray entries and entries of similar resolution. The OWAB column contains the minimum, median, 95 th percentile and maximum values of the occupancy-weighted average B-factor per residue. The column labelled`Q< 0.9' lists the number of (and percentage) of residues with an average occupancy less than 0.9.