Академический Документы
Профессиональный Документы
Культура Документы
ScienceDirect
RESEARCH ARTICLE
State Key Laboratory of Crop Stress Biology for Arid Areas, College of Plant Protection, Northwest A&F University, Yangling
712100, P.R.China
Abstract
Protein kinases play an important role in every aspect of cellular life. In this study, we systemically identified protein
kinases from the predicted proteomes of 59 representative fungi from Ascomycota and Basidiomycota. Comparative
analysis revealed that fungi from Ascomycota and Basidiomycota differed in the number and variety of protein kinases.
Some groups of protein kinases, such as calmodulin/calcium regulated kinases (CMGC) and those with the highest group
percentages are the most prevalent protein kinases among all fungal species tested. In contrast, the STE group (homologs
of the yeast STE7, STE11 and STE20 genes), was more abundant in Basidiomycetes than in Ascomycetes. Importantly,
the distribution of some protein kinase families appeared to be subphylum-specific. The tyrosine kinase-like (TKL) group
had a higher protein kinase density in Agaricomycotina fungi. In addition, the distribution of accessory domains, which could
have functional implications, demonstrated that usage bias varied between the two phyla. Principal component analysis
revealed a divergence between the main functional domains and associated domains in fungi. This study provides novel
insights into the variety and expansion of fungal protein kinases between Ascomycota and Basidiomycota.
information for understanding how Ascomycota and stage. Lastly, the important role of PKs in cell life was not
Basidiomycota fungi evolved. emphasized and the relationship of PKs to the life cycle of
Protein kinases (PKs) constitute a key class of enzymes fungi was not explored. As many fungal genomes have been
and regulate numerous cellular processes, including sequenced and new tools are available to classify PKs, it is
mitosis, communication, differentiation, metabolism, and necessary to analyze fungal PKs with new methods.
transcription (Hanks and Hunter 1995; Cohen 2000). In this study, we identified and compared the full
By adding a phosphate group from ATP/GTP to the side repertoires of PKs from 59 representative fungal species
chains of target proteins, PKs often profoundly alter the from Ascomycota and Basidiomycota using Kinannote.
biological activity of the target molecules (LaRonde-LeBlanc All 59 fungi can be classified into six sub-phyla, as
and Wlodawer 2005; Wuichet et al. 2010). All PKs can demonstrated by Hibbett et al. (2007). The six sub-phyla
be classified into two super-families: 1) eukaryotic or are Pucciniomycetes, Ustilaginomycetes, Agaricomycotina,
conventional protein kinases (ePKs), which are the most Pezizomycetes, Saccharomycetes and Taphrinomycetes,
prevalent and have a common ancestry; and 2) atypical and the evolutionary stages of these sub-phyla are
protein kinases (aPKs) (Hanks and Hunter 1995). ePKs mentioned in order. The 59 fungi PKs were classified into
have two functional domains, a catalytic domain which binds 10 groups and 119 families/sub-families using different
and phosphorylates target proteins, and a regulatory region. strategies. In addition, we identified all domains associated
The catalytic domain of ePKs contains a core domain of with PKs according to the Pfam database (http://pfam.xfam.
250–300 amino acid residues that is divided further into 12 org/) classification, followed by identifying the differences
sub-domains that have highly conserved individual amino between Ascomycota and Basidiomycota.
acid residues and motifs (Hanks 2003). aPKs lack sequence
similarity to the ePK catalytic domain, but do have kinase 2. Materials and methods
activity (Hanks and Hunter 1995). Based on the catalytic
domain, the ePK super-family can be classified into several The procedures used in this study can be classified into
major groups (Hanks and Hunter 1995; Becker and Joost three phases (Fig. 1). First, fungal proteomes were collected
1998; Manning et al. 2002b), and members of these groups from a genome database. Second, Kinannote was used
are grouped according to broad functional categories to annotate proteomes. Finally, the presence of each
with distinct sequences and structural features. Several domain within a specific fungal group was determined and
classification schemes for PKs have been proposed in the statistically analyzed.
literatures. In their original classification work, Hanks and
Hunter (1995) took into consideration conservation and 2.1. Data collection
phylogenetic analyses of catalytic domains. This has been
followed by several research groups for PK classification The proteomes of 17 fungi were downloaded from the Fungal
(Manning et al. 2002a; Krupa et al. 2004; Scheeff and Genome Initiative site of the Broad Institute (http://www.
Bourne 2005; Martin et al. 2009). The database KinBase broadinstitute.org/), 19 were obtained from GenBank of the
(http://www.kinase.com/kinbase) has been developed for National Center of Biological Information (NCBI) (http://www.
PK classification (Manning et al. 2002b; Martin et al. 2009). ncbi.nlm.nih.gov/), and the remaining 23 were downloaded
Currently, over 3 000 PK genes divided into 10 groups can from the United States Department of Energy (DOE) Joint
be found in KinBase. Goldberg et al. (2013) designed a
new software, Kinannote, which can identify and classify
members of the PK super-families. Kinannote significantly Predicted proteome
improved the average sensitivity and precision for full
classification of conserved PKs and had been implementes Kinannote
HMMER search
in several PK annotation programs.
Several hundreds of fungal genomes have been
Motif scoring
published (Zhao et al. 2013), including Ascomycota and
Basidiomycota, allowing for representative fungal data to
BLAST against local database and
be utilized. Kosti et al. (2010) compared and analyzed the classification to 10 groups
PKs from 30 fungi, allowing some meaningful discoveries,
although their study had several weaknesses. First, they
Pfam analysis Data analysis
were unable to develop a precise classification of PKs that
could be correlated to specific functions. Second, the PKs of
the 30 fungi were not compared on the basis of evolutionary Fig. 1 Flow chart of the protein kinase (PK) analysis pipeline.
98 PEI Guo-liang et al. Journal of Integrative Agriculture 2019, 18(1): 96–107
Genome Institute (JGI) site (http://genome.jgi-psf.org/). using SPSS Statistic 20.0 Software (http://www.01.ibm.com/
software/analytics/spss/).
2.2. Kinase annotation and statistical analysis
2.3. Domains searches
The Kinannote Program, downloaded from Sourceforge
(http://sourceforge.net/projects/kinannote/), was used to InterProScan 5.20 was used to identify domain sequences
extract and classify putative PKs from all fungal proteomes. from each putative PKs by searching sequences against the
Kinannote identified and classified PKs in three phases. Pfam A (V30) HMMs library (Jones et al. 2014). A minimum
First, the proteomes were searched with a PK hidden score value of e-value of 1×10–4 was used.
Markov models (HMMs) and a relaxed cutoff was used for
PK identification. Candidates were searched using position- 2.4. Principal component analysis (PCA)
specific scoring matrices and against a local version of
KinBase using BLAST, and BLAST results were then parsed. The 59 fungi were classified according to evolutionary
Second, BLAST results were used to identify conserved PKs stages, limiting the data to those domains present in at
with poor HMM scores. Finally, BLAST search results were least half of the fungi (at least 30). The PCA procedure was
applied to classify PKs. applied PKs and the domains identified as PK sequences,
The PK groups are divided as follows: The CMGC group using the GNU R Software in the ade4 Package (Dray and
was named after a set of families (CDK, MAPK, GSK3 and Dufour 2007).
CLK) and has a diverse set of functions, which include cell
cycle control, MAPK (mitogen-activated protein kinase) 3. Results and discussion
signaling, splicing, and other unknown functions. The other
group consists of several families and some unique PKs that 3.1. Distribution of PKs
are clearly ePKs but do not fit into other ePK groups. The
AGC group was named after the protein kinase A, G, and The predicted proteomes of 59 Basidiomycetes
C families (PKA, PKC and PKG) and contains many core and Ascomycetes fungi representing six sub-phyla
intracellular signaling PKs which are modulated by cyclic (Pucciniomycetes, Ustilaginomycetes, Agaricomycotina,
nucleotides, phospholipids and calcium. The STE group Pezizomycetes, Saccharomycetes and Taphrinomycetes)
includes homologs of the yeast STE7, STE11 and STE20 (Appendix B) were systematically screened for different
proteins, which form the MAPK cascade transducing signals families of kinomes using Kinannote Software. These
from the cell surface to the nucleus. The Ser/Thr group fungi also represent the five types of nutritional modes:
consists of only one family. The CK1 group is a small but saprophytic, facultative parasitic, hemi-biotrophic, biotrophic
ancient family, originally known as Casein Kinase 1 (from a and mutualistic symbiotic fungi, including pathogens of
biochemically assay with a non-physiological substrate), and plants, human, and insects as well as non-pathogenic fungi.
has been renamed Cell Kinase 1. The TKL group is most The focus of this study was on the fungal evolutionary status,
similar to tyrosine kinases. The PAB group was named after also taking into account other features of fungi, such as
the PAB-dependent poly(A)-specific ribonuclease subunit, nutritional modes and pathogenicity.
a catalytic subunit of the poly(A)-nuclease de-adenylation A total of 7 557 putative PK sequences were obtained.
complex, which have PK activity. There are two large They were classified into 119 families/sub-families and 10
groups, RGC and TK, which are not found in fungi and are PK groups (RGC and TK groups were not represented). All
not discussed in this article. It should be noted that the putative PKs were identified in fungal predicted proteomes.
classifications determined using the Kinannote database, As shown in Fig. 2, the distribution of the 10 PK groups was
based on KinBase, is slightly different from the newer version classified using Kinannote Software. All PK groups in fungi
of KinBase. We annotated all the families of PKs using can be classified into three classes by group percentage,
Saccharomyces cerevisiae S290C genes, and genes which the number of genes in a PK group compared to the total
cannot be found in S. cerevisiae were identified by other number of all PKs, which were then adjusted by a median
fungal genes (Appendix A). In order to avoid unnecessary value. The most populated PK groups were “CMGC” and
misunderstanding in different classification, we do not “Other”, averaging >20% from all fungi. The AGC, CAMK,
discuss the functions of PKs from any special fungus. STE and Ser/Thr groups comprised approximately 10% of
All data were extracted using Kinannote Software. all fungal PKs, while the CK1, PAB and TKL groups were
OriginPro 8.5 (http://originlab.com/index.aspx?go= the least populated groups (Appendix C). Most PK groups
PRODUCTS/OriginPro) was used to generate statistical were stably distributed in all fungi studied, especially the
box charts. Independent sample t-tests were performed CK1 and PAB groups. However, in the TKL and the Ser/Thr
PEI Guo-liang et al. Journal of Integrative Agriculture 2019, 18(1): 96–107 99
Basidiomycota
Ustilaginomycotina Ustilago maydis
Ceraceosorus bombacis
Malassezia globosa
Botryobasidium botryosum
Paxillus involutus
Laccaria bicolor
Hebeloma cylindrosporum
Calocera viscosa
Agaricomycotina
Calocera cornea
Coniophora puteana
Cryptococcus neoformans
Cryptococcus gattii
Moniliophthora perniciosa
Nectria haematococca
Cordyceps militaris AGC
Metarhizium acridum
Neurospora crassa Atypica
Gaeumannomyces graminis var. tritici
Metarhizium anisopliae CAMK
Arthroderma benhamiae
CK1
Cladosporium fulvum
Aspergillus clavatus CMGC
Mycosphaerella graminicola
Pyrenophora teres f. sp. teres Other
Ajellomyces capsulatus
PAB
Trichophyton rubrum
Pezizomycotina
Magnaporthe oryzae STE
Cochliobolus sativus
Penicillium marneffei TKL
Ascomycota
Coccidioides immitis
Arthrobotrys oligospora
Ser/Thr
Phaeosphaeria nodorum
Dothistroma septosporum
Botryotinia fuckeliana
Sclerotinia sclerotiorum
Fusarium graminearum
Tuber melanosporum
Chaetomium thermophilum
Blumeria graminis f. sp. hordei
Saccharomyces cerevisiae
Debaryomyces hansenii
Candida tropicalis
Saccharomycotina
Candida lusitaniae
Candida albicans
Lodderomyces elongisporus
Schizosaccharomyces japonicus
Schizosaccharomyces cryophilus
Schizosaccharomyces octosporus
Schizosaccharomyces pombe
Pneumocystis jirovecii
Taphrinomycotina Taphrina deformans
Fig. 2 Comparative analysis of fungal protein kinases (PKs). The number of kinase genes is represented as horizontal bars.
Colors indicate the major kinase groups: AGC, Atypica, CAMK, CK1, CMGC, other, PAB, STE, TKL and Ser/Thr protein kinase.
groups, the number of PKs highly varied among different (2010) reported a large number of TKL protein kinase
fungi (Appendix D). The species with the most PKs (389) genes in Laccaria bicolor. In the present study, TKL gene
was Botryobasidium botryosum, whereas Moniliophthora duplication was found not only in L. bicolor, but also in most
perniciosa only had 48 PKs. Agaricomycotina. Duplicated genes mostly belonged to
The differences in PKs among fungi were not limited TKL/TKL-CCIN group. In fact, the TKL/TKL-CCIN group
to the number but also in types of kinases. Kosti et al. was only found in Agaricomycetes, whereas Cryptococcus
100 PEI Guo-liang et al. Journal of Integrative Agriculture 2019, 18(1): 96–107
gattii and Calocera cornea had no genes encoding TKL/ gene functions. These results suggest that the function of
TKL-CCIN, suggesting that duplicated TKL/TKL-CCIN genes some PK families/sub-families may change and play an
could be a potential classifier for Agaricomycetes species. increasingly important role in the fungal cellular lifecycle.
The data show that PK gene duplication is a common The 119 families/sub-families vary greatly in the
phenomenon in fungi. Besides the TKL/TKL-CCIN family, distribution and abundance among fungi (Fig. 3). For
which was largely observed in Agaricomycetes, and the example, different members of the families Other/SCY1,
Ser/Thr group, which was identified in all fungi, there are CMGC/CK2, STE/STE20/PAKA and STE/STE20/YSK are
11 families/sub-families that were represented by more than present in the examined fungi. For the other 13 families/
two copies in each fungus. For example, the families/sub- sub-families, such as CMGC/CDKL, AGC/RSK/RSKP90 and
families ratio of other/AGAK1 was 5.5, indicating that the STE/STE20/KHS, only a single member was identified in one
fungi containing these families/sub-families of PKs have at predicted proteome, similar to the distribution in Arabidopsis
least five copies of other/AGAK1 genes. Duplicate genes thaliana, whereas many copies were identified in human and
are believed to be a major mechanism for establishing new mouse genomes (Manning et al. 2002b; Caenepeel et al.
0 10 20 30 40 50 59 0 10 20 30 40 50 59
AGC AGC Other Other/SCY1
AGC/NDR/NDR Other/HAL
AGC/PDK1 Other/BUB
AGC/AKT Other/CDC7
AGC/RSK/RSK-UNCLASSIFIED Other/IRE
AGC/NDR/NDR-UNCLASSIFIED Other/ULK/ULK
AGC/PKA Other/AUR
AGC/PKC Other/NEK
AGC/YANK Other/RAN
AGC/RSK/RSKP70 Other/VPS15
AGC/NDR Other/NAK
AGC/DMPK Other/TTK
AGC/RSK Other/BUD32
AGC/DMPK/GEK Other/PEK/GCN2
AGC/DMPK/ROCK Other/IKS
AGC/RSK/RSKP90 Other/HASPIN
AGC/SGK Other/PLK
Atypical Other/CAMKK/CAMKK-META
Atypical/ABC1/ABC1-C Other/WEE/WEE-UNCLASSIFIED
Atypical/ABC1/ABC1-B Other/CAMKK
Atypical/RIO/RIO2 Other/NAK/BIKE
Atypical/RIO/RIO1 Other/CAMKK/ELM
Atypical/ABC1 Other/PEK
Atypical/ABC1/ABC1-A
Other/WEE
Atypical/PI4K
Other/PLK/PLK1
Atypical/PIKK/FRAP
Other/AGAK1
Atypical/HISK
Other/ULK
Atypical/PIKK
Other/CILIATE-A1
Atypical/BCR
Other/NAK/MPSK
Atypical/RIO
Other/NEK/NEK2
CAMK CAMK
Other/PLK/SAK
CAMK/CAMK1/CAMK1-RCK
CAMK/CAMKL/AMPK
CMGC CMGC/CK2
CMGC/CDK/CRK7
CAMK/CAMKL/KIN1
CMGC/SRPK
CAMK/CAMKL/KIN4
CMGC/CDK
CAMK/CAMK1
CMGC/CDK/CDC2
CAMK/CAMKL/PASK
CMGC/CDK/CDK8
CAMK/CAMK1/CAMK1-CMK
CMGC/CLK
CAMK/CAMKL/GIN4
CAMK/CAMKL/CHK1 CMGC/MAPK
CAMK/CAMKL CMGC/MAPK/P38
CAMK/RAD53 CMGC/CDK/CDK5
CAMK/CAMKL/MARK CMGC/CDK/CDK7
CAMK/CAMKL/BRSK CMGC/DYRK/YAK
CMGC/GSK
STE STE/STE20/PAKA CMGC/DYRK/DYRK2
STE/STE20/YSK CMGC/RCK/MAK
STE/STE11/BCK1 CMGC/DYRK/PRP4
STE/STE11/SSK CMGC/MAPK/ERK1
STE/STE11 CMGC
STE/STE7 CMGC/CDK/CDK9
STE/STE7/MEK1 CMGC/RCK
STE/STE11/CDC15 CMGC/CDK/PITSLRE
STE/STE7/MKK CMGC/DYRK
STE/STE20/FRAY CMGC/CDKL
STE/STE20
STE TKL TKL
STE/STE20/KHS TKL/LISK/LISK-DD1
TKL/TKL-CCIN
CK1 CK1/CK1/CK1-D
CK1/CK1/CK1-G PAB PAB-dependent
CK1/CK1
CK1 Ser/Thr Serine/Threonine
Fig. 3 Distribution of each protein kinase (PK) group among the 59 fungi. The colums showed the numbers of fungi that have
the members of the specific kinase group.
PEI Guo-liang et al. Journal of Integrative Agriculture 2019, 18(1): 96–107 101
2004; Champion et al. 2004). comparison to the kinome size. At the sub-phylum level,
only Agaricomycetes had remarkable gene duplication in
3.2. Differences between Basidiomycetes and As- PKs (Appendix E).
comycetes In addition to those already mentioned above, several
PK families/sub-families were only observed in either
The studied fungi, which belong to Basidiomycetes and Basidiomycetes or Ascomycetes, but not both. Ten and
Ascomycetes, had different types and numbers of PKs. 13 PK families/sub-families are only found in Ascomycetes
Although these fungi of two phyla were separated long or in Basidiomycetes (Table 1). It should be noted that
ago, their PKs in some groups are still highly analogous, some families/sub-families, such as TKL/TKL-CCIN, Other/
however, the high homology was not found across all PK AGAK1 and TKL/LISK/LISK-DD1 in Basidiomycetes,
groups. Interestingly, the number of PKs in the STE group have similar numbers of duplicated genes, such as
is more abundant in Basidiomycetes than in Ascomycetes Other/NAK/BIKE, AGC/RSK/RSKP70 and Atypical/HISK
(Fig. 4-A, P<0.01). In contrast, the percentage of TKL in Ascomycetes. Interestingly, both TKL/TKL-CCIN and
in Basidiomycetes was higher than in Ascomycetes, Other/AGAK1 are only found in Agaricomycetes at high
whereas the group percentages of CMGC and other PKs
numbers. In contrast, TKL/LISK/LISK-DD1, STE/STE20/
in Basidiomycetes is lower than in Ascomycetes (Fig. 4-B,
FRAY and AGC/DMPK, which have more than 10 copies,
P<0.01). This result may indicate that Basidiomycetes
can be found in most Basidiomycetes. Other/NAK/BIKE
depend less on CMGC and other PKs during the cell cycle.
and AGC/RSK/RSKP70, which also have more than 10
PK gene duplication in Agaricomycetes may partially
copies, are found in most Ascomycetes and have no
explain why the STE group is more abundant in
obvious sub-phylum features.
Basidiomycetes than in Ascomycetes. In addition, the
The kinase percentage (the ratio of kinase number
decreased group percentage for CMGC and Other PKs
to gene number) differed between 0.5 and 2.5% among
in Basidiomycetes compared to Ascomycetes could be
different sub-phyla (Fig. 5). For most Taphrinomycetes,
partially due to gene amplification of TKL/TKL-CCIN. The
the kinase percentage was >2%, greater than all other
analysis indicates that, although Basidiomycetes fungi
fungi. Only Taphrina deformans has a kinase percentage
have bigger genomes than Ascomycetes, the genome
differences between these two phyla are not significant in of only 1.4%. As few fungal kinase percentages reach the
level of Taphrinomycetes, this may serve as a marker for
Taphrinomycetes species.
A B
60 Basidiomycota
Ascomycota Table 1 Phylum-specific protein kinase families/sub-families
25 Max.
99% Families Basidiomycetes Ascomycetes
95%
50 75% TKL/TKL-CCIN 552 0
Mean
Media Other/AGAK1 33 0
20 25%
TKL/LISK/LISK-DD1 18 0
Groups percentage (%)
40 5%
STE/STE20/FRAY 17 0
Number of kinases
1%
Min.
AGC/DMPK 12 0
15 30 AGC/DMPK/GEK 3 0
STE/STE20/KHS 1 0
CK1 1 0
10 20 CAMK/CAMKL/BRSK 1 0
Other/NEK/NEK2 1 0
AGC/RSK/RSKP90 1 0
10
5 AGC/DMPK/ROCK 1 0
Atypical/BCR 1 0
0 Other/NAK/BIKE 0 45
0 AGC/RSK/RSKP70 0 32
STE TKL CMGC Other Atypical/HISK 0 5
CMGC/DYRK 0 3
Other/CILIATE-A1 0 1
Fig. 4 Difference in protein kinases (PKs) between Ascomycetes
CMGC/CDKL 0 1
and Basidiomycetes. A, the number of PKs in group STE was
plotted for Ascomycetes and Basidiomycetes. B, the group AGC/SGK 0 1
percentage of PKs in groups TKL, CMGC and other were Other/PLK/SAK 0 1
plotted for Ascomycetes and Basidiomycetes respectively Atypical/RIO 0 1
(t-test, P<0.01). Other/NAK/MPSK 0 1
102 PEI Guo-liang et al. Journal of Integrative Agriculture 2019, 18(1): 96–107
PH domain C1 domain
RIO1 family
Byr2
Response regulator receiver
Rio2, N-terminal
Protein tyrosine kinase
SAM domain
C2 domain
DUF3635
SNF1-like protein kinase
Ubiquitin associated domain
Fungal kinase associated-1
Kinase associated-1 domain Anticodon binding domain
WD domain, G-beta repeat Ribonuclease 2-5A
His kinase A DUF3543
Fig. 6 Distribution of domains in protein kinases (PKs). A, distribution of the number of domains found in all fungal kinases over
the 10 groups of PKs. B and C, distribution of the number of domains found in Ascomycetes and Basidiomycetes. Domains with
kinase catalytic activity are colored deep blue, while the others in green.
any domain in this phylum. and these two groups are equally represented. In the
The distribution of kinase associated domains among CAMK group, there were more domains exclusive to
the 10 PK groups is different. As shown in Fig. 7-A, the Basidiomycetes than Ascomycetes, but this was reversed
CAMK group contains 38 types of domains, making it the in the CMGC and Ser/Thr groups. For the TKL and CK1
most complicated group of all PKs. This indicates that groups, no such domains were found in Ascomycetes.
kinase proteins in the CAMK group have a richness of The PAB group only had common domains found in both
functional domains and a flexibility/expansibility in function. Basidiomycetes and Ascomycetes.
The CK1 group contains two types of domains, containing
only a protein kinase domain and a casein kinase 1 gamma 3.4. PKs in different nutrition modes and pathoge-
C terminus which lacks kinase activity. In contrast, the nicity of fungi
distribution of 10 common types of kinase domains among
the 10 PK groups is different. As shown in Fig. 7-B, in most Different fungi live in very particular conditions, and PKs are
cases, the kinase proteins in the “Other” PK group contain a utilized to best adapt to their specific environment. Thus,
protein kinase domain and a small domain which has kinase it is important to uncover differences in fungal life styles,
activity. In contrast, kinase proteins in the TKL group contain such as methods for obtaining nutrients or invading a host.
both a protein kinase domain and a protein tyrosine kinase, In the present study, the evolutionary status of each fungal
which takes up almost half of the entire sequence. This is species is inextricably tied to survival. Except Pneumocystis
similar to the CAMK group, which contains more domain jirovecii and T. deformans, most Saccharomycotina and
types, reducing the protein kinase domain percentage Taphrinomycotina fungal species are yeasts, which have
(Fig. 7-B). For the AGC and Atypical groups, domains the same nutritional mode and do not invade hosts. This
are found exclusively in Basidiomycetes or Ascomycetes, suggests that the nutritional modes within the same sub-
104 PEI Guo-liang et al. Journal of Integrative Agriculture 2019, 18(1): 96–107
phylum can be compared to determine significant differences Seven PK families, which form the largest group among
based on the same evolutionary status. Thus, nutritional all studied groups, only appear in plant pathogenic fungi,
modes and pathogenicity affect the evolutionary status of suggesting that these fungi may have different strategies
fungi. in adapting to kinds of plants, such as monocotyledons and
As shown in Fig. 8-A, pathogenic fungi with different dicotyledons. Some fungi from the Saccharomycotina and
hosts (including plant, insect and human pathogens) and Agaricomycotina were used as a negative control group
yeasts were compared. Interestingly, all fungi with different to determine the differences between pathogenic and
hosts and yeasts have particular families of kinases. The non-pathogenic fungi in the five PK families found in this
particular PK families of human pathogenic fungi are Other/ group.
CILIATE-A1 and Other/PLK/SAK. The AGC/SGK family is The differences in PKs between fungi with different
only found in insect pathogenic fungi, including Ajellomyces nutritional modes were compared (Fig. 8-B). Common
capsulatus, Cordyceps militaris and Metarhizium anisopliae. PK families were the most prevalent and 84 families were
A Percentage (%) B
0 10 20 30 40
Atypical Protein kinase domain
CAMK
Protein tyrosine kinase
Other TKL
Protein kinase C terminal domain
Ser/Thr PAB
C1 domain
CMGC Ser/Thr RIO1 family
Fig. 7 Domain distribution in protein kinase (PK) groups. A, the number of domains found in PK groups. B, the distribution of
kinase catalytic domains between PK groups. The x-axis denotes the percentage of genes contained the domains in PK groups.
Facu
ltativ
A Insect Plant B e pa
rasit
2 ic
c
7
hi
1
p
N
ro
on 0 1
ot
an
0 e 1
Bi
5 3 1 0
Hemibio
0
um
2 0 2
H
0 1
1 0 3 0
2 5 0 3
trophic
84 0
2
0 1
3 85 0 1
0 0
0 1
1
0 1
9 0 2
Sy
gi
m
6
fun
bio
ic
1 hyt
tic
aprop
S
Fig. 8 Venn diagrams of shared protein kinase (PK) families and unique PK families in fungi. A, Venn diagram showing the
common and specific PKs among fungi. Three hosts of pathogenic fungi (human, insect, plant) and yeast were used to generate
the Venn diagram. In the graph were reported the number of species-specific and common PKs. B, Venn diagram showing the
common and specific PKs in five modes of nutrition in fungi. Five types of nutritional modes: saprophytic, facultative parasitic,
hemi-biotrophic, biotrophic, and symbiotic fungi were used to generate the Venn diagram. In the graph were reported the number
of species-specific and common PKs.
PEI Guo-liang et al. Journal of Integrative Agriculture 2019, 18(1): 96–107 105
found. Dissimilar from other kinds of nutritional modes, stands for one or several PK families/sub-families, which
mutualistic symbiotic fungi show no particular PK families. were clustered together in small groups. Where circles
This result confirms that mutualistic symbiotic fungi have intersect with each other a bias for PK families/sub-families
lost many functional genes during the symbiosis and part within different groups of fungi is suggested. As illustrated
of the gene regulatory networks is dependent on the host in Fig. 9, the circle of Agaricomycotina covers the largest
regulation system. In contrast, saprophytic fungi have the area and gives a good description of PK gene amplification
largest amount of PK families. This suggested that this kind in this sub-phylum. The two sub-phyla Taphrinomycotina
of fungi must be more flexible in order to survive complex and Saccharomycotina cover an area close to one another,
environments. More details on the PK families/sub-families providing a good indication that most of the fungi in these
in this analysis are included in Appendix F. two sub-phyla are yeasts and have a close evolutionary
status and similar lifestyles. It should be noted that all of
3.5. Principal component analysis of fungal kinases the circles are consistent with evolutionary status, and the
discontinuous distribution suggested that evolution of the
PCA is one of the most useful statistical tools for analyzing PKs are complicated and associated with the lifestyle.
multivariate data and has been widely applied to analyze In parallel, 26 of the most common domains were
biological data. PCA transforms a number of correlated analyzed (Fig. 10). This is distinguished from the result of
variables into a smaller number of uncorrelated variables,
which are called principal components (PCs) with a minimal
loss of information. The reduced numbers of top ranked PCs
are calculated by projecting samples onto spaces spanned by
“eigenvectors” of a sample covariance matrix and selecting
the “eigenvectors” that comprise the largest contribution of
the sample variation. There are two approaches to perform
PCA, using “eigenvalue” decomposition (P-mode), or
singular value decomposition (Q-mode). The present study
utilized the ade4 package of R language to analyze the data
with the “eigenvalue” decomposition (P-mode) method,
which uses the covariance relationships between markers,
focusing on the differentiation of PK families/sub-families
with different evolutionary status. Discovering relationships
PC2 value
the PK families/sub-families, as this figure was drawn using differentiated for the five nutritional modes: saprophytic,
common domains, showing that the areas of each circle facultative parasitic, hemi-biotrophic, biotrophic, and
intersect each other more steadily and continuously, except mutualistic symbiotic fungi (Appendix G). As a result, the PK
the areas of Taphrinomycotina and Saccharomycotina which composition for biotrophic fungi was remarkably consistent
are irregularly distributed. This does not reflect a clear with the symbiotic fungi, as most the areas coincided.
correlation between domain distributions and taxonomic Although the hemi-biotrophic and biotrophic fungi have
classification, while also reflecting the divergence between similar stages in their lifecycles, there is a difference in the
main functional domains and associated domains, which PK composition between these two kinds of fungi. The
have different roles in natural selection. largest area covered by saprophytic fungi may suggest that
In contrast, when the PK families/sub-families for this kind of fungi have a complete PK composition compared
fungi from different nutritional modes and pathogenicity with the four other kinds of nutritional modes. Clustering the
were analyzed by PCA, 48 of the 59 fungi were clearly PK families/sub-families of pathogenic fungi with different
hosts (plant, insect and human) and yeasts revealed that
the PK compositions of plant pathogenic fungi and insect
pathogenic fungi are almost identical as all of the areas
covered by insect pathogenic fungi were surrounded by plant
pathogenic fungi (Appendix H). Human pathogenic fungi are
very different from plant pathogenic and insect pathogenic
fungi. The greatest area covered by yeasts, which do not
have a host, suggests that there is a big difference in PK
composition between pathogenic and non-pathogenic fungi.
4. Conclusion
domains. Furthermore, the relationships between different structure and classification. The FASEB Journal, 9,
classifications of fungi, such as evolutionary status, 576–596.
nutritional mode and pathogenicity, indicate that the Hibbett D S, Binder M, Bischoff J F, Blackwell M, Cannon P F,
Eriksson O E, Huhndorf S, James T, Kirk P M, Lücking R.
distribution of PK families is influenced by both evolutionary
2007. A higher-level phylogenetic classification of the fungi.
status and by fungal lifestyle. Thus, the present study
Mycological Research, 111, 509–547.
provides insights into the variety and expansion of PK Jones P, Binns D, Chang H Y, Fraser M, Li W, McAnulla C,
families, which will be useful in understanding the differences McWilliam H, Maslen J, Mitchell A, Nuka G, Pesseat S,
in regulatory mechanisms among different species of fungi. Quinn A F, Sangrador-Vegas A, Scheremetjew M, Yong
S Y, Lopez R, Hunter S. 2014. InterProScan 5: Genome-
Acknowledgements scale protein function classification. Bioinformatics, 30,
1236–1240.
Kosti I, Mandel-Gutfreund Y, Glaser F, Horwitz B A. 2010.
This work was supported by the National Science &
Comparative analysis of fungal protein kinases and
Technology Pillar Program of China during the Twelfth
associated domains. BMC Genomics, 11, 133.
Five-Year Plan period (2012BAD19B04), the National Krupa A, Abhinandan K, Srinivasan N. 2004. KinG: A database
Natural Science Foundation of China (31371924), the 111 of protein kinases in genomes. Nucleic Acids Research,
Project from the Ministry of Education of China (B07049), 32, D153–D155.
and the National Basic Research Program of China LaRonde-LeBlanc N, Wlodawer A. 2005. The RIO kinases:
(2013CB127700). We thank Prof. Liu Huiquan, College An atypical protein kinase family required for ribosome
biogenesis and cell cycle progression. Biochimica et
of Plant Protection, Northwest A&F University, China for
Biophysica Acta (BBA: Proteins and Proteomics), 1754,
helpful discussion.
14–24.
Manning G, Plowman G D, Hunter T, Sudarsanam S. 2002a.
Appendices associated with this paper can be available on Evolution of protein kinase signaling from yeast to man.
http://www.ChinaAgriSci.com/V2/En/appendix.htm Trends in Biochemical Sciences, 27, 514–520.
Manning G, Whyte D B, Martinez R, Hunter T, Sudarsanam
References S. 2002b. The protein kinase complement of the human
genome. Science, 298, 1912–1934.
Becker W, Joost H G. 1998. Structural and functional Martin D M, Miranda-Saavedra D, Barton G J. 2009. Kinomer
characteristics of Dyrk, a novel subfamily of protein kinases v. 1.0: A database of systematically classified eukaryotic
with dual specificity. Progress in Nucleic Acid Research and protein kinases. Nucleic Acids Research, 37, D244–D250.
Molecular Biology, 62, 1–17. Mclaughlin D J, Hibbett D S, Lutzoni F, Spatafora J W, Vilgalys
Caenepeel S, Charydczak G, Sudarsanam S, Hunter T, R. 2009. The search for the fungal tree of life. Trends in
Manning G. 2004. The mouse kinome: Discovery and Microbiology, 17, 488.
comparative genomics of all mouse protein kinases. Scheeff E D, Bourne P E. 2005. Structural evolution of the
Proceedings of the National Academy of Sciences of the protein kinase-like superfamily. PLoS Computational
United States of America, 101, 11707–11712. Biology, 1, e49.
Champion A, Kreis M, Mockaitis K, Picaud A, Henry Y. 2004. Stajich J E, Berbee M L, Blackwell M, Hibbett D S, James T
Arabidopsis kinome: After the casting. Functional & Y, Spatafora J W, Taylor J W. 2009. The fungi. Current
Integrative Genomics, 4, 163–187. Biology, 19, R840–R845.
Cohen P. 2000. The regulation of protein function by multisite Taylor J W, Berbee M L. 2006. Dating divergences in the
phosphorylation - A 25 year update. Trends in Biochemical fungal tree of life: Review and new analyses. Mycologia,
Sciences, 25, 596–601. 98, 838–849.
Dray S, Dufour A B. 2007. The ade4 package: Implementing Wang H, Guo S, Huang M, Thorsten L H, Wei J. 2010.
the duality diagram for ecologists. Journal of Statistical Ascomycota has a faster evolutionary rate and higher
Software, 22, 1–20. species diversity than Basidiomycota. Science China (Life
Goldberg J M, Griggs A, Smith J L, Haas B, Wortman J, Sciences), 53, 1163–1169.
Zeng Q. 2013. Kinannote, a computer program to identify Wuichet K, Cantwell B J, Zhulin I B. 2010. Evolution and phyletic
and classify members of the eukaryotic protein kinase distribution of two-component signal transduction systems.
superfamily. Bioinformatics, 29, 2387–2394. Current Opinion in Microbiology, 13, 219–225.
Hanks S K. 2003. Genomic analysis of the eukaryotic protein Zhao Z, Liu H, Wang C, Xu J R. 2013. Comparative analysis of
kinase superfamily: A perspective. Genome Biology, 4, 111. fungal genomes reveals different plant cell wall degrading
Hanks S K, Hunter T. 1995. Protein kinases 6. The eukaryotic capacity in fungi. BMC Genomics, 14, 274.
protein kinase superfamily: Kinase (catalytic) domain