Вы находитесь на странице: 1из 12

Journal of Integrative Agriculture 2019, 18(1): 96–107

Available online at www.sciencedirect.com

ScienceDirect

RESEARCH ARTICLE

Comparative analysis of protein kinases and associated domains


between Ascomycota and Basidiomycota

PEI Guo-liang, GUO Jun, WANG Qin-hu, KANG Zhen-sheng

State Key Laboratory of Crop Stress Biology for Arid Areas, College of Plant Protection, Northwest A&F University, Yangling
712100, P.R.China

Abstract
Protein kinases play an important role in every aspect of cellular life. In this study, we systemically identified protein
kinases from the predicted proteomes of 59 representative fungi from Ascomycota and Basidiomycota. Comparative
analysis revealed that fungi from Ascomycota and Basidiomycota differed in the number and variety of protein kinases.
Some groups of protein kinases, such as calmodulin/calcium regulated kinases (CMGC) and those with the highest group
percentages are the most prevalent protein kinases among all fungal species tested. In contrast, the STE group (homologs
of the yeast STE7, STE11 and STE20 genes), was more abundant in Basidiomycetes than in Ascomycetes. Importantly,
the distribution of some protein kinase families appeared to be subphylum-specific. The tyrosine kinase-like (TKL) group
had a higher protein kinase density in Agaricomycotina fungi. In addition, the distribution of accessory domains, which could
have functional implications, demonstrated that usage bias varied between the two phyla. Principal component analysis
revealed a divergence between the main functional domains and associated domains in fungi. This study provides novel
insights into the variety and expansion of fungal protein kinases between Ascomycota and Basidiomycota.

Keywords: protein kinases, associated domains, Ascomycota, Basidiomycota

et al. 2009). Including molds, mushrooms, rusts, smuts


and yeasts, Ascomycota and Basidiomycota make
1. Introduction essential contributions to the biosphere, bio-product
industry, medicine, and as animal, human and plant
Dikarya is a subkingdom of fungi that contains the two
pathogens. Although separated by almost two billion years,
largest fungal phyla, Ascomycota and Basidiomycota,
and comprising of a huge number of family members,
accounting for the vast majority of terrestrial fungi (Stajich
Ascomycota and Basidiomycota have retained the common
feature of dikaryotic hyphae, and thus belong to the
Dikarya (Mclaughlin et al. 2009). In addition to similarities
Received 27 April, 2018 Accepted 21 June, 2018
PEI Guo-liang, E-mail: peiguoliang@nwafu.edu.cn; Correspondence
in mating, in which nuclear fusion does not follow directly
KANG Zhen-sheng, Tel/Fax: +86-29-87080061, E-mail: kangzs@ to gamete fusion (Taylor and Berbee 2006), Ascomycota
nwsuaf.edu.cn; GUO Jun, Tel/Fax: +86-29-87082439, E-mail: and Basidiomycota share several general characteristics,
guojunwgq@nwsuaf.edu.cn
as well as some specialties (Wang et al. 2010). Thus, it is
© 2019 CAAS. Published by Elsevier Ltd. This is an open
important to study functional protein families in the Dikarya
access article under the CC BY-NC-ND license (http://
creativecommons.org/licenses/by-nc-nd/4.0/) in order to understand the molecular characteristics of
doi: 10.1016/S2095-3119(18)62022-2 these organisms. Such studies may provide fundamental
PEI Guo-liang et al. Journal of Integrative Agriculture 2019, 18(1): 96–107 97

information for understanding how Ascomycota and stage. Lastly, the important role of PKs in cell life was not
Basidiomycota fungi evolved. emphasized and the relationship of PKs to the life cycle of
Protein kinases (PKs) constitute a key class of enzymes fungi was not explored. As many fungal genomes have been
and regulate numerous cellular processes, including sequenced and new tools are available to classify PKs, it is
mitosis, communication, differentiation, metabolism, and necessary to analyze fungal PKs with new methods.
transcription (Hanks and Hunter 1995; Cohen 2000). In this study, we identified and compared the full
By adding a phosphate group from ATP/GTP to the side repertoires of PKs from 59 representative fungal species
chains of target proteins, PKs often profoundly alter the from Ascomycota and Basidiomycota using Kinannote.
biological activity of the target molecules (LaRonde-LeBlanc All 59 fungi can be classified into six sub-phyla, as
and Wlodawer 2005; Wuichet et al. 2010). All PKs can demonstrated by Hibbett et al. (2007). The six sub-phyla
be classified into two super-families: 1) eukaryotic or are Pucciniomycetes, Ustilaginomycetes, Agaricomycotina,
conventional protein kinases (ePKs), which are the most Pezizomycetes, Saccharomycetes and Taphrinomycetes,
prevalent and have a common ancestry; and 2) atypical and the evolutionary stages of these sub-phyla are
protein kinases (aPKs) (Hanks and Hunter 1995). ePKs mentioned in order. The 59 fungi PKs were classified into
have two functional domains, a catalytic domain which binds 10 groups and 119 families/sub-families using different
and phosphorylates target proteins, and a regulatory region. strategies. In addition, we identified all domains associated
The catalytic domain of ePKs contains a core domain of with PKs according to the Pfam database (http://pfam.xfam.
250–300 amino acid residues that is divided further into 12 org/) classification, followed by identifying the differences
sub-domains that have highly conserved individual amino between Ascomycota and Basidiomycota.
acid residues and motifs (Hanks 2003). aPKs lack sequence
similarity to the ePK catalytic domain, but do have kinase 2. Materials and methods
activity (Hanks and Hunter 1995). Based on the catalytic
domain, the ePK super-family can be classified into several The procedures used in this study can be classified into
major groups (Hanks and Hunter 1995; Becker and Joost three phases (Fig. 1). First, fungal proteomes were collected
1998; Manning et al. 2002b), and members of these groups from a genome database. Second, Kinannote was used
are grouped according to broad functional categories to annotate proteomes. Finally, the presence of each
with distinct sequences and structural features. Several domain within a specific fungal group was determined and
classification schemes for PKs have been proposed in the statistically analyzed.
literatures. In their original classification work, Hanks and
Hunter (1995) took into consideration conservation and 2.1. Data collection
phylogenetic analyses of catalytic domains. This has been
followed by several research groups for PK classification The proteomes of 17 fungi were downloaded from the Fungal
(Manning et al. 2002a; Krupa et al. 2004; Scheeff and Genome Initiative site of the Broad Institute (http://www.
Bourne 2005; Martin et al. 2009). The database KinBase broadinstitute.org/), 19 were obtained from GenBank of the
(http://www.kinase.com/kinbase) has been developed for National Center of Biological Information (NCBI) (http://www.
PK classification (Manning et al. 2002b; Martin et al. 2009). ncbi.nlm.nih.gov/), and the remaining 23 were downloaded
Currently, over 3 000 PK genes divided into 10 groups can from the United States Department of Energy (DOE) Joint
be found in KinBase. Goldberg et al. (2013) designed a
new software, Kinannote, which can identify and classify
members of the PK super-families. Kinannote significantly Predicted proteome
improved the average sensitivity and precision for full
classification of conserved PKs and had been implementes Kinannote
HMMER search
in several PK annotation programs.
Several hundreds of fungal genomes have been
Motif scoring
published (Zhao et al. 2013), including Ascomycota and
Basidiomycota, allowing for representative fungal data to
BLAST against local database and
be utilized. Kosti et al. (2010) compared and analyzed the classification to 10 groups
PKs from 30 fungi, allowing some meaningful discoveries,
although their study had several weaknesses. First, they
Pfam analysis Data analysis
were unable to develop a precise classification of PKs that
could be correlated to specific functions. Second, the PKs of
the 30 fungi were not compared on the basis of evolutionary Fig. 1 Flow chart of the protein kinase (PK) analysis pipeline.
98 PEI Guo-liang et al. Journal of Integrative Agriculture 2019, 18(1): 96–107

Genome Institute (JGI) site (http://genome.jgi-psf.org/). using SPSS Statistic 20.0 Software (http://www.01.ibm.com/
software/analytics/spss/).
2.2. Kinase annotation and statistical analysis
2.3. Domains searches
The Kinannote Program, downloaded from Sourceforge
(http://sourceforge.net/projects/kinannote/), was used to InterProScan 5.20 was used to identify domain sequences
extract and classify putative PKs from all fungal proteomes. from each putative PKs by searching sequences against the
Kinannote identified and classified PKs in three phases. Pfam A (V30) HMMs library (Jones et al. 2014). A minimum
First, the proteomes were searched with a PK hidden score value of e-value of 1×10–4 was used.
Markov models (HMMs) and a relaxed cutoff was used for
PK identification. Candidates were searched using position- 2.4. Principal component analysis (PCA)
specific scoring matrices and against a local version of
KinBase using BLAST, and BLAST results were then parsed. The 59 fungi were classified according to evolutionary
Second, BLAST results were used to identify conserved PKs stages, limiting the data to those domains present in at
with poor HMM scores. Finally, BLAST search results were least half of the fungi (at least 30). The PCA procedure was
applied to classify PKs. applied PKs and the domains identified as PK sequences,
The PK groups are divided as follows: The CMGC group using the GNU R Software in the ade4 Package (Dray and
was named after a set of families (CDK, MAPK, GSK3 and Dufour 2007).
CLK) and has a diverse set of functions, which include cell
cycle control, MAPK (mitogen-activated protein kinase) 3. Results and discussion
signaling, splicing, and other unknown functions. The other
group consists of several families and some unique PKs that 3.1. Distribution of PKs
are clearly ePKs but do not fit into other ePK groups. The
AGC group was named after the protein kinase A, G, and The predicted proteomes of 59 Basidiomycetes
C families (PKA, PKC and PKG) and contains many core and Ascomycetes fungi representing six sub-phyla
intracellular signaling PKs which are modulated by cyclic (Pucciniomycetes, Ustilaginomycetes, Agaricomycotina,
nucleotides, phospholipids and calcium. The STE group Pezizomycetes, Saccharomycetes and Taphrinomycetes)
includes homologs of the yeast STE7, STE11 and STE20 (Appendix B) were systematically screened for different
proteins, which form the MAPK cascade transducing signals families of kinomes using Kinannote Software. These
from the cell surface to the nucleus. The Ser/Thr group fungi also represent the five types of nutritional modes:
consists of only one family. The CK1 group is a small but saprophytic, facultative parasitic, hemi-biotrophic, biotrophic
ancient family, originally known as Casein Kinase 1 (from a and mutualistic symbiotic fungi, including pathogens of
biochemically assay with a non-physiological substrate), and plants, human, and insects as well as non-pathogenic fungi.
has been renamed Cell Kinase 1. The TKL group is most The focus of this study was on the fungal evolutionary status,
similar to tyrosine kinases. The PAB group was named after also taking into account other features of fungi, such as
the PAB-dependent poly(A)-specific ribonuclease subunit, nutritional modes and pathogenicity.
a catalytic subunit of the poly(A)-nuclease de-adenylation A total of 7 557 putative PK sequences were obtained.
complex, which have PK activity. There are two large They were classified into 119 families/sub-families and 10
groups, RGC and TK, which are not found in fungi and are PK groups (RGC and TK groups were not represented). All
not discussed in this article. It should be noted that the putative PKs were identified in fungal predicted proteomes.
classifications determined using the Kinannote database, As shown in Fig. 2, the distribution of the 10 PK groups was
based on KinBase, is slightly different from the newer version classified using Kinannote Software. All PK groups in fungi
of KinBase. We annotated all the families of PKs using can be classified into three classes by group percentage,
Saccharomyces cerevisiae S290C genes, and genes which the number of genes in a PK group compared to the total
cannot be found in S. cerevisiae were identified by other number of all PKs, which were then adjusted by a median
fungal genes (Appendix A). In order to avoid unnecessary value. The most populated PK groups were “CMGC” and
misunderstanding in different classification, we do not “Other”, averaging >20% from all fungi. The AGC, CAMK,
discuss the functions of PKs from any special fungus. STE and Ser/Thr groups comprised approximately 10% of
All data were extracted using Kinannote Software. all fungal PKs, while the CK1, PAB and TKL groups were
OriginPro 8.5 (http://originlab.com/index.aspx?go= the least populated groups (Appendix C). Most PK groups
PRODUCTS/OriginPro) was used to generate statistical were stably distributed in all fungi studied, especially the
box charts. Independent sample t-tests were performed CK1 and PAB groups. However, in the TKL and the Ser/Thr
PEI Guo-liang et al. Journal of Integrative Agriculture 2019, 18(1): 96–107 99

0 100 200 300 400


Puccinia striiformis
Melampsora laricis-populina
Pucciniomycotina Puccinia graminis
Microbotryum violaceum
Rhodotorula graminis
Mixia osmundae
Sporisorium reilianum
Tilletiaria anomala

Basidiomycota
Ustilaginomycotina Ustilago maydis
Ceraceosorus bombacis
Malassezia globosa
Botryobasidium botryosum
Paxillus involutus
Laccaria bicolor
Hebeloma cylindrosporum
Calocera viscosa
Agaricomycotina
Calocera cornea
Coniophora puteana
Cryptococcus neoformans
Cryptococcus gattii
Moniliophthora perniciosa
Nectria haematococca
Cordyceps militaris AGC
Metarhizium acridum
Neurospora crassa Atypica
Gaeumannomyces graminis var. tritici
Metarhizium anisopliae CAMK
Arthroderma benhamiae
CK1
Cladosporium fulvum
Aspergillus clavatus CMGC
Mycosphaerella graminicola
Pyrenophora teres f. sp. teres Other
Ajellomyces capsulatus
PAB
Trichophyton rubrum
Pezizomycotina
Magnaporthe oryzae STE
Cochliobolus sativus
Penicillium marneffei TKL

Ascomycota
Coccidioides immitis
Arthrobotrys oligospora
Ser/Thr
Phaeosphaeria nodorum
Dothistroma septosporum
Botryotinia fuckeliana
Sclerotinia sclerotiorum
Fusarium graminearum
Tuber melanosporum
Chaetomium thermophilum
Blumeria graminis f. sp. hordei
Saccharomyces cerevisiae
Debaryomyces hansenii
Candida tropicalis
Saccharomycotina
Candida lusitaniae
Candida albicans
Lodderomyces elongisporus
Schizosaccharomyces japonicus
Schizosaccharomyces cryophilus
Schizosaccharomyces octosporus
Schizosaccharomyces pombe
Pneumocystis jirovecii
Taphrinomycotina Taphrina deformans

Fig. 2 Comparative analysis of fungal protein kinases (PKs). The number of kinase genes is represented as horizontal bars.
Colors indicate the major kinase groups: AGC, Atypica, CAMK, CK1, CMGC, other, PAB, STE, TKL and Ser/Thr protein kinase.

groups, the number of PKs highly varied among different (2010) reported a large number of TKL protein kinase
fungi (Appendix D). The species with the most PKs (389) genes in Laccaria bicolor. In the present study, TKL gene
was Botryobasidium botryosum, whereas Moniliophthora duplication was found not only in L. bicolor, but also in most
perniciosa only had 48 PKs. Agaricomycotina. Duplicated genes mostly belonged to
The differences in PKs among fungi were not limited TKL/TKL-CCIN group. In fact, the TKL/TKL-CCIN group
to the number but also in types of kinases. Kosti et al. was only found in Agaricomycetes, whereas Cryptococcus
100 PEI Guo-liang et al. Journal of Integrative Agriculture 2019, 18(1): 96–107

gattii and Calocera cornea had no genes encoding TKL/ gene functions. These results suggest that the function of
TKL-CCIN, suggesting that duplicated TKL/TKL-CCIN genes some PK families/sub-families may change and play an
could be a potential classifier for Agaricomycetes species. increasingly important role in the fungal cellular lifecycle.
The data show that PK gene duplication is a common The 119 families/sub-families vary greatly in the
phenomenon in fungi. Besides the TKL/TKL-CCIN family, distribution and abundance among fungi (Fig. 3). For
which was largely observed in Agaricomycetes, and the example, different members of the families Other/SCY1,
Ser/Thr group, which was identified in all fungi, there are CMGC/CK2, STE/STE20/PAKA and STE/STE20/YSK are
11 families/sub-families that were represented by more than present in the examined fungi. For the other 13 families/
two copies in each fungus. For example, the families/sub- sub-families, such as CMGC/CDKL, AGC/RSK/RSKP90 and
families ratio of other/AGAK1 was 5.5, indicating that the STE/STE20/KHS, only a single member was identified in one
fungi containing these families/sub-families of PKs have at predicted proteome, similar to the distribution in Arabidopsis
least five copies of other/AGAK1 genes. Duplicate genes thaliana, whereas many copies were identified in human and
are believed to be a major mechanism for establishing new mouse genomes (Manning et al. 2002b; Caenepeel et al.

0 10 20 30 40 50 59 0 10 20 30 40 50 59
AGC AGC Other Other/SCY1
AGC/NDR/NDR Other/HAL
AGC/PDK1 Other/BUB
AGC/AKT Other/CDC7
AGC/RSK/RSK-UNCLASSIFIED Other/IRE
AGC/NDR/NDR-UNCLASSIFIED Other/ULK/ULK
AGC/PKA Other/AUR
AGC/PKC Other/NEK
AGC/YANK Other/RAN
AGC/RSK/RSKP70 Other/VPS15
AGC/NDR Other/NAK
AGC/DMPK Other/TTK
AGC/RSK Other/BUD32
AGC/DMPK/GEK Other/PEK/GCN2
AGC/DMPK/ROCK Other/IKS
AGC/RSK/RSKP90 Other/HASPIN
AGC/SGK Other/PLK
Atypical Other/CAMKK/CAMKK-META
Atypical/ABC1/ABC1-C Other/WEE/WEE-UNCLASSIFIED
Atypical/ABC1/ABC1-B Other/CAMKK
Atypical/RIO/RIO2 Other/NAK/BIKE
Atypical/RIO/RIO1 Other/CAMKK/ELM
Atypical/ABC1 Other/PEK
Atypical/ABC1/ABC1-A
Other/WEE
Atypical/PI4K
Other/PLK/PLK1
Atypical/PIKK/FRAP
Other/AGAK1
Atypical/HISK
Other/ULK
Atypical/PIKK
Other/CILIATE-A1
Atypical/BCR
Other/NAK/MPSK
Atypical/RIO
Other/NEK/NEK2
CAMK CAMK
Other/PLK/SAK

CAMK/CAMK1/CAMK1-RCK
CAMK/CAMKL/AMPK
CMGC CMGC/CK2
CMGC/CDK/CRK7
CAMK/CAMKL/KIN1
CMGC/SRPK
CAMK/CAMKL/KIN4
CMGC/CDK
CAMK/CAMK1
CMGC/CDK/CDC2
CAMK/CAMKL/PASK
CMGC/CDK/CDK8
CAMK/CAMK1/CAMK1-CMK
CMGC/CLK
CAMK/CAMKL/GIN4
CAMK/CAMKL/CHK1 CMGC/MAPK
CAMK/CAMKL CMGC/MAPK/P38
CAMK/RAD53 CMGC/CDK/CDK5
CAMK/CAMKL/MARK CMGC/CDK/CDK7
CAMK/CAMKL/BRSK CMGC/DYRK/YAK
CMGC/GSK
STE STE/STE20/PAKA CMGC/DYRK/DYRK2
STE/STE20/YSK CMGC/RCK/MAK
STE/STE11/BCK1 CMGC/DYRK/PRP4
STE/STE11/SSK CMGC/MAPK/ERK1
STE/STE11 CMGC
STE/STE7 CMGC/CDK/CDK9
STE/STE7/MEK1 CMGC/RCK
STE/STE11/CDC15 CMGC/CDK/PITSLRE
STE/STE7/MKK CMGC/DYRK
STE/STE20/FRAY CMGC/CDKL
STE/STE20
STE TKL TKL
STE/STE20/KHS TKL/LISK/LISK-DD1
TKL/TKL-CCIN
CK1 CK1/CK1/CK1-D
CK1/CK1/CK1-G PAB PAB-dependent
CK1/CK1
CK1 Ser/Thr Serine/Threonine

Fig. 3 Distribution of each protein kinase (PK) group among the 59 fungi. The colums showed the numbers of fungi that have
the members of the specific kinase group.
PEI Guo-liang et al. Journal of Integrative Agriculture 2019, 18(1): 96–107 101

2004; Champion et al. 2004). comparison to the kinome size. At the sub-phylum level,
only Agaricomycetes had remarkable gene duplication in
3.2. Differences between Basidiomycetes and As- PKs (Appendix E).
comycetes In addition to those already mentioned above, several
PK families/sub-families were only observed in either
The studied fungi, which belong to Basidiomycetes and Basidiomycetes or Ascomycetes, but not both. Ten and
Ascomycetes, had different types and numbers of PKs. 13 PK families/sub-families are only found in Ascomycetes
Although these fungi of two phyla were separated long or in Basidiomycetes (Table 1). It should be noted that
ago, their PKs in some groups are still highly analogous, some families/sub-families, such as TKL/TKL-CCIN, Other/
however, the high homology was not found across all PK AGAK1 and TKL/LISK/LISK-DD1 in Basidiomycetes,
groups. Interestingly, the number of PKs in the STE group have similar numbers of duplicated genes, such as
is more abundant in Basidiomycetes than in Ascomycetes Other/NAK/BIKE, AGC/RSK/RSKP70 and Atypical/HISK
(Fig. 4-A, P<0.01). In contrast, the percentage of TKL in Ascomycetes. Interestingly, both TKL/TKL-CCIN and
in Basidiomycetes was higher than in Ascomycetes, Other/AGAK1 are only found in Agaricomycetes at high
whereas the group percentages of CMGC and other PKs
numbers. In contrast, TKL/LISK/LISK-DD1, STE/STE20/
in Basidiomycetes is lower than in Ascomycetes (Fig. 4-B,
FRAY and AGC/DMPK, which have more than 10 copies,
P<0.01). This result may indicate that Basidiomycetes
can be found in most Basidiomycetes. Other/NAK/BIKE
depend less on CMGC and other PKs during the cell cycle.
and AGC/RSK/RSKP70, which also have more than 10
PK gene duplication in Agaricomycetes may partially
copies, are found in most Ascomycetes and have no
explain why the STE group is more abundant in
obvious sub-phylum features.
Basidiomycetes than in Ascomycetes. In addition, the
The kinase percentage (the ratio of kinase number
decreased group percentage for CMGC and Other PKs
to gene number) differed between 0.5 and 2.5% among
in Basidiomycetes compared to Ascomycetes could be
different sub-phyla (Fig. 5). For most Taphrinomycetes,
partially due to gene amplification of TKL/TKL-CCIN. The
the kinase percentage was >2%, greater than all other
analysis indicates that, although Basidiomycetes fungi
fungi. Only Taphrina deformans has a kinase percentage
have bigger genomes than Ascomycetes, the genome
differences between these two phyla are not significant in of only 1.4%. As few fungal kinase percentages reach the
level of Taphrinomycetes, this may serve as a marker for
Taphrinomycetes species.

A B
60 Basidiomycota
Ascomycota Table 1 Phylum-specific protein kinase families/sub-families
25 Max.
99% Families Basidiomycetes Ascomycetes
95%
50 75% TKL/TKL-CCIN 552 0
Mean
Media Other/AGAK1 33 0
20 25%
TKL/LISK/LISK-DD1 18 0
Groups percentage (%)

40 5%
STE/STE20/FRAY 17 0
Number of kinases

1%
Min.
AGC/DMPK 12 0
15 30 AGC/DMPK/GEK 3 0
STE/STE20/KHS 1 0
CK1 1 0
10 20 CAMK/CAMKL/BRSK 1 0
Other/NEK/NEK2 1 0
AGC/RSK/RSKP90 1 0
10
5 AGC/DMPK/ROCK 1 0
Atypical/BCR 1 0
0 Other/NAK/BIKE 0 45
0 AGC/RSK/RSKP70 0 32
STE TKL CMGC Other Atypical/HISK 0 5
CMGC/DYRK 0 3
Other/CILIATE-A1 0 1
Fig. 4 Difference in protein kinases (PKs) between Ascomycetes
CMGC/CDKL 0 1
and Basidiomycetes. A, the number of PKs in group STE was
plotted for Ascomycetes and Basidiomycetes. B, the group AGC/SGK 0 1
percentage of PKs in groups TKL, CMGC and other were Other/PLK/SAK 0 1
plotted for Ascomycetes and Basidiomycetes respectively Atypical/RIO 0 1
(t-test, P<0.01). Other/NAK/MPSK 0 1
102 PEI Guo-liang et al. Journal of Integrative Agriculture 2019, 18(1): 96–107

3.0 with more kinase sequences in more groups, increasing


Max.
99%
95% from an average of 99.2 PK sequences per fungal species
75%
Mean to 128 sequences, although the average of 1.4 domain per
Media
25%
sequence did not change. Interestingly, the type of domains
2.5 5%
1%
and distribution of some domains changed greatly. These
Min.
differences between the two studies could partly be due to
the total number of kinase domains examined and changes
2.0 in the version of Pfam. In the present study, the following
Kinase percentage (%)

17 domain types had kinase activity: protein kinase domain


(7 162 domains), protein tyrosine kinase (581), protein kinase
C terminal domain (210), C1 domain (123), RIO1 family
1.5
(103), kinase associated domain 1 (66), polydenylate sensor
of SNF1-like protein kinase (61), Ras-binding domain of
Byr2 (52), fungal kinase associated domain (44), His kinase
1.0 A (phospho-acceptor) domain (35), histidine kinase, DNA
gyrase B, and HSP90-like ATPase (31), lipopolysaccharide
kinase (Kdo/WaaP) family (26), lipopolysaccharide kinase
(Kdo/WaaP) family (26), phosphatidylinositol 3- and 4-kinase
0.5
(25), yeast phosphatidylinositol-4-OH kinase Pik1 (8), PIK
domain (2) and ecdysteroid kinase (1). The protein kinase
domain was the most common type of kinase domain
0 identified in this study, representing 68.7% of all catalytic
A B C D E F domains. Fig. 6-A shows 30 of the most common domain
types found in PKs, including 11 of 17 domains that have
Fig. 5 Different percentages of protein kinases (PKs) for kinase activity.
sub-classification in Ascomycetes and Basidiomycetes. The Interestingly, the distribution of domains between
percentage of PKs was plotted for sub-classifications in
Ascomycetes and Basidiomycetes. A, Pucciniomycotina; B, Basidiomycetes and Ascomycetes is different. There are
Ustilaginomycotina; C, Agaricomycotina; D, Pezizomycotina; 3 088 PK sequences with 4 190 domains in Basidiomycetes,
E, Saccharomycotina; F, Taphrinomycotina. The ratio of the while there are 4 469 protein kinase sequences with 6 239
kinome in Saccharomycotina was more abundant than other
fungi studied. domains in Ascomycetes. Among 181 different domain
types, 67 of 181 kinds of domains were only observed in
Basidiomycetes, in contrast to 59 of 181 kinds of domains
3.3. Domain distribution of PKs observed in Ascomycetes only. Only 55 common elements
were found in both Basidiomycetes and Ascomycetes.
Protein kinases play an important role in the cell cycle and The 16 most common types of kinase domains for each
function in protein interaction networks. In order to establish phylum are shown in Fig. 6-B and C. Tyrosine kinase
contact with other proteins and initiate different functions, domains are more abundant in Basidiomycetes than in
most protein kinases have other domains in addition to the Ascomycetes. In contrast, the protein kinase C terminal
catalytic domain. In terms of protein kinase sequence, domain and ABC1 family (which lack kinase activity) are
the catalytic domain is frequently tethered to one or more more abundant in Ascomycetes than in Basidiomycetes.
non-kinase domains that are responsible for regulation, Interestingly, like the prevalence of TKL/TKL-CCIN in
substrate specificity, scaffolding, or other functions. In Agaricomycetes, tyrosine kinase domains are highly
previous studies, researchers focused on both the catalytic represented in Agaricomycetes and the TKL/TKL-CCIN
and non-kinase domains (Manning et al. 2002a; Kosti et al. family. In most cases, the frequency of domains lacking
2010), therefore providing a rationale for the present study to kinase activity found in kinase sequences was very low, and
search for the putative kinases against the Pfam database. 91 of 181 domain types were only observed once among the
Kinases were annotated to determine the identity and 7 557 protein kinase sequences. In contrast, some domain
number of domains flanking the catalytic domain. types were found at high frequencies in the present study.
Among the 7 557 PKs identified from the studied 59 Fifteen copies of HET (accession number PF06985) were
fungal species, 7 419 significantly matched 10 429 Pfam identified in Ascomycetes, while the alpha/beta hydrolase
domains belonging to 181 different domain types. These fold (accession number PF07859) was observed at five
results expand the study conducted by Kosti et al. (2010), copies in Basidiomycetes, the highest number found for
PEI Guo-liang et al. Journal of Integrative Agriculture 2019, 18(1): 96–107 103

0 100 200 300 400 0 100 200 300 400


A Protein kinase domain 7 192 B Protein kinase domain 2 714
Protein tyrosine kinase 581 Protein tyrosine kinase 581
ABC1 family ABC1 family
Protein kinase C
Protein kinase C
C1 domain
FHA domain
P21-Rho-binding domain
C1 domain
POLO box duplicated region
P21-Rho-binding domain
FHA domain
Hr1 repeat Hr1 repeat
RIO1 family RIO1 family
Response regulator WD domain, G-beta repeat
C2 domain Ankyrin repeats (3 copies)

POLO box C2 domain

Kinase associated domain 1 Kinase associated domain 1


Response regulator receiver
SNF1-like protein kinase
PH domain
Ribonuclease 2-5A

Binding domain of tRNAs


C Protein kinase domain 4 478
Ankyrin repeats
ABC1 family
DUF3543 Protein kinase C
Mad3/BUB1 FHA domain
Histidyl-tRNA synthetase P21-Rho-binding domain
RWD domain Hr1 repeat

PH domain C1 domain
RIO1 family
Byr2
Response regulator receiver
Rio2, N-terminal
Protein tyrosine kinase
SAM domain
C2 domain
DUF3635
SNF1-like protein kinase
Ubiquitin associated domain
Fungal kinase associated-1
Kinase associated-1 domain Anticodon binding domain
WD domain, G-beta repeat Ribonuclease 2-5A
His kinase A DUF3543

Fig. 6 Distribution of domains in protein kinases (PKs). A, distribution of the number of domains found in all fungal kinases over
the 10 groups of PKs. B and C, distribution of the number of domains found in Ascomycetes and Basidiomycetes. Domains with
kinase catalytic activity are colored deep blue, while the others in green.

any domain in this phylum. and these two groups are equally represented. In the
The distribution of kinase associated domains among CAMK group, there were more domains exclusive to
the 10 PK groups is different. As shown in Fig. 7-A, the Basidiomycetes than Ascomycetes, but this was reversed
CAMK group contains 38 types of domains, making it the in the CMGC and Ser/Thr groups. For the TKL and CK1
most complicated group of all PKs. This indicates that groups, no such domains were found in Ascomycetes.
kinase proteins in the CAMK group have a richness of The PAB group only had common domains found in both
functional domains and a flexibility/expansibility in function. Basidiomycetes and Ascomycetes.
The CK1 group contains two types of domains, containing
only a protein kinase domain and a casein kinase 1 gamma 3.4. PKs in different nutrition modes and pathoge-
C terminus which lacks kinase activity. In contrast, the nicity of fungi
distribution of 10 common types of kinase domains among
the 10 PK groups is different. As shown in Fig. 7-B, in most Different fungi live in very particular conditions, and PKs are
cases, the kinase proteins in the “Other” PK group contain a utilized to best adapt to their specific environment. Thus,
protein kinase domain and a small domain which has kinase it is important to uncover differences in fungal life styles,
activity. In contrast, kinase proteins in the TKL group contain such as methods for obtaining nutrients or invading a host.
both a protein kinase domain and a protein tyrosine kinase, In the present study, the evolutionary status of each fungal
which takes up almost half of the entire sequence. This is species is inextricably tied to survival. Except Pneumocystis
similar to the CAMK group, which contains more domain jirovecii and T. deformans, most Saccharomycotina and
types, reducing the protein kinase domain percentage Taphrinomycotina fungal species are yeasts, which have
(Fig. 7-B). For the AGC and Atypical groups, domains the same nutritional mode and do not invade hosts. This
are found exclusively in Basidiomycetes or Ascomycetes, suggests that the nutritional modes within the same sub-
104 PEI Guo-liang et al. Journal of Integrative Agriculture 2019, 18(1): 96–107

phylum can be compared to determine significant differences Seven PK families, which form the largest group among
based on the same evolutionary status. Thus, nutritional all studied groups, only appear in plant pathogenic fungi,
modes and pathogenicity affect the evolutionary status of suggesting that these fungi may have different strategies
fungi. in adapting to kinds of plants, such as monocotyledons and
As shown in Fig. 8-A, pathogenic fungi with different dicotyledons. Some fungi from the Saccharomycotina and
hosts (including plant, insect and human pathogens) and Agaricomycotina were used as a negative control group
yeasts were compared. Interestingly, all fungi with different to determine the differences between pathogenic and
hosts and yeasts have particular families of kinases. The non-pathogenic fungi in the five PK families found in this
particular PK families of human pathogenic fungi are Other/ group.
CILIATE-A1 and Other/PLK/SAK. The AGC/SGK family is The differences in PKs between fungi with different
only found in insect pathogenic fungi, including Ajellomyces nutritional modes were compared (Fig. 8-B). Common
capsulatus, Cordyceps militaris and Metarhizium anisopliae. PK families were the most prevalent and 84 families were

A Percentage (%) B
0 10 20 30 40
Atypical Protein kinase domain
CAMK
Protein tyrosine kinase
Other TKL
Protein kinase C terminal domain
Ser/Thr PAB
C1 domain
CMGC Ser/Thr RIO1 family

Atypical Other Kinase associated domain 1

STE Adenylate sensor of


TKL SNF1-like protein kinase
CAMK Ras-binding domain of Byr2
STE
CMGC Fungal kinase
AGC
associated-1 domain
CK1
PAB His kinase A domain
AGC
CK1
0 20 40 60 80 100
Percentage (%)

Fig. 7 Domain distribution in protein kinase (PK) groups. A, the number of domains found in PK groups. B, the distribution of
kinase catalytic domains between PK groups. The x-axis denotes the percentage of genes contained the domains in PK groups.

Facu
ltativ
A Insect Plant B e pa
rasit
2 ic
c

7
hi

1
p

N
ro

on 0 1
ot
an

0 e 1
Bi

5 3 1 0
Hemibio

0
um

2 0 2
H

0 1
1 0 3 0
2 5 0 3
trophic

84 0
2
0 1
3 85 0 1
0 0
0 1
1
0 1
9 0 2
Sy

gi
m

6
fun
bio

ic
1 hyt
tic

aprop
S

Fig. 8 Venn diagrams of shared protein kinase (PK) families and unique PK families in fungi. A, Venn diagram showing the
common and specific PKs among fungi. Three hosts of pathogenic fungi (human, insect, plant) and yeast were used to generate
the Venn diagram. In the graph were reported the number of species-specific and common PKs. B, Venn diagram showing the
common and specific PKs in five modes of nutrition in fungi. Five types of nutritional modes: saprophytic, facultative parasitic,
hemi-biotrophic, biotrophic, and symbiotic fungi were used to generate the Venn diagram. In the graph were reported the number
of species-specific and common PKs.
PEI Guo-liang et al. Journal of Integrative Agriculture 2019, 18(1): 96–107 105

found. Dissimilar from other kinds of nutritional modes, stands for one or several PK families/sub-families, which
mutualistic symbiotic fungi show no particular PK families. were clustered together in small groups. Where circles
This result confirms that mutualistic symbiotic fungi have intersect with each other a bias for PK families/sub-families
lost many functional genes during the symbiosis and part within different groups of fungi is suggested. As illustrated
of the gene regulatory networks is dependent on the host in Fig. 9, the circle of Agaricomycotina covers the largest
regulation system. In contrast, saprophytic fungi have the area and gives a good description of PK gene amplification
largest amount of PK families. This suggested that this kind in this sub-phylum. The two sub-phyla Taphrinomycotina
of fungi must be more flexible in order to survive complex and Saccharomycotina cover an area close to one another,
environments. More details on the PK families/sub-families providing a good indication that most of the fungi in these
in this analysis are included in Appendix F. two sub-phyla are yeasts and have a close evolutionary
status and similar lifestyles. It should be noted that all of
3.5. Principal component analysis of fungal kinases the circles are consistent with evolutionary status, and the
discontinuous distribution suggested that evolution of the
PCA is one of the most useful statistical tools for analyzing PKs are complicated and associated with the lifestyle.
multivariate data and has been widely applied to analyze In parallel, 26 of the most common domains were
biological data. PCA transforms a number of correlated analyzed (Fig. 10). This is distinguished from the result of
variables into a smaller number of uncorrelated variables,
which are called principal components (PCs) with a minimal
loss of information. The reduced numbers of top ranked PCs
are calculated by projecting samples onto spaces spanned by
“eigenvectors” of a sample covariance matrix and selecting
the “eigenvectors” that comprise the largest contribution of
the sample variation. There are two approaches to perform
PCA, using “eigenvalue” decomposition (P-mode), or
singular value decomposition (Q-mode). The present study
utilized the ade4 package of R language to analyze the data
with the “eigenvalue” decomposition (P-mode) method,
which uses the covariance relationships between markers,
focusing on the differentiation of PK families/sub-families
with different evolutionary status. Discovering relationships
PC2 value

between fungi of different evolutionary status is a difficult


task, especially as the dissimilarities are based on individual
elements within the six sub-phyla, which includes fungi at the
same evolutionary status but with different lifestyles. This C B
is further complicated by difficulties in finding species that A D
from all six sub-phyla with similar lifestyles that also have
published and sequenced genomes. As a result, the PCA E
F
inevitably mirrors both the evolutionary aspects of a genome
and conservation of the functional genome.
The methodology for this research is not entirely
consistent with previous studies. First, all fungi were grouped
by evolutionary status, followed by listing the frequency
and type of the most common PK families/sub-families or
domains found among all fungal kinomes. Next, PCA was PC1 value
utilized to cluster fungi with PK families/sub-families or
domains based on grouping data. Fig. 9 shows the PCA Fig. 9 Principal component analysis (PCA) result for protein
clustering of the different fungi, based on the frequency kinases (PKs) from 59 fungi in six sub-phyla. The PCA results
from the 59 fungi in six sub-phyla. A total of 76 of 119 PKs
and type of the 76 most common PK families/sub-families families which could be found in at least half of these fungi were
found among all fungal kinomes. This figure composed of used to generate the PCA figure. Each dot represents one cluster
a round circle and some dots, and the circle area covered of PKs. The color for sub-phylum from Pucciniomycotina (A),
Ustilaginomycotina (B), Agaricomycotina (C), Pezizomycotina
depends on the site of the dots, suggesting frequency and (D), Saccharomycotina (E), and Taphrinomycotina (F) are
type of PK families/sub-families within the group. Each dot black, red, green, navy blue, light blue, and pink, respectively.
106 PEI Guo-liang et al. Journal of Integrative Agriculture 2019, 18(1): 96–107

the PK families/sub-families, as this figure was drawn using differentiated for the five nutritional modes: saprophytic,
common domains, showing that the areas of each circle facultative parasitic, hemi-biotrophic, biotrophic, and
intersect each other more steadily and continuously, except mutualistic symbiotic fungi (Appendix G). As a result, the PK
the areas of Taphrinomycotina and Saccharomycotina which composition for biotrophic fungi was remarkably consistent
are irregularly distributed. This does not reflect a clear with the symbiotic fungi, as most the areas coincided.
correlation between domain distributions and taxonomic Although the hemi-biotrophic and biotrophic fungi have
classification, while also reflecting the divergence between similar stages in their lifecycles, there is a difference in the
main functional domains and associated domains, which PK composition between these two kinds of fungi. The
have different roles in natural selection. largest area covered by saprophytic fungi may suggest that
In contrast, when the PK families/sub-families for this kind of fungi have a complete PK composition compared
fungi from different nutritional modes and pathogenicity with the four other kinds of nutritional modes. Clustering the
were analyzed by PCA, 48 of the 59 fungi were clearly PK families/sub-families of pathogenic fungi with different
hosts (plant, insect and human) and yeasts revealed that
the PK compositions of plant pathogenic fungi and insect
pathogenic fungi are almost identical as all of the areas
covered by insect pathogenic fungi were surrounded by plant
pathogenic fungi (Appendix H). Human pathogenic fungi are
very different from plant pathogenic and insect pathogenic
fungi. The greatest area covered by yeasts, which do not
have a host, suggests that there is a big difference in PK
composition between pathogenic and non-pathogenic fungi.

4. Conclusion

We systemically identified PKs from the predicted proteomes


of 59 representative fungi belonging to Ascomycota and
Basidiomycota. Comparative analysis revealed that fungi
PC2 value

between Ascomycota and Basidiomycota exhibited a high


diversity in the number and variety of PKs. Some PK groups,
CMGC and Other, were the most prevalent groups, followed
by AGC, CAMK, STE and Ser/Thr, and then CK1, PAB and
TKL were the least populated groups. On the other hand,
the STE group was more abundant in Basidiomycetes
A
C than in Ascomycetes. Importantly, some of the PK family
B D
distribution appeared to be phylum-specific. Ten families/
sub-families were found only in Ascomycetes, and 13
F E families/sub-families were only found in Basidiomycota.
More interestingly, the TKL/TKL-CCIN group had remarkable
gene duplication in many Agaricomycotina species and this
appearred to be subphylum-specific. Although there were
different kinase percentages among the six sub-phyla, this
is not an ideal method for classifying sub-phylum, but in
most cases should be a practical method for identifying
PC1 value Taphrinomycetes. In addition, the distribution of accessory
domains, which may have functional implications, has a
Fig. 10 Principal component analysis (PCA) result for associated usage bias that varies between the two phyla. The protein
domains from 59 fungi in six sub-phyla. The PCA result of the
kinase Tyr, which has kinase catalytic activity, is highly
59 fungi from six sub-phyla. A total of 26 associated domains
which can be found in at least half of these fungi were used represented in Agaricomycotina. The PCA demonstrates
to generate the PCA figure. Each dot represents one cluster that there is a divergence between the main functional
of PKs. The color for sub-phylum from Pucciniomycotina (A), domains and associated domains, and that the two parts in
Ustilaginomycotina (B), Agaricomycotina (C), Pezizomycotina
(D), Saccharomycotina (E), and Taphrinomycotina (F) are black, the PK sequences have different roles in natural selection,
red, green, navy blue, light blue and pink, respectively. allowing for more selection and choice with the associated
PEI Guo-liang et al. Journal of Integrative Agriculture 2019, 18(1): 96–107 107

domains. Furthermore, the relationships between different structure and classification. The FASEB Journal, 9,
classifications of fungi, such as evolutionary status, 576–596.
nutritional mode and pathogenicity, indicate that the Hibbett D S, Binder M, Bischoff J F, Blackwell M, Cannon P F,
Eriksson O E, Huhndorf S, James T, Kirk P M, Lücking R.
distribution of PK families is influenced by both evolutionary
2007. A higher-level phylogenetic classification of the fungi.
status and by fungal lifestyle. Thus, the present study
Mycological Research, 111, 509–547.
provides insights into the variety and expansion of PK Jones P, Binns D, Chang H Y, Fraser M, Li W, McAnulla C,
families, which will be useful in understanding the differences McWilliam H, Maslen J, Mitchell A, Nuka G, Pesseat S,
in regulatory mechanisms among different species of fungi. Quinn A F, Sangrador-Vegas A, Scheremetjew M, Yong
S Y, Lopez R, Hunter S. 2014. InterProScan 5: Genome-
Acknowledgements scale protein function classification. Bioinformatics, 30,
1236–1240.
Kosti I, Mandel-Gutfreund Y, Glaser F, Horwitz B A. 2010.
This work was supported by the National Science &
Comparative analysis of fungal protein kinases and
Technology Pillar Program of China during the Twelfth
associated domains. BMC Genomics, 11, 133.
Five-Year Plan period (2012BAD19B04), the National Krupa A, Abhinandan K, Srinivasan N. 2004. KinG: A database
Natural Science Foundation of China (31371924), the 111 of protein kinases in genomes. Nucleic Acids Research,
Project from the Ministry of Education of China (B07049), 32, D153–D155.
and the National Basic Research Program of China LaRonde-LeBlanc N, Wlodawer A. 2005. The RIO kinases:
(2013CB127700). We thank Prof. Liu Huiquan, College An atypical protein kinase family required for ribosome
biogenesis and cell cycle progression. Biochimica et
of Plant Protection, Northwest A&F University, China for
Biophysica Acta (BBA: Proteins and Proteomics), 1754,
helpful discussion.
14–24.
Manning G, Plowman G D, Hunter T, Sudarsanam S. 2002a.
Appendices associated with this paper can be available on Evolution of protein kinase signaling from yeast to man.
http://www.ChinaAgriSci.com/V2/En/appendix.htm Trends in Biochemical Sciences, 27, 514–520.
Manning G, Whyte D B, Martinez R, Hunter T, Sudarsanam
References S. 2002b. The protein kinase complement of the human
genome. Science, 298, 1912–1934.
Becker W, Joost H G. 1998. Structural and functional Martin D M, Miranda-Saavedra D, Barton G J. 2009. Kinomer
characteristics of Dyrk, a novel subfamily of protein kinases v. 1.0: A database of systematically classified eukaryotic
with dual specificity. Progress in Nucleic Acid Research and protein kinases. Nucleic Acids Research, 37, D244–D250.
Molecular Biology, 62, 1–17. Mclaughlin D J, Hibbett D S, Lutzoni F, Spatafora J W, Vilgalys
Caenepeel S, Charydczak G, Sudarsanam S, Hunter T, R. 2009. The search for the fungal tree of life. Trends in
Manning G. 2004. The mouse kinome: Discovery and Microbiology, 17, 488.
comparative genomics of all mouse protein kinases. Scheeff E D, Bourne P E. 2005. Structural evolution of the
Proceedings of the National Academy of Sciences of the protein kinase-like superfamily. PLoS Computational
United States of America, 101, 11707–11712. Biology, 1, e49.
Champion A, Kreis M, Mockaitis K, Picaud A, Henry Y. 2004. Stajich J E, Berbee M L, Blackwell M, Hibbett D S, James T
Arabidopsis kinome: After the casting. Functional & Y, Spatafora J W, Taylor J W. 2009. The fungi. Current
Integrative Genomics, 4, 163–187. Biology, 19, R840–R845.
Cohen P. 2000. The regulation of protein function by multisite Taylor J W, Berbee M L. 2006. Dating divergences in the
phosphorylation - A 25 year update. Trends in Biochemical fungal tree of life: Review and new analyses. Mycologia,
Sciences, 25, 596–601. 98, 838–849.
Dray S, Dufour A B. 2007. The ade4 package: Implementing Wang H, Guo S, Huang M, Thorsten L H, Wei J. 2010.
the duality diagram for ecologists. Journal of Statistical Ascomycota has a faster evolutionary rate and higher
Software, 22, 1–20. species diversity than Basidiomycota. Science China (Life
Goldberg J M, Griggs A, Smith J L, Haas B, Wortman J, Sciences), 53, 1163–1169.
Zeng Q. 2013. Kinannote, a computer program to identify Wuichet K, Cantwell B J, Zhulin I B. 2010. Evolution and phyletic
and classify members of the eukaryotic protein kinase distribution of two-component signal transduction systems.
superfamily. Bioinformatics, 29, 2387–2394. Current Opinion in Microbiology, 13, 219–225.
Hanks S K. 2003. Genomic analysis of the eukaryotic protein Zhao Z, Liu H, Wang C, Xu J R. 2013. Comparative analysis of
kinase superfamily: A perspective. Genome Biology, 4, 111. fungal genomes reveals different plant cell wall degrading
Hanks S K, Hunter T. 1995. Protein kinases 6. The eukaryotic capacity in fungi. BMC Genomics, 14, 274.
protein kinase superfamily: Kinase (catalytic) domain

Section editor WAN Fang-hao


Managing editor ZHANG Juan

Вам также может понравиться