Preprint Article Version 2 Preserved in Portico This version is not peer-reviewed

Genome-Wide Identification and Analysis of Cell Cycle Genes in Betula pendula

Version 1 : Received: 8 December 2021 / Approved: 9 December 2021 / Online: 9 December 2021 (10:38:21 CET)
Version 2 : Received: 22 December 2021 / Approved: 23 December 2021 / Online: 23 December 2021 (11:34:00 CET)

How to cite: Li, Y.; Chen, S.; Liu, Y.; Huang, H. Genome-Wide Identification and Analysis of Cell Cycle Genes in Betula pendula. Preprints 2021, 2021120149. https://doi.org/10.20944/preprints202112.0149.v2 Li, Y.; Chen, S.; Liu, Y.; Huang, H. Genome-Wide Identification and Analysis of Cell Cycle Genes in Betula pendula. Preprints 2021, 2021120149. https://doi.org/10.20944/preprints202112.0149.v2

Abstract

Research Highlights: This study identified the cell cycle genes in birch that likely play important roles during plant growth and development. This analysis provides a basis for understanding the regulatory mechanism of various cell cycles in Betula pendula. Background and Objectives: The cell cycle factors not only influence cell cycle progression together, but also regulate accretion, division and differentiation of cells, and then regulate growth and development of plant. In this study, we identified the putative cell cycle genes in B. pendula genome, based on the annotated cell cycle genes in A. thaliana. It could serve as a foundation for further functional studies. Materials and Methods: The transcript abundance was determined for all the cell cycle genes in xylem, root, leaf and flower tissues using RNA-seq technology. Results: We identified 59 cell cycle gene models in the genome of B. pendula, 17 highly expression genes among them. These genes were BpCDKA.1, BpCDKB1.1, BpCDKB2.1, BpCKS1.2, BpCYCB1.1, BpCYCB1.2, BpCYCB2.1, BpCYCD3.1, BpCYCD3.5, BpDEL1, BpDpa2, BpE2Fa, BpE2Fb, BpKRP1, BpKRP2, BpRb1 and BpWEE1. Conclusions: We identified 17 core cell cycle genes in the genome of birch by combining phylogenetic analysis and tissue specific expression data.

Keywords

Betula pendula; cell cycle; Cyclin; RNA-seq

Subject

Biology and Life Sciences, Forestry

Comments (1)

Comment 1
Received: 23 December 2021
Commenter: yijie li
Commenter's Conflict of Interests: Author
Comment: ArticleGenome-Wide Identification and Analysis of Cell Cycle Genes in Betula pendulaCitation: , A. Genome-Wide Identification and Analysis of Cell Cycle Genes in Betula pendula. Forests 2021, 12, x. https://doi.org/10.3390/xxxxxAcademic Editor: Received: Accepted: Published: datePublisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.Copyright: © 2021 by the authors. Submitted for possible open access publication under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).Yijie Li 1, Song Chen 1, Yuhang Liu 1, Haijiao Huang 1*1    State Key Laboratory of Tree Genetics and Breeding, Northeast Forestry University, Harbin, China*   Correspondence: Haijiao Huang, e-mail: haijiao_sea@163.comAbstract: Research Highlights: This study identified the cell cycle genes in birch that likely play important roles during plant growth and development. This analysis provides a basis for understanding the regulatory mechanism of various cell cycles in Betula pendula. Background and Objectives: The cell cycle factors not only influence cell cycle progression together, but also regulate accretion, division and differentiation of cells, and then regulate growth and development of plant. In this study, we identified the putative cell cycle genes in B. pendula genome, based on the annotated cell cycle genes in A. thaliana. It could serve as a foundation for further functional studies. Materials and Methods: The transcript abundance was determined for all the cell cycle genes in xylem, root, leaf and flower tissues using RNA-seq technology. Results: We identified 59 cell cycle gene models in the genome of B. pendula, 17 highly expression genes among them. These genes were BpCDKA.1, BpCDKB1.1, BpCDKB2.1, BpCKS1.2, BpCYCB1.1, BpCYCB1.2, BpCYCB2.1, BpCYCD3.1, BpCYCD3.5, BpDEL1, BpDpa2, BpE2Fa, BpE2Fb, BpKRP1, BpKRP2, BpRb1 and BpWEE1. Conclusions: We identified 17 core cell cycle genes in the genome of birch by combining phylogenetic analysis and tissue specific expression data.Keywords: Betula pendula; cell cycle; Cyclin; RNA-seq  IntroductionMany important life processes are closely related to mitosis in higher organisms. The regulation mechanism of eukaryotic cell division cycle is one of the hot topics in cell biology and molecular biology. Research on the regulation of plant cell cycle started later than that of mammals and yeast. Great progress has been made in the research of cell cycle in higher plant in recent years [1-4]. The progression of cell cycle is the result of interaction between the gene expression and the external factors. The cell cycle in higher plant is strictly regulated in the course of its growth and development.The concept of cell cycle was brought forward by Howard and Pelcin 1953 [5], which was divided into the intermitotic phase (G1, S, and G2) and mitotic phase (M). Growth and development of plant depend on accretion, division and differentiation of cells, while cell cycle involved into these processes. Recent studies have shown that, during regulation of hormone, nutriment substance and other growth signals, Cyclin D (CYCD) was expressed first, and binds to cyclin dependent kinase A (CDKA) to form a complex. The complex is activated by the action of CDK activating kinase (CAK) and cyclin-dependent kinase inhibitor (CKI) or KIP-related proteins (KRPs). The activated complex attenuates the inhibitory effect of retino-blastoma protein-related (RBR) and E2F (E2 factor) a-b/DP through phosphorylation, and release transcript factor E2Fa-b/DP [6]. While E2F/DPs could promote the expression of genes required for G1 conversion to S phase (DNA synthesis phase). After entering the S phase, CYCA binds to CDKA, and it was combined with CDK subgroup cyclin-dependent kinase subunit (CKS) and CYCB synthesized during the development to G2 phase. To remove the inhibitory phosphate group from the tyrosine phosphatase, activate the CDKB, and enter the M phase. At the end of M phase, cyclin proteins are hydrolyzed through the anaphase promoting complex (APC) protein pathway, and exit the mitosis. A whole cell cycle is completed [7,8].Since the cell cyclins have been found in sea urchins by Hunt in the 1980s [9], tremendous advances have been made in the molecular mechanisms of the cell cycle. This provides a positive direction for the study of tumors and other physiological diseases caused by cell cycle regulation [10]. The most significant molecular structure feature of cyclin is its conserved domain sequence, known as cyclin box, which consists of about 100 amino acid residues. The cyclin framework is the core structure of cyclin. During the cell cycle, specific cyclins rely on their own unique cyclin frames to recognize specific cyclin-dependent kinase (CDK), and form a complex with it, thus showing specific CDK kinase activity [11]. Many different cyclins have been found, which have different expression patterns in different organs, tissues, and cell types of various organisms [12].Betula pendula is a pioneer boreal tree that can be induced to flower within one year [13,14]. It is one of the tree species with important application value and development potential in northeast of China. As an important timber tree, it can help us understand how cell cycle genes regulate the growth and development of birch, which will greatly contribute to the application of B. pendula in industrial production and ornamental as-pects. Fortunately, the genome sequence of birch [13] has become available in the last few years, which can help us to accurately identify the genes related to cell cycle. In this study, we identified cell cycle genes that likely play a very important role during plant growth and development. This provides a basis for understanding the expression processes and regulatory mechanism of various cell cycles in B. pendula, and may serve as a foundation for further functional studies. Materials and Methods 2.1. Identification of B. pendula cell cycle genes and physical and chemical properties analysisThe B. pendula genome was used for the identification of the cell cycle genes according to the previous publication [15]. We downloaded the genomic information and protein sequences of B. pendula form the Phytozome database (https://phytozome.jgi.doe.gov/pz/portal.html) and the protein sequence of A. thaliana cell cycle gene family members from the TAIR (https://www.arabidopsis.org/) database. The identification of the cell cycle genes of B. pendula was performed using the BLASTP [16] program to search (E value is set to 1e-5). In addition, all the genes were further manually examined using the Conserved Domain Database of NCBI [17] to confirm if they were correctly annotated. We then divided them into eight subgroups based on their functional type in A. thaliana. Then, we used ExPASy-ProtParam Tool (http://web.expasy.org/protparam/) to determine the physical and chemical parameters of  the cell cycle genes, including the number of amino acids, molecular weight and isoelectric point (pI).2.2. Chromosome distribution of the B. pendula cell cycle genesAccording to the starting position of B. pendula cell cycle genes on the birch chromosomes, the chromosome distribution of 59 cell cycle genes was analyzed, and the chromosome position image of B. pendula cell cycle gene was determined using the TBtools software.2.3. Phylogenetic analyses of B. pendula cell cycle genes, Gene structure and Conserved sequence and specific motif analysisTo investigate the phylogenetic relationships of the cell cycle genes of B. pendula, a phylogenetic tree was constructed for each subgroup according to the previous publication [18]. We performed a multiple sequence alignment. Then, the phylogenetic trees of each subgroup were built using MEGA 5.05 with 500 bootstrap trials. Representative trees were selected using the Neighbor-Joining method.In order to understand the structural diversity of B. pendula cell cycle genes, we performed exon/intron analysis. In order to understand the functional regions of birch cell cycle proteins and analyze the structural differences of birch cell cycle genes. We used the online software MEME (Multiple Em for Motif Elicitation, Version 5.4.1, http://meme-suite.org/tools/meme) to analyze the conserved amino acid motifs of B. pendula cyclin. TBtools was used to analyze conserved amino acid motifs. The CDS sequence of Betula pendula was extracted from the genomic structure information of the genome (https://phytozome-next.jgi.doe.gov/report/gene/Bplatyphylla_v1_1), and its intron and exon structure were visualized with TBtools.2.4. RNA-seq expression analysis of B. pendula cell cycle genesTo investigate the expression patterns of B. pendula cell cycle genes in different tissues, transcriptome data (PRJNA535361) was downloaded from [15] the public database of NCBI SRA. The clean reads of each sample were obtained by filtering out reads of low quality and the low quality reads was filtered using fastp. All the clean reads were aligned to the B. pendula reference genome using bowtie2. The RNA-seq (RNA-sequencing) data were then analyzed using the RSEM (RNA-seq by Expectation-Maximization) pipeline [19] and the data were processed using a paired-end sequencing mode. The number of RNA-seq fragments corresponding to each gene were estimated and normalized to TPM (transcripts per kilobase million) value. The expression profiles of the cell cycle genes were shown as Log2(TPM+1) conversion value, and the heat map was constructed by TBtools. Results.3.1. Identification of Betula pendula cell cycle genes and physical and chemical properties analysisThe annotated genes in B. pendula genome were used to identify putative cell cycle genes, based on the annotated cell cycle genes in A. thaliana. In total, 59 gene models (Table 1) were identified as putative cell cycle genes in B. pendula genome. The 59 genes contain 15 cyclin-dependent kinases (CDKs), 2 cyclin-dependent kinase subunit (CKSs), 27 Cyclins (CYCs), 3 E2 factor (E2Fs), 2DPs, 2 DP-E2F-like (DELs), 4 KIP-related proteins (KRPs), 2 Rbs, and 2 WEEs, respectively. Among these cell cycle genes, CYC is the largest family that contains 27 members, while CKSDELRb and WEE are all the smallest families containing only two members. Rb and WEE are also the smallest families in A. thaliana containing only one member. Analysis of protein characteristics showed that the size of the cell cycle gene protein ranges from 69 amino acids (Bpev01.c0457.g0045) to 1316 amino acids (Bpev01.c1113.g0001), and the relative molecular mass ranges from 7 kDa to 14 kDa. The predicted isoelectric point also varies greatly from 4.42 (Bpev01.c0579.g0010) to 9.69 (Bpev01.c1061.g0010), which indicates that different cyclins may work in different microenvironments. The detailed information of the protein molecular weight, isoelectric point and amino acid number of the gene family are shown in Table 1.Table 1. Putative cell cycle genes in Betula pendula.Gene familyGene nameGene ID Deduced number of amino acidsMolecularweight (Da)Isoelectricpoint (pI)InstabilityindexGrand average ofhydropathicity        CDKCDKA.1Bpev01.c0957.g001329533777.936.4239.45-0.247CDKB1.1Bpev01.c0224.g001330534519.948.1630.49-0.272CDKB2.1Bpev01.c0480.g005831936190.129.0430.26-0.297CDKC1.1Bpev01.c0000.g017951557319.579.2244.26-0.810CDKC1.2Bpev01.c0275.g005664971959.859.1147.61-0.579CDKC1.3Bpev01.c0344.g001272180003.759.2847.51-0.657CDKC1.4Bpev01.c0349.g003169877679.089.3043.51-0.563CDKC1.5Bpev01.c0420.g001956362760.249.3654.61-0.681CDKC1.6Bpev01.c0745.g000571179499.189.3051.47-0.664CDKC1.7Bpev01.c1061.g001071179560.559.6948.78-0.634CDKC1.8Bpev01.c1202.g005356863441.509.6351.02-0.575CDKD.1Bpev01.c1443.g000241546691.889.3636.70-0.391CDKE1.1Bpev01.c0263.g001211112348.096.0334.85-0.374CDKE1.2Bpev01.c0390.g001547853271.819.3041.51-0.461CDKF.1Bpev01.c0389.g005647453297.394.5153.09-0.434CyclinsCYCA1.1Bpev01.c0118.g002949856182.498.1749.57-0.364CYCA1.2Bpev01.c0706.g000523827110.955.3552.20-0.202CYCA1.3Bpev01.c1588.g000449354391.906.4356.98-0.220CYCA2.1Bpev01.c0167.g000652159705.008.9948.34-0.263CYCA2.2Bpev01.c0207.g001049155055.528.6346.18-0.243CYCA2.3Bpev01.c1398.g001236541875.905.2061.96-0.336CYCA2.4Bpev01.c1588.g000551456762.038.1946.44-0.234CYCA3.1Bpev01.c1764.g000136140479.129.2939.11-0.247CYCA3.2Bpev01.c1028.g000138143109.828.8343.20-0.355CYCB1.1Bpev01.c1009.g000845950545.599.0038.21-0.207CYCB1.2Bpev01.c0645.g003342747430.698.7350.72-0.264CYCB2.1Bpev01.c0022.g012943549791.135.3950.14-0.365CYCB2.2Bpev01.c0455.g001139445186.844.8246.83-0.117CYCB2.3Bpev01.c0134.g010443549391.855.6348.64-0.269CYCB3.1Bpev01.c1259.g001322126057.576.3932.870.011CYCD1.1Bpev01.c0848.g004232536316.285.3161.70-0.215CYCD3.1Bpev01.c0157.g001938243607.705.1962.70-0.238CYCD3.2Bpev01.c0506.g001312813598.629.3071.590.009CYCD3.3Bpev01.c0106.g001314114557.409.1089.39-0.343CYCD3.4Bpev01.c0229.g003114014728.157.8958.460.184CYCD3.5Bpev01.c0015.g005437442291.385.0864.20-0.111CYCD3.6Bpev01.c0640.g002037442444.175.2252.89-0.295CYCD4.1Bpev01.c0018.g005535239061.575.2648.70-0.080CYCD4.2Bpev01.c0645.g002529032331.386.6649.71-0.004CYCD6.1Bpev01.c0469.g000930935275.726.0344.03-0.081CYCD6.2Bpev01.c1653.g000435240349.929.2753.390.023CYCH.1Bpev01.c1947.g000652059565.088.4040.98-0.418CKSCKS1.1Bpev01.c1113.g00011316148157.796.7047.53-0.523CKS1.2Bpev01.c1602.g00088610264.609.0563.75-0.981RbRb1Bpev01.c0457.g00451019112457.117.2851.61-0.232Rb2Bpev01.c2803.g0002697110.275.0525.090.375E2F/DPE2FaBpev01.c0105.g001247351575.735.1049.59-0.595E2FbBpev01.c2596.g000247552376.334.8450.61-0.692E2FcBpev01.c0214.g003345651109.765.6155.05-0.807 DPa1Bpev01.c0423.g000334638243.675.6260.94-0.758Dpa2Bpev01.c0427.g001374884137.369.2640.81-0.288DELDEL1Bpev01.c0813.g001137742243.328.8041.91-0.693DEL2Bpev01.c0094.g005335139730.688.6447.44-0.721RPKRP1Bpev01.c0000.g009724527423.726.7660.33-0.822KRP2Bpev01.c0016.g006924226897.627.8453.03-1.146KRP3Bpev01.c2423.g000318320002.405.5553.69-0.507KRP4Bpev01.c0027.g018120923217.595.3678.83-0.880WEEWEE1Bpev01.c0579.g000449855758.406.7452.91-0.446WEE2Bpev01.c0579.g00109710666.774.4252.08-0.464 3.2. Chromosome distribution of cell cycle genes in B. pendulaBased on the genomic information of B. pendula, the chromosomal distribution of the 59 B. pendula cell cycle genes was analyzed. According to chromosome location analysis, these cell cycle genes are unevenly distributed on the 14 chromosomes of B. pendula (Figure 1). Chromosome 11 contains the most cell cycle genes (9), followed by chromosome 6 (8). There are 6 cell cycle genes on chromosomes 1 and 3, and only 1 cell cycle gene on chromosomes 2, 8 and 12.Figure 1. Chromosome distribution of B. pendula cell cycle genes members in birch.3.3. Identification and analysis of cyclin dependent kinases (CDK) gene family Members of Betula platyphyllaThere are many regulators of cell cycle in plants, most of them have special serine/threonine protein kinase activity, because they bind to cyclins to function, and are named as cyclin dependent kinases (CDKs). According to their structural and functional similarities with animal and yeast CDKs and their conserved PSTAIRE domains that bind to cyclins, plant CDKs were divided into 8 groups: CDKA, CDKB, CDKC, CDKD, CDKE, CDKF, CDKG and CDKLIKE [4,20]. In this study, we identified 5 groups of CDKs: BpCDKA, BpCDKB, BpCDKC, BpCDKD, BpCDKE and BpCDKF. CDKA.1 plays a key role in the process of leaf cell division and differentiation and the development of leaf [21]. CDKB1.1 can prolong hypocotyl cells, promote cotyledon cell development, and regulate stomatal development of Arabidopsis thaliana [2,22]. The mutation of CDKB2 has been shown to impact meristem seriously [23].We identified 15 BpCDKs in the B. pendula genome. A phylogenetic tree was constructed for the BpCDKs (Figure 2a) to reveal the evolutionary relationships within these groups. Seven different conserved domains and special motifs of BpCDKs protein were identified using MEME tool (Figure 2c). All the BpCDKs proteins contain at least one conserved amino acid motif. For example, BpCDKE1.1 only contains motif 2, while the rest of BpCDKs proteins contain 1, 2, and 3 conserved amino acid motifs. The conserved motifs of each BpCDKs protein branch are similar in composition, indicating that these members have a close evolutionary relationship [24]. In addition, most members of the BpCDKs protein contain motif 1, motif 2, motif 3, and motif6, these conservative motifs may have an important influence on the function of BpCDKs protein. The gene structure helps to further understand the gene family. In the BpCDK family, there are at most 13 introns (BpCDKC1.1 and BpCDKE1.2), and at least one intron (BpCDKC1.8 and BpCDKE1.1). Most genes in the BpCDKs family contain 7-8 introns (Figure 2b), and the fact that most members of the same subfamily share a similar exon/intron structure strengthens the observed phylogenetic distribution.Figure 2. Phylogenetic analysis; exon/Intron genomic structure and protein motif organization of CDK in B. pendula. An unrooted phylogenetic tree was constructed using MEGA5.05 by the neighbor-joining method. Gene structure of the corresponding BpCDKs genes, TBtools software was used to visualize gene structure. The yellow boxes represent exons and grey lines represent introns. Use MEME Web server to analyze the distribution of conserved motifs in BpCDKs protein. The protein motif figure of BpCDKs was constructed by TBtools software. (a) Phylogenetic analysis of BpCDKs; (b) Exon/Intron genomic structure of BpCDKs; (c) Protein motif organization of BpCDKs.3.4. Identification and analysis of cyclins (CYC) gene family Members of Betula platyphyllaMonomeric CDKs have no kinase activity and must associate with regulatory proteins called cyclins to be activated. There is common molecular structure among various cyclins, which contain a rather conservative amino acid sequence called cyclin frame to mediate the binding to CDK and regulate the activity of CDK. In plant, cyclins can be grouped into M-cyclin (containing A- and B-type cyclins) and G1- specific cyclins (designated D-type cyclins). C-cyclin and H-cyclin have been confirmed, and only CYCH.1 could activate CDK [25].All four types of cyclins known in plants were identified. A total of 27 BpCYCs genes were detected in the B. pendula genome, including nine A-type, six B-type, eleven D- type, and one H- type. An evolutionary tree was built for BpCYCs. The MEME tool was used to identify five different conserved amino acid motifs of the CYC protein (Figure 3c). All BpCYCs proteins contain at least one conserved amino acid motif. For example, BpCYCD3.4, BpCYCD3.2, and BpCYCD3.3 only contain motif 2, BpCYCA1.2 only contains motif 3, and most of the other BpCYCs proteins contains 1, 2, 3, and 4 conservative amino acid motifs, indicating that these motifs may have an important influence on the function of BpCYCs protein. It can be seen from Figure 3b that the BpCYCs family has a similar intron structure (Figure 3b). The intron-exon organization of the BpCYCs family is similar to that of Arabidopsis, this indicates that CYC is highly conserved in plants in an evolutionary manner.Figure 3. Phylogenetic analysis; exon/Intron genomic structure and protein motif organization of CYC in B. pendula. An unrooted phylogenetic tree was constructed using MEGA5.05 by the neighbor-joining method. Gene structure of the corresponding BpCYCs genes, TBtools software was used to visualize gene structure. The yellow boxes represent exons and grey lines represent introns. Use MEME Web server to analyze the distribution of conserved motifs in BpCYCs protein. The protein motif figure of BpCYCs was constructed by TBtools software. (a) Phylogenetic analysis of BpCYCs; (b) Exon/Intron genomic structure of BpCYCs; (c) Protein motif organization of BpCYCs.3.5. Identification and analysis of cyclin dependent kinases subunit (CKS) gene family Members of Betula platyphyllaCDK subunit (CKS) proteins act as docking factors that mediate the interaction of CDKs with putative substrates and regulatory proteins. There are two CDK subunit genes in Arabidopsis described previously [4]. In this study, we identified two BpCKSs in the B. pendula genome. It can be seen that these two genes have the same motif, but their gene structures are quite different.Figure 4. Exon/Intron genomic structure and protein motif organization of CKS in B. pendula. Gene structure of the corresponding BpCKSs genes, TBtools software was used to visualize gene structure. The yellow boxes represent exons and grey lines represent introns. Use MEME Web server to analyze the distribution of conserved motifs in BpCKSs protein. The protein motif figure of BpCKSs was constructed by TBtools software. (a) Exon/Intron genomic structure of BpCKSs; (b) Protein motif organization of BpCKSs.3.6. Identification and analysis of Rb and ubiquitin-conjugating enzyme factor and DP (E2F/DP) gene family Members of Betula platyphyllaRb regulates the expression of many essential genes in cell cycle progression by regulating the activity of E2F transcription factor. Only one Rb could be identified in the Arabidopsis genome [4]. We identified two BpRbs in the B. pendula genome. E2F transcription factors, composed of E2F and DP, play a decisive role in plant cell size control [26].We identified three BpE2Fs and towBpDPs in the B. pendula genome. Two DP-E2F-like (DEL) were identified in the B. pendula genome, because they form a distinct class. A phylogenetic tree was generated for these genes, which contains for groups (Figure 5a). Through the analysis of conservative motifs, it can be seen that both E2F and DP families contain conservative motif 1 (Figure 5c), indicating that conservative motif 1 is highly conserved during evolution. Except for BpRb2 and BpDPa2, both intron and exon structures contain highly similar and numerous introns (Figure 5b). Figure 5. Phylogenetic analysis; exon/Intron genomic structure and protein motif organization of E2F, DP, DEL and Rb in B. pendula. An unrooted phylogenetic tree was constructed using MEGA5.05 by the neighbor-joining method. Gene structure of the corresponding BpE2FsBpDPsBpDELs and BpRbs genes, TBtools software was used to visualize gene structure. The yellow boxes represent exons and grey lines represent introns. Use MEME Web server to analyze the distribution of conserved motifs in BpE2FsBpDPsBpDELs and BpRbs protein. The protein motif figure of BpE2FsBpDPsBpDELs and BpRbs was constructed by TBtools software. (a) Phylogenetic analysis of BpE2FsBpDPsBpDELs and BpRbs; (b) Exon/Intron genomic structure of BpE2FsBpDPsBpDELs and BpRbs; (c) Protein motif organization of BpE2FsBpDPsBpDELs and BpRbs.3.7. Identification and analysis of KIP-related proteins (KRP) and WEE gene family Members of Betula pendulaThe activity of CYC-CDK is also regulated by an inhibitory protein CKI (also known as KRP). Seven CKI genes belonging to the group of Kip/Cip CKIs have been described previously for Arabidopsis, designated KRP1 to KRP7 [27]. In this study, we have identified four BpKRPs in the B. pendula genome. These four genes all have motif 1 (Figure 6c). BpKRP1 and BpKRP2 also contain the same motif 2, and both contain 3-4 introns (Figure 6b), and have similar structures.CDK/cyclin activity is regulated negatively by phosphorylation of the CDK subunit by the WEE1 kinase and positively when the inhibitory phosphate groups are removed by the CDC25 phosphatase. Two BpWEEs were identified in the B. pendula genome, their conserved motifs are similar in structure, while there are only two introns in BpWEE2.Figure 6. Phylogenetic analysis; exon/Intron genomic structure and protein motif organization of KRP and WEE in B. pendula. An unrooted phylogenetic tree was constructed using MEGA5.05 by the neighbor-joining method. Gene structure of the corresponding BpKRPs, BpWEEs genes, TBtools software was used to visualize gene structure. The yellow boxes represent exons and grey lines represent introns. Use MEME Web server to analyze the distribution of conserved motifs in BpKRPs, BpWEEs protein. The protein motif figure of BpKRPs, BpWEEs was constructed by TBtools software. (a) Phylogenetic analysis of BpKRPs, BpWEEs; (b) Exon/Intron genomic structure of BpKRPs, BpWEEs; (c) Protein motif organization of BpKRPs, BpWEEs.3.8. RNA-seq expression analysis of B. pendula cell cycle genesWe applied quantitative criteria to assign genes that are likely to be cell cycle genes based on transcript abundance and specificity. The tissue specific expressional data include xylem, roots, leaves and flowers. We calculated the total expression of the 59 identified genes in xylem and selected 17 genes which have a high expression in leaves or xylem or flower (Figure 7). The 17 cell cycle genes were BpCDKA.1, BpCDKB1.1, BpCDKB2.1, BpCKS1.2, BpCYCB1.1, BpCYCB1.2, BpCYCB2.1, BpCYCD3.1, BpCYCD3.5, BpDEL1, BpDpa2, BpE2Fa, BpE2Fb, BpKRP1, BpKRP2, BpRb1 and BpWEE1.In the BpCDK family, BpCDKA.1 is abundant in xylem. In addition, BpCDKA.1, BpCDKB1.1 and BpCDKB2.1 were highly expressed in leaves. BpCDKA.1 is highly expressed in all the four investigated tissues, which indicated BpCDKA.1 may play multiple roles in different tissues. The most similar genes to BpCDKA.1, BpCDKB1.1 and BpCDKB2.1 in A. thaliana are CDKA; 1 (AT3G48750), CDKB1.1 (AT3G54180) and CDKB2.1 (AT1G76540).A total of 27 BpCYCs were detected in the B. pendula genome, of which BpCYCD3.5 is abundant in flower and leaves. The gene most similar to BpCYCD3.5 in A. thaliana is AT3G50070. In addition to this, BpCYCB1.1BpCYCB1.2BpCYCB2.1 and BpCYCD3.1 are highly expressed in leaves.In the CKS family of birch, BpCKS1.2 was most abundant in the leaf and expressed at moderate levels in the other three tissues. The gene most similar to BpCKS1.2 in A. thaliana is AT2G27960.BpRb1 is abundant in leaves BpRb1 is most similar to AT3G12280. ZmRb1 binds to D- type cyclins in plants, is highly expressed in differentiated cells, and regulates leaf development at temporal and spatial level [28]. BpE2Fa and BpE2Fb are abundant in leaves. BpE2Fa and BpE2Fb are most similar to AT2G36010 and AT5G22220, respectively. Two BpDPs were identified in the B. pendula genome, of which BpDP2 is abundant in xylem, and this gene is similar to AT5G02470. BpDEL1 is abundant in leaves. This gene is similar to AT3G48160 in A. thaliana. In the KRP family of birch, BpKRP1 was most abundant in the xylem and BpKRP2 also was expressed at a high level in the xylem. These two genes are most similar to AT2G23430 in A. thaliana. Moreover, BpKRP1 and BpKRP2 are also highly expressed in flower and leaves. BpWEE1 is abundant in leaves. This gene is similar to AT1G02970.Figure 7. The heat map shows the expression of cell cycle genes in different parts of the birch tissue. Highly or lowly expressed genes are colored red or blue, respectively. DiscussionPrevious studies have identified many cell cycle genes [29], but the genetic and biochemical roles of the birch cell cycle genes need to be better defined. In this study, we identified a total of 59 cell cycle genes in B. pendula, which should help clarifying the molecular mechanism of plant growth and development in B. pendula. Plant cell cycle could be regulated by altered expression of some G1-S and G2-M checkpoints genes in cells [3].G1-S phase was one of the most important checkpoints among all the cell cycle, and CycD genes have been indicated as a sensor of extracellular growth condition [1]. Over expression of CycD3;1 in Arabidopsis thaliana could induce B-type cyclin expression, resulting in not only an increase in endoreduplication but also in mitosis [30]. A further study revealed that CYCLIN B1; 2 was the mitosis promoting factor [31]. CYCLIN B1;2 expression can promote nuclear and cellular division, which is sufficient to trigger endoreduplication to mitosis, but not sufficient enough to increase cell cycle rounds [31]. In contrast with our results, BpCYCB1.1BpCYCB1.2BpCYCB2.1 and BpCYCD3.1 are highly expressed in leaves, and BpCYCD3.5 is abundant in flower and leaves (Figure 7). These genes with high expression levels in birch tissues contain CYCD3.1 and CYCB1.2, indicating that these two genes in birch may also play a very important role in cell division. Gene structure analysis found that the gene sequence structure of BpCYCs family members is similar (Figure 3b), indicating that their gene structure is highly conserved during evolution. Both the pistil cell death and stamen cell arrest are involved in cell cycle regulation in maize sex determination, CYCA, CYCB and CDK were highly expressed in the developing pistil and stamen, while WEE1 and CKI were only expressed in the arresting stamen [32]. In our study, part of genes was highly expressed in flower, such as BpCYCD3.5, BpCKS1.2, BpCDKA.1 (Figure 7). However, birch has unisexual flowers on separate male and female inflorescences (catkins) [12,33,34]. How the cell cycle genes regulate the flower development process of birch needs our further research. ConclusionsCell cycle genes are closely related to all life activities of plants, we identified 17 core cell cycle genes in the genome of birch by combining phylogenetic analysis, gene structure analysis and tissue specific expression data, provide some help for better application of cell cycle genes and modern molecular breeding.Author Contributions: Conceptualization, Haijiao Huang; software, Yijie Li and Song Chen; validation, Haijiao Huang; writing-original draft preparation, Yijie Li and Song Chen; writing-review and editing, Yijie Li; project administration, Haijiao Huang; funding acquisition, Haijiao Huang. All authors have read and agreed to the published version of the manuscript.Funding: This research was funded by the Fundamental Research Funds for the Central Universities (2572018BW06), the National Natural Science Foundation of China (31800556).Acknowledgments: We thank the reviewers and editors who provided constructive comments on our manuscript.Conflicts of Interest: The authors declare no conflict of interest.References De Veylder, L. The Discovery of Plant D-Type Cyclins. The Plant cell 2019, 31, 1194-1195, doi:10.1105/tpc.19.00277. Boudolf, V.; Lammens, T.; Boruc, J.; Van Leene, J.; Van Den Daele, H.; Maes, S.; Van Isterdael, G.; Russinova, E.; Kondorosi, E.; Witters, E., et al. CDKB1;1 Forms a Functional Complex with CYCA2;3 to Suppress Endocycle Onset. Plant physiology 2009, 150, 1482-1493, doi:10.1104/pp.109.140269. Boudolf, V.; Vlieghe, K.; Beemster, G.T.S.; Magyar, Z.; Acosta, J.A.T.; Maes, S.; Van Der Schueren, E.; Inze, D.; De Veylder, L. The plant-specific cyclin-dependent kinase CDKB1;1 and transcription factor E2Fa-DPa control the balance of mitotically dividing and endoreduplicating cells in Arabidopsis. The Plant cell 2004, 16, 2683-2692, doi:10.1105/tpc.104.024398. Vandepoele, K.; Raes, J.; De Veylder, L.; Rouze, P.; Rombauts, S.; Inze, D. Genome-wide analysis of core cell cycle genes in Arabidopsis. The Plant cell 2002, 14, 903-916, doi:10.1105/tpc.010445. R. Pelc;Alma Howard Effect of Irradiation on DNA Synthesis in Vicia as Shown by Autoradiographs [J] Acta Radiologica,1954 Inze, D.; De Veylder, L. Cell cycle regulation in plant development. Annual review of genetics 2006, 40, 77-105, doi:10.1146/annurev.genet.40.110405.090431. Kosugi, S.; Ohashi, Y. Interaction of the Arabidopsis E2F and DP proteins confers their concomitant nuclear translocation and transactivation. Plant physiology 2002, 128, 833-843, doi:10.1104/pp.010642. Boniotti, M.B.; Gutierrez, C. A cell-cycle-regulated kinase activity phosphorylates plant retinoblastoma protein and contains, in Arabidopsis, a CDKA/cyclin D complex. The Plant journal : for cell and molecular biology 2001, 28, 341-350, doi:10.1046/j.1365-313x.2001.01160.x. Evans T et al. Cyclin: a protein specified by maternal mRNA in sea urchin eggs that is destroyed at each cleavage division.[J]. Cell, 1983, 33(2) : 389-96 Drakare, S.; Lennon, J.J.; Hillebrand, H. The imprint of the geographical, evolutionary and ecological context on species-area relationships. Ecol Lett 2006, 9, 215-227, doi:10.1111/j.1461-0248.2005.00848.x. Finlay, B.J. Global dispersal of free-living microbial eukaryote species. Science 2002, 296, 1061-1063, doi:10.1126/science.1070710. Fenchel, T.; Finlay, B.J. The ubiquity of small species: Patterns of local and global diversity. Bioscience 2004, 54, 777-784, doi:Doi 10.1641/0006-3568(2004)054[0777:Tuossp]2.0.Co;2. Salojarvi, J.; Smolander, O.P.; Nieminen, K.; Rajaraman, S.; Safronov, O.; Safdari, P.; Lamminmaki, A.; Immanen, J.; Lan, T.Y.; Tanskanen, J., et al. Genome sequencing and population genomic analyses provide insights into the adaptive landscape of silver birch. Nature genetics 2017, 49, 904-+, doi:10.1038/ng.3862. Huang, H.J.; Wang, S.; Jiang, J.; Liu, G.F.; Li, H.Y.; Chen, S.; Xu, H.W. Overexpression of BpAP1 induces early flowering and produces dwarfism in Betula platyphylla x Betula pendula. Physiologia plantarum 2014, 151, 495-506, doi:10.1111/ppl.12123. Chen, S.; Lin, X.; Zhang, D.W.; Li, Q.; Zhao, X.Y.; Chen, S. Genome-Wide Analysis of NAC Gene Family in Betula pendula. Forests 2019, 10, doi:Artn 74110.3390/F10090741. Camacho, C.; Coulouris, G.; Avagyan, V.; Ma, N.; Papadopoulos, J.; Bealer, K.; Madden, T.L. BLAST plus : architecture and applications. BMC bioinformatics 2009, 10, doi:Artn 42110.1186/1471-2105-10-421. Marchler-Bauer, A.; Bryant, S.H. CD-Search: protein domain annotations on the fly. Nucleic acids research 2004, 32, W327-W331, doi:10.1093/nar/gkh454. Gang, H.X.; Li, R.H.; Zhao, Y.M.; Liu, G.F.; Chen, S.; Jiang, J. Loss of GLK1 transcription factor function reveals new insights in chlorophyll biosynthesis and chloroplast development. Journal of experimental botany 2019, 70, 3125-3138, doi:10.1093/jxb/erz128. Langmead, B.; Salzberg, S.L. Fast gapped-read alignment with Bowtie 2. Nature methods 2012, 9, 357-U354, doi:10.1038/Nmeth.1923. Li, B.; Dewey, C.N. RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC bioinformatics 2011, 12, doi:Artn 32310.1186/1471-2105-12-323. Guo, J.; Song, J.; Wang, F.; Zhang, X.S. Genome-wide identification and expression analysis of rice cell cycle genes. Plant molecular biology 2007, 64, 349-360, doi:10.1007/s11103-007-9154-y. Adachi, S.; Nobusawa, T.; Umeda, M. Quantitative and cell type-specific transcriptional regulation of A-type cyclin-dependent kinase in Arabidopsis thaliana. Developmental biology 2009, 329, 306-314, doi:10.1016/j.ydbio.2009.03.002. Boudolf, V.; Barroco, R.; Engler, J.D.; Verkest, A.; Beeckman, T.; Naudts, M.; Inze, D.; De Veylder, L. B1-type cyclin-dependent kinases are essential for the formation of stomatal complexes in Arabidopsis thaliana. The Plant cell 2004, 16, 945-955, doi:10.1105/tpc.021774. Wang Y, Wang Q, Zhao Y, Han G, Zhu S. Systematic analysis of maize class III peroxidase gene family reveals a conserved subfamily involved in abiotic Cai et al. BMC Genomics (2021) 22:314 Page 18 of 19 stress response. Gene. 2015;566(1):95–108. https://doi.org/10.1016/j.gene.201 5.04.041 Andersen, S.U.; Buechel, S.; Zhao, Z.; Ljung, K.; Novak, O.; Busch, W.; Schuster, C.; Lohmann, J.U. Requirement of B2-type cyclin-dependent kinases for meristem integrity in Arabidopsis thaliana. The Plant cell 2008, 20, 88-100, doi:10.1105/tpc.107.054676. Huntley, R.; Healy, S.; Freeman, D.; Lavender, P.; Murray, J.A.H. The maize retinoblastoma protein homologue ZmRb-1 is regulated during leaf development and displays conserved interactions with G1/S regulators and plant cyclin D (CycD) proteins. Plant molecular biology 1998, 37, 155-169. Sabelli, P.A.; Larkins, B.A. Regulation and function of retinoblastoma-related plant genes. Plant Science 2009, 177, 540-548, doi:10.1016/j.plantsci.2009.09.012. Shimotohno, A.; Umeda-Hara, C.; Bisova, K.; Uchimiya, H.; Umeda, M. The plant-specific kinase CDKF;1 is involved in activating phosphorylation of cyclin-dependent kinase-activating kinases in Arabidopsis. The Plant cell 2004, 16, 2954-2966, doi:DOI 10.1105/tpc.104.025601. De Veylder, L.; Beeckman, T.; Beemster, G.T.; Krols, L.; Terras, F.; Landrieu, I.; van der Schueren, E.; Maes, S.; Naudts, M.; Inze, D. Functional analysis of cyclin-dependent kinase inhibitors of Arabidopsis. The Plant cell 2001, 13, 1653-1668, doi:10.1105/tpc.010087. Schnittger, A.; Schobinger, U.; Bouyer, D.; Weinl, C.; Stierhof, Y.D.; Hulskamp, M. Ectopic D-type cyclin expression induces not only DNA replication but also cell division in Arabidopsis trichomes. Proceedings of the National Academy of Sciences of the United States of America 2002, 99, 6410-6415, doi:10.1073/pnas.092657299. Schnittger, A.; Schobinger, U.; Stierhof, Y.D.; Hulskamp, M. Ectopic B-type cyclin expression induces mitotic cycles in endoreduplicating Arabidopsis trichomes. Current biology : CB 2002, 12, 415-420, doi:10.1016/s0960-9822(02)00693-0. Lee, J.R.; Kim, J.C. Temporal and Spatial Regulation of Cell Cycle Genes during Maize Sex Determination. Journal of Life Science 2006, 16, 828-833. Wang, S.; Huang, H.J.; Han, R.; Liu, C.Y.; Qiu, Z.N.; Liu, G.F.; Chen, S.; Jiang, J. Negative feedback loop between BpAP1 and BpPI/BpDEF heterodimer in Betula platyphylla x B. pendula. Plant Science 2019, 289, doi:ARTN 11028010.1016/j.plantsci.2019.110280. Wang, S.; Huang, H.J.; Han, R.; Chen, J.Y.; Jiang, J.; Li, H.Y.; Liu, G.F.; Chen, S. BpAP1 directly regulates BpDEF to promote male inflorescence formation in Betula platyphylla x B. pendula. Tree physiology 2019, 39, 1046-1060, doi:10.1093/treephys/tpz021. 
+ Respond to this comment

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 1
Metrics 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.