CRISPR/Cas9-Mediated Adenine Base Editing in Rice Genome
Li Hao, Qin Ruiying, Liu Xiaoshuang, Liao Shengxiang, Xu Rongfang, Yang Jianbo, Wei Pengcheng
Key Laboratory of Rice Genetics & Breeding, Institute of Rice Research, Anhui Academy of Agricultural Science;
Corresponding author: WEI Pengcheng (weipengcheng@gmail.com); YANG Jianbo (yjianbo@263.net)
Abstract

SUMMARY Precise base editing is highly desired in plant functional genomic research and crop molecular breeding. In this study, we constructed a rice-codon optimized adenine base editor (ABE)-nCas9 tool that induced targeted A∙T to G∙C point mutation of a key single nucleotide polymorphism site in an important agricultural gene. Combined with the modified single-guide RNA variant, our plant ABE tool can efficiently achieve adenine base editing in the rice genome.

The CRISPR/Cas9 system has been successfully employed in targeted genome mutagenesis in rice and various plant species, providing a promising strategy to accelerate crop improvement. The system is generally used to create gene knockouts or loss-of-function mutations by inducing the random deletions or insertions through the error-prone non-homologous end joining (NHEJ) mechanism. However, a number of agricultural traits are conferred by single nucleotide polymorphisms (SNPs). Despite extensive efforts in developing reliable point mutations or base replacement methods through the homology-directed repair (HDR) mechanism (Miki et al, 2018), it is haunted by a troublesome fact that the low amounts of donor DNA templates limit the efficiency of precise gene editing in plant cells. Base editing is a novel approach that induces single base conversion without requiring HDR or DNA double-strand breaks (DSBs) (Komor et al, 2016). The most well-known base editor tool, base editor 3 (BE3), is consisted of an nCas9 (Cas9-D10A nickase), the rat APOBEC1 cytidine deaminase, and the uracil DNA glycosylase inhibitor (UGI) (Komor et al, 2016). Cooperating with single-guide RNA (sgRNA), the nCas9 fusion protein is directed to the target genome site, and efficiently induces C to T or G to A substitutions (Komor et al, 2016). The BE3 system is highly capable of precise crop breeding by mediating C∙ G to T∙ A conversion at key functional SNP loci associated with various important agricultural traits, such as herbicide resistance, plant type and nitrogen use efficiency (Lu and Zhu, 2017; Zong et al, 2017).

A∙ T to G∙ C mutations frequently occur as functional SNPs. Recently, a novel base editing tool, the adenine base editor (ABE), was developed by extensive protein engineering based on the bacterial tRNA adenosine deaminase (TadA) (Gaudelli et al, 2017). Natural TadA converts A to inosine (I) in tRNA but not DNA. After several rounds of protein evolution, several Escherichia coli TadA (ecTadA) mutants (TadA* ) with DNA- modifying capabilities were created (Gaudelli et al, 2017). To assemble the ABE, a heterodimer consisting of TadA* and wild-type TadA was formed into a modified Cas9 enzyme. In mammalian cells, the late-stage ABE tool, e.g., ABE7.10, can be programmed to induce A∙ T to G∙ C conversion at target loci without undesired InDels or off-target effects (Gaudelli et al, 2017). In this study, we reported the efficient A∙ T to G∙ C conversion in the rice genome using the ABE7.10 system.

We first synthesized the rice codon-optimized ecTadA- XTEN-TadA* 7.10, which was then attached to the N-terminal SpCas9 (D10A) nickase (Supplemental Fig. 1). The sequence of the recombinant protein was cloned to our previously modified pHUN411 binary vector under the control of a maize ubiquitin promoter. This vector, designated as pHUN411-ABE (Fig. 1-A), still contains a rice U3 promoter cassette for protospacer sequence integration and sgRNA expression (Xing et al, 2014).

The vector was tested in the rice amylose synthesis gene Wx. Wx-mq is a minor mutant allele that results in low amylose content in rice endosperm (Sato et al, 2002). The Wx-mqallele contains a T to C substitution at position 595, resulting in the replacement of a tyrosine by a histidine at residue 191. To mimic this mutation, a sgRNA (Wx-sg) was designed (Fig. 1-B) and constructed into the pHUN411-ABE vector. The construct was transformed into rice plants via Agrobacteria, and 28 independent transgenic events were regenerated. The target mutants were screened by high-resolution melting analysis and then confirmed by PCR sequencing. Two targeted mutants were identified (Supplemental Table 2). One line (Wx-6, the 2nd chromatogram) harbored a T to C substitution at position 6 (the first nucleotide of the 20-bp sgRNA distal from PAM is position 1) (Fig. 1-C), whereas the other line (Wx-15, the 3rd chromatogram) had two T to C mutations at positions 5 and 6 (Fig. 1-C). The genotype of the Wx-15 line was further assessed by clone sequencing. We found that approximately 16.67% clones had the desired T to C conversion at position 5, and approximately 27.78% clones harbored the substitution at position 6 (Supplemental Fig. 2). To increase editing efficiency, an extended version of the sgRNA was used to replace the conventional sequence in pHUN411-ABE (Supplemental Fig. 3) (Dang et al, 2015), generating the pHUN411-ABE-sg2.0 vector. The Wx-sg was fused into the ABE-sg2.0 binary vector, and base changes were observed in 5 out of the 33 transgenic lines (Fig. 1-D). Among these lines, Wx-sg2.0-7 and Wx-sg2.0-22 were edited at positions 5 and 6, respectively. Three lines (Wx-sg2.0-2, Wx-sg2.0-15 and Wx-sg2.0-21) were potentially chimeric (Supplemental Table 2). After detailed genotyping, Wx-sg2.0-21 showed that the T at positions 5 and 6 was simultaneously mutated to C in a single strand (Supplemental Fig. 2). Furthermore, a T to C substitution at position 9 was detected in Wx-sg2.0-15 (Fig. 1-C and Supplemental Fig. 2), although the editing window of ABE was believed to be limited to 4 bases, ranging from positions 4 to 7 (Gaudelli et al, 2017).

Fig. 1. Targeted adenine base editing (ABE) in rice genome.
A, Schematic diagrams of rice codon-optimized tools pHUN411-ABE and pHUN411-ABE-sg2.0. B, Schematic illustration of the target site in the Wx gene. The protospacer adjacent motif (PAM) and sgRNA sequences are underlined. The desired point mutation of Wx-mq is indicated in red. C, Representative Sanger sequencing chromatograms of the ABE-edited Wx allele. Arrows mark the T to C substitutions. The protospacer sequence is underlined. D, The base editing efficiency of the different sgRNAs and different targets. E, Schematic illustration of the target site in the GL2 and OsGRF2 genes by the GL2-sg protospacer. The PAM and sgRNA sequences are underlined. The binding sequence of the miR396 is indicated in red. The asterisks show the mutation sites in the wild-type GL2 allele with large grain size. F, Representative Sanger sequencing chromatograms of the ABE-edited GL2 allele.
RB, Right border; LB, Left border; NTL, Number of transgenic line; NEL, Number of edited line; BEE, Base editing efficiency.

To further test the editing feature of the ABE vectors in rice, we targeted the GL2/OsGRF4 and OsGRF3 genes, which is responsible for grain size and yield. It has been shown that the 2-bp substitution mutation located in the miR396 recognition sequence of GL2 disrupts miR396-directed GL2 transcript cleavage, resulting in heavier grain weight and higher yield (Che et al, 2015). Fig. 1-E shows that the miR396 binding site in GL2 was targeted by sgRNA GL2-sg. This protospacer also exhibits 100% sequence identity to the miR396-binding region of OsGRF3(Fig. 1-E) (Liu et al, 2014). We found that 4 out of the 35 transgenic lines of the pHUN411-ABE-GL2-sg harbored the targeted base mutations (Fig. 1-D and Supplemental Table 3). Among these, three lines had the substitution in the target region of both GL2 and OsGRF3, whereas one harbored the mutation in GL2 but not OsGRF3(Supplemental Table 3). At the target region in GL2, three lines carried the T to C substitution at position 7, and the T to C substitution at position 8 was only observed in one transgenic event (Fig. 1-F and Supplemental Table 3). In addition, we observed that the mutation only occurred at position 7 in the target of OsGRF3 (Supplemental Figs. 3 and 4, and Supplemental Table 3), suggesting that the targeting feature of the ABE tool has position preference for the editing window. The protospacer was also fused to the extended sgRNA variant, and 4 out of 30 transgenic lines harbored the targeted base mutation in the GL2 sequence (Fig. 1-D and Supplemental Table 3). Furthermore, editing in the target region of OsGRF3 was also detected in five independent lines (achieving 16.67% efficiency), which included three lines that were targeted at both sites (Fig. 1-D and Supplemental Table 3). In most of the mutated plants, the conversion of nucleotide T at positions 7 and/or 8 into C was heterozygous or chimeric (Supplemental Fig. 5 and Supplemental Table 3). Off-target effects of the plant ABE tool were also detected. The most frequent off-target region in GL2-sg was located in the miR396-binding site of OsGRF5, with a single base mismatch (Supplemental Fig. 6) (Liu et al, 2014). Assessment of all transgenic plants using both vectors showed no off-target mutations (Supplemental Table 4).

In this study, we reported the efficient A∙ T to G∙ C base editing in rice. The A∙ T to G∙ C mutation can result desired amino acid substitution or potential interference of miRNA binding in the target regions (Supplemental Fig. 7). We noticed the mutation frequency induced by the pHUN411-ABE vector was < 10% at the Wx and GL2 targets. A possible explanation is that a high resolution method (HRM) analysis was used to pre-screen the target mutations, which may ignore some lines with low-frequency chimeric mutations. In plants, the mutagenesis efficiency of the Cas9 variant SpCas9-VQR could be significantly increased by the adapted sgRNA with extended Cas9-binding hairpin structure and mutations on the potential terminator of the pol III promoter (Hu et al, 2018). Using this version of the sgRNA variant, the base editing efficiency of the Cas9-ABE tool was substantially increased, suggesting that the modification of the sgRNA structure improves the efficiency of the ABE tool. Three groups also most recently and independently described a similar A∙ T to G∙ C base editing by ABE in plants (Hua et al, 2018; Li et al, 2018; Yan et al, 2018). The rBE14 system exhibits 16.67% to 62.26% editing efficiency at three different genomic loci in rice (Yan et al, 2018). The pRABEsp-OsU6 system could achieve as high as 26% editing efficiency in the T0 rice transgenic population (Hua et al, 2018), and the PABE-7 system, developed by a series optimization on position of TadA, nuclear localization sequences locations, and also sgRNA expression, strategies exhibits A to G substitution efficiency of 15.8% to 59.1% in the stable transformed rice lines (Li et al, 2018). The nucleotide sequence context might be a major contributing factor to the efficiency variations of different SpCas9-based plant ABE tools. It will be interesting to test them, including our ABE system, in the same target loci under the same transformation procedure. Furthermore, a recent study in mammalian cells indicated that base editing efficiency could be affected by multiple factors during engineering, including the different commercial source of codon-optimization (Koblan et al, 2018). Taken together, the plant ABE systems greatly expand the application of CRISPR-Cas9 tools as well as provide a reliable and efficient method for fundamental plant research and precise crop breeding.

ACKNOWLEDGEMENTS

We appreciate Prof. Chen Qijun for providing the pHUN411 vector. This study was supported by the National Natural Science Foundation of China (Grant No. 31501239) and the Genetically Modified Breeding Major Projects (Grant No. 2016ZX08010-002-008).

SUPPLEMENTAL DATA

The following materials are available in the online version of this article at http://www.sciencedirect.com/science/journal/16726308; http://www.ricescience.org.

Supplemental File 1. Materials and methods used in this study.

Supplemental Table 1. The primers used in this study.

Supplemental Table 1 The primers used in this study.

Supplemental Table 2. Identification of targeted wxmutants induced by adenine base editor tools in transgenic rice.

Supplemental Table 2 Identification of targeted wxmutants induced by ABE tools in transgenic rice

Supplemental Table 3. Identification of targeted gl2and/orosgrf3mutants induced by adenine base editor tools in transgenic rice.

Supplemental Table 3 Identification of targeted gl2and/orosgrf3mutants induced by ABE tools in transgenic rice.

Supplemental Table 4. Detection of the putative off-target effects of the rice adenine base editor tools on the high-similarity site of the GL2-sg protospacer.

Supplemental Table 4 Detection of the putative off-target effects of the rice ABE tools on the high-similarity site of the GL2-sg protospacer.

Supplemental Fig. 1. Sequence of rice codon-optimized ABE7.10.

Fig. 1. Sequence of rice codon-optimized ABE7.10.
The sequence of wild type ecTadA, linker, TadA* 7.10, nCas9(D10A) and NLS was labeled by blue, orange, red, black and cyan respectively.

Supplemental Fig. 2. Genotyping of some targetedwx mutants.

Fig. 2 Genotyping of some targetedwx mutants.
The mutations in the target region are indicated in red. The genotype of the wx targeted mutant was analyzed by PCR sequencing. The numbers located far right indicate the ratio of the number of clones with the corresponding genotype to the number of total sequenced clones. WT, wild-type; Tx to C, the T at position x is mutated to C.

Supplemental Fig. 3. The sequence of the extended sgRNA scaffold.

Fig. 3 The sequence of the extended sgRNA scaffold.
The structures of the sgRNA-scaffold-2.0 sequence were generated with RNAfold and visualized using VARNA (Darty et al., 2009). The adapted and extended RNA is shown in red.

Supplemental Fig. 4. Sanger sequencing chromatograms of the OsGRF3 target region of representative ABE-Gl2-sg edited lines.

Fig. 4 Sanger sequencing chromatograms of the OsGRF3 target region of representative ABE-Gl2-sg edited lines.
The targeted T to C mutation(s) is indicated by arrow(s). The target sequence is underlined.

Supplemental Fig. 5. Genotyping of some transgenic plants targeted by ABE-GL2-sg.

Fig. 5 Genotyping of some transgenic plants targeted by ABE-GL2-sg.
The mutations within the target region are indicated in red. The genotype of the GL2 and OsGRF3 target regions of the mutant was analyzed by PCR sequencing.

Supplemental Fig. 6. Alignment of the miR396 binding region in the rice growth-regulating factor family.

Fig. 6 Alignment of the miR396 binding region in the rice growth-regulating factor family.
The GL2-sg shows 100% sequence identity with GL2 and OsGRF3, whereas there is one base mismatch with that of OsGRF5.

Supplemental Fig. 7. The mutation of protein sequence resulting from the adenine base editor-induced editing.

Fig. 7 The mutation of protein sequence resulting from the adenine base editor-induced editing.
The amino acid of the target region of Wx (A) and GL2/OsGRF3 (B) was indicated. The PAM and protospacer sequence was labeled by blue and orange, respectively.

The authors have declared that no competing interests exist.

Reference
[1] Che R H, Tong H N, Shi B H, Liu Y Q, Fang S R, Liu D P, Xiao Y H, Hu B, Liu L C, Wang H R, Zhao M F, Chu C C. 2015. Control of grain size and rice yield by GL2-mediated brassinosteroid responses. Nat Plants, 2: 15195. [Cited Within:1]
[2] Dang Y, Jia G X, Choi J, Ma H M, Anaya E, Ye C T, Shankar P, Wu H Q. 2015. Optimizing sgRNA structure to improve CRISPR- Cas9 knockout efficiency. Genome Biol, 16: 280. [Cited Within:1]
[3] Gaudelli N M, Komor A C, Rees H A, Packer M S, Badran A H, Bryson D I, Liu D R. 2017. Programmable base editing of A∙T to G∙C in genomic DNA without DNA cleavage. Nature, 551: 464-471. [Cited Within:4]
[4] Hu X X, Meng X B, Liu Q, Li J Y, Wang K J. 2018. Increasing the efficiency of CRISPR-Cas9-VQR precise genome editing in rice. Plant Biotechnol J, 16(1): 292-297. [Cited Within:1]
[5] Hua K, Tao X P, Yuan F T, Wang D, Zhu J K. 2018. Precise A∙T to G∙C base editing in the rice genome. Mol Plant, 11(4): 627-630. [Cited Within:2]
[6] Koblan L W, Doman J L, Wilson C, Levy J M, Tay T, Newby G A, Maianti J P, Raguram A, Liu D R. 2018. Improving cytidine and adenine base editors by expression optimization and ancestral reconstruction. Nat Biotechnol, doi: DOI:10.1038/nbt.4172. [Cited Within:1]
[7] Komor A C, Kim Y B, Packer M S, Zuris J A, Liu D R. 2016. Programmable editing of a target base in genomic DNA without double-strand ed DNA cleavage. Nature, 533: 420-424. [Cited Within:3]
[8] Li C, Zong Y, Wang Y P, Jin S, Zhang D B, Song Q N, Zhang R, Gao C X. 2018. Expand ed base editing in rice and wheat using a Cas9-adenosine deaminase fusion. Genome Biol, 19: 59. [Cited Within:2]
[9] Liu H H, Guo S Y, Xu Y Y, Li C H, Zhang Z Y, Zhang D J, Xu S J, Zhang C, Chong K. 2014. OsmiR396d-regulated OsGRFs function in floral organogenesis in rice through binding to their targets OsJMJ706 and OsCR4. Plant Physiol, 165(1): 160-174. [Cited Within:2]
[10] Lu Y, Zhu J K. 2017. Precise editing of a target base in the rice genome using a modified CRISPR/Cas9 system. Mol Plant, 10(3): 523-525. [Cited Within:1]
[11] Miki D, Zhang W X, Zeng W J, Feng Z Y, Zhu J K. 2018. CRISPR/ Cas9-mediated gene targeting in Arabidopsis using sequential transformation. Nat Commun, 9: 1967. [Cited Within:1]
[12] Sato H, Suzuki Y, Sakai M, Imbe T. 2002. Molecular characterization of Wx-mq, a novel mutant gene for low-amylose content in endosperm of rice( Oryza sativa L. ). Jpn J Breeding, 52(2): 131-135 [Cited Within:1]
[13] Xing H L, Dong L, Wang Z P, Zhang H Y, Han C Y, Liu B, Wang X C, Chen Q J. 2014. A CRISPR/Cas9 toolkit for multiplex genome editing in plants. BMC Plant Biol, 14: 327. [Cited Within:1]
[14] Xu R F, Yang Y C, Qin R Y, Li H, Qiu C H, Li L, Wei P C, Yang J B. 2016. Rapid improvement of grain weight via highly efficient CRISPR/Cas9-mediated multiplex genome editing in rice. J Genet Genom, 43(8): 529-532. [Cited Within:1]
[15] Yan F, Kuang Y J, Ren B, Wang J W, Zhang D W, Lin H H, Yang B, Zhou X P, Zhou H B. 2018. Highly efficient A∙T to G∙C base editing by Cas9n-guided tRNA adenosine deaminase in rice. Mol Plant, 11(4): 631-634. [Cited Within:2]
[16] Zong Y, Wang Y P, Li C, Zhang R, Chen K L, Ran Y D, Qiu J L, Wang D W, Gao C. 2017. Precise base editing in rice, wheat and maize with a Cas9-cytidine deaminase fusion. Nat Biotechnol, 35(5): 438-440. [Cited Within:1]