Selection of highly efficient small interference RNA (SiRNA) targeting mammalian genes

Abstract

RNAi is the method of silencing the expression of targeted genes. RNAi applications include gene function analysis and target validation. Designing highly efficient small interference RNA (siRNA) sequence with maximum target specificity for mammalian RNAi is one of important topics in recent years. In this work, a statistical analysis of the information for a large number (3734) of siRNA presented in the database available on the internet is done. This is to improve the design of efficient siRNA molecules. The (3734) siRNAs are classified according to their efficiency to three groups (high efficient, moderate efficient and low efficient). Thirteen properties (positional and thermodynamics) are identified in the high efficient group in the primary statistical study. In the final statistical study, the average weight of each identified property is calculated. A very good linear correlation was found between the average percentage efficiency and the weighted score of siRNA properties. It is found that the most important feature of highly efficient siRNA is the difference in binding energy between the 5’ end and the 3’ end of the anti-sense strand. The (RISC) activation step is a critical step in RNAi process where the efficiency of this process depends on the instability of the 5’ end of the anti-sense strand.

Share and Cite:

El-lakkani, A. , Elgawad, W. and Sayed, E. (2013) Selection of highly efficient small interference RNA (SiRNA) targeting mammalian genes. Journal of Biophysical Chemistry, 4, 72-79. doi: 10.4236/jbpc.2013.42010.

1. INTRODUCTION

RNA interference (RNAi) is the process through which double stranded RNA (dsRNA) molecules inhibit gene expression by mediating sequence-specific messenger RNA (mRNA) degradation. The process starts when dsRNA molecules are degraded into short interfering RNA (siRNA) molecules, about 21 - 23 nucleotides in length, by the RNase enzyme Dicer. These siRNAs are then incorporated into a ribonucleoprotein complex named RNA-induced silencing complex (RISC). The RISC helicase subunit unwinds a duplex siRNA into two singlestranded RNAs, namely the passenger strand (sense) and the guide strand (antisense). The passenger strand will be degraded, and the guide strand is incorporated into the RNA-induced silencing complex (RISC) and guides it to bind with the complementary mRNA. The mRNA can be inhibited or degraded, depending on the degree of pairmatching [1-3].

Multiple statistical and computational models have been proposed in recent years to design functional siRNA depending on a large number of “features”. These features can be roughly classified into three categories, the first category is sequence features, the second category includes features defined based on the thermodynamics of the siRNA and the third category of features are defined based on the target sites on the mRNA, including target location-related features [4].

In this work, statistical studies are done on a large number of data (3734 siRNA) available on internet: http://gesteland.genetics.utah.edu/members/olgaM/siRN A database September 2006.xls. In the primary statistical study, thirteen properties (positional and thermodynamics) are identified in the high efficient siRNA group. In the final statistical study, the average weight of each identified property is computed. The advantages of this study are the average weights obtained for the identified thermodynamics properties of siRNA and it is done on a large number of data.

2. MATERIAL AND METHODS

The database contain a large number of mammalian experimentally validated (siRNAs) targeting mRNA of different mammalian genes [5-7]. The database contains the sequences and the thermo-dynamics properties of the siRNA besides the experimentally determined of the percent of remaining level of the mRNA in cells.

2.1. Classification of the SiRNA

The siRNAs are classified according to their silencing percentage efficiency (calculated from the percentage of the remaining level of mRNA) to three groups: high-efficient siRNA group (experimental efficiency ≥ 80%), moderate-efficient group (efficiency < 80% and ≥ 50%), and low-efficient group (efficiency < 50%).

2.2. Statistical Studies

Positional properties of the anti-sense strand (AS) of siRNA, such as the presence of the nucleotide (A/U) or (G/C) in certain position are studied in each group. The thermodynamics properties including Gibbs free energy change ΔG of sense-antisense duplex, Gibbs free energy change ΔG of antisense oligo secondary structure, Gibbs free energy change ΔG of antisense oligo-oligo dimer and the binding energy difference dΔG or (ΔG1 - ΔG18) between the 5’end and the 3’end of (AS) strand are studied in each group. The average GC content in each group is also studied.

From these studies the important positional and thermodynamics properties present significantly in the high efficient group are deduced. The thirteen important chosen properties are:

1) A/U at position 1 of AS strand.

2) A/U at position 2 of AS strand.

3) A/U at position 7 of AS strand.

4) A/U at position 10 of AS strand.

5) A/U at position13 of AS strand.

6) A/U at position 14 of AS strand.

7) G/C at position 19 of AS strand.

8) G/C at position 18 of AS strand.

9) G/C content ranged between (6 - 10) nucleotides of 19 ones of AS strand.

10) ΔG of sense-antisense duplex (≥ −36.720 Kcal/mole).

11) ΔG of antisense oligo secondary structure (≥ −1.240 Kcal/mole).

12) ΔG antisense oligo-oligo dimer (≥ −8.414 Kcal/mole).

(13) The binding energy difference between the 5’end and the 3’end of (AS) strand > 0 Kcal/mole.

2.3. Determining the Weight of Each Property

The 3743 siRNAs in the database are sorted according to the type of properties found in each one (from the thirteen identified properties). If more than one siRNA have the same type of properties then the average corresponding efficiency for them is calculated. From these data the different weights of the thirteen properties are computed by using a developed computer program. The program obtains the best fit of the data to a second order polynomial function (). The best fit weights minimizing the sum squared error function were computed 200 times after randomizing the order in which the properties were considered. Then the average weight of each property was computed.

3. RESULT AND DISCUSSION

Figure 1 shows the percent of siRNAs in the three groups (high-efficient, moderate-efficient and low-efficient) which contain A/U at positions from 1 to 19 of AS strand starting from 5’-AS end. It can be seen from Figure 1 that high percentage of siRNA has A/U at positions (1, 2, 7, 10, 13 and 14 of AS strand starting from the 5’end) are present in the high efficient group i.e. low internal stability in these positions. In contrast the low efficient group is enriched with siRNA molecules that have high internal stability (G/C) in these positions (see Figure 2). Khvorova et al. [8] found in their experimental study that the functional group of siRNA is enriched with siRNA molecules that have a low internal stability at the 5’-AS and an overall low internal stability profile, particularly in positions 9 - 14 100 (counted from 5’-AS end) and this is in contrast to the non-functional group.

Conflicts of Interest

The authors declare no conflicts of interest.

References

[1] Elbashir, S.M., Harborth, J., Lendeckel, W., Yalcin, A., Weber, K. and Tuschl, T. (2001) Duplexes of 21-nucleotide RNAs mediate RNA interference in cultured mammalian cells. Nature, 411, 494-498. doi:10.1038/35078107
[2] Martinez, J., Patkaniowska, A., Urlaub, H. Lührmann, R. and Tuschl, T. (2002) Single-stranded antisense siRNAs guide target RNA cleavage in RNAi. Cell, 110, 563-574. doi:10.1016/S0092-8674(02)00908-X
[3] Hannon, G.J. and Rossi, J.J. (2004) Unlocking the potential of the human genome with RNA interference. Nature, 431, 371-378. doi:10.1038/nature02870
[4] Hsieh, A.C., Bo, R., Manola, J., Vazquez, F., Bare, O., Khvorova, A., Scaringe, S. and Sellers, W.R. (2004) A library of siRNA duplexes targeting the phosphoinositide 3-kinase pathway: Determinants of gene silencing for use in cell-based screens. Nucleic Acids Research, 32, 893-901. doi:10.1093/nar/gkh238
[5] Huesken, D., Lange, J., Mickanin, C., Weiler, J., Asselbergs, F., Warner, J., Meloon, B., Engel, S., Rosenberg, A., Cohen, D., Labow, M., Reinhardt, M., Natt, F. and Hall, J. (2005) Design of a genome-wide siRNA library using an artificial neural network. Nature Biotechnology, 23, 995-1001. doi:10.1038/nbt1005-1315a
[6] Jagla, B., Aulner, N., Kelly, P.D., Song, D., Volchuk, A., Zatorski, A., Shum, D., Mayer, T., De Angelis, D.A., Ouerfelli, O., Rutishauser, U. and Rothman, J.E. (2005) Sequence characteristics of functional siRNAs. RNA, 11, 864-872. doi:10.1261/rna.7275905
[7] Katoh, T. and Suzuki, T. (2007) Specific residues at every third position of siRNA shape its efficient RNAi activity. Nucleic Acids Research, 35, 27-40. doi:10.1093/nar/gkl1120
[8] Khvorova, A., Reynolds, A. and Jayasena, S.D. (2003) Functional siRNAs and miRNAs exhibit strand bias. Cell, 115, 209-216. doi:10.1016/S0092-8674(03)00801-8
[9] Shabalina, S. A., Spiridonov, A.N. and Ogurtsov, A.Y. (2006) Computational models with thermodynamic and composition features improve siRNA design. BMC Bioinformatics, 7, 65-80. doi:10.1186/1471-2105-7-65
[10] Shah, J.K., Garner, H.R., White, M.A., Shames, D.S. and Minna, J.D. (2007) sIR: siRNA information resource, a web-based tool for siRNA sequence design and analysis and an open access siRNA database. BMC Bioinformatics, 8, 178. doi:10.1186/1471-2105-8-178
[11] Reynolds, A., Leake, D., Boese, Q., Scaringe, S., Marshall, W.S. and Khvorova, A. (2004) Rational siRNA design for RNA interference. Nature Biotechnology, 22, 326-330. doi:10.1038/nbt936
[12] Ui-Tei, K., Naito, Y., Takahashi, F., Haraguchi, T., OhkiHamazaki, H., Juni, A., Ueda, R. and Saigo, K. (2004) Guidelines for the selection ofhighly effective siRNA sequences for mammalian and chick RNA interference. Nucleic Acids Research, 32, 936-948. doi:10.1093/nar/gkh247
[13] Chalk, A.M., Wahlestedt, C. and Sonnhammer, E.L. (2004) Improved and automated prediction of effective siRNA. Biochemicaland Biophysical Research Communications, 319, 264-274. doi:10.1016/j.bbrc.2004.04.181
[14] Matveeva, O., Nechipurenko, Y., Rossi, L., Moore, B., Saetrom, P., Ogurtsov, A.Y., Atkins, J.F. and Naito, Y., Yamada, T., Ui-Tei, K., Morishita, S. and Saigo, K. (2004) siDi- rect: Highly effective, target-specific siRNA design software for mammalian RNA interference. Nucleic Acids Research, 32, W124-W129. doi:10.1093/nar/gkh442
[15] Tilesi, F., Fradiani, P., Socci, V., Willems, D. and Ascenzioni, F. (2009) Design and validation of siRNAs and shRNAs. Molecular Therapeutics, 11, 156-164. http://ukpmc.ac.uk/abstract/MED/19330721

Copyright © 2024 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.