Designing, Optimization and Validation of Tetra Primer ARMS-PCR Protocol for Genotyping Single Nucleotide Polymorphism rs4731702 (C/T) of KLF14 Gene Associated with Type 2 Diabetes Mellitus: A Study in San Luis, Argentina ()

1. Introduction
Type 2 Diabetes Mellitus (T2DM) is a chronic disease also called adult-onset diabetes or non-insulin-dependent diabetes and accounts for more than 90% of all diabetes types in the global population. Without proper monitoring, it can lead to serious complications such as cardiovascular disease, kidney disease, nerve disease, and eye disease. T2DM is a life-long condition that requires careful management. The incidence and prevalence have reached epidemic proportions worldwide [1] .
The physiological disturbances of T2DM include abdominal obesity, insulin resistance, elevated low density lipoprotein cholesterol (LDL-c), elevated triglycerides (TG), reduced high density lipoproteins cholesterol (HDL-c) and hypertension. Although environmental factors, particularly caloric excess and physical inactivity, play a role in T2DM; the traits are highly heritable.
To date, approximately 25 genome-wide significant common variant associations with T2DM have been described, mostly through genome-wide association analyses (GWAS) [2] - [12] . The identities of the variants and genes mediating the susceptibility effects at most of these signals have yet to be established, and the known variants account for less than 10% of the overall estimated genetic contribution to T2DM predisposition.
Although GWAS have identified numerous loci influencing these traits individually, but to date, no loci have been found that affect the entire spectrum of T2DM traits. Variants near KLF14, which encodes the transcription factor Krüppel-like Factor 14, have previously been associated with T2DM and HDL cholesterol levels in large GWAS analyses [13] - [18] .
The KLF14 is shown to act as a master role in regulating the expression of adipose genes that are associated with key metabolic traits. Interestingly, expression levels of ten genes (TPMT, ARSD, SLC7A10, C8orf82, APH1B, PRMT2, NINJ2, KLF13, GNB1, MYL5) were associated with a variety of metabolic syndrome traits, including obesity, dyslipidemias and measures of insulin resistance, supporting a trans-causal link between KLF14 expression and these ten genes [19] .
The single nucleotide polymorphism (SNP) of rs4731702 ~ 14 kb upstream of KLF14 has implicated a high correlation with HDL-c and cardiovascular disease [13] [14] [15] [20] .
Maternally derived alleles at rs4731702 in KLF14 are associated with expression of a number of genes that are correlated with insulinemia and other characteristics of metabolic syndrome [19] . Hanson et al. [21] found an association between the diabetes risk allele (C) of the KLF14 SNP rs4731702 and lower insulin sensitivity. The diabetes risk allele also was associated with hyperinsulinemia in a previous study, which suggests increased insulin resistance [16] .
Previous functional studies found that the maternally transmitted T allele of rs4731702 SNP is associated with increased expression of KLF14 in adipose tissue, indicating the presence of a cis expression quantitative trait loci (eQTL) [9] . In addition, the T allele showed a consistent association with increased HDL-c [20] .
Therefore this polymorphism need to be associated with metabolic trait for identification of molecular markers that can be used to identify individuals at risk of developing Diabetes Mellitus at an early stage of life using robust and less costly genotyping techniques. A plethora of methods for genotyping of specific polymorphic loci are currently used and these include Polymerase Chain Reaction-Restriction Fragment Length Polymorphism (PCR-RFLP), Single Strand Conformation Polymorphism (SSCP), direct DNA sequencing, and Tetra-Primer Amplification Refractory Mutation System- Polymerase Chain Reaction (Tetra Primer ARMS-PCR). In developing countries it is very difficult and costly to genotype large number of individuals by methods involving PCR and post-PCR manipulations. To circumvent these problems, simple, fast and cost effective genotyping methods need to be developed. In Tetra Primer ARMS-PCR, 2 pairs of primers in a single PCR tube, can simultaneously amplify both alleles as well as allow amplification of an internal DNA control. This technique has been applied to study different mutations [22] [23] [24] . Ye et al. [25] were the first to describe tetra-primer PCR in which allele-specific amplification is achieved in a single PCR reaction using two outer primers and two allele-specific inner primers. Ye et al. [26] combined tetra-primer PCR with ARMS to form the Tetra Primer ARMS-PCR or T-ARMS technique by introducing deliberate mismatches at position −2 from the 3’ end of inner primers to improve allele specificity. In a single step reaction, the outer primers amplify a large fragment of the target gene, irrespective of its genotype although each inner primer combines with a particular opposite outer primer to generate smaller allele- specific amplicons, which are of different sizes and can easily be discriminated on gel electrophoresis either as homozygous or heterozygous.
In the present study, we developed rapid, efficient, cost effective and allele specific Tetra Primer ARMS-PCR for genotyping rs4731702 (C/T) polymorphism of KLF14, so that these methods can be used to genotype large samples of individuals in order to estimate association of this SNP with individuals at risk of developing Diabetes Mellitus Type 2.
2. Research Design and Methods
2.1. Subjects
The present study was carried out in accordance with the guidelines of the Helsinki Declaration. A total of 60 volunteers (30 patients with type 2 diabetes and 30 healthy age-matched controls) participated in this investigation.
Criteria published by the American Diabetes Association were used to diagnose T2DM [27] . These patients reside in San Luis, Argentina. The protocol for this study was approved by the local Institutional Review Board, and a written informed consent was obtained from each patient to be enrolled. During an initial interview with each patient, they were asked for diseases, medication and smoking histories. Exclusion criteria included liver, kidney and thyroid diseases, as well as the use of anti-lipemic drugs.
2.2. DNA Samples
Genomic DNA was isolated from diabetic patients and healthy volunteers using conventional protocol by Qiagen kits (Qiagen, Inc., Valencia, CA). DNA concentration was detected by UV-VIS spectroscopy and diluted to a final concentration of 20 ng/μL.
2.3. Primer Design, Amplification, and Genotyping of Tetra Primer ARMS-PCR
Based on the GenBank sequence of human KLF14, accession number: NT_007933.15, we designed “in silico” a pair of outer primers and two allele-specific inner primers for Tetra Primer ARMS-PCR, in a free access web (http://cedar.genetics.soton.ac.uk) and then checked for specificity (http://blast.ncbi.nlm.nih.gov/Blast.cgi) (Table 1).
Each PCR reaction was carried out in a total volume of 35 μL, containing 200 ng of template DNA, 1 pmol of inner primer C allele, 10 pmol of inner primer allele T and 10 pmol of each outer primer, 200 μM dNTPs, 2.5 mM MgCl2, 1× buffer, and 1 unit of AmpliTaq Gold DNA polymerase (Perkin-Elmer). The template DNA was denatured for 3 minutes at 95˚C before undergoing 35 cycles of denaturation for 1 minute at 95˚C, primer annealing for 1 minute at 62˚C and extension for 1 minute at 72˚C, and final extension at 72˚C for 3 minutes (Table 1). The resultant products obtained after PCR were separated by electrophoresis on 2.5% agarose gel containing GelRed. The image was visualized and photographed under UV transillumination. Randomly selected 20% of samples were re-genotyped for cross validating initial genotypes. In case of unclear genotyping results, the samples were repeated again in duplicates till clear genotype was available. No genotyping error was observed during cross validation.
2.4. Validation of the Assay
To evaluate the efficiency and accuracy of the assay, selected PCR-amplified DNA samples (n = 3, respectively, for each genotype) were examined by DNA sequencing and the results obtained by Tetra Primer ARMS-PCR were compared with those determined by sequencing.
3. Results
The SNP [rs4731702 (C/T)], an important candidate for identify individuals at risk of
![]()
Table 1. PCR primers and conditions.
developing Diabetes Mellitus Type 2, was genotyped by Tetra Primer ARMS-PCR based methodology. PCR fragments were generated as per expectations for all the loci. All the PCR products were well resolved and sized by agarose gel electrophoresis, allowing easy identification of different genotypes. Heterozygotes and homozygotes were unambiguously assigned from the gel profile.
The size of DNA fragments amplified with these four primers for rs4731702 (C/T) of KLF14 (619 bp control fragment, 158 bp T allele, 515 bp C allele) was suitable for separation on 2.5% agarose gels (Figure 1).
Several factors, including primer concentration and PCR cycling conditions, which can affect PCR specificity and efficiency were optimized. For sets of outer?inner primer pairs we tested Mg2+ concentrations from 1.5 mM to 4 mM and best results were obtained at Mg2+ concentration of 2.5 mM for the primer sets. Gradient PCR was carried out on the BIORAD iCycler to determine the best annealing temperature for primer sets (change per reaction was 1˚C) and optimum annealing temperatures were finalized to be 62˚C for rs4731702 (C/T) loci.
In our study, optimization of primer concentrations was the trickiest one, since the usually suggested outer-inner ratio (1:10) for tetra-primer ARMS PCR was not useful for our primer sets. Thus, we performed PCR with three more ratios (1:10, 1:5, 1:1) of outer-inner primers and 1:1 was found to give best results for T allele and 1:10 for the C allele.
Validation of the developed methodology was done by direct sequencing of representative samples from each set using the outer primers for each SNP. We observed complete concordance between the methods. The genotypes scored from the assay were in 100% accordance with direct sequencing (Figure 2).
4. Discussion
Kruppel-like factors (KLFs) comprise a family of evolutionarily conserved zinc finger- containing transcription factors with diverse regulatory functions in cell growth, proliferation, differentiation and embryogenesis [28] [29] . In fact, KLF14 has been recently
![]()
Figure 1. Agarose gel electrophoresis (2.5%) of polymerase chain reaction (PCR) product of Tetra-primer ARMS-PCR. Lanes 2, 9 and 10 show a typical T/T genotype; lanes 4, 5, 6, 8, 11 and 12 show a typical C/T genotype; lane 3 shows a typical C/C genotype; lane 13 shows (-) control and lanes 1, 7 and 14 show the molecular weight marker.
(a)
(b)
(c)
Figure 2. The sequencing results of three genotypes of the KLF14 gene.
proposed as a master trans-regulator of multiple genes that are associated with metabolic phenotypes in adipose tissue [19] , T regulatory cell differentiation [30] , and lipid-mediated signaling through a distinct epigenetic mechanism [31] .
KLF14 has recently elicited significant attention since extensive genetic studies in humans identified a central role of this protein in the development of metabolic diseases, in particular those that regulate lipid metabolism. In fact, because of its contribution to metabolic diseases, KLF14 has been recently referred to as a “conductor of the metabolic syndrome orchestra” [32] .
KLF14, a maternally expressed imprinted gene without introns, is robustly associated with HDL-c levels, cardiovascular disease, Type 2 Diabetes Mellitus, obesity, and cancer [13] [14] [16] [18] [19] [20] [31] [33] [34] [35] [36] .
Small et al. [19] demonstrated that the type 2 diabetes and HDL-c associated cis- acting eQTL of KLF14 acts as a master trans-regulator of adipose gene expression. The results suggested a trans-causal link between KLF14 expression and ten genes that were associated with a variety of metabolic syndrome traits including obesity, dyslipidemia, and measures of insulin resistance. Moreover, using large scale genome-wide association study data, they showed that five of the ten genes had nearby SNPs that were associated with key metabolic syndrome traits at genome wide significance.
Variants near the transcription factor KLF14 (Kruppel-like factor 14) are robustly associated with both Type 2 Diabetes and HDL-c in large-scale genome-wide association studies conducted in Caucasians [13] [17] [20] [34] . These studies have implicated a group of highly-correlated SNPs including rs4731702~14 kb upstream of KLF14 [13] [17] .
Since transcription factors such as KLF14 typically modulate expression of other genes in trans, Small et al. [19] tested for association between rs4731702 and expression levels of ~24 K probes (16,663 genes) and they focused on ten genes (TPMT, ARSD, SLC7A10, C8orf82, APH1B, PRMT2, NINJ2, KLF13, GNB1, MYL5) showing genome- wide significant trans (GWST) associations (p < 5 × 10−8) driven by rs4731702. Therefore, de Assuncao et al. [31] reported the description of the activity and mechanisms underlying the function of KLF14 as an activator protein and novel regulator of lipid signaling.
In addition, the T allele of rs4731702, is significantly associated with HDL-c levels and a decreased risk of cardiovascular disease and type 2 diabetes [10] [13] [34] .
Functional studies demonstrate that the maternally transmitted T allele of rs4731702 SNP is associated with increased expression of KLF14 in adipose tissue, indicating the presence of a cis expression quantitative trait loci (eQTL) [10] . All this evidence indicates that the KLF14 variant is an important regulator of lipoprotein metabolism. In addition, the T allele carriers have higher ApoA-I levels in the Mulao population in China [33] .
Taken together, the rs4731702 SNP may act in cis to influence the KLF14-associated trans-regulatory network and bring about the cascade of events in lipid metabolism.
Chen et al. [20] detected positive associations of KLF14-rs4731702 with atherosclerotic cardiovascular disease, including ischemic stroke and myocardial infarction phenotypes. Thus, the KLF14 gene is not only an important factor for the regulation of metabolic phenotypes, but also involved in the pathogenesis of atherosclerotic-related phenotypes such as ischemic stroke and myocardial infarction.
Hence, for genetic analysis, fast and economical assays that can be performed with standard PCR instruments are highly desirable. Single nucleotide polymorphism (SNP), a novel molecular marker technology, refers to a sequence polymorphism caused by a single nucleotide mutation at a specific locus in the DNA sequence. This sort of polymorphism includes single base transitions, transversions, insertions and deletions [37] , and the minor allele frequency should be 1% or greater. Of all the SNP mutation types, transitions are the most common (approx. 66.6%) [38] . The workers in the past have used techniques like Polymerase Chain Reaction-Restriction Fragment Length Polymorphism (PCR-RFLP), Single Strand Conformation Polymorphism (SSCP), direct DNA sequencing, and tetra-primer Amplification Refractory Mutation System-Poly- merase Chain Reaction (ARMS-PCR) for genotyping of specific polymorphic nucleotide loci. Among these the tetra-primer ARMS-PCR could be a useful tool for genotyping, since SSCP may not be repeatable some times, direct DNA sequencing is a cumbersome, time consuming, technically demanding and costly procedure and the possibility of getting a restriction site for an enzyme could be rare for genotyping by RFLP.
Compared to other genotyping techniques, Tetra Primer ARMS-PCR has been reported to be a rapid, reliable, simple and economical assay for SNP genotyping [26] [39] [40] . In the other hand, Etlik et al. [41] compared tetra-primer ARMS assay with routinely used methods such as typical PCR-RFLP analysis, real time PCR assay and DNA sequencing. They concluded that although real time PCR and DNA sequencing are sensitive and accurate techniques, Tetra Primer ARMS-PCR assay could be beneficial in terms of total time, cost and applicability in a typical laboratory.
Hence Tetra Primer ARMS-PCR based methodology was developed for genotyping rs4731702 (C/T) in the present study. The assay described here is more convenient than the traditional PCR-RFLP since it eliminates the need for incubation with restriction enzymes. This not only avoids any consequent errors and artifacts from such procedures but also reduces the amount of DNA required for the digestion step in PCR- RFLP. No special equipment and only a small amount of standard PCR reagents are needed in Tetra Primer ARMS-PCR.
The Tetra Primer ARMS-PCR methods described above are the first reported method allowing one to genotype the SNP rs4731702 (C/T) of KLF14 gene with no post- PCR treatment other than electrophoresis. These methods are rapid, simple, reliable, and easy to perform, economical and require minimum level of expertise that can be used for both large-and small-scale genotyping studies.
In summary, the association studies which link genetic variants with lipid levels and susceptibility of atherosclerotic cardiovascular disease risk may also have important significances in pharmacogenetics for guiding personalized treatment of high lipids, especially in our populations which have a high prevalence of T2DM and metabolic syndrome [42] .
5. Conclusions
With the rapid advances in molecular techniques, various methods for genotyping single-nucleotide polymorphisms (SNPs) are available. Still, the search for easy, robust, and less costly techniques continues. We wished to develop a Tetra Primer Amplification Refractory Mutation System-Polymerase Chain Reaction based technique for genotyping the SNP rs4731702 (C/T) of KLF14 gene so that these methodologies could be used to establish association between the studied polymorphism and metabolic trait for detection of molecular markers that can be used to identify individuals at risk of developing Diabetes Mellitus Type 2.
Importantly, our study emphasizes the importance of genotyping T2DM patients according to the rs4731702 (C/T) polymorphism of the KLF14 gene in order to establish efficient prevention policies to reduce the risk of developing T2DM. Finally, genotyping of the rs4731702 (C/T) polymorphism of the KLF14 gene can be of high predictive and interventional value of cardiovascular complications in our, and probably other, populations which have a high prevalence of T2DM.
Acknowledgements
This research was supported by grants from the Universidad Nacional de San Luis (SES-PROICO 2-2814).
Conflict of Interests
The authors declare no conflict of interests with respect to the present paper.