Application of Hartree-Fock Method for Modeling of Bioactive Molecules Using SAR and QSPR

The central importance of quantum chemistry is to obtain solutions of the Schrödinger equation for the accurate determination of the properties of atomic and molecular systems that occurred from the calculation of wave functions accurate for many diatomic and polyatomic molecules, using Self Consistent Field method (SCF). The application of quantum chemical methods in the study and planning of bioactive compounds has become a common practice nowadays. From the point of view of planning it is important to note, when it comes to the use of molecular modeling, a collective term that refers to methods and theoretical modeling and computational techniques to mimic the behavior of molecules, not intend to reach a bioactive molecule simply through the use of computer programs. The choice of method for energy minimization depends on factors related to the size of the molecule, parameters of availability, stored data and computational resources. Molecular models generated by the computer are the result of mathematical equations that estimate the positions and properties of the electrons and nuclei, the calculations exploit experimentally, the characteristics of a structure, providing a new perspective on the molecule. In this work we show that studies of Highest Occupied Molecular Orbital Energy (HOMO), Low Unoccupied Molecular Orbital Energy (LUMO) and Map of molecular electrostatic potential (MEP) using HatreeFock method with different basis sets (HF/3-21G*, HF/3-21G**, HF/6-31G, HF/6-31G*, HF/ Corresponding author. C. B. R. Santos et al. 2 6-31G** and HF/6-311G), that are of great importance in modern chemistry, biochemistry, molecular biology, and other fields of knowledge of health sciences. In order to obtain a significant correlation, it is essential that the descriptors are used appropriately. Thus, the quantum chemical calculations are an attractive source of new molecular descriptors that can, in principle, express all the geometrical and electronic properties of molecules and their interactions with biological receptor.


Introduction
Brazil has the largest genetic diversity in plant species in the world, however it is estimated that less than 10% were evaluated as their biological characteristics and less than 5% were submitted the detailed phytochemical studies.Despite a recent increase in research in this area, plants still constitute a source relatively underused and potentially very valuable for discovery of new biologically active substances [1] [2].The bioactive compounds present in the vegetable kingdom have important functions and biological actions and may be considered as promoters of human health.We have already recognized the association between intake of fruits and vegetables and decreased risk of development various disorder chronic degenerative, such as cancer, inflammation, cardiovascular disease, cataracts, macular degeneration and other being carotenoids and phenolic compounds, some of the groups of bioactive compounds to which are attributed such actions.The seeds and extracts of annatto (Bixa orellana L.) are used as colorants in food, pharmaceutical and cosmetic industries due to the predominant presence of the carotenoid bixin [3].The efficiency of biological activity of bioactive compounds of vegetable origin depends on their structure and concentration.In turn, the amount of these substances in vegetables is largely influenced by genetic factors and environmental conditions, in addition of the degree of maturation and plant variety, among other things.It is known also that the biological capacity is influenced by the substrate used in the assay, by the solvent and by extraction technique used, as well as the binomial time-temperature.Regarding the organic solvents, methanol, by managing to extract high amount of bioactive compounds, it has been touted as the most effective [4]- [7].
The Montrichardia linifera (Arruda) Schott, Araceae family, popularly known as "aninga", is an aquatic macrophyte that forms large clonal populations along the rivers and streams of the Amazon.Those bordering consider this poisonous plant because its sap causes skin burns and eye contact can cause blindness [8].However, paradoxically, it is widely used in traditional Amazonian medicine, mainly due to the healing properties of sap and of the juice of this plant, which has already been mentioned in the literature since the nineteenth century for the treatment of wounds and ulcers [9]- [12], leading to the hypothesis that this species may contain biologically active substances.However, very little is known about their chemical composition and biological activities that eventually may be useful also against human infections caused by parasites, which are serious problems in tropical and subtropical in developing countries, despite the discovery of new anti-protozoal drugs [13].Between antimalarial compounds isolated from plants, artemisinin is one of the most important discoveries nowadays [14]- [16].Artemisinin (or Qinghaosu, QHS, Figure 1) represents the most relevant advance in the treatment of malarial disease for the last 20 years [17].Artemisinin is a sesquiterpene lactone with an endoperoxide group, which has been used in traditional Chinese medicine for many centuries as a natural product for fever and malarial treatment.This drug was isolated by Chinese chemists in the early 1970s from the ancient Artemisia annua L. Nowadays, artemisinin and derivatives are widely used around the world because of their potent antimalarial activity, fast action, and low toxicity.As a result, artemisinin and its derivatives have become recognized as a new generation of antimalarial drugs [18].Pinheiro, Ferreira and Romero (2001) techniques combined quantum chemical (Hartree-Fock 3-21G) and multivariate analyzes methods (PCA, HCA, KNN and SIMCA) to study and propose diidroartemisinin derivatives.Through the technique PCA and HCA, seven (7) descriptors that were responsible for the classification of compounds into two distinct classes were selected, and with construction of qualitative models KNN and SIMCA The artemisinin structure and the region essential for expression of the biological activity (pharmacophore) was visualized using ChemSketch 12.00 program [23].(b) Map of molecular electrostatic potential (MEP).((c) and (d)) Orbital's energy Homo and Lumo.The MEP, Homo and Lumo were visualized and calculated using Hartree-Fock (HF) method and HF/6-31G** basis set by the Molekel program [24].two (2) compounds of a set of twelve (12) test predicted as of high activity were proposed [19].Artemisinin derivatives with antimalarial activity against Plasmodium falciparum, which is resistant to mefloquine, were studied using quantum chemical methods (HF/6-31G*) and the partial least-squares (PLS) method.Three main components explained 89.55% of the total variance, with Q 2 = 0.83 and R 2 = 0.92.From a set of 10 proposed artemisinin derivatives (artemisinin derivatives with unknown antimalarial activity against Plasmodium falciparum), a novel compound was produced with superior antimalarial activity compared with the compounds previously described in the literature [20].Cardoso et al. (2008) studied artemisinin and some of its derivatives with activity against D-6 strains of Plasmodium falciparum using the HF/3-21G method.To verify the reliability of the geometry obtained, Cardoso et al. compared the structural parameters of the artemisinin trioxane ring with theoretical and experimental values from the literature.MEP was used in an attempt to identify key features of the compounds that are necessary for their activities, and they use those to propose new artemisinin derivatives [21].There is however, the need for discovery of new antimalarial drugs using techniques of quantum chemistry which has been extensively used in the pharmaceutical industry, where predicting the probable activity of a drug from molecular orbital calculations is much less expensive than manufacturing it in order to perform expensive tests [22].

The Importance and Application of the Methods of Quantum Chemistry to Molecular Modeling
The central importance of quantum chemistry is to obtain solutions of the Schrödinger equation for the accurate determination of the properties of atomic and molecular systems.Within the area of physical chemistry, quantum chemistry is used to calculate various thermodynamic properties such as entropy, heat capacity of gases, for the interpretation of molecular spectra, calculations of bond length, bond angles, dipole moment and the understanding of intermolecular forces.In organic quantum chemistry can estimate the stability of molecules, calculate the properties of intermediate reactions, reproduce the aromaticity of organic compound and simulate spec-HOMO = 0.18339 eV LUMO tra of nuclear magnetic resonance (NMR).In analytical chemistry is used for the interpretation of intensities of the spectral lines.In the area of inorganic chemistry is used for the ligand field theory, where predictions are made, and properties justify complex ions of transition metals.In Biochemistry of quantum chemistry calculations are used to study the conformation of biological molecules such as enzyme-substrate binding and solvation of organic molecules [25]- [28].
The application of quantum chemical methods in the study and planning of bioactive compounds has become a common practice nowadays.From the point of view of planning, it is important to note that, when speaking of the use of molecular modeling (a collective term that refers to theoretical methods and computational techniques to model or mimic the behavior of molecules), it is not intended to reach a bioactive molecule simply through the use of computer programs.The development process of these molecules due to their complexity, necessarily involves multidisciplinary approach, which employs a large set of computational methods of systematic way form facilitate and optimize the development process of bioactive compounds, on a constant exchange of information with groups of chemical synthesis and evaluation of the activity of these compounds.These computational methods can be used as tools of rational design of bioactive compounds, so called because it is guided by a rational hypothesis about the mechanism of action of these compounds.The action of bioactive molecules is a very complex phenomenon, but one of the paradigms of medicinal chemistry is that these molecules are related to their effects or interactions to chemical reactions with macromolecular structures present in living systems, proteins, in its large majority.Cases these proteins are cell receptors, bioactive molecules are classified as receptor agonists or antagonists, in the case of enzymes, these molecules act as enzyme inhibitors [29] [30].

The Origin of Quantum Chemistry Methods
The history and debates related to the so called "semiempirical method" of London-Eyring-Polanyi are reported on a fascinating work of Nye [31].The approach of Eyring and Polanyi, aiming to merge theory with experimental results to construct potential energy surfaces, showed that it is possible to gain insights into the mechanisms of adiabatic reactions, leading to important concepts related to the dynamics of chemical reactions, such as transition state and activated complex.Already at that time, there were debates comparing the approaches based on first principles with the semiempirical.In 1933 James and Coolidge [32] published an ab initio calculation when obtained the binding energy of H 2 molecule with an accuracy of 98%.This calculation took a year to be done, but its success increased confidence of authors that then criticized the semiempirical method as a "happy cancellation of errors" that occurs for not taking into account "terms of considerable importance" [32].This expression "cancellation of errors" remains to this day in the imagination of researchers, although it is not possible to identify precisely that are these errors, and mainly, how and why they if would cancel.Polanyi in 1937 counter-argued: "Personally, I do not attach importance to an exact agreement between theory and experiment at this stage, but I believe that the theory [referring to the semiempirical method] can claim to provide a reasonable description of the mechanism of reactions chemical that would otherwise remain obscured" [33].
In 1941 Hirschfelder presented the calculation of at least one hundred activation energies for several reactions using the method of London-Eyring-Polanyi, showing that only in some cases the values did not agree with the experimental, with an error of 10 kcal/mol [34].Hirschfelder defended the semiempirical method as "a method sufficiently flexible so that it could be made consistent with any set of chemical facts, maintaining consistent with the basic principles of quantum mechanics".According to Nye [31], in this statement Hirschfelder has correctly identified that an ad hoc character is inherent to semi-empirical approaches.This ad hoc character involves risk, since it may well be possible to force an agreement between the semi-empirical and experimental model, even if the model is based on an erroneous theory.Such risk is even greater if the model is used for extrapolation.The final triumph of the first principles approach to the reaction H + H 2 → H 2 + H, occur only came in 2003 when Mielke and colleagues [35] presented a new constants of experimental and theoretical speeds.The theoretical calculations were obtained using quantum dynamics in an exact adiabatic potential surface including Born-Oppenheimer corrections.The experiment and theory now agree in the range of 167 K to 2112 K, within experimental error, causing the problem can be solved now.
The "semiempirical" expression was first used in theoretical chemistry in 1931 by Michael Polanyi (1891-1976) and Henry Eyring (1901Eyring ( -1981) ) [36] [37] in their attempt to combine thermodynamics, chemical kinetics, quantum mechanics and the theory of binding electrons valence.In these 82 years, semiempirical method of London-Eyring-Polanyi to construct potential energy surfaces, provided a work tool useful to get insights about how the physical and chemical processes occur, and to motivate the development of new techniques and experiments to study the temporary combination of atoms called transition state as in the works of Ahmed Zewail.The novelty of the visual maps of potential energy surfaces and the language of potential wells and barriers of activation has also become an important pedagogical tool for chemistry.In contrast, first-principles calculations of quantum dynamics, with equivalent accuracy to that of Mielke and colleagues [35], remain prohibitive, almost impossible to be performed for the vast majority of chemical reactions of practical interest.Undoubtedly: the ab initio methods, based entirely on first principles eventually prevail.However, if the reaction of the hydrogen atom with a hydrogen molecule could be regarded as definitely settled 10 years ago, what about all the other chemical and biochemical reactions?[38]

Hartree-Fock Equations
A development of great importance in quantum chemistry occurred from calculation functions accurate wave diatomic and for many polyatomic molecules, by the method of self-consistent field developed by Douglas Hartree [39].In his theory Hartree considered the wave function as being formed by an antisymmetric linear combination of products of spin-orbitals.Then the wave function Hartree-Fock for atoms or molecules obeying the Pauli exclusion principle, should be written as a product antisimetrizado spin-orbitals, called Slater determinant, that in the formula restricted and normalized to a system of closed layers containing 2m electrons, is given by: in which each spin-orbital ( ) is a product of a spatial function The functions of an electron ( ) i x ϕ are known as "orbitals", a term proposed by Mulliken [40], which is the quantum mechanical analogue of the classical orbit [41].It is considered that the orbitals are orthonormal ( ) ( ) All elements forming a given column of Slater determinant involve the same spin-orbital already all elements in the same row involve the same electrons.When the configuration of an atom or molecule layer is closed this is represented by a single determinant, unlike a configuration layer to open, since in this case it is used a sum of Slater determinant.The analysis made the following refers the systems configurations closed layer.Open layer settings will not be discussed here, viewed the complexity of the formulas.Aiming to show a simple format for the expressions, it is used the system of atomic units in the equations.
The expression refers to the total electronic energy Hartree-Fock E for a system of electrons n nuclei in fundamental state is given by the theorem of the variational method.
where Ψ comes to be wave function in the form of Slater determinant and Ĥ is the Hamiltonian operator without considering the coordinates spin system and non-relativistic.In the case of molecules that in our case the operator Ĥ is the Hamiltonian purely electronic which is obtained after the separation of Born-Oppernheimer [42] that written in atomic units has the form: The first term in Equation ( 5) is the operator for the kinetic energy of the electrons, the second term comes to be potential energy of attraction between electrons and nuclei, and the last term refers to the potential energy of repulsion between the electrons, A Z is the charge of the nucleus A, i r α is the distance of the electron i the nucleus α and ij r is the distance of the electron i to the electron j.
The operator Ĥ can be separated into two other operators The operator 1 Ĥ is relative to the kinetic energy of the electrons and the potential energy of interaction of the electrons with the nuclei: The operator ( ) ĥ i is the corresponding Hamiltonian to the motion of an electron in the field generated by the nuclei only ( ) The operator 2 Ĥ is associated with the interelectronic repulsion Utilizing the wave function ( 1) and the Hamiltonian (5) in the expression for the energy functional (4), has been as a result the expression for the total electronic energy of a system represented by a wave function in the form of Slater determinant.
( ) This relationship has reduced the integration of many electrons the set of three dimensional integrals (h ij ) and six (J ij ) and (K ij ), written of following form: ( ) ( ) ( ) The integral of an electron ii h represents the sum of the kinetic energy of an electron in the orbital ( ) and its potential integral due to the action of the nuclei.The Coulomb integrals ij J and of exchange ij K are associated to interaction between an electron in the orbital and another in the orbital ( ) . In classical mechanics the integral Coulomb is the interaction energy between two distributions of charges, already integral exchange does not have analogue in classical mechanics, their presence arises due to anti-symmetry of the wave function [41].Varying E in Equation ( 4) for each orbital and making 0 E δ = while keeping orthonormality of the wave function arrives at the 1-electron equations that define the Hartree-Fock method ( ) ( ) ( ) where Fock operator ( ) where ˆi J and ˆj K are respectively operators of Coulomb and exchange The Hartree-Fock equations are solved in general by an interactive procedure (SCF).At the end of the procedure, i ε are the eigenvalues monoeletrônicos HF system.Each i ε is often called orbital energy and is inter- preted as being the energy of an electron in the orbital i ϕ , resulting from its kinetic energy, energy of attraction with the nuclei and their energy exchange and repulsion due to all other electrons in their charge density The Equation ( 14) provides ( ) ( ) ( ) Substituting Equation ( 15) in ( 18) we can relate the orbital energies with the integrals ( 11), ( 12) and ( 13).
( ) It may be also find the total electron energy E from the orbital energies.But this total energy is not simply equal the sum of the energies of an electron.This fact is due to the sum of the energies of an electron including twice each electron-electron interaction, i.e., repulsion between electrons 1 and 2 contributes to the energy of an electron associated with both electrons.Thus it follows that the second term in the equation below corrects this problem ( ) Thus substituting (19) in ( 20) have ( ) which equals ( ) with ii h being the same Equation (11).

Hartree-Fock-Roothaan Method
The problem of solving the Hartree-Fock equations for atoms and molecules is due to the fact of absence of central symmetry.Therefore it is necessary to use approximations for the best orbital.Thus for systems containing many electrons, an approximated way of revolver the Hartree-Fock equations consists of expand the orbital Hartree-Fock on a linear combination of K basis functions µ χ , in accordance proposed by Roothaan [43], and this method denominated of Hartree-Fock-Roothaan (HFR) or also Molecular-Orbital Linear Combination Atomic Orbitals (LCAO-MO).Thus the orbital can be expanded in a linear combination as follows.
where the i c µ are the coefficient of expansion, which will be treated as variational parameters and basis func- For the molecules ( ) i x ϕ are molecular orbitals and µ χ are the atomic orbitals.Thus there is a significant improvement in computational calculations when the orbital functions are expanded in terms of a finite set of basis functions.The integro-differential equations are then transformed into algebraic equations for the expansion coefficients [41].The total electronic energy is obtained when Equation ( 23) is replaced in (21).
where ( ) ( ) ( ) with ( ) given by Equation ( 8) by replacing i by 1.The integrals v µλ σ and v µ λσ are integral interaction two electrons are represented by The best coefficients i c µ are determined by varying the electron energy given by Equation ( 26) with respect to them, obeying the condition of orthonormality.Thus we arrive at the equations of Hartree-Fock- where the operator Fock defined as We can replace the Equation ( 31) in (30) and thus obtain the equations of Hartree-Fock-Roothaan the following notation 0, 1, 2, , Applying a unitary transform on Equation (32) to diagonalize the matrix ε , has been Equation ( 33) can be written in matrix form in this way has been The equations of Hartree-Fock-Roothaan are resolved in the same way that the equations of Hartree-Fock, i.e., by an interactive process (SCF-MO-LCAO).Those obtained at the end of the procedure are the eigenvalues of HFR system.

Hartree-Fock Limit
In Hartree-Fock method, is used a electronic wave function composed only by a Slater determinant, thereby providing only approximate description of exact wave function because this can not be described by a single Slater determinant.The exact solution of the Schrödinger equation would not be obtained in case the spatial orbitals ( ) i x ϕ be expanded into a linear combination of basis functions µ χ , although the larger and more com- plete is the set of those functions greater the degree of flexibility in expansion for spin-orbital and lowest expected value for energy.Larger bases sets decrease the HF energy up to a certain limit.This limit is the lowest that can be obtained from a wave function of a determinant and it is called of Hartree-Fock limit.Even so, as a variational method, this limit energy E HF will still be above the exact nonrelativistic energy ( ) ex E NR due to the energy of electron correlation E corr ( ) The SCF method is a valid approach, but generates errors in energy because it describes of approximate mode the interactions between the electrons.Should be also consider the instantaneous interactions between the electrons.The moves of the electrons are correlated with each other, i.e., there is a correlation in the different positions of the electron and must be taken into account.
The exact value of the energy E including electron correlation and relativistic effects E rel is given by:

Application of Basis Sets Separate Valence Used in the Molecular Modeling
To perform atomic and molecular calculations, determines a basis set formed by mathematical functions.These bases consist of a linear combination of the wave functions of an electron in terms of a finite number of base functions, which contains a set of parameters to be optimized.The denomination of the base sets depends on various types of basis functions, of the number of those employed functions in the expansion of monoeletrônicas functions (orbital) and of characteristics of the parameters to be optimized.Thus it is of fundamental importance the careful choice these functions when you want to get accurate results.The main basis sets separate valence available in computational chemistry programs for performing calculations of molecular properties are presented below:

Slater Type Functions
The atomic systems containing only one electron, hydrogenic atoms, have as a solution of the Schrödinger nonrelativistic equation, a function of type where n, l and m are the principal quantum numbers, orbital angular momentum and magnetic, is the angular part denominated spherical harmonics.The radial part has the form ( ) where N nl comes to be a normalization factor, Z is nuclear charge, r is radius, a is a constant and being N lm a normalization factor and ( ) m l P the associated Legendre polynomial.For more that hydrogenic orbitals are orthogonal they do not form a complete set of continuous functions.They have limited application because many of the required integrals in molecular orbital calculations are somewhat hard to calculated, especially when you have high values of the principal quantum number because of the complexity of the polynomial in r.Then Slater (1930) proposed a form analytical simpler for the radial function, then introducing functions such as Slater (Slater Type Functions-STFs) [42].
Since the orbital exponent is written as where s is a constant related to the shielding effect of the electrons of the internal layers of atoms and n * is an effective principal quantum number.Therefore the general shape of Slater type functions can be written as follows, which is similar to the equation.
Slater evaluated through empirical rules to select of parameters s and n * , thereby aiming at a good approximation for the atomic orbitals best of this type.The values of ζ were determined by the variational method by Clementi (1963 and1967) for neutral atoms to the ruthenium in their fundamental states, using for this the SCF method [44] [45].The exponents ζ are positive numbers and adjustable in the methods of calculation.These exponents determine the size of the orbital, thereby large exponents characterize dense orbital and small exponents characterize diffuse orbitals.
The so-called Slater type orbitals (STOs) are formed by the product of the angular part of Equation (39), by the product of the radial part of Equation (40).The STOs generate reasonable representations of atomic orbitals.However, as they replace the polynomial in r in the hydrogenic orbitals by a simple power r, do not have the proper number of radial nodes, just not well represent the internal part of an orbital.
One of the limitations is that they are not of mode some orthogonal, as much as it can be corrected using a set of STOs orthogonalized.In addition the use of these orbitals in molecular SCF calculations makes the integrals multicenter, that involve interactions between electrons, becomes somewhat difficult to be solved numerically, thereby increasing the computational time.

Minimum Base Sets
Minimum base sets or base "single-zeta" is set containing a single function to represent each occupied atomic orbital of different quantum numbers n and l of electron configuration.Thus, this set presents a reduced computational time to make these calculations, so can be used in calculations involving large molecules which are our case.In the calculations where a minimum base set is used are used in general Slater type orbitals.Due to the small size of the base, minimum base sets generate results only form qualitative of the properties.Nevertheless the calculations of the electronic structure were until around 1960 all performed in terms of minimum base.The first semiempirical methods more used were based in a set of minimum base STOs [46] [47].

Contracted Gaussian Functions
The use of STOs in electronic structure calculations generates computational problems due to the emergence of multicenter integrals, although these orbitals well describe the functional behavior of molecular orbitals.In order to simplify the calculation of multicenter integrals, Boys (1950) proposed for calculations involving molecules using Gaussian type functions (GTFs) [48] ( ) ( ) ( ) where n and l are the main quantum numbers and orbital angular momentum and α is the orbital exponent which is a variational parameter.The use of GTFs in calculations of electronic interaction integrals presents an enormous advantage, because the product two of these functions entries in different centers is equivalent to a single function centered on a new center [49], thereby there is a reduction of multicenter integrals to integrals calculated in terms of functions centered on the same point.
The functions of the type Gaussian have a poor behavior of atomic orbitals in relation to Slater type functions, the reason is that the GTFs do not have the "cusp" in the region near the nucleus.Thus the GTFs have a behavior different functional that seen for molecular orbitals, so it is necessary to use two (2) to five (5) GTFs to represent adequately each STO.As the number of electronic interaction integrals originated using of a basis set dimension m increases in the order of m 4 , fastest speed and simplify the calculations of the electronic interaction integrals in terms of GTFs compensates the large number of integrals to be calculated when comparing using STFs.
The solution of the equations of SCF method in the calculation HFR is also a process using an enormous computational time, it is also proportional to the fourth power of the number of base functions.However, the number of cycles of the interactive process SCF method increases with the number of coefficients to be optimized.Thus the use of GTFs contracted originated from linear combinations of Gaussian primitive is generally more adequate [47] [50] [51].
The contracted basis functions can be chosen in order to resemble STOs, atomic orbitals HF or any other set of functions.A type of GTFs contracted is set STO-NG.This nomenclature comes being a set of GTFs contracted that describes a STF through N GTFs primitives used in contraction.Each STO is then approximated as a linear combination of N GTFs, in which the coefficients and exponents contracted are chosen so that the basis function approach is an STF.Thus the computational time decreases, but the results generated by the set of contracted GTFs are not good.
The STOs ns and np are approximated by their respective functions

Basis Sets Separate Valence
In order to obtain better results, many studies have been made aimed at finding basis sets with this capability.In recent years, besides the basis sets previously described, other sets are being used in electronic structure calculations.Among these, there are basis sets separate valence [42].
Basis sets separate valence are extended sets GTFs contracted, where the most usual are listed below: 4-31G, 3-21G, 6-31G e 6-311G.In such sets are used two functions for the valence orbitals and only one for the orbitals of the internal layer, may each of these functions is a linear combination or not primitive gaussian.The occurrence of this is due the internal layer contribute little to the chemical properties of interest.As the functions of the internal layers not duplicate it generates effects on total energy, however represents little it comes of dipole moments, ionization potentials of the valence, charge density, dissociation energy and other chemical properties.For example, the base 4-31G each atomic orbital of the internal layers is described by only one GTF contracted which is formed by the combination of four linear (4) GTFs primitive.For each atomic orbital of the valence shell there are two basic functions, being that of them is a GTF contracted from the linear combination of three (3) GTFs primitive, describing the internal part of valence orbital, since the other is a single GTF primitive, which describes the outer part of the valence orbital.Considering the first row atoms (Li to F) we have: For the hydrogen atom, which has no internal layer, it has been: ( ) ( ) where the functions which appear with (') are internal functions while appearing with (") are external functions.
Similarly to the base structure previously described is the basis 6-31G.Only now each orbital of internal layers is represented by a GTF contracted originated from linear combination of six (6) GTFs primitive.In the case of the base 6-311G, includes a GTF primitive the set 6-31G to represent a new layer of the outer valence.

Polarization Function
All basis sets discussed so far have a peculiar characteristic, i.e., they comprise functions restricted to be centered in the nuclei.However, there is evidence showing that the atomic orbitals distort or polarize when they form a molecule.For this reason one should take into consideration the possibility of non-uniform displacement of electric charges outside the atomic nucleus, i.e. the polarization.Thus it is possible to obtain a better description of the changes as well as also deformations in the atomic orbitals within the molecule.One way to consider such a polarization is introduced on the basis in question functions wherein the values of l (quantum number of the orbital angular momentum) larger than those of fundamental state of a given atom.In this type of functions gives the name of polarization functions.For the hydrogen atom of the description fundamental state uses only functions s.The functions p, d, ... centered on H, for molecular calculate, are considered as functions of polarization.Generally, with the inclusion of polarization functions in the molecular basis there is a higher possibility of obtaining better results for many of the properties of chemical interest, such as dissociation energy and dipole moments.In real terms, it is noted that it is not satisfactory to include polarization functions of symmetry d and f together with the basis sets s and p small, that is, the polarization functions should only be added when working with basis sets said saturated [52].
Of the bases valence separate with polarization functions most commonly used in molecular calculations are the STO-3G * , 3-1G * , 6-31G * , 6-31G ** , 6-311G * and 6-311G ** [53].The base 6-31G * and 6-31G ** are formed by the inclusion of polarization functions to the base 6-31G, being that the base 6-31G * is constructed by adding a set of five (5) polarization functions of type GTFs of symmetry d to the base 6-31G, for each different atom of hydrogen and helium, already the basis 6-31G ** is constructed by adding a set of three (3) polarization functions of type GTFs of symmetry p to the set 6-31G * , for each hydrogen atom.

Diffuse Functions
The use of valence basis set with polarization function in calculations involving anions does not generate good results, because the electron cloud of the anionic systems possess a tendency to expansion.Thus it is necessary to also add appropriate diffuse functions because they allow a greater occupation of space by the orbital region.The importance of diffuse functions in calculations of transition metals is due to the metal atoms present orbital type d and these have diffuse characteristics.So it becomes necessary adding diffuse functions to basis function associated to configuration of the neutral metal atom in order to obtain a better description of the metal complex.The great importance of diffuse functions is due to the fact that they better describe the farthest molecular orbital of the nuclei [54].
Several programs two-dimensional of design of molecules are available and easy to use, as ChemWindow, Isis Draw, ChemDraw [68] and Chem3D [69].They allow the preparation of figures and diagrams with desired quality and accuracy and facilitate the documentation and scientific communication.The software ChemSketch 12.00 [23] is an advanced design that provides chemical molecular properties, optimization and 3D visualization, ability to name the molecules, as IUPAC, and still has a large database of chemical structures and laboratory materials.The software automatically calculates the valence of each atom and restricts the construction of the molecule based on the octet rule, unless instructed to do this restriction.Then is possible to request the construction of 3D spatial form of the species studied, which triggers another window where the academic can rotate tridimensionally the species studied, in addition to observing these species in different visualizations with possibility to visualize bonds and spatial arrangement of species prominently in each of these representations.The design and visualization of 3D drugs, with steric factors relevant to biological activity, are important for analysis of the size, volume and shape of the molecules [70] [24].
In the area of molecular modeling, graphics construction and projects of drugs, the program Hyperchem [73] for being a tool specializing in 3D structures of interest to the medical, pharmaceutical and organic chemistry.The program lets you design complicated molecules.This software is also an alternative in the field of spectroscopy, which besides the ability to simulate a priori by the NMR spectra quantum methods, contains a database of approximately 10 4 molecules applicable to macromolecules as well as small molecules.The software also includes animations, and quantum chemical calculations and molecular mechanics.
The choice of method for energy minimization depends on factors related to the size of the molecule, parameters of availability and stored data and computational resources.Molecular models generated by the computer are the result of mathematical equations that estimate the positions and properties of the electrons and nuclei, the calculations exploit experimentally, the characteristics of a structure, providing a new perspective on the molecule [74].
In quantum chemistry the softwares more used are Gaussian and GaussView that uses the laws of quantum mechanics to predict the energies, structures and properties and vibrational frequency of molecular systems [75].The GaussView 5.0 is a program that can work on Windows and responsible for building the structures under study, by viewing these as well as for generating the input of the species under study for the program calculations-Gaussian 03W.This includes an advanced molecular modeler, which can be used for construction and molecular dimensions of the three test [75].
The Gaussian 03W is a program that can work on Windows and Linux that performs computations used in the study of reaction mechanisms, equilibrium geometries of neutral molecules, radicals and ions, and the determination of physicochemical parameters.Appreciates structure, reactivity, thermodynamic properties, energy barriers (transition states), conformational analysis, employing the optimization of molecules and theoretical calculations of vibrational spectra.From the optimization is obtained the most appropriate structure to the molecule, whereas the lengths and bond angles and power stabilization calculated by ( )

Application of Hartree-Fock Method Using Computational Tools
Obtaining molecular properties depends of method and basis sets, and represents a means of chemical information contained in the molecular structure of the compound studied.The structure-activity relationships (SAR) represent a core aspect of medicinal chemistry.The fact that a small change in structure leads to a small change in biological activity and allows chemists to rationalize substitutions at specifics positions, giving them the freedom to modify a molecule to improve various properties such as lipophilicity, bioavailability, and so on without sacrificing potency (to a large extent).From the modelers' perspective, the principle of similar structures having similar activities [76] is a cornerstone of quantitative structure-activity relationship (QSAR) modeling [77] [78].This information is transformed and encoded for lots of problems chemical, pharmacological and toxicological studies on the relationship between structure-activity, quantitative structure-activity and structure-property (SAR, QSAR and QSPR) [79]- [81].Ferreira et al. [82] studied artemisinin and 18 derivatives with antimalarial activity against W-2 strains of Plasmodium falciparum through quantum chemistry and multivariate analysis.The geometry optimization of structures was realized using the Hartree-Fock method and the 3-21G** basis set.Maps of molecular electrostatic potencial and docking molecular were used to investigate the interaction between the ligands and the receptor (Heme).
Santos et al. [83] performed studies using Hartree-Fock method and the 6-31G** basis set were employed to calculate the molecular properties of artemisinin and 20 derivatives with antimalarial activity.Maps of molecular electrostatic potential (MEPs) and molecular docking were used to investigate the interaction between ligands and the receptor (heme).Principal component analysis and hierarchical cluster analysis were employed to select the most important descriptors related to activity.The correlation between biological activity and molecular properties was obtained using the partial least squares and principal component regression methods.The regression PLS and PCR models built in this study were also used to predict the antimalarial activity of 30 new artemisinin compounds with unknown activity.The models obtained showed not only statistical significance but also predictive ability.The significant molecular descriptors related to the compounds with antimalarial activity were the hydration energy (HE), the charge on the O11 oxygen atom (QO11), the torsion angle O1-O2-Fe-N2 (D2) and the maximum rate of R/Sanderson Electronegativity (RTe+).These variables led to a physical and structural explanation of the molecular properties that should be selected for when designing new ligands to be used as antimalarial agents.

Case Study on Aspirin
Aspirin introduced in 1899 was one of the first drugs developed and is still one of the most widely used.Estimated 20 billion aspirin tablets are consumed each year in the United States.Originally planned to ease the pain and relieve sore muscles, proved to be a highly complex drug with the power and unexpected limitations.It turned out that it reduces the incidence of heart attacks and is effective in reducing the incidence of Alzheimer's disease and cancer of the digestive tract.At the same time, however, aspirin attacking the stomach lining, causing bleeding or even ulcers, and usually cause intestinal problems [84].One of the forms of action of aspirin is blocking an enzyme (a type of protein) called COX-2, which promotes inflammation, pain and fever.Unfortunately, it also interferes with COX-1 a correlate enzyme that produces essential hormones to the health of stomach and kidney.An analgesic and anti-inflammatory agent is efficient in COX-1.Figure 2 is shown the structure of aspirin which acts transferring part of its molecule known as acetyl group, for COX-2, disabling it.This drug-receptor interaction is irreversible nature due to the formation of a covalent bond resulting from the nucleophilic attack of the hydroxyl group of the amino acid serina 530 to the electrophilic acetyl grouping present in aspirin [85].
A substitute for aspirin (new drug) has to keep this aspect of the molecule, and the replacement should the general format and size of the molecule in such a way it clicks into place in molecular target the same manner .Aspirin (structure) and region essential for the expression of biological (pharmacophore) was visualized using ChemSketch 12.00 program [23] and HyperChem Release 6.02 [73].
that aspirin.
The use of maps of electrostatic potential (MEP) is a computational tool that aids in the process of recognition of one molecule by another, as interactions of types receptor drugs and enzyme substrates, because being by their potential that chemical species interact with other in biological recognition process.The electronic parameters are one of the main factors governing drug-receptor interaction, in this sense; the MEP can be considered an alternative approach in order to understand the electrostatic contribution of this drug and its new derivatives for biological activity [86].In the construction of the MEP are necessary three steps: construction of the surface density of the molecule, construction of the electrostatic surface potential and applying colors the surface obtained to designate potential values.One of the frequent topics of theoretical chemistry is research to improve methods to elucidate the behavior of molecules and other reactive chemical species.Among the numerous existing reactivity indices the molecular electrostatic potential ( ) V r that is generated around a molecule by its nuclei and electrons, is known for being a real physical property can be determined experimentally by diffraction methods, as well as computationally [87].
In Figure 3 are shown the MEPs of aspirin with Hatree-Fock method in different basis sets (HF/3-21G*, HF/ 3-21G**, HF/6-31G, HF/6-31G*, HF/6-31G** and HF/6-311G).In this figure we can observe that in the HF/6-31G** method, showed the lowest positive electrostatic potential (region blue color) of 0.06715 au (atomic unit).However, HF/6-311G method, showed higher positive electrostatic potential 0.07607 au.The variation between them was ±0.00892au (HF/6-311G and HF/6-31G**); Also in this figure we observed that the HF/6-311G method, showed the lowest negative electrostatic potential (region red color) equal to −0.10235 au.However, HF/3-21G** method showed higher negative electrostatic potential of −0.09071 au.The variation between them was ±0.01164 au (HF/6-311G and HF/3-21G**).Thus, the presence of a negative potential surface in carbonyl oxygen atom of the acetyl group is nucleophilic (affinity for positive nuclei), while the carbonyl carbon is electrophilic (electron affinity) in accordance with the literature [85].Thus, we have shown that the use of different basis sets present values of different electrostatic potentials for the same case study.Therefore, obtaining molecular properties will depend on the method and basis set to reproduce experimental data with greater accuracy.In this case, the MEP of aspirin in different basis set was used to evaluate the key features of aspirin from qualitative comparisons in the region of the acetyl group, which according to Bernardinelli et al. [88] the geometric form of the electrostatic potential is similar for all active compounds.However, new derivatives of aspirin must have some structural similarity in terms of their electrostatic potentials that allow one to be recognized by the other, with similar biological activities [87] [89] [90].
Structure-activity relationship (SAR) indicates molecular structure modifications that increase the drug effectiveness.In general, reports show that these modifications are made throughout small changes in the leading compound structure, followed by trials in laboratory to quantify the variations in the biological activity due to changes in the molecular structure [91].
The quantum-chemical descriptors widely used in SAR, QSAR and QSPR studies are related to the energy of the frontier orbitals (HOMO and LUMO).The reason for this is related to the fact that these properties provide information about the character electron donor and/or electron-acceptor and a compound thus forming a charge transfer complex (CTC) [92].The energy of Highest Occupied Molecular Orbital Energy (HOMO) and Lowest Unoccupied Molecular Orbital Energy (LUMO) are quantum-chemical descriptors, which play an important role in chemical reactions and the formation of many complex charge transfer [93].In Figure 4 is shown the fron-  tiers orbital (HOMO and LUMO) with the respective values of energy (eV), using Hatree-Fock method at different basis sets (HF/3-21G*, HF/3-21G**, HF/6-31G, HF/6-31G*, HF/6-31G** and HF/6-311G) for aspirin, and this figure we note the delimited region for the HOMO orbital which measures the electron-donor character of aspirin, and the LUMO which measures the electron-acceptor character.From these definitions, two important features can be observed: the higher the energy of the HOMO greater electron-donating ability, which can be observed in HF/6-31G** with a value of HOMO = −0.34408eV, and the lower the energy of the LUMO will be lowest resistance to accept electrons that can be noticed in HF/6-311G with a value of LUMO = 0.06889 eV.
In this figure the HOMO is located in the region around the benzene ring and the acetyl group, when the substituents are made in the aromatic ring or acetyl group, depending on the substituent will have a high electron density, such as the carbonyls, amines or amides, the more pronounced HOMO region is strongly influenced to perform electronic stereo secondary effects, which may compromise the pharmacological activity of the compound.The energy of HOMO is directly related to the ionization potential of the compound and characterizes the ability of the molecule to perform nucleophilic attacks.Also in Figure 4, the LUMO is located in close region to the benzene ring and of the carboxylic group, so the energy of LUMO is directly related to the electron affinity, characterized by the susceptibility of the compound in relation to attacks by nucleophiles [94].
The energies of HOMO and LUMO have been used for decades as indices of chemical reactivity and are commonly correlated with other indices, such as electron affinity and ionization potential [95]- [99].The difference between the energies of the HOMO-LUMO orbitals (gap) is an important indicator of molecular stability.Molecules with low gap value are generally reactive, while molecules with higher gap value indicate high stability of the molecule, in the sense of low reactivity in chemical reactions [100].In Table 1 are shown some molecular properties obtained with Hartree-Fock method in different basis sets (HF/3-21G*, HF/3-21G**, HF/6-31G, HF/6-31G*, HF/6-31G** e HF/6-311G) and the correlation matrix of Pearson [101]- [103], considering how an independent variant the total energy (Etotal) of aspirin, and other properties as the dependent variable, this treatment was performed using Stastica 6.2 program [104].In this table it is observed that the correlation between the molecular properties of aspirin is less than or equal to 0.96890 (LUMO), while the correlation between the molecular properties and the total energy is less than or equal to 0.85448 (QC8), which represents the charge on the carbon atom 8, see Figure 2.Among these properties obtained those that had greater relevance to building a model QSPR in function of the total energy were the volume (−0.78677), hydration energy (−0.45775),GAP (−0.43419) and QC 8 (0.85448).Therefore, we can represent the QSPR model according to the values of the statistical parameters in Equation ( 52) below.The statistical quality [105] of the regression equations was gauged by parameters like correlation coefficient (r) or squared correlation coefficient (r 2 ), explained variance ( 2 A R , i.e., adjusted R 2 ), standard error of estimate (SEE), and variance ratio (F) [106]- [108].The better regression models were selected on the basis of the higher r, F value (a statistic of assessing the overall significance) and the lower SEE.
This type of treatment shows that QSPR studies are important to obtain of molecular properties that take into account different aspects of chemical information, this information can be through experiments or theoretical calculations simple counting, consider the entire molecule, fragments or functional groups, knowledge of the 3D structure of the molecule or molecular graphics his or her simply formula, information defined by scalar values, vectors or scalar fields [109].
In the discovery of a superaspirin Jacob et al. (2012) [110] synthesized by conjugation aspirin-glucose (Figure 5) in order to study the solubility in water and anticancer activity as compared to aspirin and evaluated that aspirin-glucose was seven times more soluble in water than aspirin, and about 8 to 9 times more active in inhibiting cell growth than aspirin in its anticancer activity in cell culture breast, pancreatic, and cell lines prostate, while the activity was similar in a line benign non-cancer cells.According to the computer calculations performed to aspirin and aspirin-glucose in HF/6-31G** verified that the anticancer activity of aspirin-glucose was increased by the fact of presenting high value GAP = −0.43232eV, having greater molecular stability and low chemical reactivity, being that the value of GAP to aspirin in HF/6-31G** was −0.43332 eV, having a variation of GAP = ±10 −3 eV (between aspirin and aspirin-glucose).
In Figure 5 is shown the map of electrostatic potential for aspirin-glucose, and observed the region of negative electrostatic potential, characterized by red color, which is the region essential for the expression of biological activity (pharmacophore).Therefore, these results are directly related with the conclusion that Jacob et al. indicated that the hydrolysis of the aspirin-glucose in human serum is at a relatively slower rate compared to aspirin where there has been significant anticancer activity at the doses studied under the experimental conditions.The anticancer activity in vitro was much stronger for aspirin-glucose compared to aspirin in cancer cell lines.Therefore, further studies are needed among them we can mention the use of maps of electrostatic potential as an indicator of site of chemical reactivity to confirm this finding in a system in vivo.The high solubility of the conjugated glucose aids the development of a form of injection of aspirin.
This new derivative can be considered in the future a "superaspirin" that must pass safety tests for long periods before being placed on pharmacy shelves, but the time to replace aspirin and other non-steroidal anti-inflammatory drugs.

Final Considerations
The computational chemistry is considered as one of the greatest intellectual realizations of the twentieth century, and has been the conceptual basis that allows the understanding of chemistry of a way much deeper than that existing before age 20, epoch wherein were launched the bases of quantum theory.The impact of this theory in chemistry can be verified for its practical implications in various fields such as spectroscopy, electron microscopy, molecular modeling, among others.
The molecular modeling provides important information for the process of drug discovery, because it facilitates obtaining of specific molecular properties of a molecule that can influence the interaction with the receptor and biological activity.
The Hartree-Fock method provides a quantitative prediction of high quality for a wide variety of chemical and biological systems, but the calculations are time consuming and high computational cost.A resource commonly employed is optimizing geometry with a basis set simpler, and then perform calculations "Single Point" with a basis set most complete allowing the determination of energy and other molecular properties of a system, using a base of more sophisticated calculation.
The application of the Hartree-Fock method depends on the basis set and of system to be studied to obtain different molecular properties related to the biological activity of bioactive molecules.The computer calculations should describe the necessary characteristics related to the experimental data of the molecule or set of molecules under study, for that molecular modeling can be represented of effective and/or efficient form.However, these descriptors are not completely universal, because they are dependent on the structures and systems studied.Even though it is based on a minimum of energy, the calculated descriptors have values very close to their respective empirical values and indicate trends electronic systems under study.The aspects highlighted in this work are evident, therefore, the quantum-chemical descriptors have a wide range of applications in SAR, QSAR and QSPR studies, as well as in many areas of integration of fundamental knowledge of Organic Chemistry, Biochemistry, Molecular Biology, Pharmacology and Pharmaceutical Chemistry.

Figure 1 .
Figure 1.(a) The artemisinin structure and the region essential for expression of the biological activity (pharmacophore) was visualized using ChemSketch 12.00 program [23].(b) Map of molecular electrostatic potential (MEP).((c) and (d)) Orbital's energy Homo and Lumo.The MEP, Homo and Lumo were visualized and calculated using Hartree-Fock (HF) method and HF/6-31G** basis set by the Molekel program [24].

tions µχ
are Slater type atomic orbitals or Gaussian type.To accurately represent the orbitals, functions should form a complete set.However, this requires an infinite number of such functions.What you should use is actually a finite number of basis functions.Orbitals should obey the orthonormality condition.Otherwise you can make a linear transformation thereby making them orthonormal the expansion, where the last three parameters are determined by the least squares method[51].
. The Molekel is a free software multiplatform molecular visualization.It was originally developed at the University of Geneva by Flükiger in the 1990s for Silicon Graphics computers.In 1998, Stefan Portmann took responsibility and released version 3.0.The version 4.0 was almost one version of the platform independent.Other developments lead version 4.3, before Stefan Portman moved and stopped developing the codes.In 2006, the Swiss National Supercomputing Centre (CSCS) restarted the project and version 5.0 was released on December 21 of the same year

Table 1 .
Molecular properties obtained at different basis sets of aspirin and Pearson correlation matrix.