Two laboratory activities are designed to reinforce several important concepts in General Botany course, which is a required course for biology majors at Savannah State University (SSU). The first activity requires students to study the relationship between protein structure and function through observing the 3D structure of Rubisco (ribulose-1,5-biphosphate carboxylase and oxygenase) —the enzyme that catalyzes the first step of the Calvin cycle for photosynthesis. This activity also helps students understand the mechanism of enzymatic action through examining the interaction of Rubisco with its cofactor, substrate, competitive inhibitor, and product. The second activity is designed to help students grasp the concept of plant evolution and phylogeny through analyzing the genetic sequences of Rubisco collected from representative species and determining the evolutionary relationships of these species using bioinformatics tools. Through these two laboratory activities, several important topics are linked together, with Rubisco as a common theme, so that students would develop a holistic and coherent view of plant sciences. Furthermore, students would also gain several important bioinformatics skills that they could use and apply in their future studies and careers.
Botany course is commonly required as part of the curriculum for B.S. in biology at four-year colleges or universities. Through the course, biology students are expected to gain a basic understanding of plants—the photosynthetic organisms that sustain other forms of life on the earth—and relate plants to different aspects of their everyday life, such as food, health, and environment.
General Botany is offered to junior and senior biology students as a required course at SSU. The course covers a broad range of topics of plant sciences at molecular, cellular, and organismic levels; it is accompanied by a laboratory section, which is linked to the lecture topics. In the past, a standard lab manual has been used to conduct the labs by the instructor; they were mostly stand-alone labs and each lab was designed to be completed in 1-hour and 50-minute. These labs tended to be isolated from each other and there was no apparent connection between them.
To foster the inter-connectedness among the topics in biology education and develop students’ bioinformatics skills, we designed two lab activities using Rubisco as a common theme. Rubisco is an essential enzyme for the Calvin cycle of photosynthesis, catalyzing the first step, the rate-limiting step, of the cycle. It converts atmospheric carbon dioxide (CO2) into organic compounds. As one of the most important and abundant enzymes on the earth, Rubisco has been thoroughly studied and characterized. The enzyme is composed of 16 polypeptide subunits (or chains), including eight copies of a large chain and eight copies of a small chain. The large chain is coded by the gene (rbcL) in the chloroplast DNA, and the small chain is coded by the gene (rbcS) found in the nuclear DNA (Portis & Parry, 2007).
Rubisco needs two substrates: CO2 and ribulose 1,5-biphospahte or RuBP (a five-carbon compound). Once the two substrates are gathered in the specific location of the Rubisco’s active site, carbon dioxide is linked to RuBP to form two molecules of phosphoglycerate (3PG or PGA), converting inorganic carbon dioxide into organic compounds in the process (Andersson, 2008).
The structure of Rubisco has mostly been determined by X-ray crystallography technique; some of the 3D images are stored in the protein structure databases such as the PDB (Protein Databank) and MMDB (Molecular Modeling Database) and they can be visualized with Cn3D—the software that displays the structure of a biomolecule (Wang et al., 2000). To take advantage of these resources in the databases, we designed a laboratory exercise that would allow students to examine the structure of Rubisco, including its primary, secondary, tertiary, and quaternary structure, and observe the active site of Rubisco and the physical interactions between Rubisco with its cofactor, substrate, competitive inhibitor, and product.
Rubisco is an ancient enzyme that has evolved over two billion years and is found in most of the photosynthetic organisms, including cyanobacteria, algae, and plants (Chase et al., 1993). Therefore, the evolutionary history of the photosynthetic organisms has been recorded in the genetic sequences of Rubisco. As many plant genomes (including their nuclear and chloroplast genomes) have been sequenced, the genetic data of Rubisco are available for phylogenetic and evolutionary analysis of plants. Therefore, we designed another laboratory activity that would require students to retrieve the protein sequences of Rubisco in photosynthetic organisms from the databases and construct a phylogenetic tree of those species.
Through these two activities, several topics of botany course, such as photosynthesis, plant chemistry, plant genetics, plant phylogeny and evolution could be coherently and logically linked together. In addition, students would learn several basic bioinformatics skills, which are definitely important skills for the future biologists.
Students have been previously exposed to the concept of protein structure and function in their freshman biology course (Principles of Biology); they have also gained the basic understanding of enzymes and their catalytic roles in biochemical reactions in the same course. Those concepts are reinforced again in General Botany course. For example, Rubisco is thoroughly discussed in the context of photosynthesis in the lecture. To further enhance students’ understanding of this enzyme, we designed a computer-based lab that requires students to visualize the structure of Rubisco and its interaction with its cofactor (Mg2+), substrate (RuBP), product (3PG), and competitive inhibitor (D-xylulose-2,2-diol-1,5-bisphosphat or XDP). Students are also encouraged to view the structures of several Rubisco mutants and investigate how a mutation renders the change of the 3D structure of Rubisco and affect its enzymatic function.
In this activity, students are asked to use Cn3D (“see in 3D”) at the NCBI site (https://www.ncbi.nlm.nih.gov/) to view the 3D structure of Rubisco at the MMDB or PDB. There are a number of structures of Rubisco stored in these databases, which have mostly been determined by X-ray diffraction technique. We choose several of them for this activity.
First, students are asked to view the overall 3D shape of the entire Rubisco enzyme. Spinach Rubisco in the database (PDB ID: 8RUC) is selected for this purpose (Andersson, 1996). Students are expected to see sixteen chains (or subunits) of Rubisco, including eight large chains and eight small chains, and how these chains are assembled in space. They are also required to examine the secondary structures in Rubisco, including 14 α-helices and 18 β-sheets in the large subunit and 2 α-helices and 5 β-sheets in the small subunit.
Students are then asked to observe the active site of Rubisco. Eight large subunits of Rubisco are organized into four groups, each of which contains two large chains that are assembled into an antiparallel dimer, with the N-terminal domain of one monomer adjacent to the C-terminal domain of the other. As a result, there are four such dimers in one molecule of Rubisco. The active site is at an interface between monomers within each dimer and arranged around a magnesium ion (Mg2+), which is the co-factor of the enzyme. Students are asked to zoom in to see the detail of the active site—the binding of Mg2+ and 2-carboxyar-abinitol-1,5-diphosphate (or CAP, an analogue of RuBP) to the active site (
Students are also asked to observe the binding of the substrate (RuBP) to the active site of Rubisco through an entry—PDB ID: 1RCX (Taylor & Andersson, 1997a) and the binding of its competitive inhibitor (XDP) to the active site through another entry—PDB ID: 1RCO (Taylor et al., 1996). By inspecting and comparing the inhibitor (XDP) and the natural substrate (RuBP) of the enzyme in their structure and shape (
The 3D structures of several Rubisco mutants are also found in the databases. Through a direct observation and comparison of the structures of a wild-type with a mutated Rubisco, students are able to visualize how a single amino acid change at the level of primary structure of Rubisco results in a change of its 3D structure and affects its substrate binding and enzymatic activity. Those mutants are excellent examples of illustrating the close relationships of the amino acid sequence in a protein with its structure and function. For example, a mutant (PDB ID: 1 UWA) is caused by a substitution of leucine by phenylalanine at 290th position (L290F) in the large subunit of Rubisco from a green algae (Chlamydomonas reinhardtii), as both leucine and phenylalanine are nonpolar amino acids, they are similar in their chemical properties that the substitution does not lead to a global change in the shape of Rubisco; however, it does cause local structural changes and results in a 13% decrease in CO2 binding, and therefore leads to reduced catalytic activity of the enzyme (Karkehabadi et al., 2005).
In addition, students are encouraged to explore some of the revertants in the databases, whose phenotypes have reverted to the normal phenotype by a second mutation. Those revertants help reinforce students’ understanding of the intricate relationship of a protein’s primary structure with its 3D structure and enzymatic function.
Students have been introduced to the taxonomy and evolution of plants in the lecture; as a result, they have gained the basic understanding of the classification and phylogeny of plants. The new area of molecular phylogenetics is also explained to students and principles underlying the phylogenetic analysis of molecular data, such as DNA and protein sequences are also discussed in the lecture.
In this lab activity, students are required to collect protein sequences of the large subunit of Rubisco from at least twenty-five species representing different branches of photosynthetic organisms; those sequences are then compared and analyzed to reveal the evolutionary relationships among those species and generate a phylogenetic tree through the steps described below.
Students are asked to identify and retrieve the protein sequences of the large subunit of Rubisco by searching the protein databases on the NCBI site (https://www.ncbi.nlm.nih.gov/) using the keywords, such as “Rubisco large subunit” and the name of species. Each student needs to retrieve 25 or more protein sequences from different types of photosynthetic organisms, including cyanobacteria, green algae, bryophytes, vascular seedless plants, gymnosperms, and angiosperms (monocots, eudicots, and basal angiosperms).
Once the protein sequences of the Rubisco large subunit are retrieved and formatted, students are asked to generate the multiple sequence alignment of these sequences using Clustal Omega (Thompson et al., 1994), which is available on the European Bioinformatics Institute (EBI) site(https://www.ebi.ac.uk/Tools/msa/clustalo/). Students need to examine the alignment visually and understand the mutation events that have led to the mismatches and gaps in the alignment. Mismatches are generally caused by amino acid substitutions, and gaps are usually generated by indels (that is, insertion or deletion mutations). By inspecting the alignment of the protein sequences, students would gain the basic understanding of how the sequences of Rubisco have been diverged by the specific molecular mutations over the course of evolution.
The Phylogeny Interference Package (PHYLIP) is downloaded from the website (http://evolution.genetics.washington.edu/phylip.html) and used to construct the phylogenetic tree. PHYLIP contains a number of software tools needed for the generation of the tree (Felsenstein, 1989). Specifically, PROTDIST in PHYLIP is used to compute a distance matrix from the alignment of Rubisco protein sequences (obtained from the previous step), which is a table showing the evolutionary distances between all pairs of protein sequences in the dataset; the evolutionary distance is calculated from the number of amino acid differences between a pair of sequences. NEIGHBOR in the same package is then used to generate a neighbor-joining tree (Saito & Nei, 1987) using the distance matrix data generated from PROTDIST. The graphic tree is displayed with Tree View (Page, 1996). Rubisco protein sequence from cyanophyta (or cyanobacteria) is used as an out group to root the tree.
Once a tree is generated, students need to interpret their trees and compare them with the existing trees reconstructed from other data (morphological, anatomical, or molecular) in the textbook or journal articles.
A sample of the phylogenetic trees generated by our students is shown in
Accession No. | Species | Classification |
---|---|---|
ABG53398 | Trichodesmium erythraeum (cyanobacteria) | Bacteria |
YP_764383 | Stigeoclonium helveticum (green algae) | Chlorophyta |
YP_635747 | Chara vulgaris (green algae) | Charophyta |
BAC85044 | Physcomitrella patens (moss) | Bryophyte |
AHN85622 | Marchantia polymorpha (liverwort) | Bryophyte |
YP_004021777 | Equisetum arvense (horsetail) | Vascular Seedless |
NP_848068 | Adiantum capillus-veneris (fern) | Vascular Seedless |
AJT70447 | Pinus armandii (pine) | Gymnosperm |
BAL72600 | Ginkgo biloba (ginkgo) | Gymnosperm |
A6BM42 | Gnetum parvifolium (jointfir) | Gymnosperm |
YP_009158611 | Encephalartos lehmannii (cycad) | Gymnosperm |
NP_904107 | Amborella trichopoda (amborella) | Basal Angiosperm |
ARJ63208 | Illicium anisatum (Japanese star anise) | Basal Angiosperm |
YP_009516884 | Cabomba aquatica (fanwort) | Basal Angiosperm |
AAF34884 | Magnolia grandiflora (magnolia) | Basal Angiosperm |
AUD56413 | Brasenia schreberi (watershield) | Basal Angiosperm |
ACN49416 | Nelumbo nucifera (sacred lotus) | Basal Angiosperm |
YP_007476359 | Trithuria inconspicua | Basal Angiosperm |
ABQ81458 | Ceratophyllum demersum (hornwort) | Basal Angiosperm |
BAA00147 | Oryza sativa (rice) | Monocot |
NP_043033 | Zea mays (maize) | Monocot |
CCW72383 | Musa acuminata (banana) | Monocot |
AEX93806 | Allium cepa (onion) | Monocot |
AAB68400 | Arabidopsis thaliana (thale cress) | Eudicot |
NP_054944 | Spinacia oleracea (spinach) | Eudicot |
AFS41717 | Gossypium hirsutum (cotton) | Eudicot |
AAA19771 | Solanum lycopersicum (tomato) | Eudicot |
ABJ89687 | Coffea arabica (coffee) | Eudicot |
YP_588125 | Helianthus annuus (sunflower) | Eudicot |
YP_009365832 | Citrus lemon (lemon) | Eudicot |
YP_538747 | Glycine max (soybean) | Eudicot |
YP_001109509 | Populus trichocarpa (black cottonwood) | Eudicot |
ABY86790 | Carica papaya (papaya) | Eudicot |
AEX58544 | Theobroma cacao (cacao) | Eudicot |
ADO64983 | Prunus persica (peach) | Eudicot |
YP_567084 | Vitis vinifera (grape) | Eudicot |
angiosperms, three subgroups are well-defined, representing monocot, eudicot, and basal angiosperms (flowering plants which have diverged from the lineage that evolved to monocots and eudicots).
We designed two computer-based activities that were tied to several lecture topics in General Botany, including enzyme chemistry, photosynthesis, and evolution and phylogeny of plants.
We were under the impression that students, in general, showed great interest and enthusiasm for learning during these lab activities, which was demonstrated by their active participation and engagement in these activities. The evaluation of student performance indicated that the implementation of these exercises indeed helped students achieve their learning objectives. For example, most of the students were able to complete these activities independently and submit a well-written report with valid results and correct interpretations. In addition, most of the students performed satisfactorily on the test, demonstrating their understanding of the concepts and mastering of the skills they learned in the lab. However, we were unable to make an accurate assessment of student learning outcomes this time due to the small size of the class (24 students) and the lack of a control group. We will definitely address this oversight in our future studies.
Although these activities were designed and implemented in General Botany course at SSU, they can be easily modified and adapted to teach the similar topics in other biology courses by selecting different groups of proteins or enzymes relevant to that course.
There are several educational implications of this project. First, the traditional labs for General Botany were generally stand-alone labs that were disconnected with each other. These two exercises are developed to foster the interconnectedness of the topics and concepts through a common theme so that students would gain a holistic and coherent view of plant sciences. Second, unlike the traditional recipe-based lab exercises, these two exercises are designed to be inquiry-based and open-ended activities and to promote students’ skills in critical thinking, problem-solving, and data analysis and interpretation. Third, students are introduced to several important bioinformatics skills, such as searching databases, retrieving data, and using software to analyze the data, which have become important skills in any areas of biological sciences. Students could apply these valuable skills in their future studies and careers.
The author declares no conflicts of interest regarding the publication of this paper.
Zhang, X. R. (2019). Teaching Botany Using Bioinformatics Tools. Creative Education, 10, 2137-2146. https://doi.org/10.4236/ce.2019.1010155