Logo of compfungenJournal's HomeManuscript SubmissionAims and ScopeAuthor GuidelinesEditorial BoardHome
Comp Funct Genomics. 2012; 2012: 914843.
Published online 2012 Jan 24. doi:  10.1155/2012/914843
PMCID: PMC3270531

In Silico Identification and Comparative Genomics of Candidate Genes Involved in Biosynthesis and Accumulation of Seed Oil in Plants


Genes involved in fatty acids biosynthesis, modification and oil body formation are expected to be conserved in structure and function in different plant species. However, significant differences in the composition of fatty acids and total oil contents in seeds have been observed in different plant species. Comparative genomics was performed on 261 genes involved in fatty acids biosynthesis, TAG synthesis, and oil bodies formation in Arabidopsis, Brassica rapa, castor bean and soybean. In silico expression analysis revealed that stearoyl desaturase, FatB, FAD2, oleosin and DGAT are highly abundant in seeds, thereby considered as ideal candidates for mining of favorable alleles in natural population. Gene structure analysis for major genes, ACCase, FatA, FatB, FAD2, FAD3 and DGAT, which are known to play crucial role in oil synthesis revealed that there are uncommon variations (SNPs and INDELs) which lead to varying content and composition of fatty acids in seed oil. The predicted variations can provide good targets for seed oil QTL identification, understanding the molecular mechanism of seed oil accumulation, and genetic modification to enhance seed oil yield in plants.

1. Introduction

A major challenge mankind is facing in this century is the gradual exhaustion of the fossil energy resources. The combustion of those fossil fuels used in transportation is one of the key factors responsible for global warming and environment pollution due to large-scale carbon dioxide emissions. Thus, alternative energy sources based on sustainable and ecologically friendly processes are urgently required. At present gasoline or diesel are being largely substituted by two biofuels, bioethanol and biodiesel, capturing 90% of the market [1]. Biodiesel is made from renewable biomass mainly by alkali-catalysed transesterification of triacylglycerols (TAGs) from plant oils [2]. Manipulation of biosynthetic pathways offers a number of exciting opportunities for plant biologists to redesign plant metabolism toward production of specific TAGs.

The biosynthesis of fatty acids in plants begins with the formation of acetyl Co-A from pyruvate. The acetyl CoA produced in plastids is activated to malonyl CoA; the malonyl group is subsequently transferred to acyl carrier protein (ACP) giving rise to malonyl ACP, the primary substrate of the fatty acid synthase complex. The formation of malonyl CoA is the committed step in fatty acid synthesis and is catalyzed by the highly regulated plastidic acetyl CoA carboxylase complex [3]. De novo fatty acid synthesis in the plastids occurs through a repeated series of condensation, reduction, and dehydration reactions that add two carbon units derived from malonyl ACP to the elongating fatty acid chain. A series of condensation reactions proceed with acetyl-CoA and malonyl-ACP, then acyl-ACP acceptors. Three separate condensing enzymes, or 3-ketoacyl-ACP synthases (KAS I–III) are necessary for the production of an 18-carbon fatty acid. Three additional condensation reactions are required; each condensation step to obtain a saturated fatty acid that is two carbons longer than at the start of the cycle. These reactions are catalysed by 3-ketoacyl-ACP reductase (KAR), 3-hydroxyacyl-ACP dehydratase (HD), and enoyl-ACP reductase (ENR). The first desaturation step also occurs in the plastid; while the acyl chain is still conjugated to ACP, a Δ 9-desaturase converts stearoyl ACP to oleoyl ACP. Termination of fatty acid elongation is catalyzed by acyl ACP thioesterases, which are two main types in plants. The FatA class removes oleate from ACP, whereas FatB thioesterases are involved in saturated and unsaturated acyl ACPs, and, in some species, with shorter-chain-length acyl ACPs [46]. After release from ACP, the free fatty acids are exported from the plastid and converted to acyl CoAs. Nascent fatty acids can be incorporated into TAGs in developing seeds [4]. Oleic acid can be further desaturated to oleate acids by FAD2 [7] and FAD6 [8] in the cytosol and the plastid, respectively. Cytosolic and plastid ω-3 desaturations that result in the production of linolenic acids are catalyzed by FAD3 [9] and FAD7 [10], respectively. Fatty acids can be incorporated into TAGs in developing seeds in a number of ways. For example, a series of reactions known as the Kennedy pathway results in the esterification of two acyl chains from acyl CoA to glycerol-3-phosphate to form phosphatidic acid (PA) and, following phosphate removal, diacylglycerol (DAG). A diacylglycerol acyltransferase (DGAT), using acyl CoA as an acyl donor, converts DAG to TAG. Two classes of DGAT enzymes have been isolated [11, 12], and orthologs have been identified in numerous plant species. DAG and phosphatidylcholine (PC) are interchangeable via the action of cholinephosphotransferase, suggesting a route for the flux of fatty acids into and out of PC. Acyl chains from PC can be incorporated into TAG, either via conversion back to DAG or by the action of a phospholipid diacylglycerol acyltransferase (PDAT) that uses PC as an acyl donor to convert DAG to TAG. There are two predominant seed oil storage proteins in plants: caleosin and oleosin. TAG assembled in these storage proteins form oil bodies in seeds.

The fatty acid composition of seed oil varies considerably both between species and within species. The variation of fatty acids occurs both in chain length and degrees of desaturation. Consequently, the fuel properties of biodiesel derived from a mixture of fatty acids are dependent on the composition of fatty acids in seed oil. Altering the fatty acid profile can, therefore, improve fuel properties of biodiesel such as cold-temperature flow characteristics, oxidative stability, and NOx emissions [13].

Fatty acid biosynthetic pathway is highly conserved in plants, but there are significant variations in fatty acid contents and composition in plants (Table 1). What determines differences in the contents and composition of fatty acids and subsequently the total oil yield in the seeds is not understood. The availability of whole genome sequences, ESTs, and individual gene sequences from different oil rich plant species provide an opportunity to investigate what differences in the structure and sequences of genes determine variation in contents and composition so as to identify distinguishing gene signatures to assist in genetic improvement of crop plants either through marker-assisted breeding or by metabolic engineering [32]. Tanhuanpää et al. [33] developed an allele-specific PCR marker for oleic acid by comparing the wild-type and high-oleic allele of the FAD 2 gene locus in spring turnip rape (Brassica rapa ssp. oleifera). The accumulation of ricinoleic acid in transgenic Arabidopsis seeds was doubled by expressing the castor FAH12 hydroxylase in a FAD 2/FAE1 mutant [34]. The FatA and FatB genes of castor bean were heterologously expressed in Escherichia coli for biochemical characterization after purification, resulting in high catalytic efficiency of RcFatA on oleoyl-ACP and palmitoleoyl-ACP and high efficiencies of RcFatB for oleoyl-ACP and palmitoyl-ACP. The expression profile of these genes displayed the highest levels in expanding tissues that typically are very active in lipid biosynthesis such as developing seed endosperm and young expanding leaves [35]. Arabidopsis thaliana gene diacylglycerol acyltransferase (DGAT) coding for a key enzyme in TAG biosynthesis was expressed in tobacco under the control of a strong ribulose-biphosphate carboxylase small subunit promoter. This modification led up to a 20-fold increase in TAG accumulation in tobacco leaves and translated into an overall twofold increase in extracted fatty acids up to 5.8% of dry biomass in Nicotiana tabacum [36]. Dimov and Mollers [37] tested genetic variation for saturated fatty acid content in two sets of modern winter oilseed rape cultivars (Brassica napus L.) in field experiments under typical German growing conditions. They observed highly significant genetic differences among the cultivars for total saturated fatty acid content, which ranged from 6.8% to 8.1%. Singh et al. [38] constructed genetic map using AFLP, RFLP, and SSR markers for oil palm. They detected quantitative trait loci (QTLs) controlling oil quality (measured in terms of iodine value and fatty acid composition) and identified significant QTLs associated with iodine value (IV), myristic acid (C14 : 0), palmitic acid (C16 : 0), palmitoleic acid (C16 : 1), stearic acid (C18 : 0), oleic acid (C18 : 1), and linoleic acid (C18 : 2) content. The Brassica napus mutant line DMS100 carrying a G-to-A base substitution at the 5′ splice site of intron 6 in FAD 3 had reduced C18 : 3 content in oil seeds [39]. These studies suggest that the comparative analysis of oil biosynthesis and accumulation genes is a suitable strategy to investigate the molecular basis of oil content and composition variation in seed oils of different plant species. Additionally, these variations can be used to develop functional markers for increasing selection efficiency by marker- assisted selection in plant breeding.

Table 1
Variations for fatty acids and TAG biosynthesis pathway genes associated with high oil content in different plant species.

In the present study, four plant species, Arabidopsis, Brassica, soybeans, and castor bean were considered for comprehensive analysis of fatty acid biosynthesis genes due to the availability of their genome sequences and several ESTs collections. Moreover, soybeans and brassicas are the biggest source of plant oil in the world, whereas castor bean contains unusual fatty acid ricinoleate that have chemical properties useful for industrial applications. The total seed oil contents of Arabidopsis, castor bean Brassica, and soybean are 30–37%, 40–45%, 30–40%, and 15–20%, respectively (Table 2) [2831]. Plant oils are mostly composed of five common fatty acids, namely, palmitate (16 : 0), stearate (18 : 0), oleate (18 : 1), linoleate (18 : 2) and linolenate (18 : 3), although, depending on the particular species, longer or shorter fatty acids may also be major constituents. These fatty acids differ from each other in terms of acyl chain length and number of double bonds, leading to different physical properties. Here we put forward the questions (1) whether there are common variations in genes, if any, which contribute to increased seed oil content in plants? (2) Which are the major genes responsible for the higher amounts of five fatty acids mentioned above in different plant species? For answering these questions the present study aimed at (1) the identification of candidate genes for fatty acid biosynthesis, TAG synthesis and oil body formation proteins in plant species under study, (2) the comparative structure analysis of these candidate genes, (3) the in silico identification of sequence variations in fatty acid biosynthesis genes, and (4) the in silico association of sequence variations in candidate genes for oil content and composition.

Table 2
Fatty acid composition of four plant species.

2. Materials and Methods

2.1. Retrieval of Sequences

Thirty-two genes involved in the biosynthesis and storage of fatty acids were retrieved from Arabidopsis database (http://lipids.plantbiology.msu.edu/) by referring to the comprehensive lipid gene catalog provided by Beisson et al. [40]. The selected genes covered all the major biochemical events in the biosynthesis and storage of fatty acids [41, 42]. The protein sequences of these genes were used as query against castor bean database in TIGR (http://blast.jcvi.org/er-blast/index.cgi?project=rca1) and soybean database in soybase (http://soybase.org/). Full-length coding sequences of Brassica were downloaded from GenBank (http://www.ncbi.nlm.nih.gov/genbank/GenbankSearch.html). Protein function domains were examined with “CDD” from NCBI (http://www.ncbi.nlm.nih.gov/Structure/cdd/cdd.shtml).

2.2. Prediction of Gene Structures

Gene models for castor bean and soybean genomes were downloaded from Phytozome (http://www.phytozome.net/). The Arabidopsis gene models were downloaded from TAIR (http://www.arabidopsis.org/). Arabidopsis, castor bean, Brassica. Rapa, and soybean sequences were further annotated for gene models (open reading frames, including the 5′UTRs and 3′UTRs) using gene prediction algorithms of FGenesH (http://linux1.softberry.com/berry.phtml?topic=fgenesh&group=programs&subgroup=gfind) [4345] (see Table 1 of the Supplementary Material available online at doi:10.1155/2012/914843). Sequence identity among Brassica rapa, castor bean, soybean, and Arabidopsis genes was confirmed using ClustalW in MegAling in DNASTAR (DNASTAR Inc., Madison, WI, USA). The in silico expression status of candidate genes belonging to different families was searched with an e-value cutoff 0.0 in the ESTdb of NCBI (National Centre for Biotechnology Information) at http://www.ncbi.nlm.nih.gov/BLAST/ and TIGR (The Institute of Genomic Research) at http://blast.jcvi.org/er-blast/index.cgi?project=rca1.

3. Results

3.1. Comparative Genomics of Fatty Acid Biosynthesis Genes in Major Oil Seed Plant Species

The fatty acid biosynthesis pathway includes 32 gene families involved in the conversion of acetyl Co-A into different fatty acids and their storage in oil bodies. A total of 68 protein sequences were retrieved for 32 gene families from the comprehensive lipid gene catalog of Arabidopsis [40] and functional domains were identified for each gene family. The 68 protein sequences from Arabidopsis were queried for fatty acid biosynthesis genes in Brassica rapa, soybean, and castor bean databases. A total of 261 genes belonging to 32 gene families were identified and retrieved from four plant species, out of which, 68 were from Arabidopsis, 62 from Brassica rapa, 55 from castor bean, and 76 from soybean (Table 3). Detailed gene structures, exon- intron coordinates of each gene are given in Supplementary Table  1.

Table 3
Oil synthesis and accumulation genes in Arabidopsis, Brassica rapa, castor bean, and soybean.

3.2. Expression Status of Fatty Acid Biosynthesis Genes

In silico expression analysis revealed that for 32 gene families, ESTs were detected for 68 genes in Arabidopsis, 62 genes in Brassica, 49 genes in castor bean, and 76 genes in soybean (Figure 1). Thirteen genes of Arabidopsis, 15 from castor bean, 8 from soybean, and 2 from Brassica showed tissue preferential expression patterns as per their identities to ESTs from tissue-specific libraries. Twenty-two genes from four plant species were expressed in seeds, 4 in leaves, 3 in flower, and 1 in roots (Table 4). FAD 2 and one homolog of Stearoyl desaturase gene had maximum seed ESTs in castor bean.

Figure 1
In silico transcript abundance (based on matching ESTs available in the database) of oil biosynthesis and accumulation genes in different tissues.
Table 4
In silico expression status of fatty acids biosynthesis and accumulation genes.

3.3. Comparative Analysis of Gene Structures in Different Plant Species

Comparative genomics of fatty acid biosynthesis genes was done to understand as what determines differences, if any, for variations in contents and compositions of fatty acids in different plant species. The gene structure analysis revealed that the exon-intron structure of fatty acid biosynthesis genes in castor bean and soybean gene homologs shared more structure similarity in comparison to Arabidopsis fatty acid biosynthesis genes. However, insertion, deletion, and intron size variations were found in castor bean and soybean genes with reference to Arabidopsis. Fatty acid biosynthesis genes of Brassica rapa were not analyzed for gene structure because for most of the Brassica genes only coding DNA sequences were available in the GeneBank.

Conversion of acetyl Co-A to malonyl Co-A by acetyl carboxylase (ACCase) is the most committed step in fatty acid biosynthesis. Exon/intron number and CDS length for ACCase gene was almost same between castor bean (31 exons) and soybean (33 exons), whereas slightly less in Arabidopsis (26 exons). Comparative structural analysis revealed that homomeric ACCase gene from Arabidopsis (1–26 exons) showed microsynteny with castor bean (6–31 exons) and soybean (6–33 exons), with a 3 bp deletion in 8th and 26th exons of castor bean, 3 bp deletion and 3 bp insertion in 29th and 31st exons of soybean, and a 12 bp insertion in 24th and 26th exons of castor bean and soybean, respectively. First five exons of homomeric ACCase in castor bean and soybean (missing in Arabidopsis) showed colinearity for exon size, with the exception of a 3 bp insertion in the first exon of castor bean gene. Sixteenth exon of ACCase in castor bean showed sequence identity to 3 exons (16th, 17th, and 18th) of soybean (Figure 2).

Figure 2
Structure of ACCase gene in Arabidopsis (26 exons), castor bean (31 exons), and soybean (33 exons); thick arrows and thin lines represented exons and introns, respectively. Arabidopsis 1–26 exons showed identity to 6 to 31st and 6 to 33rd exons ...

Two distinct classes of thioesterases, FatA and FatB, are responsible for release of fatty acids from ACP by thioesterases. FatA gene structure was diverse with respect to exons number (varying from 5 to 11) among four plant species. Two homologs of FatA gene were present in Arabidopsis, castor bean, and soybean, whereas FatB gene had 4 homologs in soybean. The first exon of FatB gene had an insertion of 3 bp in castor bean and 27 bp insertion in one of soybean homologs (Glyma0421910) and other three homologs of soybean (Glyma05g08060, Glyma17g12940, and Glyma06g23560) had 6 bp deletion compared to Arabidopsis (Figure 3). An 69 bp insertion of one exon was present in FatB genes of castor bean and soybean but was absent in Arabidopsis. The last exon of FatB (5th exon) in Arabidopsis showed homology to the last exon (6th exon) of one of the homologs of soybean (Glyma04g21910) and last two exons (6th and 7th) of another homolog of soybean (Glyma06g23560), whereas last exon of castor bean showed homology to the last exon of other two homologs of soybean (Glyma05g08060 and Glyma17g12940).

Figure 3
Structure of FatB (palmitoyl thioesterase) gene in Arabidopsis (At1g08510), castor bean (29848.m004677), and four soybean homologs (Glyma0421910, Glyma05g08060, Glyma17g12940, and Glyma06g23560). The 5th exon of FatB in Arabidopsis showed homology to ...

Stearoyl ACP desaturase gene had maximum number of homologs (6 in Arabidopsis, 3 in Brassica, 4 in soybean, and 4 in castor bean) in fatty acid desaturase category of enzymes. Oleoyl desturase (Fad2) and Linoleate desaturase (Fad3) genes showed more relatedness in relation to number and sizes of exons and introns in each homolog among four plant species. Oleoyl desaturase (FAD 2) had only one exon in Arabidopsis, castor bean, and soybean with an insertion of 12 bp in the exon of castor bean and 9 bp insertion in the exon of one homolog of soybean (Glyma09g17170). FAD 3 gene structure was conserved with respect to exon-intron number and size between Arabidopsis, castor bean, and soybean except for first and last exons. A 21 bp deletion in the first exon of castor bean (29681.m001360) and an insertion of 210 and 213 bp was observed in two homologs of soybean (Glyma01g29630 and Glyma07g18350), respectively. Two deletions of 3 and 12 bp were observed in the last exon (8th exon) of castor bean and soybean, respectively. A deletion of 6 bp was observed in the 3rd exon of FAD 3 of castor bean. An SNP (G→A) was also identified at the exon-intron junction of FAD 3 gene in the 3rd exon of one homolog of soybean (Glyma01g29630) with respect to castor bean, Arabidopsis, and other homologs of soybean (Figure 4).

Figure 4
Structure of FAD 3 (linoleoyl desaturase) gene in Arabidopsis (At2g29980), castor bean (29681.m001360), and two soybean homologs (Glyma01g29630, Glyma07g18350). Exon/intron numbers are conserved in FAD 3 while variation in sizes was observed in the first ...

The DGAT gene involved in TAG (Tri-acyl Glyceride) synthesis has two isoforms, DGAT-1 and DGAT-2. These two genes showed variation in number and sizes of exons and introns. DGAT-1 gene had 15 exons in Arabidopsis, 13 exons in castor bean, and 16 exons in soybean. DGAT-2 had 8 exons in Arabidopsis and castor bean and 7 exons in soybean. The detailed comparative genomics of fatty acid biosynthesis genes in 4 oil seed plant species provided insights to undertake identification and utilization of castor bean fatty acid biosynthesis genes and sequence variations for the development of candidate gene markers in Jatropha.

Fatty acid biosynthesis genes showed evolutionary relatedness but there is no synteny in gene order and position of genes on the chromosomes. Location of genes on chromosomes in Arabidopsis and soybean is given in Supplementary Table 2.

4. Discussion

In general, plant oil biosynthesis mostly follows the common biosynthetic pathways for fatty acids in the plastid as well as TAG in the endoplasmic reticulum (ER) and the oil further accumulates in oil bodies. However, there are significant differences for content and composition of seed oil in different plant species. Using comparative genomics, we tried to infer the effect of change in gene structure differences on oil content in different plant species. In this study, 261 genes involved in biosynthesis and accumulation of seed oil were identified in four oil seed plant species, Arabidopsis, Brassica, castor bean, and soybean. The genes corresponded to six different categories (ACCase, desturase, elongase, thioesterase, TAG synthesis and oil body proteins). Gene families corresponding to these six categories of enzymes had multiple copies in plant species with the exception of homomeric ACCase.

In higher plants, many proteins and enzymes are encoded by gene families, and in Arabidopsis, it has been estimated that 20% of genes are members of gene families [46]. The existence of gene families can sometimes reflect additional levels of genetic control or isoforms of proteins with specific functions. Therefore, it is of interest to detect potential gene families involved in the fatty acid biosynthesis pathway. There is a possibility that different copies of fatty acid biosynthesis genes are present in low oil content genotypes which gives leaky phenotypes as in the case of starch biosynthesis pathway where different copies of genes were responsible for low, medium, and high amylase contents in rice [47].

The oil biosynthesis may be limited by the production of fatty acids [48], which is regulated by acetyl CoA carboxylase (ACCase). Reduction of ACCase activity lowered (1.5–16%) the fatty acid content in transgenic seeds [49]. Conversion of acetyl Co-A to malonyl Co-A by acetyl carboxylase (ACCase) is the most committed step in fatty acid biosynthesis. ACCase of castor bean and soybean showed microsynteny to Arabidopsis, with a 3 bp deletion in 8th and 26th exons in castor bean, 3 bp deletion and 3 bp insertion in 29th and 31st exons in soybean and a 12 bp insertion in 24th, and 26th exons of castor bean and soybean, respectively with respect to Arabidopsis. These sequence variations in ACCase genes may be possibly influencing the variations in fatty acid composition and content in seed oil among Arabidopsis, castor bean, and soybean, as fatty acid content and composition was altered in many plant species with the variations in sequences or expression of ACCase gene [19, 50]. Yang et al. [19] identified two SNPs (T→G, G→A) in ACCase gene which lead to increase (1.3%) in oleic acid, lenolenic acid, and lenoleic acid content in maize. Addition of a plastid transit sequence targeted the introduced ACCase protein to chloroplasts, ultimately resulting in a 5% increase in seed oil of rapeseed [50]. The insertion or deletion identified in our analysis between Arabidopsis, castor bean, and soybean might be responsible for reduction or enhancement of ACCase activity, which is associated with the variations in total fatty acid composition in seed oil among these plant species.

Studies in transgenic plants have demonstrated that thioesterases contribute to the regulation of fatty acid chain length [51]. Typically, FatB accepts saturated acyl-ACP substrates of varying length, while FatA is specific to unsaturated fatty acids and acts on C18:1, oleic, acyl-ACPs [51]. In Brassica napus and Arabidopsis, genetic engineering of Acyl-ACP thioesterase (FatB) resulted in maximum increase of 58% in palmitic acid content [52, 53]. Preventing the release of saturated fatty acids from ACP by downregulating FatB, which encodes a palmitoyl ACP thioesterase, lowered the levels of saturated fatty acids [54]. Variations in palmitate content in seed oil in plant species can be related to the variations in FatB gene [27, 52, 53]. Cardinal et al. [27] identified deletion in exon-inrton junction in one homolg of FatB gene which was associated with low palmitic acid content in soybean cultivar Century (N79-2077 and N93-2008). Palmitate content was ~8% in Arabidopsis [55], ~2% in castor bean [56] and 7–11% in soybean [57]. Variations in the amount of palmitic acid in the seeds of Arabidopsis, castor bean, and soybean might be due to deletions in first exon of FatB gene, which can be further utilized for identification of markers associated with high level of palmitate (saturated fatty acid) in total seed oil in plant species desired for biodiesel purpose.

Soybean lines with high levels of oleic acid (85%) and low levels of saturated fatty acids (6%) have been developed using a transgenic strategy that results in downregulation of two genes, FAD 2, and FatB involved in fatty acid synthesis. Downregulation of the FAD 2 gene, encoding a Δ12 fatty acid desaturase, prevented the conversion of oleic acid to polyunsaturated fatty acids, resulting in increased levels of oleic acid. Additionally, preventing the release of saturated fatty acids from acyl carrier protein (ACP) by downregulating FatB gene, which encodes a palmitoyl ACP (acyl carrier protein) thioesterase, lowered the levels of saturated fatty acids [54]. Hu et al. [14] sequenced the FAD 2 gene fragment from the mutant line DMS100 and wild-type line Quantum of Brassica napus, and identified a single nucleotide mutation (C→T) in the FAD 2 gene. This particular mutation created a stop codon (TAG) leading to premature termination of the peptide chain during translation which leads to high oleic acid content in mutant line DMS100. B. napus mutant line DMS100 carrying a G-to-A substitution at the 5′ splice site of intron 6 in FAD 3 had reduced lenolenic acid content in seed oil [39]. In our analysis insertions or deletions in FAD 2 and FAD 3 genes of soybean might be the possible causes of higher oleate and linoleate content in high oil yielding soybean genotypes. Higher amount of ricinoleic acid in castor bean can be due to an insertion in the FAD 2 gene resulting in higher level of oleic acid because oleic acid is further utilized as a substrate by fatty acid hydroxylase (FAH) to convert oleate to ricinoleate. Low level of linoleate in castor bean oil may be due to a deletion in the 3rd exon of FAD 3 gene because each copy of FAD 3 in Arabidopsis and soybean is conserved.

In our analysis, the acyl-CoA:diacylglycerol acyltransferases (DGAT) gene was highly diverse, which might be involved in the overall variation in triacylglycerols in the oil among the plant species as it is a key enzyme in determining the levels of triacylglycerols in seed oils [58, 59]. Burgal et al. [58] demonstrated that coexpressing the castor bean DGAT2 gene with the castor FA 12 hydroxylase resulted in almost double the levels of hydroxylated fatty acids in neutral lipids (up to 30% of total, compared with 17% in the absence of DGAT2). In our study, most of the variations observed in the coding regions are either insertion or deletion of 3 bp or multiple of three that represent codon usage which either leads to shift in reading frame or functional mutation that are expected to be related to oil content. Thus, the sequence variations identified in fatty acid biosynthesis genes in this study can be tested for their functional role in altering content and composition of seed oil in Jatropha.

5. Conclusion

Comparative genomics, for gene structures and coding sequence variations, was performed on 261 genes involved in fatty acids biosynthesis, TAG synthesis, and oil bodies formation in four oil seed plant species, Arabidopsis, Brassica rapa, castor bean, and soybean to understand whether differences in gene structures or coding sequence determine preferential biosynthesis of higher amounts of particular fatty acids and their contents in the seeds of different plant species. Overall comparative gene structure of fatty acid biosynthesis related genes provided an insight to improve oil quality for biodiesel by exploiting the variations for engineering FAD5, FAD6, and FatB genes to enhance the content of saturated fatty acids. The variations in FAD2, FAD3, Stearoyl desaturase, DGAT-1, and DGAT-2 will be helpful to enhance the oil content in plants. The close relationship between genes under study would be helpful for comparative genomics to study these genes in related species for oil content modification.


1. Antoni D, Zverlov VV, Schwarz WH. Biofuels from microbes. Applied Microbiology and Biotechnology. 2007;77(1):23–35. [PubMed]
2. Ma F, Hanna MA. Biodiesel production: a review. Bioresource Technology. 1999;70(1):1–15.
3. Nikolau BJ, Ohlrogge JB, Wurtele ES. Plant biotin-containing carboxylases. Archives of Biochemistry and Biophysics. 2003;414(2):211–222. [PubMed]
4. Mayer KM, Shanklin J. Identification of amino acid residues involved in substrate specificity of plant acyl-ACP thioesterases using a bioinformatics-guided approach. BMC Plant Biology. 2007;7, article 1 [PMC free article] [PubMed]
5. Pollard MR, Anderson L, Fan C, Hawkins DJ, Davies HM. A specific acyl-ACP thioesterase implicated in medium-chain fatty acid production in immature cotyledons of Umbellularia californica. Archives of Biochemistry and Biophysics. 1991;284(2):306–312. [PubMed]
6. Salas JJ, Ohlrogge JB. Characterization of substrate specificity of plant FatA and FatB acyl-ACP thioesterases. Archives of Biochemistry and Biophysics. 2002;403(1):25–34. [PubMed]
7. Okuley J, Lightner J, Feldmann K, Yadav N, Lark E, Browse J. Arabidopsis FAD2 gene encodes the enzyme that is essential for polyunsaturated lipid synthesis. Plant Cell. 1994;6(1):147–158. [PMC free article] [PubMed]
8. Hitz WD, Carlson TJ, Booth JR, Kinney AJ, Stecca KL, Yadav NS. Cloning of a higher-plant plastid omega-6 fatty acid desaturase cDNA and its expression in a cyanobacterium. Plant Physiology. 1994;105(2):635–641. [PMC free article] [PubMed]
9. Reed DW, Schafer UA, Covello PS. Characterization of the Brassica napus extraplastidial linoleate desaturase by expression in Saccharomyces cerevisiae. Plant Physiology. 2000;122(3):715–720. [PMC free article] [PubMed]
10. Iba K, Gibson S, Nishiuchi T, et al. A gene encoding a chloroplast ω-3 fatty acid desaturase complements alterations in fatty acid desaturation and chloroplast copy number of the fad7 mutant of Arabidopsis thaliana. Journal of Biological Chemistry. 1993;268(32):24099–24105. [PubMed]
11. Cases S, Smith SJ, Zheng YW, et al. Identification of a gene encoding an acyl CoA:diacylglycerol acyltransferase, a key enzyme in triacylglycerol synthesis. Proceedings of the National Academy of Sciences of the United States of America. 1998;95(22):13018–13023. [PMC free article] [PubMed]
12. Lardizabal KD, Mai JT, Wagner NW, Wyrick A, Voelker T, Hawkins DJ. DGAT2 is a new diacylglycerol acyltransferase gene family. Purification, cloning, and expression in insect cells of two polypeptides from Mortierella ramanniana with diacylglycerol acyltransferase activity. Journal of Biological Chemistry. 2001;276(42):38862–38869. [PubMed]
13. Serdari A, Lois E, Stournas S. Impact of esters of mono- and dicarboxylic acids on diesel fuel quality. Industrial and Engineering Chemistry Research. 1999;38(9):3543–3548.
14. Hu X, Gilbert MS, Gupta M, Thompson SA. Mapping of the loci controlling oleic and linolenic acid contents and development of fad2 and fad3 allele-specific markers in canola (Brassica napus L.) Theoretical and Applied Genetics. 2006;113(3):497–507. [PubMed]
15. Zhang P, Burton JW, Upchurch RG, Whittle E, Shanklin J, Dewey RE. Mutations in a 9-stearoyl-ACP-desaturase gene are associated with enhanced stearic acid levels in soybean seeds. Crop Science. 2008;48(6):2305–2313.
16. López Y, Nadaf HL, Smith OD, Connell JP, Reddy AS, Fritz AK. Isolation and characterization of the Δ12-fatty acid desaturase in peanut (Arachis hypogaea L.) and search for polymorphisms for the high oleate trait in Spanish market-type lines. Theoretical and Applied Genetics. 2000;101(7):1131–1138.
17. Bilyeu KD, Palavalli L, Sleper DA, Beuselinck PR. Mutations in soybean microsomal omega-3 fatty acid desaturase genes reduce linolenic acid concentration in soybean seeds. Crop Science. 2005;45(5):1830–1836.
18. Ha BK, Monteros MJ, Boerma HR. Development of SNP assays associated with oleic acid QTLs in N00-3350 soybean. Euphytica. 2010;176(3):403–415.
19. Yang X, Guo Y, Yan J, et al. Major and minor QTL and epistasis contribute to fatty acid compositions and oil concentration in high-oil maize. Theoretical and Applied Genetics. 2010;120(3):665–678. [PubMed]
20. Bruner AC, Jung S, Abbott AG, Powell GL. The naturally occurring high oleate oil character in some peanut varieties results from reduced oleoyl-PC desaturase activity from mutation of aspartate 150 to asparagine. Crop Science. 2001;41(2):522–526.
21. Zheng P, Allen WB, Roesler K, et al. A phenylalanine in DGAT is a key determinant of oil content and composition in maize. Nature Genetics. 2008;40(3):367–372. [PubMed]
22. Bachlava E, Dewey RE, Auclair J, Wang S, Burton JW, Cardinal AJ. Mapping genes encoding microsomal ω-6 desaturase enzymes and their cosegregation with QTL affecting oleate content in soybean. Crop Science. 2008;48(2):640–650.
23. Anai T, Yamada T, Kinoshita T, Rahman SM, Takagi Y. Identification of corresponding genes for three low-α-linolenic acid mutants and elucidation of their contribution to fatty acid biosynthesis in soybean seed. Plant Science. 2005;168(6):1615–1623.
24. Aghoram K, Wilson RF, Burton JW, Dewey RE. A mutation in a 3-keto-acyl-ACP synthase II gene is associated with elevated palmitic acid levels in soybean seeds. Crop Science. 2006;46(6):2453–2459.
25. Spencer MM, Pantalone VR, Meyer EJ, Landau-Ellis D, Hyten DL., Jr. Mapping the Fas locus controlling stearic acid content in soybean. Theoretical and Applied Genetics. 2003;106(4):615–619. [PubMed]
26. Pérez-Vich B, Leon AJ, Grondona M, Velasco L, Fernández-Martínez JM. Molecular analysis of the high stearic acid content in sunflower mutant CAS-14. Theoretical and Applied Genetics. 2006;112(5):867–875. [PubMed]
27. Cardinal AJ, Burton JW, Camacho-Roger AM, Yang JH, Wilson RF, Dewey RE. Molecular analysis of soybean lines with low palmitic acid content in the seed oil. Crop Science. 2007;47(1):304–310.
28. Li Y, Beisson F, Pollard M, Ohlrogge J. Oil content of Arabidopsis seeds: the influence of seed anatomy, light and plant-to-plant variation. Phytochemistry. 2006;67(9):904–915. [PubMed]
29. Velasco L, Becker HC. Estimating the fatty acid composition of the oil in intact-seed rapeseed (Brassica napus L.) by near-infrared reflectance spectroscopy. Euphytica. 1998;101(2):221–230.
30. Ramos LCDS, Tango JS, Savi A, Leal NR. Variability for oil and fatty acid composition in castorbean varieties. Journal of the American Oil Chemists’ Society. 1984;61(12):1841–1843.
31. Sangwan NK, Gupta K, Dhindsa KS. Fatty acid composition of developing soybeans. Journal of Agricultural and Food Chemistry. 1986;34(3):415–417.
32. Rubin GM, Yandell MD, Wortman JR, et al. Comparative genomics of the eukaryotes. Science. 2000;287(5461):2204–2215. [PMC free article] [PubMed]
33. Tanhuanpää P, Vilkki J, Vihinen M. Mapping and cloning of FAD2 gene to develop allele-specific PCR for oleic acid in spring turnip rape (Brassica rapa ssp. oleifera) Molecular Breeding. 1998;4(6):543–550.
34. Smith MA, Moon H, Chowrira G, Kunst L. Heterologous expression of a fatty acid hydroxylase gene in developing seeds of Arabidopsis thaliana. Planta. 2003;217(3):507–516. [PubMed]
35. Sánchez-García A, Moreno-Pérez AJ, Muro-Pastor AM, Salas JJ, Garcés R, Martínez-Force E. Acyl-ACP thioesterases from castor (Ricinus communis L.): an enzymatic system appropriate for high rates of oil synthesis and accumulation. Phytochemistry. 2010;71(8-9):860–869. [PubMed]
36. Andrianov V, Borisjuk N, Pogrebnyak N, et al. Tobacco as a production platform for biofuel: overexpression of arabidopsis DGAT and LEC2 genes increases accumulation and shifts the composition of lipids in green biomass. Plant Biotechnology Journal. 2010;8(3):277–287. [PubMed]
37. Dimov Z, Mollers C. Genetic variation for saturated fatty acid content in a collection of European winter oilseed rape material (Brassica napus) Plant Breeding. 2010;129(1):82–86.
38. Singh R, Tan SG, Panandam JM, et al. Mapping quantitative trait loci (QTLs) for fatty acid composition in an interspecific cross of oil palm. BMC Plant Biology. 2009;9:114–133. [PMC free article] [PubMed]
39. Hu X, Sullivan-Gilbert M, Gupta M, Thompson SA. G-to-A mutation at a 5_splice site of fad3c caused impaired splicing in a low linolenic mutant of canola (Brassica napus L.) Plant Biotechnology. 2007;24(4):397–400.
40. Beisson F, Koo A, Ruuska S, et al. Arabidopsis genes involved in acyl lipid metabolism. A 2003 census of the candidates, a study of the distribution of expressed sequence tags in organs, and a web-based database. Plant Physiology. 2003;132(2):681–697. [PMC free article] [PubMed]
41. Mekhedov S, De de IO, Ohlrogge J. Toward a functional catalog of the plant genome. A survey of genes for lipid biosynthesis. Plant Physiology. 2000;122(2):389–401. [PMC free article] [PubMed]
42. Ohlrogge J, Browse J. Lipid biosynthesis. Plant Cell. 1995;7(7):957–970. [PMC free article] [PubMed]
43. Yu J, Hu S, Wang J, et al. A draft sequence of the rice genome (Oryza sativa L. ssp. indica) Science. 2002;296(5565):79–92. [PubMed]
44. Goff SA, Ricke D, Lan TH, et al. A draft sequence of the rice genome (Oryza sativa L. ssp. japonica) Science. 2002;296(5565):92–100. [PubMed]
45. Krill AM, Kirst M, Kochian LV, Buckler ES, Hoekenga OA. Association and linkage analysis of aluminum tolerance genes in maize. Plos One. 2010;5(4) Article ID e9958. [PMC free article] [PubMed]
46. Bevan M, Bancroft I, Bent E, et al. Analysis of 1.9 Mb of contiguous sequence from chromosome 4 of Arabidopsis thaliana. Nature. 1998;391(6666):485–488. [PubMed]
47. Tian Z, Qian Q, Liu Q, et al. Allelic diversities in rice starch biosynthesis lead to a diverse array of rice eating and cooking qualities. Proceedings of the National Academy of Sciences of the United States of America. 2009;106(51):21760–21765. [PMC free article] [PubMed]
48. Bao XM, Ohlrogge J. Supply of fatty acid is one limiting factor in the accumulation of triacylglycerol in developing embryos. Plant Physiology. 1999;120(4):1057–1062. [PMC free article] [PubMed]
49. Thelen JJ, Ohlrogge JB. Metabolic engineering of fatty acid biosynthesis in plants. Metabolic Engineering. 2002;4(1):12–21. [PubMed]
50. Roesler K, Shintani D, Savage L, Boddupalli S, Ohlrogge J. Targeting of the Arabidopsis homomeric acetyl-coenzyme A carboxylase to plastids of rapeseeds. Plant Physiology. 1997;113(1):75–81. [PMC free article] [PubMed]
51. Jones A, Davies HM, Voelker TA. Palmitoyl-acyl carrier protein (ACP) thioesterase and the evolutionary origin of plant acyl-ACP thioesterases. The Plant Cell. 1995;7(3):359–371. [PMC free article] [PubMed]
52. Voelker TA, Hayes TR, Cranmer AM, Turner JC, Davies HM. Genetic engineering of a quantitative trait: metabolic and genetic parameters influencing the accumulation of laurate in rapeseed. Plant Journal. 1996;9(2):229–241.
53. Dormann P, Voelker TA, Ohlrogge JB. Accumulation of palmitate in Arabidopsis mediated by the acyl-acyl carrier protein thioesterase FATB1. Plant Physiology. 2000;123(2):637–643. [PMC free article] [PubMed]
54. Buhr T, Sato S, Ebrahim F, et al. Ribozyme termination of RNA transcripts down-regulate seed fatty acid genes in transgenic soybean. Plant Journal. 2002;30(2):155–163. [PubMed]
55. Bonaventure G, Salas JJ, Pollard MR, Ohlrogge JB. Disruption of the FATB gene in Arabidopsis demonstrates an essential role of saturated fatty acids in plant growth. Plant Cell. 2003;15(4):1020–1033. [PMC free article] [PubMed]
56. Akbar E, Yaakob Z, Kamarudin SK, Ismail M, Salimon J. Characteristic and composition of Jatropha curcas oil seed from Malaysia and its potential as biodiesel feedstock feedstock. European Journal of Scientific Research. 2009;29(3):396–403.
57. Kinney AJ. Genetic engineering of oilseeds for desired traits. In: Setlow JK, editor. Genetic Engineering. Vol. 19. New York, NY, USA: Plenum Press; 1997. pp. 149–166.
58. Burgal J, Shockey J, Lu C, et al. Metabolic engineering of hydroxy fatty acid production in plants: RcDGAT2 drives dramatic increases in ricinoleate levels in seed oil. Plant Biotechnology Journal. 2008;6(8):819–831. [PMC free article] [PubMed]
59. Venegas-Calerón M, Sayanova O, Napier JA. An alternative to fish oils: metabolic engineering of oil-seed crops to produce omega-3 long chain polyunsaturated fatty acids. Progress in Lipid Research. 2010;49(2):108–119. [PubMed]

Articles from Comparative and Functional Genomics are provided here courtesy of Hindawi Publishing Corporation
PubReader format: click here to try


Save items

Related citations in PubMed

See reviews...See all...

Cited by other articles in PMC

See all...


  • Nucleotide
    Primary database (GenBank) nucleotide records reported in the current articles as well as Reference Sequences (RefSeqs) that include the articles as references.
  • PubMed
    PubMed citations for these articles

Recent Activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...