Genome-wide analysis of the Hsf family in soybean and functional identification of GmHsf-34 involvement in drought and heat stresses

BMC Genomics. 2014 Nov 21;15(1):1009. doi: 10.1186/1471-2164-15-1009.

Abstract

Background: High temperature affects organism growth and metabolic activity. Heat shock transcription factors (Hsfs) are key regulators in heat shock response in eukaryotes and prokaryotes. Under high temperature conditions, Hsfs activate heat shock proteins (Hsps) by combining with heat stress elements (HSEs) in their promoters, leading to defense of heat stress. Since the first plant Hsf gene was identified in tomato, several plant Hsf family genes have been thoroughly characterized. Although soybean (Glycine max), an important oilseed crops, genome sequences have been available, the Hsf family genes in soybean have not been characterized accurately.

Result: We analyzed the Hsf genetic structures and protein function domains using the GSDS, Pfam, SMART, PredictNLS, and NetNES online tools. The genome scanning of dicots (soybean and Arabidopsis) and monocots (rice and maize) revealed that the whole-genome replication occurred twice in soybean evolution. The plant Hsfs were classified into 3 classes and 16 subclasses according to protein structure domains. The A8 and B3 subclasses existed only in dicots and the A9 and C2 occurred only in monocots. Thirty eight soybean Hsfs were systematically identified and grouped into 3 classes and 12 subclasses, and located on 15 soybean chromosomes. The promoter regions of the soybean Hsfs contained cis-elements that likely participate in drought, low temperature, and ABA stress responses. There were large differences among Hsfs based on transcriptional levels under the stress conditions. The transcriptional levels of the A1 and A2 subclass genes were extraordinarily high. In addition, differences in the expression levels occurred for each gene in the different organs and at the different developmental stages. Several genes were chosen to determine their subcellular localizations and functions. The subcellular localization results revealed that GmHsf-04, GmHsf-33, and GmHsf-34 were located in the nucleus. Overexpression of the GmHsf-34 gene improved the tolerances to drought and heat stresses in Arabidopsis plants.

Conclusions: This present investigation of the quantity, structural features, expression characteristics, subcellular localizations, and functional roles provides a scientific basis for further research on soybean Hsf functions.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adaptation, Physiological / genetics
  • Amino Acid Motifs
  • Amino Acid Sequence
  • Cell Nucleus / metabolism
  • Chromosomes, Plant / genetics
  • DNA-Binding Proteins / genetics*
  • DNA-Binding Proteins / metabolism
  • Droughts*
  • Exons / genetics
  • Gene Duplication
  • Gene Expression Regulation, Plant
  • Genes, Plant
  • Genome-Wide Association Study*
  • Glycine max / genetics*
  • Glycine max / physiology
  • Heat Shock Transcription Factors
  • Hot Temperature*
  • Introns / genetics
  • Isoelectric Point
  • Molecular Sequence Data
  • Molecular Weight
  • Multigene Family*
  • Phylogeny
  • Plant Proteins / chemistry
  • Plant Proteins / genetics
  • Plant Proteins / metabolism
  • Protein Structure, Tertiary
  • Protein Transport
  • Regulatory Sequences, Nucleic Acid / genetics
  • Sequence Alignment
  • Stress, Physiological / genetics*
  • Transcription Factors / genetics*
  • Transcription Factors / metabolism

Substances

  • DNA-Binding Proteins
  • Heat Shock Transcription Factors
  • Plant Proteins
  • Transcription Factors