Domain architecture of BAF250a reveals the ARID and ARM-repeat domains with implication in function and assembly of the BAF remodeling complex

PLoS One. 2018 Oct 11;13(10):e0205267. doi: 10.1371/journal.pone.0205267. eCollection 2018.

Abstract

BAF250a and BAF250b are subunits of the SWI/SNF chromatin-remodeling complex that recruit the complex to chromatin allowing transcriptional activation of several genes. Despite being the central subunits of the SWI/SNF complex, the structural and functional annotation of BAF250a/b remains poorly understood. BAF250a (nearly 2200 residues protein) harbors an N-terminal DNA binding ARID (~110 residues) and a C-terminal folded region (~250 residues) of unknown structure and function, recently annotated as BAF250_C. Using hydrophobic core analysis, fold prediction and comparative modeling, here we have defined a domain boundary and associate a β-catenin like ARM-repeat fold to the C-terminus of BAF250a that encompass BAF250_C. The N-terminal DNA-binding ARID is found in diverse domain combinations in proteins imparting unique functions. We used a comparative sequence analysis based approach to study the ARIDs from diverse domain contexts and identified conserved residue positions that are important to preserve its core structure. Supporting this, mutation of one such conserved residue valine, at position 1067, to glycine, resulted in destabilization, loss of structural integrity and DNA binding affinity of ARID. Additionally, we identified a set of conserved and surface-exposed residues unique to the ARID when it co-occurs with the ARM repeat containing BAF250_C in BAF250a. Several of these residues are found mutated in somatic cancers. We predict that these residues in BAF250a may play important roles in mediating protein-DNA and protein-protein interactions in the BAF complex.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Cell Differentiation
  • Chromatin Assembly and Disassembly / genetics*
  • Computational Biology
  • DNA-Binding Proteins
  • Datasets as Topic
  • Glycine / genetics
  • Molecular Dynamics Simulation
  • Multienzyme Complexes / chemistry
  • Multienzyme Complexes / genetics*
  • Mutagenesis, Site-Directed
  • Mutation
  • Nuclear Magnetic Resonance, Biomolecular
  • Nuclear Proteins / chemistry
  • Nuclear Proteins / genetics*
  • Nuclear Proteins / isolation & purification
  • Protein Binding / genetics
  • Protein Domains / genetics*
  • Recombinant Proteins / chemistry
  • Recombinant Proteins / genetics
  • Recombinant Proteins / isolation & purification
  • Sequence Alignment
  • Transcription Factors / chemistry
  • Transcription Factors / genetics*
  • Transcription Factors / isolation & purification
  • Valine / genetics

Substances

  • ARID1A protein, human
  • DNA-Binding Proteins
  • Multienzyme Complexes
  • Nuclear Proteins
  • Recombinant Proteins
  • Transcription Factors
  • Valine
  • Glycine

Grants and funding

This work is supported by Indian Institute of Science (M.S), IISc-DBT Partnership program (M.S.), Ramalingaswami Fellowship (BT/RLF/Re-entry/23/2013) (M.S.) from Department of Biotechnology (DBT), India, and Early Career Award (ECR/2015/000023) (M.S.) from Department of Science and Technology (DST), India. A.M would like to acknowledge DBT, India for the Research Associateship and India Alliance for current Early Career Fellowship (IA/E/15/1/502321). M.G. would like to acknowledge DST, India for INSPIRE fellowship. Authors acknowledge DST and DBT, India for the NMR facilities at the Indian Institute of Science, Bengaluru. Authors acknowledge ITC facility at National Centre for Biological Sciences, Bengaluru. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.