TY - DATA T1 - Data supporting analysis of evolution of SET genes in insects AU - Jiang, Feng AU - Liu, Qing AU - Yanli Wang AU - Zhang, Jie AU - Huimin Wang AU - Tianqi Song AU - Meiling Yang AU - Xianhui Wang AU - Kang, Le DO - 10.5524/100299 UR - http://gigadb.org/dataset/100299 AB - The SET domain is an evolutionarily conserved motif present in histone lysine methyltransferases, which are important in the regulation of chromatin and gene expression in animals. In this study, we searched for SET domain-containing genes (SET genes) in all of the 147 arthropod genomes sequenced so far to understand the evolutionary history by which SET domain have evolved in insects. Phylogenetic and ancestral state reconstruction analysis revealed an arthropod-specific SET gene family, named SmydA, which is ancestral to arthropod animals and specifically diversified during insect evolution. Considering that pseudogenization is the most probable fate of the new emerging gene copies, we provided experimental and evolutionary evidence to demonstrate their essential functions. Fluorescence in situ hybridization analysis and in vitro methyltransferase activity assays showed that the SmydA-2 gene was transcriptionally active and retained the original histone methylation activity. Expression knockdown by RNA interference significantly increased mortality, implying that the SmydA genes may be essential for insect survival. We further showed predominantly strong purifying selection on the SmydA gene family and a potential association between the regulation of gene expression and insect phenotypic plasticity by transcriptome analysis. Overall, these data suggest that the SmydA gene family retains essential functions that may possibly define novel regulatory pathways in insects. This work provides insights into the roles of lineage-specific domain duplication in insect evolution. KW - Genomic KW - insects KW - domain KW - gene duplication KW - histone modification PY - 2017 PB - GigaScience Database LA - en ER -