TY - DATA T1 - Genomic data of the diploid cotton (Gossypium raimondii) AU - Wang, K AU - Wang, Z AU - Li, F AU - Ye, W AU - Wang, J AU - Song, G AU - Yue, Z AU - Cong, L AU - Shang, H AU - Zhu, S AU - Zou, C AU - Li, Q AU - Yuan, Y AU - Lu, C AU - Wei, H AU - Gou, C AU - Zheng, Z AU - Yin, Y AU - Zhang, X AU - Liu, K AU - Wang, B AU - Song, C AU - Shi, N AU - Kohel, RJ AU - Percy, RG AU - Yu, JZ AU - Zhu, Y AU - Wang, J AU - Yu, S DO - 10.5524/100079 UR - http://gigadb.org/dataset/100079 AB - Cotton is one of the most economically important crop plants worldwide. Its fiber, commonly known as cotton lint, is the principal natural source for the textile industry.We have sequenced and assembled a draft genome of G. raimondii, whose progenitor is the putative contributor of the D subgenome to the economically important fiber-producing cotton species Gossypium hirsutum and Gossypium barbadense. We sequenced the 0.78 Gb genome to a depth of approximately 103 X with short reads from a series of libraries with various insert sizes ( 170 bp, 250 bp, 500 bp, 800 bp, 2 kb, 5 kb, 10 kb, 20 kb and 40 kb) on a HiSeq 2000 sequencer.The assembled scaffolds of high quality sequences total 78.7 Gb, with the contig and scaffold N50 values of 44.9 kb and 2.3 Mb respectively. We identified 40,976 protein-coding genes with an mean length of 1104 bb. KW - Genomic PY - 2014 PB - GigaScience Database LA - en ER -