TY - DATA T1 - Genome sequence of the duck (Anas platyrhynchos) AU - Huang, Y AU - Li, Y AU - Burt, DW AU - Chen, H AU - Zhang, Y AU - Qian, W AU - Kim, H AU - Gan, S AU - Zhao, Y AU - Li, J AU - Yi, K AU - Feng, H AU - Zhu, P AU - Li, B AU - Liu, Q AU - Fairley, S AU - Magor, KE AU - Du, Z AU - Hu, X AU - Goodman, L AU - Tafer, H AU - Vignal, A AU - Lee, T AU - Kim, K AU - Sheng, Z AU - An, Y AU - Searle, S AU - Herrero, J AU - Groenen, MAM AU - Crooijmans, RPMA AU - Faraut, T AU - Cai, Q AU - Webster, RG AU - Aldridge, AU - Warren, WC AU - Bartschat, S AU - Kehr, S AU - Marz, M AU - Stadler, PF AU - Smith, J AU - Kraus, RHS AU - Zhao, Y AU - Ren, L AU - Fei, J AU - Morisson, M AU - Kaiser, P AU - Griffin, DK AU - Rao, M AU - Pitel, F AU - Wang, J AU - Li, N DO - 10.5524/101001 UR - http://gigadb.org/dataset/101001 AB - Available here is the first draft genomic sequence of the duck (Anas platyrhynchos). Duck is a member of Anatidae, a family of birds that include geese and swans. However, duck is an economically important waterfowl serving as a source of meat, eggs and feathers; though, of special interest to agriculture and medicine is that fact that the duck is a principal natural host of influenza A viruses and harbours all subtypes of 16 haemagglutinin and 9 neuraminidase subtypes currently known, except for H13 and H16 subtypes. Using llumina Genome Analyser sequencing technology the genome of a 10-week old female Beijing duck was sequenced and a total 77 Gb of paired-end reads (approximately 64-fold coverage of the whole genome) was generated with an average length of 50 bp. Using SOAPdenovo to combine short reads, a draft genome assembly was constructed consisting of 78,487 scaffolds and covered 1.1 Gb. The contig N50 and scaffold N50 values were 26 kb and 1.2 Mb respectively. Super scaffolds were constructed and chromosomal sequences created according to the duck genetic map – this resulted in 47 superscaffolds which contained 225 scaffolds and spanned 289 Mb. Transcriptomes were also generated from several different tissues, comprising 1.87 million ESTs, and approximately 121 million 75-bp and 917 million 90-bp paired-end reads, which were generated using either the 454/Roche Life Sciences Analyzer or Illumina Genome sequencing technology. KW - Genomic PY - 2013 PB - GigaScience Database LA - en ER -