TY - DATA T1 - De novo genome assembly and annotation data for the Murray cod (Maccullochella peelii), Australia's largest freshwater fish AU - Austin, Christopher M AU - Lee, Yin Peng AU - Harrisson, Katherine A AU - Tan, Mun Hua AU - Croft, Laurence J AU - Pavlova, Alexandra AU - Sunnucks, Paul AU - Gan, Han Ming DO - 10.5524/100329 UR - http://gigadb.org/dataset/100329 AB - One of the most iconic Australian fish is the Murray cod, Maccullochella peelii (Mitchell, 1838), a freshwater species that can grow to ~1.8 metres in length and live ≥ 48 years of age. The Murray cod is of conservation concern as a result of strong population contractions, but is also popular for recreational fishing and is of growing aquaculture interest. In this study, we report the whole genome sequence of the Murray cod to support ongoing population genetics, conservation and management-related research, as well as to understand better the evolutionary ecology and history of the species. A draft Murray cod genome of 633 Mbp (N50=109,974bp; BUSCO and CEGMA completeness of 94.2% and 91.9%, respectively) with an estimated 148 Mbp of putative repetitive sequences was assembled from the combined sequencing data of two fish individuals with an identical maternal lineage. 47.2 Gb of Illumina HiSeq data and 804 Mb of Nanopore data were generated from the first individual while 23.2 Gb of Illumina MiSeq data were from the second individual. The inclusion of Nanopore reads for scaffolding followed by subsequent gap-closing using Illumina data led to a 29% reduction in the number of scaffolds and a 55% and 54% increase in the scaffold and contig N50, respectively. We also report the first transcriptome of Murray cod that was subsequently used to annotate the Murray cod genome leading to the identification of 26,539 protein-coding genes. We present the whole genome of the Murray cod and anticipate this will be a catalyst for a range of genetic, genomic and phylogenetic studies of the Murray cod and more generally other fish species of Percichthydae family. KW - Genomic KW - Transcriptomic KW - Murray Cod KW - long reads KW - genome KW - transcriptome KW - hybrid assembly PY - 2017 PB - GigaScience Database LA - en ER -