TY - JOUR
T1 - A High-Quality Reference Genome Assembly of the Saltwater Crocodile, Crocodylus porosus, Reveals Patterns of Selection in Crocodylidae
AU - Ghosh, Arnab
AU - Johnson, Matthew G.
AU - Osmanski, Austin B.
AU - Louha, Swarnali
AU - Bayona-Vásquez, Natalia J.
AU - Glenn, Travis C.
AU - Gongora, Jaime
AU - Green, Richard E.
AU - Isberg, Sally
AU - Stevens, Richard D.
AU - Ray, David A.
PY - 2020/1
Y1 - 2020/1
N2 - Crocodilians are an economically, culturally, and biologically important group. To improve researchers' ability to study genome structure, evolution, and gene regulation in the clade, we generated a high-quality de novo genome assembly of the saltwater crocodile, Crocodylus porosus, from Illumina short read data from genomic libraries and in vitro proximity-ligation libraries. The assembled genome is 2,123.5 Mb, with N50 scaffold size of 17.7 Mb and N90 scaffold size of 3.8 Mb. We then annotated this new assembly, increasing the number of annotated genes by 74%. In total, 96% of 23,242 annotated genes were associated with a functional protein domain. Furthermore, multiple noncoding functional regions and mappable genetic markers were identified. Upon analysis and overlapping the results of branch length estimation and site selection tests for detecting potential selection, we found 16 putative genes under positive selection in crocodilians, 10 in C. porosus and 6 in Alligator mississippiensis. The annotated C. porosus genome will serve as an important platform for osmoregulatory, physiological, and sex determination studies, as well as an important reference in investigating the phylogenetic relationships of crocodilians, birds, and other tetrapods.
AB - Crocodilians are an economically, culturally, and biologically important group. To improve researchers' ability to study genome structure, evolution, and gene regulation in the clade, we generated a high-quality de novo genome assembly of the saltwater crocodile, Crocodylus porosus, from Illumina short read data from genomic libraries and in vitro proximity-ligation libraries. The assembled genome is 2,123.5 Mb, with N50 scaffold size of 17.7 Mb and N90 scaffold size of 3.8 Mb. We then annotated this new assembly, increasing the number of annotated genes by 74%. In total, 96% of 23,242 annotated genes were associated with a functional protein domain. Furthermore, multiple noncoding functional regions and mappable genetic markers were identified. Upon analysis and overlapping the results of branch length estimation and site selection tests for detecting potential selection, we found 16 putative genes under positive selection in crocodilians, 10 in C. porosus and 6 in Alligator mississippiensis. The annotated C. porosus genome will serve as an important platform for osmoregulatory, physiological, and sex determination studies, as well as an important reference in investigating the phylogenetic relationships of crocodilians, birds, and other tetrapods.
KW - Crocodylus porosus
KW - evolution
KW - selection
UR - http://www.scopus.com/inward/record.url?scp=85077480676&partnerID=8YFLogxK
U2 - 10.1093/gbe/evz269
DO - 10.1093/gbe/evz269
M3 - Article
C2 - 31821505
AN - SCOPUS:85077480676
SN - 1759-6653
VL - 12
SP - 3635
EP - 3646
JO - Genome Biology and Evolution
JF - Genome Biology and Evolution
IS - 1
ER -