Lungfish have the largest known genome of vertebrates. Scientists have failed to assemble it successfully for a long time. Therefore, the research on the transition of vertebrates from aquatic to terrestrial habitats missed a critical link. In a previous study, only 34.5 Gb of genome sequence was obtained from Australian lungfish, and the extremely low core gene completeness (67%) hindered the in-depth research in lungfish. In this study, we completed the analysis of 1.5 TB of third-generation data in three days by using an independently developed third-generation sequencing and assembly algorithm and obtained 39.1 Gb of genome sequence data. The core gene completeness is as high as 95%. Thus, the genome of African lungfish, which is the most difficult to assemble, was analyzed at high quality. Lungfish is known as a “living fossil”; the analysis of the genomic data revealed a three-step scenario for the water-to-land evolution. This achievement signifies that China has reached the international top level in giant genome assembly.
Chromosome-level genome assembly and evolutionary history of the African lungfish
Large memory computer operation site