Other Access

The information on this page (the dataset metadata) is also available in these formats:

JSON RDF

Data Extent

Data from: A High-Quality Genome Assembly from a Single, Field-collected Spotted Lanternfly (Lycorma delicatula) using the PacBio Sequel II System

SLF-spotted lanternfly (Lycorma delicatula); adult winged. Photo: Stephen Ausmus, ARS Image Gallery

A high-quality reference genome is an essential tool for applied and basic research on arthropods. Long-read sequencing technologies may be used to generate more complete and contiguous genome assemblies than alternate technologies, however, long-read methods have historically had greater input DNA requirements and higher costs than next generation sequencing, which are barriers to their use on many samples. Here, we present a 2.3 Gb de novo genome assembly of a field-collected adult female Spotted Lanternfly (Lycorma delicatula) using a single PacBio SMRT Cell. The Spotted Lanternfly is an invasive species recently discovered in the northeastern United States, threatening to damage economically important crop plants in the region. The DNA from one individual female specimen collected in Reading, Berks County, Pennsylvania was used to make one standard, size-selected library with an average DNA fragment size of ~20 kb. The library was run on one Sequel II SMRT Cell 8M, generating a total of 132 Gb of long-read sequences, of which 82 Gb were from unique library molecules, representing approximately 38x coverage of the genome. The assembly had high contiguity (contig N50 length = 1.5 Mb), completeness, and sequence level accuracy as estimated by conserved gene set analysis (96.8% of conserved genes both complete and without frame shift errors). Further, it was possible to segregate more than half of the diploid genome into the two separate haplotypes. The assembly also recovered two microbial symbiont genomes known to be associated with L. delicatula, each microbial genome being assembled into a single contig. We demonstrate that field-collected arthropods can be used for the rapid generation of high-quality genome assemblies, an attractive approach for projects on emerging invasive species, disease vectors, or conservation efforts of endangered species.

Supporting files for the manuscript "A High-Quality Genome Assembly from a Single, Field-collected Spotted Lanternfly (Lycorma delicatula) using the PacBio Sequel II System", include several intermediate versions of the assembly (raw output from Falcon, raw output from Falcon unzip, etc.) as well as the final assembly primary contigs and haplotigs (for the regions of the genome that were phased).

Dataset Info

These fields are compatible with DCAT, an RDF vocabulary designed to facilitate interoperability between data catalogs published on the Web.
FieldValue
Authors
Kingan, Sarah
(ORCID)
Urban, Julie
Lambert, Christine
Baybayan, Primo
Childers, Anna
(ORCID)
Coates, Brad
Scheffler, Brian
(ORCID)
Hackett, Kevin
Korlach, Jonas
(ORCID)
Geib, Scott M.
(ORCID)
Product Type
Dataset
Spatial / Geographical Coverage Area
POLYGON ((-75.915994048119 40.335385813355, -75.915994048119 40.346376494447, -75.897797942162 40.346376494447, -75.897797942162 40.335385813355))
Spatial / Geographical Coverage Location
Female specimen sequenced collected in Reading, Berks County, Pennsylvania (40.34 N, 75.91 W)
Temporal Coverage
2018-08-26
Equipment or Software Used
Intended Use
This is supporting data for the Lycorma delicatula de novo genome assembly.
Use Limitations
None
Publisher
Ag Data Commons
Contact Name
Geib, Scott M.
Contact Email
Public Access Level
Public
Primary Article

Kingan, S. B., Urban, J., Lambert, C. C., Baybayan, P., Childers, A. K., Coates, B. S., Scheffler, B., Hackett, K., Korlach, J., & Geib, S. M. (2019). A High-Quality Genome Assembly from a Single, Field-collected Spotted Lanternfly (Lycorma delicatula) using the PacBio Sequel II System. bioRxiv 627679 [preprint]

License
Funding Source(s)
Agricultural Research Service
2040-22430-026-00-D
Dataset DOI (digital object identifier)
10.15482/USDA.ADC/1503745
Program Code
005:037 - Department of Agriculture - Research and Education
Bureau Code
005:18 - Agricultural Research Service
Modified Date
2019-05-10
Release Date
2019-05-01
Ag Data Commons Keywords: 
  • Genomics & Genetics
  • Genome
  • Genome assembly
  • Genomics & Genetics
  • Genome
  • Genomics & Genetics
  • Plants & Crops
  • Plant health
  • Plants & Crops
State or Territory: 
ISO Topic(s):