U.S. flag

An official website of the United States government

Ag Data Commons migration begins October 18, 2023

The Ag Data Commons is migrating to a new platform – an institutional portal on Figshare. Starting October 18 the current system will be available for search and download only. Submissions will resume after the launch of our portal on Figshare in November. Stay tuned for details!

Halyomorpha halys Official Gene Sets v1.0 and v1.1

    This dataset presents the *Halyomorpha halys* Official Gene Set (OGS) v1.0 and v1.1. The OGS is an integration of automatic gene predictions from NCBI's eukaryotic annotation pipeline, [NCBI Halyomorpha halys Annotation Release 100](https://www.ncbi.nlm.nih.gov/genome/annotation_euk/Halyomorpha_halys/100/), with manual annotations by the research community (performed via the Apollo manual curation software, http://genomearchitect.org/).

    Data from: A High-Quality Genome Assembly from a Single, Field-collected Spotted Lanternfly (Lycorma delicatula) using the PacBio Sequel II System

      A 2.3 Gb *de novo* genome assembly of a field-collected adult female Spotted Lanternfly (*Lycorma delicatula*) using a single PacBio SMRT Cell is provided. Supporting files for the manuscript "A High-Quality Genome Assembly from a Single, Field-collected Spotted Lanternfly (*Lycorma delicatula*) using the PacBio Sequel II System", include several intermediate versions of the assembly (raw output from Falcon, raw output from Falcon unzip, etc.) as well as the final assembly primary contigs and haplotigs (for the regions of the genome that were phased).

      Oncopeltus fasciatus hybrid genome assembly 1.0

        The milkweed bug, *Oncopeltus fasciatus*, was sequenced as part of the i5k pilot project from Baylor College of Medicine (Illumina data). To augment those resources, we present here a hybrid genome assembly with low coverage PacBio data, assembled with PBJelly: the *Oncopeltus fasciatus* Hybrid Genome Assembly v1.0.

        Cephus cinctus Official Gene Set OGSv1.0 and OGSv1.1

          This Official Gene Set is an integration ([NCBI Cephus cinctus Annotation Release 101](https://www.ncbi.nlm.nih.gov/genome/annotation_euk/Cephus_cinctus/101/)) from NCBI's eukaryotic annotation pipeline v8.0 with manual annotations by the research community (done via the Apollo manual annotation software). QC and Merge of the dataset was performed using the GFF3toolkit software ([https://github.com/NAL-i5K/GFF3toolkit](https://github.com/NAL-i5K/GFF3toolkit)).

          Orussus abietinus mitochondrial genome assembly

            The Baylor College of Medicine has sequenced and annotated the Orussus abietinus genome as part of the i5k pilot project. This dataset represents a targeted assembly and annotation of the mitochondrial genome.

            Athalia rosae mitochondrial genome assembly

              The Baylor College of Medicine has sequenced and annotated the Athalia rosae genome as part of the i5k pilot project. This dataset represents a separate targeted assembly of the mitochondrial genome.

              Orussus abietinus Official Gene Set OGSv1.0

                The Orussus abietinus genome was recently sequenced and annotated as part of the i5k pilot project by the Baylor College of Medicine. The Orussus abietinus research community has manually reviewed and curated the computational gene predictions and generated an official gene set, OGSv1.0. The general procedure for generating this OGS is outlined here: https://github.com/NAL-i5K/I5KNAL_OGS/wiki. OGSv1.0 was generated by merging gene set OABI-V0.5.3-Models generated by the Baylor College of Medicine, and community-curated models in the Apollo software, after QC of the Apollo output.