U.S. flag

An official website of the United States government

i5k Workspace

About the i5k Workspace@NAL

The i5k Workspace (https://i5k.nal.usda.gov) is an inclusive genome portal for any arthropod genome project that would like to make use of our resources. We provide download services, BLAST, the JBrowse genome browser, and the Apollo manual curation service. Over 50 arthropod genomes are now part of the i5k Workspace, and users are encouraged to browse the genomes that we host, and contribute to the curation of each genome. For more information about the i5k Workspace, you can read our paper on the i5k Workspace, view our posters and talks, and find our software projects on github. The Ag Data Commons is now hosting a growing number of i5k Workspace datasets.

About the i5k initiative

The i5k initiative is a transformative project that aims to sequence and analyze the genomes of 5,000 arthropod species. The National Agricultural Library has partnered with the i5k initiative to create the i5k Workspace@NAL, which serves any ‘orphaned’ arthropod genome project's hosting needs. For more information about the i5k initiative, read the paper and visit the website.

Filter by Author

i5k Datasets

23 datasets

Manual annotations of Rhyzopertha dominica genome assembly RdoDt3_Drdd8_decomES

    This dataset contains manual annotations from Rhyzopertha dominica community curators, based on genome assembly RdoDt3_Drdd8_decomES.fasta.gz. These annotations are direct exports from Apollo 2.6 (https://doi.org/10.5281/zenodo.5015109), hosted by the i5k Workspace@NAL (https://i5k.nal.usda.gov/). Manual annotations are temporary and will be reviewed by the i5k Workspace@NAL and submitted to NCBI's GenBank database after review.

    Data from: Tripal EUtils - A Tripal module to increase exchange and reuse of genome assembly metadata

      A core component of NCBI’s BioSample metadata are the BioSample “packages” ([https://www.ncbi.nlm.nih.gov/biosample/docs/packages/](https://www.ncbi.nlm.nih.gov/biosample/docs/packages/)). Data submitters can choose a package, which contain a variety of attribute sets, such as plant- or insect-specific attributes, attribute values as recommended by the MIxS standard, etc. Here, we provide suggested ontology term mappings for attributes from the Invertebrate 1.0 and Plant 1.0 packages. This dataset corresponds to Table 4 in the corresponding publication in the journal Database.

      Frankliniella occidentalis Official Gene Set OGSv1.0

        The *Frankliniella occidentalis* genome was recently sequenced and annotated as part of the i5k pilot project by the Baylor College of Medicine. The *Frankliniella occidentalis* research community has manually reviewed and curated the computational gene predictions and generated an official gene set, OGSv1.0. OGSv1.0 was generated by merging gene set FOCC-V0.5.3-Models generated by the Baylor College of Medicine, and community-curated models in the Apollo software, after QC of the Apollo output. After the merge, scaffolds that were likely bacterial contamination were identified by John H. Werren, and gene models overlapping with these contaminated regions were removed from the OGS.

        Ephemera danica Official Gene Set ephdan_OGSv1.0

          This dataset presents the *Ephemera danica* Official Gene Set (OGS) v1.0. The OGS is an integration of automatic gene predictions from *Ephemera danica* genome annotations v0.5.3, with manual annotations by the research community. Manual and automated annotations were lifted over from genome assembly *Ephemera danica* genome assembly v1.0 to genome assembly Edan_2.0 using the coordinates_conversion and remap-gff3 programs.

          Leptinotarsa decemlineata genome annotations v0.5.3

            This dataset presents the Leptinotarsa decemlineata gene set BCM_v_0.5.3, which was generated computationally. RNA-Seq data was used with additional protein homology data for a MAKER automated annotation of the Leptinotarsa decemlineata genome assembly 1.0.

            Leptinotarsa decemlineata genome assembly 1.0

              This dataset presents the Leptinotarsa decemlineata genome v1.0. This assembly version is the pre-release version, prior to filtering and quality control by the [National Center for Biotechnology Information's GenBank resource](http://www.ncbi.nlm.nih.gov/assembly/GCA_000696205.1)

              Halyomorpha halys Official Gene Set v1.2

                This dataset presents the Halyomorpha halys Official Gene Set (OGS) v1.2. OGSv1.2 and is an update of Halyomorpha halys OGSv1.1 (https://doi.org/10.15482/USDA.ADC/1504240) to the coordinates of genome assembly GCA_000696795.3 (https://www.ncbi.nlm.nih.gov/assembly/GCA_000696795.3) using https://github.com/NAL-i5K/coordinates_conversion/.