U.S. flag

An official website of the United States government

i5k Workspace

About the i5k Workspace@NAL

The i5k Workspace (https://i5k.nal.usda.gov) is an inclusive genome portal for any arthropod genome project that would like to make use of our resources. We provide download services, BLAST, the JBrowse genome browser, and the Apollo manual curation service. Over 50 arthropod genomes are now part of the i5k Workspace, and users are encouraged to browse the genomes that we host, and contribute to the curation of each genome. For more information about the i5k Workspace, you can read our paper on the i5k Workspace, view our posters and talks, and find our software projects on github. The Ag Data Commons is now hosting a growing number of i5k Workspace datasets.

About the i5k initiative

The i5k initiative is a transformative project that aims to sequence and analyze the genomes of 5,000 arthropod species. The National Agricultural Library has partnered with the i5k initiative to create the i5k Workspace@NAL, which serves any ‘orphaned’ arthropod genome project's hosting needs. For more information about the i5k initiative, read the paper and visit the website.

Filter by Author

Filter by User-supplied tags

i5k Datasets

44 datasets

Manual annotations of Rhyzopertha dominica genome assembly RdoDt3_Drdd8_decomES

    This dataset contains manual annotations from Rhyzopertha dominica community curators, based on genome assembly RdoDt3_Drdd8_decomES.fasta.gz. These annotations are direct exports from Apollo 2.6 (https://doi.org/10.5281/zenodo.5015109), hosted by the i5k Workspace@NAL (https://i5k.nal.usda.gov/). Manual annotations are temporary and will be reviewed by the i5k Workspace@NAL and submitted to NCBI's GenBank database after review.

    Neodiprion lecontei Official Gene Set v1.1

      This dataset presents the *Neodiprion* Official Gene Set (OGS) v1.1. It was generated using Maker v2.31.8, followed by CrossMap re-mapping of coordinates to genome assembly Nlec1.1 ([https://www.ncbi.nlm.nih.gov/assembly/GCA_001263575.2/](https://www.ncbi.nlm.nih.gov/assembly/GCA_001263575.2/)).

      Oncopeltus fasciatus Official Gene set v1.1

        Oncopeltus fasciatus genome was recently sequenced and annotated as part of the i5k pilot project by the Baylor College of Medicine. The O. fasciatus research community has manually reviewed and curated the computational gene predictions and generated an official gene set, OGSv1.1. This dataset presents the Oncopeltus fasciatus Official Gene Set (OGS) v1.1. The OGS is an integration of automatic gene predictions from Maker (done by Dan Hughes at Baylor) with manual annotations by the research community (done via Web Apollo).

        Leptinotarsa decemlineata Official Gene set v1.2

          The Leptinotarsa decemlineata genome was recently sequenced and annotated as part of the i5k pilot project by the Baylor College of Medicine. The L. decemlineata research community has manually reviewed and curated the computational gene predictions and generated an official gene set, OGSv1.2. OGSv1.1 is an integration of automatic gene predictions from Maker (performed by Dan Hughes at Baylor College of Medicine) with manual annotations by the research community (done via the Apollo manual annotation software). The coordinates of OGSv1.1 were converted to the latest genome assembly, GCF_000500325.1, using coordinates_conversion and remap-gff3, to generate OGSv1.2.

          Oncopeltus fasciatus Official Gene set v1.2

            This dataset presents the Oncopeltus fasciatus Official Gene Set (OGS) v1.2. The OGS is an update of OGSv1.1. Manual annotations from the Apollo manual annotation tool were merged with OGSv1.1 using the NAL's [prototype Merge program](https://github.com/NAL-i5K/I5KNAL_OGS).

            Cimex Lectularius Official Gene Set v1.2

              The Baylor College of Medicine recently sequenced and annotated the Cimex lectularius genome as part of the i5k pilot project. The C. lectularius research community has manually reviewed and curated the computational gene predictions and generated an official gene set, OGSv1.2. This dataset presents the Cimex lectularius Official Gene Set (OGS) v1.1. The OGS is an integration of automatic gene predictions from MAKER with manual annotations by the research community.

              Drosophila eugracilis genome annotations v0.5.3 for genome assembly Deug05112011

                This research on Drosophila eugracilis genomics is part of the Drosophila modENCODE project.The Baylor College of Medicine is studying the comparative genomics of eight species of Drosophila, including biarmipes, bipectinata, elegans, eugracillis, ficusphila, kikkawai, rhopaloa, and takahashii. RNA-Seq data were used with additional protein homology data for a MAKER automated annotation of the Drosophila eugracilis genome assembly Deug05112011. This gene set is an unstable pre-release (v0.5.3), and is provided to facilitate manual curation and analyses. Gene identifiers from this gene set will not be maintained.

                Drosophila ficusphila genome annotations v0.5.3 for genome assembly Dfic02082011

                  This research on Drosophila ficusphila genomics is part of the Drosophila modENCODE project.The Baylor College of Medicine is studying the comparative genomics of eight species of Drosophila, including biarmipes, bipectinata, elegans, eugracillis, ficusphila, kikkawai, rhopaloa, and takahashii. RNA-Seq data were used with additional protein homology data for a MAKER automated annotation of the Drosophila ficusphila genome assembly Dfic02082011. This gene set is an unstable pre-release (v0.5.3), and is provided to facilitate manual curation and analyses. Gene identifiers from this gene set will not be maintained.