i5k Workspace

About the i5k Workspace@NAL

The i5k Workspace (https://i5k.nal.usda.gov) is an inclusive genome portal for any arthropod genome project that would like to make use of our resources. We provide download services, BLAST, the JBrowse genome browser, and the Apollo manual curation service. Over 50 arthropod genomes are now part of the i5k Workspace, and users are encouraged to browse the genomes that we host, and contribute to the curation of each genome. For more information about the i5k Workspace, you can read our paper on the i5k Workspace, view our posters and talks, and find our software projects on github. The Ag Data Commons is now hosting a growing number of i5k Workspace datasets.

About the i5k initiative

The i5k initiative is a transformative project that aims to sequence and analyze the genomes of 5,000 arthropod species. The National Agricultural Library has partnered with the i5k initiative to create the i5k Workspace@NAL, which serves any ‘orphaned’ arthropod genome project's hosting needs. For more information about the i5k initiative, read the paper and visit the website.

Filter by Author Name

i5k Datasets

25 datasets

Oncopeltus fasciatus Official Gene set v1.1

Oncopeltus fasciatus genome was recently sequenced and annotated as part of the i5k pilot project by the Baylor College of Medicine.
The O. fasciatus research community has manually reviewed and curated the computational gene predictions and generated an official gene set, OGSv1.1. This dataset presents the Oncopeltus fasciatus Official Gene Set (OGS) v1.1. The OGS is an integration of automatic gene predictions from Maker (done by Dan Hughes at Baylor) with manual annotations by the research community (done via Web Apollo).

insects 5000 program

Trichogramma pretiosum genome annotations v0.5.3

This dataset presents the Trichogramma pretiosum gene set BCM_v_0.5.3. RNA-Seq data was used with additional protein homology data for a MAKER automated annotation of the Trichogramma pretiosum genome assembly 1.0.

insects 5000 program

Trichogramma pretiosum genome assembly v1.0

The Baylor College of Medicine has sequenced and annotated the Trichogramma pretiosum genome as part of the i5k pilot project. This dataset presents the Trichogramma pretiosum genome v1.0. This assembly version is the pre-release version, prior to filtering and quality control by the National Center for Biotechnology Information's GenBank resource. Scaffold 109 was identified to be a complete Wolbachia genome, and was removed from the assembly on GenBank and relocated to a separate record (https://www.ncbi.nlm.nih.gov/nuccore/NZ_CM003641.1).

insects 5000 program

Anoplophora glabripennis genome annotations v0.5.3

This dataset presents the Anoplophora glabripennis gene set BCM_v_0.5.3. RNA-Seq data was used with additional protein homology data for a MAKER automated annotation of the Anoplophora glabripennis genome assembly 1.0.

insects 5000 program

Anoplophora glabripennis Official Gene Set OGSv1.2

The Anoplophora glabripennis genome was recently sequenced, assembled and annotated as part of the i5k pilot project by the Baylor College of Medicine, in collaboration with the McKenna Laboratory at the University of Memphis. The Anoplophora glabripennis research community has manually reviewed and curated the computational gene predictions and generated an official gene set, OGSv1.2. OGSv1.2 was generated by merging gene set AGLA-c0.5.3-Models generated by the Baylor College of Medicine, and community-curated models in the Apollo software, after QC of the Apollo output.

insects 5000 program

Pachypsylla venusta genome annotations v0.5.3

This dataset presents the Pachypsylla venusta gene set BCM_v_0.5.3. RNA-Seq data was used with additional protein homology data for a MAKER automated annotation of the Pachypsylla venusta genome assembly 1.0.

insects 5000 program

Orussus abietinus genome annotations v0.5.3

This dataset presents the Orussus abietinus gene set BCM_v_0.5.3. RNA-Seq data was used with additional protein homology data for a MAKER automated annotation of the Orussus abietinus genome assembly 1.0.

insects 5000 program

Athalia rosae genome annotations v0.5.3

This dataset presents the Athalia rosae gene set BCM_v_0.5.3. RNA-Seq data was used with additional protein homology data for a MAKER automated annotation of the Athalia rosae genome assembly 1.0.

insects 5000 program

Cimex lectularius Genome Annotations v0.5.3

The Baylor College of Medicine recently sequenced and annotated the Cimex lectularius genome as part of the i5k pilot project. The C. lectularius research community has manually reviewed and curated the computational gene predictions and generated an official gene set, OGSv1.2. This dataset presents the Cimex lectularius gene set BCM_v_0.5.3, which was generated computationally. RNA-Seq data was used with additional protein homology data for a MAKER automated annotation of the Cimex lectularius genome assembly 1.0.

insects 5000 program

Cimex Lectularius Official Gene Set v1.1

The Baylor College of Medicine recently sequenced and annotated the Cimex lectularius genome as part of the i5k pilot project. The C. lectularius research community has manually reviewed and curated the computational gene predictions and generated an official gene set, OGSv1.2. This dataset presents the Cimex lectularius Official Gene Set (OGS) v1.1. The OGS is an integration of automatic gene predictions from MAKER with manual annotations by the research community.

insects 5000 program