U.S. flag

An official website of the United States government

The Ag Data Commons is migrating

The Ag Data Commons is migrating to a new institutional portal on Figshare. The current system is available for search and download only. The new platform is open for submission with assistance from Ag Data Commons curators. Please contact NAL-ADC-Curator@usda.gov, if you need to publish or update your datasets.

Other Access

The information on this page (the dataset metadata) is also available in these formats:


via the DKAN API

Data Extent

Functional annotation for 15 diverse arthropod genomes

The general approach for functional annotation is to combine GO annotations transferred on the basis of sequence homology (e.g., BLAST) with information about functional motifs (e.g., derived from resources such as PFAM). Gene products are mapped to metabolic and signalling pathways based upon sequence homology or orthology.

We present the annotation results of 15 arthropod proteomes using an open source, open access and containerized pipeline for genome-scale functional annotation of insect proteomes and apply it to a diverse range of arthropod species. You can find more information about the pipeline at our readthedocs site. The files for each genome include GOanna, InterproScan and KOBAS predictions.

Arthropod genomes selected for this study and their assembly and annotation statistics.

  1. Apis Mellifera (honey bee)
  2. Drosophila melanogaster (fruit fly)
  3. Tribolium castaneum (red flour beetle)
  4. Latrodectus hesperus (Western black widow spider)
  5. Limnephilus lunatus (caddisfly)
  6. Oncopeltus fasciatus (Large milkweed bug)
  7. Homalodisca vitripennis (Glassy-winged sharpshooter)
  8. Eurytemora affinis (calanoid copepod)
  9. Agrilus planipennis (emerald ash borer)
  10. Copidosoma floridanum (parasitoid wasp)
  11. Athalia rosae (turnip sawfly)
  12. Ceratitis capitata (Mediterranean fruit fly)
  13. Cimex lectularius (Cimicidae bed bug)
  14. Varroa destructor(parasitic mite)
  15. Diaphorina citri (Asian citrus psyllid)
Release Date
Spatial / Geographical Coverage Area
POLYGON ((-125.33203125 30.654452824401, -125.33203125 48.848450835898, -74.35546875 48.848450835898, -74.35546875 30.654452824401))
Ag Data Commons
Temporal Coverage
July 6, 2021
Contact Name
Saha, Surya
Contact Email
Public Access Level
Program Code
005:040 - Department of Agriculture - National Research
Bureau Code
005:18 - Agricultural Research Service