U.S. flag

An official website of the United States government

Ag Data Commons migration begins October 18, 2023

The Ag Data Commons is migrating to a new platform – an institutional portal on Figshare. Starting October 18 the current system will be available for search and download only. Submissions will resume after the launch of our portal on Figshare in November. Stay tuned for details!

Other Access

The information on this page (the dataset metadata) is also available in these formats:


via the DKAN API

Data Extent

Halyomorpha halys Official Gene Set v1.2

This dataset presents the Halyomorpha halys Official Gene Set (OGS) v1.2. OGSv1.2 is an update of Halyomorpha halys OGSv1.1 (https://doi.org/10.15482/USDA.ADC/1504240) to the coordinates of genome assembly GCA_000696795.3 (https://www.ncbi.nlm.nih.gov/assembly/GCA_000696795.3) using https://github.com/NAL-i5K/coordinates_conversion/.

The original OGSv1.0 is an integration of automatic gene predictions from NCBI's eukaryotic annotation pipeline, NCBI Halyomorpha halys Annotation Release 100 (https://www.ncbi.nlm.nih.gov/genome/annotation_euk/Halyomorpha_halys/100/; ftp://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/000/696/795/GCF_000696795.1_H...), with manual annotations by the research community (performed via the Apollo manual curation software, http://genomearchitect.org/). Manual annotations performed by the community were downloaded from Apollo, QC'd, and merged with NCBI Halyomorpha halys Annotation Release 100 using the GFF3toolkit software (https://github.com/NAL-i5K/GFF3toolkit/releases/tag/v1.4.4). The resulting merged dataset was formatted for ingest into the i5k Workspace and GenBank databases, resulting in Halyomorpha halys Official Gene Set (OGS) v1.0.

Halyomorpha Official Gene Set halhal_OGSv1.1 is a minor update of halhal_OGSv1.0: Alias attributes were added to all manually annotated cathepsin models; six models from contaminated scaffolds were removed; and notes were added to 3 models located on possibly contaminated scaffolds.

Release Date
Spatial / Geographical Coverage Area
POLYGON ((-172.96875 -85.973919490277, -172.96875 85.513398309887, 194.0625 85.513398309887, 194.0625 -85.973919490277))
Ag Data Commons
Temporal Coverage
January 1, 2019
Contact Name
Sparks, Michael
Contact Email
Public Access Level
Program Code
005:040 - Department of Agriculture - National Research
Bureau Code
005:18 - Agricultural Research Service