U.S. flag

An official website of the United States government

Other Access

The information on this page (the dataset metadata) is also available in these formats:


via the DKAN API

Data Extent

Halyomorpha halys Official Gene Sets v1.0 and v1.1

The Baylor College of Medicine sequenced and annotated the Halyomorpha halys genome as part of the i5k pilot project. The H. halys research community has manually reviewed and curated the computational gene predictions and generated official gene sets, halhal_OGSv1.0 and halhal_OGSv1.1.

halhal_OGSv1.0 is a merge of automatic gene predictions from NCBI's eukaryotic annotation pipeline, NCBI Halyomorpha halys Annotation Release 100 (https://www.ncbi.nlm.nih.gov/genome/annotation_euk/Halyomorpha_halys/100/; ftp://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/000/696/795/GCF_000696795.1_H...), with manual annotations by the research community (performed via the Apollo manual curation software, http://genomearchitect.org/). Manual annotations performed by the community were downloaded from Apollo, QC'd, and merged with NCBI Halyomorpha halys Annotation Release 100 using the GFF3toolkit software (https://github.com/NAL-i5K/GFF3toolkit/releases/tag/v1.4.4). The resulting merged dataset was formatted for ingest into the i5k Workspace and GenBank databases, resulting in Halyomorpha halys Official Gene Set (OGS) v1.0.

Halyomorpha Official Gene Set halhal_OGSv1.1 is a minor update of halhal_OGSv1.0: Alias attributes were added to all manually annotated cathepsin models; six models from contaminated scaffolds were removed; and notes were added to 3 models located on possibly contaminated scaffolds.

Release Date
Spatial / Geographical Coverage Area
POLYGON ((-168.75 -84.011134538754, -168.75 84.9283209295, 195.46875 84.9283209295, 195.46875 -84.011134538754))
Ag Data Commons
Temporal Coverage
January 1, 2019
Contact Name
Sparks, Michael
Contact Email
Public Access Level
Program Code
005:040 - Department of Agriculture - National Research
Bureau Code
005:18 - Agricultural Research Service