Ag Data Commons
Browse
halhal_OGSv1.2.tar.gz (43.11 MB)

Halyomorpha halys Official Gene Set v1.2

Download (43.11 MB)
Version 2 2024-01-29, 18:12
Version 1 2024-01-25, 22:14
dataset
posted on 2024-01-29, 18:12 authored by Michael SparksMichael Sparks, Adelaide Rhodes, Alexander Martynov, Arun Velamuri, Joshua B. Benoit, Debora Pires Paula, David R. Nelson, Markus Friedrich, Christopher J. Holmes, Hugh M. Robertson, Joshua Rhoades, Stephen Richards, Jack Scanlan, Kristen A. Panfilio, Leslie Pick, Andrew J. Rosendale, Jackson B. Wells, Monica Poelchau, Dawn E. Gundersen-Rindal

This dataset presents the Halyomorpha halys Official Gene Set (OGS) v1.2. OGSv1.2 is an update of Halyomorpha halys OGSv1.1 (https://doi.org/10.15482/USDA.ADC/1504240) to the coordinates of genome assembly GCA_000696795.3 (https://www.ncbi.nlm.nih.gov/assembly/GCA_000696795.3) using https://github.com/NAL-i5K/coordinates_conversion/.

The original OGSv1.0 is an integration of automatic gene predictions from NCBI's eukaryotic annotation pipeline, NCBI Halyomorpha halys Annotation Release 100 (https://www.ncbi.nlm.nih.gov/genome/annotation_euk/Halyomorpha_halys/100/; ftp://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/000/696/795/GCF_000696795.1_Hhal_1.0), with manual annotations by the research community (performed via the Apollo manual curation software, http://genomearchitect.org/). Manual annotations performed by the community were downloaded from Apollo, QC'd, and merged with NCBI Halyomorpha halys Annotation Release 100 using the GFF3toolkit software (https://github.com/NAL-i5K/GFF3toolkit/releases/tag/v1.4.4). The resulting merged dataset was formatted for ingest into the i5k Workspace and GenBank databases, resulting in Halyomorpha halys Official Gene Set (OGS) v1.0.

Halyomorpha Official Gene Set halhal_OGSv1.1 is a minor update of halhal_OGSv1.0: Alias attributes were added to all manually annotated cathepsin models; six models from contaminated scaffolds were removed; and notes were added to 3 models located on possibly contaminated scaffolds.


Resources in this dataset:

  • Resource Title: Halymorpha halys Official Gene Set OGSv1.2.

    File Name: halhal_OGSv1.2.tar.gz

    Resource Description: The attached tar.gz archive (halhal_OGSv1.2.tar.gz) contains the following files:

    halhal_OGSv1.2.gff. Gff3 of all gene predictions of Halymorpha halys genome annotations OGSv1.2 halhal_OGSv1.2_CDS.fa. CDS sequences of Halymorpha halys genome annotations OGSv1.2 halhal_OGSv1.2_pep.fa. Amino acid sequences of Halymorpha halys genome annotations OGSv1.2 halhal_OGSv1.2_trans.fa. Transcript sequences of Halymorpha halys genome annotations OGSv1.2 readme. Readme file describing Halymorpha halys genome annotations OGSv1.2

Funding

National Human Genome Research Institute: U54 HG003273

History

Data contact name

Sparks, Michael

Data contact email

Michael.Sparks2@USDA.GOV

Publisher

Ag Data Commons

Temporal Extent Start Date

2019-01-01

Theme

  • Not specified

Geographic Coverage

{"type":"FeatureCollection","features":[{"geometry":{"type":"Polygon","coordinates":[[[-172.96875,-85.973919490277],[-172.96875,85.513398309887],[194.0625,85.513398309887],[194.0625,-85.973919490277],[-172.96875,-85.973919490277]]]},"type":"Feature","properties":{}}]}

ISO Topic Category

  • biota

Ag Data Commons Group

  • Insects - i5K

National Agricultural Library Thesaurus terms

genomics; Halyomorpha halys; genes; prediction; genome assembly; sequence analysis; genome

OMB Bureau Code

  • 005:18 - Agricultural Research Service

OMB Program Code

  • 005:040 - National Research

Pending citation

  • No

Public Access Level

  • Public

Preferred dataset citation

Sparks, Michael; Rhodes, Adelaide; Martynov, Alexander; Velamuri, Arun; Benoit, Joshua B.; Pires Paula, Debora; Nelson, David R.; Friedrich, Markus; Holmes, Christopher J.; Robertson, Hugh M.; Rhoades, Joshua; Richards, Stephen; Scanlan, Jack; Panfilio, Kristen A.; Pick, Leslie; Rosendale, Andrew J.; Wells, Jackson B.; Poelchau, Monica; Gundersen-Rindal, Dawn E. (2020). Halyomorpha halys Official Gene Set v1.2. Ag Data Commons. https://doi.org/10.15482/USDA.ADC/1518751

Usage metrics

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC