Blattella germanica Official Gene Set OGSv1.0
The Blattella germanica genome was recently sequenced and annotated as part of the i5k pilot project by the Baylor College of Medicine. The Blattella germanica research community has manually reviewed and curated the computational gene predictions and generated an official gene set, OGSv1.0. The general procedure for generating this OGS is outlined here: https://github.com/NAL-i5K/GFF3toolkit/. QC of community-curated models from the Apollo software was performed using the GFF3toolkit function gff3_QC.py, and errors were fixed using gff3_fix.py. OGSv1.0 was generated by merging the gene set BGER_v0.6.2 (provided by E. Jongepier, doi:10.1038/s41559-017-0459-1) with 1) QC'd and error-corrected community-curated models and 2) semi-automatically predicted miRNAs (references: doi:10.1038/srep37736 and doi.org/10.1186/s12864-017-4177-5) . Subsequently, i5k Workspace (https://i5k.nal.usda.gov/) IDs were generated for all features.
Resources in this dataset:
Resource Title: Blattella germanica Official Gene Set OGSv1.0.
File Name: BGER_OGSv1-0.tar.gz
Resource Description: The attached tar.gz archive (BGER_OGSv1-0.tar.gz) contains the following files: * BGER_OGSv1-0_CDS.fa. CDS sequences of Blattella germanica genome annotations OGSv1.0. * BGER_OGSv1-0_pep.fa. Amino acid sequences of Blattella germanica genome annotations OGSv1.0. * BGER_OGSv1-0_trans.fa. Transcript sequences of Blattella germanica genome annotations OGSv1.0. * BGER_OGSv1-0.gff. Gff3 of all gene predictions of Blattella germanica genome annotations OGSv1.0 * BGER_OGSv1-0_idmap.txt. A mapping file describing ID and name updates from dataset Blattella germanica genome annotations v0.5.3. * readme. This file briefly describes how the dataset Blattella germanica Official Gene Set OGSv1.0 was generated.
Funding
Deutsche Forschungsgemeinschaft: BO2544/11-1
U.S. Department of Housing and Urban Development: NCHHU-0017-13
National Science Foundation: IOS-1557864
Alfred P. Sloan Foundation: 2013-5-35 MBE
History
Data contact name
Harrison, MarkData contact email
m.harrison@uni-muenster.dePublisher
Ag Data CommonsTheme
- Not specified
ISO Topic Category
- biota
Ag Data Commons Group
- Insects - i5K
National Agricultural Library Thesaurus terms
genomics; Blattella germanica; genes; prediction; models; microRNAPending citation
- No
Public Access Level
- Public