U.S. flag

An official website of the United States government

Oncopeltus fasciatus hybrid genome assembly 1.0

    The milkweed bug, *Oncopeltus fasciatus*, was sequenced as part of the i5k pilot project from Baylor College of Medicine (Illumina data). To augment those resources, we present here a hybrid genome assembly with low coverage PacBio data, assembled with PBJelly: the *Oncopeltus fasciatus* Hybrid Genome Assembly v1.0.

    Cephus cinctus Official Gene Set OGSv1.0 and OGSv1.1

      This Official Gene Set is an integration ([NCBI Cephus cinctus Annotation Release 101](https://www.ncbi.nlm.nih.gov/genome/annotation_euk/Cephus_cinctus/101/)) from NCBI's eukaryotic annotation pipeline v8.0 with manual annotations by the research community (done via the Apollo manual annotation software). QC and Merge of the dataset was performed using the GFF3toolkit software ([https://github.com/NAL-i5K/GFF3toolkit](https://github.com/NAL-i5K/GFF3toolkit)).

      Orussus abietinus mitochondrial genome assembly

        The Baylor College of Medicine has sequenced and annotated the Orussus abietinus genome as part of the i5k pilot project. This dataset represents a targeted assembly and annotation of the mitochondrial genome.

        Athalia rosae mitochondrial genome assembly

          The Baylor College of Medicine has sequenced and annotated the Athalia rosae genome as part of the i5k pilot project. This dataset represents a separate targeted assembly of the mitochondrial genome.

          Orussus abietinus Official Gene Set OGSv1.0

            The Orussus abietinus genome was recently sequenced and annotated as part of the i5k pilot project by the Baylor College of Medicine. The Orussus abietinus research community has manually reviewed and curated the computational gene predictions and generated an official gene set, OGSv1.0. The general procedure for generating this OGS is outlined here: https://github.com/NAL-i5K/I5KNAL_OGS/wiki. OGSv1.0 was generated by merging gene set OABI-V0.5.3-Models generated by the Baylor College of Medicine, and community-curated models in the Apollo software, after QC of the Apollo output.

            Athalia rosae Official Gene Set OGSv1.0

              The Athalia rosae genome was recently sequenced and annotated as part of the i5k pilot project by the Baylor College of Medicine. The Athalia rosae research community has manually reviewed and curated the computational gene predictions and generated an official gene set, OGSv1.0. The general procedure for generating this OGS is outlined here: https://github.com/NAL-i5K/I5KNAL_OGS/wiki. OGSv1.0 was generated by merging gene set AROS-V0.5.3-Models generated by the Baylor College of Medicine, and community-curated models in the Apollo software, after QC of the Apollo output.

              Leptinotarsa decemlineata Official Gene set v1.2

                The Leptinotarsa decemlineata genome was recently sequenced and annotated as part of the i5k pilot project by the Baylor College of Medicine. The L. decemlineata research community has manually reviewed and curated the computational gene predictions and generated an official gene set, OGSv1.2. OGSv1.1 is an integration of automatic gene predictions from Maker (performed by Dan Hughes at Baylor College of Medicine) with manual annotations by the research community (done via the Apollo manual annotation software). The coordinates of OGSv1.1 were converted to the latest genome assembly, GCF_000500325.1, using coordinates_conversion and remap-gff3, to generate OGSv1.2.

                Anoplophora glabripennis genome annotations v0.5.3

                  This dataset presents the Anoplophora glabripennis gene set BCM_v_0.5.3. RNA-Seq data was used with additional protein homology data for a MAKER automated annotation of the Anoplophora glabripennis genome assembly 1.0.

                  Lucilia cuprina genome annotations v0.5.3

                    This dataset presents the Lucilia cuprina gene set BCM_v_0.5.3. RNA-Seq data was used with additional protein homology data for a MAKER automated annotation of the Lucilia cuprina genome assembly 1.0. This dataset is free for all use.