Ag Data Commons
Browse

File(s) not publicly available

Data and code from: Identification of a key target for elimination of nitrous oxide, a major greenhouse gas

dataset
posted on 2023-11-30, 11:40 authored by Blake A. Oakley, Quentin ReadQuentin Read, Scott E. Gold, Anthony E. Glenn

Note: Data files will be made available upon manuscript publication

This dataset contains all code and data needed to reproduce the analyses in the manuscript:

IDENTIFICATION OF A KEY TARGET FOR ELIMINATION OF NITROUS OXIDE, A MAJOR GREENHOUSE GAS.
Blake A. Oakley (1), Trevor Mitchell (2), Quentin D. Read (3), Garrett Hibbs (1), Scott E. Gold (2), Anthony E. Glenn (2)

  1. Department of Plant Pathology, University of Georgia, Athens, GA, USA.
  2. Toxicology and Mycotoxin Research Unit, U.S. National Poultry Research Center, United States Department of Agriculture-Agricultural Research Service, Athens, GA, USA
  3. Southeast Area, United States Department of Agriculture-Agricultural Research Service, Raleigh, NC, USA

citation will be updated upon acceptance of manuscript

Brief description of study aims

Denitrification is a chemical process that releases nitrous oxide (N2O), a potent greenhouse gas. The NOR1 gene is part of the denitrification pathway in Fusarium. Three experiments were conducted for this study. (1) The N2O comparative experiment compares denitrification rates, as measured by N2O production, of a variety of Fusarium spp. strains with and without the NOR1 gene. (2) The N2O substrate experiment compares denitrification rates of selected strains on different growth media (substrates). For parts 1 and 2, linear models are fit comparing N2O production between strains and/or substrates. (3) The Bioscreen growth assay tests whether there is a pleiotropic effect of the NOR1 gene. In this portion of the analysis, growth curves are fit to assess differences in growth rate and carrying capacity between selected strains with and without the NOR1 gene.

Code

All code is included in a .zip archive generated from a private git repository on 2022-10-13 and archived as part of this dataset.

The code is contained in R scripts and RMarkdown notebooks. There are two components to the analysis: the denitrification analysis (comprising parts 1 and 2 described above) and the Bioscreen growth analysis (part 3). The scripts for each are listed and described below.

Analysis of results of denitrification experiments (parts 1 and 2)

  • NOR1_denitrification_analysis.Rmd: The R code to analyze the experimental data comparing nitrous oxide emissions is all contained in a single RMarkdown notebook. This script analyzes the results from the comparative study and the substrate study.
  • n2o_subgroup_figures.R: R script to create additional figures using the output from the RMarkdown notebook

Analysis of results of Bioscreen growth assay (part 3)

  • bioscreen_analysis.Rmd: This RMarkdown notebook contains all R code needed to analyze the results of the Bioscreen assay comparing growth of the different strains. It could be run as is. However, the model-fitting portion was run on a high-performance computing cluster with the following scripts:
    • bioscreen_fit_simpler.R: R script containing only the model-fitting portion of the Bioscreen analysis, fit using the Stan modeling language interfaced with R through the brms and cmdstanr packages.
    • job_bssimple.sh: Job submission shell script used to submit the model-fitting R job to be run on USDA SciNet high-performance computing cluster.

Additional scripts developed as part of the analysis but that are not required to reproduce the analyses in the manuscript are in the deprecated/ folder.

Also note the files nor1-denitrification.Rproj (RStudio project file) and gtstyle.css (stylesheet for formatting the tables in the notebooks) are included.

Data

Data required to run the analysis scripts are archived in this dataset, other than strain_lookup.csv, a lookup table of strain abbreviations and full names included in the code repository for convenience. They should be placed in a folder or symbolic link called project within the unzipped code repository directory.

  • N2O_data_2022-08-03/N2O_Comparative_Study_Trial_(n)_(date range).xlsx: These are the data from the N2O comparative study, where n is the trial number from 1-3 and date range is the begin and end date of the trial.
  • N2O_data_2022-08-03/Nitrogen_Substrate_Study_Trial_(n)_(date range).xlsx: These are the data from the N2O substrate study, where n is the trial number from 1-3 and date range is the begin and end date of the trial.
  • Outliers_NOR1_2022/Bioscreen_NOR1_Fungal_Growth_Assay_(substrate)_(oxygen level)_Outliers_BAO_(date).xlsx: These are the raw Bioscreen data files in MS Excel format. The format of each file name includes the substrate (minimal medium with nitrite or nitrate and lysine), oxygen level (hypoxia or normoxia), and date of the run. This repository includes code to process these files, but the processed data are also included on Ag Data Commons, so it is not necessary to run the data processing portion of the code.
  • clean_data/bioscreen_clean_data.csv: This is an intermediate output file in CSV format generated by bioscreen_analysis.Rmd. It includes all the data from the Bioscreen assays in a clean analysis-ready format.

Funding

Agricultural Research Service, 6040-42000-046-000D

History

Data contact name

Read, Quentin

Data contact email

quentin.read@usda.gov

Publisher

Ag Data Commons

Intended use

This dataset is intended to allow reproducing all analyses presented in the above-cited manuscript.

Use limitations

The code included in this dataset is only designed to work with the input data provided and would need to be modified if running similar analyses on different input data.

Temporal Extent Start Date

2021-12-08

Temporal Extent End Date

2022-03-08

Theme

  • Not specified

Geographic Coverage

{"type":"FeatureCollection","features":[{"geometry":{"type":"Point","coordinates":[-83.3563255,33.928033]},"type":"Feature","properties":{}}]}

Geographic location - description

Athens, Georgia, USA

ISO Topic Category

  • environment
  • farming

National Agricultural Library Thesaurus terms

nitrous oxide; greenhouse gases; data collection; denitrification; greenhouse gas emissions; comparative study; models; USDA; oxygen; lysine; hypoxia; normoxia; silver; information processing; nitrogen; sodium nitrite; sodium nitrate; air pollution control; bioassays; microbial growth; culture media; growth curves; computer software; Fusarium verticillioides; Fusarium oxysporum f. sp. vasinfectum; Fusarium graminearum; plant pathogenic fungi; mutants; strains

OMB Bureau Code

  • 005:18 - Agricultural Research Service

OMB Program Code

  • 005:040 - National Research

ARS National Program Number

  • 108

Pending citation

  • No

Public Access Level

  • Public

Preferred dataset citation

Oakley, Blake A.; Read, Quentin D.; Gold, Scott E.; Glenn, Anthony E. (2022). Data and code from: Identification of a key target for elimination of nitrous oxide, a major greenhouse gas. Ag Data Commons. https://doi.org/10.15482/USDA.ADC/1528134