These data are for the Pea Single Plant Plus Collection (PSPPC) and the PSPPC augmented with 25 *P. fulvum *accessions.
The 6 datasets can be divided into two groups. Group 1 consists of 3 datasets labeled “PSPPC” which refer to SNP data pertaining to the USDA Pea Single Plant Plus Collection. Group 2 consists of 3 datasets labeled “PSPPC + P. fulvum” which refer to SNP data pertaining to the PSPPC with 25 accessions of *Pisum fulvum *added. SNPs for each of these groups were called independently; therefore any SNP name that is shared between the PSPPC and PSPPC + *P. fulvum *groups should NOT be assumed to refer to the same locus.
For analysis, SNP data is available in two widely used formats: hapmap and vcf. These files were successfully loaded into the standalone version of TASSEL v. 5.2.25 (http://www.maizegenetics.net/tassel).
Explanations of fields (columns) in the VCF files are contained within commented (##) rows at the top of the file.
The first 11 columns required for the hapmap format are as follows:
rs#- Name of locus (i.e. SNP name)
alleles- Indicates the SNPs for each allele at the locus
chrom- N/A, since markers are unordered.
pos- N/A, since markers are unordered.
strand- N/A, since markers are unordered
assembly#- N/A
center- N/A
protLSID- N/A
assayLSID- N/A
panel- N/A
QCcode- N/A
The fasta sequences containing the SNPs are also available here for such downstream applications as development of primers for platform-specific markers.