Gibpss

Z MetaCentrum
Skočit na navigaci Skočit na vyhledávání

Description

GIbPSs is a package for the analysis of GBS (genotyping-by-sequencing) and RAD (restriction site associated DNA) sequence data. It handles paired-end or single reads from protocols such as GBS, two-enzyme GBS, and ddRAD. It can also analyze forward reads from RAD libraries. Its main purpose is genotyping of individuals from diploid organisms without a reference genome for population genetic and phylogeographic analyses. GIbPSs is written in Perl 5 and runs under Linux and Windows. I have developed it for the analysis of sequence data from "classical" GBS libraries (Elshire et al. 2011). This technique uses a single restriction enzyme. Reads of a locus originate from both strands and display full, partial, or no overlap. This can lead to duplicate locus identification when a reference genome is not available. GIbPSs solves this problem through reverse complement clustering of paired sequence reads. GIbPSs can handle sequences of different lengths and analyze short restriction fragments in adapter-containing sequences. GIbPSs employs a rapid genotyping strategy that eliminates errors due to indel variation without the need for time-consuming multiple alignments. It identifies loci with indel variation with the aid of USEARCH (Edgar, R. C. 2010) and excludes them from the final dataset. GIbPSs stores genotype data in a database that offers multiple options for data exploration, quality filtering, data selection, and export. It produces various data formats for population genetic and phylogenetic programs. Examples are STRUCTURE, Genepop, Arlequin, and files in FASTA, Nexus, and Phylip format that contain DNA characters, integers, or binary data. GIbPSs can export one SNP per locus, encode alleles based on all SNPs of each locus or export full allele sequences.

License

GIbPSs is free software, GNU GPLv3

Usage

Upcoming modulesystem change alert!

Due to large number of applications and their versions it is not practical to keep them explicitly listed at our wiki pages. Therefore an upgrade of modulefiles is underway. A feature of this upgrade will be the existence of default module for every application. This default choice does not need version number and it will load some (usually latest) version.

You can test the new version now by adding a line

source /cvmfs/software.metacentrum.cz/modulefiles/5.1.0/loadmodules

to your script before loading a module. Then, you can list all versions of gibpss and load default version of gibpss as

module avail gibpss/ # list available modules
module load gibpss   # load (default) module


If you wish to keep up to the current system, it is still possible. Simply list all modules by

module avail gibpss

and choose explicit version you want to use.

Available modules are:

createstdat_03.0.pl
data_selector_16.0.pl
depth_analyzer_09.0.pl
export_gt_13.0.pl
export_seq_03.0.pl
export_svars_02.0.pl
export_tables_07.0.pl
fdm_07.0.pl
indel_checker_03.0.pl
indloc_23.0.pl
indpoploc_06.0.pl
pair_finder_07.0.pl
phredi_08.0.pl 
phred_pos_filter_07.0.pl
poploc_09.0.pl

usearch11
Use all .pl scripts with a full path prefix perl /software/gibpss/1.0.2/source/ , ie. perl /software/gibpss/1.0.2/source/createstdat_03.0.pl

Documentation

Documentation is in the folder /software/gibpss/1.0.2/docu

Homepage

https://github.com/ahapke/gibpss