BashGATK

0

Описание

GATK pipelines on Bash

Языки

  • Python56,1%
  • Shell42,9%
  • R1%
22 дня назад
3 месяца назад
месяц назад
месяц назад
10 дней назад
2 месяца назад
2 месяца назад
6 месяцев назад
5 месяцев назад
6 месяцев назад
2 месяца назад
README.md

BashGATK

Bash implementation of GATK-based analysis, designed specifically for paired targeted NGS.

Installation

Install GNU parallel (please reference the original work):

Install Docker and DeepVariant to use it:

Get user into the docker group:

Reboot afterwards.

Install conda environments using the corresponding yml-files in the root directory of this repository:

  • ppp_genebe
  • ppp_oc
  • ppp_proc
  • ppp_spliceai

For example,

GeneBe

Create the .netrc file to use genebe, then change permissions:

OpenCRAVAT

Start the

ppp_oc
environment, then run the following commands:

Parquet output

To output parquet files along with the excel-outputs you will need to install pyarrow, which might conflict being installed via conda in the ppp_spliceai environment.

However, if the pyarrow package is installed via pip in the ppp_spliceai environment, it might work. Version 21.0.0 works all right.

You also will need to create a bash variable doParquetFlag='y'.

Pipeline example

Pipeline example is in the main_example.sh file.

Usage: