Nav apraksta

Thomas de0cf94401 display for Flowcells 10 mēneši atpakaļ
jq_filters b793b4a698 N50 and size for BAMStats 10 mēneši atpakaļ
src de0cf94401 display for Flowcells 10 mēneši atpakaļ
.gitignore 494efb6235 run 1 gadu atpakaļ
:a e1e6476df8 IdsInput Pod5s 10 mēneši atpakaļ
Cargo.lock b94e086725 machete 10 mēneši atpakaļ
Cargo.toml b94e086725 machete 10 mēneši atpakaļ
README.md b793b4a698 N50 and size for BAMStats 10 mēneši atpakaļ

README.md

Install

Dependencies

For building required HTSlib:

sudo apt install cmake libclang-dev

Usage

Use jq for selecting variants

  • Somatic Variants of chrM (25)

    zcat /data/longreads_basic_pipe/*/diag/somatic_variants.json.gz | \
    jq -L ./jq_filters -C 'include "jq_variants"; [.data[] | select(contig("chrM") and n_in_constit <= 1) | format]'
    

Using jq and find to look for chrM norm coverage

find /data/longreads_basic_pipe/ -name "*_diag_hs1_info.json" -type f -exec sh -c 'basename $(dirname $(dirname "{}")) | tr -d "\n"' \; -printf "\t" -exec jq -L ./jq_filters -r 'include "jq_bam"; contig_coverage("chrM")' {} \;

Using jq and find VEP consequences (cf https://www.ensembl.org/info/genome/variation/prediction/predicted_data.html)

zcat /data/longreads_basic_pipe/ADJAGBA/diag/somatic_variants.json.gz | jq -L ./jq_filters -C 'include "jq_variants"; consequence("SynonymousVariant")' | bat

Using jq and find to count VEP consequences

find /data/longreads_basic_pipe/ -name "somatic_variants.json.gz" -type f -exec sh -c 'dirname=$(basename $(dirname $(dirname "$1"))); count=$(zcat "$1" | jq -L ./jq_filters -r '\''include "jq_variants"; count_consequence("SynonymousVariant") | [.true_count, .total_count, .proportion] | @tsv'\''); echo "${dirname}\t${count}"' sh {} \;

Find recurrence by VEP consequence

find /data/longreads_basic_pipe/ -name "somatic_variants.json.gz" -type f -exec sh -c 'dirname=$(basename $(dirname $(dirname "$1"))); count=$(zcat "$1" | jq -L ./jq_filters -r '\''include "jq_variants"; consequence("StopGained") | .[] | select(.has_consequence == true) | [.chr, .position, .ref, .alt] | @tsv'\''); echo "${count}"' sh {} \; | sort -k1,1V -k2,2n | uniq -c |  awk '$1 > 1 {print $2"\t"$3"\t"$4"\t"$5"\t"$1}'

Reading log files

zcat /data/longreads_basic_pipe/ID/log/deepsomatic/deepvariant_e7ed1.log.gz | jq -r '.log'