Žiadny popis

Thomas 2aefa191ed up 10 mesiacov pred
jq_filters b793b4a698 N50 and size for BAMStats 10 mesiacov pred
src 2aefa191ed up 10 mesiacov pred
.gitignore 494efb6235 run 1 rok pred
:a e1e6476df8 IdsInput Pod5s 10 mesiacov pred
Cargo.lock b94e086725 machete 10 mesiacov pred
Cargo.toml b94e086725 machete 10 mesiacov pred
README.md b793b4a698 N50 and size for BAMStats 10 mesiacov pred

README.md

Install

Dependencies

For building required HTSlib:

sudo apt install cmake libclang-dev

Usage

Use jq for selecting variants

  • Somatic Variants of chrM (25)

    zcat /data/longreads_basic_pipe/*/diag/somatic_variants.json.gz | \
    jq -L ./jq_filters -C 'include "jq_variants"; [.data[] | select(contig("chrM") and n_in_constit <= 1) | format]'
    

Using jq and find to look for chrM norm coverage

find /data/longreads_basic_pipe/ -name "*_diag_hs1_info.json" -type f -exec sh -c 'basename $(dirname $(dirname "{}")) | tr -d "\n"' \; -printf "\t" -exec jq -L ./jq_filters -r 'include "jq_bam"; contig_coverage("chrM")' {} \;

Using jq and find VEP consequences (cf https://www.ensembl.org/info/genome/variation/prediction/predicted_data.html)

zcat /data/longreads_basic_pipe/ADJAGBA/diag/somatic_variants.json.gz | jq -L ./jq_filters -C 'include "jq_variants"; consequence("SynonymousVariant")' | bat

Using jq and find to count VEP consequences

find /data/longreads_basic_pipe/ -name "somatic_variants.json.gz" -type f -exec sh -c 'dirname=$(basename $(dirname $(dirname "$1"))); count=$(zcat "$1" | jq -L ./jq_filters -r '\''include "jq_variants"; count_consequence("SynonymousVariant") | [.true_count, .total_count, .proportion] | @tsv'\''); echo "${dirname}\t${count}"' sh {} \;

Find recurrence by VEP consequence

find /data/longreads_basic_pipe/ -name "somatic_variants.json.gz" -type f -exec sh -c 'dirname=$(basename $(dirname $(dirname "$1"))); count=$(zcat "$1" | jq -L ./jq_filters -r '\''include "jq_variants"; consequence("StopGained") | .[] | select(.has_consequence == true) | [.chr, .position, .ref, .alt] | @tsv'\''); echo "${count}"' sh {} \; | sort -k1,1V -k2,2n | uniq -c |  awk '$1 > 1 {print $2"\t"$3"\t"$4"\t"$5"\t"$1}'

Reading log files

zcat /data/longreads_basic_pipe/ID/log/deepsomatic/deepvariant_e7ed1.log.gz | jq -r '.log'