بدون توضیح

Thomas b793b4a698 N50 and size for BAMStats 10 ماه پیش
jq_filters b793b4a698 N50 and size for BAMStats 10 ماه پیش
src b793b4a698 N50 and size for BAMStats 10 ماه پیش
.gitignore 494efb6235 run 1 سال پیش
Cargo.lock 605ee768fb json somatic stats 10 ماه پیش
Cargo.toml 605ee768fb json somatic stats 10 ماه پیش
README.md b793b4a698 N50 and size for BAMStats 10 ماه پیش

README.md

Install

Dependencies

For building required HTSlib:

sudo apt install cmake libclang-dev

Usage

Use jq for selecting variants

  • Somatic Variants of chrM (25)

    zcat /data/longreads_basic_pipe/*/diag/somatic_variants.json.gz | \
    jq -L ./jq_filters -C 'include "jq_variants"; [.data[] | select(contig("chrM") and n_in_constit <= 1) | format]'
    

Using jq and find to look for chrM norm coverage

find /data/longreads_basic_pipe/ -name "*_diag_hs1_info.json" -type f -exec sh -c 'basename $(dirname $(dirname "{}")) | tr -d "\n"' \; -printf "\t" -exec jq -L ./jq_filters -r 'include "jq_bam"; contig_coverage("chrM")' {} \;

Using jq and find VEP consequences (cf https://www.ensembl.org/info/genome/variation/prediction/predicted_data.html)

zcat /data/longreads_basic_pipe/ADJAGBA/diag/somatic_variants.json.gz | jq -L ./jq_filters -C 'include "jq_variants"; consequence("SynonymousVariant")' | bat

Using jq and find to count VEP consequences

find /data/longreads_basic_pipe/ -name "somatic_variants.json.gz" -type f -exec sh -c 'dirname=$(basename $(dirname $(dirname "$1"))); count=$(zcat "$1" | jq -L ./jq_filters -r '\''include "jq_variants"; count_consequence("SynonymousVariant") | [.true_count, .total_count, .proportion] | @tsv'\''); echo "${dirname}\t${count}"' sh {} \;

Find recurrence by VEP consequence

find /data/longreads_basic_pipe/ -name "somatic_variants.json.gz" -type f -exec sh -c 'dirname=$(basename $(dirname $(dirname "$1"))); count=$(zcat "$1" | jq -L ./jq_filters -r '\''include "jq_variants"; consequence("StopGained") | .[] | select(.has_consequence == true) | [.chr, .position, .ref, .alt] | @tsv'\''); echo "${count}"' sh {} \; | sort -k1,1V -k2,2n | uniq -c |  awk '$1 > 1 {print $2"\t"$3"\t"$4"\t"$5"\t"$1}'

Reading log files

zcat /data/longreads_basic_pipe/ID/log/deepsomatic/deepvariant_e7ed1.log.gz | jq -r '.log'