Няма описание

Thomas b793b4a698 N50 and size for BAMStats преди 10 месеца
jq_filters b793b4a698 N50 and size for BAMStats преди 10 месеца
src b793b4a698 N50 and size for BAMStats преди 10 месеца
.gitignore 494efb6235 run преди 1 година
Cargo.lock 605ee768fb json somatic stats преди 10 месеца
Cargo.toml 605ee768fb json somatic stats преди 10 месеца
README.md b793b4a698 N50 and size for BAMStats преди 10 месеца

README.md

Install

Dependencies

For building required HTSlib:

sudo apt install cmake libclang-dev

Usage

Use jq for selecting variants

  • Somatic Variants of chrM (25)

    zcat /data/longreads_basic_pipe/*/diag/somatic_variants.json.gz | \
    jq -L ./jq_filters -C 'include "jq_variants"; [.data[] | select(contig("chrM") and n_in_constit <= 1) | format]'
    

Using jq and find to look for chrM norm coverage

find /data/longreads_basic_pipe/ -name "*_diag_hs1_info.json" -type f -exec sh -c 'basename $(dirname $(dirname "{}")) | tr -d "\n"' \; -printf "\t" -exec jq -L ./jq_filters -r 'include "jq_bam"; contig_coverage("chrM")' {} \;

Using jq and find VEP consequences (cf https://www.ensembl.org/info/genome/variation/prediction/predicted_data.html)

zcat /data/longreads_basic_pipe/ADJAGBA/diag/somatic_variants.json.gz | jq -L ./jq_filters -C 'include "jq_variants"; consequence("SynonymousVariant")' | bat

Using jq and find to count VEP consequences

find /data/longreads_basic_pipe/ -name "somatic_variants.json.gz" -type f -exec sh -c 'dirname=$(basename $(dirname $(dirname "$1"))); count=$(zcat "$1" | jq -L ./jq_filters -r '\''include "jq_variants"; count_consequence("SynonymousVariant") | [.true_count, .total_count, .proportion] | @tsv'\''); echo "${dirname}\t${count}"' sh {} \;

Find recurrence by VEP consequence

find /data/longreads_basic_pipe/ -name "somatic_variants.json.gz" -type f -exec sh -c 'dirname=$(basename $(dirname $(dirname "$1"))); count=$(zcat "$1" | jq -L ./jq_filters -r '\''include "jq_variants"; consequence("StopGained") | .[] | select(.has_consequence == true) | [.chr, .position, .ref, .alt] | @tsv'\''); echo "${count}"' sh {} \; | sort -k1,1V -k2,2n | uniq -c |  awk '$1 > 1 {print $2"\t"$3"\t"$4"\t"$5"\t"$1}'

Reading log files

zcat /data/longreads_basic_pipe/ID/log/deepsomatic/deepvariant_e7ed1.log.gz | jq -r '.log'