WARNING: While bind mounting '/mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18:/mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18': destination is already in the mount point list [COMMAND] /opt/bin/run_clairs --tumor_bam_fn /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/CHAHA_diag_hs1.bam --normal_bam_fn /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/norm/CHAHA_norm_hs1.bam --ref_fn /home/t_steimle/ref/hs1/chm13v2.0.fa --threads 40 --platform ont_r10_dorado_sup_5khz_ssrs --output_dir /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18 --region chr19:1-19687871 --sample_name CHAHA_diag --include_all_ctgs --print_germline_calls --enable_indel_calling --enable_clair3_germline_output --use_longphase_for_intermediate_haplotagging True [INFO] Create folder /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/logs [INFO] Create folder /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp [INFO] Create folder /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/split_beds [INFO] Create folder /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/candidates [INFO] Create folder /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/pileup_tensor_can [INFO] Create folder /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/fa_tensor_can [INFO] Create folder /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/vcf_output [INFO] Create folder /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/tmp_vcf_output [INFO] Create folder /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/logs/clair3_log [INFO] Create folder /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/clair3_output/phased_output [INFO] Create folder /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/clair3_output/vcf [INFO] --include_all_ctgs enabled [INFO] Call variants in contigs: chr19 [INFO] Number of chunks for each contig: 13 [INFO] CALLER VERSION: 0.4.4 [INFO] NORMAL BAM FILE PATH: /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/norm/CHAHA_norm_hs1.bam [INFO] TUMOR BAM FILE PATH: /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/CHAHA_diag_hs1.bam [INFO] REFERENCE FILE PATH: /home/t_steimle/ref/hs1/chm13v2.0.fa [INFO] PLATFORM: ont_r10_dorado_sup_5khz_ssrs [INFO] THREADS: 40 [INFO] OUTPUT FOLDER: /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18 [INFO] OUTPUT VCF PATH: /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/output.vcf.gz [INFO] PILEUP MODEL PATH: /opt/conda/envs/clairs/bin/clairs_models/ont_r10_dorado_sup_5khz_ssrs/pileup.pkl [INFO] FULL-ALIGNMENT MODEL PATH: /opt/conda/envs/clairs/bin/clairs_models/ont_r10_dorado_sup_5khz_ssrs/full_alignment.pkl [INFO] BED FILE PATH: /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/region.bed [INFO] GENOTYPING MODE VCF FILE PATH: None [INFO] HYBRID MODE VCF FILE PATH: None [INFO] REGION FOR CALLING: chr19:1-19687871 [INFO] CONTIGS FOR CALLING: None [INFO] CONDA BINARY PREFIX: /opt/conda/envs/clairs [INFO] SAMTOOLS BINARY PATH: samtools [INFO] PYTHON BINARY PATH: python3 [INFO] PYPY BINARY PATH: pypy3 [INFO] PARALLEL BINARY PATH: parallel [INFO] LONGPHASE BINARY PATH: /opt/conda/envs/clairs/bin/longphase [INFO] CHUNK SIZE: 5000000 [INFO] SNV MINIMUM AF: 0.05 [INFO] SNV MINIMUM QUAL: 8 [INFO] INDEL MINIMUM AF: 0.1 [INFO] INDEL PILEUP MODEL PATH: /opt/conda/envs/clairs/bin/clairs_models/ont_r10_dorado_sup_5khz_ssrs/indel/pileup.pkl [INFO] INDEL FULL-ALIGNMENT MODEL PATH: /opt/conda/envs/clairs/bin/clairs_models/ont_r10_dorado_sup_5khz_ssrs/indel/full_alignment.pkl [INFO] INDEL MINIMUM QUAL: 8 [INFO] NORMAL VCF FILE PATH: None [INFO] DISABLE PHASING: False [INFO] ENABLE DRY RUN: False [INFO] ENABLE INDEL CALLING: True [INFO] ENABLE PRINTING REFERENCE CALLS: False [INFO] ENABLE PRINTING GERMLINE CALLS: True [INFO] ENABLE INCLUDING ALL CTGS FOR CALLING: True [INFO] ENABLE REMOVING INTERMEDIATE FILES: False [INFO] Call Germline Variants in Normal BAM using Clair3 [INFO] RUN THE FOLLOWING COMMAND: ( /opt/conda/envs/clairs/bin/run_clair3.sh --bam_fn /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/norm/CHAHA_norm_hs1.bam --ref_fn /home/t_steimle/ref/hs1/chm13v2.0.fa --model_path /opt/conda/envs/clairs/bin/clairs_models/clair3_models/r1041_e82_400bps_sup_v420 --platform ont --threads 40 --output /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/clair3_output/clair3_normal_output --ctg_name=chr19 --samtools=samtools --pypy=pypy3 --python=python3 --min_coverage=4 --snp_min_af=0.08 --indel_min_af=0.15 --longphase_for_phasing ) 2>&1 | tee /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/logs/clair3_log/1_CLAIR3_NORMAL.log [INFO] CLAIR3 VERSION: v1.0.8 [INFO] BAM FILE PATH: /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/norm/CHAHA_norm_hs1.bam [INFO] REFERENCE FILE PATH: /home/t_steimle/ref/hs1/chm13v2.0.fa [INFO] MODEL PATH: /opt/conda/envs/clairs/bin/clairs_models/clair3_models/r1041_e82_400bps_sup_v420 [INFO] OUTPUT FOLDER: /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/clair3_output/clair3_normal_output [INFO] PLATFORM: ont [INFO] THREADS: 40 [INFO] BED FILE PATH: EMPTY [INFO] VCF FILE PATH: EMPTY [INFO] CONTIGS: chr19 [INFO] CONDA PREFIX: [INFO] SAMTOOLS PATH: samtools [INFO] PYTHON PATH: python3 [INFO] PYPY PATH: pypy3 [INFO] PARALLEL PATH: parallel [INFO] WHATSHAP PATH: whatshap [INFO] LONGPHASE PATH: /opt/conda/envs/clairs/bin/longphase [INFO] CHUNK SIZE: 5000000 [INFO] FULL ALIGN PROPORTION: 0.7 [INFO] FULL ALIGN REFERENCE PROPORTION: 0.1 [INFO] PHASING PROPORTION: 0.7 [INFO] MINIMUM MQ: 5 [INFO] MINIMUM COVERAGE: 4 [INFO] SNP AF THRESHOLD: 0.08 [INFO] INDEL AF THRESHOLD: 0.15 [INFO] BASE ERROR IN GVCF: 0.001 [INFO] GQ BIN SIZE IN GVCF: 5 [INFO] ENABLE FILEUP ONLY CALLING: False [INFO] ENABLE FAST MODE CALLING: False [INFO] ENABLE CALLING SNP CANDIDATES ONLY: False [INFO] ENABLE PRINTING REFERENCE CALLS: False [INFO] ENABLE OUTPUT GVCF: False [INFO] ENABLE HAPLOID PRECISE MODE: False [INFO] ENABLE HAPLOID SENSITIVE MODE: False [INFO] ENABLE INCLUDE ALL CTGS CALLING: False [INFO] ENABLE NO PHASING FOR FULL ALIGNMENT: False [INFO] ENABLE REMOVING INTERMEDIATE FILES: False [INFO] ENABLE LONGPHASE FOR INTERMEDIATE VCF PHASING: True [INFO] ENABLE PHASING FINAL VCF OUTPUT USING WHATSHAP: False [INFO] ENABLE PHASING FINAL VCF OUTPUT USING LONGPHASE: False [INFO] ENABLE HAPLOTAGGING FINAL BAM: False [INFO] ENABLE LONG INDEL CALLING: False [INFO] ENABLE C_IMPLEMENT: True + /opt/conda/envs/clairs/bin/scripts/clair3_c_impl.sh --bam_fn /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/norm/CHAHA_norm_hs1.bam --ref_fn /home/t_steimle/ref/hs1/chm13v2.0.fa --threads 40 --model_path /opt/conda/envs/clairs/bin/clairs_models/clair3_models/r1041_e82_400bps_sup_v420 --platform ont --output /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/clair3_output/clair3_normal_output --bed_fn=EMPTY --vcf_fn=EMPTY --ctg_name=chr19 --sample_name=SAMPLE --chunk_num=0 --chunk_size=5000000 --samtools=samtools --python=python3 --pypy=pypy3 --parallel=parallel --whatshap=whatshap --qual=2 --var_pct_full=0.7 --ref_pct_full=0.1 --var_pct_phasing=0.7 --snp_min_af=0.08 --indel_min_af=0.15 --min_mq=5 --min_coverage=4 --min_contig_size=0 --pileup_only=False --gvcf=False --base_err=0.001 --gq_bin_size=5 --fast_mode=False --call_snp_only=False --print_ref_calls=False --haploid_precise=False --haploid_sensitive=False --include_all_ctgs=False --no_phasing_for_fa=False --pileup_model_prefix=pileup --fa_model_prefix=full_alignment --remove_intermediate_dir=False --enable_phasing=False --enable_long_indel=False --keep_iupac_bases=False --use_gpu=False --longphase_for_phasing=True --longphase=/opt/conda/envs/clairs/bin/longphase --use_whatshap_for_intermediate_phasing=True --use_longphase_for_intermediate_phasing=True --use_whatshap_for_final_output_phasing=False --use_longphase_for_final_output_phasing=False --use_whatshap_for_final_output_haplotagging=False [INFO] Check environment variables [INFO] Create folder /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/clair3_output/clair3_normal_output/log [INFO] Create folder /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/clair3_output/clair3_normal_output/tmp/pileup_output [INFO] Create folder /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/clair3_output/clair3_normal_output/tmp/merge_output [INFO] Create folder /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/clair3_output/clair3_normal_output/tmp/phase_output [INFO] Create folder /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/clair3_output/clair3_normal_output/tmp/gvcf_tmp_output [INFO] Create folder /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/clair3_output/clair3_normal_output/tmp/full_alignment_output [INFO] Create folder /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/clair3_output/clair3_normal_output/tmp/phase_output/phase_vcf [INFO] Create folder /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/clair3_output/clair3_normal_output/tmp/phase_output/phase_bam [INFO] Create folder /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/clair3_output/clair3_normal_output/tmp/full_alignment_output/candidate_bed [INFO] Call variant in contigs: chr19 [INFO] Chunk number for each contig: 13 [INFO] 1/7 Call variants using pileup model Calling variants ... Total processed positions in chr19 (chunk 6/13) : 37439 Total time elapsed: 89.45 s Calling variants ... Total processed positions in chr19 (chunk 5/13) : 71150 Total time elapsed: 142.17 s Calling variants ... Total processed positions in chr19 (chunk 3/13) : 77903 Total time elapsed: 157.90 s Calling variants ... Total processed positions in chr19 (chunk 10/13) : 82264 Total time elapsed: 160.79 s Calling variants ... Total processed positions in chr19 (chunk 7/13) : 77159 Total time elapsed: 161.93 s Calling variants ... Total processed positions in chr19 (chunk 11/13) : 80011 Total time elapsed: 163.09 s Calling variants ... Total processed positions in chr19 (chunk 8/13) : 81026 Total time elapsed: 165.24 s Calling variants ... Total processed positions in chr19 (chunk 9/13) : 76878 Total time elapsed: 165.94 s Calling variants ... Total processed positions in chr19 (chunk 4/13) : 82363 Total time elapsed: 166.31 s Calling variants ... Total processed positions in chr19 (chunk 12/13) : 83832 Total time elapsed: 168.75 s Calling variants ... Total processed positions in chr19 (chunk 1/13) : 86660 Total time elapsed: 171.00 s Calling variants ... Total processed positions in chr19 (chunk 2/13) : 80468 Total time elapsed: 175.28 s Calling variants ... Total processed positions in chr19 (chunk 13/13) : 90012 Total time elapsed: 179.27 s real 3m7.948s user 34m3.448s sys 1m12.228s [INFO] 2/7 Select heterozygous SNP variants for Whatshap phasing and haplotagging [INFO] Select heterozygous pileup variants exceeding phasing quality cutoff 14 [INFO] Total heterozygous SNP positions selected: chr19: 52254 real 0m2.376s user 0m2.156s sys 0m0.181s [INFO] 3/7 Phase VCF file using LongPhase LongPhase Ver 1.7 --- File Parameter --- SNP File : /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/clair3_output/clair3_normal_output/tmp/phase_output/phase_vcf/chr19.vcf SV File : MOD File : REF File : /home/t_steimle/ref/hs1/chm13v2.0.fa Output Prefix : /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/clair3_output/clair3_normal_output/tmp/phase_output/phase_vcf/phased_chr19 Generate Dot : False BAM File : /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/norm/CHAHA_norm_hs1.bam --- Phasing Parameter --- Seq Platform : ONT Phase Indel : False Distance Threshold : 300000 Connect Adjacent : 20 Edge Threshold : 0.7 Mapping Quality : 1 Variant Confidence : 0.75 ReadTag Confidence : 0.65 parsing VCF ... 0s parsing SV VCF ... 0s parsing Meth VCF ... 0s reading reference ... 1s (chr19,5s) parsing total: 5s merge results ... 0s writeResult SNP ... 1s total process: 7s real 0m6.950s user 0m20.590s sys 0m1.374s [INFO] 5/7 Select candidates for full-alignment calling [INFO] Set variants quality cutoff 19.0 [INFO] Set reference calls quality cutoff 12.0 [INFO] Low quality reference calls to be processed in chr19: 86848 [INFO] Low quality variants to be processed in chr19: 96058 real 0m2.598s user 0m2.315s sys 0m0.237s [INFO] 6/7 Call low-quality variants using full-alignment model Calling variants ... Total processed positions in chr19 (chunk 19/19) : 2906 Total time elapsed: 45.73 s Calling variants ... Total processed positions in chr19 (chunk 2/19) : 10000 Total time elapsed: 121.54 s Calling variants ... Total processed positions in chr19 (chunk 14/19) : 10000 Total time elapsed: 122.15 s Calling variants ... Total processed positions in chr19 (chunk 18/19) : 10000 Total time elapsed: 122.53 s Calling variants ... Total processed positions in chr19 (chunk 3/19) : 10000 Total time elapsed: 122.68 s Calling variants ... Total processed positions in chr19 (chunk 9/19) : 10000 Total time elapsed: 122.95 s Calling variants ... Total processed positions in chr19 (chunk 5/19) : 10000 Total time elapsed: 124.12 s Calling variants ... Total processed positions in chr19 (chunk 6/19) : 10000 Total time elapsed: 124.04 s Calling variants ... Total processed positions in chr19 (chunk 4/19) : 10000 Total time elapsed: 125.87 s Calling variants ... Total processed positions in chr19 (chunk 7/19) : 10000 Total time elapsed: 125.85 s Calling variants ... Total processed positions in chr19 (chunk 11/19) : 10000 Total time elapsed: 126.55 s Calling variants ... Total processed positions in chr19 (chunk 12/19) : 10000 Total time elapsed: 127.93 s Calling variants ... Total processed positions in chr19 (chunk 15/19) : 10000 Total time elapsed: 127.21 s Calling variants ... Total processed positions in chr19 (chunk 13/19) : 10000 Total time elapsed: 129.72 s Calling variants ... Total processed positions in chr19 (chunk 10/19) : 10000 Total time elapsed: 129.50 s Calling variants ... Total processed positions in chr19 (chunk 16/19) : 10000 Total time elapsed: 130.44 s Calling variants ... Total processed positions in chr19 (chunk 17/19) : 10000 Total time elapsed: 130.86 s Calling variants ... Total processed positions in chr19 (chunk 1/19) : 10000 Total time elapsed: 134.52 s Calling variants ... Total processed positions in chr19 (chunk 8/19) : 10000 Total time elapsed: 134.39 s real 2m18.633s user 37m45.730s sys 1m49.859s [INFO] 7/7 Merge pileup VCF and full-alignment VCF [INFO] Pileup variants processed in chr19: 41215 [INFO] Full-alignment variants processed in chr19: 104681 real 0m3.039s user 0m3.729s sys 0m0.287s [INFO] Finish calling, output file: /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/clair3_output/clair3_normal_output/merge_output.vcf.gz real 5m57.040s user 72m32.427s sys 3m6.124s [INFO] Call Germline Variant in Tumor BAM using Clair3 [INFO] RUN THE FOLLOWING COMMAND: ( /opt/conda/envs/clairs/bin/run_clair3.sh --bam_fn /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/CHAHA_diag_hs1.bam --ref_fn /home/t_steimle/ref/hs1/chm13v2.0.fa --model_path /opt/conda/envs/clairs/bin/clairs_models/clair3_models/r1041_e82_400bps_sup_v420 --platform ont --threads 40 --output /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/clair3_output/clair3_tumor_output --ctg_name=chr19 --samtools=samtools --pypy=pypy3 --python=python3 --min_coverage=4 --snp_min_af=0.08 --indel_min_af=0.15 --longphase_for_phasing ) 2>&1 | tee /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/logs/clair3_log/2_CLAIR3_TUMOR.log [INFO] CLAIR3 VERSION: v1.0.8 [INFO] BAM FILE PATH: /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/CHAHA_diag_hs1.bam [INFO] REFERENCE FILE PATH: /home/t_steimle/ref/hs1/chm13v2.0.fa [INFO] MODEL PATH: /opt/conda/envs/clairs/bin/clairs_models/clair3_models/r1041_e82_400bps_sup_v420 [INFO] OUTPUT FOLDER: /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/clair3_output/clair3_tumor_output [INFO] PLATFORM: ont [INFO] THREADS: 40 [INFO] BED FILE PATH: EMPTY [INFO] VCF FILE PATH: EMPTY [INFO] CONTIGS: chr19 [INFO] CONDA PREFIX: [INFO] SAMTOOLS PATH: samtools [INFO] PYTHON PATH: python3 [INFO] PYPY PATH: pypy3 [INFO] PARALLEL PATH: parallel [INFO] WHATSHAP PATH: whatshap [INFO] LONGPHASE PATH: /opt/conda/envs/clairs/bin/longphase [INFO] CHUNK SIZE: 5000000 [INFO] FULL ALIGN PROPORTION: 0.7 [INFO] FULL ALIGN REFERENCE PROPORTION: 0.1 [INFO] PHASING PROPORTION: 0.7 [INFO] MINIMUM MQ: 5 [INFO] MINIMUM COVERAGE: 4 [INFO] SNP AF THRESHOLD: 0.08 [INFO] INDEL AF THRESHOLD: 0.15 [INFO] BASE ERROR IN GVCF: 0.001 [INFO] GQ BIN SIZE IN GVCF: 5 [INFO] ENABLE FILEUP ONLY CALLING: False [INFO] ENABLE FAST MODE CALLING: False [INFO] ENABLE CALLING SNP CANDIDATES ONLY: False [INFO] ENABLE PRINTING REFERENCE CALLS: False [INFO] ENABLE OUTPUT GVCF: False [INFO] ENABLE HAPLOID PRECISE MODE: False [INFO] ENABLE HAPLOID SENSITIVE MODE: False [INFO] ENABLE INCLUDE ALL CTGS CALLING: False [INFO] ENABLE NO PHASING FOR FULL ALIGNMENT: False [INFO] ENABLE REMOVING INTERMEDIATE FILES: False [INFO] ENABLE LONGPHASE FOR INTERMEDIATE VCF PHASING: True [INFO] ENABLE PHASING FINAL VCF OUTPUT USING WHATSHAP: False [INFO] ENABLE PHASING FINAL VCF OUTPUT USING LONGPHASE: False [INFO] ENABLE HAPLOTAGGING FINAL BAM: False [INFO] ENABLE LONG INDEL CALLING: False [INFO] ENABLE C_IMPLEMENT: True + /opt/conda/envs/clairs/bin/scripts/clair3_c_impl.sh --bam_fn /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/CHAHA_diag_hs1.bam --ref_fn /home/t_steimle/ref/hs1/chm13v2.0.fa --threads 40 --model_path /opt/conda/envs/clairs/bin/clairs_models/clair3_models/r1041_e82_400bps_sup_v420 --platform ont --output /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/clair3_output/clair3_tumor_output --bed_fn=EMPTY --vcf_fn=EMPTY --ctg_name=chr19 --sample_name=SAMPLE --chunk_num=0 --chunk_size=5000000 --samtools=samtools --python=python3 --pypy=pypy3 --parallel=parallel --whatshap=whatshap --qual=2 --var_pct_full=0.7 --ref_pct_full=0.1 --var_pct_phasing=0.7 --snp_min_af=0.08 --indel_min_af=0.15 --min_mq=5 --min_coverage=4 --min_contig_size=0 --pileup_only=False --gvcf=False --base_err=0.001 --gq_bin_size=5 --fast_mode=False --call_snp_only=False --print_ref_calls=False --haploid_precise=False --haploid_sensitive=False --include_all_ctgs=False --no_phasing_for_fa=False --pileup_model_prefix=pileup --fa_model_prefix=full_alignment --remove_intermediate_dir=False --enable_phasing=False --enable_long_indel=False --keep_iupac_bases=False --use_gpu=False --longphase_for_phasing=True --longphase=/opt/conda/envs/clairs/bin/longphase --use_whatshap_for_intermediate_phasing=True --use_longphase_for_intermediate_phasing=True --use_whatshap_for_final_output_phasing=False --use_longphase_for_final_output_phasing=False --use_whatshap_for_final_output_haplotagging=False [INFO] Check environment variables [INFO] Create folder /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/clair3_output/clair3_tumor_output/log [INFO] Create folder /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/clair3_output/clair3_tumor_output/tmp/pileup_output [INFO] Create folder /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/clair3_output/clair3_tumor_output/tmp/merge_output [INFO] Create folder /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/clair3_output/clair3_tumor_output/tmp/phase_output [INFO] Create folder /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/clair3_output/clair3_tumor_output/tmp/gvcf_tmp_output [INFO] Create folder /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/clair3_output/clair3_tumor_output/tmp/full_alignment_output [INFO] Create folder /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/clair3_output/clair3_tumor_output/tmp/phase_output/phase_vcf [INFO] Create folder /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/clair3_output/clair3_tumor_output/tmp/phase_output/phase_bam [INFO] Create folder /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/clair3_output/clair3_tumor_output/tmp/full_alignment_output/candidate_bed [INFO] Call variant in contigs: chr19 [INFO] Chunk number for each contig: 13 [INFO] 1/7 Call variants using pileup model Calling variants ... Total processed positions in chr19 (chunk 10/13) : 10530 Total time elapsed: 57.85 s Calling variants ... Total processed positions in chr19 (chunk 8/13) : 10757 Total time elapsed: 57.86 s Calling variants ... Total processed positions in chr19 (chunk 3/13) : 11930 Total time elapsed: 59.36 s Calling variants ... Total processed positions in chr19 (chunk 5/13) : 13827 Total time elapsed: 64.29 s Calling variants ... Total processed positions in chr19 (chunk 11/13) : 13999 Total time elapsed: 64.41 s Calling variants ... Total processed positions in chr19 (chunk 12/13) : 15431 Total time elapsed: 66.90 s Calling variants ... Total processed positions in chr19 (chunk 4/13) : 15863 Total time elapsed: 67.31 s Calling variants ... Total processed positions in chr19 (chunk 9/13) : 15113 Total time elapsed: 70.26 s Calling variants ... Total processed positions in chr19 (chunk 2/13) : 18004 Total time elapsed: 72.89 s Calling variants ... Total processed positions in chr19 (chunk 1/13) : 18455 Total time elapsed: 75.61 s Calling variants ... Total processed positions in chr19 (chunk 13/13) : 19185 Total time elapsed: 76.35 s Calling variants ... Total processed positions in chr19 (chunk 7/13) : 43919 Total time elapsed: 114.61 s Calling variants ... Total processed positions in chr19 (chunk 6/13) : 53083 Total time elapsed: 127.56 s real 2m11.327s user 13m29.567s sys 1m9.261s [INFO] 2/7 Select heterozygous SNP variants for Whatshap phasing and haplotagging [INFO] Select heterozygous pileup variants exceeding phasing quality cutoff 18 [INFO] Total heterozygous SNP positions selected: chr19: 62515 real 0m1.270s user 0m1.072s sys 0m0.169s [INFO] 3/7 Phase VCF file using LongPhase LongPhase Ver 1.7 --- File Parameter --- SNP File : /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/clair3_output/clair3_tumor_output/tmp/phase_output/phase_vcf/chr19.vcf SV File : MOD File : REF File : /home/t_steimle/ref/hs1/chm13v2.0.fa Output Prefix : /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/clair3_output/clair3_tumor_output/tmp/phase_output/phase_vcf/phased_chr19 Generate Dot : False BAM File : /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/CHAHA_diag_hs1.bam --- Phasing Parameter --- Seq Platform : ONT Phase Indel : False Distance Threshold : 300000 Connect Adjacent : 20 Edge Threshold : 0.7 Mapping Quality : 1 Variant Confidence : 0.75 ReadTag Confidence : 0.65 parsing VCF ... 1s parsing SV VCF ... 0s parsing Meth VCF ... 0s reading reference ... 0s (chr19,23s) parsing total: 23s merge results ... 0s writeResult SNP ... 1s total process: 25s real 0m24.312s user 1m31.017s sys 0m5.226s [INFO] 5/7 Select candidates for full-alignment calling [INFO] Set variants quality cutoff 21.0 [INFO] Set reference calls quality cutoff 7.0 [INFO] Low quality reference calls to be processed in chr19: 10511 [INFO] Low quality variants to be processed in chr19: 108203 real 0m1.396s user 0m1.153s sys 0m0.203s [INFO] 6/7 Call low-quality variants using full-alignment model Calling variants ... Total processed positions in chr19 (chunk 12/12) : 8714 Total time elapsed: 118.40 s Calling variants ... Total processed positions in chr19 (chunk 5/12) : 10000 Total time elapsed: 120.16 s Calling variants ... Total processed positions in chr19 (chunk 7/12) : 10000 Total time elapsed: 120.89 s Calling variants ... Total processed positions in chr19 (chunk 6/12) : 10000 Total time elapsed: 122.03 s Calling variants ... Total processed positions in chr19 (chunk 8/12) : 10000 Total time elapsed: 126.72 s Calling variants ... Total processed positions in chr19 (chunk 4/12) : 10000 Total time elapsed: 131.41 s Calling variants ... Total processed positions in chr19 (chunk 11/12) : 10000 Total time elapsed: 138.95 s Calling variants ... Total processed positions in chr19 (chunk 2/12) : 10000 Total time elapsed: 141.30 s Calling variants ... Total processed positions in chr19 (chunk 3/12) : 10000 Total time elapsed: 144.17 s Calling variants ... Total processed positions in chr19 (chunk 10/12) : 10000 Total time elapsed: 143.84 s Calling variants ... Total processed positions in chr19 (chunk 1/12) : 10000 Total time elapsed: 145.49 s Calling variants ... Total processed positions in chr19 (chunk 9/12) : 10000 Total time elapsed: 154.51 s real 2m37.785s user 25m39.050s sys 1m13.121s [INFO] 7/7 Merge pileup VCF and full-alignment VCF [INFO] Pileup variants processed in chr19: 46569 [INFO] Full-alignment variants processed in chr19: 101152 real 0m2.046s user 0m2.166s sys 0m0.224s [INFO] Finish calling, output file: /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/clair3_output/clair3_tumor_output/merge_output.vcf.gz real 5m27.660s user 40m52.431s sys 2m29.321s [INFO] Select Heterozygous SNP for Phasing [INFO] RUN THE FOLLOWING COMMAND: ( parallel --joblog /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/logs/clair3_log/parallel_1_select_hetero_snp_for_phasing.log -j 40 pypy3 /opt/bin/clairs.py select_hetero_snp_for_phasing --tumor_vcf_fn /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/clair3_output/clair3_tumor_output/merge_output.vcf.gz --normal_vcf_fn /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/clair3_output/clair3_normal_output/merge_output.vcf.gz --output_folder /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/clair3_output/vcf --ctg_name {1} --use_heterozygous_snp_in_normal_sample_for_intermediate_phasing True :::: /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/CONTIGS ) 2>&1 | tee /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/logs/clair3_log/1_select_hetero_snp_for_phasing.log [INFO] Total HET SNP calls selected: chr19: 84452, not found:765, not match:81, low_qual_count:0. Total normal:76459 Total tumor:85298, pro: 0.9901 [INFO] Phase the Tumor BAM [INFO] RUN THE FOLLOWING COMMAND: ( parallel --joblog /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/logs/clair3_log/parallel_4_phase_tumor.log -j 40 /opt/conda/envs/clairs/bin/longphase phase -s /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/clair3_output/vcf/{1}.vcf -b /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/CHAHA_diag_hs1.bam -r /home/t_steimle/ref/hs1/chm13v2.0.fa -t 40 -o /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/clair3_output/phased_output/tumor_phased_{1} --ont :::: /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/CONTIGS && parallel -j 40 bgzip -f /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/clair3_output/phased_output/tumor_phased_{1}.vcf :::: /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/CONTIGS ) 2>&1 | tee /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/logs/clair3_log/4_phase_tumor.log && parallel -j 40 tabix -f -p vcf /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/clair3_output/phased_output/tumor_phased_{1}.vcf.gz :::: /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/CONTIGS LongPhase Ver 1.7 --- File Parameter --- SNP File : /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/clair3_output/vcf/chr19.vcf SV File : MOD File : REF File : /home/t_steimle/ref/hs1/chm13v2.0.fa Output Prefix : /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/clair3_output/phased_output/tumor_phased_chr19 Generate Dot : False BAM File : /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/CHAHA_diag_hs1.bam --- Phasing Parameter --- Seq Platform : ONT Phase Indel : False Distance Threshold : 300000 Connect Adjacent : 20 Edge Threshold : 0.7 Mapping Quality : 1 Variant Confidence : 0.75 ReadTag Confidence : 0.65 parsing VCF ... 0s parsing SV VCF ... 0s parsing Meth VCF ... 0s reading reference ... 1s (chr19,41s) parsing total: 41s merge results ... 0s writeResult SNP ... 1s total process: 43s [INFO] Haplotag the Tumor BAM [INFO] RUN THE FOLLOWING COMMAND: ( parallel --joblog /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/logs/clair3_log/parallel_5_haplotag_tumor.log -j 40 /opt/conda/envs/clairs/bin/longphase haplotag -o /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/clair3_output/phased_output/tumor_{1} --reference /home/t_steimle/ref/hs1/chm13v2.0.fa --region {1} -s /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/clair3_output/phased_output/tumor_phased_{1}.vcf.gz -b /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/CHAHA_diag_hs1.bam :::: /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/CONTIGS ) 2>&1 | tee /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/logs/clair3_log/5_tumor_haplotag.log && parallel -j 40 samtools index -@40 /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/clair3_output/phased_output/tumor_{1}.bam :::: /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/CONTIGS phased SNP file: /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/clair3_output/phased_output/tumor_phased_chr19.vcf.gz phased SV file: phased MOD file: input bam file: /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/CHAHA_diag_hs1.bam input ref file: /home/t_steimle/ref/hs1/chm13v2.0.fa output bam file: /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/clair3_output/phased_output/tumor_chr19.bam number of threads: 1 write log file: false log file: ------------------------------------------- tag region: chr19 filter mapping quality below: 1 percentage threshold: 0.6 tag supplementary: false ------------------------------------------- parsing SNP VCF ... 1s tag read start ... chr: chr19 ... 627s tag read 628s ------------------------------------------- total process time: 629s total alignment: 734582 total supplementary: 28447 total secondary: 0 total unmapped: 0 total tag alignment: 390188 total untagged: 344394 [INFO] STEP 1: Extract Variant Candidates from Tumor and Normal BAMs [INFO] RUN THE FOLLOWING COMMAND: ( parallel --joblog /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/logs/parallel_1_extract_tumor_candidates.log -C " " -j 40 pypy3 /opt/bin/clairs.py extract_pair_candidates --tumor_bam_fn /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/CHAHA_diag_hs1.bam --normal_bam_fn /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/norm/CHAHA_norm_hs1.bam --ref_fn /home/t_steimle/ref/hs1/chm13v2.0.fa --samtools samtools --snv_min_af 0.05 --indel_min_af 0.1 --chunk_id {2} --chunk_num {3} --ctg_name {1} --platform ont --min_coverage 4 --bed_fn /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/split_beds/{1} --candidates_folder /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/candidates --output_depth True --select_indel_candidates True --hybrid_mode_vcf_fn None --genotyping_mode_vcf_fn None :::: /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/CHUNK_LIST ) 2>&1 | tee /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/logs/1_EC.log && pypy3 /opt/bin/clairs.py concat_files --input_dir /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/candidates --input_prefix CANDIDATES_FILE_ --output_fn CANDIDATES_FILES [INFO] chr19 chunk 10/13: Total snv candidates found: 946, total indel candidates found: 795 [INFO] chr19 chunk 7/13: Total snv candidates found: 350, total indel candidates found: 815 [INFO] chr19 chunk 5/13: Total snv candidates found: 831, total indel candidates found: 847 [INFO] chr19 chunk 3/13: Total snv candidates found: 488, total indel candidates found: 773 [INFO] chr19 chunk 1/13: Total snv candidates found: 701, total indel candidates found: 1001 [INFO] chr19 chunk 9/13: Total snv candidates found: 542, total indel candidates found: 860 [INFO] chr19 chunk 12/13: Total snv candidates found: 570, total indel candidates found: 891 [INFO] chr19 chunk 2/13: Total snv candidates found: 674, total indel candidates found: 995 [INFO] chr19 chunk 0/13: Total snv candidates found: 1711, total indel candidates found: 1191 [INFO] chr19 chunk 8/13: Total snv candidates found: 253, total indel candidates found: 806 [INFO] chr19 chunk 11/13: Total snv candidates found: 437, total indel candidates found: 872 [INFO] chr19 chunk 6/13: Total snv candidates found: 349, total indel candidates found: 715 [INFO] chr19 chunk 4/13: Total snv candidates found: 1154, total indel candidates found: 883 [INFO] STEP 2: Pileup Model Calling [INFO] Create Paired Tensors [INFO] RUN THE FOLLOWING COMMAND: ( parallel --joblog /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/logs/parallel_2-1_create_pair_tensor.log -j 40 pypy3 /opt/bin/clairs.py create_pair_tensor_pileup --normal_bam_fn /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/norm/CHAHA_norm_hs1.bam --tumor_bam_fn /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/CHAHA_diag_hs1.bam --ref_fn /home/t_steimle/ref/hs1/chm13v2.0.fa --ctg_name {1/.} --samtools samtools --candidates_bed_regions {1} --tensor_can_fn /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/pileup_tensor_can/{1/} --platform ont :::: /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/candidates/CANDIDATES_FILES ) 2>&1 | tee /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/logs/2-1_CPT.log [INFO] chr19 chunk 1-1/8 Tensors generated: 250 [INFO] chr19 chunk 1-1/1 Tensors generated: 701 [INFO] chr19 chunk 1-1/7 Tensors generated: 350 [INFO] chr19 chunk 1-1/2 Tensors generated: 670 [INFO] chr19 chunk 1-1/6 Tensors generated: 349 [INFO] chr19 chunk 1-1/11 Tensors generated: 425 [INFO] chr19 chunk 1-1/3 Tensors generated: 487 [INFO] chr19 chunk 1-1/9 Tensors generated: 542 [INFO] chr19 chunk 1-1/12 Tensors generated: 570 [INFO] chr19 chunk 1-1/5 Tensors generated: 828 [INFO] chr19 chunk 1-1/4 Tensors generated: 1154 [INFO] chr19 chunk 1-1/10 Tensors generated: 946 [INFO] chr19 chunk 1-1/0 Tensors generated: 1692 [INFO] Pileup Model Prediction [INFO] RUN THE FOLLOWING COMMAND: ( parallel --joblog /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/logs/parallel_2-2_predict.log -j 40 python3 /opt/bin/clairs.py predict --tensor_fn /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/pileup_tensor_can/{1/} --call_fn /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/vcf_output/p_{1/}.vcf --chkpnt_fn /opt/conda/envs/clairs/bin/clairs_models/ont_r10_dorado_sup_5khz_ssrs/pileup.pkl --use_gpu False --platform ont --ctg_name {1/.} --pileup --show_germline :::: /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/candidates/CANDIDATES_FILES ) 2>&1 | tee /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/logs/2-2_PREDICT.log [INFO] chr19 total processed positions: 250, time elapsed: 0.9s [INFO] chr19 total processed positions: 349, time elapsed: 1.1s [INFO] chr19 total processed positions: 350, time elapsed: 1.2s [INFO] chr19 total processed positions: 487, time elapsed: 1.1s [INFO] chr19 total processed positions: 542, time elapsed: 1.3s [INFO] chr19 total processed positions: 425, time elapsed: 1.5s [INFO] chr19 total processed positions: 570, time elapsed: 1.6s [INFO] chr19 total processed positions: 701, time elapsed: 1.8s [INFO] chr19 total processed positions: 670, time elapsed: 2.1s [INFO] chr19 total processed positions: 946, time elapsed: 2.2s [INFO] chr19 total processed positions: 828, time elapsed: 2.4s [INFO] chr19 total processed positions: 1692, time elapsed: 3.6s [INFO] chr19 total processed positions: 1154, time elapsed: 3.5s [INFO] Merge Pileup VCFs [INFO] RUN THE FOLLOWING COMMAND: pypy3 /opt/bin/clairs.py sort_vcf --ref_fn /home/t_steimle/ref/hs1/chm13v2.0.fa --contigs_fn /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/CONTIGS --input_dir /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/vcf_output --vcf_fn_prefix p_ --output_fn /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/vcf_output/pileup.vcf [INFO] Sorting VCFs... [INFO] Finished VCF sorting! [INFO] STEP 3: Full-alignment Model Calling [INFO] Create Full-alignment Paired Tensors [INFO] RUN THE FOLLOWING COMMAND: ( parallel --joblog /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/logs/parallel_3-1_create_pair_tensor_fa.log -j 40 pypy3 /opt/bin/clairs.py create_pair_tensor --normal_bam_fn /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/norm/CHAHA_norm_hs1.bam --tumor_bam_fn /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/clair3_output/phased_output/tumor_{1/.}.bam --ref_fn /home/t_steimle/ref/hs1/chm13v2.0.fa --ctg_name {1/.} --samtools samtools --candidates_bed_regions {1} --tensor_can_fn /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/fa_tensor_can/{1/} --platform ont :::: /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/candidates/CANDIDATES_FILES ) 2>&1 | tee /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/logs/3-1_CPT.log [INFO] chr19 chunk 1-1/8 Tensors generated: 250 [INFO] chr19 chunk 1-1/7 Tensors generated: 350 [INFO] chr19 chunk 1-1/6 Tensors generated: 349 [INFO] chr19 chunk 1-1/11 Tensors generated: 425 [INFO] chr19 chunk 1-1/3 Tensors generated: 487 [INFO] chr19 chunk 1-1/9 Tensors generated: 542 [INFO] chr19 chunk 1-1/1 Tensors generated: 701 [INFO] chr19 chunk 1-1/12 Tensors generated: 570 [INFO] chr19 chunk 1-1/2 Tensors generated: 670 [INFO] chr19 chunk 1-1/5 Tensors generated: 817 [INFO] chr19 chunk 1-1/10 Tensors generated: 946 [INFO] chr19 chunk 1-1/0 Tensors generated: 1683 [INFO] chr19 chunk 1-1/4 Tensors generated: 1154 [INFO] Full-alignment Model Prediction [INFO] RUN THE FOLLOWING COMMAND: ( parallel --joblog /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/logs/parallel_3-2_predict.log -j 40 python3 /opt/bin/clairs.py predict --tensor_fn /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/fa_tensor_can/{1/} --call_fn /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/vcf_output/fa_{1/}.vcf --chkpnt_fn /opt/conda/envs/clairs/bin/clairs_models/ont_r10_dorado_sup_5khz_ssrs/full_alignment.pkl --use_gpu False --platform ont --ctg_name {1/.} --show_germline :::: /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/candidates/CANDIDATES_FILES ) 2>&1 | tee /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/logs/3-2_PREDICT.log [INFO] chr19 total processed positions: 250, time elapsed: 45.0s [INFO] chr19 total processed positions: 349, time elapsed: 62.9s [INFO] chr19 total processed positions: 350, time elapsed: 63.6s [INFO] chr19 total processed positions: 425, time elapsed: 76.1s [INFO] chr19 total processed positions: 487, time elapsed: 87.2s [INFO] chr19 total processed positions: 542, time elapsed: 100.1s [INFO] chr19 total processed positions: 570, time elapsed: 102.7s [INFO] chr19 total processed positions: 670, time elapsed: 122.7s [INFO] chr19 total processed positions: 701, time elapsed: 130.2s [INFO] chr19 total processed positions: 817, time elapsed: 148.4s [INFO] chr19 total processed positions: 946, time elapsed: 170.6s [INFO] chr19 total processed positions: 1154, time elapsed: 218.6s [INFO] chr19 total processed positions: 1683, time elapsed: 305.6s [INFO] Merge Full-alignment VCFs [INFO] RUN THE FOLLOWING COMMAND: pypy3 /opt/bin/clairs.py sort_vcf --ref_fn /home/t_steimle/ref/hs1/chm13v2.0.fa --contigs_fn /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/CONTIGS --input_dir /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/vcf_output --vcf_fn_prefix fa_ --output_fn /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/vcf_output/full_alignment.vcf [INFO] Sorting VCFs... [INFO] Finished VCF sorting! [INFO] STEP 4: Haplotype filtering [INFO] RUN THE FOLLOWING COMMAND: ( pypy3 /opt/bin/clairs.py haplotype_filtering --tumor_bam_fn /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/clair3_output/phased_output/tumor_ --ref_fn /home/t_steimle/ref/hs1/chm13v2.0.fa --germline_vcf_fn /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/clair3_output/clair3_tumor_output/merge_output.vcf.gz --pileup_vcf_fn /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/vcf_output/pileup.vcf --full_alignment_vcf_fn /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/vcf_output/full_alignment.vcf --output_dir /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/vcf_output --samtools samtools --pypy3 pypy3 --parallel parallel --threads 40 ) 2>&1 | tee /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/logs/4_HAP_FILTER.log Total input calls: 2721, filtered by haplotype match 8 [INFO] STEP 5: Merge and sort VCF [INFO] RUN THE FOLLOWING COMMAND: ( pypy3 /opt/bin/clairs.py merge_vcf --ref_fn /home/t_steimle/ref/hs1/chm13v2.0.fa --pileup_vcf_fn /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/vcf_output/pileup_filter.vcf --full_alignment_vcf_fn /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/vcf_output/full_alignment_filter.vcf --output_fn /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/output.vcf --platform ont --qual 8 --sample_name CHAHA_diag --enable_indel_calling True --prefer_recall False --cmdline /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/CMD ) 2>&1 | tee /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/logs/5_MV.log [INFO] Full-alignment variants filtered by pileup: 271 [INFO] STEP 6: Indel Pileup Model Calling [INFO] Create Paired Tensors [INFO] RUN THE FOLLOWING COMMAND: pypy3 /opt/bin/clairs.py concat_files --input_dir /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/candidates --input_prefix INDEL_CANDIDATES_FILE_ --output_fn INDEL_CANDIDATES_FILES && ( parallel --joblog /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/logs/parallel_6-1_create_pair_tensor_indel.log -j 40 pypy3 /opt/bin/clairs.py create_pair_tensor_pileup --normal_bam_fn /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/norm/CHAHA_norm_hs1.bam --tumor_bam_fn /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/CHAHA_diag_hs1.bam --ref_fn /home/t_steimle/ref/hs1/chm13v2.0.fa --ctg_name {1/.} --samtools samtools --candidates_bed_regions {1} --tensor_can_fn /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/pileup_tensor_can/indel_{1/} --platform ont :::: /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/candidates/INDEL_CANDIDATES_FILES ) 2>&1 | tee /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/logs/6-1_CPTI.log [INFO] chr19 Tensors generated: 881 [INFO] chr19 Tensors generated: 773 [INFO] chr19 Tensors generated: 986 [INFO] chr19 Tensors generated: 1000 [INFO] chr19 Tensors generated: 715 [INFO] chr19 Tensors generated: 1183 [INFO] chr19 Tensors generated: 795 [INFO] chr19 Tensors generated: 801 [INFO] chr19 Tensors generated: 860 [INFO] chr19 Tensors generated: 868 [INFO] chr19 Tensors generated: 815 [INFO] chr19 Tensors generated: 891 [INFO] chr19 Tensors generated: 846 [INFO] Indel Pileup Model Prediction [INFO] RUN THE FOLLOWING COMMAND: ( parallel --joblog /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/logs/parallel_6-2_predict_indel.log -j 40 python3 /opt/bin/clairs.py predict --tensor_fn /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/pileup_tensor_can/indel_{1/} --call_fn /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/vcf_output/indel_p_{1/}.vcf --chkpnt_fn /opt/conda/envs/clairs/bin/clairs_models/ont_r10_dorado_sup_5khz_ssrs/indel/pileup.pkl --use_gpu False --platform ont --ctg_name {1/.} --pileup --enable_indel_calling True --show_germline :::: /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/candidates/INDEL_CANDIDATES_FILES ) 2>&1 | tee /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/logs/6-2_PREDICT_INDEL.log [INFO] chr19 total processed positions: 715, time elapsed: 1.6s [INFO] chr19 total processed positions: 795, time elapsed: 1.6s [INFO] chr19 total processed positions: 773, time elapsed: 1.7s [INFO] chr19 total processed positions: 986, time elapsed: 2.2s [INFO] chr19 total processed positions: 801, time elapsed: 2.2s [INFO] chr19 total processed positions: 868, time elapsed: 2.1s [INFO] chr19 total processed positions: 1000, time elapsed: 2.3s [INFO] chr19 total processed positions: 815, time elapsed: 2.1s [INFO] chr19 total processed positions: 1183, time elapsed: 2.9s [INFO] chr19 total processed positions: 846, time elapsed: 2.6s [INFO] chr19 total processed positions: 860, time elapsed: 2.5s [INFO] chr19 total processed positions: 891, time elapsed: 2.7s [INFO] chr19 total processed positions: 881, time elapsed: 2.7s [INFO] Merge Pileup VCFs [INFO] RUN THE FOLLOWING COMMAND: pypy3 /opt/bin/clairs.py sort_vcf --ref_fn /home/t_steimle/ref/hs1/chm13v2.0.fa --contigs_fn /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/CONTIGS --input_dir /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/vcf_output --vcf_fn_prefix indel_p_ --output_fn /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/vcf_output/indel_pileup.vcf [INFO] Sorting VCFs... [INFO] Finished VCF sorting! [INFO] STEP 7: Indel Full-alignment Model Calling [INFO] Create Full-alignment Paired Tensors [INFO] RUN THE FOLLOWING COMMAND: ( parallel --joblog /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/logs/parallel_7-1_create_pair_tensor_fa_indel.log -j 40 pypy3 /opt/bin/clairs.py create_pair_tensor --normal_bam_fn /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/norm/CHAHA_norm_hs1.bam --tumor_bam_fn /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/clair3_output/phased_output/tumor_{1/.}.bam --ref_fn /home/t_steimle/ref/hs1/chm13v2.0.fa --ctg_name {1/.} --samtools samtools --candidates_bed_regions {1} --tensor_can_fn /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/fa_tensor_can/indel_{1/} --platform ont :::: /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/candidates/INDEL_CANDIDATES_FILES ) 2>&1 | tee /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/logs/7-1_CPTI.log [INFO] chr19 Tensors generated: 715 [INFO] chr19 Tensors generated: 773 [INFO] chr19 Tensors generated: 801 [INFO] chr19 Tensors generated: 795 [INFO] chr19 Tensors generated: 986 [INFO] chr19 Tensors generated: 815 [INFO] chr19 Tensors generated: 860 [INFO] chr19 Tensors generated: 845 [INFO] chr19 Tensors generated: 1000 [INFO] chr19 Tensors generated: 881 [INFO] chr19 Tensors generated: 868 [INFO] chr19 Tensors generated: 891 [INFO] chr19 Tensors generated: 1181 [INFO] Indel Full-alignment Model Prediction [INFO] RUN THE FOLLOWING COMMAND: ( parallel --joblog /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/logs/parallel_7-2_predict.log -j 40 python3 /opt/bin/clairs.py predict --tensor_fn /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/fa_tensor_can/indel_{1/} --call_fn /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/vcf_output/indel_fa_{1/}.vcf --chkpnt_fn /opt/conda/envs/clairs/bin/clairs_models/ont_r10_dorado_sup_5khz_ssrs/indel/full_alignment.pkl --use_gpu False --platform ont --ctg_name {1/.} --enable_indel_calling True --show_germline :::: /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/candidates/INDEL_CANDIDATES_FILES ) 2>&1 | tee /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/logs/7-2_PREDICT_INDEL.log [INFO] chr19 total processed positions: 715, time elapsed: 127.0s [INFO] chr19 total processed positions: 773, time elapsed: 132.1s [INFO] chr19 total processed positions: 795, time elapsed: 136.5s [INFO] chr19 total processed positions: 801, time elapsed: 137.2s [INFO] chr19 total processed positions: 815, time elapsed: 139.3s [INFO] chr19 total processed positions: 868, time elapsed: 149.0s [INFO] chr19 total processed positions: 860, time elapsed: 149.4s [INFO] chr19 total processed positions: 845, time elapsed: 150.6s [INFO] chr19 total processed positions: 891, time elapsed: 158.6s [INFO] chr19 total processed positions: 881, time elapsed: 161.9s [INFO] chr19 total processed positions: 986, time elapsed: 170.9s [INFO] chr19 total processed positions: 1000, time elapsed: 171.0s [INFO] chr19 total processed positions: 1181, time elapsed: 205.9s [INFO] Merge Full-alignment VCFs [INFO] RUN THE FOLLOWING COMMAND: pypy3 /opt/bin/clairs.py sort_vcf --ref_fn /home/t_steimle/ref/hs1/chm13v2.0.fa --contigs_fn /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/CONTIGS --input_dir /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/vcf_output --vcf_fn_prefix indel_fa_ --output_fn /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/vcf_output/indel_full_alignment.vcf [INFO] Sorting VCFs... [INFO] Finished VCF sorting! [INFO] Indel Haplotype filtering [INFO] RUN THE FOLLOWING COMMAND: ( pypy3 /opt/bin/clairs.py haplotype_filtering --tumor_bam_fn /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/clair3_output/phased_output/tumor_ --ref_fn /home/t_steimle/ref/hs1/chm13v2.0.fa --germline_vcf_fn /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/clair3_output/clair3_tumor_output/merge_output.vcf.gz --pileup_vcf_fn /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/vcf_output/indel_pileup.vcf --full_alignment_vcf_fn /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/vcf_output/indel_full_alignment.vcf --output_dir /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/vcf_output --samtools samtools --pypy3 pypy3 --parallel parallel --threads 40 --is_indel ) 2>&1 | tee /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/logs/8_INDEL_HAP_FILTER.log Total input calls: 1032, filtered by haplotype match 12 [INFO] STEP 8: Merge and sort Indel VCF [INFO] RUN THE FOLLOWING COMMAND: ( pypy3 /opt/bin/clairs.py merge_vcf --ref_fn /home/t_steimle/ref/hs1/chm13v2.0.fa --pileup_vcf_fn /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/vcf_output/indel_pileup_filter.vcf --full_alignment_vcf_fn /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/vcf_output/indel_full_alignment_filter.vcf --output_fn /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/indel.vcf --platform ont --qual 8 --sample_name CHAHA_diag --enable_indel_calling True --indel_calling --prefer_recall False --cmdline /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/CMD ) 2>&1 | tee /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/logs/8_MVI.log [INFO] Full-alignment variants filtered by pileup: 100 [INFO] Total time elapsed: 33m48.00s [INFO] Finish calling, output file: /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/output.vcf.gz [INFO] Finish calling, snv output file: /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/snv.vcf.gz [INFO] Finish calling, indel output file: /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/indel.vcf.gz [INFO] Finish calling, Clair3 tumor BAM germline output file: /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/clair3_tumor_germline_output.vcf.gz [INFO] Finish calling, Clair3 normal BAM germline output file: /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/clair3_normal_germline_output.vcf.gz cp /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/tmp/clair3_output/clair3_tumor_output/merge_output.vcf.gz /mnt/beegfs02/scratch/t_steimle/data/wgs/CHAHA/diag/ClairS/part18/clair3_tumor_germline_output.vcf.gz ---------------------------------------------- | SLURM EPILOG | ---------------------------------------------- Job ID: 2556288 Cluster: flamingo User/Group: t_steimle/gs_hpc_biopath Nodelist: n11 Nodes: 1 Cores per node: 40 Job started at: 2025-12-08 16:07:06 Job ended at: 2025-12-08 16:40:57 Job Wall-clock time: 00:33:51 CPU Utilized: 03:49:32 CPU Efficiency: 16.95% of 22:34:00 core-walltime Memory Utilized: 23.59 GB Memory Efficiency: 58.98% of 40.00 GB