updated rule filter (in merge_samples) #64

gmafrafortuna · 2023-11-20T10:49:29Z

so it starts from the file with the most samples

…h the most samples

janaobsteter · 2023-11-20T12:02:00Z

Snakemake/workflow/rules/merge_samples_vcf.smk

        resources: cpus=1, mem_mb=64000, time_min=60
        run:
            import os
+
+            for file in input:
+                shell("wc -l {file} >> files_len.txt")


This sorts by the number of lines - which is SNPs, not samples. Is this ok?

I guess this is about the number of individuals, correct @gmafrafortuna ? We could probably extract the number of individuals using vcf/bcftools. I also found this awk one-liner: awk '{if ($1 == "#CHROM"){print NF-9; exit}}' input_vcf_file.vcf This looks at the #CHROM line in the vcf file which is the line of the column header (which includes individual names).

…ified tree

…supernodes)

Wip

updated rule filter (in merge_samples) so it starts from the file wit…

561fe75

…h the most samples

gmafrafortuna requested a review from janaobsteter November 20, 2023 10:49

janaobsteter reviewed Nov 20, 2023

View reviewed changes

gmafrafortuna and others added 17 commits April 12, 2024 11:26

changed rule all input to be the simplified tree

fd7503d

no changes

f8fa59c

now filtering overlapping samples from vcfs in one step

cfa7c01

simplified rule

4c318c3

changed options after testing to improve accuracy

4389bfb

updated script to add aa to vcf keeping all sites

107192f

updated rules and scripts for ts inference. final output is now simpl…

cdc238c

…ified tree

script to simplify ts

4b3b914

updated default resources and increased for certain rules (to run on …

6e700c4

…supernodes)

config file to run global cattle demography project

0b84d65

Merge branch 'HighlanderLab:main' into main

e86a93c

updated version to run cattle inferences

19e32cf

Merge branch 'main' into wip

f5f42bb

Merge pull request #1 from gmafrafortuna/wip

87c4a20

Wip

Update Snakefile

5606ece

Update ancestral_info_to_vcf.smk

f4cc4b1

Update cluster.json

a911031

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

updated rule filter (in merge_samples) #64

updated rule filter (in merge_samples) #64

gmafrafortuna commented Nov 20, 2023

janaobsteter Nov 20, 2023

hannesbecher Dec 18, 2023

updated rule filter (in merge_samples) #64

Are you sure you want to change the base?

updated rule filter (in merge_samples) #64

Conversation

gmafrafortuna commented Nov 20, 2023

janaobsteter Nov 20, 2023

Choose a reason for hiding this comment

hannesbecher Dec 18, 2023

Choose a reason for hiding this comment