Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

wfmash to speed up #403

Open
OZTaekOppa opened this issue Aug 1, 2024 · 1 comment
Open

wfmash to speed up #403

OZTaekOppa opened this issue Aug 1, 2024 · 1 comment

Comments

@OZTaekOppa
Copy link

Dear PGGB team,

I have a few questions about your great program.

Based on the program, it appears that wfmash performs all-vs-all alignment on a single node. From my trials, this is indeed the case.

I am trying to speed up the wfmash process on multiple nodes (PBSpro) by running parallel jobs. My idea is to perform one-vs-all alignments for each node from an input full genome dataset (120 human pangenomes), and then merge the results into a single paf file for further analysis.

  1. Do you have any recommendations for tweaking the wfmash code to achieve this?
  2. If I run one-vs-all alignments on each node, will the merged paf file be equivalent to an all-vs-all alignment? Theoretically, I assume the final outcome should be the same.

Looking forward to your insights.

Kind regards,

Taek

@subwaystation
Copy link
Member

Dear @OZTaekOppa,

we have a short description on how to parallelize wfmash using several nodes. Please take a look at https://github.com/waveygang/wfmash?tab=readme-ov-file#running-wfmash-on-a-cluster.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants