Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Retrieving protein-coding sequence from a giving function #54

Open
Streptomyces1 opened this issue May 2, 2021 · 1 comment
Open

Retrieving protein-coding sequence from a giving function #54

Streptomyces1 opened this issue May 2, 2021 · 1 comment

Comments

@Streptomyces1
Copy link

Streptomyces1 commented May 2, 2021

Hey @elizabethmcd good afternoon!
I have a very newbie issue related to my results: I run the summarize metabolism option on my isolated genomes, and now I´m interested on checking the protein sequence with matches a given functions, carbon monoxide dehydrogenase, let´s say. I se that on the annotation result I have the protein sequences, and on the summary table I know I have at least one copy, so I´d like to know how to identify it among the protein sequences. Should I blast one by one, or there is an easier way?
thank you!

@elizabethmcd
Copy link
Owner

I can't remember if in the out folder if the program saves the output of sequences for each individual hits, or just the HMM outputs. You could search those HMM output files, or use the single-marker-phylogeny with the HMM for the carbon monoxide dehydrogenase, and that will give you the sequence files in both the .faa and alignment files for hits in your genomes. You could just tell it to run with FastTree if you aren't interested in robust phylogenies and just want the sequences for that marker specifically.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants