How do I specify FLNC reads in IsoQuant #226

sanyalab · 2024-08-19T08:23:28Z

Hi,

I have Pacbio FLNC reads in fastq format. What options should be specified while running the tool. I was thinking
--data_type pacbio --fl_data. Is this correct?

Thanks
Abhijit

The text was updated successfully, but these errors were encountered:

andrewprzh · 2024-08-19T22:29:59Z

Dear @sanyalab

Yes, this set of options is correct.

Best
Andrey

sanyalab · 2024-08-21T01:30:40Z

Hi Andrey,

A few other questions?

Are you guys sure there is no difference between pacbio and pacbio_ccs? I used 1.2 million IsoSeq FLNC reads and got 7780 transcripts for pacbio_ccs and 5438 for pacbio. I am using the 3.5 version.
What is the difference among default_pacbio, sensitive_pacbio, and fl_pacbio other than the transcript number.
I am working with a fungal genome (<100MB) in a contig state, that has 2 haplotypes.
2a. Do I concatenate the haplotype genomes and use them together for IsoQuant or use these separately as I have done above.
2b. Does this decrease (1.2 mil to ~8000) seem reasonable for a fungal genome? Any suggestions on the optimal number of reads (genome agnostic) for IsoQuant?

Thanks
Abhijit

andrewprzh · 2024-08-22T12:49:56Z

@sanyalab

Are you guys sure there is no difference between pacbio and pacbio_ccs? I used 1.2 million IsoSeq FLNC reads and got 7780 transcripts for pacbio_ccs and 5438 for pacbio. I am using the 3.5 version.

Yes, they are just aliases. Could you send me the logs for these runs?

What is the difference among default_pacbio, sensitive_pacbio, and fl_pacbio other than the transcript number.

These are just different option presets. sensitive_pacbio applies slightly lighter filters compared to default_pacbio. fl_pacbio requires known transcripts to be covered by FSM reads to be reported. From the user perspective the only difference is the number of reported transcripts.

I am working with a fungal genome (<100MB) in a contig state, that has 2 haplotypes.
2a. Do I concatenate the haplotype genomes and use them together for IsoQuant or use these separately as I have done above.

I have very little experience with diploid genomes, especially highly diploid. I would first try to create a consensus genome, if even possible. If not, using them separately could be better, since there can be way too much multimappers when using concatenated genome.

2b. Does this decrease (1.2 mil to ~8000) seem reasonable for a fungal genome? Any suggestions on the optimal number of reads (genome agnostic) for IsoQuant?

It's very hard to predict now many novel transcripts should be detected and what is a reasonable number. It depends on how well the genome itself, how well it is sequenced, how deep is your sequencing etc. So, the only suggestion I can give is to check relative genomes or try different settings / tools and compare the output.

Best
Andrey

sanyalab · 2024-08-23T16:08:18Z

Hi Andrey,

I'll generate the files again. since it was a test and I was playing with the hyperparameters, I did'nt know what to retain. No worries, I'll generate the files and send you the logs. Thank you for the insightful comments.

-Abhijit

andrewprzh added the question Further information is requested label Aug 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How do I specify FLNC reads in IsoQuant #226

How do I specify FLNC reads in IsoQuant #226

sanyalab commented Aug 19, 2024

andrewprzh commented Aug 19, 2024

sanyalab commented Aug 21, 2024

andrewprzh commented Aug 22, 2024

sanyalab commented Aug 23, 2024

How do I specify FLNC reads in IsoQuant #226

How do I specify FLNC reads in IsoQuant #226

Comments

sanyalab commented Aug 19, 2024

andrewprzh commented Aug 19, 2024

sanyalab commented Aug 21, 2024

andrewprzh commented Aug 22, 2024

sanyalab commented Aug 23, 2024