Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Automatic clone_threshold is 'NA'. Consider setting --clonal_threshold manually. #322

Closed
petemeng opened this issue Apr 16, 2024 · 3 comments
Labels
bug Something isn't working

Comments

@petemeng
Copy link

Description of the bug

I encountered an error when running the command :
"Automatic clone_threshold is 'NA'. Consider setting --clonal_threshold manually."
After setting the --clonal_threshold parameter to 0.01, it ran successfully.

Note: I am using test data, I just downloaded the test data locally.

Command used and terminal output

nextflow run airrflow/main.nf \
> -profile docker \
> --mode fastq \
> --input data/10x_sc_raw.tsv \
> --library_generation_method sc_10x_genomics \
> --reference_10x /Project/Nextflow/airrflow/data/refdata-cellranger-vdj-GRCh38-alts-ensembl-5.0.0.tar.gz \
> --outdir ./results  \
> --imgtdb_base /Project/Nextflow/airrflow/data/imgtdb_base.zip \
> --igblast_base /Project/Nextflow/airrflow/data/igblast_base.zip \
> --clonal_threshold auto

Relevant files

.nextflow.log

System information

Nextflow 23.04.3
Container : Docker
OS : Ubuntu18.04
Version of nf-core/airrflow: 3.3.0

@petemeng petemeng added the bug Something isn't working label Apr 16, 2024
@ggabernet
Copy link
Member

Hi @petemeng ,
yes this is a desired behaviour of airrflow, that it will notify the user if the clonal threshold could not be set automatically.
You can find a report under results/clonal_analysis/find_threshold/index.html that shows a plot of the hamming distance distribution that can be used to find the appropriate threshold manually according to your dataset.

For example, for this analysis:
Screenshot 2024-04-16 at 10 12 06

A threshold at around 0.11 seems to be able to separate sequences forming clones from singletons.
A more detailed explanation on how this method works, can be found on the Immcantation Shazam vignette: https://shazam.readthedocs.io/en/stable/vignettes/DistToNearest-Vignette/

@petemeng
Copy link
Author

Thank you for your answer.

@zhanyinx
Copy link

zhanyinx commented Sep 2, 2024

Hi there,

thanks for setting up this nice workflow. I encounter this similar problem. When I open the html report, it says:

## All `dist_nearest` values are NA. Skipping threshold analysis.

How could I set the threshold if all distances are NA?

THanks for your help
Best
Zhan

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

3 participants