Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Build variable suggester tool #26

Open
dougiesquire opened this issue Apr 6, 2023 · 2 comments
Open

Build variable suggester tool #26

dougiesquire opened this issue Apr 6, 2023 · 2 comments
Assignees
Labels
enhancement New feature or request

Comments

@dougiesquire
Copy link
Collaborator

dougiesquire commented Apr 6, 2023

One current difficulty for new users trying to use the ACCESS-NRI catalog is that variable names in the catalog are taken directly from the model output and so different names can exist for the same variable in the catalog. The task of translating variables to a common vocabulary is probably too difficult/large and probably would not even be wanted by most users (many users will know the name of the variable they're looking for). Instead, we could build a tool that recommends synonym variables. E.g.

$ synonym_variables("sst")
You might also be interested in variables named: tos, ...

This tool could learn from the datastores (e.g. using standard_names and long_names) in the catalog as they're added.

@dougiesquire dougiesquire self-assigned this May 19, 2023
@dougiesquire dougiesquire added the enhancement New feature or request label Jun 13, 2023
@paolap
Copy link

paolap commented Jul 19, 2024

"The task of translating variables to a common vocabulary is probably too difficult/large and probably would not even be wanted by most users"
This is exactly what ACCESS-MOPPeR does as we do need that mapping for post-processing the variables, so it can be reused when creating intake catalogues. I just added and extra "intake' option for MOPPeR so now it can create a mapping file(that is needed for the post-processing step) and an intake catalogue which lists the simulation files as a multi-variable catalogue file with extra lines for each variable that can be mapped and/or has a standard_name. Happy to show a demo.
You wouldn't want the exact same behavior but I could add an intake-nri template to my tool so it could also produce an nri-style esm-catalog

@dougiesquire
Copy link
Collaborator Author

That sounds neat @paolap. Having the variable mappings would be very helpful for users

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants