Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

request: extract brain area metadata #1985

Open
bendichter opened this issue Jul 22, 2024 · 2 comments
Open

request: extract brain area metadata #1985

bendichter opened this issue Jul 22, 2024 · 2 comments

Comments

@bendichter
Copy link
Member

bendichter commented Jul 22, 2024

Based on usage feedback from NeuroDataReHack and from personal experience, searching for brain area is currently a bottleneck. @neurovium also included search by brain area in his search specification document.

Currently, brain area is required for some modalities, but it is buried. It is not currently extracted into asset or dandiset metadata, and requires reading individual NWB file contents. It is also in different places for different modalities.

Searching for brain area can therefore take a long time, particularly in datasets that have many assets. For the IBL dataset, we have been able to get this time down to 8:26 using LINDI (see discussion), but that's still not great. It would be much better to pre-extract this metadata, and provide it as asset and dandiset metadata, which would make searching much faster on the user side. It would also allow us to register terms against ontologies and controlled vocabularies and run analyses on the types of brain areas recorded.

Specifically:

  1. Modify the Dandiset and Asset metadata to include brain area
  2. Modify the CLI to extract this information from the NWB files
  3. Modify the NWB Inspector to promote including this information
  4. Modify the DLP to display this information
@bendichter
Copy link
Member Author

bendichter commented Jul 23, 2024

  1. Modify the Dandiset and Asset metadata to include brain area

It looks like this is taken care of already with the "Anatomy" entries, which can be added under subject matter for both assets and dandisets. The (potentially) missing piece is that we want an UBERON (or other) uri. Most of these brain areas are Allen Institute Mouse CCFv3 abbreviations. Are there official URIs for these abbreviations? Is there a recommended way to map them to UBERON areas?

@satra
Copy link
Member

satra commented Jul 23, 2024

pinging @lydiang on CCFv3 URIs

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants