Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Trial registration statement missing from xml output #1143

Open
mariadelmarq opened this issue Jul 15, 2024 · 3 comments
Open

Trial registration statement missing from xml output #1143

mariadelmarq opened this issue Jul 15, 2024 · 3 comments
Labels
error cases Some error/test case for future improvements

Comments

@mariadelmarq
Copy link

Hi again,

Another "error case". For the published pdf for https://pubmed.ncbi.nlm.nih.gov/27917460/ (not the freely available author manuscript), the "Trial Registration" located just under the abstract on page 1 is missing from Grobid's xml output. Just checking if there is an easy fix, and happy to chat more or send more info through if it helps.

@lfoppiano lfoppiano added the error cases Some error/test case for future improvements label Jul 21, 2024
@lfoppiano
Copy link
Collaborator

lfoppiano commented Jul 21, 2024

This issue is partially mentioned in the #1142 (point 2), that this information that does not have a specific standardized place in the header is lost, and we should keep it or find a place for it.

@kermitt2
Copy link
Owner

Hello! We could add the trial information in the data availability section. I think the issue is that we don't have text content about trial registration in the training corpus for data availability section currently, so it's ignored or it goes under the funding section. Having better recognition of the information about clinical trials is also an objective of the French Open Science Monitor.

@mariadelmarq
Copy link
Author

That would be amazing! We're interested in general statements about preregistration, which includes clinical trial registration, so it would be great to capture the broader statements if at all possible!

@lfoppiano lfoppiano changed the title Section of pdf missing from xml output Trial registration statement missing from xml output Jul 25, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
error cases Some error/test case for future improvements
Projects
None yet
Development

No branches or pull requests

3 participants