-
Notifications
You must be signed in to change notification settings - Fork 42
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
scraper directory uses the user-input output directory #95
Comments
Looks like this issue has been reported before, but not fixed yet in the main code: #56 |
yep; these are quite old bugs but since fewer people see to have been interested in quickscrape and work has been going on with updating thresher the code to fix this hasn't made it to the master branch yet. Have a look at tarrow/master for a place where lots of these fixes are. |
ah cool. I shall have a look at tarrow/master then, thx :) |
@tarrow erm... I just realised I don't know how to compile this kind of code from source. tarrow/master is at https://github.com/tarrow/quickscrape right?
won't install your quickscrape will it? How do I install your updated quickscrape? |
npm install --global tarrow/quickscrape should do it :) |
cheers. I know literally nothing about npm 😭 |
I installed
quickscrape
as per the readme instructionsI cloned the example journal scrapers repo.
I tried the first peerj-384 example in the readme, but it didn't work.
The problem is it appears to be looking for the scraper file inside of the specified output folder!
e.g. instead of looking in:
journal-scrapers/scrapers/peerj.json
it looks for the scraper file in:
peerj-384/journal-scrapers/scrapers/peerj.json
A quick workaround is just to specify output folder as .
The text was updated successfully, but these errors were encountered: