-
Notifications
You must be signed in to change notification settings - Fork 71
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Allow for total (t)
CLI functionality to be ran using a Wikidata lexemes dump
#520
Comments
--total (-t)
CLI functionality to be ran using a Wikidata lexemes dumptotal (t)
CLI functionality to be ran using a Wikidata lexemes dump
We can use the Ref: |
Perfect, @axif0! Let's go this route for sure then :) I remember using this before I believe. Feel free to get started here! |
Another thing is that, If user give Do you want to query total from
- Wikidata query service
- Lexeme dump
(W/L/S) if user give Or totally remove the Wikidata query service feature? |
Another suggetion is that, if there is a lexeme parsed dump JSON format available in directory, then What do you think? @andrewtavis @wkyoshida @henrikth93 |
I'd say we can provide the options to the user if they don't pass something and then use the |
Terms
Description
This issue will be the first issue to add dump processing functionality to the Scribe-Data CLI. In it, we'll do the following:
--wikidata-dump (-wd)
argument to the total commandcheck_lexeme_dump_prompt_download
function to cli/utils.py #518 functionality will be passed to make sure that a dump is available or download onesrc/scribe_data/wikidata/parse_dump.py
and called from the CLIBefore starting, we should map out the best way to process the dump, with a specific question being whether we need to uncompress the dump or whether we can work directly from the compressed
.json.bz2
file.Contribution
@axif0 will be working on this as a part of Outreachy! 📶🚤
The text was updated successfully, but these errors were encountered: