Wikidata:Requests for permissions/Bot/OrophinBot
- The following discussion is closed. Please do not modify it. Subsequent comments should be made in a new section. A summary of the conclusions reached follows.
- Approved--Ymblanter (talk) 09:50, 7 July 2021 (UTC)[reply]
OrophinBot (talk • contribs • new items • new lexemes • SUL • Block log • User rights log • User rights • xtools)
Operator: Bennylin (talk • contribs • logs)
Task/s: Creating 60.000+ Indonesian lexemes. May create more after that batch is finished and I've prepared the next batch
Code: https://leksem-indonesia.toolforge.org, forked without modifications from https://lexeme-forms.toolforge.org by Lucas Werkmeister. I'm just preparing the Indonesian language template (and in the future would try to encompass all languages in Indonesia as well)
Function details: I have prepared the data for 2 months, and it's now ready to be inputted to Wikidata. My bot did the input on id.wiktionary a long time ago. The base data is the same, from the trusted source Great Dictionary of the Indonesian Language (Q4200623), and the first batch of 60.000+ lexemes consist of one word lexemes only. In the future will also work on word compounds.
- User:Bennylin/Leksem Indonesia - the lexeme forms
- User:Bennylin#Leksem - the lexeme count
- User:Bennylin/meng- (or this) - the first test run (100 lexemes out of 4350)
- Wikidata:Lexicographical_data/Documentation/Languages/id - the lexicographical project
- wikt:id:Kategori:Kata_bahasa_Indonesia - for comparison, id.wikt today has 67.056+ Indonesian words (excluding phrases)
--Bennylin (talk) 07:07, 17 June 2021 (UTC)[reply]
- Support Looks good to me! Not that I know any Indonesian. The bot is just adding forms and not senses, correct? ArthurPSmith (talk) 17:06, 17 June 2021 (UTC)[reply]
- Correct. But what if in the future I'm adding senses to the lexeme? Is there a caveat?
- Should I run a couple hundred test first with the bot, or should I wait some more people to weigh-in? Thanks! Bennylin (talk) 18:06, 17 June 2021 (UTC)[reply]
- @Bennylin: there is a possible licence issue regarding the import of senses to the lexeme. The Wikidata licence is CC0 which means we can only import here senses from dictionary released under this licence or in the public domain; otherwise it does not comply with the CC0 licence. About the other question, yes please run your bot with a few examples so that we can check whether it correctly runs. Pamputt (talk) 06:03, 18 June 2021 (UTC)[reply]
- Cool. I'm running 500 now.
- @Pamputt: Done. Please check. 500 verbs Bennylin (talk) 07:38, 18 June 2021 (UTC)[reply]
- @Bennylin: Thanks, could you also run your bot for lexeme other than vers (noun, adjective). Only a few of each is needed. Pamputt (talk) 12:34, 18 June 2021 (UTC)[reply]
- @Pamputt: 100 nouns and 165 adjectives test run. Bennylin (talk) 23:39, 18 June 2021 (UTC) PS: That was apparently my 10.000th edit here.[reply]
- @Bennylin: there is a possible licence issue regarding the import of senses to the lexeme. The Wikidata licence is CC0 which means we can only import here senses from dictionary released under this licence or in the public domain; otherwise it does not comply with the CC0 licence. About the other question, yes please run your bot with a few examples so that we can check whether it correctly runs. Pamputt (talk) 06:03, 18 June 2021 (UTC)[reply]
- Are we ready for approval here?--Ymblanter (talk) 18:08, 1 July 2021 (UTC)[reply]
- I am! Bennylin (talk) 10:43, 3 July 2021 (UTC)[reply]
- @Ymblanter:. It's been another week now. Bennylin (talk) 09:31, 7 July 2021 (UTC)[reply]
- I am! Bennylin (talk) 10:43, 3 July 2021 (UTC)[reply]