Commons:Bots/Requests/BMacZeroBot 6
BMacZeroBot (talk · contribs) 6
Operator: BMacZero (talk · contributions · Statistics · Recent activity · block log · User rights log · uploads · Global account information)
Bot's tasks for which permission is being sought: Perform a batch upload from NPGallery, the US National Park Service's photo repository (Commons:Batch uploading/NPGallery). Licensing for these images is all over the place, so I'm being careful to determine public domain status before uploading; I'll be starting with pre-1924 images, images credited only as "NPS Photo", and images credited to people who are whitelisted as NPS employees.
Automatic or manually assisted: Automatic
Edit type (e.g. Continuous, daily, one time run): One-time
Maximum edit rate (e.g. edits per minute): 6 uploads per minute (given the speed of the source site, it will in all likelihood be far slower)
Bot flag requested: (Y/N): Already has.
Programming language(s): C#
BMacZero (talk) 05:33, 10 May 2019 (UTC)
Discussion
Test run can be found in Category:Images from NPGallery to check. For the first few I made hand edits after uploading - in each case, I also updated the bot to do those things automatically. BMacZero (talk) 05:33, 10 May 2019 (UTC)
- Please use language tags for Photographer and Depicted place fields. I also don't think that photos like File:Photo Op (5b2a2b65-0ba5-422a-8b04-dc47d36ee650).jpg and File:Posing (ad003174-6386-48df-a200-59eece4f59ce).jpg fit into Commons:Project scope. --EugeneZelenko (talk) 13:47, 10 May 2019 (UTC)
- @EugeneZelenko: Good catch, will add the language tags. I'm not sure it's possible for me to effectively detect photos that are primarily of specific people and therefore out of scope. I can try looking for phrases like "a man" and "two women" in the text, but I think we'd probably miss some good images that way. Maybe I can add such images to an additional check category for humans to review? BMacZero (talk) 16:02, 10 May 2019 (UTC)
- Users with quibbles are free to nominate any image for deletion. The overall value of the collection outweighs the odd out-of-scope random selfie shot. And images of specific non-notable people doing activities (or in locations) which are poorly illustrated on Commons can have educational use. Users of Wikivoyage or Wikibooks might like to use an image of everyday hikers in Denali National Park, without the inherent distraction in depicting a celebrity. Commons does not exist solely as a servant of Wikipedia, nor is it restricted to only Wikimedia projects. --Animalparty (talk) 17:49, 10 May 2019 (UTC)
- Sorry, but I don't see big value to have pictures of everyday hikers in every possible place to hike. Places are more then enough for Wikivoyage. --EugeneZelenko (talk) 13:44, 11 May 2019 (UTC)
- @EugeneZelenko: I added the logic I described above and put the images it would catch into Category:Images from NPGallery to check for scope. I'll can monitor this as I go and add more conditions as I spot them. If you'd rather we make the decision preemptively and not upload these at all, I can open a wider discussion to see if there is a consensus one way or the other. BMacZero (talk) 17:17, 11 May 2019 (UTC)
- Sure, it's good idea to organize project to post-process files in mass uploads in many respects: reviewing, categorization, adding metadata, etc. --EugeneZelenko (talk) 14:08, 12 May 2019 (UTC)
- @EugeneZelenko: I added the logic I described above and put the images it would catch into Category:Images from NPGallery to check for scope. I'll can monitor this as I go and add more conditions as I spot them. If you'd rather we make the decision preemptively and not upload these at all, I can open a wider discussion to see if there is a consensus one way or the other. BMacZero (talk) 17:17, 11 May 2019 (UTC)
- Sorry, but I don't see big value to have pictures of everyday hikers in every possible place to hike. Places are more then enough for Wikivoyage. --EugeneZelenko (talk) 13:44, 11 May 2019 (UTC)
- Another useful field/addition to include is the NPGallery Album name of the images, to greatly improve categorization. This photo of a squirrel, categorized only at the National Park level (which are prone to crowding anyway), is one of 148 images in the Album: Squirrels in Denali: having the category name machine readable would greatly facilitate placing into Category:Mammals of Denali National Park or new subsidiary categories. --Animalparty (talk) 18:13, 10 May 2019 (UTC)
- @Animalparty: Good catch, I forget to grab those on the download run. I've added an Album(s) line to the uploads. I also added the "NPS Unit Code". I did another 10 uploads with the changes. BMacZero (talk) 04:00, 11 May 2019 (UTC)
- Please also take a look on recent uploads. For example, language tags are missing in some fields in File:Ranger-Led Hike (bb4d7778-d65b-4ba3-8ddc-464d63ae1f4e).jpg, will be good idea to use plain text in description. --EugeneZelenko (talk) 14:08, 12 May 2019 (UTC)
- @EugeneZelenko: I added code to strip the spans and brs, though I am adding the paragraphs to separate the different elements that are going into the Description. I also code to add en tags to any text fields missing them (not the Unit Code, though, since it's a technical code). BMacZero (talk) 19:09, 13 May 2019 (UTC)
Any other problems or thoughts, or should I proceed? @EugeneZelenko: – BMacZero (🗩) 02:48, 27 May 2019 (UTC)
- It'll be reasonable to repeat test run. --EugeneZelenko (talk) 14:10, 27 May 2019 (UTC)
- I started one and noticed some metadata keys that weren't getting downloaded. I'm fixing that up and I'll do a clean test afterwards. – BMacZero (🗩) 18:46, 30 May 2019 (UTC)
- @EugeneZelenko: Clean test finished, see Category:NPGallery Batch Upload Test 3. I did end up excluding the theoretical out-of-scope images; I'll work later on how to deal with those. – BMacZero (🗩) 16:00, 4 June 2019 (UTC)
- Looks OK for me. I could only suggest to improve edit summary: (BOT) could be omitted because account name state this fact; it could refer to particular batch (NPS, it's particular division, etc). --EugeneZelenko (talk) 13:36, 5 June 2019 (UTC)
- @EugeneZelenko: Good ideas; I will do that. – BMacZero (🗩) 02:42, 6 June 2019 (UTC)
- @EugeneZelenko: Do you need anything else, or is this good to go? – BMacZero (🗩) 15:29, 25 June 2019 (UTC)
- You didn't notify about new test run, but latest uploads look OK for me. --EugeneZelenko (talk) 13:45, 26 June 2019 (UTC)
- @EugeneZelenko: Thanks. Sorry - I didn't do a test run because the change was very minor. I'll be sure to check it before I start. – BMacZero (🗩) 17:40, 26 June 2019 (UTC)
- Looks OK for me. I could only suggest to improve edit summary: (BOT) could be omitted because account name state this fact; it could refer to particular batch (NPS, it's particular division, etc). --EugeneZelenko (talk) 13:36, 5 June 2019 (UTC)
- @EugeneZelenko: Clean test finished, see Category:NPGallery Batch Upload Test 3. I did end up excluding the theoretical out-of-scope images; I'll work later on how to deal with those. – BMacZero (🗩) 16:00, 4 June 2019 (UTC)
- I started one and noticed some metadata keys that weren't getting downloaded. I'm fixing that up and I'll do a clean test afterwards. – BMacZero (🗩) 18:46, 30 May 2019 (UTC)
If there are no objections, I think task should be approved. --EugeneZelenko (talk) 13:45, 26 June 2019 (UTC)