Links to erfgoedbot search/statistics pages edit

Hi Multichill, thanks for making this critical tool for WLM. I was looking for the 'Search Monuments' and 'Statistics' pages mentioned toward the bottom of the infographic on Erfgoedbot's userpage, but I haven't been able to find them at a glance. I imagine others might run into this problem. Could you please provide links to those pages? Would it make sense to include the links on the userpage here and on Commons? Best, Emw (talk) 11:04, 25 July 2012 (UTC)Reply

More frequent updates? edit

Morning, Any possibility of running this bot more frequently during WLM? Noticed the bot has reached its max thumb quota the last 2 days for the US NRHP unused images. Thanks! 25or6to4 (talk) 16:19, 4 September 2012 (UTC)Reply

The database contains over 1 million items so I rather not update it more often.
I think once you guys cleared the backlog the limit of 400 won't be reached anymore. Multichill (talk) 21:25, 4 September 2012 (UTC)Reply

Misbehaving at Wikipedia:WikiProject National Register of Historic Places/Unused images edit

The problem is that photos that were added yesterday (maybe going back to last week) are still included in today's update. Not sure why - was there some change a week ago? But it makes the page a lot more time-consuming to use.

Any help appreciated.

Smallbones(smalltalk) 15:47, 24 October 2012 (UTC)Reply

Fixed. The database was not updating because of a typo. Multichill (talk) 18:46, 24 October 2012 (UTC)Reply
There's another problem with this article; File:Adams Mills Lock 28.jpg keeps popping up, even though it's already being used for National Register of Historic Places listings in Muskingum County, Ohio. I realize what the problem is; the image is for the Muskingum River Navigation Historic District, which exists in Coshocton, Muskingum, Morgan, Washington Counties. However, the Morgan County list has a hidden message stating specifically, "Image goes here. Please don't add Triple locks 01.jpg or Adams Mills Lock 28.jpg because they're not in Morgan County." So this image really shouldn't turn up on the list. Other than that, everything seems to be working halfway decent. ---------User:DanTD (talk) 15:10, 12 October 2013 (UTC)Reply
Comments are filtered off so it ends up being empty. Multichill (talk) 15:12, 12 October 2013 (UTC)Reply
So that image is going to keep popping up here until somebody either finds an appropriate image, or misuses this one? That sucks.-------User:DanTD (talk) 14:38, 13 October 2013 (UTC)Reply

Incorrect removal? edit

this edit removed all the images from Wikipedia:WikiProject_Historic_sites/Unused_images_of_heritage_sites_in_South_Africa although many of them are still unused. Can this please be corrected and the unused images page updated? Zaian (talk) 20:46, 11 October 2013 (UTC)Reply

NJR ZA broke it with this edit. I reverted it. This edit broke all the bot functionality. Multichill (talk) 20:56, 11 October 2013 (UTC)Reply
Thank you! Zaian (talk) 15:19, 14 October 2013 (UTC)Reply

Duplicates in HABS uploads edit

Could a bit of weeding be added to do the equivalent of this change? Any TIFF file with a PNG or jpeg of the same filename can be safely assumed to be a duplicate. In the HABS uploads the PNGs have only been created for TIFFs which cannot display a thumbnail, and this is likely to take care of most cases (there is a lag in creating them). -- (talk) 12:20, 21 July 2014 (UTC)Reply

Hi , great to see new images being added! The bot offers all options so people can choose which image they want to use. If you add the image to the lists, these will disappear from this page. I don't plan to introduce special behavior for one of the many of these pages, so I don't plan to introduce any filtering. Multichill (talk) 20:35, 21 July 2014 (UTC)Reply

False positives edit

This bot has flagged invalid coordinates at the following articles (that I watch):

These are all false positives, probably due to the inclusion of a <!-- comment --> in the coords. All the coords are valid. However, I'll make an adjustment in the syntax so these articles won't trigger the bot any more. — Ipoellet (talk) 16:28, 8 May 2016 (UTC)Reply

Hi Ipoellet − sorry for getting back to you so late. Thanks for reporting this. It was actually fixed not long after your message via phab:rTHERebcd48c5.
Cheers, Jean-Fred (talk) 13:48, 1 October 2016 (UTC)Reply

Excluding images edit

There are a number of images at Wikipedia:WikiProject National Register of Historic Places/Images without refnum that properly do not have or need the Commons NRHP template, but are categorized into an NRHP category. The two maps at the head of that page (File:Albany, New York Map NRHP.png and File:Albany, New York Map NRHP.svg) are perhaps the most visible examples of this. Is there a way to exclude them from being considered by the bot for placement on that page? Magic♪piano 13:42, 18 June 2019 (UTC)Reply

@Magicpiano: For clarity, they’re considered by the bot not because they are in the NHRP category tree, but because they bear the NHRP template. One easy solution could be to remove that template.
An ignore list had been requested before, but I probably do not have spare cycles to implement this any time soon :-/ Jean-Fred (talk) 10:18, 10 May 2020 (UTC)Reply

Is the bot still running edit

It has been two weeks since I last saw an update to https://en.wikipedia.org/wiki/Wikipedia:WikiProject_National_Register_of_Historic_Places/Images_without_refnum Einbierbitte (talk) 03:30, 18 January 2020 (UTC)Reply

Yes it is. But every now and then, it tends to be restoring images that were already tagged with their reference numbers. The most recent edit that it made did just that under a minute ago. ---------User:DanTD (talk) 19:40, 4 March 2020 (UTC)Reply
I think you're running into a phenomenon I've seen before. Clearly the bot is doing (1) data gathering and then (2) generating the new page. You're making edits while it's running, which are not accounted for because they happened after (1). Magic♪piano 20:52, 4 March 2020 (UTC)Reply

May 2020 edit

 
You have been blocked indefinitely from editing for running unapproved bot scripts.
Under the bot policy, all automated scripts must be approved by the Bot Approvals Group to ensure that they perform safe and useful functions without stressing system resources.
If you think there are good reasons for being unblocked, please read the guide to appealing blocks, then add the following text below the block notice on your talk page: {{unblock|reason=Your reason here ~~~~}}.  Primefac (talk) 23:32, 9 May 2020 (UTC)Reply

Add/remove loop edit

Can you please check what is happening here, where the bot cycles through adding and removing images from the list page? https://en.wikipedia.org/w/index.php?title=Wikipedia:WikiProject_Historic_sites/Unused_images_of_heritage_sites_in_South_Africa&action=history Zaian (talk) 11:31, 18 May 2020 (UTC)Reply

@Jean-Frédéric? Zaian (talk) 21:08, 9 June 2020 (UTC)Reply

Something's odd edit

@Jean-Frédéric and Lokal Profil: could you check if Erfgoedbot is running smoothly? It gives suspiciously short galleries suddenly in reports.

For example, here: https://en.wikipedia.org/w/index.php?title=Wikipedia:WikiProject_Historic_sites/Unused_images_of_listed_buildings_in_Scotland&diff=next&oldid=960731283&diffmode=source

it removed for example: https://commons.wikimedia.org/wiki/File:Shore_gate.jpg from its list, even though the image was not edited, and 23372 on https://en.wikipedia.org/wiki/List_of_listed_buildings_in_Crail,_Fife is still without image. I see many similar cases. Same on nlwiki. Thanks for checking. effeietsanders 05:03, 5 June 2020 (UTC)Reply

Hah, just noticed this - so happy, for a tiny little while... effeietsanders 06:02, 5 June 2020 (UTC)Reply

July 2020 edit

 
You have been blocked indefinitely from editing certain pages (Wikipedia:WikiProject National Register of Historic Places/Images without refnum) for persistent WP:NFCC#9 violations.
If you think there are good reasons for being unblocked, please read the guide to appealing blocks, then add the following text below the block notice on your talk page: {{unblock|reason=Your reason here ~~~~}}.  — JJMC89(T·C) 01:58, 15 July 2020 (UTC)Reply
It is clear the bot needs work, not just to fix the issue of including non-free images. It routinely includes in the NRHP image list it generates files that shouldn't be there because they are properly tagged. I repeat the request made above that means be added to the bot to exclude images from consideration, which would fix, or allow for the bypassing of, all of these things. Magic♪piano 02:07, 15 July 2020 (UTC)Reply
There are several pictures that are properly tagged - some several years ago - and it is as if it was invisible to the bot, since it repeatedly places them in the list. Einbierbitte (talk) 17:43, 15 July 2020 (UTC)Reply
Quick question. Is it only that page which is causing blocking-level issues? Because if so we can simply disable the job that updates that list. That means the bot can still go on doing everything else it does which isn't blocking-level broken.
The NFCC#9 violation is a direct result of people adding that image to the lists. It only got removed from there after the block here. The bot will assume images in the lists are allowed to be there. It is in fact not even aware that some images are not on Commons, let alone if they are fair use.
There obviously seems to be something weird going on with the NRHP data. The images without id job has some issues with reporting images despite them carrying the template. We have not been able to determine the reason for why.
For information this bot is currently suffering from some larger issues (which largely manifest by lists emptying completely). Right now neither of the developers working on it have the spare time to investigate the underlying issue further. /Lokal_Profil 21:31, 15 July 2020 (UTC)Reply

@Jean-Frédéric and Lokal Profil: It appears that the issue blocking the bot, NFCC#9, was corrected by taking the image out of the lists. Can you restart the bot? It would help with the cleanup of Wikipedia:WikiProject National Register of Historic Places/Images without refnum. The problems with blanking the page last only about one day, and the issues with the pictures already with templates can be worked around. We have made substantial progress in adding templates to pictures and have done well over half from a backlog of >15,000 to ~6,500 images. If there are more NFCC#9 problems, we can remove them from the lists. Thanks Einbierbitte (talk) 17:32, 24 July 2020 (UTC)Reply

@Einbierbitte: If the bot is unblocked the job should restart automatically within a day.
If you see a pattern for the false positives then let me know and we can se if that issue can be fixed /Lokal_Profil 08:23, 26 July 2020 (UTC)Reply
@Lokal Profil: OK Thanks Einbierbitte (talk) 21:53, 27 July 2020 (UTC)Reply
@JJMC89: Did you see the above request to undo your block? If you want the bot not to edit that page (whose whole purpose is for that bot to edit it... so that's odd) then maybe simply talking with the maintainer and asking them to disable that page makes more sense? Just thinking out loud... effeietsanders 17:28, 2 October 2020 (UTC)Reply
When the operator engages and fixes the code so that the violations cannot happen again, then I'll consider it. — JJMC89(T·C) 00:35, 5 October 2020 (UTC)Reply
@JJMC89: Are you aware that Lokal Profil is in a position to do this? I think he offered above to disable it for certain countries, if that is so desired. I'm a little unclear what else you're looking for. (the way I read the response, the issue was caused by manual edits (using a very different script that is wholly unrelated to Erfgoedbot), not by the bot - but to be fair I'm not entirely sure I fully understand your bug report otherwise) effeietsanders 02:09, 15 October 2020 (UTC)Reply
Something else may have made the bot make the edit, but the operator is still responsible for making the bot comply with policy. I have yet to see any indication that Lokal Profil will update the code accordingly. — JJMC89(T·C) 03:07, 22 October 2020 (UTC)Reply
Honestly. There is no way for us to set up the current bot so that it will not include Fair Use images when someone has manually added these to the lists (incorrectly). However as soon as they are taken of the lists they will also drop of the report page with the next update. The repeated re-adding was a result of the underlying fair use violation not having been addressed in-between updates, because there doesn't seem to be a bot which checks for violations in the article namespace. This issue will be true with any list, not only the NRHP ones. The only solution I see, bar writing a specific bot for en.wp to handle fair use, is to remove all Images without id reports being outputted to en.wp. @Jean-Frédéric: do you see any other solution? /Lokal_Profil 21:23, 10 November 2020 (UTC)Reply
Yes, let’s do so. Either we drop the en.wp reports altogether, or we output these on Wikimedia Commons (where the local uploads will not display). Jean-Fred (talk) 10:14, 11 November 2020 (UTC)Reply

@Jean-Frédéric and Lokal Profil:Can you let WP:NRHP know what you decide so we can continue adding ID numbers to the pictures? Einbierbitte (talk) 13:34, 30 November 2020 (UTC)Reply

ErfgoedBot not removing images and categories edit

@Jean-Frédéric: For a while now, ErfgoedBot hasn't been removing images and categories from Wikipedia:WikiProject National Register of Historic Places/Unused images or Wikipedia:WikiProject National Register of Historic Places/Missing commons category links, and those pages are getting hard to navigate because of all the old entries that are no longer relevant. Would it be possible to fix the bot so it removes entries once they've been added again? TheCatalyst31 ReactionCreation 15:41, 13 June 2023 (UTC)Reply

@TheCatalyst31: Thanks for the ping. I had a look at the logs, and indeed it’s been pretty bad for several months now. Filed phab:T338987 for this. Jean-Fred (talk) 18:20, 13 June 2023 (UTC)Reply

Bot war edit

@Jean-Frédéric: Your bot has been reverting itself three times every day on Wikipedia:WikiProject Historic sites/Unused images of Historic Places in Canada (history) since September 2021. Please investigate. —Cryptic 18:23, 26 October 2023 (UTC)Reply

Thanks for flagging this. Looks like we have three different datasets − `ca-prov`, `ca-fed` and `ca-muni` − with that page as reporting target. Digging into the git history, I see some trick « Canada in English 3 times because of the 3 levels in one source table » which was probably lost when chunking the configuration in three parts. The configuration is clearly broken here, but not really sure what’s the best way forward is... Jean-Fred (talk) 21:37, 26 October 2023 (UTC)Reply