Right-arrow (2094740) - The Noun Project.svg



Happy New Year!Edit

Send New Year cheer by adding {{subst:Happy New Year}} to user talk pages.

Iron Law of OligarchyEdit

You mention this on your userpage. I got indeffed and IP-banned from Conservapedia for a single well-sourced post noting that molality and molarity are respectively absolute and relative measures. If I hadn't bragged about it on RationalWiki, it'd probably still be there... Narky Blert (talk) 18:24, 3 January 2021 (UTC)

I have to sayEdit

"As a proof, they told me that the Keep vote of [GreenC which you see in the deletion discussion is done by them." If that's true, it's actually pretty cunning. Gråbergs Gråa Sång (talk) 20:47, 25 January 2021 (UTC)

Eh, my guess is a high end fraud lawyer is more cunning to fall for it and then post tears of regret publicly on Wikipedia. Story doesn't add up. -- GreenC 20:56, 25 January 2021 (UTC)
That feels probable, sure. Gråbergs Gråa Sång (talk) 20:59, 25 January 2021 (UTC)
  The Wikipedia Motivation Barnstar
You are the true motivator :) Sliekid (talk) 07:16, 27 January 2021 (UTC)

UntitledEdit

perse nuk vendosni import export te vitit 2020 por keni len ate te vitit 2017? Apo ngaqe esht deficiti shum me i lart — Preceding unsigned comment added by 151.41.56.71 (talk) 19:25, 28 January 2021 (UTC)

Your Opinion Requested at Michael ShellenbergerEdit

Hi GreenC,
You've previously weighed-in on the issues of publicity at Michael Shellenberger. I recently tried to clean said page up and add academic literature to the page, and it seems the page's subject has recently taken umbrage with said revisions. If you have the time, do you mind taking a look at the issues that recently occurred at Talk:Michael Shellenberger? --Hobomok (talk) 20:41, 28 January 2021 (UTC)

Edits revertedEdit

Hey GreenC, thanks so much for reviewing Saket Modi. I noticed you reverted all of my edits although they were written in a neutral tone and were supported by third party, reliable sources. I read WP:LEAD that you highlighted in your comment and it says "a lead section should contain no more than four well-composed paragraphs and be carefully sourced as appropriate." and "The lead section should briefly summarize the most important points covered in an article." I am not trying to stuff the lead section rather updating it and adding an award. It was hardly a one-line addition and i provided citations too to back them up. In addition to this, i had made some small changes to the rest of the article with citations and they were also reverted. You seem to be very well versed in policies and guidelines. I would really appreciate your guidance and help with this. Thanks.2405:204:C:AD2D:18B4:8F41:A22B:98E2 (talk) 16:05, 1 February 2021 (UTC)

Hi Sir, did you have a chance to look at it?.

Scripts++ Newsletter – Issue 20Edit

Signature issue on your comment at AfDEdit

It seems like there was a problem with your signature for your comment (Special:Diff/1004919157/1004929875) on the Jack Schlossberg AfD. It was a good comment any you might want to correct this issue. Cheers! - tucoxn\talk 13:42, 5 February 2021 (UTC)

crowd governanceEdit

the external link on your user page no longer works :( i dont want to edit your page, but i was able to enjoy the story at this address: https://web.archive.org/web/20200127053758/http://misrc.umn.edu/wise/2014_Papers/110.pdf have a good one Violarulez (talk) 20:39, 11 February 2021 (UTC)

Thank you, added the archive link. Interesting and non-intuitive story. -- GreenC 20:43, 11 February 2021 (UTC)

Speedy deletion nomination of Category:Esperanto literary awardsEdit

 

A tag has been placed on Category:Esperanto literary awards requesting that it be speedily deleted from Wikipedia. This has been done under section C1 of the criteria for speedy deletion, because the category has been empty for seven days or more and is not a disambiguation category, a category redirect, a featured topics category, under discussion at Categories for discussion, or a project category that by its nature may become empty on occasion.

If you think this page should not be deleted for this reason, you may contest the nomination by visiting the page and clicking the button labelled "Contest this speedy deletion". This will give you the opportunity to explain why you believe the page should not be deleted. However, be aware that once a page is tagged for speedy deletion, it may be deleted without delay. Please do not remove the speedy deletion tag from the page yourself, but do not hesitate to add information in line with Wikipedia's policies and guidelines. Liz Read! Talk! 15:58, 13 February 2021 (UTC)

@Liz: I didn't create empty categories 11 years ago, I guess whatever was there has been deleted. -- GreenC 16:09, 13 February 2021 (UTC)

TravelMate URL'sEdit

Unfortunately I can not find the exact reference for this but back in June 2020 you stopped bot InternetArchiveBot from archiving links for [1] or possibly a shorther name, as the archived versions do not work - the only reference I can now find is Wikipedia:Australian_Wikipedians'_notice_board/Archive_57#premierpostal.com. I have now encountered a similar problem with Red Cliffs, Victoria where a link to http://www.travelmate.com.au/MapMaker/MapMaker.asp which is dead, is being archived but the archived versions do no work as the javascript does not operate. The bot has now for the second time archived this link, on both occasions removing the 'dead link' tag. Can you please again help in stopping this bot archiving links for this URL. Fleet Lists (talk) 02:52, 20 February 2021 (UTC)

Fleet Lists, I believe the correct action is to 'whitelist' the URL which means the bot will always consider it 'alive' and will not try to add an archive. I just did this which should stop IABot. It could still be a problem with any other bot trying to save dead links in the future due to the {{dead link}} tag. If the link is dead and no viable archive it might be better to convert these to {{citation}} without a |url=. -- GreenC 03:32, 20 February 2021 (UTC)
Thank you for your reply and update of the Red Cliffs article. However the bot has now revisited and removed the "dead link" tag. So we are back where we started. How can the URL be "whitelisted"? Fleet Lists (talk) 22:00, 23 February 2021 (UTC)
Now I'm not sure what is happening. For the moment, I added {{cbignore}} which tells the bot to stay off the reference. This is fine, except when there are dozens or 100s of citations, as in this case, as each requires the cbignore. I'm going to ask the developer why the whitelist is not working. -- GreenC 22:45, 23 February 2021 (UTC)
Ah now figured it out: at iabot.org set the URL status to blacklist (not whitelist) and also delete the archive URL from the record. This action can only be done by an administrator. Should be set now. -- GreenC 22:51, 23 February 2021 (UTC)

DYK for George DinningEdit

 On 21 February 2021, Did you know was updated with a fact from the article George Dinning, which you recently created, substantially expanded, or brought to good article status. The fact was ... that in 1897, former slave George Dinning was the first black man to successfully sue a mob of the Ku Klux Klan? The nomination discussion and review may be seen at Template:Did you know nominations/George Dinning. You are welcome to check how many pageviews the nominated article or articles got while on the front page (here's how, George Dinning), and if they received a combined total of at least 416.7 views per hour (ie, 5,000 views in 12 hours or 10,000 in 24), the hook may be added to the statistics page. Finally, if you know of an interesting fact from another recently created article, then please feel free to suggest it on the Did you know talk page.

 — Amakuru (talk) 00:02, 21 February 2021 (UTC)

Disambiguation link notification for February 22Edit

An automated process has detected that when you recently edited Brian Nelson (literature professor), you added a link pointing to the disambiguation page Swann in Love.

(Opt-out instructions.) --DPL bot (talk) 06:14, 22 February 2021 (UTC)

there's a mess...Edit

... in this edit.

Trappist the monk (talk) 23:02, 2 March 2021 (UTC)

Bug that caused this fixed. -- GreenC 16:36, 11 April 2021 (UTC)

Removing archived urlsEdit

Hi! I'm sure you're doing great work, but not all of it seems to be going well. I've already posted at User talk:GreenC bot to ask why your bot removed an archived link from Louise Blouin. Why did you then again remove this archived url with this edit? Why should that url not be archived in case it ever ceases to be accessible in the future? Are you aware that, because of the General Data Protection Regulation, many North American websites block access for users from Europe? And that archive.org in many cases provides a way of restoring that access? Of course, if we have a policy that links should not be archived unless unavoidably necessary, do please point me to it. Otherwise, can you unconditionally guarantee that neither you nor your bot will again remove a working archived link from Wikipedia? And that you will, as a matter of priority, identify and repair any instance where either you or the bot has done so in the past? Thanks, Justlettersandnumbers (talk) 22:23, 13 March 2021 (UTC)

We don't use archives with the intention of bypassing policy blocks, that is not what are archives are meant for, there is no community consensus for that. Policy blocks, be it a pay wall or government regulation. There is no problem adding archive URLs as a precaution for link rot, but in this instance it was added directly into the URL with no citation template or {{webarchive}} thus in effect making to live URL inaccessible - literally deleting it. Now, the bot in this case was doing a URL move of observer.com because a user requested it - changing a dead URL to a live URL (there was a change in schemes at observer.com). During URL moves it does preserve the archive but only if there is a citation template or {{webarchive}}. I probably could add a feature to add a new {{webarchive}} when it's a square URL with an archive in order to preserve the archive. -- GreenC 22:48, 13 March 2021 (UTC)
The archived link leads directly to the actual source cited when the content was written (see WP:Text-source integrity). That content may have been changed or completely removed from more recent versions of the external page. There is no obligation that I'm aware of to cite a current link to a page if we already have an archived link; nor is there any obligation to use citation templates or webarchive templates (WP:CITEVAR). Anyway, would you kindly either point me to community consensus that a working archived link may be removed without discussion or unconditionally guarantee that neither you nor your bot will again remove a working archived link from Wikipedia, and that you will, as a matter of priority, identify and repair any instance where either you or the bot has done so in the past? Thank you, Justlettersandnumbers (talk) 11:32, 14 March 2021 (UTC)
I already added the feature. I'll take a look about readding old ones. -- GreenC 13:53, 14 March 2021 (UTC)

Incorrect IABot's edit summary in RussianEdit

The current summary "Добавьте № книги для Википедия:Проверяемость" has no sense in Russian language. Correct summary can be "Добавление ссылок на электронные версии книг" or "Добавление ссылок на электронные версии № (plural|книги|книг)". MBH (talk) 14:10, 24 March 2021 (UTC)

@MBH: I don't know which is better so I did the first one. Thank you very much. -- GreenC 14:31, 24 March 2021 (UTC)
Also I advice you not to use machine translation for translating bot messages into any languages you don't know. Maybe machine translation between big Roman and Germanic languages is not very bad, but machine translation from English to Russian is always terrible due to big difference in languages' structure. MBH (talk) 14:42, 24 March 2021 (UTC)

Nomination for deletionEdit

An article you created or have significantly contributed to has been nominated for deletion. The article is being discussed at the deletion discussion, located here. North America1000 11:41, 1 April 2021 (UTC)

Backlinks?Edit

Hi GreenC! I'm enjoying using the Backlinks functionality - it's been about a year now. I didn't receive any emails today - did your process stop for April Fools' Day?  :-) Thanks! GoingBatty (talk) 13:48, 1 April 2021 (UTC)

It's not that clever :) I checked the logs and it appears to have run and sent emails, the data looks normal. I just sent you a test email from the server can you verify it came through? -- GreenC 15:09, 1 April 2021 (UTC)
I did not receive the test email, and have received emails from other senders. @Certes: Did you receive the Backlinks emails today? GoingBatty (talk) 16:03, 1 April 2021 (UTC)
Hmm strange. Certes is using a new system that post results online instead of email. Do you want to use that instead? For example:
Config page: https://en.wikipedia.org/w/index.php?title=User:Certes/Backlinks
Data page: User:Certes/Backlinks/Report
Otherwise I can try to debug why emails are not coming through. -- GreenC 16:08, 1 April 2021 (UTC)
Yes, I'm interested in having the results posted online instead. I've created User:GoingBatty/Backlinks/Report. For User:GoingBatty/stopbutton, when stopped, does this mean that results are queued on your side, and then all posted once we set Action=RUN again? If so, I'm interested in using that on the days when I'm away from my computer. Thanks! GoingBatty (talk) 16:38, 1 April 2021 (UTC)
Just ran it, and it worked. I forgot to adjust the filters you wanted to keep out Template, Project and some others, those will be in effect next run. The stop button is a hard stop the program does not cache results. Useful for extended disabled. For random days, recommend viewing the page history which serves as a cache of prior runs. -- GreenC 19:13, 1 April 2021 (UTC)
There were quite a few links to be fixed in the Template, Project and other spaces, so feel free to keep those coming. Thanks! GoingBatty (talk) 00:53, 2 April 2021 (UTC)
You now have everything except these:
(^Talk:|^Wikipedia:|^Wikipedia talk:|^Template talk:|^Portal talk:|^User:|^User talk:|^File talk:|^MediaWiki:|^MediaWiki talk:|^Help:|^Help talk:|^Category talk:|^Book:|^Book talk:|^Draft:|^Draft talk:|^TimedText:|^TimedText talk:|^Module talk:)
-- GreenC 01:32, 2 April 2021 (UTC)
My Backlinks appeared on the data page as usual at 10:47 UTC today. It has failed to appear a couple of times over the last few months, but worked fine today. I asked to stop receiving Backlinks by e-mail, as my long list produced lots of e-mails. If I'm away for a few days I'll just catch up using the page history. Certes (talk) 23:47, 1 April 2021 (UTC)

IABot bug - "blocked: You have been blocked from editing." despite not being blockedEdit

Hello! I think phab:T274050 is back to bug us again. I'm getting a "blocked: You have been blocked from editing." error when trying to analyse & edit pages despite not being blocked. I can't seem to make the tool report on the exact API message it's getting (e.g. to see if an autoblock of a Toolforge IP is to blame), could you have a look? Thanks! ƒirefly ( t · c ) 15:36, 3 April 2021 (UTC)

Pages using duplicate arguments in template callsEdit

is it possible to remove User:GreenC/test from Category:Pages using duplicate arguments in template calls (easier to see the actual problems when there aren't user pages in there)? thank you. Frietjes (talk) 16:22, 11 April 2021 (UTC)

Done. -- GreenC 16:33, 11 April 2021 (UTC)

Bot functionality requestEdit

Hi GreenC, nice to meet you. I found you trawling through the bot status report (User:MajavahBot/Bot status report). I was wondering if I could interest you or request a relatively simple bot task? That task is: periodically go through the entries in this category: Category:Peer review requests not opened.

For each peer review talk page there will be a template like {{Peer review|archive=X}}. There should be a corresponding peer review page called Wikipedia:Peer review/PAGENAME/archiveX, but about once a week someone starts the process but doesn't actually create the page, so the template just hangs there. It would be very useful for a bot to remove the template if the peer review wasn't started for, like, a week after the template was placed, as that probably means no review page will be created.

I've had some problems with single functionality bots before so I thought I might ask you because your bot seems unlikely to randomly become inactive :P. Crossing my fingers, Tom (LT) (talk) 10:28, 12 April 2021 (UTC)

Hi Tom (LT) - I can help with this, though it would be a standalone bot, running on Toolforge from cron ie. servers maintained by Wikimedia in their datacenter, with code accessible to anyone with a Toolforge account. I think once a day it could retrieve the list of page names in the tracking category, along with today's date, and add it to a text file in two columns (page name|added (ie. today's) date). If the page name is already in the text file don't add it again, but check if it has been more than 7 days since the added date. If so, verify there is Peer review archive and if not then remove the Peer review template, and remove from the text file. Likewise if the pagename is in the text file but not in the tracking category then remove the pagename from the file. Sound good? -- GreenC 02:19, 14 April 2021 (UTC)
That would be wonderful. It is just one of those small thankless tasks that a bot could so, so I'm very appreciative of this. There are a couple of similar tasks lying around, would it be possible to pester you in the future if something similar arises? Tom (LT) (talk) 07:48, 14 April 2021 (UTC)
Alright. Hopefully will get to it this week. It depends on the task how complicated, and how busy I am at the time. There is also BOTREQ. BTW I will need to send this through BRFA which sometimes can take forever but see no trouble in approval given it's simplicity and non-controversial. -- GreenC 15:30, 14 April 2021 (UTC)
User:GreenC bot/Job 20 & Wikipedia:Bots/Requests for approval/GreenC bot 20 -- GreenC 03:29, 15 April 2021 (UTC)

When you have a momentEdit

Hello Green C. I hope you are well. I asked for a run from Template:Cleanup bare URLs/bot last night that it still hasn't processed. You may already be aware of this but I wanted to let you know just in case. My year and a half long infobox person cleanup project is almost finished so I will have time to use this bot again. Cheers. MarnetteD|Talk 22:24, 17 April 2021 (UTC)

Hello MarnetteD, there was a stuck/zombie process on one of the Toolforge grid computers blocking the spawning of new processes. That can happen, it's beyond my control to prevent but easily fixed by killing the process (done). If by chance it ever happens again and I am not around for a while, you can request help at Village Pump Technical who will point you to the right place (probably a Phab ticket), the stuck process will be called "tagbot.awk". Last resort waiting for the computer to reboot every couple months would also clear it. You take on big projects :) This one is probably infinite but every change is a huge help. -- GreenC 02:46, 18 April 2021 (UTC)
You said it :-) Thanks for the info and the fix! MarnetteD|Talk 02:55, 18 April 2021 (UTC)

MarnetteD, looks like it zombied again. If it keeps happening I might need to make another program that monitors for stuck processes. -- GreenC 17:57, 26 April 2021 (UTC)

I'm glad you noticed. I was waiting a bit to see if it would kick in. It is hard to say when this problem crept up since it wasn't getting used as regularly in the last year or so. Thanks for the update. MarnetteD|Talk 18:12, 26 April 2021 (UTC)

InternetArchiveBot in esWikiEdit

Hi, GreenC. Thanks for taking care of this. Can you assure me that, in addition of fixing the duplicates, the bot won't perform inconsequential editions like this (it's difficult to find, it's just an added space)? That's the other half of the complaint. If that's so, I'll lift the block. Thanks. --Angus (talk) 22:07, 18 April 2021 (UTC)

This bot is small and purpose-built it shouldn't make empty edits. Bigger bots that can happen as they are doing many functions adding and deleting text. -- GreenC 23:28, 18 April 2021 (UTC)
Sorry I misunderstood, you mean ensure IABot does not (was thinking the smaller fixer bot). I contacted Cyberpower678, this should be an easy bug to detect and avoid by removing all whitespace from the original and new article, compare the two strings and if they are equal abort the edit. -- GreenC 00:19, 19 April 2021 (UTC)
GreenC, yes, this will be corrected. I should have a fix for this ready fairly quick. —CYBERPOWER (Message) 02:51, 19 April 2021 (UTC)

Hi guys, thanks for your cooperation. I unblocked the bot. --Angus (talk) 12:44, 19 April 2021 (UTC) cc user:cyberpower678

Hi Angus, could you recommend wording for a Spanish edit summary equivalent to "Fixing 1 redundant {{wayback}}" and "Fixing 2 redundant {{wayback}}" (plural). Will also need "Fixing 1 redundant archiveurl/urlarchvo argument" and "Fixing 2 redundant archiveurl/urlarchvo arguments". I've learned not to use Google Translate or guess but ask a native speaker. Thank you! -- GreenC 14:41, 19 April 2021 (UTC)
Here:
  • Arreglo {{wayback}} redundante
  • Arreglo 2 {{wayback}} redundantes
  • Arreglo argumento urlarchivo/archiveurl redundante
  • Arreglo 2 argumentos urlarchivo/archiveurl redundantes
--Angus (talk) 14:52, 19 April 2021 (UTC)
Angus, btw, the bot has a run page so it doesn’t need to be blocked to stop it. You can find the run page at https://iabot.toolforge.org/index.php?page=runpages&wiki=eswikiCYBERPOWER (Around) 16:27, 19 April 2021 (UTC)
Cyberpower678, unfortunately the "IABot Management Console" wants me to give it unnecessary access to private information, like my email address and who knows what else, before it will show me that page. So it remains inaccessible to me. --Angus (talk) 16:49, 19 April 2021 (UTC)
Angus, as the designer of the bot and the UI I can assure that not only is your email address not saved anywhere unless you explicitly tell the tool to, your email address is not ever passed to the tool on authorization. I have no idea why it says that. All you are giving the tool is your username and public accessible data like your registration date, permissions, and block status. —CYBERPOWER (Chat) 17:05, 19 April 2021 (UTC)
Angus, user privacy is taken very seriously and is never leaked. Private data is only stored with the users’ permission and critical data is encrypted to prevent unauthorized access. —CYBERPOWER (Chat) 17:06, 19 April 2021 (UTC)

Cyberpower678, it's ok, no worries. Maybe the Mediawiki API (or whatever) should be changed so it doesn't request unneeded data...

GreenC, thanks man! Sorry I wasn't there when needed, I'm glad things are fixed now! --Angus (talk) 22:54, 20 April 2021 (UTC)

esWikiEdit

Hi, I'm not sure if this is the right place to report this bug, but InternetArchiveBot duplicated two articles on esWiki while trying to fix a redundant archive. The first one is es:Anthem Sports (a duplicate of es:Anthem Sports & Entertainment) and the second one is es:Heckler (a duplicate of es:Heckler & Koch MP5). I think these are the only cases so far ([2]). --Soulreaper (talk) 15:08, 20 April 2021 (UTC)

Yes I am aware of this bug in the code and fixed it and had already redirected Heckler but was not aware of Anthem, now also redirected. If you think they should be deleted instead I'll start that process. -- GreenC 16:34, 20 April 2021 (UTC)

GreenC BotEdit

What do GreenC Bot do ? Cookersweet (talk) 11:44, 22 April 2021 (UTC)

Thanks for helping out at peer review!Edit

  The Peer Review Barnstar
For your very helpful bot-related contributions to Wikipedia peer review, I present to you the peer review barnstar. Nice work! Tom (LT) (talk) 07:11, 5 May 2021 (UTC)

Tom (LT) (talk) 07:11, 5 May 2021 (UTC)

No problem! At this rate it will be longest trial period for 25 edits in history :) -- GreenC 01:15, 6 May 2021 (UTC)

Could I interest you in some more...Edit

Could I interest you in one more peer review related task...? (Wikipedia:Bot_requests#Bot_to_repair_broken_peer_review_links)

Summary: a nearly completed bot exists but the owner went away. Old peer reviews didn't contain a fixed link to the peer review page, which means over time as pages are moved, the links get broken. The bot was designed to fix those links. There was one outstanding issue which was that sometimes it would include a link twice in the output. Once that happens it can fix the rest of the 680 outstanding broken links. Would I be able to interest you in picking up and finishing this task...? :D Tom (LT) (talk) 01:31, 9 May 2021 (UTC)

Tom, do you know if the source available somewhere that I could take a look? -- GreenC 16:59, 9 May 2021 (UTC)
Ah, looks like the owner has resurfaced and there is a new bot RfA in the works (Wikipedia:Bots/Requests for approval/AWMBot 2). Hurray, and ignore my request! Tom (LT) (talk) 04:08, 10 May 2021 (UTC)
Ok good! -- GreenC 15:22, 10 May 2021 (UTC)

Transclusion of deleted templateEdit

I've nowiki'ed a transclusion of the now-baleeted {{Wayback}} from a subpage in your userspace, but I will let you know here for the sake of visibility (since I don't know if you're going to see an edit on some random userspace page). jp×g 17:11, 17 May 2021 (UTC)

@JPxG: thank you, I just pre'd the whole page for now. -- GreenC 18:21, 17 May 2021 (UTC)

Wikipedia:Link rot/TemplatesEdit

Just wanted to tell you about a project I've started recently. Wikipedia:Link rot/Templates is intended to list all our external link templates on one page along with the status of the links to more quickly catch when links go down. I hope to get all templates with over 1000 transclusions on there within a few weeks.

If it would be possible to have a bot assisting with detection of dead links that would be great. If the links were checked to be working weekly by bot that would make the page a lot more useful. Is that plausible or not? I'm sadly completely out of my depth with that kind of bot and can not answer even simple questions like that on my own. --Trialpears (talk) 23:24, 22 May 2021 (UTC)

Disambiguation link notification for May 25Edit

An automated process has detected that when you recently edited Lionel Terray, you added a link pointing to the disambiguation page Mount Huntington.

(Opt-out instructions.) --DPL bot (talk) 06:01, 25 May 2021 (UTC)

Dead linkEdit

Hi GreenC, I noticed you marked a link I added as dead. Thanks for pinging me. I added it today, and just checked again, and the link is definitely not dead. ― Tartan357 Talk 21:46, 7 June 2021 (UTC)

Never mind, I figured it out. The link is uniquely-generated and has a short expiration. I'll just link to the index. ― Tartan357 Talk 22:06, 7 June 2021 (UTC)

User:GoingBatty/Backlinks/ReportEdit

Hi GreenC! I've been enjoying the daily updates posted User:GoingBatty/Backlinks/Report and fixing the appropriate articles. I noticed that the bot didn't post an update today. Could you please check on it? Thanks! GoingBatty (talk) 01:55, 10 June 2021 (UTC)

It ran and generated the table, which it keeps on hand, but it didn't post for some reason. Maybe a network transient? I just posted it manually. Good thing you asked as it only keeps it for up to the next batch run. -- GreenC 02:06, 10 June 2021 (UTC)
Thank you for the manual list. The bot worked fine today, as usual. Happy editing! GoingBatty (talk) 22:37, 10 June 2021 (UTC)
Hi again! Unfortunately, your bot did not post a new version of User:GoingBatty/Backlinks/Report today. Could you please check on it? Thanks! GoingBatty (talk) 13:57, 6 July 2021 (UTC)

OK just added a loop it will try 10 times with 30 second pauses to account for timeouts. After 10 on fail it will email me. I believe this will solve it. -- GreenC 15:33, 6 July 2021 (UTC)

Thank you for manually posting an update for today, but that update contains many items not on User:GoingBatty/Backlinks. Did you accidentally provide me someone else's list? Thanks! GoingBatty (talk) 16:56, 6 July 2021 (UTC)
Oi, that's my list! Certes (talk) 17:31, 6 July 2021 (UTC)
lol yeah sorry about that the procs are called "bw" and "bw2" on the server and I got confused which is GoingBatty (bw) and Certes (bw2). Should be corrected now. -- GreenC 17:34, 6 July 2021 (UTC)

Shadows Commons botEdit

User:GreenC bot/Job 10 hasn't tagged anything as {{ShadowsCommons}} since 4 May. The bot page says it uses Quarry 18894 but that query doesn't work due to [3]. It seems unlikely that absolutely nothing shadowed Commons after 4 May. — Alexis Jazz (talk or ping me) 22:16, 24 June 2021 (UTC)

@Alexis Jazz: Ah. Shoot. It was discussed in this Phab a while back and the WMF sysadmins didn't come up with a viable alternative. I just posted an alternative idea but it would take some time to develop, assuming it can even be made to work. The basic issue is that Commons has 60+ million titles and downloading that list takes a very long time, meanwhile ShadowBot needs to run daily. So my idea was to break the problem down into sub-lists; and scrap using database queries which can't deal with this problem effectively. It's an ugly problem. -- GreenC 00:02, 25 June 2021 (UTC)
GreenC, MGA73, how do I get a list of files on enwiki? As in, without the local description pages. I already have a list that includes those. It seems technically SELECT img_name FROM image should work, but it looks like it'll take about half an hour? — Alexis Jazz (talk or ping me) 09:51, 25 June 2021 (UTC)
@Alexis Jazz: I have no good solution atm. It seems that it does take a long time to run a quarry. --MGA73 (talk) 11:21, 25 June 2021 (UTC)
MGA73, 1298.78 seconds to return 894832 rows to be exact. — Alexis Jazz (talk or ping me) 11:31, 25 June 2021 (UTC)
MGA73, User:Alexis Reggae/The Real Slim ShadyCommons something something — Alexis Jazz (talk or ping me) 14:24, 25 June 2021 (UTC)
Alexis Jazz hi, sorry, not sure what we are looking at, were you able to devise a working query? -- GreenC 18:49, 25 June 2021 (UTC)
Not really, just wanted to get a list once to prevent the backlog from getting bigger. It actually returned more than expected, but many problem files so I'm keeping the list. The Real Slim ShadyCommons is essentially a list of files on enwiki (SELECT img_name FROM image) that shadow a Commons file (index from dump) or redirect and don't have the {{keeplocal}} template. It includes what the bot would have tagged, but also much more, so it's not as simple as this. — Alexis Jazz (talk or ping me) 19:00, 25 June 2021 (UTC)

Who cares?Edit

Moved to Talk:Christopher_C._Horner#Who_cares?

ReutersEdit

Moved to WP:URLREQ#Reuters (again)

Backlinks and common wordsEdit

I'm thinking of adding selected common words (maybe 20) to my Backlinks list. Of course, a search for "The" would match almost everything and time out, but a search for linksto:"The" is fast. Would these additions be safe, or would they slow things down in an antisocial way? I already have A listed (to catch jokers who link each letter), so you could check whether that runs noticeably slower than less common words. Certes (talk) 01:06, 15 July 2021 (UTC)

Certes, top 50 by size from your list

Extended content
3449577	china    
2551179	London    
925721	Boston    
899215	Sydney    
822928	National Football League  
787712	Jazz    
742641	Melbourne    
637715	The Daily Telegraph  
602020	Luxembourg    
545562	Athens    
470567	Manchester    
417654	Liverpool    
414820	Birmingham    
358707	Perth    
298299	Naples    
287433	Edmonton    
284325	Hollywood    
262800	New Brunswick   
244875	surrey    
244846	Surrey    
242423	Oxford    
241038	guinea    
227722	Havana    
224517	Blues    
224517	blues    
219266	Wellington    
208890	National League   
198799	Oxygen    
197470	Butterfly    
197470	butterfly    
197311	Cambridge    
197190	Norfolk    
194592	Hyderabad    
191996	Christchurch    
190041	ABC News   
183295	The Observer   
182475	Country    
182475	country    
179265	Madonna    
179265	madonna    
174391	Alexandria    
164651	Portsmouth    
160175	Sculpture    
160175	sculpture    
143481	The Sunday Times  
139840	York    
138493	Hanover    
135255	Company    
135255	company    
134502	Stream    

That's file size in bytes (each file contains a list of article names), but it gives a relative sense of which ones are the largest. The system was never designed with this many in mind but it seems to be holding up fine. One reason it might have trouble is if it takes > 24 hrs to run, at which point we increase the time period between runs. The last run took 2.5 hours so you're 10% of the way there ;) Or if the number of links is in the millions, like the {{cite web}} template, but it's hard to imagine linked terms much more common than china or london. Ghandi? Jesus? China has 133,717 backlinks. 'The' has less than a thousand. -- GreenC 02:31, 15 July 2021 (UTC)

Thanks for the list. I'd already removed London, Boston, Sydney, Melbourne, National Football League, Luxembourg, Manchester and others for producing too many false positives. I removed Jazz and Athens last night, so that's most of the top ten gone. china (lower case for pottery) can also go now as it appears rarely. Of the top ten, that just leaves The Daily Telegraph. I still get plenty of links for The Daily Telegraph (Sydney) in semi-automated citations; I think it's linked to the wrong WP article in Trove. However, it would be perfect for a variant which limits the search to articles which also mention Australia (or perhaps NSW, Brisbane, etc.) Here's another Telegraph error today: David Storey (politician). Certes (talk) 13:00, 15 July 2021 (UTC)
User:Certes, you may already know about this, but in case thought you might be interested in Zipf's law (second paragraph of lead section). External links has a Zipf's list for English. They might contain frequent disambiguation problems. -- GreenC 17:15, 19 July 2021 (UTC)
I vaguely remember Zipf's law but it had slipped my mind. I just checked a couple of lists and the only word I'd missed was "information", which seems a legitimate enough target for the false positives to dominate the errors. I've deliberately omitted words such as Be, which are or redirect to dabs and will be picked up by WikiProject Disambiguation. The words are now being checked: just one today; no false positives. Certes (talk) 17:39, 19 July 2021 (UTC)