Open main menu

This is a page for requesting tasks to be done by bots per the bot policy. This is an appropriate place to put ideas for uncontroversial bot tasks, to get early feedback on ideas for bot tasks (controversial or not), and to seek bot operators for bot tasks. Consensus-building discussions requiring large community input (such as request for comments) should normally be held at WP:VPPROP or other relevant pages (such as a WikiProject's talk page).

You can check the "Commonly Requested Bots" box above to see if a suitable bot already exists for the task you have in mind. If you have a question about a particular bot, contact the bot operator directly via their talk page or the bot's talk page. If a bot is acting improperly, follow the guidance outlined in WP:BOTISSUE. For broader issues and general discussion about bots, see the bot noticeboard.

Before making a request, please see the list of frequently denied bots, either because they are too complicated to program, or do not have consensus from the Wikipedia community. If you are requesting that a template (such as a WikiProject banner) is added to all pages in a particular category, please be careful to check the category tree for any unwanted subcategories. It is best to give a complete list of categories that should be worked through individually, rather than one category to be analyzed recursively (see example difference).

Note to bot operators: The {{BOTREQ}} template can be used to give common responses, and make it easier to keep track of the task's current status. If you complete a request, note that you did with {{BOTREQ|done}}, and archive the request after a few days (WP:1CA is useful here).


Please add your bot requests to the bottom of this page.
Make a new request


Populate Selected Anniversaries with Jewish (and possibly Muslim) HolidaysEdit

Right now there is a manual process to go year by year to the last year's date and remove the Jewish holiday and then find the correct Gregorian date for this year and put it in. This is because Jewish and Muslim holidays are not based on the Gregorian calendar. There is a website, http://www.hebcal.org for Jewish holidays that lists all the Gregorian dates for the respective Jewish holidays. I suggest a bot take that list and update the appropriate dates for the current/next year. Sir Joseph (talk) 19:29, 20 July 2018 (UTC)

Which articles are we talking about? Enterprisey (talk!) 20:15, 29 July 2018 (UTC)
Not articles, the Selected Anniversaries pages which then lands up on the main page. For example, Wikipedia:Selected_anniversaries/July_22 has a Jewish holy day for 2018 but in 2017 and 2019.... that day falls on a different date. So the idea is to get a list of days that are worthy of being listed and then go to the page for the prior year and delete the entry and then add the entry for the current year. Sir Joseph (talk) 23:38, 29 July 2018 (UTC)

Hello @Sir Joseph and Howcheng: - I'm thinking about and investigating this as a possible project, if you are still interested. It would help which Jewish/Israeli holidays are tracked on Selected Anniversary pages? -- GreenC 15:25, 22 August 2018 (UTC)

@GreenC:, yes and thank you for your interest. I know that hebcal is open source and has a data feed, but not sure how useful it is. My thinking is to create a subpage listing "Holidays to be posted on the Front Page" and then the bot goes to this year or last year and updates it. I'm sure you might have a better way to do it. Sir Joseph (talk) 15:44, 22 August 2018 (UTC)
http://www.hebcal.org appears to be expired now a click-farm. What I would need is a list of holidays, then I can see what resources are available. -- GreenC 16:22, 22 August 2018 (UTC)
my huge mistake, I meant hebcal.com Sir Joseph (talk) 16:25, 22 August 2018 (UTC)
Great, I'll take a look. There is also Enrico Service 2.0 which is pretty good for an API but it's missing some holidays thus I would need to know which ones are needed before deciding on a data source. -- GreenC 16:30, 22 August 2018 (UTC)
I'd go with the hebcal integration. Many of the holidays posted to the front page are not national holidays so wouldn't be in enrico.Sir Joseph (talk) 16:36, 22 August 2018 (UTC)
You are right it is better. Would need to know which holidays. What happens with multi-day holidays like Chanukah, is an entry made in every Selected Anniversaries page (for 8 days I think?) or just the first day? -- GreenC 16:57, 22 August 2018 (UTC)
@Howcheng: would be able to confirm, but I think only one day is listed, except perhaps for Rosh Hashana. I will see if I can pull together a list of holidays unless Howcheng has one that he uses, this would be my own list and not what has happened in the past. Sir Joseph (talk) 17:11, 22 August 2018 (UTC)
Actually, I was thinking NOT of SA/OTD, but to update the holiday articles themselves. If we can make sure all the articles have holiday infoboxes in them, then the bot can add the date20XX= parameters. Including them in OTD is a manual process because the articles have to be checked to make sure the quality is still good enough for inclusion. howcheng {chat} 17:19, 22 August 2018 (UTC)
I'm OK either way, but wouldn't it be easier for you to not have to search for a holiday, but know a few days in advance as you do to check the SA and then see if that article is good enough? Sir Joseph (talk) 17:58, 22 August 2018 (UTC)
It's a bot, so there's no reason it can't do both, I suppose. howcheng {chat} 07:20, 23 August 2018 (UTC)

After some reflection, I would suggest the way to display Jewish and other moveable holiday dates is with a Lua template. The data could be in an enwiki hosted JSON file during development, later imported into Wikidata once established to share with other projects. The data can be downloaded from hebcal.com, generated with a script using mathematical algos or otherwise manually entered. It only needs to be done one time up to 2050 or 2100 whatever makes sense. It can be for any holiday that uses a non-Gregorian calendar, for translating to Gregorian. For example {{holigreg |holiday=Hanukkah |date=2018 |df=mdy}} would produce December 2, 2018 - December 10, 2018. There can be other options to control display output. It might also have |date=CURRENTYEAR so it's always up to date for use in infoboxes. And support |date=CURRENTYEAR+1 to display dates previous and after. Another advantage is other tools and bots can use the database; so for example if a bot was written to update the SA, it could key off the JSON or Wikidata database in a consistent manner for Jewish, Muslim and I assume there are other non-Gregorian holidays. -- GreenC 13:13, 23 August 2018 (UTC)

Thanks, that sounds like a plan, or I think I understand that it sounds like a plan. I know hebcal has a JSON here: [1] Let me know if you need anything from my end. I am sure we're going to need spelling equalization, etc. Sir Joseph (talk) 13:28, 23 August 2018 (UTC)

This is a really good idea. Just make sure that someone checks which "day" we tend to mark. Jewish festivals begin in the evening. I think we usually note the occurrence on the 'main day', not the day the festival begins, but there may be exceptions. --Dweller (talk) Become old fashioned! 13:51, 23 August 2018 (UTC)

This would also be a good approach to take for other moveable holidays, like Easter and its related holidays (Fat Tuesday, Ash Wednesday, etc), and ones based on the Chinese calendar as well. howcheng {chat} 16:02, 23 August 2018 (UTC)

We should create a page with a table similar to this: User:Sir_Joseph/sandbox and that can be populated by the bot. Sir Joseph (talk) 16:11, 23 August 2018 (UTC)

New template is created. Follow-up at Template_talk:Calendar_date#Template. -- GreenC 19:02, 24 August 2018 (UTC)

Mass category change for location userboxes listsEdit

I created new category Category:Lists of location userboxes to better organize userboxes. I would like to change (and add where it's missing) all entries of

[[Category:Lists of userboxes|.*]] 

to

[[Category:Lists of location userboxes|{{subst:SUBPAGENAME}}]]

in all subpages of following pages:

  1. WP:Userboxes/Life/Citizenship
  2. WP:Userboxes/Life/Origin
  3. WP:Userboxes/Life/Residence
  4. WP:Userboxes/Location
  5. WP:Userboxes/Travel

There are some exceptions, like WP:Userboxes/Location/United States/Cities should be [[Category:Lists of location userboxes|United States]], and not, [[Category:Lists of location userboxes|Cities]]. It would be nice if bot could distinguish such subpages (with titles equal to Cities, Regions, States, Nations), but it would be OK if it didn't — there is only a handful of such subpages — they can be updated later manually.

—⁠andrybak (talk) 12:40, 20 August 2018 (UTC)

It appears that only /Location has subpages, although /Travel has /Travel-2 and /Travel-3, which are a sort of parallel page that might be suitable for this categorization. I might have missed other pages that are not technically subpages. – Jonesey95 (talk) 16:37, 20 August 2018 (UTC)
Thanks. I guess going by page prefix would be better:
  1. Special:PrefixIndex/Wikipedia:Userboxes/Life/Citizenship
  2. Special:PrefixIndex/Wikipedia:Userboxes/Life/Origin
  3. Special:PrefixIndex/Wikipedia:Userboxes/Life/Residence
  4. Special:PrefixIndex/Wikipedia:Userboxes/Location
  5. Special:PrefixIndex/Wikipedia:Userboxes/Travel
—⁠andrybak (talk) 17:03, 20 August 2018 (UTC)

[r] → [ɾ] in IPA for SpanishEdit

(reviving) A consensus was reached at Help talk:IPA/Spanish#About R to change all instances of r that either occur at the end of a word or precede a consonant (i.e. any symbol except a, e, i, o, or u, or j or w) to ɾ inside the first parameter of {{IPA-es}}. There currently appear to be about 1,140 articles in need of this change. Could someone help with this task with a bot? Nardog (talk) 12:55, 20 August 2018 (UTC) – Fixed Nardog (talk) 15:28, 20 August 2018 (UTC)

Hi Nardog - I have a script, but could you confirm how these testcases should look, perhaps add other unusual cases that might come up:
*{{IPA-es|aˈðrβr|lang}}
*{{IPA-es|aˈðr βr|lang}}
*{{IPA-es|aˈðr-βr|lang}}
*{{IPA-es|aˈðr|lang}}
*{{IPA-es|aˈðrer|lang}}
*{{IPA-es|aˈerβ|lang}}
*{{IPA-es|ri|lang}}
*{{IPA-es|r|lang}}
*{{IPA-es|r r r|lang}}
*{{IPA-es|ir er or|lang}}
Thanks, -- GreenC 14:42, 20 August 2018 (UTC)
@GreenC: Thanks for taking a stab at this. In principle, any instance of r that is followed by anything (including a space, | or }) except a, e, i, j, o, u, or w in the first parameter of {{IPA-es}} must be replaced with ɾ, so the first seven would be
*{{IPA-es|aˈðɾβɾ|lang}}
*{{IPA-es|aˈðɾ βɾ|lang}}
*{{IPA-es|aˈðɾ-βɾ|lang}}
*{{IPA-es|aˈðɾ|lang}}
*{{IPA-es|aˈðreɾ|lang}} <!-- the first [r] should also be [ɾ] anyway but that is not relevant here -->
*{{IPA-es|aˈeɾβ|lang}}
*{{IPA-es|ri|lang}}
and the final one *{{IPA-es|iɾ eɾ oɾ|lang}}. *{{IPA-es|r|lang}} and *{{IPA-es|r r r|lang}} should probably be unmodified, but I found no such occurrence by searching hastemplate:IPA-es insource:/IPA-es[^\|]*?\|[^\|\}]*?[ \|]r[ \|\}]/. As a side note, a few transclusions include a comment inside the first parameter, but those have been fixed so you can likely exclude them. I've also fixed some other idiosyncratic cases like this and this. Nardog (talk) 15:28, 20 August 2018 (UTC)
Alright that works. Since it's only 1,140 we can probably do this right away fully supervised. Would you be willing to help verify the edits? I can do 50 or so on the first run to make sure it's working correctly, and then 500 after that. If there are no problems in the 500 the rest can be finished with spot checks (it will be obvious there is a problem by looking at the edit byte count which will be uniform size 0 byte). -- GreenC 16:35, 20 August 2018 (UTC)
Sounds great! (ɾ is 3 2 bytes long so it would be +2 +1 per change, by the way.) Nardog (talk) 16:39, 20 August 2018 (UTC)
Nardog - Alright just did 73, hard to control exact number - see diffs for User:GreenC bot (ignore first 5 a typo in the script caused catastrophic fail). -- GreenC 17:06, 20 August 2018 (UTC)
@GreenC: Sorry, please add j and w to the list of symbols that can follow [r] (see the correction to my OP of this thread, I should have made it clearer). Other than that, the corrections are spot on. (2 bytes it was...) Nardog (talk) 17:31, 20 August 2018 (UTC)

─────────────── 500 done, I'll wait till tomorrow to finish the rest. -- GreenC 19:03, 20 August 2018 (UTC)

@Nardog: No response by anyone from yesterday's run which is a good sign so I went ahead and finished the rest it's a clean sweep (nice search formulation btw). This script is basically a cut and paste "bot complete" for anyone with a bot flag and OAuth credentials; it identifies the article names, downloads the wikisource, makes the changes and uploads the new article with edit summary. I can do it or anyone else in the future as needed. -- GreenC 15:58, 21 August 2018 (UTC)
Awesome ;) Nardog (talk) 16:48, 21 August 2018 (UTC)
Script

Dependencies: GNU awk, wikiget.awk, library.awk. MIT License User:GreenC 2018
./wikiget -a ": hastemplate:IPA-es insource:/IPA-es[^\|\}]*?\|[^\|\}\<]*?r[^aeijouw]/" > ipa; awk -ilibrary '{fp=sys2var("./wikiget -w " shquote($0)); c=patsplit(fp,field,/{{IPA-es[^}]*}}/,sep);for(i=1;i<=c;i++){patsplit(field[i],aa,/[|]/,a); sub(/r$/,"ɾ",a[1]); gsub(/r[ ]/,"ɾ ",a[1]); d=patsplit(a[1],b,/r[^aeioujw]/,bb);for(j=1;j<=d;j++) sub(/^r/,"ɾ",b[j]);a[1]=unpatsplit(b,bb); field[i] = unpatsplit(aa,a) } if(unpatsplit(field,sep) != fp){ print shquote($0); sys2varPipe(unpatsplit(field,sep), "./wikiget -E " shquote($0) " -S " shquote("[r] → [ɾ] in IPA for Spanish [[Help_talk:IPA/Spanish#About_R|per discussion]] and [[Wikipedia:Bot_requests#%5Br%5D_%E2%86%92_%5B%C9%BE%5D_in_IPA_for_Spanish|botreq]]") " -P STDIN")}}' ipa

Cleanup needed in Category:Featured articles needing translation from <language>Edit

Hello! The other day I was looking through Category:Featured articles needing translation from Chinese Wikipedia. This is a hidden category containing articles that have featured status on the Chinese Wikipedia. This is nifty because now we know we have a high quality article in Chinese that can be translated into English.

However, I noticed that a lot of these articles have had their featured status removed long ago (e.g. glove puppetry). Then I looked into other subcategories of Category:Featured articles needing translation from foreign-language Wikipedias and found similar problems. For example, Nationalist Party of Castile and León is in the category Category:Featured articles needing translation from Spanish Wikipedia, even though the corresponding article in Spanish appears to have never been featured.

This looks like a problem a bot could fix without much trouble. Just go each subcategory of Category:Featured articles needing translation from foreign-language Wikipedias, then for each article in that subcategory verify that it is indeed featured on the corresponding language's edition of Wikipedia, otherwise remove it from the category.

While we're at it, we could populate these categories by going through the list of featured articles in each language and adding each one to the appropriate category. This might be desirable because these categories are a bit sparse and many don't seem to be actively maintained. However this might be too big of a change to automate without consensus from the Wikipedia:WikiProject Intertranswiki or elsewhere.

I am relatively new to Wikipedia so sorry if this request is inappropriate! Flurmbo (talk) 21:45, 23 August 2018 (UTC)

  Doing... - @Flurmbo: - I think this is a great idea. I have started a discussion at the following projects Cleanup, Languages, and Categories. If this gains consensus I will be willing to complete the request. Kadane (talk) 18:32, 8 September 2018 (UTC)
Also asked for comments at Wikiproject Intertranswiki. Kadane (talk) 18:40, 8 September 2018 (UTC)

Bot to assist in uploading/linking spoken word audio filesEdit

I have an idea for a bot (well a piece of software actually) to assist in uploading/linking spoken word audio files.

In particular it would help with article title uploads. Eg short recordings, like "Scott Morrison" to be used in the Wikipedia article.

Workflow:

  1. Display to the user an article title. Eg a random page or member of a category.
  2. Record the user saying the title.
  3. Name the recording (with the article title, language, accent)
  4. Upload the recording to Commons (in ogg format).
  5. If no other recording exists for the article, link the recording from a article page (perhaps a subpage like Scott Morrison/spoken - we can work out the details later).
  6. Repeat step one.

If you need clarification let me know.

Does this software already exist, perhaps in other Wikimedia projects?

Potentially I could make the tool myself, but it would take a loooong time. If no one can assist could you point me in the right direction - I was thinking of using Python as the pywikibot would help with some of the tasks.

Thanks for your time, Commander Keane (talk) 08:22, 31 August 2018 (UTC)

  • @Commander Keane: For clarification: you intend that such recordings only contain the title of the article, not a readout of the whole article, right?
That sounds feasible, but I am not sure it is easy (technically) to get the recording via browser. It is probably easier to have one user/client-side tool that (1) picks a list of article titles, (2) records the user saying them, (3) stores the whole thing with relevant metadata on the user's disk; and then to have another tool, either web/Wikipedia-based or client-side again, that (a) reads a directory on the user's disk, (b) uploads ogg files to Commons according to metadata or file naming conventions, (c) put relevant tags and links on article talk pages on en-wp. The first tool could be configured once to save user metadata (accent, preferred saving format, etc.) in every file. The second tool can also be made compatible with existing recordings, e.g. if someone names their files [article title]-title.ogg simple string matching can catch it, and made to upload more than title readouts (think [article title]-title.ogg, [article title]-lead.ogg and [article title]-fullarticle.ogg).
I have zero experience with audio files programming, but I can give a go at the first tool in Python, if nothing similar pops up. For the second tool, you will probably need some consensus first; since it uploads files and potentially writes to talk pages etc. it must go through approval processes (WP:BRFA on en-wp, I dunno what on Commons). There are also security issues, I think (we do not want a vandal to use the tool to upload a.ogg, b.ogg, ... z.ogg, aa.ogg, ... zzzzzz.ogg to Commons). TigraanClick here to contact me 09:15, 31 August 2018 (UTC)
No surprise - it's c:COM:BRFA on commons Ronhjones  (Talk) 19:57, 31 August 2018 (UTC)
  • Yes I am just thinking about recording the title of the article. I think you understand the concept well. Splitting it into two tools is interesting.--Commander Keane (talk) 09:38, 31 August 2018 (UTC)
  • Update: I have created a tool that takes a list of articles, records and saves the audio to disk. The next steps need to be compleetd though.--Commander Keane (talk) 08:57, 14 September 2018 (UTC)

Update GA section of Template:WPVG announcementsEdit

See my post at WT:Lua#Can Lua be used to parse a section of one page, change the contents and be transcluded on another page? for more details. Basically, the bot would take information from WP:GAN#VG, reformat it and add it to a new page that can be transcluded to {{WPVG announcements}} which is currently updated manually. @GamerPro64, TarkusAB, TheJoebro64, ProtoDrake, and Lee Vilenski: You are currently the main users doing those updates, do you think there is a problem to have a bot handle this in future? Regards SoWhy 10:02, 11 September 2018 (UTC)

@SoWhy: Are you requesting a bot to update the entire template, or just the WP:GAN section? If just one section, why not have a bot update the entire thing? Kadane (talk) 14:45, 11 September 2018 (UTC)
The GAN part is the one that sees the most changes. It would probably make sense to update the other parts of the template via bot as well but I don't really know how they are updated at the moment. I'll cross-post this to WT:VG for more input. Regards SoWhy 15:12, 11 September 2018 (UTC)
I think I suggested this a while back but usually the procedure is to get local consensus first before requesting that someone write a bot. Could pull from WP:VG/AA too. Also, a bunch of the WPVG regulars like updating lists manually, fwiw. (not watching, please {{ping}}) czar 01:51, 15 September 2018 (UTC)

Bot to fix capitalization of "Senator" in specific contextsEdit

I'd like to request bot help to downcase "Senator" to "senator" (and the plural) in specific titles and links to those titles, as follows:

This came up at Talk:Dan_Sullivan_(American_senator)#Requested_move_8_September_2018 and I've started an RFC to see if there's any objection, at Wikipedia:Village_pump_(policy)#RFC:_Capitalization_of_Senator. So far, nobody is claiming that senator or senators in these contexts is part of a proper name. Dicklyon (talk) 03:45, 15 September 2018 (UTC)

The problem is that the capitalization of “senator” depends on context, and bots are notoriously bad at determining context. This is something that should be done manually. Blueboar (talk) 11:40, 15 September 2018 (UTC)
If you look at the bot proposal above, you can see I'm only proposing very narrow specific contexts where the bot can easily get it right; 250 specific moves, and the links to them (not messing with any piped text of course). The rest would need to be done by hand, as you note. Dicklyon (talk) 16:13, 15 September 2018 (UTC)
There should be no need to "fix all links to" any redirects that may be created by a page move; this is WP:NOTBROKEN. --Redrose64 🌹 (talk) 21:01, 15 September 2018 (UTC)
Presumably it would make sense to convert United States Senator from Alaska -> United States senator from Alaska because of how those words display on the page. If it's a piped link it wouldn't make sense to convert as you say it's not broken. -- GreenC 21:14, 15 September 2018 (UTC)
Right, if it's a piped link, someone made an explicit choice about how it should appear, and we shouldn't have a bot mess with that. It's optional, unnecessary but harmless, to fix the part before the pipe, however. Dicklyon (talk) 03:37, 23 September 2018 (UTC)
  • Oppose for multiple different reasons. First, I'm not convinced that "United States Senator" is incorrect; though I agree that "American Senator" is incorrect. Pages such as List of United States Representatives from Nebraska would be equally wrong. For categories, any change should encompass all the subcategories of Category:United States Senators. Perhaps a "Members of the United States Senate" formulation would be better. There's also no point to use a bot to update links to redirects; several of these formulations are already redirects and all of them should remain as {{R from other capitalisation}} forever. Finally, a proposal to move articles should be a WP:RM; you can probably bundle all 50 states into a single proposal if the page titles are otherwise the same. power~enwiki (π, ν) 03:45, 16 September 2018 (UTC)
    Can you explain your objections better? How can "United States Senator" be correct in the contexts mentioned? And are you objecting because I didn't go further correct other over-capitalization at the same time, like Representative? I'd be happy to add that on, but no need to do everything at once. As for links to redirects, there is absolutely a point. The redirect links usually appear in article text, over-capitalized; downcasing them corrects this very common style error in articles. Perhaps you didn't understand what corrections I meant; sorry if I was unclear. And yes I can easily generate the multiple-RM requests, but that seems like the wrong approach for such obviously uncontroversial corrections that have been discussed elsewhere; and even if the moves got done it would leave a ton of cleanup work for someone, where a bot would be a huge help. Dicklyon (talk) 03:23, 17 September 2018 (UTC)
    My concern is that in "Alabama senator", the word "Alabama" describes the person. In "United States Senator", "United States" describes the legislative body, not the person. Also, (U.S. senator) simply looks wrong to me. I doubt that consensus will agree with me on this point, so I'm not going to argue it in detail here. power~enwiki (π, ν) 04:05, 17 September 2018 (UTC)
    I don't see the difference. There's an Alabama Senate (I presume) and a United States Senate. But senator is a title whether it's an Alabama senator or U.S. senator, and doesn't need a cap except when attached to an individual's name as MOS:JOBTITLES explains. Dicklyon (talk) 02:50, 18 September 2018 (UTC)
    @Power~enwiki: Let us know if you're still concerned. Let's not worry about your "simply looks wrong to me", but rather focus on Wikipedia guidelines such as MOS:CAPS and MOS:JOBTITLES. Dicklyon (talk) 03:37, 23 September 2018 (UTC)
    Oh, you're only referring to updating un-piped links in articles? I was thrown off by the use of the word "move". I guess that's fine, though I don't see the point of changing template names (and categories should go to CfD regardless; they already have bots for that). It's probably possible to do that with AWB fairly easily; I've never used that so you'll have to ask someone else. power~enwiki (π, ν) 04:05, 17 September 2018 (UTC)
    OK, we can separately do the Categories at CfD when this is all settled. Dicklyon (talk) 02:50, 18 September 2018 (UTC)

Revised plan: downcase Senator, Senators, Representative, Representative, in these contexts, when they exist (most of the added ones don't, but we can try):

And templates:

The objection above by power~enwiki seems to have gone away, as he has not responded to pings about whether his concerned have been answered. Dicklyon (talk) 14:26, 26 September 2018 (UTC)

Bot to update 'Needs infobox'Edit

So I was browsing the WP Backlog and came across a number of categories regarding articles needing Infoboxes. For example, Category:Baseball articles needing infoboxes, Category:Baseball articles needing infoboxes, Category:BBC articles without infoboxes. Did a little clicking around and found that a number of these actually already have had Infoboxes added but the parameter on the talk page was never updated to reflect this. So, my thought was to create a bot that would do the following:

  1. Take a list of categories that are of the basic format <article type> articles needing infobox
  2. Routinely (weekly?) checks those pages for an infobox.
  3. If the page contains an infobox, updates the |needs-infobox=yes on the talk page.

I'm happy to tackle creating this bot myself but wanted to discuss it first and see what others thought? --Zackmann08 (Talk to me/What I been doing) 23:59, 22 September 2018 (UTC)

Sounds like a great idea. Category:Wikipedia articles with an infobox request definitely looks in need of bot help. -- GreenC 00:39, 23 September 2018 (UTC)
Sounds OK to me, Suggest start at Category:Wikipedia backlog, get the list and pick out the "needing infoboxes", "without infoboxes" "needing an infobox" (nothing like consistency...). Come back if you get stuck writing it. Ronhjones  (Talk) 19:47, 23 September 2018 (UTC)
@Ronhjones and GreenC: Wikipedia:Bots/Requests for approval/ZackBot 10 --Zackmann (Talk to me/What I been doing) 20:45, 28 September 2018 (UTC)

BBLd - LinkfixEdit

https://en.wikipedia.org/w/index.php?title=Special:LinkSearch&limit=500&offset=0&target=http%3A%2F%2F%2A.bbl-digital.de

instead of

http://bbl-digital.de/eintrag/$1/
http://www.bbl-digital.de/eintrag/$1/

it should be

https://bbld.de/$1

i.e. new domain, httpS, no "eintrag/" and no final "/". 78.55.121.98 (talk) 02:33, 23 September 2018 (UTC)

It doesn't seem to work with this test:
http://www.bbl-digital.de/eintrag/Adlerberg-Woldemar-Eduard-Ferdinand-v.-1791-1884/
https://bbld.de/Adlerberg-Woldemar-Eduard-Ferdinand-v.-1791-1884
-- GreenC 12:49, 23 September 2018 (UTC)

I was curious if this is a problem in huwiki. So I checked both your link and the huwiki list, tried several links and they all worked. There is an automatic redirect on that site to the correct address without eintrag and trailing /, so IMO it is not worth to change them unless somebody is bored. Not a big deal although. If some of them does not work, another problem may cause the trouble, that's why GreenC's test didn't work. Bináris (talk) 14:47, 27 September 2018 (UTC)

I don't understand how an external link works on huwiki but doesn't work on enwiki. -- GreenC 16:11, 27 September 2018 (UTC)
I clicked on the uppermost link in this section and tried several links randonly, all worked. You may have found one problematic. Bináris (talk) 16:18, 27 September 2018 (UTC)
Do these work? [2][3][4][5][6][7][8][9] .. for me only two of them work (near the end). They were not chosen randomly from the list, they are the first 8 (starting at line #14). -- GreenC 16:25, 27 September 2018 (UTC)
Same for me. But the problem is not with the form of the link or the redirect. These are simply bad or broken links. Bináris (talk) 16:54, 27 September 2018 (UTC)
Got it. Someone would need to manually find the correct links and update the pages. Only about 40 to check. -- GreenC 17:03, 27 September 2018 (UTC)

portalEdit

hi please creat bot to adding portals to articles many articles not any portal example https://en.wikipedia.org/wiki/Billy_James_(footballer) — Preceding unsigned comment added by Amirh123 (talkcontribs) 11:50, 26 September 2018 (UTC)

  Not a good task for a bot per WP:CONTEXTBOT. Headbomb {t · c · p · b} 14:45, 26 September 2018 (UTC)
@Amirh123: Please review the various requests that you have made here in the past, and the replies that we have left. --Redrose64 🌹 (talk) 15:23, 26 September 2018 (UTC)

CAT:MISSFILE botEdit

Hi! In regards to CAT:MISSFILE, we have a significant issue which could be helped greatly by the creation of a few bots. For the past few weeks now, Sam Sailor and I have been patrolling the backlogged category (you can check our documentation of the backlog in Sam Sailor’s documentation log and Katniss’s documentation log.) In helping to lessen the backlog, we noticed a few patterns which could greatly reduce the back of the category if a bot were to preform the tedious manual tasks.

  1. Often, users will attempt to change the name of a file in good faith to correct a perceived typo. There have also been several cases in which even experienced users will change certain incorrect punctuation in a file name. Of course, this causes the image names to link to files are not technically linked or uploaded on the project servers.
  2. Another large issue is users adding file names to the articles before they are uploaded, and then often times, forgetting to upload them completely.
  3. There are also a few issues with the current bots created for this category, User:CommonsDelinker and User:Filedelinkerbot. Even after they are removed from Commons, the bots are sometimes not performing their duties. There were several instances of files (and audio files) deleted from Commons back in July which were still present in the articles when we manually removed them in September.

As an experiment, neither of us patrolled CAT:MISSFILE for a period of 10 days, and the backlog already grew again to 681 articles in the short time that the category was not patrolled (for more information on this, you can read our talk page conversation). Though neither of us are particularly tech savvy (and thus wouldn't know the technical way to describe the commands that would be most efficient for the bots to perform), we believe that the creation of bots to perform these tasks would help to greatly reduce the backlog in that category. If anyone has any suggestions, thoughts, or ideas in regards to creating bots to efficiently completely these tasks, that would be great! Thanks! (Courtesy ping for KylieTastic, who may potentially be interested in this conversation even though she is currently on WikiBreak) Katniss May the odds be ever in your favor 14:05, 27 September 2018 (UTC)

In terms of existing bots @Magnus Manske and Krd:.. Filedelinkerbot is running and CommonsDelinker is running. If they are not removing certain links report the errors as you find them. Also, maybe ask them to investigate new features like above; since they wrote bots designed for this sort of thing a good place to start. -- GreenC 15:57, 27 September 2018 (UTC)
Thanks for your advice, GreenC. I have invited them both to contribute to this discussion. To clarify, the issue goes beyond a few errors. In patrolling CAT:MISSFILE the past five months, I can safely say there have been hundreds of files over that period that are deleted but the bots are not catching them. The reason for creating this discussion (in addition to exploring the other two suggested features) was to hopefully come up with a solution less tedious than reporting every single file that the bots aren't catching, as it's clearly a lot. Thanks, Katniss May the odds be ever in your favor 18:32, 27 September 2018 (UTC)
I appreciate the initiative, but I sadly don't have enough resources to check hundreds of files. Please advise if there is some pattern visible, or please report few examples for further investigation. Thank you. --Krd 18:59, 27 September 2018 (UTC)
Hi Krd, the only pattern I've noticed is that probably 99.9% of the files that aren't being deleted are ones that were deleted from Commons by Explicit, though I have no idea if that means anything or not. Regarding the other feature requests as well, would either of the issues above warrant creating new features on the existing bots (or new bots altogether)? In example, maybe a bot could detect users changing a filename in the mainspace without uploading it/changing the name of the file itself first, and revert those edits? Could a bot (either existing or new) also detect if users are adding images to the article mainspace that aren't uploaded (e.g. a file has never been uploaded to Wikipedia under that name before)? Hopefully I'm explaining myself okay, as I said this would be my first time making a bot request and thus I'm not sure if I'm explaining what's in my head right. Let me know if any clarification is needed, and I would be interested to get people's opinions on this! Best, Katniss May the odds be ever in your favor 13:01, 28 September 2018 (UTC)
Regarding your feature request, I'd say this is definitely more that a simple task, more than a few additional lines of code, but a small new project which I definitely have no resources for, neither for the actual coding nor the later maintenance.
Regarding the actual problem, please choose one example (and if possible please keep it uncorrected until I was able to look). --Krd 14:03, 28 September 2018 (UTC)
User:KatnissEverdeen. Both ideas (#1 and #2) are technically feasible (IMO), and justifiable, but not simple, maybe even kind of hard. Assuming the two features were implemented, estimating in your experience, how many links do you think such a bot would be fixing on a weekly basis? Dozens, hundreds, thousands? -- GreenC 14:54, 28 September 2018 (UTC)
Hi Krd, this one wasn't deleted by Explicit, but here is one example. File:Fyodor_Petrovich_Tolstoy_2.jpg was deleted from Commons on September 24, 2018, but is still showing up as a missing (red-linked) file on List of Russian artists. This was found just by checking two articles at random, the second one being the example I outlined above, so I'm sure there are plenty more given I was able to find one so quickly. From patrolling that category, this is a pretty regular occurrence. Katniss May the odds be ever in your favor 19:08, 28 September 2018 (UTC)
@KatnissEverdeen: As far as I see, Filedelinkerbot currently works for files only that are deleted at Commons, but File:Fyodor Petrovich Tolstoy 2.jpg was deleted locally at enwiki. I don't know why this isn't active, but think I can easily add that, although I'm not sure if additional discussion or approval is required. What do you think? --Krd 07:33, 1 October 2018 (UTC)
Krd, I would say there has been enough discussion. I can't imagine why someone wouldn't agree with you doing it. It's probably worth saying though that there are some files that were actually deleted from Commons (don't have any examples at the moment, but I could probably find one if you need it) that are slipping under the radar of the bots. Most of the issue comes from enwiki files however. Katniss May the odds be ever in your favor 13:23, 1 October 2018 (UTC)
I'm going to active it now, please block the bot if something goes wrong. And please report further examples, if there appear any in the future. --Krd 17:07, 1 October 2018 (UTC)
@Krd: You should probably file a BRFA for the "deleted at en.wp" part. I guess it will be approved quickly. --Izno (talk) 18:01, 1 October 2018 (UTC)
Done. --Krd 06:08, 2 October 2018 (UTC)

─────────────────Thank you, Krd! The bot seems to be working beautifully and has already reduced the backlog by several hundred files! I would say #3 of my list above is   done!   Katniss May the odds be ever in your favor 13:34, 2 October 2018 (UTC)

Hi GreenC, let's see, so in the two weeks (give or take a day) that the category hasn't been patrolled, the category is up to 706 not counting the 10 templates that are also in that category. So let's say roughly 700 in the category at the time of my reply. I would say from experience that about 75% of the pages in that category are there because of the three things I outlined above. So 75% of 700 would be 525 files in two weeks, divided by two for a single week would make ~263 a week. Katniss May the odds be ever in your favor 19:12, 28 September 2018 (UTC)
Bots are terrible at evaluating context, and #1 and #2 are quite different. I think you will struggle with #1 unless you have a bot that is constantly running and uses "recent changes" to check if the file name has been altered and if the resulting file name is bad - otherwise you can end up with a page that could have several edits between the file change and the current version, and having a bot revert a much older edit won't be easy. I wonder if such a system could be added to ClueBot - ping @Cobi:. #2 is not a problem, however if you don't fix #1 then the bot will end up removing links when someone has changed the file name - one could wait a fixed time before doing that, so it it's not fixed in X days, then the link gets removed. Ronhjones  (Talk) 20:14, 30 September 2018 (UTC)

Hi Ronhjones, I really like your suggestion about adding a feature to ClueBot. Often, some of the file name changes are sneaky vandalism anyways, so it would make sense for ClueBot to have a feature which catches that. What would you (or anyone who wants to jump in) say is an appropriate amount of time to wait? A week, a few days, a few hours (as ClueBot seems to always catch vandalism right away)? Another suggestion, would it be possible to adding a warning when someone alters a file name that would pop up as they hit save? For example, "You are about to alter a file name. Changing a file name, even due to typos/grammatical errors, may cause the file name to break. Are you sure you want to do this?" It certainly wouldn't prevent all cases, but it would definitely cut down on the accidental file name changing issue. Katniss May the odds be ever in your favor 23:36, 30 September 2018 (UTC)

@KatnissEverdeen: Hopefully the cluebot operator will comment. Another option, which just occurred to me, is an edit filter - thus you could trap all the #1 with an edit filter (maybe warn and only allow respected editors to edit), then that makes it easy to attack the rest as just needing the image link removed. I'm no edit filter expert, why not suggest at Wikipedia:Edit filter/Requested and see what happens? If you can get a quick stop/revert system to fix #1, then I would think a 1 day wait would do for #2, with say, the bot running once a day. Ronhjones  (Talk) 00:01, 1 October 2018 (UTC)

Good suggestion, Ronhjones! I'll suggest it, but since I'm definitely not an edit filter (or bot request!) expert, what would you say qualifies as a "respected user" in this case? Autoconfirmed? Someone with 100+ edits? Just asking to make sure I'm understanding correctly, as well as the fact that it's not just non-autoconfirmed users that are making this mistake. Katniss May the odds be ever in your favor 01:01, 1 October 2018 (UTC)

@KatnissEverdeen: I "think" the edit filter can accept/deny based on the user edit count. I would think autoconfirmed too low, would let in the determined vandals, 50-100 would be nice. Also they might be able to stop AWB edits - I saw a page you fixed where there was an AWB run fixing "fancy" quotes to normal quotes. Ronhjones  (Talk) 01:06, 1 October 2018 (UTC)

Ronhjones That "fancy" quote issue actually comes up more often than you'd think, in fact I've even seen many experienced users make the mistake. I've gone ahead and made the edit filter request here. Katniss May the odds be ever in your favor 01:32, 1 October 2018 (UTC)

Kat, WP:Edit filter can't revert but they can log, warn or block edits. I see they are also only meant to be used for abusive edits, this may or may not be considered vandalism. Will see what they say. Ronhjones is right that #2 is more feasible than #1, though I think #1 is (theoretically) possible by monitoring EventStreams and comparing diffs, it's just would be a lot of work and resources to setup and run. --GreenC 12:46, 1 October 2018 (UTC)

GreenC I would say that the file name changes fall into three categories. 1) deliberate/blatant vandalism 2) sneaky vandalism, often changing one letter of a filename (i.e. if the file name was "Katniss," a user might vandalize it and change it the K to a C, "Catniss", etc.) 3) good-faith attempts, such as correcting punctuation or spelling, to 'correct' a file name (which breaks the file). I would argue that the first two categories would warrant an edit filter, and there's not really any harm in having the edit filter warn or stop edits in the third category seeing as it's usually a mistake anyways. Katniss May the odds be ever in your favor 14:37, 1 October 2018 (UTC)

I agree with GreenC, in that a bot for #1 can only be done by a continuously running one, monitoring the changes - not easy (and not for me - I only run RonBot from the PC). I commented on the edit filter page - deliberate/blatant vandalism does occur and I have often see many a nice portrait replaced with a large image of some sexual organ - admittedly they do get reverted fairly quickly, but not before half a dozen annoyed readers have posted an e-mail to OTRS! Ronhjones  (Talk) 15:29, 1 October 2018 (UTC)

Regarding "There have also been several cases in which even experienced users will change certain incorrect punctuation in a file name.", what about a bot that does something like what User:DPL bot does, which is to notify users who introduce red linked file names.. Galobtter (pingó mió) 17:19, 1 October 2018 (UTC)

Not sure how DPL bot 2 works and if the strategy would work here - ping the owner for comment @JaGa: Ronhjones  (Talk) 19:37, 1 October 2018 (UTC)

Just to give people a heads up that may not be following this page, the edit filter request I made was denied due to technical limitations. Seeing as an edit filter isn't feasible, is there maybe another way we could have a warning pop up when someone hits save (before the page is offically updated)? Such as, "You are about to alter a file name, which if not properly linked will break the image. Are you sure you want to do this?" or something like that. Katniss May the odds be ever in your favor 18:58, 11 October 2018 (UTC)

@KatnissEverdeen: The only thing to stop a live change in it's tracks is an edit filter. MusikAnimal said We can to some extent detect changes to image link syntax, but unfortunately we can't detect if the image exists - which does not really help you too much - especially the "to some extent" part - if they could have detected all image link changes, at least that might have been useful to pop up a "you are changing an image link" message. I don't see any easy answer now. If we were to check if the link points to a deleted image, and if so remove it, then we are at risk of there being a deleted image that someone has coincidentally changed the file name to - the bot would not be able to work out that it was a bad name change, and not a unlinked deletion. One option is to just remove all image links that do not point to a current image, that I suspect would be rather controversial (but much easy to code!) and would certainly need some discussion and support before any approval. We could have a bot that just sends a message to the last editor to say that there is now a broken image link, and ask if they broke it - but if it's an IP or it's a vandal then it's probably a waste of time. Ronhjones  (Talk) 01:17, 15 October 2018 (UTC)
Thanks for replying, Ronhjones. From my experience, the majority of editors doing this are IPs who haven't necessarily familiarized themselves with the rules, so the bot sending people messages probably wouldn't help unless it is an experienced editor who has accidentally altered an image name. "One option is to just remove all image links that do not point to a current image" - I'm not sure I agree with this idea. I think, like you said, we would be at the risk of removing valid image links which vandals have altered. Is there a way for a bot to sense if the file name has been changed recently? Katniss May the odds be ever in your favor 13:04, 15 October 2018 (UTC)
I think you mentioned that some experienced editors also break file links, so there would be some use to notifying experienced editors that they have broken the image right? Galobtter (pingó mió) 13:40, 15 October 2018 (UTC)
@KatnissEverdeen and Galobtter: I think Galobtter has a valid point, if we are only posting a simple standard template (to be written) to a user talk page, then there is no harm done. Even if said editor was not the culprit, since they have edited the page, they conceivably might have an interest in the subject and might fix it anyway. One could also duplicate the message on the article talk page - editors might have the page in their watch lists - OR, better still I think, a small template to the top of the article page like the cleanup tags - I could write a task for my bot to do all or some of that (and as it's an adminbot, page protection does not get in the way).
As for working out the change of link, that does really need a continuously running system like ClueBot or DPL bot 2 (owners were pinged but never answered), which can monitor the recent changes. To try to do that, say once a day, I think would be a nightmare of trying to compare revisions - easy for a human, difficult for a bot (and not for me, as my bot talks are PC based).Ronhjones  (Talk) 15:31, 15 October 2018 (UTC)
DPL Bot 2 doesn't check continuously but twice a day, seeing what new dab links are there and notifying users (See BRFA). I can see a similar system working for this category too. Galobtter (pingó mió) 15:36, 15 October 2018 (UTC)

────────────Given what we have tried/suggested so far, I would say creating a standard template to post to the user talk page and article talk page would definitely help with the issue, at least a little bit. Katniss May the odds be ever in your favor 15:40, 15 October 2018 (UTC)

  Coding... @KatnissEverdeen:I'll make something up. Of course, it does not stop their being another bot added later, if someone can work out a useful method. Ronhjones  (Talk) 16:36, 15 October 2018 (UTC)
@Ronhjones: Thanks in advance for creating a talk page template for this. I also patrol this category and a template would be very helpful! (Although an update to ClueBot also seems like a great idea for some of the image-related vandalism reversions.) - tucoxn\talk 16:49, 15 October 2018 (UTC)

  BRFA filed Ronhjones  (Talk) 23:37, 15 October 2018 (UTC)

  BRFA filed This is in regard to messaging users a la DPL Bot 2. Ping KatnissEverdeen Galobtter (pingó mió) 10:09, 16 October 2018 (UTC)

fix ping @Tucoxn and Sam Sailor: Galobtter (pingó mió) 10:10, 16 October 2018 (UTC)
Wonderful, thank you both, Ron and Galobtter. Sam Sailor 10:16, 16 October 2018 (UTC)

Request for a list of article-space pages with the most Linter errorsEdit

[I was sent here from Wikipedia talk:Linter.]

While working on Linter table tag errors, I stumbled across Greek football clubs in European competitions, which had over 1,000 Linter errors, mostly missing end tags (crazy diff here!). I was able to fix them with a series of find-and-replace operations, so it wasn't too bad. Is there a way to find pages with many Linter errors? We could reduce our total count more quickly if we could knock out some of the worst offenders.

This is a request for a list of 1,000 pages with the most errors in article space, or a list/table of all article-space pages with 20 or more errors (and how many of each type of error exist on each page). Does anyone know how to create such a list? The list will need to be recreated periodically as gnomes work through it, fixing errors. Thanks. – Jonesey95 (talk) 11:57, 4 October 2018 (UTC)

I don't know what's involved, but wonder if User:Firefly would be interested in adding this to Firefly Tools? -- GreenC 13:06, 4 October 2018 (UTC)
Jonesey95 Done! I realised this could be done through quarry, and after some fiddling around got Quarry:query/30386 working. I made into a table at User:Galobtter/Articles_by_Lint_Errors Galobtter (pingó mió) 19:09, 14 October 2018 (UTC)
Excellent! Thanks Galobtter. Is this something that could be updated periodically, like once a week? I put it on my watchlist and will be working on the articles. – Jonesey95 (talk) 04:42, 15 October 2018 (UTC)
Jonesey95, Yes; I can manually run the query every week and that'll only take a few minutes work; but I'll also see if I can get User:Galobot on it :) Galobtter (pingó mió) 04:49, 15 October 2018 (UTC)

Can someone take over User:Bot0612?Edit

Firefly has been inactive for two months and Bot0612 task 9 no longer updates many WP:VITAL article lists. I haven't looked at other tasks, but can anyone take over the functions currently done by Bot0612? feminist (talk) 09:01, 7 October 2018 (UTC)

@Feminist: Have you tried to contact him? He was around 7 days ago, and he has an e-mail link. User:Bot0612/shutoff/9 suggest the bot is still active. It may be some wiki change that has broken the bot, and he would be better placed to fix it. If he wants to give it up, then the bot code is published. Ronhjones  (Talk) 22:06, 10 October 2018 (UTC)
Never mind, the bot seems to work correctly now, at least for this task. Thanks. feminist (talk) 03:58, 11 October 2018 (UTC)

[r] → [ɾ] in IPA for ItalianEdit

In Italian the letter ⟨r⟩ – usually trilled as [r] – is systematically realised as a flap [ɾ] in certain positions, which I don't think is currently in our IPA entries. Specifically it occurs in any unstressed syllable with a vowel on either side,[1] and in the onset of a mid-word syllable with secondary stress.[2]

Formally: [r] → [ɾ] / [ˈVːɾV, (V/C)(ˌ) ɾV-, Vɾ-, -ɾ(ˈ)C-]

References

  1. ^ Which for this case should just any unstressed syllable I think.
  2. ^ Romano, Antonio. "A preliminary contribution to the study of phonetic variation of /r/ in Italian and Italo-Romance." Rhotics. New data and perspectives (Proc. of’r-atics-3, Libera Università di Bolzano (2011): 213-214.

ReconditeRodent « talk · contribs » 12:50, 7 October 2018 (UTC)

This looks similar to the script made for Wikipedia:Bot_requests#[r]_→_[ɾ]_in_IPA_for_Spanish which might be applicable with some tweaks. But it would need a discussion somewhere, and an explanation what to do as I am not familiar with lexicographic/linguistic terminology (trilled, flap, unstressed syllable, onset of a mid-word syllable with secondary stress). Would also need to develop a search formula like this. @Nardog: -- GreenC 13:10, 7 October 2018 (UTC)

Whoops, I summarised Canepari's analysis which is slightly at odds with the actual conclusion of the review. What I said earlier maybe would be how an Italian speaker hears it, but for our purposes I suppose that means the only confirmed standard realisation of [ɾ] is in the unstressed intervocalic position. So whenever there's an 'r' between any two vowels ('iueoɛɔa'), it should be replaced with an 'ɾ'. There may also be a syllable break '.' and/or vowel lengthening mark (':' or 'ː') between the 'r' and the previous vowel. This should narrow the search down to unstressed syllables automatically as the stress mark (' or ˈ ) would get between 'r' and the previous vowel.
I tried to make a search expression which seems to work but I wouldn't mind someone else checking my logic. ─ ReconditeRodent « talk · contribs » 14:39, 7 October 2018 (UTC)
44 results. Normally that is too small for a bot, but since the code is already done with a few tweaks, and it's so few cases I can run it if you want. -- GreenC 14:53, 7 October 2018 (UTC)
Oh a lot more than 44. Something wrong with my offline script gave a wrong number. -- GreenC 15:32, 7 October 2018 (UTC)
Yeah. I just realised this isn't Wiktionary. Anyway, I messed up the last search expression by putting in the stress marks so use this instead. Thanks. ─ ReconditeRodent « talk · contribs » 15:37, 7 October 2018 (UTC)
502. That's botable but not so many to require botrequest. Can we apply the same consensus from the Spanish to the Italian, do you foresee anyone would object? -- GreenC 16:11, 7 October 2018 (UTC)
Well there was a bit of debate about it back in 2009 at Talk:Italian_phonology#flap_vs_trill but the source I gave is solid, and I doubt anyone would object. For the record, this independent blog post was what put me on to this in first place. ─ ReconditeRodent « talk · contribs » 16:37, 7 October 2018 (UTC)

You couldn't give me a hint on how to write a search expression for Wiktionary, could you? The only difference is that instead of an IPA-it template you have an IPA template with "lang=it" somewhere inside it. ─ ReconditeRodent « talk · contribs » 17:03, 7 October 2018 (UTC)

This maybe, it's a technique I've never used before. Will wait a day or two for comment on the bot. -- GreenC 21:02, 7 October 2018 (UTC)

There is no consensus for this. Please discuss it first at Help talk:IPA/Italian before changing existing transcriptions. Nardog (talk) 02:12, 8 October 2018 (UTC)

We are currently discussing this at Talk:Italian phonology#flap vs trill.   イヴァンスクルージ九十八(会話)  06:38, 8 October 2018 (UTC)

ViewBotEdit

for editing subjects that people would be extremely tedious about. name of bot:ViewBot. Huff slush7264 Chat With Me 22:49, 8 October 2018 (UTC)

@Huff slush7264: So.... What would the bot do? SQLQuery me! 22:52, 8 October 2018 (UTC)
  • what User:タチコマ robot does. or maybe just fix caps on letters Huff slush7264 Chat With Me 22:55, 8 October 2018 (UTC)
  • Are you working on it or do you not care? Can you make it for me? I am not very good at Python so really, It would be too hard for just me to program it. It could fix templates, and work on edits that wpuld take too long for a person to do. Please and thank you (you right now:(Ō_o) <(why please and thank you right next to each other?????) Huff slush7264 Chat With Me 15:48, 9 October 2018 (UTC)
Be specific. You shouldn't expect others to know what you're talking about just by saying "editing subjects that people would be extremely tedious about" or "what User:タチコマ robot does". Nardog (talk) 16:08, 9 October 2018 (UTC)

Remove infobox image requests from WP templates when the article has an infobox imageEdit

This should not be too hard. Many WikiProjects like WP:VG use a cover=yes (or cover=y) switch in their WikiProject banner on an article's talk page to populate a request category (e.g. Category:Video game articles requesting identifying art). However, sometimes editors add a cover to the article and forget / don't know about the WP banner request, leaving the article in the category despite not needing an image anymore. I'd like to request a bot to check all articles in Category:Video game articles requesting identifying art and see if the infobox has an (existing) image defined. If so, the bot should remove the cover=yes (and of course ideally log the removal somewhere so we can check whether it made mistakes). That way, when trying to eliminate the backlog, editors won't have to load articles that were already fixed. This would be a manual bot, run every once in a while. Anyone feeling like coding something like that? Regards SoWhy 07:17, 9 October 2018 (UTC)

There are articles that have screenshots in the infobox - a valid existing image that will usually need to be retained and moved to another location in the prose - but the request for a cover is a genuine one. So this request is not as straight forward as if image present, remove cover required flag. A better solution would be to have a bot run that creates a list of articles that have an image and a request for a cover, users can then manually clear that list first. This means that the category will then be free of articles with images and requests, eliminating the need for a regular bot run. - X201 (talk) 09:27, 9 October 2018 (UTC)
Articles with screenshots in the infobox are indeed a problem but how often will this actually be the case? Of course, the bot could instead compile a list of articles which contain an image but have cover=yes and someone can check them manually and then feed the list sans those false positives back into a bot/tool/AWB. I'd be happy to help check such a list if generated. Regards SoWhy 10:26, 9 October 2018 (UTC)
I'm generating a list now. - X201 (talk) 13:26, 9 October 2018 (UTC)
@SoWhy: Here you go. User:X201/Cover required but image present - X201 (talk) 14:28, 9 October 2018 (UTC)
@X201: Thanks! I'll go through it later and notify you when I'm done. Regards SoWhy 14:38, 9 October 2018 (UTC)

Update Wikipedia:WikiProject Women in Red/Drafts weeklyEdit

Per discussion at Wikipedia_talk:WikiProject_Women_in_Red#How_about_weekly_additions_to_drafts?, could somebody with an established bot take this task on? The code is in User:Ritchie333/afcbios.py - at the moment it prints the page to the standard output, which I redirect to a file and then copy and paste manually into the editing window. It would be much less hassle to bolt this task into an existing bot, rather than me creating one from the ground up. Ritchie333 (talk) (cont) 11:35, 9 October 2018 (UTC)

  Coding... Ronhjones  (Talk) 23:50, 10 October 2018 (UTC)
@Ritchie333: First userspace test at User:Ronhjones/Sandbox2. Ignoring the fact that the numbering went a bit off (It hit the max of 5000 pagenames, and the counter got zeroed when I did the second lot), it don't look too bad. I had to re-write it, as I've yet to get pywikibot to install on my PC - last time I tried, it corrupted the whole python system, and lucky I had a backup... Source is at User:RonBot/11/Source1 Ronhjones  (Talk) 13:41, 11 October 2018 (UTC)
I've installed Pywikibot on Linux, OS X and Windows using Mingw64, and it seems to work okay for all of them. Still, if you want to rewrite with raw APIs (which you have done - crikey, whoever wrote the module mwparserfromhell must have named it from experience), that's not an issue. Also, most of my scripts tend to just write to the standard output rather than chain things together, as other developers can test them quickly and easily. You can then replace print with a lambda argument and encapsulate bits of functionality. Anyway, that's just the way I work.
Another useful feature would be to have a list of false positives that the script generates. For example, it thinks Draft:Harry Edward Vickers (Flannel Foot) is in scope for the project because of the sentence starting "In May 1934 newspapers report that Scotland Yard have circulated a girl’s picture...." - the draft contains information about a woman, but the main subject isn't one. I was going to code that as a straight list, possibly reworking as an internal file "blokes.txt", and any draft whose title is in the list is skipped over. Ritchie333 (talk) (cont) 14:18, 11 October 2018 (UTC)
@Rosiestep: Does the output at user:Ronhjones/Sandbox2 look about right? Ritchie333 (talk) (cont) 14:21, 11 October 2018 (UTC)
@Ritchie333: We can have a subpage of the bot for false positives, then anyone can update them. Never mind the mwparserfromhell - the wikitools page.page was confusing the first time I encountered it! As an aside - as these are Drafts, would adding the date of the last edit date be useful? I know how to get the history dates. Ronhjones  (Talk) 17:06, 11 October 2018 (UTC)
The date might be useful - it would let us know which are in danger of deleted via WP:G13. Also the regex (defined as reText in my script) to work out what might be in scope would be useful as a configurable sub-page (though I'd probably want that protected in some form to stop vandalism). Ritchie333 (talk) (cont) 17:08, 11 October 2018 (UTC)
I'll have a play and see what works. Ronhjones  (Talk) 17:48, 11 October 2018 (UTC)
@Ritchie333 and Rosiestep:Shortened test at User:Ronhjones/Sandbox4. I need to tidy the date field (don't really need the time part). Note the lack of "Draft:Adam Zango" as he has been put in User:RonBot/11/FalsePositives. If that looks OK, I'll file the BRFA. Ronhjones  (Talk) 18:43, 11 October 2018 (UTC)
Yes, Ronhjones, this is an improvement. Question: will the page that Ritchie333 originally created be automatically updated or will there be a constant stream of new subpages with new entries? The latter would be impractical, I think. --Rosiestep (talk) 14:49, 12 October 2018 (UTC)
@Rosiestep: I was planning a weekly update of the one page. Ronhjones  (Talk) 17:04, 12 October 2018 (UTC)
But it can be any interval you like, I use Windows Schedule Tasks to run all my bots. Ronhjones  (Talk) 17:07, 12 October 2018 (UTC)
Ronhjones Perfect; thanks. When you have a moment, would you also please add explain all this on the Women in Red talkpage so that those pagestalkers are in the loop? --Rosiestep (talk) 17:31, 12 October 2018 (UTC)
@Rosiestep: OK, I am running a full length trial (now) to my user sandbox. When complete (and if OK), I'll add a bit to the page and start the WP:BRFA process. Ronhjones  (Talk) 17:54, 12 October 2018 (UTC)
Ronhjones, for transparency... Yesterday, I gave an elevator pitch to TNegrin (WMF) about what I really want (a spreadsheet which looks like other Women in Red Wikidata-SPARQL-QUERY lists, with columns including the name, year of birth, year of death, place of birth, place of death, photo, occupation, nationality, Q number), so he may be following up, too. In the meantime, I think what you and Ritchie have developed will be a good start. --Rosiestep (talk) 18:16, 12 October 2018 (UTC)

  BRFA filedRonhjones  (Talk) 15:32, 15 October 2018 (UTC)

Orphan Template:IncompleteEdit

Please can someone remove all calls to Template:Incomplete in mainspace? Per the Tfd all those with specified reasons have been converted to {{missing information}}. The remainder are not suitable for this template and can simply be removed. There are 5129 transclusions but only those in article space or draft space are important. Thank you — Martin (MSGJ · talk) 19:01, 10 October 2018 (UTC)

  Done — JJMC89(T·C) 08:34, 12 October 2018 (UTC)

Generating a list of a sub-set of articles using a specific templateEdit

Not sure if this is the right place, so sorry in advance if it isn't.

Would it be possible to get a list of all main-space articles (so no talk-, user-, template-pages, etc.) using Template:Infobox television episode or Template:Infobox Television episode that also have parenthesis in their title (basically I want all disambiguated articles using these templates)? --Gonnym (talk) 19:43, 10 October 2018 (UTC)

6k or so. May include some redirects--handling for redirects in search is optimized for readers rather than editors, so sometimes results aren't great. --Izno (talk) 20:41, 10 October 2018 (UTC)
Thanks a lot Izno! --Gonnym (talk) 21:16, 10 October 2018 (UTC)
Here's a Quarry query to get the information. This includes all the redirects to Template:Infobox television episode], not just Template:Infobox Television episode, and specifically selects only titles that end with something looking like a disambiguator. Anomie 12:38, 14 October 2018 (UTC)

Updating links from album articles to AllMusicEdit

Hello! There are many articles about albums that have links to album reviews on the AllMusic website. It seems that some time ago, I'm not sure when, AllMusic changed their URLs. So now, a lot of the links to the AllMusic reviews are to obsolete URLs. It would be great if a bot could find these and change them to the current URLs. But, I'm not sure how hard or easy it would be for a bot to figure out the current URLs. (I've updated some of these manually, and I can generally find the current AllMusic page for an album review by doing a search for the album on the AllMusic site itself.) As a further complication, the references in some of the articles contain the URL for the AllMusic review itself, while others (for example this one) use the {{AllMusic}} template to generate the URL. I don't know how many articles include the outdated links, but I would think there have to be thousands of them. So, how does this sound so far? Mudwater (Talk) 18:29, 11 October 2018 (UTC)

Given this old and new link how would a bot determine r1701846 -> mw0000649874? Link to discussion Wikipedia_talk:WikiProject_Albums#AllMusic_links. -- GreenC 18:50, 11 October 2018 (UTC)
I haven't detected a pattern that can be used to convert programmatically from the old links to the new ones. If there isn't one, then the bot would have to use some kind of search to find the new links. And as I said, I'm not sure how hard or easy that would be to implement. Mudwater (Talk) 19:40, 11 October 2018 (UTC)
This is difficult but one possible solution: checking the old URL at Wayback returns the redirected URL, thus resolving r1701846 -> mw0000649874 [10]. If they all have this I don't know. One could extract the new URL from the redirected Wayback URL. Probably need to go through 10 of 20 samples to see how many work and if the rule holds. If so it might be automated. -- GreenC 20:53, 11 October 2018 (UTC)
Sounds like a promising line of inquiry! Mudwater (Talk) 22:10, 11 October 2018 (UTC)
I'll take this on. Will be busy the next 2-3 weeks might make some progress, if not after that. Number of complications including archive URLs (can't modify a source URL if there is corresponding archive URL already in place), the AllMusic template and probably more than the "/album/" links. A bot probably won't get them all but will narrow the field. -- GreenC 15:00, 12 October 2018 (UTC)
Sounds good to me. Thanks! Mudwater (Talk) 00:15, 13 October 2018 (UTC)

Mass move of election articlesEdit

The naming guideline for election articles was recently amended as a result of an RfC to move the year from the end to the start of the article title. As part of the proposal, I stated that if successful, I would request a bot run to move the thousands of articles affected.

As the RfC was closed in favour of the change, we now need a bot run to move the articles. I have prepared this offline in an Excel file and can also provide it in a txt file to the bot owner by email. If it needs to be on-wiki, I can create a few pages in my sandbox with a full list of the proposed moves. Cheers, Number 57 21:52, 15 October 2018 (UTC)

@Number 57: It's been a while (couple months since I've done bot work), but I'm interested. Just need a bit for some API research and some time to wrap my head around this. Also need to figure out how many articles would be affected and how to find them. --TheSandDoctor Talk 23:03, 15 October 2018 (UTC)
Found them, just need to figure out the "rules" for the bot to rename them by. --TheSandDoctor Talk 23:05, 15 October 2018 (UTC)
Prototyping looks good so far. Email sent as well Number 57. --TheSandDoctor Talk 03:41, 16 October 2018 (UTC)
@TheSandDoctor: No need to find the articles – I've prepared a list of articles that should be changed and the new names (sorry, perhaps should have been clearer above about what I'd prepared offline). Sent by email. Cheers, Number 57 07:25, 16 October 2018 (UTC)