User talk:TedderBot/AOP
- Please contact me at User talk:Tedder for issues related to this project.
tedder (talk) 03:40, 23 January 2010 (UTC)
The following discussion is closed. Please do not modify it. Subsequent comments should be made on the appropriate discussion page. No further edits should be made to this discussion.
Hello world
editIf you don't feel like editing the AOP page, feel free to leave feedback here. tedder (talk) 06:41, 13 August 2009 (UTC)
Early discussion
edit(stolen from User talk:EncMstr's page - tedder)
I'm writing code to automate Wikipedia:WikiProject Oregon/Admin; hope you don't mind. Initial results say there are currently 8237 articles tagged with the Oregon template; can you run it by hand and see if you agree? I'd appreciate any input you have, including "don't do it!" tedder (talk) 02:05, 13 August 2009 (UTC)
- I don't mind at all. It would be a relief to have that done automatically. It'd be nice to summarize the count of articles, images, categories, etc. into a table somewhere too. That could be done much more often (like weekly) than the /Admin page update, which I'd expect every 4 or 8 weeks, something like that.
- I did the category capture in the last 20 minutes. There are a total of 11,159 pages tagged: 1023 categories, 528 media files, 387 portal pages, 9008 articles, 193 templates, 1 user talk page, and 19 Wikipedia pages. —EncMstr (talk) 05:09, 13 August 2009 (UTC)
- Eek! Big green box! Anyway...9008 articles??? Wowie zowie! I'd love to see a graph of our article creation progress. Looks like it's ramped up, not slowed down like you would expect. We need to throw ourselves a party when we reach 10,000. Katr67 (talk) 05:39, 13 August 2009 (UTC)
- Yeah, I was thinking weekly for /admin, because it's easy to do. (AlexNewArtBot/Oregon runs daily, after all). I'll throw the output into a log, which will be relatively easy to parse later. 9008 articles? Lower than the 8237 I pulled, but that might be right. Hmm. I'd like to have those counts match. How do you do the "category capture"?
- Katr- maybe our favorite aunt will bake us a cake? tedder (talk) 05:44, 13 August 2009 (UTC)
- I copy and paste all the entries in Category:WikiProject Oregon pages. The total count there is now 26,576, updated each time this page is viewed. Hmmm, 10,401. I wonder what it doesn't count?
- It might be helpful to compare the lists of article names you generated. Or I could make my list available to you? —EncMstr (talk) 06:08, 13 August 2009 (UTC)
- Yeah, make your list available to me, it's probably easier that way. Want to update /admin, then I'll just write a script to diff them? tedder (talk) 06:23, 13 August 2009 (UTC)
(moving left) Hmm. I was using backlinks to pull anything that has the template. If I use categorymembers and the category, I get almost the same number. Here they are:
- template: 8287
- category: 8286
It'd be nice to find why those ~620 articles aren't on my list. tedder (talk) 07:05, 13 August 2009 (UTC)
- Sent a note to EncMstr, I think his list has duplicates in it. Once I remove them, I get the right number:
egrep "^Talk:" encmstr-wpore.txt | sort | uniq | wc -l 8286
/admin list is up
editHere's my version of the /admin list: User:TedderBot/AOP/admin. tedder (talk) 04:05, 1 October 2009 (UTC)
- Looks good, though I didn't do any detailed checks—that's an exercise left for you.... If you like, I'll update the list the way I've been doing it so you can compare a shorter list of expected discrepancies. —EncMstr (talk) 05:35, 1 October 2009 (UTC)
- I'll be responsible for that, sure :-) Post the updated list and I'll compare them. It was nice 6-7 weeks ago when I last touched it, so hopefully it looks good. tedder (talk) 05:58, 1 October 2009 (UTC)
List comparisons
editI finally found the source of duplicate entries. It was my fault pasting duplicate pages (groups of 200 entries) due to the slow responsiveness of Windows copy operation leading to pasting twice without any visible difference than one paste. Well, okay the target editor changed—for example—from at line 4637 to at line 4837 which escape my attention. The last updated version is carefully checked for duplicates, and I think it's clean. —EncMstr (talk) 03:45, 17 October 2009 (UTC)
- Aha. I ran against yours and had a lot of differences. I finally realized I needed to simply take the lists and sort them the same (using sort (Unix)). Finally they were exactly the same, with three exceptions, which were UTF8 errors on your feed. That means the following are redlinks because of copy/paste issues:
- Ahmad Rasha-d (should be Ahmad Rashād)
- Danijela Krstic' (should be Danijela Krstić)
- Yo-suke Matsuoka (should be Yōsuke Matsuoka)
- So I'm satisfied that our data is the same through both processes. Are you? tedder (talk) 06:17, 17 October 2009 (UTC)
- Yes indeed. —EncMstr (talk) 01:28, 17 November 2009 (UTC)
/admin2 results
editOkay, so I have admin2 up: User:TedderBot/AOP/admin2
I had some trouble getting the Wikipedia: project to match. Part of it is I'm combining the category results and the backlinks ("what links here"). I had to discard the latter results from the "Wikipedia:" and "Wikipedia Talk:" namespaces. It now matches, except I have one more entry: Wikipedia:WikiProject Oregon/Assessment/A class assessments. EncMstr, do you know why that one isn't in your results? tedder (talk) 03:50, 7 November 2009 (UTC)
- 'cos I just tagged it yesterday? Katr67 (talk) 05:16, 7 November 2009 (UTC)
- The history shows it hasn't been edited in 18 months. Eh? tedder (talk) 06:13, 7 November 2009 (UTC)
- I tagged the talk page: [1] with {{WikiProject Oregon}}. That's what makes the pages show up in the searches. Katr67 (talk) 19:02, 10 November 2009 (UTC)
- Aha. I was seeing the banner on the main page, didn't check out the talk page. tedder (talk) 19:33, 10 November 2009 (UTC)
- I tagged the talk page: [1] with {{WikiProject Oregon}}. That's what makes the pages show up in the searches. Katr67 (talk) 19:02, 10 November 2009 (UTC)
- The history shows it hasn't been edited in 18 months. Eh? tedder (talk) 06:13, 7 November 2009 (UTC)
more done
editProgress:
- the /admin2 list is ready except for verification
- the "show on tedderbot" vs "show on live site" code is in
- the bot is updating to a log: User:TedderBot/AOP/log
In short, it's almost ready to go. I need verification from User:EncMstr on the /admin2 section, we may do a synchronized pull so I can compare it. But at least the numbers are within reason, so it's getting close! tedder (talk) 07:13, 7 November 2009 (UTC)
- Okay, let's plan on a synchronized grab soon. Later this evening works for me, as well as tomorrow afternoon. —EncMstr (talk) 01:30, 17 November 2009 (UTC)
- Great. I can do it tonight or tomorrow, give me a time here or on email. tedder (talk) 01:38, 17 November 2009 (UTC)
admin2 comparison
editSo User:EncMstr and I ran our admin2 collections at the same time. Those results are up, so the following will be my comparison of them.
Initial comparison: old=2213 entries, new=2219 entries:
- 477 images (same)
- 1115 categories (same)
- 408 portals (same)
- 193 templates old / 199 new
- 20 projects (same)
A (normalized) diff showed the following entries are on my AOP version, but not on EncMstr's version.
- (r) Template:WP Oregon
- (r) Template:WPOREGON
- Template:WikiProject Oregon/class
- Template:WikiProject Oregon/sandbox
- Template:WikiProjectBannerShell/Compliant banner list
- (r) Template:Wikiproject Oregon
Those marked with (r) are redirects- the others are not. Some finagling with the code fixed that- I was looking for WPOregon backlinks, but needed to exclude those on the templates (as I have on the project pages).
Finally, EncMstr updated the text in /admin2 header after I developed it. So I made a quick little update of the text, committed the changes, and now things are identical. Woohoo!
EncMstr, want to give your A-OK before I run it through WP:BRFA and have it update the live page? tedder (talk) 05:20, 22 November 2009 (UTC)
- Looks good. I compared Wikipedia:WikiProject Oregon/Admin2 which was last updated 2009-11-17T15:49:34 with User:TedderBot/AOP/admin2 last updated 2009-11-21T21:08:53. Those 4.22 days difference easily account for the difference observed (reformatted slightly for viewing pleasure here):
55d54 < |[ [:Category:Book publishing companies of Oregon]]||category 525a525 > |[ [:Category:Oregon Ducks football culture and lore]]||category 554a555 > |[ [:Category:Oregon State Beavers football culture and lore]]||category 619d619 < |[ [:Category:Oregon journalists]]||category 1163d1162 < |[ [:File:DeschutesBrewery.png]]||file
- To get a sensible difference, I removed the header and trailer so that only table row entries remained, sorted the wikitext, removed the first half file consisting of |- and replaced [[:file:* with [[:File:*.
- I apologize for the delay finishing this. Much thanks for seeing to this, and removing a bookkeeping load from my task list! —EncMstr (talk) 17:38, 16 January 2010 (UTC)
- Thanks for the blessing, Enc. That's similar to my results, which was they were very close. Here's the breakdown:
- Category:Book publishing companies of Oregon - created on 20091118
- Category:Oregon Ducks football culture and lore - created on 20091103, deleted on 20091119
- Category:Oregon State Beavers football culture and lore - created on 20091103, deleted on 20091119
- File:DeschutesBrewery.png - added to project on 20091121
- So at least the differences have logical reasons. I've created the BRFA for this: Wikipedia:Bots/Requests for approval/TedderBot 3. tedder (talk) 23:26, 16 January 2010 (UTC)
This is alive
editThe bot was 'speedy approved' today. Further questions should be directed to my user talk page to make sure I actually see them. tedder (talk) 03:40, 23 January 2010 (UTC)