User talk:TedderBot/AOP

Latest comment: 14 years ago by Tedder in topic This is alive
Please contact me at User talk:Tedder for issues related to this project.

tedder (talk) 03:40, 23 January 2010 (UTC)Reply



The following discussion is closed. Please do not modify it. Subsequent comments should be made on the appropriate discussion page. No further edits should be made to this discussion.


Hello world

edit

If you don't feel like editing the AOP page, feel free to leave feedback here. tedder (talk) 06:41, 13 August 2009 (UTC)Reply

Early discussion

edit

(stolen from User talk:EncMstr's page - tedder)

I'm writing code to automate Wikipedia:WikiProject Oregon/Admin; hope you don't mind. Initial results say there are currently 8237 articles tagged with the Oregon template; can you run it by hand and see if you agree? I'd appreciate any input you have, including "don't do it!" tedder (talk) 02:05, 13 August 2009 (UTC)Reply

I don't mind at all. It would be a relief to have that done automatically. It'd be nice to summarize the count of articles, images, categories, etc. into a table somewhere too. That could be done much more often (like weekly) than the /Admin page update, which I'd expect every 4 or 8 weeks, something like that.
I did the category capture in the last 20 minutes. There are a total of 11,159 pages tagged: 1023 categories, 528 media files, 387 portal pages, 9008 articles, 193 templates, 1 user talk page, and 19 Wikipedia pages. —EncMstr (talk) 05:09, 13 August 2009 (UTC)Reply
Eek! Big green box! Anyway...9008 articles??? Wowie zowie! I'd love to see a graph of our article creation progress. Looks like it's ramped up, not slowed down like you would expect. We need to throw ourselves a party when we reach 10,000. Katr67 (talk) 05:39, 13 August 2009 (UTC)Reply
Yeah, I was thinking weekly for /admin, because it's easy to do. (AlexNewArtBot/Oregon runs daily, after all). I'll throw the output into a log, which will be relatively easy to parse later. 9008 articles? Lower than the 8237 I pulled, but that might be right. Hmm. I'd like to have those counts match. How do you do the "category capture"?
Katr- maybe our favorite aunt will bake us a cake? tedder (talk) 05:44, 13 August 2009 (UTC)Reply
I copy and paste all the entries in Category:WikiProject Oregon pages. The total count there is now 26,576, updated each time this page is viewed. Hmmm, 10,401. I wonder what it doesn't count?
It might be helpful to compare the lists of article names you generated. Or I could make my list available to you? —EncMstr (talk) 06:08, 13 August 2009 (UTC)Reply
Yeah, make your list available to me, it's probably easier that way. Want to update /admin, then I'll just write a script to diff them? tedder (talk) 06:23, 13 August 2009 (UTC)Reply

(moving left) Hmm. I was using backlinks to pull anything that has the template. If I use categorymembers and the category, I get almost the same number. Here they are:

  • template: 8287
  • category: 8286

It'd be nice to find why those ~620 articles aren't on my list. tedder (talk) 07:05, 13 August 2009 (UTC)Reply

Sent a note to EncMstr, I think his list has duplicates in it. Once I remove them, I get the right number:
egrep "^Talk:" encmstr-wpore.txt | sort | uniq | wc -l
8286
tedder (talk) 20:31, 13 August 2009 (UTC)Reply

/admin list is up

edit

Here's my version of the /admin list: User:TedderBot/AOP/admin. tedder (talk) 04:05, 1 October 2009 (UTC)Reply

Looks good, though I didn't do any detailed checks—that's an exercise left for you.... If you like, I'll update the list the way I've been doing it so you can compare a shorter list of expected discrepancies. —EncMstr (talk) 05:35, 1 October 2009 (UTC)Reply
I'll be responsible for that, sure :-) Post the updated list and I'll compare them. It was nice 6-7 weeks ago when I last touched it, so hopefully it looks good. tedder (talk) 05:58, 1 October 2009 (UTC)Reply

List comparisons

edit

I finally found the source of duplicate entries. It was my fault pasting duplicate pages (groups of 200 entries) due to the slow responsiveness of Windows copy operation leading to pasting twice without any visible difference than one paste. Well, okay the target editor changed—for example—from at line 4637 to at line 4837 which escape my attention. The last updated version is carefully checked for duplicates, and I think it's clean. —EncMstr (talk) 03:45, 17 October 2009 (UTC)Reply

Aha. I ran against yours and had a lot of differences. I finally realized I needed to simply take the lists and sort them the same (using sort (Unix)). Finally they were exactly the same, with three exceptions, which were UTF8 errors on your feed. That means the following are redlinks because of copy/paste issues:
So I'm satisfied that our data is the same through both processes. Are you? tedder (talk) 06:17, 17 October 2009 (UTC)Reply
Yes indeed. —EncMstr (talk) 01:28, 17 November 2009 (UTC)Reply

/admin2 results

edit

Okay, so I have admin2 up: User:TedderBot/AOP/admin2

I had some trouble getting the Wikipedia: project to match. Part of it is I'm combining the category results and the backlinks ("what links here"). I had to discard the latter results from the "Wikipedia:" and "Wikipedia Talk:" namespaces. It now matches, except I have one more entry: Wikipedia:WikiProject Oregon/Assessment/A class assessments. EncMstr, do you know why that one isn't in your results? tedder (talk) 03:50, 7 November 2009 (UTC)Reply

'cos I just tagged it yesterday? Katr67 (talk) 05:16, 7 November 2009 (UTC)Reply
The history shows it hasn't been edited in 18 months. Eh? tedder (talk) 06:13, 7 November 2009 (UTC)Reply
I tagged the talk page: [1] with {{WikiProject Oregon}}. That's what makes the pages show up in the searches. Katr67 (talk) 19:02, 10 November 2009 (UTC)Reply
Aha. I was seeing the banner on the main page, didn't check out the talk page. tedder (talk) 19:33, 10 November 2009 (UTC)Reply

more done

edit

Progress:

In short, it's almost ready to go. I need verification from User:EncMstr on the /admin2 section, we may do a synchronized pull so I can compare it. But at least the numbers are within reason, so it's getting close! tedder (talk) 07:13, 7 November 2009 (UTC)Reply

Okay, let's plan on a synchronized grab soon. Later this evening works for me, as well as tomorrow afternoon. —EncMstr (talk) 01:30, 17 November 2009 (UTC)Reply
Great. I can do it tonight or tomorrow, give me a time here or on email. tedder (talk) 01:38, 17 November 2009 (UTC)Reply

admin2 comparison

edit

So User:EncMstr and I ran our admin2 collections at the same time. Those results are up, so the following will be my comparison of them.

Initial comparison: old=2213 entries, new=2219 entries:

  • 477 images (same)
  • 1115 categories (same)
  • 408 portals (same)
  • 193 templates old / 199 new
  • 20 projects (same)

A (normalized) diff showed the following entries are on my AOP version, but not on EncMstr's version.

Those marked with (r) are redirects- the others are not. Some finagling with the code fixed that- I was looking for WPOregon backlinks, but needed to exclude those on the templates (as I have on the project pages).

Finally, EncMstr updated the text in /admin2 header after I developed it. So I made a quick little update of the text, committed the changes, and now things are identical. Woohoo!

EncMstr, want to give your A-OK before I run it through WP:BRFA and have it update the live page? tedder (talk) 05:20, 22 November 2009 (UTC)Reply

Looks good. I compared Wikipedia:WikiProject Oregon/Admin2 which was last updated 2009-11-17T15:49:34 with User:TedderBot/AOP/admin2 last updated 2009-11-21T21:08:53. Those 4.22 days difference easily account for the difference observed (reformatted slightly for viewing pleasure here):
55d54
< |[ [:Category:Book publishing companies of Oregon]]||category
525a525
> |[ [:Category:Oregon Ducks football culture and lore]]||category
554a555
> |[ [:Category:Oregon State Beavers football culture and lore]]||category
619d619
< |[ [:Category:Oregon journalists]]||category
1163d1162
< |[ [:File:DeschutesBrewery.png]]||file
To get a sensible difference, I removed the header and trailer so that only table row entries remained, sorted the wikitext, removed the first half file consisting of |- and replaced [[:file:* with [[:File:*.
I apologize for the delay finishing this. Much thanks for seeing to this, and removing a bookkeeping load from my task list! —EncMstr (talk) 17:38, 16 January 2010 (UTC)Reply
Thanks for the blessing, Enc. That's similar to my results, which was they were very close. Here's the breakdown:
So at least the differences have logical reasons. I've created the BRFA for this: Wikipedia:Bots/Requests for approval/TedderBot 3. tedder (talk) 23:26, 16 January 2010 (UTC)Reply

This is alive

edit

The bot was 'speedy approved' today. Further questions should be directed to my user talk page to make sure I actually see them. tedder (talk) 03:40, 23 January 2010 (UTC)Reply

The discussion above is closed. Please do not modify it. Subsequent comments should be made on the appropriate discussion page. No further edits should be made to this discussion.