User talk:Citation bot/Archive 12

Latest comment: 5 years ago by AManWithNoPlan in topic Timeout at Edward M. Fram
Archive 5 Archive 10 Archive 11 Archive 12 Archive 13 Archive 14 Archive 15

Request: Strip commas at the end of parameters

Status
{{fixed}}
Reported by
Headbomb {t · c · p · b} 23:43, 22 August 2018 (UTC)
What happens
Nothing
What should happen
[1]
We can't proceed until
Feedback from maintainers


This should apply to all parameters. There's no reason for any parameter to end with a comma. Headbomb {t · c · p · b} 23:43, 22 August 2018 (UTC)

Except perhaps author-sep? Martin (Smith609 – Talk) 07:45, 25 August 2018 (UTC)
Yes, that would be the exception. Headbomb {t · c · p · b} 13:31, 25 August 2018 (UTC)
author-separator author-name-separator separator AManWithNoPlan (talk) 16:46, 1 November 2018 (UTC)
https://github.com/ms609/citation-bot/pull/1011 AManWithNoPlan (talk) 16:51, 1 November 2018 (UTC)
Presumably you mean parameter values. And Martin's special case would |author-sep=,, where the comma is the value, not some trailing cruft. ♦ J. Johnson (JJ) (talk) 20:32, 2 November 2018 (UTC)
cs1|2 does not have or support |author-sep=, |author-name-separator=, or |separator=.
{{cite book |title=Title |last=Last1 |first=First1 |last2=Last2 |first2=First2 |publisher=Publisher |author-sep=, |author-name-sep=, |separator=,}}
Last1, First1; Last2, First2. Title. Publisher. {{cite book}}: Unknown parameter |author-name-sep= ignored (help); Unknown parameter |author-sep= ignored (help); Unknown parameter |separator= ignored (help)CS1 maint: numeric names: authors list (link)
Those parameters were deprecated and removed when |mode= was instituted.
One might legitimately set |postscript=,:
{{cite book |title=Title |last=Last1 |first=First1 |last2=Last2 |first2=First2 |publisher=Publisher |postscript=,}}
Last1, First1; Last2, First2. Title. Publisher,{{cite book}}: CS1 maint: numeric names: authors list (link)
Trappist the monk (talk) 20:43, 2 November 2018 (UTC)

https://github.com/ms609/citation-bot/pull/1020 AManWithNoPlan (talk) 20:59, 2 November 2018 (UTC)

Request: chapter-format

Status
{{fixed}}
Reported by
(tJosve05a (c) 09:30, 2 November 2018 (UTC)
What should happen
If removing |chapter-url=, also remove |chapter-format=
We can't proceed until
Feedback from maintainers


https://github.com/ms609/citation-bot/pull/1017 AManWithNoPlan (talk) 14:06, 2 November 2018 (UTC)

Bug: Spaces in ref-tags

Status
{{fixed}}
Reported by
(tJosve05a (c) 09:47, 2 November 2018 (UTC)
What happens
The bot doesn't expand bare URLs in ref tags if there is a space in the </ref >
Relevant diffs/links
https://en.wikipedia.org/w/index.php?title=User%3AJosve05a%2Fcite-sandbox&diff=prev&oldid=866905443
We can't proceed until
Feedback from maintainers


https://github.com/ms609/citation-bot/pull/1018 AManWithNoPlan (talk) 17:00, 2 November 2018 (UTC)

Interesting new ability for Citoid

phab:T124610#4715397 or rather phab: T198567 might be of interest for the bot as well. (tJosve05a (c) 16:14, 2 November 2018 (UTC)

Please keep an eye on it. When it gets really reliable; then we might add it. For now {{notabug}}. AManWithNoPlan (talk) 14:10, 5 November 2018 (UTC)

PMID low numers

Status
{{fixed}}
Reported by
(tJosve05a (c) 15:20, 3 November 2018 (UTC)
What happens
The bot does not remove |url=https://www.ncbi.nlm.nih.gov/pubmed/1 despite |pmid=1 exists
What should happen
Replace |url=https://www.ncbi.nlm.nih.gov/pubmed/1 with |pmid=1
Relevant diffs/links
https://en.wikipedia.org/w/index.php?title=Lolita_Lebrón&diff=866697532&oldid=866671113
We can't proceed until
Feedback from maintainers


In https://en.wikipedia.org/w/index.php?title=User%3AJosve05a%2Fcite-sandbox&diff=prev&oldid=867085176 the bot did not add |pmid=1 or |pmid= 11442. (tJosve05a (c) 15:20, 3 November 2018 (UTC)
https://github.com/ms609/citation-bot/pull/1021 AManWithNoPlan (talk) 18:52, 3 November 2018 (UTC)

Request: Expand doi citations in {{cite news}} as well

Status
mostly {{fixed}}
Reported by
(tJosve05a (c) 17:41, 8 October 2018 (UTC)
What should happen
The bot should expand {{cite news|url=http://pubs.acs.org/doi/abs/10.1021/cen-v076n048.p024}}
We can't proceed until
Feedback from maintainers


https://github.com/ms609/citation-bot/pull/1019 AManWithNoPlan (talk) 17:09, 2 November 2018 (UTC)

Do not override specific page with page range

Status
{{fixed}}
Reported by
(tJosve05a (c) 11:03, 5 November 2018 (UTC)
What happens
Bot overrides specific |at=p. 29 with |pages=21–40
Relevant diffs/links
https://en.wikipedia.org/w/index.php?title=User%3AJosve05a%2Fcite-sandbox&diff=prev&oldid=867384576
We can't proceed until
Feedback from maintainers


https://github.com/ms609/citation-bot/pull/1026 AManWithNoPlan (talk) 15:36, 5 November 2018 (UTC)

Remove doi.org if adding doi parameter

Status
{{fixed}}
Reported by
(tJosve05a (c) 11:09, 5 November 2018 (UTC)
What happens
The bot adds |doi=10.4224/PhysRevA.96.042113.data
What should happen
The bot should remove |url=https://doi.org/10.4224/PhysRevA.96.042113.data since the doi resolves
Relevant diffs/links
https://en.wikipedia.org/w/index.php?title=User%3AJosve05a%2Fcite-sandbox&diff=prev&oldid=867385059
We can't proceed until
Feedback from maintainers


we only remove the URL if the doi is in CrossRef. Probably should make an execption for doi.org. AManWithNoPlan (talk) 14:29, 5 November 2018 (UTC)

https://github.com/ms609/citation-bot/pull/1025 AManWithNoPlan (talk) 15:33, 5 November 2018 (UTC)

Emergency blacklist

zenodo.org has been blacklisted on English Wikipedia for copyright reasons (at least for now). Please disable the addition of it (and allow other edits to be made; the bot currently fails on Radon). See Special:PermanentLink/867438103#zenodo.org. (Courtesy ping Nemo bis and JzG) (tJosve05a (c) 21:42, 5 November 2018 (UTC)

https://github.com/ms609/citation-bot/pull/1029 (tJosve05a (c) 21:51, 5 November 2018 (UTC)
and yet researchgate is cool. AManWithNoPlan (talk) 22:20, 5 November 2018 (UTC)
i am glad we currently clean up those urls, so we convert pdf links to landing pages. AManWithNoPlan (talk) 22:22, 5 November 2018 (UTC)
Well, even cleaning up https://zenodo.org/record/1000677/files/article.pdf to https://zenodo.org/record/1000677 is blacklisted (/me mumbles something angrily) (tJosve05a (c) 22:34, 5 November 2018 (UTC)
Yes. A friend cannot upload her papers to ResearchGate but can upload them to Zenodo. I think that may be telling. Guy (Help!) 23:40, 5 November 2018 (UTC)
Blocked and {{fixed}}. Also, a second pull is in place to turn it back off, if that is possible. You are correct, it is one thing to violate your own papers' copyright; but it is another thing to violate everyone's papers copyrights. AManWithNoPlan (talk) 18:25, 6 November 2018 (UTC)

Bug: Publisher weirdness

Status
{{fixed}}
Reported by
Headbomb {t · c · p · b} 14:25, 13 October 2018 (UTC)
Relevant diffs/links
[2]
We can't proceed until
Feedback from maintainers


Did you verify that there are not whites that got changed? i cannot look right now. AManWithNoPlan (talk) 15:44, 13 October 2018 (UTC)
I'm not sure what 'changing whites' would be here, but it did something similar in the previous edit [3], where normally it removes publisher in cite journals. Headbomb {t · c · p · b} 18:14, 13 October 2018 (UTC)
If it does than that's another bug. It shouldn't remove the publisher parameter in cite journal templates unless the publisher value would be the same as the journal value. (And, actually, for optimal meta data it shouldn't even remove it then for as long as it is correct, so that both meta data entries journal and publisher can be populated. Instead, seemingly duplicate values should be detected in the cite template and one of the values suppressed in the output, but not in meta data.)
--Matthiaspaul (talk) 11:44, 15 October 2018 (UTC)
i got auto corrected. whitespaces not whites. AManWithNoPlan (talk) 22:37, 13 October 2018 (UTC)
it drops publisher then google books adds it back AManWithNoPlan (talk) 03:55, 14 October 2018 (UTC)
I just saw it remove a publisher from a "journal" that is really a newsletter whose publisher should not have been removed: Special:Diff/866664956. For major well-established academic journals, removal of publisher may be a good thing, but blindly doing it to all journal citations is not. Citation bot absolutely should not be making this kind of decision, and should not even be suggesting it to human editors (as they too-often fail to exercise any judgement of their own). —David Eppstein (talk) 21:02, 31 October 2018 (UTC)

{{cite magazine}} seems like the proper template is what i am hearing from you. AManWithNoPlan (talk) 21:32, 31 October 2018 (UTC)

This should stop the drop/add cycle https://github.com/ms609/citation-bot/pull/1024 AManWithNoPlan (talk) 15:27, 5 November 2018 (UTC)

Adding publishers to journal

Status
{{fixed}}
Reported by
Headbomb {t · c · p · b} 13:58, 5 November 2018 (UTC)
What happens
https://en.wikipedia.org/w/index.php?title=List_of_Eumolpinae_genera&diff=prev&oldid=867400935
What should happen
The opposite!
We can't proceed until
Feedback from maintainers


https://github.com/ms609/citation-bot/pull/1024 AManWithNoPlan (talk) 15:27, 5 November 2018 (UTC)

Bug: bot has to be run twice

Status
{{fixed}}
Reported by
(tJosve05a (c) 20:17, 20 October 2018 (UTC)
What happens
In edit 1 it adds |chapter=. In edit 2 it changed |url= to |chapter-url=
What should happen
In edit 1: If adding |chapter=, also change |url= to |chapter-url= in same edit.
Relevant diffs/links
https://en.wikipedia.org/w/index.php?title=User%3AJosve05a%2Fcite-sandbox&diff=prev&oldid=864969377
We can't proceed until
Feedback from maintainers


https://github.com/ms609/citation-bot/pull/1028 AManWithNoPlan (talk) 15:48, 5 November 2018 (UTC)

Better europepmc.org support

Try running the bot on <ref>http://europepmc.org/scanned?pageindex=1&articles=PMC1697073</ref>. You would expect to get something like:

{{Cite journal |last=Barker |first=C. A. |year=1975 |title=The Ontario Veterinary College: Temperance Street era |journal=The Canadian Veterinary Journal = la Revue Veterinaire Canadienne |volume=16 |issue=11 |pages=319–328 |pmc=1697073 |pmid=1104138}}

but instead you will get

{{Cite journal | url=http://europepmc.org/scanned?pageindex=1&articles=PMC1697073 | title=The Ontario Veterinary College: Temperance Street era| journal=The Canadian Veterinary Journal| volume=16| issue=11| pages=319| date=November 1975| last1=Pmc| first1=Europe}}

Completely missing the authors and IDs. (tJosve05a (c) 23:49, 4 November 2018 (UTC)

https://github.com/ms609/citation-bot/pull/1027 AManWithNoPlan (talk) 15:40, 5 November 2018 (UTC)
{{fixed}}

CAPS-insensitive URLs

Status
{{fixed}}
Reported by
(tJosve05a (c) 23:01, 5 November 2018 (UTC)
What happens
https://en.wikipedia.org/w/index.php?title=User:Josve05a/cite-sandbox&diff=867472498&oldid=867472485
What should happen
|url=//JSTOR.org/stable/1733521 > |jstor=1733521
We can't proceed until
Feedback from maintainers


https://github.com/ms609/citation-bot/pull/1030 AManWithNoPlan (talk) 23:21, 5 November 2018 (UTC)

Remove Pubmed URL without www.

Status
{{fixed}}
Reported by
(tJosve05a (c) 00:58, 6 November 2018 (UTC)
What happens
|url=https://ncbi.nlm.nih.gov/pubmed/12357708 is not removed
What should happen
Treat it the same way as |url=https://www.ncbi.nlm.nih.gov/pubmed/12357708
Relevant diffs/links
https://en.wikipedia.org/w/index.php?title=User%3AJosve05a%2Fcite-sandbox&diff=prev&oldid=867484728
We can't proceed until
Feedback from maintainers


https://github.com/ms609/citation-bot/pull/1030/files AManWithNoPlan (talk) 04:23, 6 November 2018 (UTC)

Do not add URL if PMC exists

Status
{{fixed}}
Reported by
(tJosve05a (c) 10:12, 6 November 2018 (UTC)
What happens
On
{{Cite journal|last=Espy|first=Kimberly Andrews|last2=Fang|first2=Hua|last3=Johnson|first3=Craig|last4=Stopp|first4=Christian|last5=Wiebe|first5=Sandra A.|last6=Respass|first6=Jennifer|date=2011|title=Prenatal tobacco exposure: Developmental outcomes in the neonatal period.|journal=Developmental Psychology|volume=47|issue=1|pages=153–169|doi=10.1037/a0020724|pmid=21038943|pmc=3057676|issn=1939-0599}}
it wants to add |url=http://digitalcommons.unl.edu/cgi/viewcontent.cgi?article=1501&context=psychfacpub
What should happen
Do not add OA-links if |pmc= exists, since PMC is already OA. See bullet point #2 on Wikipedia:OABOT#What kinds of links won't the bot add?
We can't proceed until
Feedback from maintainers


https://github.com/ms609/citation-bot/pull/1035 AManWithNoPlan (talk) 13:56, 6 November 2018 (UTC)

removed archive url

Status
{{fixed}}
Reported by
(tJosve05a (c) 10:02, 8 November 2018 (UTC)
What happens
The bot removes |archiveurl= when changing |url= to |chapter-url=
What should happen
Leave |archiveurl= (at least if only one kind of URL parameter exists)
Relevant diffs/links
https://en.wikipedia.org/w/index.php?title=User:Josve05a/cite-sandbox&diff=867839758&oldid=867839747
We can't proceed until
Feedback from maintainers


Request: Do not add OA url if |foo-access=free

Status
{{fixed}}
Reported by
(tJosve05a (c) 21:42, 8 November 2018 (UTC)
What should happen
Do not add OA url if |doi-access=free, |jstor-access=free, |ol-access=free, or |osti-access=free is true
We can't proceed until
Feedback from maintainers


Request: The New York Times

Status
{{fixed}}
Reported by
(tJosve05a (c) 23:47, 5 November 2018 (UTC)
What happens
Bot adds |newspaper=The New York Times
What should happen
Also remove |via=www.nytimes.com (and derivations)
Relevant diffs/links
https://en.wikipedia.org/w/index.php?title=American_nationalism&diff=prev&oldid=867477333
We can't proceed until
Feedback from maintainers


https://github.com/ms609/citation-bot/pull/1043 AManWithNoPlan (talk) 23:56, 7 November 2018 (UTC)

Reuters

Status
{{fixed}}
Reported by
(tJosve05a (c) 12:36, 1 November 2018 (UTC)
What happens
The bot added |newspaper=Reuters when |publisher=[[Reuters]] existed
Relevant diffs/links
https://en.wikipedia.org/w/index.php?title=Loudness_war&diff=prev&oldid=866766938
We can't proceed until
Feedback from maintainers


https://github.com/ms609/citation-bot/pull/1044 AManWithNoPlan (talk) 00:03, 8 November 2018 (UTC)

Bug: Failed to remove doi.org URL

Status
{{fixed}}
Reported by
(tJosve05a (c) 21:59, 8 November 2018 (UTC)
What should happen
Remove |url=
Relevant diffs/links
https://en.wikipedia.org/w/index.php?title=User%3AJosve05a%2Fcite-sandbox&diff=prev&oldid=867928878
We can't proceed until
Feedback from maintainers


If running the bot again, it removes it. Perhaps a "run bot muliple times, until no changes is attemeted, before saving the edit" rule should be implemented. (tJosve05a (c) 22:01, 8 November 2018 (UTC)

That is a horrible idea. Although I too have considered it. AManWithNoPlan (talk) 23:06, 8 November 2018 (UTC)
Yeah (hence it being in small). I can just imagine the bot edit warring with it self back-and-forth...however, it logically feels as if "all possible edits should be made" before saving the change. (tJosve05a (c) 23:09, 8 November 2018 (UTC)
And the bot gets banned from database access for repeats and edits take two to three times longer..... AManWithNoPlan (talk) 23:11, 8 November 2018 (UTC)
And during periods of high use, big edits fail since the bot is too busy double checking itself.... AManWithNoPlan (talk) 23:12, 8 November 2018 (UTC)
A (short) time-out for "second round" could be added, or only do it for "small" articles (i.e. if running it manually on a short section), or only run twice if there is not high-use (if that could be "tracked"). Not advocating this be implemented here, though. The issue at hand can (hopefully) be patched this time. Just a thought.(tJosve05a (c) 23:15, 8 November 2018 (UTC)

https://github.com/ms609/citation-bot/pull/1049 AManWithNoPlan (talk) 23:53, 8 November 2018 (UTC)

Request: More caps

Status
{{fixed}}
Reported by
(tJosve05a (c) 22:50, 8 November 2018 (UTC)
What happens
|journal=Bjpsych International
|journal=Ieee Transactions on Computers
|journal=Papers from the Workship Within the Framework of the XIII International Congress of Celtic Studies
What should happen
|journal=BJPsych International
|journal=IEEE Transactions on Computers
|journal=Papers from the Workship within the Framework of the XIII International Congress of Celtic Studies
We can't proceed until
Feedback from maintainers


https://github.com/ms609/citation-bot/pull/1042 AManWithNoPlan (talk) 23:13, 8 November 2018 (UTC)

Bug: DOI broken, or not broken - that is the question

Status
new bug
Reported by
(tJosve05a (c) 12:37, 10 November 2018 (UTC)
What happens
In edit 1 the bot marked 10.1525/nr.2017.20.4.13 as broken. In edit 2, mere seconds later, it marked it as functioning.
We can't proceed until
Feedback from maintainers


Impossible to know, since I do not know what your sent to the bot. AManWithNoPlan (talk) 14:59, 10 November 2018 (UTC)

{{notabug}} probably is one time fluke AManWithNoPlan (talk) 19:01, 10 November 2018 (UTC)

Bug: Unlinking

Status
{{fixed}}
Reported by
wumbolo ^^^ 22:28, 28 October 2018 (UTC)
What happens
newspaper title is unlinked when {{cite web}} is changed to {{cite news}}
What should happen
I don't exactly know, but certainly not this
Relevant diffs/links
[4]
We can't proceed until
Feedback from maintainers


drops one them adds other. weird.AManWithNoPlan (talk) 04:39, 31 October 2018 (UTC)

https://github.com/ms609/citation-bot/pull/1050 AManWithNoPlan (talk) 05:21, 10 November 2018 (UTC)

Encyclopedia

Status
{{fixed}}
Reported by
(tJosve05a (c) 23:53, 11 November 2018 (UTC)
What happens
It adds |title=The Encyclopedia of Libertarianism despite |encyclopedia=The Encyclopedia of Libertarianism exists
Relevant diffs/links
https://en.wikipedia.org/w/index.php?title=User%3AJosve05a%2Fcite-sandbox&diff=prev&oldid=868402210
We can't proceed until
Feedback from maintainers


Follow-up from User talk:Citation bot/Archive 11#Request: Encyclopedia

CAPS: Oecd

OECD should always be capitalized. I've seen it both in |last1= and |publisher=. <ref>https://dx.doi.org/10.1787/9789264239012-en</ref> adds |last1=Oecd (tJosve05a (c) 10:47, 12 November 2018 (UTC)

https://github.com/ms609/citation-bot/pull/1053 AManWithNoPlan (talk) 16:31, 12 November 2018 (UTC)

{{fixed}}

Ultra slow

Status
new bug
Reported by
Headbomb {t · c · p · b} 16:38, 13 November 2018 (UTC)
What happens
Bot takes forever to run. Hours+, times out all the time, etc...
Replication instructions
Click [5] to run the bot
We can't proceed until
Feedback from maintainers


It has been that way for about a day. Sometimes you will get lucky and get a 500 error instead of timeout. AManWithNoPlan (talk) 16:56, 13 November 2018 (UTC)

Yup. Either, this is very annoying. Headbomb {t · c · p · b} 17:04, 13 November 2018 (UTC)
Although frustrating, these very slow runs do often perform the requested edits even if they never return to display a result. Lithopsian (talk) 20:19, 13 November 2018 (UTC)
Status
new bug
Reported by
  —Chris Capoccia TC 18:45, 13 November 2018 (UTC)
What happens
Citation bot does not finish any page and just times out
We can't proceed until
Feedback from maintainers


{{fixed}}

Request: Single quotes misused as arrows should be unchanged

Status
{{fixed}}
Reported by
(tJosve05a (c) 11:58, 14 November 2018 (UTC)
What happens
''
What should happen
nothing
Relevant diffs/links
https://en.wikipedia.org/w/index.php?title=User%3AJosve05a%2Fcite-sandbox&diff=prev&oldid=868782545
We can't proceed until
Feedback from maintainers


https://github.com/ms609/citation-bot/pull/1059 AManWithNoPlan (talk) 16:27, 14 November 2018 (UTC)

Similar names in newspaper/via/publisher are redundant

Status
{{fixed}}
Reported by
(tJosve05a (c) 11:32, 14 November 2018 (UTC)
What happens
adds |newspaper=The New York Times but doesn't remove |publisher=New York Times
Relevant diffs/links
https://en.wikipedia.org/w/index.php?title=User%3AJosve05a%2Fcite-sandbox&diff=prev&oldid=868780380
We can't proceed until
Feedback from maintainers


This could be generalized to anything that differs only by a leading 'the'. Headbomb {t · c · p · b} 16:25, 14 November 2018 (UTC)
Good idea The Headbomb. I might want to create a case-intensive str_is_basically_the_same() function. AManWithNoPlan (talk) 16:32, 14 November 2018 (UTC)
https://github.com/ms609/citation-bot/pull/1066 AManWithNoPlan (talk) 21:12, 14 November 2018 (UTC)

Bug: doi.library.ubc.ca is a DOI resolver, not a website

Status
{{fixed}}
Reported by
(tJosve05a (c) 14:38, 14 November 2018 (UTC)
What happens
Bot added |url=https://doi.library.ubc.ca/10.14288/1.0071732 as a OA-link.
What should happen
|doi=10.14288/1.0071732 and |url=https://doi.library.ubc.ca/10.14288/1.0071732 both lnks to the same place. And it has a recognized doi in the path, and should be removed. We should not add such links.
Relevant diffs/links
https://en.wikipedia.org/w/index.php?title=Stainless_steel&diff=868800317&oldid=868799341
We can't proceed until
Feedback from maintainers


https://github.com/ms609/citation-bot/pull/1058 AManWithNoPlan (talk) 16:08, 14 November 2018 (UTC)

and https://github.com/ms609/citation-bot/pull/1072 AManWithNoPlan (talk) 18:16, 16 November 2018 (UTC)

Bug: Google Maps publisher is Google

Status
{{fixed}}
Reported by
(tJosve05a (c) 12:05, 15 November 2018 (UTC)
What happens
Removes |publisher=Google for citations to Google Maps
Relevant diffs/links
Don't
We can't proceed until
Feedback from maintainers


* {{Citation | publisher = Google | url = https://maps.google.com/maps/ms?ie=UTF8&hl=en&msa=0&msid=210554752554258740073.00045675b996d14eb6c3a&ll=6.839971,28.205177&spn=170.959424,24.609375&z=1 | type = map (non-exhaustive) | title = Participatory budgeting initiatives around the world}}.

https://github.com/ms609/citation-bot/pull/1070 AManWithNoPlan (talk) 16:19, 15 November 2018 (UTC)

Series/Title

Status
{{fixed}}
Reported by
(tJosve05a (c) 13:23, 15 November 2018 (UTC)
What should happen
Don't add (identical) |series=Handbook of Development Economics if |title=Handbook of Development Economics already exists (without somehow removing one of them)
Relevant diffs/links
https://en.wikipedia.org/w/index.php?title=User%3AJosve05a%2Fcite-sandbox&diff=prev&oldid=868951621
We can't proceed until
Feedback from maintainers


https://github.com/ms609/citation-bot/pull/1073 (at this moment there is a backlog of 18 pull requests, since I am working half-days and the bot operator is not). AManWithNoPlan (talk) 18:22, 15 November 2018 (UTC)

similar data in title/journal/publisher/etc

Will this fix this as well? (tJosve05a (c) 00:34, 16 November 2018 (UTC)

That can't be fixed by the bot, no. Headbomb {t · c · p · b} 02:19, 16 November 2018 (UTC)
{{wontfix}} AManWithNoPlan (talk) 17:29, 16 November 2018 (UTC)

Change cite book to Cite book

Status
{{fixed}}
Reported by
Headbomb {t · c · p · b} 19:56, 15 November 2018 (UTC)
What happens
[6] [See last diff line]
What should happen
leave caps alone
We can't proceed until
Feedback from maintainers


Huh? ♦ J. Johnson (JJ) (talk) 20:32, 15 November 2018 (UTC)

Obviously the type of template gets changed more than once as the bot does its thing. I think we can fix. AManWithNoPlan (talk) 20:50, 15 November 2018 (UTC)
Also added |journal= to a {{cite book}}. That sounds odd...(tJosve05a (c) 21:00, 15 November 2018 (UTC)
https://github.com/ms609/citation-bot/pull/1074 AManWithNoPlan (talk) 21:30, 15 November 2018 (UTC)

Request: Minimal cite conference support (doi url)

Status
{{fixed}}
Reported by
(tJosve05a (c) 21:30, 15 November 2018 (UTC)
What should happen
In {{cite conference}} Replace |url=https://dx.doi.org/10.1016/j.eswa.2012.02.120 with |doi=10.1016/j.eswa.2012.02.120
Relevant diffs/links
Should do: https://en.wikipedia.org/w/index.php?title=Health_care&diff=869013786&oldid=869013715
We can't proceed until
Feedback from maintainers


https://github.com/ms609/citation-bot/pull/1075 AManWithNoPlan (talk) 21:33, 15 November 2018 (UTC)

wrong message

In: https://en.wikipedia.org/w/index.php?title=Quarter-inch_cartridge&curid=895102&diff=869312464&oldid=866498559 it changed from ISBN-10 to ISBN-13, which is fine, but the message says that it removed the access-date. That seems a different question, so should have appropriate message. Gah4 (talk) 21:23, 17 November 2018 (UTC)

The full message is "Alter: isbn. Removed accessdate with no specified URL". It covers both, but admittedly the amount of texted changed appears to be inversely proportional to the length of the message text. AManWithNoPlan (talk) 22:22, 17 November 2018 (UTC)
{{notabug}} AManWithNoPlan (talk) 19:34, 18 November 2018 (UTC)

Weird doi and hdl formats which works

What should be done with
. doi:10013/epic.10107.d001. hdl:10013/epic.10107.d001. {{cite journal}}: Check |doi= value (help); Cite journal requires |journal= (help); Missing or empty |title= (help)
The links works, but it is not allowed formats, and the bot does not expand from them. (tJosve05a (c) 21:15, 19 November 2018 (UTC)

Probably nothing? Let users fix the errors themselves? Headbomb {t · c · p · b} 21:51, 19 November 2018 (UTC)
The valid identifiers, btw, are doi:10.2312/BzP_0107_1992 and hdl:10013/epic.10107. Headbomb {t · c · p · b} 21:52, 19 November 2018 (UTC)
The doi link works since dx.doi.org will resolve non-doi hdl. AManWithNoPlan (talk) 22:55, 19 November 2018 (UTC)
Maybe so, but it's still not a valid DOI. Headbomb {t · c · p · b} 23:02, 19 November 2018 (UTC)
What should be done is that the DOI should be fixed by a human to conform with the DOI specifications. DOI.org is under no obligation to support non-conforming DOI values, and they could remove their de facto support at any time. – Jonesey95 (talk) 05:42, 20 November 2018 (UTC)

{{wontfix}} and will leave for humans to fix. AManWithNoPlan (talk) 21:30, 20 November 2018 (UTC)

Bug: SSRN Electronic Journal

Status
new bug
Reported by
(tJosve05a (c) 06:27, 20 November 2018 (UTC)
What happens
|journal=SSRN Electronic Journal despite https://github.com/ms609/citation-bot/pull/1057
What should happen
|journal=Middle East Review of International Affairs
Relevant diffs/links
https://en.wikipedia.org/w/index.php?title=User%3AJosve05a%2Fcite-sandbox&diff=prev&oldid=869766750
We can't proceed until
Feedback from maintainers


yeah. that accidently got fixed in a case-sensitive way. waiting for another pull. AManWithNoPlan (talk) 13:24, 20 November 2018 (UTC)

{{fixed}} AManWithNoPlan (talk) 15:29, 20 November 2018 (UTC)

Request: clean up google search so-called references

While I'm not sure why a citation to Google Search should ever appear in an article, they do quite a lot. It would be good if the bot would remove unnecessary parameters for such URLs as well, as it does with Google Books.

Unnecessary parameters in https://www.google.com/search?q=%22institute+for+sustainable+weight+loss%22&oq=%22institute+for+sustainable+weight+loss%22&aqs=chrome..69i57j69i59.14823j0j7&sourceid=chrome&ie=UTF-8

  • aqs=chrome..69i57j69i59.14823j0j7 Assisted Query Stats - used for logging purposes only
  • sourceid=chrome Where the search originated from - used for logging purposes only
  • ie=UTF-8 input encoding; default is UTF-8

Unnecessary parameters in http://www.google.com/search?hl=en&safe=off&client=firefox-a&rls=com.ubuntu%3Aen-US%3Aunofficial&q=%22west+coast+hotel+co.+v.+parrish%22+(site%3Anewsweek.com+OR+site%3Apost-gazette.com+OR+site%3Ausatoday.com+OR+site%3Awashingtonpost.com+OR+site%3Atime.com+OR+site%3Areuters.com+OR+site%3Aeconomist.com+OR+site%3Amiamiherald.com+OR+site%3Alatimes.com+OR+site%3Asfgate.com+OR+site%3Achicagotribune.com+OR+site%3Anytimes.com+OR+site%3Awsj.com+OR+site%3Ausnews.com+OR+site%3Amsnbc.com+OR+site%3Anj.com+OR+site%3Atheatlantic.com)&aq=o&oq=&aqi=

  • client=firefox-a Where the search originated from - used for logging purposes only
  • rls=com.ubuntu%3Aen-US%3Aunofficial
  • hl=en (Not sure if we want to keep this and point to the English version specifically of the search result or not)
  • aq=o version of the client - used for loggig purposes
  • oq= Original query - i.e. previous search query
  • aqi=

Unnecessary parameters in https://www.google.com/search?q=roosevelt+in+hoxsey%27s+plane (not mentioned earlier):

  • tbm=isch
  • sa=X
  • ved=0ahUKEwie-7Cq4snbAhWxpFkKHYhmDvAQ_AUICigB
  • biw=837
  • bih=607

(tJosve05a (c) 19:37, 7 October 2018 (UTC)

On VERY rare occasions they are valid (example: the term xyz is more popular/common than zyx on the Internet). Almost all the time, it would be more honest to just say <ref>Look it up yourself loser</ref> AManWithNoPlan (talk) 20:01, 7 October 2018 (UTC)
While I don't disagree with you (at all), I still feel we (read: the bot) should act as if they are all valid, and clean them, and hope that someone else comes along and finds (any) better references. (tJosve05a (c) 20:05, 7 October 2018 (UTC)
   aqs=chrome..69i57j69i59.14823j0j7 Assisted Query Stats - used for logging purposes only
   sourceid=chrome Where the search originated from - used for logging purposes only
   ie=UTF-8 input encoding; default is UTF-8

Unnecessary parameters in http://www.google.com/search?hl=en&safe=off&client=firefox-a&rls=com.ubuntu%3Aen-US%3Aunofficial&q=%22west+coast+hotel+co.+v.+parrish%22+(site%3Anewsweek.com+OR+site%3Apost-gazette.com+OR+site%3Ausatoday.com+OR+site%3Awashingtonpost.com+OR+site%3Atime.com+OR+site%3Areuters.com+OR+site%3Aeconomist.com+OR+site%3Amiamiherald.com+OR+site%3Alatimes.com+OR+site%3Asfgate.com+OR+site%3Achicagotribune.com+OR+site%3Anytimes.com+OR+site%3Awsj.com+OR+site%3Ausnews.com+OR+site%3Amsnbc.com+OR+site%3Anj.com+OR+site%3Atheatlantic.com)&aq=o&oq=&aqi=

So, these are the ones to lose client (assume language= is actually important)
   rls=
   aq=
   oq=
   aqi=
   tbm=
   sa=
   ved=
   biw=
   bih=
   aqs=
   sourceid=
   client=
   ie=UTF-8  (Since this is default, but keep any other)
AManWithNoPlan (talk) 02:24, 20 October 2018 (UTC)
https://github.com/ms609/citation-bot/pull/1079 and https://github.com/ms609/citation-bot/pull/1080 AManWithNoPlan (talk) 05:09, 16 November 2018 (UTC)
{{fixed}} AManWithNoPlan (talk) 17:18, 21 November 2018 (UTC)

Bad Data detection: SSRN Electronic Journal

Status
{{fixed}}
Reported by
(tJosve05a (c) 09:31, 14 November 2018 (UTC)
What happens
|journal=SSRN Electronic Journal
What should happen
This is in many cases incorrect, despite Crossref stating this. For e.g. this edit it should be "Middle East Review of International Affairs, Vol. 20, No. 1, pp. 35-59". Perhaps |journal=SSRN Electronic Journal should be forbidden, since there seem to be a lot of misattribution to the real source.
Relevant diffs/links
https://en.wikipedia.org/w/index.php?title=User%3AJosve05a%2Fcite-sandbox&diff=prev&oldid=868769405
We can't proceed until
Feedback from maintainers


https://github.com/ms609/citation-bot/pull/1057 AManWithNoPlan (talk) 15:10, 14 November 2018 (UTC)

And https://github.com/ms609/citation-bot/pull/1081 AManWithNoPlan (talk) 19:37, 16 November 2018 (UTC)

existing wrong information is not fixed

Status
new bug
Reported by
JRSpriggs (talk) 20:59, 21 November 2018 (UTC)
We can't proceed until
Feedback from maintainers


Please see [7]. It seems extremely improbable that the issue number and page number would both be 061102. JRSpriggs (talk) 20:59, 21 November 2018 (UTC)

{{notabug}} the bot generates perfect output and leaves user input fields alone. AManWithNoPlan (talk) 21:22, 21 November 2018 (UTC)

Request: Cite paper support

Status
{{fixed}}
Reported by
(tJosve05a (c) 21:35, 21 November 2018 (UTC)
What should happen
{{cite paper}} is an alias of {{cite journal}} and should be supported in the same ways as {{cite journal}} is (only let the template name stay the same)
Relevant diffs/links
https://en.wikipedia.org/w/index.php?title=User%3AJosve05a%2Fcite-sandbox&diff=prev&oldid=870018089#Test
We can't proceed until
Feedback from maintainers


Removal of reference link not justified

Status
new bug
Reported by
Pierre cb (talk) 14:35, 25 November 2018 (UTC)
What happens
Your bot has done this editing https://en.wikipedia.org/w/index.php?title=King_City_weather_radar_station&diff=870540302&oldid=849398472 but the link to the pdf paper is working fine (http://www.tandfonline.com/doi/pdf/10.1080/07055900.1991.9649414), one has to wait a bit longer. Furthermore, the bot removed the Publisher for no good reason as "Atmosphere-Ocean" is the official journal of the Canadian Meteorological and Oceanographic Society. Please adjust it.
We can't proceed until
Feedback from maintainers


The style guides are very clear on not including publishers for Journals. 99% of the time the pdf links to publisher pdfs do not work, and even when they do, they often do not last for long. Anyway, it adds nothing that the doi already provides. AManWithNoPlan (talk) 16:30, 25 November 2018 (UTC)

the correct publisher is tandy anyway. As usual, it was wrong. AManWithNoPlan (talk) 16:40, 25 November 2018 (UTC)
{{wontfix}}

Citing magazine

Status
new bug
Reported by
wumbolo ^^^ 14:17, 24 November 2018 (UTC)
What happens
uses {{cite journal}} for Wired (magazine)
What should happen
use {{cite web}} or {{cite magazine}}
Relevant diffs/links
[8]
We can't proceed until
Feedback from maintainers


I do not think this fixable, since the only way is to maintain a list of 10,000 magazines. Also, the template are actually exactly the same. AManWithNoPlan (talk) 15:40, 24 November 2018 (UTC)

Why did it convert from cite web to cite journal? -- GreenC 15:47, 24 November 2018 (UTC)
They are not exactly the same. The rendering of |issue= and |number= differs, and you cannot set |title=none in cite magazine (there may be other differences). --Izno (talk) 18:41, 24 November 2018 (UTC)
That's news to me. I see that this is fairly new change. AManWithNoPlan (talk) 19:09, 24 November 2018 (UTC)
This pull will help a lot https://github.com/ms609/citation-bot/pull/1104 AManWithNoPlan (talk) 03:33, 27 November 2018 (UTC)
Much better and mostly {{fixed}}

Publisher

Status
new bug
Reported by
Katolophyromai (talk) 13:02, 27 November 2018 (UTC)
What happens
For some odd reason, the bot keeps removing all information from citations about the publisher of the source and the location of the publisher. I have noticed it doing this for a while now and have had to keep cleaning up after it. I do not know if these removals are intentional or accidental, but I see no reason why the bot should be removing publishers from citations, considering that the publisher is a fairly essential piece of information about the source.
Relevant diffs/links
Recent examples of this include the bot's activity here and here. It has done it before, but I cannot find the other examples right away and would have to go looking for them.
We can't proceed until
Feedback from maintainers


All style guides reject including that information for journals. Also, it is often incorrect. The bot has been doing this for over a decade, so I am sure there are other examples. AManWithNoPlan (talk) 15:34, 27 November 2018 (UTC)

Flagging as {{notabug}} until debate is over and this is finalized once and for all. AManWithNoPlan (talk) 17:27, 27 November 2018 (UTC)

CAPS: A.I.E.E.

Status
{{fixed}}
Reported by
(tJosve05a (c) 00:18, 14 November 2018 (UTC)
What happens
|journal=Journal of the A.i.e.e
What should happen
|journal=Journal of the A.I.E.E. (note the intentional trailing dot)
We can't proceed until
Feedback from maintainers


https://github.com/ms609/citation-bot/pull/1060 AManWithNoPlan (talk) 16:46, 14 November 2018 (UTC)

"Om" vs "om"

Status
{{fixed}}
Reported by
(tJosve05a (c) 15:18, 14 November 2018 (UTC)
What should happen
om should not be capitalized in Scandinavian languages (but if it is, it shouldn't automatically be de-capitilized)
We can't proceed until
Feedback from maintainers


Example |journal=Meddelelser om Grønland (tJosve05a (c) 16:59, 14 November 2018 (UTC)

I have have to think about Om vs. om. For now, adding journal titles is good first step. https://github.com/ms609/citation-bot/pull/1060 AManWithNoPlan (talk) 17:03, 14 November 2018 (UTC)
Cf. "A" used uppercase for initialism in [9]. Nemo 07:48, 15 November 2018 (UTC)

https://github.com/ms609/citation-bot/pull/1060 AManWithNoPlan (talk) 16:34, 15 November 2018 (UTC)

Request: JMIR mHealth and uHealth

Status
{{fixed}}
Reported by
(tJosve05a (c) 23:42, 23 November 2018 (UTC)
What happens
|journal=JMIR MHealth and UHealth
What should happen
|journal=JMIR mHealth and uHealth
We can't proceed until
Feedback from maintainers


The Feedback from the Maintainers is that publishers need to be less self-important. 😀😬😜😂 AManWithNoPlan (talk) 23:56, 23 November 2018 (UTC)

https://github.com/ms609/citation-bot/pull/1060 AManWithNoPlan (talk) 00:00, 24 November 2018 (UTC)

More caps: PeerJ

Status
{{fixed}}
Reported by
(tJosve05a (c) 09:23, 25 November 2018 (UTC)
What happens
|journal=Peerj
What should happen
|journal=PeerJ
We can't proceed until
Feedback from maintainers


Added to https://github.com/ms609/citation-bot/pull/1060 AManWithNoPlan (talk) 19:34, 25 November 2018 (UTC)

Request: Bot adds - and . as names

Status
{{fixed}}
Reported by
(tJosve05a (c) 14:26, 26 November 2018 (UTC)
What happens
Bot adds |last1=-|first1=-.
What should happen
Do not add names with no latin letter characters (or some simialar rule)
Relevant diffs/links
https://en.wikipedia.org/w/index.php?title=User%3AJosve05a%2Fcite-sandbox&diff=prev&oldid=870702100
We can't proceed until
Feedback from maintainers


Authors: - -. is and interesting author. AManWithNoPlan (talk) 14:52, 26 November 2018 (UTC)

https://github.com/ms609/citation-bot/pull/1103 AManWithNoPlan (talk) 01:20, 27 November 2018 (UTC)
and now https://github.com/ms609/citation-bot/pull/1107 AManWithNoPlan (talk) 16:40, 28 November 2018 (UTC)

Fails to remove PMC url

Status
{{fixed}}
Reported by
Headbomb {t · c · p · b} 19:24, 27 November 2018 (UTC)
What happens
Keeps |url=https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3737244/
What should happen
Should remove it, like so [10]
Relevant diffs/links
[11]
We can't proceed until
Feedback from maintainers


https://github.com/ms609/citation-bot/pull/1106 AManWithNoPlan (talk) 16:40, 28 November 2018 (UTC)

Bug: JSTOR API is a cookie monster and wants his cookies

Status
{{fixed}}
Reported by
(tJosve05a (c) 23:27, 5 November 2018 (UTC)
What happens
JSTOR API returned nothing for JSTOR 40238871
Relevant diffs/links
https://en.wikipedia.org/w/index.php?title=User:Josve05a/cite-sandbox&oldid=867475487
We can't proceed until
Feedback from maintainers


What gives? Temporary issues, or a bug somewhere? (tJosve05a (c) 23:27, 5 November 2018 (UTC)

About seven hours ago the test suite suddenly stopped working. AManWithNoPlan (talk) 23:29, 5 November 2018 (UTC)
We are blocked. AManWithNoPlan (talk) 23:57, 5 November 2018 (UTC)

Caps: J Gerontol A Biol Sci Med Sci

Status
{{fixed}}
Reported by
Headbomb {t · c · p · b} 13:45, 6 December 2018 (UTC)
What happens
[12]
What should happen
[13]
We can't proceed until
Feedback from maintainers


Request: Better UvA-DARE (Digital Academic Repository) support

Status
new bug
Reported by
(tJosve05a (c) 13:10, 22 November 2018 (UTC)
What happens
Bot adds |url=http://dare.uva.nl/personal/pure/en/publications/deaf-discourse-the-social-construction-of-deafness-in-a-bedouin-community(eb0cb2ac-7045-40fe-8bcf-a40b2c534003).html
What should happen
Bot should add |hdl=11245/1.345005 instead
Relevant diffs/links
https://en.wikipedia.org/w/index.php?title=User%3AJosve05a%2Fcite-sandbox&diff=prev&oldid=870110654
We can't proceed until
Feedback from maintainers


Is there an API or a way to "find this"? Or is it too much work? (tJosve05a (c) 13:10, 22 November 2018 (UTC)

Many users prefer direct links to PDF files rather than records (although librarians and website owners prefer links to HTML pages so that they can track the users more easily). That said, this repository attempts to provide the handle in its HTML metadata, but is misconfigured: <meta name="DC.identifier" content="http://hdl.handle.net11245/1.345005"> (slash missing). I suggest to warn the repository administrators. Their records on BASE are also all broken, some OAI-PMH fixes are in order. Nemo 14:47, 22 November 2018 (UTC)

{{wontfix}} at this time. AManWithNoPlan (talk) 22:38, 7 December 2018 (UTC)

Slow

Status
new bug
Reported by
94.234.40.218 (talk) 12:33, 7 December 2018 (UTC)
What happens
the bot is uploading new data slowly. I can not get it to work at the moment.
We can't proceed until
Feedback from maintainers


Also not working for me. I asked it to check The Bill, so far 25 minutes and it's done nothing.-- 5 albert square (talk) 13:43, December 2018 (UTC)

{{wontfix}} shared server and sadly when it gets slow people often just start trying again and again thus making it worse (similar to shooting someone because they are bleeding and hoping it will help) AManWithNoPlan (talk) 15:32, 7 December 2018 (UTC)

Would displaying an error message of some kind be possible here? Something like "<server> is at capacity, try again in <ammount of time depending on server load>"? Headbomb {t · c · p · b} 23:38, 7 December 2018 (UTC)
I have an idea. AManWithNoPlan (talk) 00:34, 8 December 2018 (UTC)

Bug: Do not add OA link if adding free ID

Status
{{fixed}}
Reported by
(tJosve05a (c) 09:36, 23 November 2018 (UTC)
What happens
Bot adds both |pmc=5818020 which is open access, and |url=http://discovery.ucl.ac.uk/10045938/1/Antoniades_Verbal%20learning%20and%20hippocampal.pdf. A link should not be added if a free ID is present. This is regression, or not completely done last time (check needs to be done after adding IDs.
What should happen
Only add |pmc=5818020
Relevant diffs/links
https://en.wikipedia.org/w/index.php?title=Hippocampus&diff=870226157&oldid=870046014
We can't proceed until
Feedback from maintainers


This is not a regression. The URL is added before the PMC is present. Will have to think about this. Perhaps move adding Open URL to the end would be best. AManWithNoPlan (talk) 16:46, 23 November 2018 (UTC)

https://github.com/ms609/citation-bot/pull/1099 AManWithNoPlan (talk) 22:49, 23 November 2018 (UTC)

Added additional alias of "work" paramter

Status
{{fixed}}
Reported by
DferDaisy (talk) 01:50, 30 November 2018 (UTC)
What happens
Bot added "journal" parameter when "magazine" parameter was already present, creating a duplicate parameter error (since both are aliases of "work"). This is similar to the error which renames parameters to create aliases of "work", but in this case new parameters are being created.
Relevant diffs/links
Wired article
We can't proceed until
Feedback from maintainers


very rare. fix https://github.com/ms609/citation-bot/pull/1108 AManWithNoPlan (talk) 04:02, 30 November 2018 (UTC)

Sackur–Tetrode equation

Status
{{notabug}}
Reported by
81.153.242.15 (talk) 15:42, 30 November 2018 (UTC)
We can't proceed until
Feedback from maintainers


On November 19th, you removed two wikilinks from Sackur–Tetrode equation. Both wikilinks seem to be useful; so I restored them. To me, the removal of the wikilinks indicates a bug. 81.153.242.15 (talk) 15:41, 30 November 2018 (UTC)

removal of partial wikilinks is not a bug. you need to wikilink the entire journal name or it will be removed by the bot. AManWithNoPlan (talk) 16:27, 30 November 2018 (UTC)

Weird Citation bot bug

Status
{{fixed}}
Reported by
AdrianWikiEditor (talk) 12:08, 1 December 2018 (UTC)
What happens
It put the title as "Your NED search results"
What should happen
It should say just NED or something like that.
Relevant diffs/links
Special:Diff/871488348
Replication instructions
I don't know if it'll do the same thing again but my guess is any NED websites.
We can't proceed until
Feedback from maintainers


https://github.com/ms609/citation-bot/pull/1109 once implemented this will stop bot from trying to expand this website. AManWithNoPlan (talk) 18:27, 1 December 2018 (UTC)

Non-names

Status
{{fixed}}
Reported by
wumbolo ^^^ 22:07, 5 December 2018 (UTC)
What happens
generates first and last name "Telegraph Reporters"
What should happen
blacklist these, I don't think there's many such words, e.g. reporters, journalists, correspondents, anchors
Relevant diffs/links
[14]
We can't proceed until
Feedback from maintainers


https://github.com/ms609/citation-bot/pull/1110 AManWithNoPlan (talk) 15:57, 6 December 2018 (UTC)

Bug: doi's with plus signs

In this edit:

(tJosve05a (c) 09:30, 12 November 2018 (UTC)

it is interesting that Wiley cannot handle the doi either. plus signs are a horrible choice. AManWithNoPlan (talk) 19:06, 12 November 2018 (UTC)
Anyway to get the cite template to enclode the url better so Wiley can resolve it, or is this up to crossref/Wiley to fix? (tJosve05a (c) 22:56, 15 November 2018 (UTC)
waiting for bot to come alive to debug AManWithNoPlan (talk) 03:21, 13 November 2018 (UTC)
https://github.com/ms609/citation-bot/pull/1054 AManWithNoPlan (talk) 16:47, 14 November 2018 (UTC)
Not only conveting existing doi's, but also adding bad doi's :/ (tJosve05a (c) 22:53, 15 November 2018 (UTC)
That is no surprise. AManWithNoPlan (talk) 23:30, 15 November 2018 (UTC)
No, but still sad. A bit surprised though that it didn't add |doi-broken-date=, but I guess it tests if broken before parsing what to write. (tJosve05a (c) 23:47, 15 November 2018 (UTC)
when it gets url encoded, the space becomes a plus sign. When people start using doi with spaces and emojis it is going to suck AManWithNoPlan (talk) 00:02, 16 November 2018 (UTC)
Ugggh! Horrible thoughts! Burn them before they end up in doi's! (tJosve05a (c) 11:38, 16 November 2018 (UTC)

{{fixed}} AManWithNoPlan (talk) 14:30, 10 December 2018 (UTC)

Request: Process website dates more

Status
{{fixed}}
Reported by
(tJosve05a (c) 09:29, 23 November 2018 (UTC)
What happens
|date=30/11/2011
Relevant diffs/links
https://en.wikipedia.org/w/index.php?title=User%3AJosve05a%2Fcite-sandbox&diff=prev&oldid=870225631
We can't proceed until
Feedback from maintainers


https://en.wikipedia.org/w/index.php?title=School-to-prison_pipeline&diff=872325415&oldid=872325081

https://en.wikipedia.org/w/index.php?title=Standard_of_living_in_Israel&diff=870417470&oldid=868413825

https://github.com/ms609/citation-bot/pull/1098 AManWithNoPlan (talk) 23:08, 23 November 2018 (UTC)

Reuters

Status
new bug
Reported by
wumbolo ^^^ 14:45, 24 November 2018 (UTC)
What happens
adds |newspaper=Reuters when |agency=Reuters is already present
What should happen
nothing; Reuters is a news agency
Relevant diffs/links
[15]
We can't proceed until
Feedback from maintainers


When the actuall website is Reuters.com, it whould be the work (such as |newspaper=), but while Reuters is the author of an article on another website (such as theguardian/nytimes) it should be |agency=. In this case |agency=Reuters be removed. Both |agency=Reuters and |newspaper=Reuters should not be present. (tJosve05a (c) 14:59, 24 November 2018 (UTC)
Same proble as with assocaited press AManWithNoPlan (talk) 17:54, 24 November 2018 (UTC)
https://github.com/ms609/citation-bot/pull/1102 AManWithNoPlan (talk) 19:31, 24 November 2018 (UTC)

{{fixed}}

be less exact with agency

  • In the same edit, it did not add an extra parameter for the Associated Press of Pakistan and for Agence France-Presse. All these agencies can often be called a couple of different names (e.g. AP, the Associated Press, or Associated Press), so that might be an issue. wumbolo ^^^ 19:44, 24 November 2018 (UTC)
I have added to pull 1102 some code to make it less exact. AManWithNoPlan (talk) 23:16, 24 November 2018 (UTC)

{{fixed}}

Date format

Status
new bug
Reported by
SusanLesch (talk) 16:12, 10 December 2018 (UTC)
What happens
Date added in wrong format (for example, "date=2016-08-03" was added right after "accessdate=16 September 2018")
What should happen
Should determine the page's format and copy that format
Relevant diffs/links
https://en.wikipedia.org/w/index.php?title=Minneapolis&type=revision&diff=872994460&oldid=871424039
We can't proceed until
Feedback from maintainers


{{notabug}} the accessdate is formatted wrong, not what we did. The page says {{tl:Use mdy dates}} AManWithNoPlan (talk) 16:47, 10 December 2018 (UTC)

Request: handle non-escaped dx.doi.org URL

Status
new bug
Reported by
(tJosve05a (c) 10:26, 15 November 2018 (UTC)
What happens
Adler, Robert F.; et al. (December 2003). <1147:TVGPCP>2.0.CO;2 "The Version-2 Global Precipitation Climatology Project (GPCP) Monthly Precipitation Analysis (1979–Present)". Journal of Hydrometeorology. 4 (6): 1147–1167. Bibcode:2003JHyMe...4.1147A. CiteSeerX 10.1.1.1018.6263. doi:10.1175/1525-7541(2003)004<1147:TVGPCP>2.0.CO;2. {{cite journal}}: Explicit use of et al. in: |first1= (help)
What should happen
The bot shoudl remove the |url=
Relevant diffs/links
https://en.wikipedia.org/w/index.php?title=User:Josve05a/cite-sandbox&oldid=868937431
We can't proceed until
Feedback from maintainers


https://github.com/ms609/citation-bot/pull/1072 AManWithNoPlan (talk) 16:44, 15 November 2018 (UTC)


Status
new bug
Reported by
(tJosve05a (c) 11:36, 16 November 2018 (UTC)
What should happen
Remove |url=
Relevant diffs/links
https://en.wikipedia.org/w/index.php?title=User:Josve05a/cite-sandbox&diff=869097807&oldid=869097784
We can't proceed until
Feedback from maintainers



and https://en.wikipedia.org/w/index.php?title=Instar&type=revision&diff=872352613&oldid=872351168


{{fixed}}

Cite arXiv should have capital X

Status
new bug
Reported by
(tJosve05a (c) 12:52, 21 November 2018 (UTC)
What happens
The bot converts <ref>https://arxiv.org/pdf/quant-ph/0512078.pdf</ref> to {{Cite arxiv}}.
What should happen
It should be {{Cite arXiv}} (capital X)
Relevant diffs/links
https://en.wikipedia.org/w/index.php?title=Consciousness&diff=869954364&oldid=869954280
We can't proceed until
Feedback from maintainers


{{fixed}}

Bot renames parameters to create duplicate alias of existing "work" parameter

Status
new bug
Reported by
DferDaisy (talk) 19:26, 23 November 2018 (UTC)
What happens
Bot renames "publisher" parameter to "newspaper". However, "website" parameter is already present. This creates a duplicate parameter error since both "website" and "newspaper" are aliases of "work".
What should happen
Don't convert any parameter to any alias of "work" if any alias of "work" (e.g., journal, newspaper, magazine, periodical, website) is already present.
Relevant diffs/links
Robert Stephens diff and Frank Williams (actor) diff and Spinal disease diff
We can't proceed until
Feedback from maintainers


https://github.com/ms609/citation-bot/pull/1100 AManWithNoPlan (talk) 23:07, 23 November 2018 (UTC)
Not sure how to tell if this is fixed, but if it was, it didn't work: edit at 19:57, 29 November 2018, see citation with title beginning "USA cyclist Tejay van Garderen". DferDaisy (talk) 01:31, 30 November 2018 (UTC)

{{fixed}}

Running bot twice (again)

Status
new bug
Reported by
(tJosve05a (c) 15:14, 24 November 2018 (UTC)
Relevant diffs/links
https://en.wikipedia.org/w/index.php?title=Malnutrition&diff=prev&oldid=870400649
We can't proceed until
Feedback from maintainers


https://github.com/ms609/citation-bot/pull/1101 AManWithNoPlan (talk) 17:13, 24 November 2018 (UTC)

{{fixed}}

The tool seever is down

Status
new bug
Reported by
5 albert square (talk) 00:58, 12 December 2018 (UTC)
What happens
I'm getting a 500 internal service error when trying to run the bot to check the page Jay Brown
Relevant diffs/links
https://tools.wmflabs.org/citations/process_page.php
Replication instructions
I go to Citation bot/use and select to activate the bot. In the page name I have typed in Jay Brown and select "process page". Then this error appears
We can't proceed until
Feedback from maintainers


{{fixed}}

Server got Nuuk'd?

Status
new bug
Reported by
 — Mr. Guye (talk) (contribs)  20:35, 13 December 2018 (UTC)
What happens
500 Internal Server Error
Relevant diffs/links
[In the process of uploading file]  
Replication instructions
Go to its main page tools.wmflabs.org/citations/, Thorough mode = yes, Commit edits = yes, and insert "Nuuk" into the input box next to "Process page". Then hit "Process page" and the error will occur almost immediately.
We can't proceed until
Feedback from maintainers


{{fixed}}

Unexpected data found in parse_plain_text_reference. Citation bot cannot parse.

Status
{{fixed}}
Reported by
Lithopsian (talk) 14:18, 21 November 2018 (UTC)
What happens
Message Unexpected data found in parse_plain_text_reference. Citation bot cannot parse. Please report. A&A 619, A49 (2018)
Relevant diffs/links
[16], will that work?
Replication instructions
Running CitationBot against Hyperion proto-supercluster should give the message.
We can't proceed until
Feedback from maintainers


Thank you for the report. This comes from arXiv data. We support about a dozen formats that they use. This helps us decode new ones (or in some cases detect and not decode). AManWithNoPlan (talk) 15:33, 21 November 2018 (UTC)
https://github.com/ms609/citation-bot/pull/1118 AManWithNoPlan (talk) 19:19, 15 December 2018 (UTC)

chapter= added to Cite encyclopedia without removing title=

Status
{{fixed}} mostly
Reported by
Jonesey95 (talk) 18:53, 8 December 2018 (UTC)
What happens
chapter= was added to Cite encyclopedia without removing title=, causing there to be one quoted version of the chapter name and one italicized version.
What should happen
Bot should not operate on a citation formatted in this way
Relevant diffs/links
https://en.wikipedia.org/w/index.php?title=User%3AJonesey95%2Fsandbox3&diff=prev&oldid=872714555
We can't proceed until
Feedback from maintainers


|chapter= is not a documented parameter in {{cite encyclopedia}}. |title= is supposed to be used for the encyclopedia entry. The bot should probably not add chapter at all when title is present, and it definitely should not add chapter and leave title in place. – Jonesey95 (talk) 18:53, 8 December 2018 (UTC)

The bot's edit summary was also partially incorrect in this edit, in that it claimed to have "Removed parameters", but it did not do so. – Jonesey95 (talk) 18:54, 8 December 2018 (UTC)
This should help a lot https://github.com/ms609/citation-bot/pull/1121 AManWithNoPlan (talk) 00:59, 16 December 2018 (UTC)

Convert worlcat.org urls with titles in urls to oclc also

Status
{{fixed}}
Reported by
Nessie (talk) 17:08, 11 December 2018 (UTC)
What happens
user enters a worldcat page for the url parameter and Citationbot ignores it
What should happen
worldcat urls should be removed and replaced with the oclc parameter, the same as with pmids and dois that are in the equivalent urls entered and swapped by the bot. In the case below, it should replace
|url=http://www.worldcat.org/title/dictionary-of-entomology/oclc/1038886039
with
|oclc=1038886039
Relevant diffs/links
https://en.wikipedia.org/w/index.php?title=Glossary_of_entomology_terms&curid=5072111&diff=873157988&oldid=873147962
We can't proceed until
Feedback from maintainers


https://github.com/ms609/citation-bot/pull/1114 AManWithNoPlan (talk) 04:00, 14 December 2018 (UTC)

GBIF

This edit (since reverted) had a number of issues, not least that no journal is involved. Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 16:46, 14 December 2018 (UTC)

Debugging now. AManWithNoPlan (talk) 18:59, 15 December 2018 (UTC)
Weird. i cannot get it to reproduce AManWithNoPlan (talk) 00:46, 16 December 2018 (UTC)
will just flag {{wontfix}}

Removal of valid URL from {{Cite}} tag

Status
{{notabug}}
Reported by
Shmuel (Seymour J.) Metz Username:Chatul (talk) 19:41, 15 December 2018 (UTC)
What happens
URL parameter removed from {{Cite|url=https://link.springer.com/chapter/10.1007/978-3-642-38928-3_11}} tag in article Internationalization and localization
What should happen
URL parameter should be left intact or moved after other parameters.
Relevant diffs/links
Internationalization and localization
We can't proceed until
Feedback from maintainers


The doi link points to the exact same page and is not prone to breaking as publisher links are. also, this case the pdf file is actually free which is a very unusual for a publisher website. AManWithNoPlan (talk) 21:04, 15 December 2018 (UTC)

Fails on Probiotic

Status
new bug
Reported by
Headbomb {t · c · p · b} 19:05, 19 December 2018 (UTC)
What happens
Doesn't finish the run
Relevant diffs/links
https://tools.wmflabs.org/citations/process_page.php?edit=toolbar&slow=1&page=Probiotic
We can't proceed until
Feedback from maintainers


{{notabug}} banned urls cannot be in modified text. z e n o d o. AManWithNoPlan (talk) 22:49, 19 December 2018 (UTC)

eScholarship

Status
{{fixed}}
Reported by
Headbomb {t · c · p · b} 17:52, 19 December 2018 (UTC)
What should happen
[17]
We can't proceed until
Feedback from maintainers


code change submitted AManWithNoPlan (talk) 01:49, 20 December 2018 (UTC)

Wish list

Wish list {{wontfix}} they said no

Discussion: non-functional jstor dois

Any thoughts on which of these is better:

Cartwright, Jane (1999). "Early and Medieval Literature". The Year's Work in Modern Language Studies. 61: 556–60. doi:10.2307/25833172 (inactive 2018-10-20). JSTOR 25833172.{{cite journal}}:  CS1 maint: DOI inactive as of October 2018 (link)
Cartwright, Jane (1999). "Early and Medieval Literature". The Year's Work in Modern Language Studies. 61: 556–60. JSTOR 25833172.

Should that bot remove the non-functional doi when it the same as the jstor link with 10.2307 added in front of it? AManWithNoPlan (talk) 16:30, 18 November 2018 (UTC)

I prefer the second version only, or at least not displaying inactive doi's if other IDs exists. (tJosve05a (c) 21:16, 19 November 2018 (UTC)
Non-functional DOI links of the form 10.2307/<JSTORID> can be removed if they are broken. Working JSTOR dois, or JSTOR dois of a different form should be left alone. I believe JSTOR used to have internal redirects, but no longer do, so that's why we've got a bunch of crap 10.2307/<JSTORID> DOIs laying around. Headbomb {t · c · p · b} 21:49, 19 November 2018 (UTC)
Anecdotally, sometimes the works where the JSTOR ID doesn't correspond to a working DOI actually have another DOI from a publisher. I'm not sure if these DOIs were never issued or what. Nemo 23:04, 20 November 2018 (UTC)
That is correct, some do not actually have the doi issued. Some have one from the publisher and one from jstor (and maybe one from researchgate and and who knows who else. AManWithNoPlan (talk) 01:22, 21 November 2018 (UTC)
https://github.com/ms609/citation-bot/pull/1127 AManWithNoPlan (talk) 17:20, 20 December 2018 (UTC)

{{fixed}}

Request: clean up sciencedirect URLs

Status
{{fixed}}
Reported by
(tJosve05a (c) 13:08, 21 November 2018 (UTC)
What should happen
Remove ?via%3Dihub from sciencedirect URLs
Relevant diffs/links
https://en.wikipedia.org/w/index.php?title=User:Josve05a/cite-sandbox&diff=next&oldid=869955914
We can't proceed until
Feedback from maintainers


Search results for ?via%3Dihub . Perhaps remove all "via" URL-parameters for that link. (tJosve05a (c) 13:08, 21 November 2018 (UTC)

https://github.com/ms609/citation-bot/pull/1128 AManWithNoPlan (talk) 17:34, 20 December 2018 (UTC)

Newspapers with multiple names

Status
mostly {{fixed}}
Reported by
wumbolo ^^^ 22:03, 5 December 2018 (UTC)
What happens
changes |website=[[The Daily Telegraph]] to |newspaper=The Telegraph
What should happen
don't change the newspaper name if it's wikilinked
Relevant diffs/links
[18]
We can't proceed until
Feedback from maintainers


https://github.com/ms609/citation-bot/pull/1129 AManWithNoPlan (talk) 22:10, 20 December 2018 (UTC)

and https://github.com/ms609/citation-bot/pull/1131 AManWithNoPlan (talk) 17:02, 21 December 2018 (UTC)

BBC Sport

Status
{{fixed}}
Reported by
Mattythewhite (talk) 16:01, 6 December 2018 (UTC)
What happens
changes |publisher=BBC Sport to |newspaper=BBC Sport
What should happen
Nothing; |publisher=BBC Sport is the preferred format
Relevant diffs/links
[19]
We can't proceed until
Feedback from maintainers


Why do you say that? AManWithNoPlan (talk) 15:33, 7 December 2018 (UTC)

Please refer to this discussion. Mattythewhite (talk) 13:51, 8 December 2018 (UTC)
I will think about the solution since bbc (not bbc sports) is the publisher. Newspaper is one of the many work aliases. AManWithNoPlan (talk) 21:16, 8 December 2018 (UTC)
See also this discussion. The use of |publisher=BBC Sport is a well-established norm and there is consensus for it. Nzd (talk) 08:46, 13 December 2018 (UTC)
https://github.com/ms609/citation-bot/pull/1130 AManWithNoPlan (talk) 02:16, 21 December 2018 (UTC)

Fails to convert urls with library proxies

Status
{{fixed}}
Reported by
Headbomb {t · c · p · b} 13:22, 7 December 2018 (UTC)
What happens
[20]
What should happen
[21]
We can't proceed until
Feedback from maintainers


https://github.com/ms609/citation-bot/pull/1115 AManWithNoPlan (talk) 04:24, 14 December 2018 (UTC)

Two jstor areas in code. Now need to do part right after plants. AManWithNoPlan (talk) 16:58, 21 December 2018 (UTC)

please add more entrez support

Status
{{fixed}}
Reported by
Headbomb {t · c · p · b} 06:00, 13 December 2018 (UTC)
What should happen
[22]
We can't proceed until
Feedback from maintainers


https://github.com/ms609/citation-bot/pull/1116 AManWithNoPlan (talk) 19:06, 15 December 2018 (UTC)

API: Silent/Verbose mode for category

Add a 'silent' mode. This would simplify the output to simply

--------------------------------------------------------------------------
[12:13:02] Processing page '[[2018 FFA Cup preliminary rounds]]' – [[edit]] – [[history]]
# No changes required.

when there is no changes made and

--------------------------------------------------------------------------
[12:13:02] Processing page '[[2018 FFA Cup preliminary rounds]]' – [[edit]] – [[history]]
# Updating the page ([[diff]]).

when there is a change made. This could probably made 'default' for categories, with &silent=0 to disable it. Or alternatively, &verbose=1 to enable verbose logs. Headbomb {t · c · p · b} 12:25, 21 August 2018 (UTC)

  • difficult to fix: pages that take a while to process will cause an HTTP disconect. AManWithNoPlan (talk) 13:13, 31 October 2018 (UTC)
    • @AManWithNoPlan: not sure what's that got to do with a simplified output in general? Headbomb {t · c · p · b} 13:36, 31 October 2018 (UTC)
      • perhaps output dots as the bot runs. let me think about it. AManWithNoPlan (talk) 13:50, 31 October 2018 (UTC)

way to many places in the code would need changed. also likley to drop connection while running. {{wontfix}} AManWithNoPlan (talk) 17:03, 22 December 2018 (UTC)

Do not touch any parameter with comments

In this edit.

  • Removed/touched a parameter with a comment <!-- some readers have trouble with the link generated by the doi= field? -->, which should "block out" the bot from touching it. (tJosve05a (c) 09:30, 12 November 2018 (UTC)

I think more {{fixed}} now. AManWithNoPlan (talk) 16:50, 22 December 2018 (UTC)

Spanish title case

Status
new bug
Reported by
Simon Burchell (talk) 11:12, 21 December 2018 (UTC)
What happens
Inappropriate capitalisation of foreign language titles - Spanish does not use title case, it uses first letter only capitalisation of titles.
What should happen
For Spanish titles, first letter only capitalisation (i.e. where language=es (and potentially other languages), don't apply English-language capitalisation rules.
Relevant diffs/links
https://en.wikipedia.org/w/index.php?title=Paris_Codex&type=revision&diff=874680100&oldid=837840965
We can't proceed until
Feedback from maintainers


Note that the journal themselves capitalize, e.g. "Tomado de Joaquín Galarza, “Los códices mexicanos”, Arqueología Mexicana, Edición especial núm. 31, Códices prehispánicos y coloniales tempranos. Catálogo, pp. 6 - 9.". The main issue here is that you're trying to add the issue title to the journal. It really should just be

  • "Códices Mayas" [Maya codices]. Arqueología Mexicana (Special issue 31): 10–23. August 2009. {{cite journal}}: Cite uses deprecated parameter |authors= (help) or similar.

Headbomb {t · c · p · b} 14:09, 21 December 2018 (UTC)

    • Just because the original source has one style does bot mean we follow it. Thouhts? AManWithNoPlan (talk) 17:14, 21 December 2018 (UTC)


{{notabug}} Jounal titles in many styles are capitalized. No the bots fault that the template was used wrong. AManWithNoPlan (talk) 16:56, 22 December 2018 (UTC)

API: add &via= option (also what does &edit= do?)

In a call like https://tools.wmflabs.org/citations/process_page.php?edit=toolbar&user=Headbomb&page=Steve_Bieda, does edit=toolbar do anything? Because I'd like to have some ways to tell the bot that it was triggered via {{Draft article}} or citation expander, or similar. We might want to rename the parameter to allow something like

  • https://tools.wmflabs.org/citations/process_page.php?via=User%3AHeadbomb%2Fcitation.js&user=Headbomb&page=Steve_Bieda
  • https://tools.wmflabs.org/citations/process_page.php?via=the+%5B%5BWikipedia%3ACitation+expander%7Ccitation+expander%5D%5D&user=Headbomb&page=Steve_Bieda
  • https://tools.wmflabs.org/citations/process_page.php?via=%5B%5BTemplate%3ADraft_article%5D%5D&user=Headbomb&page=Steve_Bieda.

This way we could give a summary like

Headbomb {t · c · p · b} 03:43, 26 August 2018 (UTC)

I wonder what the audience of this additional message would be? To most users, what is important is the content and motivation of an edit, rather than the circumstances in which an editor came to make it. If I have a clear understanding of the motivation for this change, I'll be able to consider the best way to implement it. Martin (Smith609 – Talk) 08:56, 27 August 2018 (UTC)
The goal is mostly to have a way to see where Citation bot is used from. How many of those edits were triggered by the web interface? How many were from user scripts and from which userscript, or how many from templates and which templates (and do any need updating)? How many were done via the Citation Expander gadget? It's not necessarily to have 'official' stats (it would be nice though), but knowing where the bot is used from is nice, and could let us give help to newbies that run into issues with the bot. Headbomb {t · c · p · b} 10:32, 27 August 2018 (UTC)
For example, [23] was most likely triggered from {{Draft article}}, present on Draft:Lil ginger ale (we sadly can't feed who used the Template from the template because we don't have a {{CURRENTUSERNAME}} magicword/variable), but knowing it was triggered from the template means it has a fairly high chance of being used by a newbie, and was probably triggered by one of these people. So that lets us (or at least me) customize feedback to people. If I see someone doing something weird/unusual with the bot from {{Draft article}} vs Web Interface vs Gadget vs User Scripts, well you more or less have a continuum of likely noob vs likely noob/intermediate vs likely intermediate vs likely advanced user dealing with the bot. And you'd have an idea of who could have triggered the bot in that scenario. Headbomb {t · c · p · b} 10:45, 27 August 2018 (UTC)

mostly {{fixed}} up. Draft actually comes from several templates deep in https://en.wikipedia.org/w/index.php?title=Template:Automated_tools/core&action=edit AManWithNoPlan (talk) 21:52, 23 December 2018 (UTC)

@AManWithNoPlan: what's the syntax? Headbomb {t · c · p · b} 22:25, 23 December 2018 (UTC)

Blacklisted URL

If the bot tries to "reformat" a blacklisted link (e.g. https://zenodo.org/record/1223952/files/article.pdf to https://zenodo.org/record/1223952 the bot will not be able to save the edit. We should stop to reformat these URLs, in order to be able to edit such pages. Editing pages with existing links aren't stopped, but formatting them turns them in to new links - which are blacklisted. (tJosve05a (c) 07:06, 28 December 2018 (UTC)

https://github.com/ms609/citation-bot/pull/1142 AManWithNoPlan (talk) 14:05, 28 December 2018 (UTC)
A better approach would be to find what causes this blacklisting, and see if edit filters can't be tweaked to let Citation Bot work around them. Headbomb {t · c · p · b} 14:44, 28 December 2018 (UTC)
Given that there are multiple ever changing black lists that would be hard. Awesome, but hard. AManWithNoPlan (talk) 14:54, 28 December 2018 (UTC)

{{fixed}} for now. AManWithNoPlan (talk) 05:25, 29 December 2018 (UTC)

time out on Life extension

Status
{{fixed}}
Reported by
Headbomb {t · c · p · b} 15:35, 28 December 2018 (UTC)
What happens
Times out
What should happen
No timeout?
Relevant diffs/links
[24]
We can't proceed until
Feedback from maintainers


https://github.com/ms609/citation-bot/pull/1145 and https://github.com/ms609/citation-bot/pull/1144 and https://github.com/ms609/citation-bot/pull/1143 AManWithNoPlan (talk) 18:03, 28 December 2018 (UTC)

Update naldc.nal.usda.gov URLs

Status
feature request (maybe better {{fixed}} by an AWB user?)
Reported by
Nemo 12:25, 28 December 2018 (UTC)
What happens
A broken URL like https://naldc.nal.usda.gov/naldc/download.xhtml?id=42375&content=PDF is left undisturbed: special:diff/875701440.
What should happen
URL gets replaced with the functioning new equivalent https://naldc-legacy.nal.usda.gov/naldc/download.xhtml?id=42375&content=PDF
Replication instructions
Special:Search/insource:"naldc.nal.usda.gov/naldc/download.xhtml"
We can't proceed until
Feedback from maintainers


Make a WP:BOTREQ and someone can take care of this with AWB. Headbomb {t · c · p · b} 16:02, 29 December 2018 (UTC)
Yes please use BOTREQ for URL updates, but be careful using AWB it typically breaks archive URLs and/or doesn't undo previous archivals of the broken URL. -- GreenC 16:08, 29 December 2018 (UTC)
Example what is required. Job done. -- GreenC 18:00, 29 December 2018 (UTC)
one time focused tasks like this are not optimal for the citation bot. AManWithNoPlan (talk) 23:54, 29 December 2018 (UTC)
I meant, the job is done. It has been completed. Special:Search/insource:"naldc.nal.usda.gov/naldc/download.xhtml" shows zero hits. I posted the diff to illustrate for Headbomb what is required when modifying URLs - it's not a search-replace with AWB because that causes problems with archives. -- GreenC 00:16, 30 December 2018 (UTC)
Thank you for the fix and for the wayback "medication". Nemo 00:24, 30 December 2018 (UTC)

Request: Shove "additional information" stuff after the pipe in edit summaries

It would probably make more sense to shove "additional information" stuff after the pipe

or

if &via= and category mentions are implemented. Headbomb {t · c · p · b} 04:10, 26 August 2018 (UTC)

https://github.com/ms609/citation-bot/pull/1134 AManWithNoPlan (talk) 21:05, 23 December 2018 (UTC)
Via= can't really be implemented in any useful manner, since edit= is currently unused and all usages use edit=toolbar and the draft article uses edit=toolbar and draft does not directly include it (it is two templates deeper, so we can't even tag it as from draft). We have done what we can for now. AManWithNoPlan (talk) 23:03, 23 December 2018 (UTC)
How about the way I suggest above? Headbomb {t · c · p · b} 23:34, 23 December 2018 (UTC)
There’s no way to easily detect how it was run. We already specify category as different. We have improved user detection though. AManWithNoPlan (talk) 23:57, 23 December 2018 (UTC)
Yes, but there is a way to recognize what is fed in &via=, or if a &via= is declared. Also, since it got archived, what's the syntax for via? Headbomb {t · c · p · b} 00:15, 24 December 2018 (UTC)
it does not exist. there’s no reliable way to do it. We can detect category vs toolbar, but nothing else. That is why edit= is not used. AManWithNoPlan (talk) 00:28, 24 December 2018 (UTC)
Well that's what the request in via was about. To add support for &via=. Headbomb {t · c · p · b} 03:54, 24 December 2018 (UTC)
I know and we’ve done all we really can. Unless we have some way of actually getting reliable information (which we do not) there’s really no point to adding it. AManWithNoPlan (talk) 04:08, 24 December 2018 (UTC)
What do you mean 'reliable information'? what's wrong with just displaying the information that's passed in &via=! That'd be the whole point of via. Headbomb {t · c · p · b} 04:54, 24 December 2018 (UTC)
we would need an approved list of options to choose from and not just accept random strings. AManWithNoPlan (talk) 04:59, 24 December 2018 (UTC)
I honestly doubt anyone would set it, since the toolbar and the citation toolset core that draft pulls information from both set toolbar. AManWithNoPlan (talk) 05:01, 24 December 2018 (UTC)
Why would we need a list of options / pre-approved stringers? 99%+ of usages would be from templates and scripts. Headbomb {t · c · p · b} 06:19, 24 December 2018 (UTC)
I think the pre-approved strings would serve as a kind of input sanitisation. Otherwise at some point you may need to check that you're not inserting junk or spam in edit summaries (where it's hard to remove). I don't know how important a concern this is, but it's not unreasonable to keep it mind. Nemo 10:08, 27 December 2018 (UTC)

{{fixed}} pipe added. AManWithNoPlan (talk) 19:38, 30 December 2018 (UTC)

Fix spacing in page/issue rages

Status
{{fixed}}
Reported by
Headbomb {t · c · p · b} 21:50, 23 December 2018 (UTC)
What should happen
[25]
We can't proceed until
Feedback from maintainers


If it is only long dashes and numbers and spaces then remove spaces.  Correct? AManWithNoPlan (talk) 21:54, 23 December 2018 (UTC)

Could be letters too, like A23 - A48. Convert/fix that to A23–A48.Headbomb {t · c · p · b} 22:02, 23 December 2018 (UTC)
that get dangerous could be junk like ii - iii, 5-7 or the evil look at pages 5 to seven and browse around pages in the early teens..... I will think about how many letters to allow. AManWithNoPlan (talk) 22:06, 23 December 2018 (UTC)
I can start simple and move on from there. AManWithNoPlan (talk) 22:53, 23 December 2018 (UTC)
https://github.com/ms609/citation-bot/pull/1135 AManWithNoPlan (talk) 23:01, 23 December 2018 (UTC)

miscellaneous

Status
{{fixed}}
Reported by
Michael Goodyear   18:30, 24 December 2018 (UTC)
Relevant diffs/links
https://en.wikipedia.org/w/index.php?title=Hannah_Arendt&type=revision&diff=875145591&oldid=875142730
We can't proceed until
Feedback from maintainers


1. French journals like Archives de sciences sociales des religions do NOT capitalise

2. Why change page= to pages=, when the article is one page only?

3. The name of this website is HannahArendt.net

many style guides actually specify capitalization of Foreign journals independent of the what the journal itself is called. It’s an odd thing. Specific journals can be submitted for capitalization as needed. AManWithNoPlan (talk) 19:16, 24 December 2018 (UTC)
pages vs. pages is odd. Jstor gives us a range and then we fix that and so it is temporarily a range of pages AManWithNoPlan (talk) 19:16, 24 December 2018 (UTC)
websites are not case-sensitve, but I can add a capitalization exception. the initial reference being a mix of a journal and a website confused the bot. AManWithNoPlan (talk) 19:16, 24 December 2018 (UTC)
https://github.com/ms609/citation-bot/pull/1136 AManWithNoPlan (talk) 19:39, 24 December 2018 (UTC)

Do not add null

Status
{{fixed}}
Reported by
(tJosve05a (c) 22:55, 26 December 2018 (UTC)
What happens
|issn=null
Relevant diffs/links
https://en.wikipedia.org/w/index.php?title=User%3AJosve05a%2Fcite-sandbox&diff=prev&oldid=875487557
We can't proceed until
Feedback from maintainers


https://github.com/ms609/citation-bot/pull/1137 and https://github.com/ms609/citation-bot/pull/1138 AManWithNoPlan (talk) 00:13, 27 December 2018 (UTC)

When journals ends in 'Des', don't uncapitalize

Status
{{fixed}}
Reported by
Headbomb {t · c · p · b} 17:57, 27 December 2018 (UTC)
What happens
[26]
What should happen
A final 'Des' will always be the undotted ISO 4 abbreviation for 'Design'
We can't proceed until
Feedback from maintainers


https://github.com/ms609/citation-bot/pull/1146 AManWithNoPlan (talk) 21:36, 28 December 2018 (UTC)

Prioritize DOI > Bibcode > Arxiv when fetching journal information

Status
{{fixed}}
Reported by
Headbomb {t · c · p · b} 18:04, 27 December 2018 (UTC)
What happens
Adds J.phys.a from arXiv
What should happen
Add Journal of Physics A from DOI
We can't proceed until
Feedback from maintainers


If arXiv has a doi then process that before the rest of the record AManWithNoPlan (talk) 18:52, 27 December 2018 (UTC)

https://github.com/ms609/citation-bot/pull/1140 AManWithNoPlan (talk) 19:04, 27 December 2018 (UTC)

Bot adds deprecated class= to Template:Citation

Status
{{fixed}}
Reported by
Jonesey95 (talk) 15:48, 29 December 2018 (UTC)
What happens
Bot adds deprecated class= to Template:Citation
What should happen
Bot should not add |class= to anything except {{Cite arxiv}}
Relevant diffs/links
https://en.wikipedia.org/w/index.php?title=Incidence_coloring&type=revision&diff=875657208&oldid=874114333
We can't proceed until
Feedback from maintainers


I see that the template documentation has changed again. AManWithNoPlan (talk) 18:25, 29 December 2018 (UTC)

https://github.com/ms609/citation-bot/pull/1148 AManWithNoPlan (talk) 18:51, 29 December 2018 (UTC)

Messes with a pubmed it shouldn't mess with

Status
{{fixed}}
Reported by
Headbomb {t · c · p · b} 04:13, 30 December 2018 (UTC)
What happens
[27] first ref it touches
What should happen
keep it like this [28]
We can't proceed until
Feedback from maintainers


https://github.com/ms609/citation-bot/pull/1149 AManWithNoPlan (talk) 04:40, 30 December 2018 (UTC)

Capitalize after (

Status
{{fixed}} just for new series
Reported by
Headbomb {t · c · p · b} 19:54, 27 December 2018 (UTC)
What happens
[29]
What should happen
(New Series), not (new Series)
We can't proceed until
Feedback from maintainers


https://github.com/ms609/citation-bot/pull/1150 AManWithNoPlan (talk) 17:11, 30 December 2018 (UTC)

More JSTOR support

Status
{{fixed}}
Reported by
(tJosve05a (c) 00:20, 27 December 2018 (UTC)
What should happen
Replace |url=https://www.jstor.org/stable/j.ctt1q1xrts with |jstor=j.ctt1q1xrts
Relevant diffs/links
https://en.wikipedia.org/w/index.php?title=User%3AJosve05a%2Fcite-sandbox&diff=prev&oldid=875487557
We can't proceed until
Feedback from maintainers


https://github.com/ms609/citation-bot/pull/1154 AManWithNoPlan (talk) 04:24, 1 January 2019 (UTC)

ePrint

Status
{{fixed}}
Reported by
Headbomb {t · c · p · b} 17:35, 31 December 2018 (UTC)
What happens
Not [30]
What should happen
[31]
We can't proceed until
Feedback from maintainers


See also User_talk:Citation_bot/Archive_10#eJournal_of... (e?Prints should read e-?Prints?)Headbomb {t · c · p · b} 18:03, 31 December 2018 (UTC)

will add ePrint, e-Prints, and e-Print to the existing ePrints. AManWithNoPlan (talk) 18:22, 31 December 2018 (UTC)

Bot generates template errors

Status
{{fixed}}
Reported by
Headbomb {t · c · p · b} 17:55, 31 December 2018 (UTC)
What happens
[32]
What should happen
Not that
We can't proceed until
Feedback from maintainers


Don’t convert to cite arXiv when incompatible parameters exist. AManWithNoPlan (talk) 18:26, 31 December 2018 (UTC)

https://github.com/ms609/citation-bot/pull/1153 AManWithNoPlan (talk) 23:47, 31 December 2018 (UTC)

Request: Better Citoid like capabilities

Status
{{fixed}}
Reported by
(tJosve05a (c) 21:10, 5 November 2018 (UTC)
What happens
Citation bot edit
What should happen
Citoid/Zotero edit
We can't proceed until
Feedback from maintainers


Investigating. AManWithNoPlan (talk) 21:41, 23 December 2018 (UTC)

If the Citation bot can easily read & process the HTML Meta tags for the page, it could sometimes do better for these cases.
For the example Nature URL, prism.doi, dc.identifier, DOI meta tags all provide the doi, though the first two give doi:....
I think the prism. tags may also work for some newspapers.
Wikipedia:Scripts/Perl scripts/hdump-head.pl was my ugly attempt at this. Hope this is useful. RDBrown (talk) 08:13, 28 December 2018 (UTC)
can you point to a single page where prism works better? Also, we need php, not perl generally. AManWithNoPlan (talk) 19:42, 30 December 2018 (UTC)
I needed to fix the perl script for https after that, this is the output for the Nature url Josve05a referenced
#    https://www.nature.com/articles/546031a#bk4
Page title: 
<title> @0.0.0
  "Books in brief | Nature"
msapplication-tilecolor #940720
dc.identifier   doi:10.1038/546031a
wt.z_primary_atype      Books and Arts
robots  noarchive
dc.publisher    Nature Publishing Group
wt.cg_s Article
access_endpoint https://www.nature.com/platform/readcube-access
theme-color     #940720
twitter:description     Barbara Kiser reviews five of the week's best science picks.
wt.cg_n Nature
dc.date 2017-05-31
doi     10.1038/546031a
wt.template     oscar
twitter:card    summary
twitter:title   Books in brief
prism.startingpage      31
twitter:site    @naturenews
dc.creator      Barbara Kiser
prism.rightsagent       permissions@nature.com
dc.rights       ©2019 Macmillan Publishers Limited. All Rights Reserved.
prism.url       https://www.nature.com/articles/546031a
description     Books & Arts
wt.page_categorisation  Article_HTML
wt.z_bandiera_abtest    a
prism.issn      1476-4687
prism.issn      1476-4687
dc.subject      Climate sciences
dc.subject      Engineering
dc.subject      Medical research
dc.subject      Nuclear physics
dc.description  Barbara Kiser reviews five of the week's best science picks.
dc.type Books and Arts
wt.z_cg_type    Nature Research Journals
wt.z_subject_term       Climate sciences;Engineering;Medical research;Nuclear physics
dc.copyright    2017 Nature Publishing Group
prism.copyright 2017 Nature Publishing Group
dc.language     En
dc.title        Books in brief
viewport        width=device-width,initial-scale=1.0,maximum-scale=2.5,user-scal
able=yes
dc.rightsagent  permissions@nature.com
prism.volume    546
wt.z_subject_term_id    climate-sciences;engineering;medical-research;nuclear-physics
twitter:image   https://media.springernature.com/full/nature-static/assets/v1/image-assets/546031a-i1.jpg
dc.source       Nature 2017 546:7656
msapplication-tileimage /static/images/favicons/nature/favicon-144x144.3e61d1f755.png
prism.number    7656
prism.publicationdate   2017-05-31
journal_id      nature
access  Yes
prism.doi       doi:10.1038/546031a
prism.section   Books and Arts
dc.format       text/html
prism.publicationname   Nature

|author=Barbara Kiser |date=2017-05-31 |title=Books in brief |journal=Nature |volume=546 |issue=7656 |pages=31 |doi=10.1038/546031a 
Barbara Kiser[au] 2017-05-31[dp] Books in brief[ti] Nature[ta] 546[vi] 7656[ip] 31[pg]

 ::That is it is dumping all the page meta tags, then cite journal parameters, then a PubMed query. I'm not a PHP programmer, but this StackOverflow answer may be useful, if you're not already retrieving the meta tag data. I think PRISM may include the Dublin Core dc. tags as a subset, but the BMJ & maybe the Oxford journals also add useful citation_ tags.

dc.contributor  Gordon C S Smith
dc.contributor  Jill P Pell
dc.identifier   10.1136/bmj.327.7429.1459
citation_title  Parachute use to prevent death and major trauma related to gravitational challenge: systematic review of randomised controlled trials
citation_public_url     https://www.bmj.com/content/327/7429/1459
citation_mjid   bmj;327/7429/1459
citation_lastpage       1461
citation_doi    10.1136/bmj.327.7429.1459
citation_section        Hazardous journeys
citation_article_type   Other
citation_pmid   14684649
Hope this is useful RDBrown (talk) 12:34, 1 January 2019 (UTC)

I have submitted code to find the doi. https://github.com/ms609/citation-bot/pull/1156 AManWithNoPlan (talk) 00:58, 2 January 2019 (UTC)

Smarter duplicate handling

Status
{{fixed}}
Reported by
Headbomb {t · c · p · b} 01:12, 3 January 2019 (UTC)
What happens
Whenever there is a |foobar=<SOMETHING> |barfoo=... |foobar=<NOTHING>, the bot changes that to |DUPLICATE_foobar=<SOMETHING> |barfoo=... |foobar=<NOTHING>
What should happen
Whenever there is a |foobar=<SOMETHING> |barfoo=... |foobar=<NOTHING>, get rid of the empty parameter and keep the full one. E.g. [33]
Exception: Keep the handling of author/editor parameters the same (last/first, editor-last/editor-first, etc...) since people often mangle the order by accident.
Relevant diffs/links
[34] (see DUPLICATE_location)
We can't proceed until
Feedback from maintainers


This is not actually a bug. The bot leaves the citation unchanged in its current method. AManWithNoPlan (talk) 02:38, 3 January 2019 (UTC)

Not saying it's a bug, but it would be an improvement. Headbomb {t · c · p · b} 13:40, 3 January 2019 (UTC)

weird ResearchGate thing

Status
new bug
Reported by
Headbomb {t · c · p · b} 17:42, 5 January 2019 (UTC)
What happens
[35]
What should happen
not that
We can't proceed until
Feedback from maintainers


That is actually the correct information. Hard to deal with people who do not know how to spell when inputting data! AManWithNoPlan (talk) 19:01, 5 January 2019 (UTC)

Correct information? There is no journal named 'peprint' out there, and that doesn't seem to be anywhere on the RG page either. Is this GIGO? Headbomb {t · c · p · b} 19:05, 5 January 2019 (UTC)
yes indeed it is correct. That’s the journal the author entered. Obviously GIGO. AManWithNoPlan (talk) 19:14, 5 January 2019 (UTC)
TY  - BOOK
AU  - Petit, Jean-Pierre
PY  - 2016/07/04
SP  - 
T1  - Schwarzschild 1916 seminal paper revisited : A virtual singularity
JO  - peprint
ER  - 

Anyway, close this one then. No need to code an exception for such uncommon GIGO. Headbomb {t · c · p · b} 19:33, 5 January 2019 (UTC)

{{wontfix}} as you said. Would have been a lot funnier if they had spelled it will one more e. AManWithNoPlan (talk) 19:34, 5 January 2019 (UTC)
PEE PINTS FOAR EVRYONE!! Headbomb {t · c · p · b} 19:38, 5 January 2019 (UTC)

Timeout on Red imported fire ant

Status
{{fixed}}
Reported by
Headbomb {t · c · p · b} 21:57, 5 January 2019 (UTC)
We can't proceed until
Feedback from maintainers


Blocked zenodo dot org url is trying to be added. Not sure how or why. AManWithNoPlan (talk) 23:29, 5 January 2019 (UTC)

https://github.com/ms609/citation-bot/pull/1171 AManWithNoPlan (talk) 00:51, 6 January 2019 (UTC)
https://github.com/ms609/citation-bot/pull/1175 AManWithNoPlan (talk) 02:39, 6 January 2019 (UTC)

CAPS: Fluids and Barriers of the CNS

Status
{{fixed}}
Reported by
(tJosve05a (c) 01:33, 6 January 2019 (UTC)
What happens
|journal=Fluids and Barriers of the Cns
What should happen
|journal=Fluids and Barriers of the CNS
We can't proceed until
Feedback from maintainers


https://github.com/ms609/citation-bot/pull/1173 AManWithNoPlan (talk) 02:40, 6 January 2019 (UTC)

URLs containing an ISSN-DOI replaced with link to incorrect article in a different journal

Status
{{fixed}}
Reported by
Randykitty (talk) 09:48, 6 January 2019 (UTC)
What happens
The bot replaced some valid references by a reference to a completely different article (a book review published 10 years before this journal was established...) Worse, it inserted this faulty reference multiple times but as different references (probably because they were named and got different names). The apparent reason for this is that the URLs of the references had changed although (with 1 exception) they still redirected to the correct page. I have corrected this manually (see article history). I do find it weird that "cite web" references were replaced by "cite journal" ones that were completely inappropriate. Although the bot indicated that it was "user activated", there was no indication about who this user was, who clearly failed to check the edits made by the bot.
Relevant diffs/links
https://en.wikipedia.org/w/index.php?title=Genes,_Brain_and_Behavior&curid=14177857&diff=877031301&oldid=873065992
We can't proceed until
Feedback from maintainers


This mostly happens with Wiley's "fake DOI" ISSN links (which are often rather spammy by the way, as in this article) and can be conclusively solved only by actually resolving DOI links. Nemo 10:32, 6 January 2019 (UTC)
Look where the incorrect reference goes. Even though the bot put "journal=Genes, Brain and Behavior", it was to an article in a completely different journal that had "Genes, Brain and Behavior" as title. It didn't go to one of Wiley's URLs at all. Wiley doesn't use these fake DOI URLs any more, although these generally are still functional but redirect to the new (non-DOI) URLs. All that the bot should have done was replace the "fake DOI URL" with the new URL. --Randykitty (talk) 10:45, 6 January 2019 (UTC)
That's a problem with fake DOIs that resolve. AManWithNoPlan (talk) 15:14, 6 January 2019 (UTC)
I could probably add code to detect DOIs in the form of 10.xxxxx/(ISSN)xxxx-xxxx which are obviously just an ISSN. AManWithNoPlan (talk) 15:21, 6 January 2019 (UTC)
Thanks for maintaining this invaluable tool. BTW, I'm still curious why the bot took those fake DOI links and arrived at an old book review, mixing up the review title and the journal name... --Randykitty (talk) 15:35, 6 January 2019 (UTC)
The Bot took the journal title which was in the title parameter and did a PMC search and found an exact match and went with it. We do have rare false positives like this. AManWithNoPlan (talk) 15:55, 6 January 2019 (UTC)
I see. Yes, that must be rare :-) Thanks again. --Randykitty (talk) 16:06, 6 January 2019 (UTC)

Once this is accepted and pushed to wikipedia, the bug will be gone. https://github.com/ms609/citation-bot/pull/1177 AManWithNoPlan (talk) 19:55, 6 January 2019 (UTC)

Timeout at Edward M. Fram

Status
new bug
Reported by
Headbomb {t · c · p · b} 00:46, 7 January 2019 (UTC)
We can't proceed until
Feedback from maintainers


{{notabug}} I ran it successfully and there were no changes to make. AManWithNoPlan (talk) 04:31, 7 January 2019 (UTC)