User:Proteins/Prosesize script acid test

A pathological article for testing prose-size scripts. Note extra line below, which should not be counted.


Bad heading level H4 edit

Stetsonville is located at 45°4′35″N 90°18′50″W / 45.07639°N 90.31389°W / 45.07639; -90.31389 (45.076413, -90.313952)[1].

According to the United States Census Bureau, the village has a total area of 0.4 square miles (1.0 km²).

Oak Park Mall is open from 10 a.m. to 9 p.m. Monday through Saturday and 11 a.m. to 6 p.m. on Sundays [1]. The mall contains nearly 200 stores and the mall area is 1,500,000 sq ft (140,000 m2), making it the largest mall in the Kansas City Metro Area.

Section 1: Blank lines: edit

Make this section harder.


Really, really hard.

Section 2: Lists edit

Sometimes an unordered list is useful...

  • in giving various cases
    • and some subcases
    • such as these
      • which may have subcases
  • in discussing cases with complicated logic

An ordered list

  1. Here's the first item.
  2. Here's the second.
  3. And here's the third.

A discursive list

First term
Describes the first item.
Second item
Describes the second item.
Third term
Describes the third item.

Section 3: Punctuation and special characters edit

Em-dashes—which are very common—are used in many hyphen–hungry texts, but – sparingly. Two words separated by an unspaced em-dash should count as two words, not one. However, two things separated by an unspaced en-dash should generally be counted as a single compound word, as in blood–brain barrier; see WP:MOS#Dashes.

& & & These ampersands should count as one character each, not five.

One Two Three Four Five These are five words separated by non-breaking spaces, not one word.


Other special characters should count as one character each:

÷ < > ? & & / " ” € £ § ↑ . ( ) ¿ ♠ = : ; ©

& & ¢ © À ñ ® ø — – µ † ‡ @


The last six characters of this line are divided into two words of three characters each by a #32 space character:

[ \ ] ^ _ "”€ £§↑

Section 4: Blockquotes, cquotes and tables edit

whether 'tis nobler in the mind


The Al/air battery system can generate enough energy and power for driving ranges and acceleration similar to gasoline powered cars...the cost of aluminum as an anode can be as low as US$ 1.1/kg as long as the reaction product is recycled...Only the Al/air EVs can be projected to have a travel range comparable to ICEs. From this analysis, Al/air EVs are the most promising candidates compared to ICEs in terms of travel range, purchase price, fuel cost, and life-cycle cost.

However, tables should not be counted, such as this one from

Risk Category Abnormality 5-year survival Relapse rate
Favorable t(8;21), t(15;17), inv(16) 70% 33%
Intermediate Normal, +8, +21, +22, del(7q), del(9q), Abnormal 11q23, all other structural or numerical changes 48% 50%
Adverse -5, -7, del(5q), Abnormal 3q, Complex cytogenetics 15% 78%

the Wikipedia article on acute myeloid leukemia.

Section 5: Poems and indented text edit

I've never seen a purple cow
and hope I never see one

But I will tell you this right now
I'd rather see than be one!
Ogden Nash
That poem is called "The Purple Cow".

The previous indented lines had no linespaces between them.

This line is indented once and has one line space ahead of itself
This line is indented twice and has one line space ahead of itself
This line is again indented twice and has one line space ahead of itself
This line is indented three times and has one line space ahead of itself
This line is yet again indented twice and has one line space ahead of itself


This line is indented four times and has two line spaces ahead of itself

Section 6: Refmark text edit

This is a citation,[2] but this is a superscript x2.

This[citation needed] is a template test.[citation needed]

Section 7: Bad heading levels edit

H4 heading edit

H3 heading edit

H5 heading edit

H1 heading edit

Section 8: Images edit

 
Test image that shouldn't be counted.
 
Another test image that shouldn't be counted.

Section 9: Unusual formatting of text edit

We begin with a space at the beginning of a line; in Wiki-markup, this corresponds to the <PRE> tag in HTML.

For I am the very model of a modern major general...
'Twas blighted affection that made him exclaim...

See also edit

Here are some great articles related to this topic.

References edit

Here are some really good references.

  1. ^ "US Gazetteer files: 2010, 2000, and 1990". United States Census Bureau. 2011-02-12. Retrieved 2011-04-23.
  2. ^ Random reference

Correct results edit

The correct results of a prose-size script on this article should be:


This article has three bad jumps in heading level, one in the lead and two in section 7; there's also an illegal H1 heading in section 7. The MediaWiki software appears to have a bug in formatting articles with such H1 sections and lead sections with a jump; please note the table of contents.

Known bugs and conventions in articlestructure.js edit

The reference script articlestructure.js has a few bugs and/or conventions.

Text conventions/bugs
  • Each <PRE>-tagged text is counted as a new paragraph.
  • Sections that come after a "closing" section such as "See also" are not counted, such as this one. Another example is 2003 German Grand Prix.
  • Closing sections may have main-article text, as in the "See also" section of Operations Specialist (US Navy) or in the "Notes" section of the Legend of the White Cowl. This text is not counted.
  • The heading "Source" or "Sources" is treated as equivalent to "References", making it a closing section. For example, see Yellowtail flounder, Nothropus or John Bell. However, "Source" could be a section heading of an article, as in the "source of the Nile".
  • The heading "Literature" is used in Heroin to mean "Further reading", but is not included in the script. That doesn't affect the prose-size counting for Heroin, but could for other articles.
  • The "Memoir" section of Martin Clemens may mean "Further reading", but is counted by this script.
  • The boxed author-abbreviation sentence in Louis van Houtte is not counted.
  • Disambiguation, See also, Further, Main article, etc. messages are counted as text if they are written out in italics. As templates, however, they are not counted. For an example, see Lieutenant-General (United Kingdom), Vital Signs (pop band), Hak Ja Han, Nevada State Route 427, or 2005 LPGA Tour. An anomalous example is the "Source" messages found in List of Sabre and Fury units in US military.
  • Wikipedians may disagree whether two words joined by an unspaced en-dash or hyphen should count as one word or two. This script treats them as a single compound word. Therefore, an unspaced en-dash used (improperly) as a parenthetical em-dash gives the wrong word count, but the correct character (byte) count.
  • The text floating at right in W. Eugene McCombs within a DIV tag is not counted.
Image conventions/bugs
  • Images are counted only if they are at least 80 pixels in both width and height. This eliminates flag icons and other such images.
  • Doesn't count very narrow but otherwise informative images, such as Universal College Application or 1-Tetradecanol.
  • Misses images composed of many smaller images, such as the uniforms in Anagennisi Dherynia.
  • Misses small mugshots in Louis Buchalter. Perhaps check for captions or an "Enlarge" button if the size test fails.