# Wikipedia talk:WikiProject Statistics

Active discussions
(Redirected from Wikipedia talk:WPSTATS)
Main page WikiProject Statistics

## Probability distributions: we need a standard for topics

The different articles for distributions include different sub-headers. For example:

• Binomial_distribution: Specification, Example, Expected value and variance, ..., Estimating parameters, ...
• Chi-squared distribution: Definition, Introduction, Characteristics, Relation to other distributions, ...
• Normal distribution: Definition, Properties, Cumulative distribution function, ..., Estimation of parameters,...

etc.

I'm planning to carve out a standard of what an article should include, and will track what I do here. If others wish to join, please leave a comment :)

Tal Galili (talk) 14:05, 27 November 2019 (UTC)

Great idea. Limit-theorem (talk) 14:39, 27 November 2019 (UTC)
Yes, good idea. NB the chi-squared *test* page is a disaster. The many different tests sometimes called chi-square test because the user is told to use a chi-square distribution to get their p-values are all mixed up; the distinction between theoretical, sometimes approximate, sometimes asymptotic, sampling distributions needs to be better explained. Richard Gill (talk) 12:24, 4 January 2020 (UTC)

### proposed structure

Based on some ideas from: https://en.wikipedia.org/wiki/Wikipedia:Featured_articles#Mathematics And also from some distribution articles, I propose the following arc-type of a structure (some might be missing):

• Definitions (this section will include sub-sections on introduction or maybe examples, PDF and CDF, as they are inherent to all distributions)
• Properties (move discussions about parameter, moments, "central limit theorem", etc. - here)
• Related distributions (this is a special case of properties, so I give it its own section)
• Statistical Inference (includes Estimation of parameters, confidence intervals, statistical tests, etc.)
• Occurrence and applications
• Computational methods (for things like how to create random numbers, numerical approximations, and software implementations)
• History (I'm putting this at the end since for most distribution this is where this section was placed, and since the main focus for readers, I assume, will likely be to understand how to use a distribution and less so about its origin)

I'll modify a few distributions and will add them here. Please review and comment:

## Opportunity for causal research on Wikipedia research missing it

The "Recent research" column in the current Signpost discusses Murić et al. (2019) "Collaboration Drives Individual Productivity" in Proceedings of the ACM on Human-Computer Interaction. This paper establishes a clear correlation for both Wikipedia articles and GitHub repositories, that, in the case of the former, editors who edit pages edited by greater numbers of editors tend to edit more pages than those who don't. The authors assume that causation flows in the direction as stated in the title and repeated the third paragraph of their conclusion, but they perform no causal research ruling out the alternative hypothesis that causation flows in the opposite direction, which is more likely in my opinion, that individual productivity drives the extent of collaboration. Or to put it a different way, I think it's more likely that editing more pages leads to editing pages edited by greater numbers of editors. This is a great opportunity for critical authors of undergraduate though postdoctorate levels to explain the importance of analyzing causation in our familiar subject matter on open data sets. If anyone wants to pursue this, please ping me on my talk page so I can give you some more ideas I have on approaching it. EllenCT (talk) 00:12, 28 January 2020 (UTC)

I would be very interested to open a discussion with someone willing to work on this & potentially collaborating.
cc: EllenCT. = paul2520 (talk) 18:44, 28 January 2020 (UTC)

## Statistics question at Talk:2019–20 coronavirus pandemic#Epidemic curve graphics

Some attention from statistics-inclined Wikipedians would be helpful for resolving the question at Talk:2019–20 coronavirus pandemic#Epidemic curve graphics. Thanks! {{u|Sdkb}}talk 22:49, 15 April 2020 (UTC)