Talk:Skew normal distribution

Latest comment: 2 years ago by 2001:14BA:5054:D300:D09C:99DD:9096:8C3 in topic Defining delta and gamma.

Creation edit

I created the article on skew normal distributions today... It is highly useful empirically when you have a distribution that is skewed away from normal -- it gives you a parametric model, which can be then used for confidence intervals, hypothesis testing, etc. Velocidex (talk) 01:29, 13 March 2008 (UTC)Reply

Nice work! I was going to ask you about the pdf... but you fixed it so fast! Josuechan (talk) 07:03, 13 March 2008 (UTC)Reply
BTW, do you happen to know how to generate a draw from this distribution? any references? Josuechan (talk) 07:06, 13 March 2008 (UTC)Reply
Same way you can draw from any other distribution. Check out rejection sampling. Velocidex (talk) 23:54, 2 April 2008 (UTC)Reply

I have two remarks to the above discussion and the text of the article. (1) Random number generation: this is possible _without_ rejection of a portion of any generated values as in rejection sampling, see http://azzalini.stat.unipd.it/SN/faq-r.html (2) Early appearances of this distribution: the information provided is not appropriate, see http://azzalini.stat.unipd.it/SN/faq-h.html --147.162.23.31 (talk) 12:54, 12 March 2010 (UTC)Reply

Inflection points? edit

In a standard normal distribution, the inflection points are at ±σ. Where are they in this distribution? —Ben FrantzDale (talk) 20:58, 26 August 2008 (UTC)Reply

Reply: I have done a bit of calculating on this question (after a brief search of the literature available on the net on this distribution) and I think I'm halfway to the answer. I doubt I'll take it much further. To not have this go to waste I'm gonna give my derivation here in case someone wants to take it further.

Starting with f(x) as defined:

 

We need to find the f(x), set it equal to 0 and solve for x. Thus we find f'(x) which is: (by the product rule for derivatives)

 

which we can rewrite as follows using the fundamental theorem of calculus (FTC), and the fact that  :

Applying the FTC to   proceeds as follows:

 

[NOTE: The FTC is defined as valid over any closed interval. The integral here is defined over a right-closed interval, which seemingly precludes the application of the FTC. The part of the integral, however, has such a small value (approaching 0) that the FTC can quite validly be applied. This statement will need better qualification before it is put on the page]

 

We also see that the first term can be rewritten in terms of f(x), giving:

 

We now determine f"(x), again using the product rule for derivatives:

 

which we can rewrite using the chain rule for  :

 

which we rewrite by substituting f'(x) as earlier computed

 

grouping the terms...

 

setting equal to 0, we now get

 

which we can rewrite as

 

getting f(x) alone on the left...

 

We now substitute f(x) from the definition, giving:

 

we cancel the   on both sides:

 

differentiate:

 

apply FTC and the product rule:

 

using the quotient rule: (and canceling a   both sides)

 
 

cancel   and multiply through with  

 

obtain an equation in x:

 
 
 

which can be rewritten as a quadratic equation by setting  :

 

This equation in v can be solved using the quadratic formula.

This should suffice to get you close to an answer. I'm not 100% sure what I did above was correct, so CHECK IT please. I found the discriminant of the quadratic and found that it can be negative, and indeed is for the case a=0. This tells me something might be seriously wrong.

I know this is original work, so do with it what you will, I didn't put it on the main page for just that reason. --Tjips (talk) 17:50, 10 April 2010 (UTC)Reply

When you get the equation  , you should not differentiate, because the equality after the differentiation holds only if this equation is an equivalence (i.e. true for all x) which is not the case. You search for those specific x values (probably two) for which this equality is true. Since the two values are unique in their small surroundings, the derivative of the two sides of this equation won't be equal. Also, as the left side doesn't have a closed form (it is an   function), we can't solve this equation analytically. 84.1.98.66 (talk) 23:38, 31 May 2020 (UTC)Reply

Median/mode edit

Are the median and mode left off for a reason? Honestly I don't know a lot about this distribution but even if these are difficult to calculate for some reason it would seem important to specify them somehow.

--Mcorazao (talk) 15:05, 4 May 2010 (UTC)Reply

As far as I can discover, there are no closed expressions for the mode and median.

I propose to add this, and that they can be computed

  • median from quantile(1/2)
  • mode by finding the maximum of the pdf

Paul A Bristow (talk) 14:43, 31 January 2012 (UTC)Reply

I don't think the approximate expression for the mode   can be correct. For one thing, the approximation given predicts the mode to the left of   for positive   so there is at least a sign error somewhere. Also, as  , the approximation do no go to zero. There is also no citation explaining where this approximation coming from.

Jt68 (talk) 12:01, 30 May 2018 (UTC)Reply

The formula for the PDF, beneath the graphs, appears wrong edit

In the main article it says the PDF is given by:

 

When we substitute in   for x and expand this, we get:

 

But the formula just below the graphs show:

 

The formula beneath the graphs needs to be multiplied by a factor of:

 

Numerical methods confirm this (the following is Maxima code):

pdf(x,mu,sigma,alpha):= sqrt(2/%pi)/sigma * %e^( -(x-mu)^2/(2*sigma^2)) * quad_qagi(%e^(-t^2/2)/(sqrt(2)*sqrt(%pi)),t,minf, alpha*(x-mu)/sigma )[1];

quad_qags(pdf(x,-1,1,10),x,-20,20); —Preceding unsigned comment added by [[User:{{{1}}}|{{{1}}}]] ([[User talk:{{{1}}}|talk]] • [[Special:Contributions/{{{1}}}|contribs]])

It seems to me from numerical calculation - for a normal distribution - that there's a 1/omega factor missing in the pdf with omega and xi, i.e. it should be:
 
i get this from using 'normpdf' and 'normcdf' in octave 3.0.1. Boud (talk) 20:11, 5 October 2010 (UTC)Reply

Agreed, a factor   is missing. —Preceding unsigned comment added by 84.160.163.108 (talk) 13:28, 13 October 2010 (UTC)Reply

I agree with the current main text, but I think the factor   from the original comment is still missing from the formula under the graphs. Bmelinden (talk) 13:12, 29 August 2016 (UTC)Reply

alpha not really in (-1,1) or is it?! edit

I just got confused with the part that "the skewness of the distribution is limited to the interval (-1,1)". This contradicts with the plots where α has values 4 and -4 and also with information that the first external link points to, where Adelchi Azzalini writes what happens if α goes to infinity. I think that part on the page is either written in a confusing way or simply wrong. Herdtien (talk) 15:16, 11 October 2010 (UTC)Reply

Notice, the difference between the skewness of the distribution and its shape parameter. —Preceding unsigned comment added by 84.160.163.108 (talk) 13:07, 13 October 2010 (UTC)Reply

Skew left vs skew right: Contradiction in text versus figure edit

The statement in the text that "The distribution is left skewed if α > 0 and is right skewed if α < 0" is contradicted by the probability density graph (at top right of article), which shows a heavier left tail if alpha<0. I have not taken the time to work through the math and obtain a full understanding of this distribution. But since this sentence seems to contradict the graph, I will shortly be so bold as to alter the sentence to read: "The distribution is right skewed if α > 0 and is left skewed if α < 0". My thanks to those who have developed and maintained these pages. If my changes are erroneous, please sort out the issue and make the corrections needed to bring both the text and the figure in line with the truth about the distribution. Thank you. --Presearch (talk) 22:52, 31 October 2018 (UTC)Reply

Defining delta and gamma. edit

This article has a circular "definition" or absence thereof for   and   (sample skew? why the 1?). In the column on the right, there is a formula for   and a link to another article, but the formula involves   and the linked article has no reference to either of them. There are multiple definitions of skew, so which one is   referring to? The statement maximum theoretical skewness does elude to yet another formula for  , but again with no references or formality.

In its current state, trying to compute alpha for any real data set is not possible. — Preceding unsigned comment added by 2001:14BA:5054:D300:D09C:99DD:9096:8C3 (talk) 09:32, 21 February 2022 (UTC)Reply