Talk:Data binning

Latest comment: 3 years ago by Barcodeplane

I'm no expert on data binning, although an academic statistician. It seems to me that at least the comparison with a histogram is wrong. They form completely different notions. Data binning seems to me to be a kind of smoothing method. Nijdam (talk) 21:56, 22 July 2010 (UTC)Reply

I am currently studying data binning in the context of PET image reconstruction and it seems that it is extensively used (SSRB Single Slice Rebinning, MSRB, Fourier Rebinning (FORE), etc). Seems to be too much in-depth for such an article, at least without first explaining some key concepts. Will add anyway unless told otherwise. --Garuh knight (talk) 23:09, 28 June 2011 (UTC)Reply
So, normally ``binning" is just making a histogram, but there are more advanced methods which involve some type of data smoothing. One simple way (which I've seen called ``Lever smoothing" is to split a data point between two adjacent bins based on how far it is from the midpoint of each bin. Right now I'm trying to find information on how to implement ``Gaussian binning" (my term), which is where you bin data into bins weighted by a Gaussian. It is definitely a technique used, but all the information on it seems to be buried in technical publications. If that could be described in the article it would be a very valuable contribution. Danski14(talk) 18:43, 28 October 2014 (UTC)Reply
LightGBM by Microsoft and its scikit-learn similar Histogram-based Gradient Boosting algorithm are applications of binning too. --Dnavarro (talk) 15:31, 18 December 2019 (UTC)Reply

Without a supporting reference, I am not so sure about the comments regarding binning as being useful for GC/MS or NMR spectra. Barcodeplane (talk) 16:03, 7 May 2020 (UTC)Reply