Modelling the Distribution of Anthropometrics: Gaussian Distributions, LMS Distributions, and Probability Plots
dc.contributor.author | Thornburg, Jonathan | |
dc.contributor.author | Vitzthum, Virginia J. | |
dc.date.accessioned | 2024-05-20T21:09:01Z | |
dc.date.available | 2024-05-20T21:09:01Z | |
dc.date.issued | 2024-05-20 | |
dc.description | An earlier version of this poster was presented (online) at the 49th annual meeting of the Human Biology Association, held at Los Angeles, USA, 2024 March 20-23. | |
dc.description.abstract | In anthropologists' studies of growth and development, particularly in non-industrialized populations, sample sizes are often small and span a range of ages. These attributes can hamper analyses and hypothesis testing, and make comparisons to other populations difficult. $z$-scores are a commonly used statistic to mitigate these challenges. A $z$-score is the value of an individual's anthropometric ($y$) expressed in units of the standard deviation (SD) of the anthropometric for a suitable sex- and age-specific reference sample. That is, $z = (y - \mu)\big/\sigma$ where $\mu$ and $\sigma$ are the mean and SD of the reference sample, respectively. Depending on the research question, commonly used reference samples (e.g., WHO, CDC) are not necessarily suitable for all populations. Therefore, there are increasing efforts to construct population-specific growth references. If a growth reference provides the mean and SD for each age/sex bin, it's easy to compute the $z$-score corresponding to any individual's measurement by assuming a Gaussian (normal) distribution. $z$-scores may be computed by either using the mean/SD for the individual's sex/age bin, or (for improved accuracy) interpolating tabulated means/SDs to the individual's age. However, if the anthropometric has an asymmetric (skewed) distribution (as do weight, BMI, and many skinfolds), this approach results in systematically biased $z$-scores. Cole's LMS distribution can accurately represent the distributions of these and many other anthropometrics, avoiding this bias. But sometimes only percentiles are provided for each age/sex bin. We describe how to: (a) determine the mean/SD of the Gaussian distribution, or the coefficients of the LMS distribution, that best fit the published percentiles; (b) visually assess the quality of such a fit; and (c) extrapolate the distribution beyond the range of the published percentiles and visually assess the quality of such an extrapolation. We describe doing this fitting with common open-source software (Gnuplot, R, or SciPy (Python)), or with Microsoft Excel\texttrademark. The fitted coefficients can then be used to compute $z$-scores. We also describe how to extrapolate parameters (and thus compute $z$-scores) for an individual who is outside the tabulated age range, and we present a graphical assessment of any given extrapolation's quality. | |
dc.identifier.citation | American Journal of Human Biology, volume 36 supplement (February 2024), page 51 | |
dc.identifier.uri | https://hdl.handle.net/2022/29773 | |
dc.language.iso | en_US | |
dc.publisher | Human Biology Association | |
dc.relation.isversionof | https://doi.org/10.1002/ajhb.24049 | |
dc.rights | CC BY-NC | |
dc.rights.uri | https://creativecommons.org/licenses/by-nc/4.0/ | |
dc.subject | anthropometrics | |
dc.subject | reference sample | |
dc.subject | z-score | |
dc.subject | Gaussian distribution | |
dc.subject | LMS distribution | |
dc.subject | probability plot | |
dc.subject | fitting | |
dc.subject | age interpolation | |
dc.subject | age extrapolation | |
dc.subject | growth reference | |
dc.title | Modelling the Distribution of Anthropometrics: Gaussian Distributions, LMS Distributions, and Probability Plots | |
dc.type | Presentation |
Files
Original bundle
1 - 1 of 1
Collections
Can’t use the file because of accessibility barriers? Contact us with the title of the item, permanent link, and specifics of your accommodation need.