Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

A validation of the PAWPER XL-MAC tape for total body weight estimation in preschool children from low- and middle-income countries

Abstract

Importance

The PAWPER tape system is one of the three most accurate paediatric weight estimation systems in the world. The latest version of the tape, which does not rely on a subjective assessment of habitus, is the PAWPER XL-MAC method which uses length and mid-arm circumference (MAC) to estimate weight. It was derived and validated in a population in the USA and has not yet been fully validated in a population from a resource-limited setting.

Objective

The objective of this study was to evaluate the performance of the PAWPER XL-MAC tape weight estimation system in a large dataset sample of children from resource-limited settings.

Methods

This was a “virtual” study in which weight estimates were generated using the PAWPER XL-MAC tape and Broselow tape 2007B and 2011A editions in a very large open access dataset. The dataset contained anthropometric information of children aged 6 to 59 months from standardised nutritional surveys in 51 low- and middle-income countries. The performance of PAWPER XL-MAC method was compared with the Broselow tape and a new length- and habitus-based tape, the Ralston method.

Main outcomes and measures

The bias of the weight estimation methods was assessed using the mean percentage error (MPE) and precision using the 95% limits of agreement (LOA) of the MPE. The overall accuracy was denoted by the percentage of weight estimates falling within 10% and 20% of actual weight (abbreviated as p10 and p20 respectively).

Results

The MPE (LOA) for the PAWPER XL-MAC tape, the Broselow 2007B and 2011A and Ralston method were 1.9 (-15.3, 19.2), 5.4 (-15.9, 26.7), 7.7 (-13.3, 30.5) and -0.7 (-20.2, 19.3) respectively. The p10 and p20 for each method were 79.3% and 96.9% for the PAWPER XL-MAC tape, 64.3% and 91.0% for the Broselow tape 2007B, 55.5% and 85.9% for the Broselow tape 2011A and 67.4 and 94.0% for the Ralston method respectively. The PAWPER XL-MAC system was statistically significantly more accurate than the Broselow tape 2011A, the Broselow tape 2007B and the Ralston method. The relative difference in accuracy (p10) was 43% (odds ratio 4.4 (4.4, 4.5), p<0.001), 23% (odds ratio 2.9 (2.8, 2.9), p<0.001) and 18% (odds ratio 1.8 (1.8, 1.8), p<0.001) compared to each method, respectively.

Conclusions and relevance

The PAWPER XL-MAC tape performed well in this study and was statistically significantly more accurate than both the Broselow tape editions and the Ralston method. This difference was substantial and clinically important. The tape did not perform as well at extremes of habitus-type, however, and might benefit from recalibration.

Introduction

The importance of weight estimation

There are two sets of circumstances under which a child’s bodyweight must be estimated: during emergency care when a child cannot be weighed even if a scale is available and in resource-limited settings where a scale might not be available at all [1, 2]. In both these examples, a weight estimation system that could estimate weight accurately would be ideal to allow for the correct calculation of drug doses [3].

The PAWPER tape is one of the three most accurate weight estimation systems in the world today, along with the Mercy method and parental estimates of weight [4, 5]. The PAWPER tape system and the Mercy method are both dual length- and habitus-based methods which have consistently been shown to be more accurate than one-dimensional systems. The PAWPER tape has also proven to be accurate both in high-income countries as well as low- and middle-income countries [5, 6].

The PAWPER tape systems

There are three versions of the PAWPER tape system: the original PAWPER tape (developed in 2009), the PAWPER XL tape (developed in 2014) and the PAWPER XL-MAC tape (developed in 2016)—see Fig 1 for a description of the PAWPER XL-MAC tape system [3, 7, 8]. The original PAWPER tape and the PAWPER XL tape both make use of body length and a visual assessment of habitus to allow a weight estimate to be read directly off the tape [7]. The PAWPER XL tape is longer than the original tape (180cm vs 145cm) and has additional capabilities to produce weight estimates in obese children (seven habitus categories vs five) [3]. The PAWPER XL-MAC tape is a completely objective system which makes use of mid-arm circumference (MAC) to define habitus instead of relying on a visual assessment of habitus [8]. Each length-segment of the tape contains MAC cut-off values which define the habitus category and allow the weight to be read directly off the tape.

thumbnail
Fig 1. The PAWPER XL-MAC tape system.

There are three steps to the use of the PAWPER XL-MAC tape. The first step is to measure the length of the child from his/her head to his/her heel (Panels A and B). The user thus identifies into which length segment the child falls. The second step is to measure the child’s mid-arm circumference using the tape (Panel C). The final step is to read off the estimated weight in the appropriate length segment based on the measured mid-arm circumference (Panel D). The user should be familiar and well-practised with the tape before using it in a resuscitation situation.

https://doi.org/10.1371/journal.pone.0210332.g001

The PAWPER XL-MAC tape was developed and validated using a combined National Health and Nutrition Examination Survey (NHANES) survey dataset from the USA [8]. Although the tape performed very well in children of all weight categories, it would be of great value to further evaluate the system in a population of much younger children from low- and middle-income countries.

The Ralston tape was designed following a study using this dataset [9]. It is essentially a (still theoretical) device with three side-to-side tapes: one to estimate the weight of “normal” children (MAC>125mm), one for “moderately wasted” children (MAC from 115 to 125mm) and one for “severely wasted” children (MAC<115mm). It was shown to be more accurate than the Broselow tape but has not been compared to other contemporary dual length- and habitus-based systems.

Objectives

The primary objective of this study was to evaluate the accuracy of the PAWPER XL-MAC system in this large dataset of children from low- and middle-income countries. The secondary objective was to compare the performance of the PAWPER XL-MAC system with that of the Broselow tape (the existing “gold standard”) and the tape proposed by Ralston et al [9].

Methods

Study design

This was a retrospective, observational, “virtual” weight estimation study (a virtual study is one in which weight estimations are calculated from a database of anthropometric measurements, rather than a study in which a weight estimation system is directly applied to an individual child). The performance of the PAWPER XL-MAC system was evaluated using data from a very large, recently-published, open-access dataset [9]. This dataset is comprised of anthropometric data for children aged 6 to 59 months derived from growth studies conducted in multiple low- and middle-income countries.

Dataset

The dataset is available at https://doi.org/10.6084/m9.figshare.6026504.v1. A full description of the dataset can be found in the article by Ralston et al from which it was obtained [9]. The dataset contains data from 2,434 nutritional anthropometric surveys from 51 low- and middle-income countries. The data was collected over a period of 25 years, from August 1992 to May 2017 and contains information on 1,717,172 children aged from 6 to 59 months. The number of data points differed slightly from the study by Ralston et al as some of the shared data required permissions from the United Nations High Commissioner for Refugees and was not included in this study (83,150 children). The data included the country of origin, year of collection, sex, measured weight, length (or height), mid-arm circumference, weight-for-age Z-score, height-for-age Z-score and weight-for-age Z-score. The data collection methodology was consistent across the surveys. Weight, height, and MAC measurements were standardized and performed by teams of trained researchers.

Ethics

Ethical approval and informed consent was obtained for each individual surveys as described by Ralston et al [9]. The source data were only collected after ethical approval was obtained from locally responsible ethics committees. When no such ethics committees were functioning, ethical approval was acquired from the institutional review bodies of the non-governmental organization which collected the data. In addition, permissions were obtained from the local ministries of health and, where appropriate, other governmental organizations.

Participation in the surveys was strictly voluntary. In all surveys, the consent procedure was approved by institutional review boards and informed consent was obtained from the primary caregiver of the child. Identifying data were removed before the dataset was made available online.

The image of the child model in Fig 1 was included with written informed consent (as outlined in PLOS consent form) to publish this illustration.

Data generation

Although the weight-for-age, height-for-age and weight-for-height z-scores were available in the dataset, the BMI and BMI-for-age Z-scores were not. These were calculated for each child using a custom-designed spreadsheet formula based on the WHO BMI-for-age growth chart data.

PAWPER XL-MAC weight estimations were generated from the dataset using the available measurements of length (or height) and mid-arm circumference for each child. The cut-off values of mid-arm circumference for each length-division of the PAWPER XL-MAC tape have been published previously [8]. These values were incorporated into an excel formula which then automatically calculated the estimated weights. The excel formula can be found in the Supplementary material (S1 file). In order to provide a standard against which to compare the PAWPER XL-MAC system, a similar process was followed to obtain weight estimations using two versions of the Broselow tape (the 2007B and 2011A editions), using the length data only. The length-segment data for the Broselow tape was obtained from a recent systematic review on the tape [10]. The data on the performance of the Ralston method was obtained from the paper in which it was described [9].

Data analysis

The data analysis followed the recommended methods for weight estimation studies which focus on evaluating the bias, precision and overall accuracy of the weight estimation systems [5]. Bias was determined using the mean percentage error (MPE), shown in Formula 1 below.

(1)

A negative value would thus be indicative of an underestimation of weight. The precision was determined using the Bland & Altman 95% limits of agreement of the MPE as well as the root mean square percentage error (RMSPE), shown in Formulae 2 and 3 below. (2) where MPE represents mean percentage error. (3) where PE represents percentage error and n the sample size.

Percentage error calculations were preferred over absolute values (in kg). The true, measured weights ranged from 3.3kg to 25kg in the dataset, almost a 10-fold difference in weight. The implication of an error of 1kg is thus significantly different for a 3kg infant compared to a 25kg 5-year-old child. Therefore, the mean bias and limits of agreement (in kg) of untransformed or unscaled data is not statistically or clinically useful. Either logarithmically transformed data or percentage error data is more useful but percentage error data is more intuitive and easier to interpret. The overall accuracy of each weight estimation system was determined by calculating the proportion of weight estimations falling within 10% and 20% of actual, measured weight (p10 and p20 respectively).

Outcome measures

The overall accuracy (represented by the p10 and p20 data) was used as the primary outcome measure as it best reflects the global performance of the weight estimation systems. It also most closely reflects the implications of the weight estimation on drug dosing accuracy. An acceptable outcome, based on previous studies, was a p10 of 70% and a p20 of 95% [5]. While the measures of bias and precision were evaluated, they are more statistically useful for refining and calibrating a weight estimation system and offer less intuitive information than overall accuracy.

Subgroup analyses

The data was analysed according to subgroups of age, weight and BMI-for-age z-score. The age subgroups used were 6 to 12 months, 13 to 24 months, 25 to 36 months, 37 to 48 months and 49 to 59 months. The weight subgroups were ≤10kg, 10.1 to 15kg and >15kg. The BMI-for-age subgroups used were Z≤-2.0 (underweight), -2.0<Z≤-1.4 (thin), -1.4<Z<1.4 (“normal” weight), 1.4≤Z<2.0 (overweight) and Z≥2.0 (obese). Since BMI-for-age subgroup data was not available for the Ralston method, subgroup analyses and comparisons were performed using their published weight-for-height data. The consistency of the outcomes was also evaluated across the regions represented in the dataset.

Comparisons between systems

In order to facilitate comparisons of accuracy between the PAWPER and Ralston methods, especially in subgroups, it was necessary to impute p10 and p20 data for the Ralston method. This was done by calculating these data using Formula 4 below. This formula is accurate in normally distributed data, as was true in this case. (4) where p10 represents the proportion of weight estimates falling within 10% of actual weight, x is the mean actual weight of the sample, μ is the mean estimate error and σ is the standard deviation of the mean estimate error.

Where statistical comparisons between the weight estimation systems were considered necessary, the paired t-test was used for comparisons of MPE and RMSPE and the McNemar test was used for paired comparisons of p10 and p20. A difference of more than 10% between any parameter was considered to be clinically or operationally important. Given the very large size of the dataset, and the use of multiple statistical tests to compare weight estimation methods, a 0.1% significance level (p<0.001) was used throughout to denote statistical significance and reduce the likelihood of type I error. The effect sizes were quantified using odds ratios with 95% confidence intervals.

Software

Microsoft excel (Microsoft Excel for Mac version 16.14.1) and Graphpad Prism (GraphPad Prism version 8.00 for Mac, GraphPad Software, La Jolla California USA, www.graphpad.com) were used for all data management and statistical analysis.

Results

A total of 1,717,172 children in the dataset had data available to produce a weight estimation by the PAWPER XL-MAC system as well as two Broselow tape editions and were included in the study.

Demographic data

A description of the demographic data of the sample population from the dataset can be found in Table 1.

thumbnail
Table 1. Basic demographic information of the children in the dataset.

https://doi.org/10.1371/journal.pone.0210332.t001

The distribution of children according to WHO weight-for-height, height-for-age and weight-for-age Z-score classification is shown in Table 2. The majority of children were classified as “normal” (i.e. without severe malnutrition, major wasting or stunting), but there were sufficient numbers of children in each category to test the weight estimation systems. According to the BMI-for-age classification, 23.1% of the children were underweight (Z≤-2.0), 13.2% were “thin” (-2.0<Z≤-1.4), 59.7% were of normal weight (-1.4<Z<1.4), 2.5% were overweight (1.4≤Z<2.0) and 1.5% were obese (Z≥2.0).

thumbnail
Table 2. Children classified according to normal and abnormal growth or body weight.

https://doi.org/10.1371/journal.pone.0210332.t002

The overall results of the assessment of the performance of the three weight estimation systems are shown in Figs 2 and 3. The PAWPER XL-MAC system was statistically significantly more accurate than the Broselow tape 2011A, the Broselow tape 2007B and the Ralston method. The absolute and relative differences in accuracy (p10) were 24% and 43% (odds ratio 4.4 (4.4, 4.5), p<0.001) compared to the Broselow tape 2011A, 15% and 23% (odds ratio 2.9 (2.8, 2.9), p<0.001) compared to the Broselow tape 2007B and 12% and 18% (odds ratio 1.8 (1.8, 1.8), p<0.001) compared to the Ralston method. These differences were all clinically relevant. The details of the statistical analyses for the subgroup comparisons can be found in the Supplementary material (S1 Table).

thumbnail
Fig 2. Analyses of the overall accuracy of the weight estimation systems.

The chart shows the proportion of weight estimations falling within 10% and 20% of actual weight (p10 and p20 respectively) as well as the proportion of critical weight estimation errors (>20% error). The data for the Ralston method was obtained from the published study [9]. The McNemar test was significant at the p<0.001 level for every comparison of p10 and p20. The PAWPER XL-MAC method’s p10 beached the 10% improvement criterion when compared with the other methods. The p10 of the Broselow 2011A was clinically inferior to all other methods.

https://doi.org/10.1371/journal.pone.0210332.g002

thumbnail
Fig 3. Analyses of the bias and precision of the weight estimation systems.

The forest plot shows the overall bias (the black circles indicate the mean percentage error), as well as the precision (the whiskers indicate the Bland & Altman 95% limits of agreement) for each system. The green shaded area denotes an acceptable MPE (within ±5%), while the dashed lines indicate an acceptable range for the 95% LOA (within ±20%). The paired t-test was significant at the p<0.001 level for every comparison of MPE and RMSPE. The MPE of the PAWPER XL-MAC and Ralston methods were clinically superior to the Broselow tape methods. The precision (quantified using the RMSPE) of the PAWPER XL-MAC method was clinically superior to all other methods.

https://doi.org/10.1371/journal.pone.0210332.g003

Figs 4, 5 and 6 show the accuracy outcome data for each system according to BMI-for-age weight status (normal weight, underweight and overweight/obese).

thumbnail
Fig 4. Accuracy of the three systems in “normal weight” children.

The chart shows the proportion of weight estimations falling within 10% and 20% of actual weight (p10 and p20 respectively) as well as the proportion of critical weight estimation errors (>20% error) for children with a BMI-for-age Z-score between -1.4 and 1.4. This subgroup data was not available for the Ralston method as it information was not presented in the original publication. The McNemar test was significant at the p<0.001 level for every comparison of p10 and p20. However, the differences were not clinically important.

https://doi.org/10.1371/journal.pone.0210332.g004

thumbnail
Fig 5. Accuracy of the three systems in underweight children.

The chart shows the proportion of weight estimations falling within 10% and 20% of actual weight (p10 and p20 respectively) as well as the proportion of critical weight estimation errors (>20% error) for children with a BMI-for-age Z-score less than -1.4. This subgroup data was not available for the Ralston method as it information was not presented in the original publication. The McNemar test was significant at the p<0.001 level for every comparison of p10 and p20. The PAWPER XL-MAC method p10 and p20 were clinically superior to both versions of the Broselow tape and the Broselow 2007B was clinically superior to the Broselow 2011A.

https://doi.org/10.1371/journal.pone.0210332.g005

thumbnail
Fig 6. Accuracy of the three systems in overweight and obese children.

The chart shows the proportion of weight estimations falling within 10% and 20% of actual weight (p10 and p20 respectively) as well as the proportion of critical weight estimation errors (>20% error) for children with a BMI-for-age Z-score greater than 1.4. This subgroup data was not available for the Ralston method as it information was not presented in the original publication. The McNemar test was significant at the p<0.001 level for every comparison of p10 and p20. The PAWPER XL-MAC method p10 and p20 were clinically superior to both versions of the Broselow tape. The p10 of the Broselow 2011A was clinically superior to the Broselow 2007B, but the p20 of the Broselow 2007B was clinically superior to the Broselow 2011A.

https://doi.org/10.1371/journal.pone.0210332.g006

Tables 36 show the results of the subgroup analyses of the performance of the weight estimation systems according to sex, age, weight and weight status.

thumbnail
Table 3. Weight estimation performance by subgroups of sex.

https://doi.org/10.1371/journal.pone.0210332.t003

thumbnail
Table 4. Weight estimation performance by subgroups of age.

https://doi.org/10.1371/journal.pone.0210332.t004

thumbnail
Table 5. Weight estimation performance by subgroups of weight.

https://doi.org/10.1371/journal.pone.0210332.t005

thumbnail
Table 6. Weight estimation performance by subgroups of weight status.

https://doi.org/10.1371/journal.pone.0210332.t006

The subgroup analyses of accuracy using weight-for-height cutoffs for the PAWPER XL-MAC and the Ralston method are shown in Fig 7.

thumbnail
Fig 7. Accuracy outcomes by subgroups of weight-for-height.

The p10 data is shown in the upper panel and the p20 data in the lower panel. Children with a weight-for-height Z-score of ≥-2.0 were categorised as “normal”, between -2.0 and -3.0 as “moderately wasted” and below -3.0 as “severely wasted”. The p10 data for the Ralston method was imputed from the MPE data. The McNemar test was significant at the p<0.001 level for every comparison of p10 and p20. We considered the PAWPER XL-MAC method p10 to be clinically superior to all the other methods in all subgroups. The Ralston method p10 was clinically superior to the Broselow tapes in wasted children. The PAWPER XL-MAC method p20 was clinically superior to all other methods in all wasted children. The Ralston method p20 was also clinically superior to the Broselow tapes in all wasted children.

https://doi.org/10.1371/journal.pone.0210332.g007

When comparing the most and least accurate systems (the PAWPER XL-MAC tape and the Broselow tape 2011 edition A), there was between a 1.4- and 80.4-fold difference in accuracy. This difference was substantially less between the PAWPER XL-MAC tape and the Ralston method ranging between 1.2- and 1.4-fold differences. The full analyses can be found in the Supplementary material (S1 Table).

There were no clinically important differences in the accuracy outcomes between the children from the geographical regions represented in the dataset (Africa, Americas, South-East Asia, Eastern Europe, Middle East and the Western Pacific).

Discussion

Performance of the PAWPER XL-MAC system

The PAWPER XL-MAC system performed well in this study and surpassed the predetermined acceptable outcome criteria. It was substantially more accurate than both editions of the Broselow tape as well as the Ralston method (which was originally developed and validated from this dataset). It was also consistently accurate across the geographical regions represented in the dataset, between boys and girls and across the spectrum of age groups and weight categories represented. The PAWPER XL-MAC system was also accurate in all habitus groups except severely underweight and obese children. Despite the lower accuracy at the extremes of habitus, it was substantially and significantly more accurate than the other methods evaluated.

With respect to its accuracy at extremes of habitus, the performance of the PAWPER XL-MAC system was not quite as good in this study as it was in the original validation study [8]. The NHANES dataset used in that study, from a USA population, was markedly different to the one used in this study, however [11]. The children were older (median 120 vs 30 months), heavier (median 40.5 vs 11.1kg) and more overweight (median BMI-for-age Z-score 0.6 vs -0.9). The proportion of “normal” or “average” weight children was also higher (70.6% vs 59.7%). These differences may account for the difference in outcomes. For this system to become truly globally accurate across the age and habitus spectrum, it will need to undergo recalibration or fine-tuning and revalidation to maximise its accuracy (see below). It will also need to be evaluated in prospective studies to determine the impact of human-factor errors and inter-rater reliability.

Performance of the Broselow tapes

The performance of the Broselow tapes in this dataset was similar to that reported in two recent meta-analyses, which found a p10 range of 50 to 60% to be common [5, 10]. In this study the Broselow tapes were accurate in children with “normal” or “average” weight but were very inaccurate in underweight and overweight/obese children. Since the Broselow tape produces an estimate that more closely approximates ideal body weight (IBW) than total body weight (TBW) this was to be expected [12]. Since IBW may far exceed TBW in underweight children, the use of IBW could result in large drug overdoses in these children [13]. An accurate estimation of TBW is required in all children, irrespective of habitus, to allow accurate drug dosing [13, 14].

The Broselow tape 2007B edition was actually more accurate than the more recent 2011A edition. This was not surprising as the 2011A was the end-result of modifications to the 2007B to reduce its underestimation of weight in children from well-nourished populations with a high prevalence of obesity. This resulted in a worsening of overestimation of weight in children from resource-limited settings. This has also been shown in previous studies from low- and middle-income countries and emphasises the concerns about using the Broselow tape in these settings [1517].

These findings further highlight the value of the modern two-dimensional (length- and habitus-based) weight estimation systems over one-dimensional (length- or age-based) systems [4]. The Broselow tape should no longer be considered as the “gold standard” in weight estimation. The presence of drug dosing information on the tape is not sufficient to counter its inaccuracy, as this information is incomplete and has not been shown to be beneficial without additional resources [10].

Performance of the Ralston method

The Ralston method performed reasonably well and was more accurate than the Broselow tapes in the whole sample. It was less accurate than the Broselow tape 2007B in children with “average” weight, but substantially more accurate in underweight children.

The Ralston method was not as accurate as the PAWPER XL-MAC method, including in the subgroups of wasted (underweight) children. Both the PAWPER XL-MAC system and the Ralston method have two categories representing underweight children, although the PAWPER uses the 15th and 5th weight-for-height centiles to define weight categories while the Ralston method uses the 1st and the 5th centiles. More importantly, the PAWPER system uses MAC cut-off values within each length-segment to define habitus while the Ralston method uses the WHO cut-off values for wasting for all children. This enables the PAWPER system to fine tune the weight estimations to a greater degree.

There was no available data on the accuracy of the Ralston method in overweight children. It is likely to be less accurate than both the Broselow 2011A tape and the PAWPER XL-MAC method as it has no mechanism for habitus modification in these children and they are grouped with the “normal” habitus. To be fair, the Ralston method was specifically designed for resource-limited settings, but it must also be remembered that there can still be a significant prevalence of overweight and obese children in low- and middle-income countries [18]. A system that can provide an accurate weight estimation for a wide range of children is therefore required. Further work on the Ralston method may produce useful information to advance length- and habitus-based weight estimation.

Other methods

The Mercy method makes use of humeral length (as a surrogate for body length) and MAC to generate a weight estimate [19]. Some previous studies have shown the Mercy method to be very accurate in populations with a high prevalence of young and underweight children [20, 21], while others have found it to be less accurate in these children [17]. Given its high level of accuracy, and the fact that it has been promoted for use in resource-limited areas where scales are not available or might be poorly calibrated, it would have been useful to compare it with the PAWPER XL-MAC and Ralston methods in this study [1]. Unfortunately, it could not be evaluated in this dataset as humeral length was not available.

The “sophistication of simplicity”

In order for a weight estimation system to be successful, it must be accurate and easy to use: the “sophistication of simplicity” [12]. The PAWPER XL-MAC system fulfils these requirements as it makes use of straightforward, reliable anthropometric parameters from which a weight estimation can be read directly off the tape, without requiring any calculations. Both the PAWPER XL tape and Mercy methods have been tested in simulated paediatric resuscitations under adverse, clinically realistic circumstances and have been shown to be resilient (i.e. maintain their accuracy) [22]. The use of MAC measurements is therefore reasonable and appropriate as part of the weight estimation process, whether it is during an emergency when a child cannot be weighed or in an environment where a scale is not available.

The need for recalibration

Although the PAWPER XL-MAC system achieved the predetermined acceptable outcome criteria, its performance in severely underweight and obese children was not as good as in other habitus types. A previous study with the original PAWPER tape in an obese USA population had similar findings, even though the tape was still the most accurate of the methods evaluated. It is not yet clear whether the inherent biological variability in the relationship between MAC and body habitus will allow an increase in weight estimation accuracy using this methodology. Nonetheless, the possibility of recalibrating the corrected weights or the MAC cut-off values in the upper habitus categories needs to be explored in future versions of the tape.

Limitations

One of the major limitations of every weight estimation study is that it is not known what degree of accuracy is required by a weight estimation system to prevent patient harm resulting from drug dosing errors. Although we regard a p10 of 70% and a p20 of 95% as an indicator of acceptable outcome, this is based on expert speculation only.

The second limitation of this study is that this was a “virtual” study with anthropometric data obtained from a database rather than from the tape actually being used in clinical practice. This provides very useful information on the potential accuracy of the weight estimation systems but does not provide evidence on human- and patient-factor errors and inter-user reliability. However, the difference between “virtual” and “in real life” testing has never been evaluated and it is not clear whether there would be a substantial difference in outcomes.

Finally, the need to impute some of the subgroup accuracy data for the Ralston method may have influenced the statistical analysis. This was not likely to have altered any of the findings to any significant degree, however.

Conclusions

The PAWPER XL-MAC tape was the most accurate of the weight estimation systems evaluated. It achieved the acceptable outcome criteria in the dataset of children from low- and middle-income countries as a whole and in all subgroups except children at the extremes of habitus. Even in these subsets it outperformed the other systems. The tape needs to be validated in prospective studies to establish whether this accuracy can be maintained in clinical practice.

The Ralston method showed promise, especially in profoundly wasted children, but needs to be evaluated further, especially to see whether its methodology can be sustained in real-world situations.

Both editions of the Broselow tape performed worse than the two-dimensional methods. The Broselow tape should no longer be regarded as a “gold standard” method.

Supporting information

S1 File. The PAWPER XL-MAC formula.

This Microsoft Excel file contains the formula that can generate a weight estimate using recumbent length and mid-arm circumference measurements based on the PAWPER XL-MAC method.

https://doi.org/10.1371/journal.pone.0210332.s001

(XLSX)

S1 Table. Outcomes of comparisons between the PAWPER XL-MAC method p10 and the other weight estimation systems.

This tables shows the details of the comparisons in accuracy (p10) between the PAWPER XL-MAC tape and the other methods evaluated.

https://doi.org/10.1371/journal.pone.0210332.s002

(DOCX)

References

  1. 1. Abdel-Rahman SM, Ridge A, Kearns GL. Estimation of body weight in children in the absence of scales: a necessary measurement to insure accurate drug dosing. Archives of Disease in Childhood. 2014;99(6):570–574. pmid:24573883
  2. 2. Luten R, Wears RL, Broselow J, Croskerry P, Joseph MM, Frush K. Managing the unique size-related issues of pediatric resuscitation: reducing cognitive load with resuscitation aids. Academic Emergency Medicine. 2002;9(8):840–7. pmid:12153892
  3. 3. Wells M, Goldstein L, Bentley A. A validation study of the PAWPER XL tape: accurate estimation of both total and ideal body weight in children up to 16 years of age. Trauma and Emergency Care. 2017;2(4):1–8.
  4. 4. Young KD, Korotzer NC. Weight estimation methods in children: a systematic review. Annals of Emergency Medicine. 2016;68(4):441–451 e10. pmid:27105839
  5. 5. Wells M, Goldstein L, Bentley A. The accuracy of emergency weight estimation systems in children—a systematic review and meta-analysis. International Journal of Emergency Medicine. 2017;10(29):1–43. https://doi.org/10.1186/s12245-017-0156-5.
  6. 6. Wells M, Goldstein L, Bentley A. A systematic review and meta-analysis of the accuracy of weight estimation systems used in paediatric emergency care in developing countries. African Journal of Emergency Medicine. 2017;7:S36–S54. pmid:30505673
  7. 7. Wells M, Coovadia A, Kramer E, Goldstein L. The PAWPER tape: A new concept tape-based device that increases the accuracy of weight estimation in children through the inclusion of a modifier based on body habitus. Resuscitation. 2013;84(2):227–232. pmid:22796544
  8. 8. Wells M, Goldstein L, Bentley A. Development and validation of a method to estimate bodyweight in critically ill children using length and mid-arm circumference measurements—the PAWPER XL-MAC system. South African Medical Journal. 2017;107(11):1015–1021. pmid:29262946
  9. 9. Ralston ME, Myatt MA. Weight estimation for children aged 6 to 59 months in limited-resource settings: A proposal for a tape using height and mid-upper arm circumference. PLoS One. 2018;13(6):e0197769. pmid:29879127
  10. 10. Wells M, Goldstein L, Bentley A, Basnett S, Monteith I. The accuracy of the Broselow tape as a weight estimation tool and a drug-dosing guide—a systematic review and meta-analysis. Resuscitation. 2017;121:9–17. http://dx.doi.org/10.1016/j.resuscitation.2017.09.026. pmid:28958796
  11. 11. Centers for Disease Control and Prevention (CDC). National Center for Health Statistics (NCHS). Hyattsville, MD: U.S. Department of Health and Human Services, Centers for Disease Control and Prevention, 1999–2016.: National Health and Nutrition Examination Survey Data; 2016. Available from: https://wwwn.cdc.gov/nchs/nhanes/continuousnhanes/default.aspx
  12. 12. Luten R, Zaritsky A. The sophistication of simplicity … optimizing emergency dosing. Academic Emergency Medicine. 2008;15(5):461–465. pmid:18439202
  13. 13. Wells M, Goldstein L. Optimising emergency weight estimation in underweight and obese children: the accuracy of estimations of total body weight and ideal body weight. European Journal of Emergency Medicine. 2018; In Press.
  14. 14. Rowe S, Siegel D, Benjamin DK Jr., Best Pharmaceuticals for Children Act—Pediatric Trials Network Administrative Core C. Gaps in drug dosing for obese children: a systematic review of commonly prescribed emergency care medications. Clinical Therapeutics. 2015;37(9):1924–1932. pmid:26323523
  15. 15. Clark MC, Lewis RJ, Fleischman RJ, Ogunniyi AA, Patel DS, Donaldson RI. Accuracy of the Broselow Tape in South Sudan, "The Hungriest Place on Earth". Academic Emergency Medicine. 2016;23(1):21–28. pmid:26671318
  16. 16. Ralston ME, Myatt MA. Weight estimation tool for children aged 6 to 59 months in limited-resource settings. PLoS One. 2016;11(8):e0159260. pmid:27529816
  17. 17. Georgoulas V, Wells M. The PAWPER tape and the Mercy Method outperform other methods of weight estimation in children in South Africa. South African Medical Journal. 2016;106(9):933–939. pmid:27601122
  18. 18. Global Burden of Disease Study 2013. Global Burden of Disease Study 2013 (GBD 2013) Obesity Prevalence 1990–2013. Seattle, United States: Institute for Health Metrics and Evaluation (IHME); 2014.
  19. 19. Abdel-Rahman SM, Ridge AL. An improved pediatric weight estimation strategy. Open Medical Devices Journal. 2012;4:87–97.
  20. 20. Dicko A, Alhousseini ML, Sidibé B, Traoré M, Abdel-Rahman SM. Evaluation of the Mercy weight estimation method in Ouelessebougou, Mali. BMC Public Health. 2014;14:270. pmid:24650051
  21. 21. Batmanabane G, Kumar Jena P, Dikshit R, Abdel-Rahman SM. Using the Mercy Method for weight estimation in Indian children. Global Pediatric Health. 2015; Jan-Dec:1–7.
  22. 22. Wells M, Goldstein L, Bentley A. The accuracy of paediatric weight estimation during simulated emergencies: the effects of patient position, patient cooperation and human errors. African Journal of Emergency Medicine. 2018;8(2):43–50. pmid:30456146