Published on in Vol 15, No 4 (2013): April

How Valid are Web-Based Self-Reports of Weight?

How Valid are Web-Based Self-Reports of Weight?

How Valid are Web-Based Self-Reports of Weight?

Original Paper

1Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden

2Clinical Epidemiology Unit, Department of Medicine, Karolinska Institutet, Stockholm, Sweden

Corresponding Author:

Stephanie Erika Bonn, MSc

Department of Medical Epidemiology and Biostatistics

Karolinska Institutet

Nobels väg 12a

Stockholm, SE-17177

Sweden

Phone: 46 852482298

Fax:46 8314975

Email: stephanie.bonn@ki.se


Background: Many studies rely on self-reported anthropometric data. While paper-based self-reports have been the standard collection mode, the number of studies collecting self-reported data via the Web is increasing rapidly. Although numerous studies have shown good agreement between self-reported and measured weight using paper-based questionnaires, the validity of using the Web to inquire about weight is unknown.

Objective: The objective of this study was to validate Web-based self-reports of bodyweight compared to weight measured at the study center.

Methods: The validity of weight self-reported via the Web was assessed by comparing self-reports against measurements of weight in a convenience sample of 149 individuals (77.2% women, 115/149), aged 20-65 years. Study participants self-reported their weight via a Web-based questionnaire and thereafter had their weight measured in the research center.

Results: The Spearman correlation coefficient between self-reported and measured weight was 0.98 (P<.001). The mean difference between self-reported and measured weight was -1.2 (SD 2.6) kg. There was a statistically significant difference between self-reported and measured weight with the self-reported being lower (P<.001). Subjects with a body mass index (BMI) ≥25 kg/m2, and subjects ≥30 years of age, under-reported their weight statistically significantly more than subjects with a BMI <25 kg/m2, and subjects <30 years of age, respectively.

Conclusions: Our results show that self-reported weight via the Web can be a valid method of data collection.

J Med Internet Res 2013;15(4):e52

doi:10.2196/jmir.2393

Keywords



Many studies rely on self-reported data on anthropometric variables. This is more practical and less expensive to collect, than measured data, especially when handling a large sample size. Nonetheless, self-reported data is limited by information bias [1]. Considering the increasing number of studies collecting Web-based data, there is a need to determine the validity of Web-based self-reports of body weight.

It has been suggested that using the Internet creates a distance between the investigator and the subject, as a self-reported questionnaire increases the anonymity of the subject responding. This makes it easier for the subject to answer personal questions truthfully and encourages reporting of uncensored personal information [2]. Sensitive or stigmatized behaviors have been reported more truthfully in self-administered surveys compared to interview-conducted surveys [3,4]. A previous study investigating the validity of self-reported weight by telephone interview and a paper questionnaire found that overweight and obese women reported weight more accurately in the questionnaire [5]. Heavy drinking of alcohol is another example of a stigmatized behavior where a Web-based questionnaire captured more heavy drinkers compared to telephone interviews [6].

While many studies have evaluated paper-based self-reports of weight, it is unclear how valid Web-based self-reports are. The aim of this study was to validate Web-based self-reports of weight against weight measurements made by the study personnel at a research center.


Study Design

Individuals between the ages of 20 and 65 years were recruited for participation during the spring of 2009 by public advertisements in Stockholm County, Sweden. The primary aim of the study was to validate methods assessing diet and physical activity. Access to the Internet and an email address were the requirements for participation. Subjects who were on any form of weight alteration diet, pregnant, or had given birth during the 10 months prior to the start of the study were excluded from participation. Participants were provided with information about the study and gave their written informed consent prior to the start of the study. The study design was previously described in detail by our research group [7]. The study was approved by the research ethics committee at Karolinska Institutet, Stockholm, Sweden.

In total, 179 subjects were included for participation in the study. Out of them, 150 subjects applied via the Web by filling in a questionnaire with personal information, including height and current weight. The questionnaire was downloaded from the Web, filled out by applicants on the computer, and thereafter sent via email to the researchers. There were no instructions regarding clothing or of subjects having to weigh themselves specifically for the questionnaire. Subjects were not aware that their weight was going to be measured at the start of the study when applying for participation. The remaining 29 subjects applied via personal contact or regular paper mail and were not included in the analysis of the present study. Approximately 2 months after applying for participation, subjects attended an introductory meeting and had their weight measured by study personnel. Subjects wore indoor clothing and no shoes during weight measurements. Measurements were made using a digital scale displaying weight to a tenth of a kg. All measurements were made by the same study personnel using the same scale. One subject out of the 150 who applied via the Web had no data on measured weight at the study center and was excluded from further analyses. Thus, 149 subjects were included in statistical analyses.

Statistical Methods

Characteristics of the study participants are presented as descriptive statistics. Results of self-reported weight and height, measured weight, and differences between assessments, are reported as mean values and standard deviations. Potential differences between men and women, subjects <30 and ≥30 years of age, and subjects with a body mass index (BMI) <25 and ≥25 kg/m2, were assessed using paired t tests for continuous variables and chi-square tests for categorical variables. We also performed multivariate linear regression controlling for sex, age (<30 or ≥30 years), BMI (<25 or ≥25 kg/m2), education, and smoking. The degree of association between self-reported and measured weight was assessed using Spearman and intraclass correlation coefficients [8]. Because systematic differences between assessments cannot be detected using Spearman correlation coefficients, absolute agreement between the assessments was determined by using the Bland-Altman technique [9]. The difference between self-reported and measured weight (y-axis) was plotted against the mean of the two assessments (x-axis). The significance level was set to P<.05. All analyses were performed using STATA 11.1


Characteristics of subjects included in analyses are presented in Table 1. The majority of subjects were female (77.2%, 115/149), <30 years of age (51.7%, 77/149), and had a BMI <25 kg/m2 (76.5%, 114/149). Men reported using Swedish snuff, a moist form of snuff, to a higher extent than women (P=.000). No other statistically significant differences were found between men and women, or between subjects with a BMI <25 or ≥25 kg/m2.

The Spearman correlation coefficient between self-reported weight via the Web and weight measured by study personnel was 0.98 (P<.001) for all subjects (Figure 1). The Spearman correlation coefficients for men and women were 0.89 (P<.001) and 0.97 (P<.001) respectively. For subjects with a BMI <25 kg/m2 and ≥25 kg/m2, the Spearman correlation coefficients between self-reported and measured weight were 0.97 (P<.001) and 0.98 (P<.001) respectively. Intraclass correlation coefficients were almost identical to the Spearman correlation coefficients (results not shown). There was a statistically significant difference between self-reported and measured weight in the whole group (P<.001), and in women (P<.001). There was a borderline statistical difference in men (P=.066). Further, there were statistically significant differences between self-reported and measured weight independent of BMI (<25 kg/m2, P<.001 and ≥25 kg/m2, P=.002).

Figure 2 shows a Bland-Altman plot graphically illustrating differences between self-reported and measured weight among all subjects, the mean difference between self-reported and measured weight was -1.2 kg (SD 2.64). The plot shows good agreement between the two methods. However, the trend in the plot indicates increased under-reporting of self-reported weight with increasing body weight.

Table 2 shows self-reported weight, height and BMI, measured weight, and differences between self-reported and measured weight. Self-reported weight was under-reported compared to measured weight for all subjects (P<.001). The difference between assessments remained highly significant among women, -1.3 kg (P<.001), but not among men, -0.9 kg (P=.07). The difference between self-reported and measured weight was not statistically significantly different between men and women. For subjects with a BMI <25 kg/m2 and ≥25 kg/m2, there was a statistically significant difference between the groups (P=.02), with heavier subjects under-reporting on average -1.2 kg more compared to leaner subjects. Years of education did not appear to affect self-reports of weight while subjects ≥30 years of age under-reported their weight more than subjects <30 years of age (P=.02).

Results from the multivariate regression controlling for sex, age (<30 or ≥30 years), BMI (<25 or ≥25 kg/m2), education, and smoking were similar to the results described above. There were statistically significantly higher under-reporting among subjects ≥30 years and subjects with a BMI ≥25 kg/m2 compared to subjects <30 years of age and a BMI of <25 kg/m2, respectively. Sex, education, and smoking did not affect the difference between self-reported and measured weight (data not shown).

Table 1. Characteristics of study participants.


AllMenWomenP valuea


n=149n=34n=115


n%n%n%
Age (years)





.45

<307751.71852.95951.3

30-392818.8720.62118.3

40-492214.8720.61513.0

50-591812.125.91613.9

>6042.700.043.5
Education (years)





.46

9-12b3221.5926.52320.0

>1211476.52573.58977.4

Missing data32.000.032.6
Smoker





.12

Current117.4514.765.2

Previous3825.5617.63227.8

Never9664.42367.67363.5

Missing data42.700.043.5
Swedish snuffc user





.000

Current74.7617.610.9

Previous1610.7720.697.8

Never12281.92161.810187.8

Missing data42.700.043.5
BMI (kg/m2)d





.11

<2511476.52779.48775.7

≥253523.5720.62824.3

acomparing men and women using chi-square tests

bprimary school

ca moist form of snuff

dBMI based on self-reported weight and height

Table 2. Weight, height, and BMI based on self-reported and measured data.


Self-report
Mean (SD)
Measured
Mean (SD)
Differencea
Mean (SD)

nWeight (kg)Height (cm)BMI (kg/m2)Weight (kg)Weight (kg)
All







14968.7 (12.6)170.6 (9.5)23.6 (3.8)69.9 (13.2)-1.2 (2.6)
Gender






Men3480.0 (7.8)183.2 (7.3)23.9 (2.2)80.9 (8.5)-0.9 (2.7)

Women11565.3 (11.7)166.9 (6.4)23.5 (4.1)66.6 (12.5)-1.3 (2.6)

P value
<.001<.001.55<.001.44
BMI (kg/m2)






<2511464.9 (9.7)171.4 (9.6)22.0 (1.7)65.8 (10.0)-0.9 (2.1)

≥253580.8 (13.5)168.3 (8.8)28.5 (4.3)83.0 (13.9)-2.1 (3.8)

P value
<.001.09<.001<.001.02
Age (years)






<307766.6 (10.8)171.7 (9.7)22.5 (2.6)67.3 (11.5)-0.7 (2.5)

≥307270.9 (14.0)169.5 (9.2)24.7 (4.4)72.6 (14.3)-1.7 (2.7)

P value
.04.14<.0010.01.03

adifference between measured and self-reported weight

Figure 1. Scatter plot showing the correlation between self-reported (y-axis) and measured (x-axis) weight (kg). The Spearman correlation coefficient was 0.98 (P<.001). Each data point represents one subject. n=149.
View this figure
Figure 2. Bland-Altman plot showing the difference between self-reported and measured weight (y-axis) in relation to measured weights (kg). The mean difference was -1.2 kg. The 95% limits of agreement for the observations are represented as ±2 SD of the mean difference. Each data point represents one subject. n=149.
View this figure

General Discussion

Results from the present study show that Web-based self-reports of weight are highly correlated, although somewhat under-reported, with measured weight in a Swedish population. Although there were small differences between subgroups, overweight and obese subjects with a BMI ≥25 kg/m2 under-reported their weight on average by 1.2 kg more than normal weight subjects. Further, we found that subjects ≥30 years of age, under-reported their weight on average by 1.0 kg more than younger subjects. Under-reporting was associated with both increasing BMI and age independently. No differences were seen between men and women. To the best of our knowledge, the present study is among the first to examine the validity of Web-based self-reports of weight.

Our results are in line with previous studies validating paper-based self-reports of weight against measurements of weight, which have shown high correlations and high validity [5,10-14]. Nevertheless, self-reported weight is under-estimated in most populations [15], and by most individuals [5,12,16], although under-weight subjects have been seen to over-report weight [5]. Consistent with the results of previous studies where increasing body mass was associated with increased under-reporting [15,17,18], our results demonstrated increased under-reporting among subjects with a high BMI.

As was also shown in a previous study [19], we saw an indication of gender difference, with women under-reporting their weight to a greater degree than men. The lack of statistical significance between the sexes’ degree of under-reporting might be due to our small sample size and fewer participating men. The greater under-reporting detected among older subjects may partly be explained by the positive association between BMI and age and the increasing under-reporting seen with increasing BMI. Increased under-reporting in older subjects could also partly be explained by memory bias due to older subjects not keeping track of their current weight to the same extent as younger subjects. Previous studies have also shown age to have an effect on under-reporting [15].

The observed under-reporting of self-reported weight compared to measured weight, may partly be explained by true differences between the time of self-reported and measured weight. However, the time period between the self-report and the measurement was fairly short and participants were healthy and not on any weight changing regimen. Thus, we believe that large changes in weight during this period were unlikely to have occurred, and does not fully explain the difference. Nonetheless, a narrower time period between the self-reported and measured weights would be ideal to avoid fluctuations in weight over time. Some of the under-estimation might be explained by subjects self-reporting their weight without clothes, while the measurements of weight were made while wearing clothing. Although this may explain part of the under-estimation, it is not likely to have caused the entire difference. Another explanation for differences might be the fact that subjects self-reported their weight in different manners as no specific instructions for self-reports were given, causing random errors. Errors in the measurement of weight were unlikely, given that the same scale was used by the study personnel for all measurements.

Furthermore, the lack of measured height was also a limitation to the current study, and may have had an effect on estimates of BMI. However, previous studies have shown good validity of categorization according to BMI assessed from self-reported data compared to measured data [5]. Therefore, we believe that our estimates of BMI assessed from self-reported weight and height were fairly accurate.

The use of Web-based self-reports of data has numerous advantages compared to collecting data using paper-based questionnaires. Data quality can be improved by implementing automatic controls for missing data and answers out of a reasonable range. The quality of anthropometric data, with regard to missing and plausible answers, collected using a Web-based questionnaire, has been shown to be equal to, or better than, that of data collected using a paper version of the questionnaire [20]. While Web-based questionnaires may be superior to their paper-based predecessors when it comes to completeness of data, concerns have been raised regarding response rates and selection bias introduced by using online surveys [21]. We do not believe, however, that these concerns will be a problem when collecting data in future studies. Access to, and use of, the Internet in Sweden has increased considerably during the last decade with more than 90% of the adult population having access today [22].

Like many other convenience samples, the present study population comprised of men and women of different ages, but the majority of subjects were nonetheless young females residing in Stockholm County, limiting the generalizability of the results. A truly population-based study would improve generalizability, but was clearly unfeasible for practical reasons. In addition, the study subjects who applied for participation in the study knew that the aim of the study was to evaluate methods assessing diet and physical activity. They may therefore be more health conscious and motivated to participate than the average population, and thus prone to self-reporting their weight in a more truthful manner. Nonetheless, the study participants were not initially aware of the comparisons between self-reported and measured weight.

The self-selection of participants might have created a sampling bias yielding a stronger correlation and a decreased difference between assessment methods than in the general population. Of note is that the study subjects had a mean BMI comparable to the general Swedish population [23]. Nonetheless, future studies should focus on validity of self-reports by other sub-groups, such as elderly or obese. Despite these potential limitations, our results may be helpful when using self-reported weights in studies in young and normal weight populations.

Conclusions

We have found Web-based self-reports of weight to be as good as paper-based self-reports of weight found in previous studies. Although more validation studies may be needed, our results showed that self-reported weight via the Internet is a suitable method of data collection for use in research and clinical work.

Acknowledgments

This work was supported by grants from the Swedish Cancer Foundation as well as grants to LifeGene from AFA Insurance, Torsten and Ragnar Söderbergs Foundation, Karolinska Institutet, Stockholm's County Council, and the Swedish Research Council. The authors thank Sara Christensen and Elisabeth Möller for their invaluable contributions during the collection of data.

Conflicts of Interest

None declared.

  1. Tripepi G, Jager KJ, Dekker FW, Zoccali C. Selection bias and information bias in clinical research. Nephron Clin Pract 2010;115(2):c94-c99 [FREE Full text] [CrossRef] [Medline]
  2. Joinson AN. Knowing me, knowing you: reciprocal self-disclosure in Internet-based surveys. Cyberpsychol Behav 2001 Oct;4(5):587-591. [Medline]
  3. Couper M. Web surveys: a review of issues and approaches. Public Opin Q 2000;64(4):464-494. [Medline]
  4. Okamoto K, Ohsuka K, Shiraishi T, Hukazawa E, Wakasugi S, Furuta K. Comparability of epidemiological information between self- and interviewer-administered questionnaires. J Clin Epidemiol 2002 May;55(5):505-511. [Medline]
  5. Lin CJ, DeRoo LA, Jacobs SR, Sandler DP. Accuracy and reliability of self-reported weight and height in the Sister Study. Public Health Nutr 2012 Jun;15(6):989-999 [FREE Full text] [CrossRef] [Medline]
  6. Link MW, Mokdad AH. Effects of survey mode on self-reports of adult alcohol consumption: a comparison of mail, web and telephone approaches. J Stud Alcohol 2005 Mar;66(2):239-245. [Medline]
  7. Bonn SE, Trolle Lagerros Y, Christensen SE, Möller E, Wright A, Sjölander A, et al. Active-Q: validation of the web-based physical activity questionnaire using doubly labeled water. J Med Internet Res 2012;14(1):e29 [FREE Full text] [CrossRef] [Medline]
  8. Lee J, Koh D, Ong CN. Statistical evaluation of agreement between two methods for measuring a quantitative variable. Comput Biol Med 1989;19(1):61-70. [Medline]
  9. Bland JM, Altman DG. Statistical methods for assessing agreement between two methods of clinical measurement. Lancet 1986 Feb 8;1(8476):307-310. [Medline]
  10. Dahl AK, Hassing LB, Fransson EI, Pedersen NL. Agreement between self-reported and measured height, weight and body mass index in old age--a longitudinal study with 20 years of follow-up. Age Ageing 2010 Jul;39(4):445-451 [FREE Full text] [CrossRef] [Medline]
  11. Lawlor DA, Bedford C, Taylor M, Ebrahim S. Agreement between measured and self-reported weight in older women. Results from the British Women's Heart and Health Study. Age Ageing 2002 May;31(3):169-174 [FREE Full text] [Medline]
  12. Bes-Rastrollo M, Sabaté J, Jaceldo-Siegl K, Fraser GE. Validation of self-reported anthropometrics in the Adventist Health Study 2. BMC Public Health 2011;11:213 [FREE Full text] [CrossRef] [Medline]
  13. Burton NW, Brown W, Dobson A. Accuracy of body mass index estimated from self-reported height and weight in mid-aged Australian women. Aust N Z J Public Health 2010 Dec;34(6):620-623. [CrossRef] [Medline]
  14. Spencer EA, Appleby PN, Davey GK, Key TJ. Validity of self-reported height and weight in 4808 EPIC-Oxford participants. Public Health Nutr 2002 Aug;5(4):561-565. [CrossRef] [Medline]
  15. Connor Gorber S, Tremblay M, Moher D, Gorber B. A comparison of direct vs. self-report measures for assessing height, weight and body mass index: a systematic review. Obes Rev 2007 Jul;8(4):307-326. [CrossRef] [Medline]
  16. Engstrom JL, Paterson SA, Doherty A, Trabulsi M, Speer KL. Accuracy of self-reported height and weight in women: an integrative review of the literature. J Midwifery Womens Health 2003;48(5):338-345. [Medline]
  17. Larsen JK, Ouwens M, Engels RC, Eisinga R, van Strien T. Validity of self-reported weight and height and predictors of weight bias in female college students. Appetite 2008;50(2-3):386-389. [CrossRef] [Medline]
  18. Nyholm M, Gullberg B, Merlo J, Lundqvist-Persson C, Råstam L, Lindblad U. The validity of obesity based on self-reported weight and height: Implications for population studies. Obesity (Silver Spring) 2007 Jan;15(1):197-208. [CrossRef] [Medline]
  19. Merrill RM, Richardson JS. Validity of self-reported height, weight, and body mass index: findings from the National Health and Nutrition Examination Survey, 2001-2006. Prev Chronic Dis 2009 Oct;6(4):A121 [FREE Full text] [Medline]
  20. Touvier M, Méjean C, Kesse-Guyot E, Pollet C, Malon A, Castetbon K, et al. Comparison between web-based and paper versions of a self-administered anthropometric questionnaire. Eur J Epidemiol 2010 May;25(5):287-296. [CrossRef] [Medline]
  21. Kongsved SM, Basnov M, Holm-Christensen K, Hjollund NH. Response rate and completeness of questionnaires: a randomized study of Internet versus paper-and-pencil versions. J Med Internet Res 2007;9(3):e25 [FREE Full text] [CrossRef] [Medline]
  22. Arrhenius C, Ewerdahl D. Statistics Sweden. 2011. Use of computers and the Internet by private persons in 2010   URL: http://www.scb.se/statistik/_publikationer/LE0108_2010A01_BR_IT01BR1101.pdf [accessed 2011-10-18] [WebCite Cache]
  23. Sundquist J, Johansson SE, Sundquist K. Levelling off of prevalence of obesity in the adult population of Sweden between 2000/01 and 2004/05. BMC Public Health 2010;10:119 [FREE Full text] [CrossRef] [Medline]


BMI: body mass index


Edited by G Eysenbach; submitted 12.10.12; peer-reviewed by J Kraschnewski, W You, J Harvey-Berino, M Bes-Rastrollo, S Lim Choi Keung, B Brumen; comments to author 09.11.12; revised version received 20.12.12; accepted 08.02.13; published 09.04.13

Copyright

©Stephanie Erika Bonn, Ylva Trolle Lagerros, Katarina Bälter. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 09.04.2013.

This is an open-access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included.