Abstract
Online surveys have become a key tool of modern health research, offering a fast, cost-effective, and convenient means of data collection. It enables researchers to access diverse populations, such as those underrepresented in traditional studies, and facilitates the collection of stigmatized or sensitive behaviors through greater anonymity. However, the ease of participation also introduces significant challenges, particularly around data integrity and rigor. As fraudulent responses—whether from bots, repeat responders, or individuals misrepresenting themselves—become more sophisticated and pervasive, ensuring the rigor of online surveys has never been more crucial. This article provides a comprehensive synthesis of practical strategies that help to increase the rigor of online surveys through the detection and removal of fraudulent data. Drawing on recent literature and case studies, we outline several options that address the full research cycle from predata collection strategies to validation post data collection. We emphasize the integration of automated screening techniques (eg, CAPTCHAs and honeypot questions) and attention checks (eg, trap questions) for purposeful survey design. Robust recruitment procedures (eg, concealed eligibility criteria and 2-stage screening) and a proper incentive or compensation structure can also help to deter fraudulent participation. We examine the merits and limitations of different sampling methodologies, including river sampling, online panels, and crowdsourcing platforms, offering guidance on how to select samples based on specific research objectives. Post data collection, we discuss metadata-based techniques to detect fraudulent data (eg, duplicate email or IP addresses, response time analysis), alongside methods to better screen for low-quality responses (eg, inconsistent response patterns and improbable qualitative responses). The escalating sophistication of fraud tactics, particularly with the growth of artificial intelligence (AI), demands that researchers continuously adapt and stay vigilant. We propose the use of dynamic protocols, combining multiple strategies into a multipronged approach that can better filter for fraudulent data and evolve depending on the type of responses received across the data collection process. However, there is still significant room for strategies to develop, and it should be a key focus for upcoming research. As online surveys become increasingly integral to health research, investing in robust strategies to screen for fraudulent data and increasing the rigor of studies is key to upholding scientific integrity.
J Med Internet Res 2025;27:e68092doi:10.2196/68092
Keywords
Introduction
Surveys are a key tool in research, offering insight into participants’ minds and allowing researchers to better understand their motivations and preferences []. Traditionally, survey data was collected primarily through paper questionnaires, potentially distributed through the mail or face-to-face, and eventually grew to include telephone surveys [,]. However, as use of the internet has grown and response rates for traditional survey modes dropped, the primary choice for surveys has shifted to online surveys, alternatively known as web-based surveys, which focus on distributing a survey form or instrument hosted on the internet, even though actual distribution methods can range from more traditional ways like mail to newer methods like social media advertising [,].
The use of online surveys in internet-based health research has become more common, largely due to its ability to efficiently reach a large audience in a cost-effective manner and its low barriers to participation [,]. For example, online surveys eliminate the need for face-to-face interaction and remove transportation and logistical barriers, offering participants the chance to complete surveys at their own convenience and comfort, reducing the burden on them [,]. Furthermore, researchers can easily reach underrepresented populations, as well as create samples that ignore geographic boundaries [,]. Depending on the design, it also gives participants the ability to remain anonymous and the freedom to respond honestly without stigma or judgment []. Online surveys can elicit significantly more reports on socially undesirable behaviors in comparison to surveys administered in person, highlighting the benefit of online surveys when it comes to sensitive topics []. COVID-19 has also accelerated the use of digital methods for research purposes, prompting the transition of surveys from in-person to online formats [,].
However, the rise of online surveys comes with an increased risk of fraudulent behavior [], which can be defined as “individuals, groups or computer processes (ie, bots) participating in online, internet or web-based data collection methods at a statistically significant level, such that data are or would be measurably distorted” []. Fraudulent behavior comes in many forms, which leads to further complications when screening for it. For example, incentives are a popular method to encourage participation, but they increase both authentic response rates and repeated responders []. Individuals may choose to misrepresent themselves to match eligibility criteria (alias scammers) or submit multiple surveys in order to receive more incentives (repeat responders) []. However, incentives are not the only motivational factor, as response distorters could spam responses in an effort to misinform results due to their own agenda or political motivation [,]. There has also been an increase in easily obtainable sophisticated software applications, otherwise known as bots, which are designed to automatically fill out surveys, allowing individuals to quickly flood surveys with responses []. Furthermore, there are also careless responders, where individuals do not give sufficient attention to questions or fail to read them comprehensively, leading to answers that may not accurately reflect reality [].
Another consideration when crafting an online survey is the sampling method, which has the potential to significantly introduce errors and bias into data collection, reducing the representativeness and generalizability of results []. When carrying out any research, it is important to clearly define the population of interest and tailor the sampling method to effectively engage that target population []. Sampling methods can largely be split into two categories: (1) probability samples, in which elements from a population are randomly selected and elements have a nonzero known probability of being selected, and (2) nonprobability samples, which are nonrandom and the probability of each element being selected may be unknown []. Probability sampling is generally a more accurate method to establish the characteristics of an entire population. In comparison, nonprobability sampling is typically used when access to the full population is limited or not needed, such as when researchers only want to focus on a specific subgroup of characteristics. If an inappropriate sampling method is chosen, researchers run the risk of collecting data that is not an accurate representation of the target population [].
Despite the difficulty of doing so, ensuring that data collected is accurate is critical, as the inclusion of even a small amount of fraudulent data can create or mask statistically significant differences, decreasing the rigor of the study [,]. The validity and integrity of results then come into question and may even artificially create relationships between uncorrelated factors []. For example, a study found that of the 1281 respondents, only 197 were real respondents, and the inclusion of fake respondents into the results significantly altered the findings—both creating and masking relationships between factors []. Without a sufficiently rigorous data screening process, bias and noise could be introduced to the dataset through the inclusion of fraudulent data, leading to incorrect conclusions of study results and potentially biasing future research or policy recommendations []. Furthermore, fraudulent behavior not only compromises the quality of the research done but also increases research costs through the disbursement of incentives to individuals who had already completed the study or were ineligible and the need for increased time spent on identifying fraudulent responses [,].
Data quality comprises many different attributes, such as reliability, accuracy, and integrity; yet, different disciplines and platforms will give rise to different levels of importance placed on each attribute when conducting research []. One of the key threats to online surveys is the rise in fraudulent responses; yet, there is a lack of articles exploring the impact of fraudulent data on online surveys and how to best guard against them []. The purpose of this article is to examine the methods in which researchers can increase the rigor in online surveys by taking an integrated approach to recruitment, survey instrument design, and metadata checks in order to reduce the inclusion of fraudulent data. The measures that are commonly used by researchers now can be largely split into 2 different phases of research: before data collection and post data collection. Subsequently, each phase gave 2 main areas of focus, (1) survey design and (2) recruitment strategies for predata collection, and (3) metadata checks and (4) data quality for postdata collection. A summary of the methods mentioned in this article can be found in .

Predata Collection: Guarding Against Fraudulent Data
Presurvey methods that help to guard against fraudulent data can either be implemented through survey design or recruitment strategies. Survey design strategies are primarily meant to make screening for fraudulent data easier. Contrarily, recruitment strategies aim to prevent fraudulent responders’ access to the survey.
Survey Design: Automatic Checks
One basic automatic check to protect against bots is CAPTCHAs, which is based on user actions such as checkbox clicks and image challenges []. Newer versions like Google’s reCAPTCHA V3 use sophisticated algorithms to evaluate user interactions based on various criteria, such as typing speed and IP address, which are then scored based on the study’s scoring system and expectations in order to identify possible bots []. Honeypot questions are an additional automated defense against bots that can be easily included within surveys alongside CAPTCHAs; these are questions that are designed specifically for bot detection and hidden from actual participants using custom code []. Thus, receiving answers to a honeypot question would indicate bot activity.
Survey Design: Survey Questions
Another widely used strategy to identify and exclude inattentive respondents is attention checks. The most commonly suggested method is to ask for a specific response, otherwise known as a trap question or instructional manipulation check []. For example, participants could be asked to pick the italicized option [], or a specific choice from a list, eg, please select the option “strongly agree” []. Current literature advises attention checks to be used throughout the survey to better identify responders who are giving low-quality responses []. The wording and quantity of trap questions should strike a balance between filtering out low-quality responses and ensuring that genuine respondents are not accidentally filtered out []. An alternative method is to make use of speed bump questions, which require participants to properly read through the question and use reason to arrive at an answer, filtering out bots or respondents who answer hastily or automatically []. Speed bump questions can look like this, “The man couldn’t lift his son because he was so weak. Who was weak, the man or his son?” However, researchers should consider the type of research they are doing before making use of attention checks, as they have been found to increase deliberation, potentially affecting cognitive task performance []. On the other hand, other research has found that they do not affect participants’ responses on scales or scale validity []. Researchers should thus monitor responses or trial survey questions to see if responses are significantly affected before rolling out their surveys officially.
Another strategy researchers can consider using is to leverage domain-specific knowledge to detect respondents who are not from the correct target audience. This is especially effective in fields where many industry-specific terms are used, and researchers can make use of this by including qualitative response formats [,]. For instance, one survey targeting experienced programmers included simple programming tasks and questions as assessments [], while another survey used specific terminology and jargon as soft-checks to verify expertise []. Subsequently, identifying inconsistencies or suspicious responses compared to established norms, such as unlikely health symptoms for the given demographic, can identify unfit respondents [].
Researchers have also found success in using duplicate questions presented in diverse formats. Researchers have advocated for requesting the same data point in multiple formats [] or incorporating specific items into questionnaires that essentially ask the same question more than once []. For example, one study detected bot-based responses by including duplicate gender identity questions, which would trigger subsequent logic checks and reveal bots that followed code rather than survey logic []. Inconsistent responses to duplicate questions serve as indicators of low-quality responses, so researchers should take note of mismatches in answers. Researchers should also disguise these checks by distributing questions across different sections of the questionnaire and phrasing them differently to prevent easy identification.
Recruitment Strategies: Eligibility Screening
To ensure the integrity of survey responses, researchers can implement rigorous eligibility screening processes and streamline recruitment procedures. For example, researchers can consider using 2-stage recruitment, in which participants first have a public eligibility screener to assess whether they meet eligibility criteria before a second personalized link is sent to them for the main survey []. The eligibility screener can further implement features like CAPTCHAs or browser cookies to prevent duplicate responses []. Eligibility screeners could also request contact information so that researchers can contact participants personally to confirm their identity or ask for documents that could confirm information critical to eligibility []. In order to prevent individuals from tailoring their responses to meet eligibility criteria, researchers can consider recruiting participants without explicitly disclosing eligibility criteria upfront []. Furthermore, having duplicated questions in the eligibility screener and main survey will allow researchers to look for inconsistencies in responses, which is another way to expose suspicious respondents []. Researchers can consider requiring participants to go through steps before accessing the survey, such as using specialized links or passwords, but must also ensure that this process is as straightforward and user-friendly as possible; otherwise, it risks increasing dropouts [].
Recruitment Strategies: Compensations
While incentives can promote higher numbers of survey responses, they also run an increased risk of attracting fraudulent responses []. To mitigate such risks, researchers can consider structuring participant reimbursements in a way that reduces the financial rewards of fraudulent responses without undermining overall study participation. For example, framing the incentive as a raffle instead of a guaranteed reimbursement reduced the number of bot responses from 633 to 23 in one particular study []. This may be explained by the way in which bots are programmed to seek out guaranteed financial incentive surveys, and thus avoid surveys that include random chance incentives []. Furthermore, researchers should appropriately match incentives to study requirements and payment norms in the area, as incentives that were seen as higher than others attracted more fraudulent responses []. Lastly, separating the reimbursement form and letting participants be directed to it only after finishing the main survey can help ensure only genuine participants receive the incentive [].
Recruitment Strategies: Sampling Methods
The way in which researchers derive their sample is key, as it is the basis for the entire study. Probability sampling is not typically used with online surveys, simply because it is difficult to establish a probability sample of the whole population on the internet; the population of interest may not necessarily be online, and even those online may not use the internet frequently enough to be captured [,]. However, there are workarounds if a probability sample is required, such as randomly sampling and contacting participation via another method (eg, telephone or mail) and doing the actual survey online, or defining a target population (eg, health care workers in a specific geographical area) and inviting all of them to complete the online survey []. While probability sampling often leads to a more representative sample of the whole population, it may not necessarily be the most ideal sampling method, as not all research requires the entire population, and it is difficult to obtain representative numbers of smaller subpopulations []. Instead, researchers can consider nonprobability sampling, which is ideal for sampling hard-to-reach or hidden populations, where the members are unknown and make up a small proportion of the whole population, meaning that it is difficult to get a sufficiently large sample through probability sampling []. Regardless of whether probability or nonprobability sampling is used, the majority of those selected to be sampled should complete the survey; if not, there is a high potential for the sample to be skewed in an unpredictable direction due to potential selection bias and the introduction of large amounts of sampling error [].
One of the most common sampling methods used in online surveys is river sampling, which is a nonprobability method based upon convenience sampling. In river sampling, researchers place a survey link somewhere it is likely to be seen by members of the target population, such as a web page or through email []. However, while cost-effective and convenient, it suffers from coverage bias, where not every subgroup is represented equally on digital platforms, leading to results being most representative of active users of the chosen platform, and not necessarily the target population. It is a great method to quickly recruit large numbers of respondents, but is particularly vulnerable to attacks by fraudulent respondents due to the easy access to the survey []. Despite its limitations, river sampling can be very beneficial if the study is focused on a narrower population, such as undergraduate students at a university [].
Another common sampling method used in online surveys is commercial panels, which can be either probability-based (eg, KnowledgePanel or AmeriSpeak) or nonprobability-based (eg, Qualtrics Panels and Dynata) []. Commercial online panel providers include participants who are prerecruited and have agreed to take part in research and differ based on how their participants are recruited. Probability-based panels tend to rely on “offline” recruitment, such as random digit dialing or address-based sampling to invite participants to join, while non-probability-based panels run on a volunteer or opt-in basis []. In general, commercial online panels are an alternative option that offer respondent pools that better mirror the demographic compositions of the general population compared to river sampling, but at a greater monetary cost []. Commercial panel providers ensure quality by verifying their respondents beforehand, placing the onus of respondent verification on them rather than the researcher []. Furthermore, researchers can impose quotas on specific demographic criteria, forcing participants to better approximate the target population. Researchers must keep in mind that even with quotas, online panel providers may still be unable to fully reach hidden populations, as subpopulations within the participant pool may differ in the frequency of how often they actually access the panel and complete surveys, resulting in results that reflect those who are most online rather than being fully representative of the target population [,]. The quality of online commercial panels also depends on how it is managed, and researchers should take care in checking how panel participants are recruited or managed before committing to a specific panel in order to maintain a high level of accuracy in the data collected [].
Lastly, crowdsourcing platforms such as Amazon’s Mechanical Turk and Prolific serve as non–probability-based online marketplaces where potential participants are “hired” to participate in research, but differ from panel providers in that their participant pools are not necessarily curated [,]. This means that researchers need to expend extra effort in screening participants for specific target populations, and larger sample sizes are required in order to sufficiently power analysis []. Another benefit is that large sample sizes can be generated quickly compared to other methods. While not always ideal for research that requires a sample that is representative of the general population, researchers should leverage the unique strengths of crowdsourcing platforms when studying nondemographic subpopulations or hard-to-reach subpopulations []. Such platforms are suitable for experimental work, allowing researchers to establish relationships and effect sizes between different conceptual variables, even if samples are not representative []. Although the conclusions drawn are limited, the information is still useful to begin testing hypotheses in diverse samples, making inferences into subpopulation characteristics, and showing that specific phenomena exist [,].
A summary of the sampling platforms and methods for online surveys with their respective potential advantages and disadvantages can be found in .
| Sampling method and survey platforms | Potential advantages | Potential disadvantages | Implications for research | |
| Probability sampling | ||||
| Random sampling: recruitment through an online sampling frame and randomization of participants |
|
|
| |
| Nonprobability sampling | ||||
| River sampling: recruitment through sites where target populations visit |
|
|
| |
| Crowdsourcing platforms: recruitment through online marketplaces where potential participants are ‘hired’ to participate in research |
|
|
| |
| Either probability or nonprobability sampling depending on recruitment | ||||
| Online panels: recruitment through prerecruited individuals who have agreed to take part in research |
|
|
| |
Postdata Collection: Strategies to Identify Fraudulent Data
Measures to reduce the inclusion of fraudulent data in online surveys postdata collection involve analyzing metadata to flag suspicious responses and removing low-quality data to ensure the final dataset is more reliable and accurate.
Metadata Checks: Duplicate Emails
One of the primary methods of metadata checks is to flag responses that share identical email addresses with previously enrolled participants, or similar email addresses with slight variations in the order of the letters or numbers [,]. One example removed any email address that had numbers exceeding 4 digits, as it is a sign of a bot-generated email []. To further ensure the quality of such checks, researchers can let the email addresses be checked for any discrepancies by a third researcher after conducting the previous 2 steps of removal based on protocol or duplicates []. Reimbursements should be done after all the checks have been conducted. In the case that a genuine participant was excluded erroneously, the research team’s contacts should be available for the participant to contact.
Metadata Checks: IP Addresses and VPN
While IP addresses can be shared among legitimate respondents in communal spaces or households, patterns of identical IP addresses across multiple submissions can indicate fraud. Researchers can consider implementing a feature to flag multiple submissions from the same IP address []. However, this alone should not be used as an automatic rejection measure, but considered as an aid in the review of responses for potential data quality issues. Although IP addresses can act as a proxy for the legitimacy of a response, such as the geographic location, it is not completely accurate, as participants can fake their IP address or use the same computer [].
Furthermore, participants may use a virtual private network (VPN) or a virtual private server to change their IP address [,]. The use of VPNs and proxies to mask respondents’ true locations poses a significant challenge in ensuring data validity. Such challenges can be addressed by designing tests to detect discrepancies between the time zone reported by the participant’s browser and the IP address’s inferred time zone []. If the time zones differed significantly, the response can then be flagged as potentially suspicious. However, researchers must consider that the average internet user is also more likely to make use of VPNs now, meaning that IP addresses may not necessarily be a signal of a low-quality response []. Despite its complications, IP address duplication checks are one of the most common data quality checks [].
Metadata Checks: Survey Timing and Duration
Another aspect of metadata checks is the analysis of survey timing and duration. Rapid survey submissions, where multiple surveys are completed within an unrealistically short timeframe, can often indicate bot activity [,,]. Suspicious submissions can be identified by comparing the start and stop times of surveys, flagging those completed within a minute of each other []. Survey duration can also be used as a key indicator of fraudulent responses, as overly rapid completion times are unrealistic for genuine participants []. Studies have shown that valid respondents typically spend more time on surveys, with response time distributions skewed to the right on a histogram visualization []. Pretests or soft launches can help establish realistic time boundaries for survey completion, further enhancing the detection of improbable durations [,].
Data Quality: Duplicate and Improbable Free-Text Responses
By incorporating multiple qualitative questions and making at least one a requirement for survey submission, researchers can more easily identify automatic responses []. One possible protocol is that a response is seen as identical if it is repeated 100 times or more for single words, 10 times or more for 2-word entries, and 3 times or more for entries of 3 or more words []. In one study, researchers found that qualitative survey questions, despite being optional, were instrumental in identifying 88 (13.3%) bot responses through exact duplicate answers unlikely to occur by chance []. The effectiveness of qualitative questions in detecting bots is supported by research, which found that bots struggled with open-ended questions requiring a minimum response length, suggesting that incorporating multiple such questions can enhance bot detection [].
Data Quality: Suspicious Response Patterns and Data Inconsistencies
Detecting straight-lining, consistencies, or patterns in the response of a participant across a series of questions is crucial for identifying low-effort or automated responses. Thus, it is important to check for uniform answers across grid questions or obvious patterns, such as sequential responses [,]. These patterns can indicate a lack of genuine engagement with the survey content. Similarly, verifying selection patterns in multiple response questions and detecting low differentiation in answers also identifies low-quality responses []. Furthermore, detecting inconsistencies in responses helps identify respondents who may not be providing truthful or accurate information []. It is important to exclude participants with inconsistent responses, such as mismatched ages and birth dates, as this ensures that survey data accurately represents the target population []. For example, one study excluded participants whose answers to questions about sex, gender, and sexuality were contradictory, eg, “I have had insertive vaginal sex with multiple female partners,” yet “none of my partners have vaginas” []. Implementing comprehensive checks for straight-lining, response patterns, and internal consistency helps filter out low-quality data, ensuring that the final dataset is reliable and valid.
Fraudulent data can occur through a variety of ways, and thus poses distinct challenges for the prevention and detection of such responses []. It is important to consider which methods would be best suited for each study, and using multiple methods in a multipronged approach can help to account for the shortcomings of any single method [,]. Researchers can consider taking a dynamic approach with evolving screening protocols based on the type of responses received across the study in order to keep up with ever-changing fraud attempts []. In addition, when deciding the threshold required to identify a response as fraudulent or low quality, researchers could assign points to indicators of suspicious activity, which reduces the likelihood of incorrectly excluding valid data [,,]. Having a point system allows specific responses to be flagged for further investigation and establishes a cutoff score for invalid responses []. This would also follow best practices in which responses would be removed based on multiple issues rather than a single failure.
Case Study
An example of a study that was able to maintain a high level of rigor throughout its run was the Restore study by Dewitt et al []. The study wanted to recruit gay, bisexual, and other men who have sex with men who were treated for prostate cancer. The challenges they faced with recruitment were because it was a minority within a minority group, and it largely impacted an older demographic, which meant that the process of recruitment had to be as streamlined as possible due to the unclear level of internet familiarity the target population would have. Ultimately, they focused recruitment on a community partner’s email list, who focused on providing support for survivors of cancer, as well as Facebook groups of prostate cancer community organizations, which would be a form of targeted river sampling. They hosted their survey on Qualtrics, a web-based survey platform, and used built-in protection features such as “Prevent Ballot Box Stuffing,” which prevents multiple submissions based on browser cookies, and “Prevent Indexing,” which blocks the survey from being indexed and found on search engines. In addition, they made use of multiple data quality control measures, such as eligibility screeners and manual data validation using both automated and hybrid protocols that flagged suspicious survey entries to the researchers. For example, survey metadata was checked (eg, short response timing, IP addresses that did not match geographic locations), and open-ended responses were evaluated to see if they were reasonable or possible. They ran into issues with a spam attack through the Facebook link and noticed quickly due to a sudden influx of low-quality survey attempts. A further problem was that the answer patterns began to evolve, where initial spam attempts were clearly fraudulent, but later entries began to provide more probable responses. Later entries were only identifiable by manually checking the process analytics and interpreting each survey response’s data. After some review, the researchers created a new copy of the survey and imposed a stricter recruitment protocol with more validation steps and focused their recruitment through community partners’ email lists. They did not provide reimbursement to surveys identified as invalid but provided a method for participants to contact them to verify their answers and receive their compensation if they were unwittingly identified as invalid, but not a single invalid respondent followed up on this. The restore study is a great case study of a dynamic protocol that was able to respond to shifting patterns of fraudulent responses and successfully maximize the rigor of their study.
Challenges and Opportunities of Artificial Intelligence
One issue of concern is that bots are becoming increasingly sophisticated and are able to better replicate human behavior, allowing them to potentially overwhelm measures put into place and more easily gain access to online surveys [,]. CAPTCHAs and open-ended questions can now be solved by high-end bots, and the rise of artificial intelligence (AI) software like ChatGPT means that open-ended questions can also be answered automatically [,]. In a study to see if AI-generated texts could be differentiated from actual people, AI-generated texts were only accurately recognized 40.45% of the time, meaning that even manual evaluation of answers may not be sufficient to weed out invalid responses []. The level of sophistication that fraudulent responders can achieve using AI and bots only continues to grow, and they pose an incredible threat to data quality when conducting online surveys as they can easily submit large numbers of responses while masquerading as multiple valid respondents []. As such, ensuring that surveys do not fall into the wrong hands is even more critical. Researchers should embrace the fact that different sampling methods reach different demographics and even consider multimodal and blended samples where multiple mediums of recruitment and communication complement each other to obtain higher quality samples []. Researchers themselves can leverage technological growth, such as by using automated fraud detection systems on web-based survey platforms. For example, Qualtrics includes RelevantID, which uses machine learning to evaluate behavioral data, browser interactions, and other metadata to identify the likelihood of fraud []. However, these automated systems should be integrated with manual review or other fraud detection methods to maximize effectiveness.
Ethical Considerations
In addition, researchers need to consider ethical issues when deciding what data should be collected for screening measures. For example, one of the most common methods used in assessing data quality is to collect IP addresses, or personal details such as email addresses and phone numbers; yet, these undermine one of the key advantages of online surveys, which is anonymity []. If a study is particularly sensitive or collects details of criminalized behavior, collecting IP addresses and personal identifiers may introduce additional complications and concerns about data breaches or loss of confidentiality and anonymity. One possible way to overcome this is to have personal details collected in a separate survey, such that key identifiers are not able to be tied to a specific response. Second, the methods reported online are intrinsically limited because studies are unable to definitively categorize which responses are valid or invalid, so we cannot truly evaluate the precision of techniques or their effectiveness. Furthermore, methods are often not verified as per randomized controlled trials and are mostly developed through empirical research when researchers experience fraudulent data in their studies. It would be of value for research to be done comparing different measures against some unfalsifiable assessment of real data in order to accurately determine the effectiveness of such measures. There is still significant room for strategies to develop, and it should be a key focus for upcoming research due to the rise in online surveys being a premier choice for survey-based research.
Conclusion
Methodological rigor is key for any study regardless of its modality, but online surveys require researchers to consider broader methodological considerations as compared to offline research []. Although a myriad of ways in which researchers can increase the rigor of their online surveys are presented in this article, there is no singular “best” method as every research method has its own limitations []. Online research appears superficially similar to offline research, but its characteristics affect the entire research process from initial design to reporting the findings in ways that influence their implementation [,]. Ultimately, researchers must consider the wide range of factors unique to the requirements of their own studies and make use of a combination of various resources and methods to increase the quality of research published.
Conflicts of Interest
None declared.
References
- Stantcheva S. How to run surveys: A guide to creating your own identifying variation and revealing the invisible. Annu Rev Econom. Sep 13, 2023;15(1):205-234. [CrossRef]
- Alam I, Khusro S, Rauf A, Zaman Q. Conducting surveys and data collection: From traditional to mobile and SMS-based surveys. Pakjstatoperres. 2014;10(2):169. [CrossRef]
- Ebert JF, Huibers L, Christensen B, Christensen MB. Paper- or web-based questionnaire invitations as a method for data collection: Cross-sectional comparative study of differences in response rate, completeness of data, and financial cost. J Med Internet Res. Jan 23, 2018;20(1):e24. [CrossRef] [Medline]
- Andrade C. The limitations of online surveys. Indian J Psychol Med. Nov 2020;42(6):575-576. [CrossRef] [Medline]
- Evans JR, Mathur A. The value of online surveys: a look back and a look ahead. INTR. Aug 6, 2018;28(4):854-887. [CrossRef]
- Bybee S, Cloyes K, Ellington L, Baucom B, Supiano K, Mooney K. Bots and nots: Safeguarding online survey research with underrepresented and diverse populations. Psychol Sex. 2022;13(4):901-911. [CrossRef] [Medline]
- Bauermeister J, Pingel E, Zimmerman M, Couper M, Carballo-Diéguez A, Strecher VJ. Data quality in web-based HIV/AIDS research: Handling invalid and suspicious data. Field methods. Aug 1, 2012;24(3):272-291. [CrossRef] [Medline]
- Bonett S, Lin W, Sexton Topper P, et al. Assessing and improving data integrity in web-based surveys: Comparison of fraud detection systems in a COVID-19 study. JMIR Form Res. Jan 12, 2024;8(1):e47091. [CrossRef] [Medline]
- Watson NL, Mull KE, Heffner JL, McClure JB, Bricker JB. Participant recruitment and retention in remote eHealth Intervention trials: Methods and lessons learned from a large randomized controlled trial of two web-based smoking interventions. J Med Internet Res. Aug 24, 2018;20(8):e10351. [CrossRef] [Medline]
- McInroy LB, Beer OWJ. Adapting vignettes for internet-based research: eliciting realistic responses to the digital milieu. Int J Soc Res Methodol. May 4, 2022;25(3):335-347. [CrossRef]
- Teitcher JEF, Bockting WO, Bauermeister JA, Hoefer CJ, Miner MH, Klitzman RL. Detecting, preventing, and responding to “fraudsters” in internet research: ethics and tradeoffs. J Law Med Ethics. 2015;43(1):116-133. [CrossRef] [Medline]
- Gnambs T, Kaspar K. Disclosure of sensitive behaviors across self-administered survey modes: a meta-analysis. Behav Res Methods. Dec 2015;47(4):1237-1259. [CrossRef] [Medline]
- Bratan T, Aichinger H, Brkic N, et al. Impact of the COVID-19 pandemic on ongoing health research: an ad hoc survey among investigators in Germany. BMJ Open. Dec 6, 2021;11(12):e049086. [CrossRef] [Medline]
- Adi Syani P, Rahiem MDH, Subchi I, Suryani R, Kurniawan F. COVID-19: accelerating digital transformation for university’s research administration. Presented at: 2020 8th International Conference on Cyber and IT Service Management (CITSM):1-6. [CrossRef]
- Johnson MS, Adams VM, Byrne J. Addressing fraudulent responses in online surveys: Insights from a web‐based participatory mapping study. People and Nature. Feb 2024;6(1):147-164. [CrossRef]
- Bowen AM, Daniel CM, Williams ML, Baird GL. Identifying multiple submissions in Internet research: preserving data integrity. AIDS Behav. Nov 2008;12(6):964-973. [CrossRef] [Medline]
- Chandler JJ, Paolacci G. Lie for a dime: when most prescreening responses are honest but most study participants are impostors. Soc Psychol Personal Sci. 2017;8(5):500-508. [CrossRef]
- Cimpian JR, Timmer JD, Birkett MA, Marro RL, Turner BC, Phillips GL II. Bias from potentially mischievous responders on large-scale estimates of lesbian, gay, bisexual, or questioning (LGBQ)–heterosexual youth health disparities. Am J Public Health. Nov 2018;108(S4):S258-S265. [CrossRef]
- Grey JA, Konstan J, Iantaffi A, Wilkerson JM, Galos D, Rosser BRS. An updated protocol to detect invalid entries in an online survey of men who have sex with men (MSM): how do valid and invalid submissions compare? AIDS Behav. Oct 2015;19(10):1928-1937. [CrossRef] [Medline]
- Storozuk A, Ashley M, Delage V, Maloney EA. Got bots? Practical recommendations to protect online survey data from bot attacks. TQMP. 2020;16(5):472-481. [CrossRef]
- Ward MK, Meade AW. Dealing with careless responding in survey data: Prevention, identification, and recommended best practices. Annu Rev Psychol. Jan 18, 2023;74:577-596. [CrossRef] [Medline]
- Berndt AE. Sampling methods. J Hum Lact. May 2020;36(2):224-226. [CrossRef] [Medline]
- Tyrer S, Heyman B. Sampling in epidemiological research: issues, hazards and pitfalls. BJPsych Bull. Apr 2016;40(2):57-60. [CrossRef] [Medline]
- Lehdonvirta V, Oksanen A, Räsänen P, Blank G. Social media, web, and panel surveys: Using non‐probability samples in social and policy research. Policy & Internet. Mar 2021;13(1):134-155. [CrossRef]
- Arias VB, Garrido LE, Jenaro C, Martínez-Molina A, Arias B. A little garbage in, lots of garbage out: Assessing the impact of careless responding in personality survey data. Behav Res Methods. Dec 2020;52(6):2489-2505. [CrossRef] [Medline]
- Huang JL, Liu M, Bowling NA. Insufficient effort responding: examining an insidious confound in survey data. J Appl Psychol. May 2015;100(3):828-845. [CrossRef] [Medline]
- Agans JP, Schade SA, Hanna SR, Chiang SC, Shirzad K, Bai S. The inaccuracy of data from online surveys: A cautionary analysis. Qual Quant. Jun 2024;58(3):2065-2086. [CrossRef]
- Peer E, Rothschild D, Gordon A, Evernden Z, Damer E. Data quality of platforms and panels for online behavioral research. Behav Res Methods. Aug 2022;54(4):1643-1662. [CrossRef] [Medline]
- Griffin M, Martino RJ, LoSchiavo C, et al. Ensuring survey research data integrity in the era of internet bots. Qual Quant. 2022;56(4):2841-2852. [CrossRef] [Medline]
- Brazhkin V. “I have just returned from the moon:” online survey fraud. SCM. Mar 27, 2020;25(4):489-503. [CrossRef]
- Zhang Z, Zhu S, Mink J, Xiong A, Song L, Wang G. Beyond bot detection: combating fraudulent online survey takers. Association for Computing Machinery; 2022. Presented at: WWW ’22:699-709; Virtual Event, Lyon France. URL: https://dl.acm.org/doi/proceedings/10.1145/3485447 [Accessed 2025-08-08] [CrossRef]
- Meade AW, Craig SB. Identifying careless responses in survey data. Psychol Methods. Sep 2012;17(3):437-455. [CrossRef] [Medline]
- Wang J, Calderon G, Hager ER, et al. Identifying and preventing fraudulent responses in online public health surveys: Lessons learned during the COVID-19 pandemic. PLOS Glob Public Health. 2023;3(8):e0001452. [CrossRef] [Medline]
- Hauser DJ, Schwarz N. It’s a trap! Instructional manipulation checks prompt systematic thinking on “tricky” tasks. Sage Open. Apr 1, 2015;5(2):2158244015584617. [CrossRef]
- Kung FYH, Kwok N, Brown DJ. Are attention check questions a threat to scale validity? Applied Psychology. Apr 2018;67(2):264-283. [CrossRef]
- Pratt-Chapman M, Moses J, Arem H. Strategies for the identification and prevention of survey fraud: Data analysis of a web-based survey. JMIR Cancer. Jul 16, 2021;7(3):e30730. [CrossRef] [Medline]
- Roehl J, Harland D. Imposter participants: Overcoming methodological challenges related to balancing participant privacy with data quality when using online recruitment and data collection. TQR. 2022;10. [CrossRef]
- Dewitt J, Capistrant B, Kohli N, et al. Addressing participant validity in a small internet health survey (the restore study): Protocol and recommendations for survey response validation. JMIR Res Protoc. Apr 24, 2018;7(4):e96. [CrossRef] [Medline]
- Nur AA, Leibbrand C, Curran SR, Votruba-Drzal E, Gibson-Davis C. Managing and minimizing online survey questionnaire fraud: Lessons from the Triple C Project. Int J Soc Res Methodol. 2024;27(5):613-619. [CrossRef] [Medline]
- Ball HL. Conducting online surveys. J Hum Lact. Aug 2019;35(3):413-417. [CrossRef] [Medline]
- Van Selm M, Jankowski NW. Conducting online surveys. Qual Quant. Jun 2006;40(3):435-456. [CrossRef]
- Kocar S, Kaczmirek L. A meta-analysis of worldwide recruitment rates in 23 probability-based online panels, between 2007 and 2019. Int J Soc Res Methodol. Sep 2, 2024;27(5):589-604. [CrossRef]
- Im EO, Chee W. Quota sampling in internet research: practical issues. Comput Inform Nurs. Jul 2011;29(7):381-385. [CrossRef] [Medline]
- Behrend TS, Sharek DJ, Meade AW, Wiebe EN. The viability of crowdsourcing for survey research. Behav Res Methods. Sep 2011;43(3):800-813. [CrossRef] [Medline]
- Bentley JW. Improving the statistical power and reliability of research using Amazon Mechanical Turk. Accounting Horizons. Dec 1, 2021;35(4):45-62. [CrossRef]
- Smith NA, Sabat IE, Martinez LR, Weaver K, Xu S. A convenient solution: Using MTurk To sample from hard-to-reach populations. Ind Organ Psychol. Jun 2015;8(2):220-228. [CrossRef]
- Jerit J, Barabas J. Are nonprobability surveys fit for purpose? Public Opin Q. Nov 21, 2023;87(3):816-840. [CrossRef]
- Aguinis H, Villamor I, Ramani RS. MTurk research: Review and recommendations. J Manage. Apr 2021;47(4):823-837. [CrossRef]
- DePalma MT, Rizzotti MC, Branneman M. Assessing diabetes-relevant data provided by undergraduate and crowdsourced web-based survey participants for honesty and accuracy. JMIR Diabetes. Jul 12, 2017;2(2):e11. [CrossRef] [Medline]
- Xu Y, Pace S, Kim J, et al. Threats to online surveys: Recognizing, detecting, and preventing survey bots. Soc Work Res. Nov 14, 2022;46(4):343-350. [CrossRef]
- Lebrun B, Temtsin S, Vonasch A, Bartneck C. Detecting the corruption of online questionnaires by artificial intelligence. Front Robot AI. 2023;10:1277635. [CrossRef] [Medline]
- Hämäläinen P, Tavast M, Kunnari A. Evaluating large language models in generating synthetic HCI research data: a case study. Association for Computing Machinery; 2023. Presented at: Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems:1-19; Hamburg, Germany. [CrossRef]
- Lawlor J, Thomas C, Guhin AT, et al. Suspicious and fraudulent online survey participation: Introducing the REAL framework. Methodological Innovations. Sep 2021;14(3). [CrossRef]
- Eckerd S, DuHadway S, Bendoly E, Carter CR, Kaufmann L. On making experimental design choices: Discussions on the use and challenges of demand effects, incentives, deception, samples, and vignettes. J of Ops Management. Mar 2021;67(2):261-275. [CrossRef]
- Roberts JK, Pavlakis AE, Richards MP. It’s more complicated than it seems: Virtual qualitative research in the COVID-19 era. Int J Qual Methods. Jan 2021;20:16094069211002959. [CrossRef]
- Regmi PR, Waithaka E, Paudyal A, Simkhada P, van Teijlingen E. Guide to the design and application of online questionnaire surveys. Nepal J Epidemiol. Dec 2016;6(4):640-644. [CrossRef] [Medline]
Abbreviations
| AI: artificial intelligence |
| VPN: virtual private network |
Edited by Naomi Cahill; submitted 28.10.24; peer-reviewed by Stephen Blumberg, Vedamurthy Gejjegondanahalli Yogeshappa; final revised version received 26.06.25; accepted 26.06.25; published 21.08.25.
Copyright©Wen Zhi Ng, Sundarimaa Erdembileg, Jean C J Liu, Joseph D Tucker, Rayner Kay Jin Tan. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 21.8.2025.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research (ISSN 1438-8871), is properly cited. The complete bibliographic information, a link to the original publication on https://www.jmir.org/, as well as this copyright and license information must be included.

