Original Paper
Abstract
Background: Methylphenidate is an effective first-line treatment for attention-deficit/hyperactivity disorder (ADHD). However, many adverse effects of methylphenidate have been recorded from randomized clinical trials and patient-reported outcomes, but it is difficult to determine abuse from them. In the context of COVID-19, it is important to determine how drug use evaluation, as well as misuse of drugs, have been affected by the pandemic. As people share their reasons for using medication, patient sentiments, and the effects of medicine on social networking services (SNSs), the application of machine learning and SNS data can be a method to overcome the limitations. Proper machine learning models could be evaluated to validate the effects of the COVID-19 pandemic on drug use.
Objective: To analyze the effect of the COVID-19 pandemic on the use of methylphenidate, this study analyzed the adverse effects and nonmedical use of methylphenidate and evaluated the change in frequency of nonmedical use based on SNS data before and after the outbreak of COVID-19. Moreover, the performance of 4 machine learning models for classifying methylphenidate use based on SNS data was compared.
Methods: In this cross-sectional study, SNS data on methylphenidate from Twitter, Facebook, and Instagram from January 2019 to December 2020 were collected. The frequency of adverse effects, nonmedical use, and drug use before and after the COVID-19 pandemic were compared and analyzed. Interrupted time series analysis about the frequency and trends of nonmedical use of methylphenidate was conducted for 24 months from January 2019 to December 2020. Using the labeled training data set and features, the following 4 machine learning models were built using the data, and their performance was evaluated using F-1 scores: naïve Bayes classifier, random forest, support vector machine, and long short-term memory.
Results: This study collected 146,352 data points and detected that 4.3% (6340/146,352) were firsthand experience data. Psychiatric problems (521/1683, 31%) had the highest frequency among the adverse effects. The highest frequency of nonmedical use was for studies or work (741/2016, 36.8%). While the frequency of nonmedical use before and after the outbreak of COVID-19 has been similar (odds ratio [OR] 1.02 95% CI 0.91-1.15), its trend has changed significantly due to the pandemic (95% CI 2.36-22.20). Among the machine learning models, RF had the highest performance of 0.75.
Conclusions: The trend of nonmedical use of methylphenidate has changed significantly due to the COVID-19 pandemic. Among the machine learning models using SNS data to analyze the adverse effects and nonmedical use of methylphenidate, the random forest model had the highest performance.
doi:10.2196/45146
Keywords
Introduction
Attention-deficit/hyperactivity disorder (ADHD) is a widely diagnosed childhood psychiatric disorder [
, ]. Recently, ADHD has been increasingly diagnosed in adults as well as children and adolescents [ , ]. In the United States, about 10% of children aged between 4 and 17 years have been diagnosed with ADHD [ ]. In 2007, the worldwide prevalence of ADHD was 5.29% [ ]. Behavior therapy is the first-line treatment for ADHD in children 5 years of age or younger. Pharmacology therapy using methylphenidate is recommended if behavioral approaches do not give a significant outcome. From 6 years of age, methylphenidate is used as the first-line agent for ADHD [ ]. Methylphenidate is a sympathomimetic drug that works by affecting the dopaminergic and noradrenergic systems. Noradrenaline and dopamine concentrations increase in the brainstem, midbrain, and frontal cortex. These effects are responsible for increased attention span and concentration [ ]. However, through its pharmacology mechanism, methylphenidate can cause several adverse effects, including sleep problems, loss of appetite, and anxiety disorders. Furthermore, hallucinations and delusions may be caused by methylphenidate use [ , ]. A previous study showed that 1.2% of patients who used methylphenidate had serious adverse effects, including psychotic disorders. Moreover, 51.2% of patients experienced nonserious adverse effects, including sleep problems and loss of appetite. This study additionally noted that 7.3% of patients withdraw from methylphenidate use due to its adverse effects [ ]. Methylphenidate has a potential risk of nonmedical use and is listed as a Schedule II substance in the United States [ , , ]. It is likely to have effects resembling model illicit drugs, with a similar mechanism of action to cocaine and methamphetamine. The effect occurs because of the drug’s pharmacological action as a dopamine reuptake inhibitor and the subsequent synaptic dopamine increase observed after administration [ ]. Therefore, methylphenidate has a potential risk of abuse. As it is a Schedule II substance, methylphenidate abuse can cause severe psychological or physical dependence. According to the National Survey on Drug Use and Health in the United States, 913,000 people aged >12 years abused methylphenidate in 2017. Moreover, 6.5% of the population aged between 18 and 25 years abused methylphenidate and amphetamine in 2018. The reason for methylphenidate abuse was to enhance focus while studying [ ].Drug use may have been affected by the spread of COVID-19. The outbreak of COVID-19 began in December 2019, and the World Health Organization soon declared it a pandemic. During the pandemic, people could not continue daily life, and industries were shut down. As a result, drug use was affected by these situations [
, ]. Hair analysis in Italy showed decreasing consumption of cannabis, benzodiazepine, and cocaine during and until the end of the lockdown period [ ]. Self-reported surveys in the Netherlands also indicated declining trends of tobacco, alcohol, cocaine, and prescribed drug consumption [ ]. Therefore, drug use evaluation as affected by the pandemic, as well as misuse of drugs, is important. However, it is difficult to determine abuse from randomized clinical trials and patient-reported outcomes. Moreover, most studies are conducted using surveys, but surveys have a response bias toward concealing abuse due to fear of detection [ ]. As people share their purposes for using medication, patient sentiment, and the effects of medicine on social networking services (SNSs), the application of machine learning and SNS data can be a method to overcome the limitations [ - ]. A previous study analyzed the nonmedical use and adverse effects of methylphenidate on Twitter by applying the support vector machine (SVM) model. The SVM model had F-1 scores of 0.547 for nonmedical use and 0.733 for adverse effects [ ]. However, the other SNS machine learning models could be applied to validate the effects of the COVID-19 pandemic on drug use. This study aims to determine the adverse effects and nonmedical use of methylphenidate and the change in frequency of nonmedical use depending on COVID-19 transmission. It also aims to evaluate the performance of several machine learning models for classifying methylphenidate use based on SNS data.Methods
Step 1: Analysis of Adverse Effects and Nonmedical Use of Methylphenidate
Data Collection
This cross-sectional study collected and analyzed data on methylphenidate use from Twitter, Facebook, and Instagram between January 2019 and December 2020 (
A). These SNSs were used because they have many users and are representative of SNSs worldwide [ - ].A web crawler was developed in Python software (version 3.7; Python Software Foundation) to collect the methylphenidate use data set because the Twitter application programming interface (API) policy was changed to require approval for crawling in 2021, and the SNS APIs have limitations on the collection period and information collected [
]. The following keywords were used to crawl for methylphenidate use: “Aptensio,” “Biphentin,” “Concerta,” “Daytrana,” “Equasym,” “Jornay,” “Medikinet,” “metadata,” “Methylin,” “QuilliChew,” “Quillivant,” “Ritalin,” “Rubifen,” and “Adhansia.” Data from all languages were collected and universally translated into English using the Google Translate program. Instagram and Twitter data mentioning the keywords were collected using the web crawler. However, as Facebook did not have sufficient keyword data, the Facebook data were manually collected.Data Annotation
Collected data were annotated to analyze methylphenidate use by annotators manually. The collected data were identified as either (1) firsthand experience or (2) non–firsthand experience, and only firsthand experience data were used in the analysis. Firsthand experience was defined to mean that the writer of the SNS post had personally used methylphenidate [
, ]. Content annotated as non–firsthand experience were news, humor, research results, another person’s experience, other drugs, song lyrics, or other content based on the previous studies [ , ]. The data annotated as firsthand experience were additionally labeled as follows: (1) medical use, (2) adverse effects, and (3) nonmedical use. Medical use was defined as the use of a medication prescribed by a physician or pharmacist. All adverse effects were also collected. It was possible for medical/nonmedical use and adverse effects to appear in a single tweet. In these cases, the data could be classified as pertaining to both side effects and nonmedical use. In addition, drug abuse slang was considered as an abuse term for labeling because methylphenidate has the potential to produce effects similar to cocaine; thus, its abuse can be a gateway to illicit drugs [ , ]. The interannotator agreement was measured by Cohen Kappa, and disagreements were discussed with experienced clinical pharmacists [ , ].Data Analysis Collected by Crawler From Twitter, Facebook, and Instagram
Initially, among the firsthand experience data, the frequency of side effects and nonmedical use of methylphenidate in the SNSs were analyzed monthly from January 2019 to December 2020. Interrupted time series analysis about nonmedical use of methylphenidate was conducted on “frequency” for a total of 24 months from January 2019 to December 2020. As a result of confirming the trend of “frequency” via a time plot, “frequency” showed a quadratic trend. To find out whether the trend of “frequency” changed after COVID-19, the period from January 2020 onward was designated as “corona outbreak,” and the trend of “frequency” before and after January 2020 was compared. For model identification using the Auto Regressive Integrated Moving Average model (ARIMA), the autocorrelation function (ACF) and the partial autocorrelation function (PACF) were obtained. As a result, the trend of “frequency” was determined by the autoregression (AR) model, written as AR(1). Therefore, since “frequency” is a data point with autocorrelation, regression analysis was performed using the generalized least squares method. Firsthand experience data were analyzed using Microsoft Excel (Microsoft Corp) spreadsheets, R software (R Foundation for Statistical Computing), and RStudio (version 4.1.2). Microsoft Excel was used to analyze the frequencies, percentages, and means of adverse effects and nonmedical use. The frequency of adverse effects and nonmedical use was analyzed by allowing duplication according to the subtopic. The EpiTools and gmodels packages in R and RStudio were applied to analyze the odds ratios (ORs) with a 95% CI, and a chi-square test was used to compare data before and after the COVID-19 pandemic. Coingestion in the subtopic of nonmedical use was the defined intake of methylphenidate with alcohol-related drinks or other stimulants, excluding drugs of abuse, such as cocaine and marijuana, as reported on the SNSs. Data regarding the use of cocaine and marijuana with methylphenidate or data with slang terms were identified as drug abuse. Recreational use was also defined as using methylphenidate to party or play games.
Step 2: Performance Evaluation of the Machine Learning Models
Data Preprocessing
To improve the performance of the machine learning models, the collected SNS data were preprocessed (
B) [ - ]. Next, to perform the supervised learning, several features were generated, such as medical and nonmedical use terms, personal nouns, side effect terms, and so on. The selection of terms regarding adverse effects and nonmedical use were labeled according to the features reported by previous studies or were added if they appeared in the training data set [ , ]. For medical and nonmedical use terms, before generating the features of nonmedical use terms, natural language processing (NLP) was performed. Preprocessing was performed (1) to remove duplicate and null data, usernames, links, punctuation marks, numeric values, and stop words; (2) to convert terms to lowercase; and (3) for stemming. In detail, duplicate and null data, usernames, links, punctuation marks, and numeric values that did not contribute to classification and could hinder machine learning were removed [ ]. Stop words, including “a,” “an,” “the,” and “with” were also removed. After removing these data, all words were converted to lowercase to reduce confusion and increase accuracy because machine learning recognizes lowercase and uppercase words as different. Finally, the Porter stemmer algorithm, which removes suffixes, was applied to reduce the text complexity. For example, “take” and “taking” contain the same information, but machine learning considers them different features [ - ]. Additionally, the long short-term memory (LSTM) model must unify the sentence length because of the model characteristics. As 99.05% (6280/6340) of the sentences had lengths ≤40 words, the sentence length in the LSTM model was unified to 40.Machine Learning Models
This study compared the performance of 4 machine learning models using the sklearn and TensorFlow packages in Python: a naïve Bayes (NB) classifier, random forest (RF), SVM, and LSTM. The NB classifier, RF, and SVM are traditional models that are well-suited for classification [
]. LSTM was also applied because it effectively classifies and analyzes text data [ , ].A set of hyperparameters was selected for each model for feature extraction techniques and structures to increase the performance and effectively control the complexity of the models [
]. The feature extraction techniques applied were bag-of-words (BoW) and term frequency-inverse document frequency (TF-IDF) [ , ]. Using a radial basis function kernel, SVM was trained to classify nonmedical use and side effects. To achieve a better performance, 2 hyperparameters of the SVM (ie, cost, gamma) were tuned. The hyperparameters were set for each model, and the remaining hyperparameters were set as default values. LSTM, a deep learning model, was structured in layers. The accuracy, precision, recall, and F-1 scores were calculated based on previous studies to evaluate the performance of the models [ , ]. True positives (TPs), false positives (FPs), true negatives (TNs), and false negatives (FNs) were calculated by comparing annotated results and predicted results according to the following equations (1)-(4):Precision = TP/(TP + FP) (1)
Recall = TP/(TP + FN) (2)
Accuracy = (TP + TN)/(TP + TN + FP + FN) (3)
F-1 = (2× Precision × Recall)/(Precision + Recall) (4)
Due to the imbalance of these categorical data, the performance of the classifiers was evaluated using the F-1 score (ie, harmonic mean of precision and recall) instead of accuracy. The data set was randomly divided into 85% of the data set to train the models and 15% to evaluate their performance. In addition, the training data set had a very large number of negative samples. To compensate for this issue, 10-fold cross-validation was performed on the training data, and inverse weights were assigned to positive and negative samples [
, ].Ethical Considerations
Twitter, Facebook, and Instagram are major social networks. Under strict policies, they try to regulate the ethical use of their data and maintain the privacy of their users. The information needed for the processing steps has been collected and saved following their guidelines. All sensitive data were discarded after analysis.
Results
Overview
From January 2019 to December 2020, a total of 146,352 data points were collected from Twitter, Facebook, and Instagram (
). Data annotated as firsthand experience comprised 4.3% (6340/146,352) of the data set, including medical use (2641/6340, 41.7%), adverse effects (1683/6340, 26.5%), and nonmedical use (2016/6340, 31.8%). The Cohen Kappa was 0.92, which means that the agreement was substantial. The word cloud indicates that people post about using methylphenidate for ADHD treatment or study and share their experiences of adverse effects, such as anxiety, depression, and loss of appetite (Figure S1 of ).Data | Twitter (N=145,520), n (%) | Instagram (N=672), n (%) | Facebook (N=160), n (%) | Total SNSsa (N=146,352), n (%) | ||||
Non–firsthand experience | 139,255 (95.7) | 599 (89.1) | 158 (98.8) | 140,012 (95.7) | ||||
Firsthand experience | 6265 (4.3) | 73 (10.9) | 2 (1.2) | 6340 (4.3) | ||||
Medical use | 2590 (41.3) | 50 (68.5) | 1 (50) | 2641 (41.7) | ||||
Side effects | 1660 (26.5) | 22 (30.1) | 1 (50) | 1683 (26.5) | ||||
Nonmedical use | 2015 (32.2) | 1 (1.4) | 0 (0) | 2016 (31.8) |
aSNS: social networking service.
Analysis of the Adverse Effects and Nonmedical Use of Methylphenidate
Frequency of Adverse Effects and Nonmedical Use
The data on adverse effects and nonmedical use were analyzed by allowing duplication (
).Psychiatric problems, including anxiety, depression, jitters, nervousness, panic, restlessness, tension, and worry, were the most frequent adverse effects of methylphenidate (521/1683, 31%). Sleep problems and loss of appetite accounted for a significant proportion of the adverse effects of methylphenidate at 18.7% (314/1683) and 14.8% (250/1683), respectively. Gastrointestinal and neurological problems were also frequent adverse effects at 7.7% (129/1683) and 5.4% (91/1683), respectively. In particular, a few hallucinations and delusions were discovered in the SNSs.
The frequency of nonmedical use of methylphenidate was also analyzed (
). Study or work was the most frequent reason for nonmedical use (741/2016, 36.8%). Coingestion and drug abuse were additional nonmedical uses at 19.9% (401/2016) and 13% (262/2016), respectively. Methylphenidate was also occasionally abused for recreational purposes (3.5%, 70/2016). The data on recreational use indicated that the users took methylphenidate to party or play games for longer amounts of time.Topics and subtopics | Data frequencyb, n (%) | |
Side effects (N=1683) | ||
Psychiatric problems | 521 (31) | |
Sleep problems | 314 (18.7) | |
Loss of appetite | 250 (14.8) | |
Heart problems | 138 (8.2) | |
Gastrointestinal problems | 129 (7.7) | |
Neurological problems | 91 (5.4) | |
Sweating | 21 (1.2) | |
Eye problems | 12 (0.7) | |
Other | 402 (23.9) | |
Nonmedical use (N=2016) | ||
Study or work | 741 (36.8) | |
Coingestion | 401 (19.9) | |
Drug abuse | 262 (13) | |
Overdose | 186 (9.2) | |
Seeking or obtaining | 118 (5.8) | |
Recreational use | 70 (3.5) | |
Loss of weight | 44 (2.2) | |
Other | 265 (13.1) |
aSNS: social networking service.
bData frequency of side effects and nonmedical use of methylphenidate was analyzed by allowing duplication.
Comparison of Nonmedical Use of Methylphenidate Before and After the COVID-19 Pandemic
The frequency of nonmedical use of methylphenidate was compared before and after the COVID-19 pandemic, as drug use has been affected by the COVID-19 pandemic. The OR for nonmedical use of methylphenidate after the pandemic was 1.02 (95% CI 0.91-1.15) compared to before the pandemic, indicating insignificant results. This indicates no difference in the frequency of nonmedical use before and after the pandemic. However, coingestion and drug abuse in nonmedical use have significantly changed (Table S1 of
). While the incidence of coingestion increased from 17.7% to 22.9% (P<.01), that of drug abuse decreased from 16.5% to 8.3% (P<.001) after the pandemic.The distribution of nonmedical use of methylphenidate differed before and after the COVID-19 pandemic (
). The frequency of nonmedical use before the pandemic was the highest in May 2019 (182/2016, 9%). However, the frequency of nonmedical use reached its lowest point in February 2020 (29/2016, 1.4%), which was immediately after the COVID-19 outbreak. It increased in May 2020 (80/2016, 4%) and showed an increasing trend until October 2020. Interrupted time series analysis was conducted on “frequency” for a total of 24 months from January 2019 to December 2020. When evaluated by time plot, “frequency” showed a quadratic trend. To check whether the trend of “frequency” changed after COVID-19, the trend of “frequency” before and after January 2020 was compared by setting the point of January 2020 as “corona outbreak.” As a result of obtaining the ACF and PACF to identify the model using the ARIMA model, the trend of “frequency” was determined by the AR(1) model. Therefore, since “frequency” is a data point with autocorrelation, regression analysis was performed using the generalized least squares method. As a result of the analysis, before the outbreak of COVID-19 (2019), the frequency of nonmedical use of methylphenidate tended to decrease by 5.52 per month, but it was not significant (95% CI −12.08 to −1.04). However, after the outbreak of COVID-19 (2020), it was significant with a monthly increase of 12.279 (95% CI 2.36-22.20). Therefore, the trend of the frequency of methylphenidate nonmedical use has changed significantly due to the COVID-19 pandemic.shows the hyperparameters of the machine learning models.
The RF model with BoW and TF-IDF and the SVM model with TF-IDF produced the highest F-1 score of 0.75 (
). In contrast, LSTM had the lowest F-1 score of 0.69. Because the adverse effects and nonmedical use of methylphenidate vary and these text patterns were randomly established, LSTM performed worse than the other models. The low recall for adverse effects was also responsible for their low F-1 scores. It is suggested that the models were less effective at filtering for false negatives. The performance of the RF model and the NB classifier was the same with BoW and TF-IDF, and the NB classifier and SVM performed slightly better with TF-IDF than BoW.Models and analysis type | Hyperparameters | |
With BoWa | ||
NBb | N/Ac | |
RFd | n_estimates = 500 | |
SVMe | C = 10, gamma = 0.1 | |
With TF-IDFf | ||
NB | TF-IDF = character N-gram (range = (3, 4)), alpha = 0.1 | |
RF | TF-IDF = count, n_estimates = 500, max_depth = 300, random_state = 3 | |
SVM | TF-IDF = count | |
Deep learning | ||
LSTMg | embedding layer = dimensions (40, 100), neurons of LSTM layer = 128, activation function of LSTM layer = “sigmoid,” neurons of dense layer = 3, activation function of dense layer = “softmax,” loss function = “sparse categorical crossentropy,” optimizer = “adam” |
aBoW: bag-of-words.
bNB: Naïve Bayes.
cN/A: not applicable.
dRF: random forest.
eSVM: support vector machine.
fTF-IDF: term frequency-inverse document frequency.
gLSTM: long short-term memory.
Model and class | Accuracy | Precision | Recall | F-1 score | ||||||
WithBoWa | ||||||||||
NBb | 0.71 | |||||||||
0c | 0.69 | 0.82 | 0.75 | |||||||
1d | 0.65 | 0.60 | 0.63 | |||||||
2e | 0.80 | 0.66 | 0.72 | |||||||
Macroaverage | 0.72 | 0.69 | 0.70 | |||||||
Weighted average | 0.72 | 0.71 | 0.71 | |||||||
RFf | 0.76 | |||||||||
0 | 0.75 | 0.89 | 0.81 | |||||||
1 | 0.75 | 0.54 | 0.63 | |||||||
2 | 0.79 | 0.78 | 0.79 | |||||||
Macroaverage | 0.76 | 0.74 | 0.74 | |||||||
Weighted average | 0.76 | 0.76 | 0.75 | |||||||
SVMg | 0.72 | |||||||||
0 | 0.66 | 0.93 | 0.77 | |||||||
1 | 0.77 | 0.44 | 0.56 | |||||||
2 | 0.82 | 0.68 | 0.74 | |||||||
Macroaverage | 0.75 | 0.68 | 0.69 | |||||||
Weighted average | 0.74 | 0.72 | 0.71 | |||||||
With TF-IDFh | ||||||||||
NB | 0.73 | |||||||||
0 | 0.70 | 0.84 | 0.76 | |||||||
1 | 0.72 | 0.60 | 0.65 | |||||||
2 | 0.80 | 0.68 | 0.73 | |||||||
Macroaverage | 0.74 | 0.71 | 0.72 | |||||||
Weighted average | 0.73 | 0.73 | 0.72 | |||||||
RF | 0.76 | |||||||||
0 | 0.75 | 0.89 | 0.81 | |||||||
1 | 0.75 | 0.51 | 0.61 | |||||||
2 | 0.79 | 0.79 | 0.79 | |||||||
Macroaverage | 0.76 | 0.73 | 0.74 | |||||||
Weighted average | 0.76 | 0.76 | 0.75 | |||||||
SVM | 0.76 | |||||||||
0 | 0.76 | 0.85 | 0.80 | |||||||
1 | 0.75 | 0.57 | 0.65 | |||||||
2 | 0.76 | 0.80 | 0.78 | |||||||
Macroaverage | 0.76 | 0.74 | 0.74 | |||||||
Weighted average | 0.76 | 0.76 | 0.75 | |||||||
Deep learning | ||||||||||
LSTMi | 0.69 | |||||||||
0 | 0.75 | 0.72 | 0.74 | |||||||
1 | 0.62 | 0.58 | 0.60 | |||||||
2 | 0.68 | 0.75 | 0.71 | |||||||
Macroaverage | 0.68 | 0.68 | 0.68 | |||||||
Weighted average | 0.69 | 0.69 | 0.69 |
aBoW: bag-of-words.
bNB: Naïve Bayes.
cClass 0 refers to medical use.
dClass 1 refers to side effects.
eClass 2 refers to nonmedical use.
fRF: random forest.
gSVM: support vector machine.
hTF-IDF: term frequency-inverse document frequency.
iLSTM: long short-term memory.
Discussion
Principal Findings
This study analyzed the adverse effects and nonmedical use of methylphenidate and evaluated the change in frequency of nonmedical use before and after COVID-19 using data from Twitter, Facebook, and Instagram. It also evaluated the performance of machine learning to effectively classify methylphenidate use from SNS data by comparing the results of the machine learning models.
Psychiatric problems were the most frequent adverse effects. The clinical concern is that treatment with methylphenidate may increase the risk of psychosis. The incidence rate of psychosis was 1.78 episodes per 1000 person-years of drug exposure in the methylphenidate group in a cohort study using 2 databases in the United States [
]. Psychosis might camouflage ADHD symptoms, such as anxiety and mood disorders because the same brain region and neurotransmitter system are involved in ADHD [ ]. Moreover, 2 studies demonstrated that psychiatric problems are still considered clinical concerns during methylphenidate consumption [ , ].Sleep problems were found to be common adverse effects of methylphenidate in this study. Because they impact daytime attention and mood disorders, sleep problems are important adverse effects and must be managed continuously [
]. A review of the safety and tolerability of ADHD medications noted that insomnia was one of the most commonly reported adverse events associated with psychostimulant treatment [ ].Furthermore, gastrointestinal and neurological problems were also common adverse effects. While most adverse effects occur in similar proportions as in the methylphenidate clinical trial results in the United States, the proportion of gastrointestinal and neurological problems was lower than that in clinical trials [
]. A previous survey also indicated that the adverse effects of methylphenidate were different from clinical trials [ ]. This may indicate that results from self-reporting, such as surveys, differ from clinical trial results. However, the difference between adverse effects data from clinical trials and surveys is acceptable because drugs can cause different responses within a large population. A pharmacovigilance system was applied to detect rare adverse effects, especially in postmarketing surveillance studies and spontaneous adverse drug reaction reporting [ ].The results of this study showed that nonmedical use for study or work had the highest frequency, followed by coingestion and drug abuse. Methylphenidate is widely used for academic achievement without a prescription by a physician [
- ]. The increasing motivation in patients with methylphenidate consumption was caused by increasing dopamine transporters and D2/D3 receptor availability [ ]. In particular, the illicit use of methylphenidate to increase concentration is strongly associated with drug abuse (eg, cocaine abuse) [ ]. Abusing, seeking, or obtaining methylphenidate, behaviors that are difficult to analyze via surveys, were significantly detected in the SNS data. This indicates that people use methylphenidate as a gateway for abuse and obtain the drug from peers or through doctor shopping [ , ]. Moreover, this study showed that some people use methylphenidate for recreational purposes. In a previous survey-based study, 107 of 164 students said they used methylphenidate to party for a longer time, which suggested that education should be provided to students who abuse methylphenidate [ , ].Nonmedical use of methylphenidate was compared before and after the COVID-19 pandemic. The frequency of nonmedical use was not shown to be different before and after the pandemic. However, coingestion in nonmedical use was significantly higher after the pandemic because coingestion can be easier to abuse than other nonmedical uses. While drug abuse during the pandemic also generally increased because people stayed at home, as required due to social distancing protocols, stimulant abuse was reported to decrease after the COVID-19 pandemic in previous studies [
, , ]. Similarly, this study indicated that the abuse of methylphenidate decreased after the COVID-19 pandemic. Drug abuse decreased because it became difficult to import the drug substance for the stimulant because of the disruption of raw material production [ ]. The pandemic may also have contributed to the decrease in drug abuse because people could not get together for parties. A study in the Netherlands showed that the consumption of ecstasy, amphetamines, cocaine, nitrous oxide, ketamine, LSD, psychedelic mushrooms or truffles, GHB, 2C-B, 3-MMC, and 4-MMC (other than prescription drugs) decreased during the pandemic because of fewer social occasions (eg, going out, appointments, visits, parties, and other activities) [ ].The frequency of nonmedical use decreased from May 2019 to May 2020, which is the traditional examination period, and immediately after the COVID-19 outbreak [
, ]. The main reason for the decline in the nonmedical use of methylphenidate was the closure of schools and universities ordered due to COVID-19, which made typical academic classes impossible [ , ]. School closures may have directly affected the amount of nonmedical use. Moreover, the COVID-19 pandemic contributed to a 75% decrease in outpatient visits [ ]. Patients had difficulty purchasing medicines because of the pandemic and social distancing. However, after the outbreak of COVID-19 in 2020, it was significant, with a monthly increase of 12.279 (95% CI 2.36-22.20). Therefore, although the frequency of methylphenidate nonmedical use has not changed significantly due to the COVID-19 pandemic, its trend has changed significantly.This study had better F-1 scores than that of Kim and colleagues [
], in which the F-1 score for nonmedical use of methylphenidate was 0.547, and that for adverse effects of methylphenidate was 0.733. However, the precision scores were not better in this study than in the study by Kim and colleagues, which were 0.920 for adverse effects and 0.926 for nonmedical use. This suggests that this study was not more effective at filtering false positives.The F-1 scores of the deep learning models in the studies by Li et al [
] and Hu et al [ ] were higher than in this study at 0.93 and 0.86, respectively. This is due to similar text patterns and words in the data. As previous studies had specific subjects showing the detection of illicit drug dealers and stimulant abuse, their performance was higher than in this study. However, in this study, as in previous studies, RF outperformed the other models because the class planes that were linearly separated could not be used for the text data [ , , ].Limitations
Most data collected from SNSs were obtained from Twitter (145,520/146,352, 99.4%), and Facebook and Instagram data accounted for only 0.6% (832/146,352). Thus, the results were biased only toward Twitter. These data proportions were caused by the policies of each SNS. Facebook and Instagram changed their policies to block searches for specific medicines and suspended offending accounts in April 2018 [
]. Searches and text may have been limited on Facebook and Instagram, and the search results may also have been hidden or deleted by the SNS policies. Moreover, about 9% of the teenagers in previous studies who potentially took methylphenidate use Facebook in the United States, and Instagram is not based primarily on text [ - ]. These factors may affect the volume of the collected data. Therefore, it is necessary to study data from other SNSs that are unmanaged and frequently used by students. Nevertheless, NLP techniques continue to develop rapidly, and new methods emerge one after another. However, the NLP method applied in this study was the method used in previous studies to reconfirm whether there was a similar pattern during a similar period compared to previous studies and to confirm the impact of COVID-19 [ ]. In the future, results from new NLP methods are expected to generate advanced outcomes. Moreover, because we performed traditional supervised learning, each instance is associated with only a single class label. Although binary classification based on text structure is a relatively simple type of text classification, real-world SNS instances might have multiple semantic meanings simultaneously. If such multilabel learning is applied in future studies, we expect that it will reflect the real environment well for each instance [ , ].Another limitation is that this study could not analyze changes in the nonmedical use of methylphenidate in 2021 when social restrictions and public alarm around the COVID-19 pandemic decreased, and it became a part of daily life. The distribution of nonmedical use in this study clearly indicates a change before and after COVID-19 transmission. However, as the analysis period was limited between January 2019 and December 2020, more recent nonmedical use could not be analyzed. Therefore, additional studies are needed to analyze how the status of nonmedical use has changed since 2021.
The results of this study are not generalizable and do not necessarily apply to the actual adverse effects and nonmedical use of methylphenidate. Most studies that rely on SNS data face this limitation [
]. The reason is that Twitter, Facebook, and Instagram restrict users aged <13 years, and this population is adept at using SNSs. Moreover, it cannot verify if users actually use the medications and experience adverse effects or if they use them for nonmedical purposes. However, despite this limitation, studies using SNSs can easily and effectively analyze the status of stimulant use.Conclusions
This study analyzed the adverse effects and nonmedical use of methylphenidate and evaluated the frequency and changes in the trend of nonmedical use frequency before and after the COVID-19 pandemic on Twitter, Facebook, and Instagram. This study also evaluated the performance of machine learning to classify methylphenidate use from SNS data by comparing the results of 4 different machine learning models. The results of this study will contribute to determining the status of methylphenidate use, and machine learning using SNS data will be useful in applications for automatic monitoring tasks. Because stimulants are potentially abused or used as a gateway to drug abuse, their use must be continuously managed. Therefore, future studies should seek to apply machine learning to other stimulants and other SNSs that are unmanaged and frequently used by students.
Acknowledgments
This research was supported by a grant (NRF-2021R1F1A1062044) from the Korean Ministry of Science and Information and Communication Technology and by the Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education (2021R1A6A1A03044296). The funder had no further role in the study design; collection, analysis, and interpretation of data; writing of the manuscript; or decision to submit the paper for publication.
Data Availability
The data sets generated during and/or analyzed during this study are available from the corresponding author upon reasonable request.
Conflicts of Interest
None declared.
Supplemental table and figure.
DOCX File , 844 KBReferences
- Storebo O, Ramstad E, Krogh H, Nilausen T, Skoog M, Homskov M. Methylphenidate for children and adolescents with attention deficit hyperactivity disorder (ADHD). Cochrane Database Syst Rev. 2015;11:1-698. [FREE Full text] [CrossRef]
- Morton W, Stockton G. Methylphenidate abuse and psychiatric side effects. Prim Care Companion J Clin Psychiatry. Oct 2000;2(5):159-164. [FREE Full text] [CrossRef] [Medline]
- Jensen L, Pagsberg A, Dalhoff K. Methylphenidate misuse in adult patients and the impact of therapeutic use. Hum Exp Toxicol. May 2015;34(5):460-467. [FREE Full text] [CrossRef] [Medline]
- Zalsman G, Shilton T. Adult ADHD: A new disease? Int J Psychiatry Clin Pract. Apr 07, 2016;20(2):70-76. [CrossRef] [Medline]
- Pastor P, Reuben C, Duran C, Hawkins L. Association between diagnosed ADHD and selected characteristics among children aged 4–17 years: United States, 2011–2013. National Center for Health Statistics. May 2015. URL: https://www.cdc.gov/nchs/data/databriefs/db201.pdf [accessed 2023-07-20]
- Polanczyk G, de Lima MS, Horta BL, Biederman J, Rohde LA. The worldwide prevalence of ADHD: a systematic review and metaregression analysis. Am J Psychiatry. Jun 2007;164(6):942-948. [CrossRef] [Medline]
- Wolraich M, Hagan J, Allan C. Clinical practice guideline for the diagnosis, evaluation, and treatment of attention-deficit/hyperactivity disorder in children and adolescents. Pediatrics. Mar 2020;145(3):e20192528. [FREE Full text] [CrossRef] [Medline]
- Bodey C. Effectiveness and tolerability of methylphenidate in children and adolescents with attention deficit hyperactivity disorder. Clin Med Insights Ther. Aug 09, 2011;3:353-363. [CrossRef]
- Storebo O, Pedersen N, Ramstad E, Kielsholm M, Nielsen S, Krogh H. Methylphenidate for attention deficit hyperactivity disorder (ADHD) in children and adolescents: Assessment of adverse events in non-randomised studies. Cochrane Database Syst Rev. 2018;5(5):1. [FREE Full text] [CrossRef]
- Ramstad E, Storebø OJ, Gerner T, Krogh H, Holmskov M, Magnusson F, et al. Hallucinations and other psychotic symptoms in response to methylphenidate in children and adolescents with attention-deficit/hyperactivity disorder: a Cochrane systematic review with meta-analysis and trial sequential analysis. Scand J Child Adolesc Psychiatr Psychol. 2018;6(1):52-71. [FREE Full text] [CrossRef] [Medline]
- Clemow D. Misuse of methylphenidate. Curr Top Behav Neurosci. 2017;34:99-124. [CrossRef] [Medline]
- Shellenberg TP, Stoops WW, Lile JA, Rush CR. An update on the clinical pharmacology of methylphenidate: therapeutic efficacy, abuse potential and future considerations. Expert Rev Clin Pharmacol. Aug 25, 2020;13(8):825-833. [CrossRef] [Medline]
- Di Trana A, Carlier J, Berretta P, Zaami S, Ricci G. Consequences of COVID-19 lockdown on the misuse and marketing of addictive substances and new psychoactive substances. Front Psychiatry. Oct 23, 2020;11:584462. [FREE Full text] [CrossRef] [Medline]
- Borrelli E, Grosso D, Vella G, Sacconi R, Querques L, Zucchiatti I, et al. Impact of COVID-19 on outpatient visits and intravitreal treatments in a referral retina unit: let's be ready for a plausible "rebound effect". Graefes Arch Clin Exp Ophthalmol. Dec 22, 2020;258(12):2655-2660. [FREE Full text] [CrossRef] [Medline]
- Gili A, Bacci M, Aroni K, Nicoletti A, Gambelunghe A, Mercurio I, et al. Changes in drug use patterns during the COVID-19 pandemic in Italy: monitoring a vulnerable group by hair analysis. Int J Environ Res Public Health. Feb 18, 2021;18(4):1967. [FREE Full text] [CrossRef] [Medline]
- Benschop A, van Bakkum F, Noijen J. Changing patterns of substance use during the coronavirus pandemic: self-reported use of tobacco, alcohol, cannabis, and other drugs. Front Psychiatry. May 26, 2021;12:633551. [FREE Full text] [CrossRef] [Medline]
- Kim M, Kim J, Kim S, Jeong J. Twitter analysis of the nonmedical use and side effects of methylphenidate: machine learning study. J Med Internet Res. Feb 24, 2020;22(2):e16466. [FREE Full text] [CrossRef] [Medline]
- Hu H, Phan N, Geller J, Iezzi S, Vo H, Dou D, et al. An ensemble deep learning model for drug abuse detection in sparse Twitter-sphere. Stud Health Technol Inform. Aug 21, 2019;264:163-167. [CrossRef] [Medline]
- Barenholtz E, Fitzgerald ND, Hahn WE. Machine-learning approaches to substance-abuse research: emerging trends and their implications. Curr Opin Psychiatry. Jul 2020;33(4):334-342. [CrossRef] [Medline]
- Edara D, Vanukuri L, Sistla V, Kolli V. Sentiment analysis and text categorization of cancer medical records with LSTM. J Ambient Intell Humaniz Comput. 2023:5309-5325. [FREE Full text] [CrossRef]
- Gentina E, Chen R, Yang Z. Development of theory of mind on online social networks: Evidence from Facebook, Twitter, Instagram, and Snapchat. J Bus Res. Jan 2021;124:652-666. [CrossRef]
- Pavelle S, Wilkinson C. Into the digital wild: utilizing Twitter, Instagram, YouTube, and Facebook for effective science and environmental communication. Front Commun. Oct 16, 2020;5 [CrossRef]
- Alhabash S, Ma M. A tale of four platforms: motivations and uses of Facebook, Twitter, Instagram, and Snapchat among college students? Soc Media Soc. Feb 01, 2017;3(1):205630511769154. [CrossRef]
- Li J, Xu Q, Shah N, Mackey TK. A machine learning approach for the detection and characterization of illicit drug dealers on Instagram: Model evaluation study. J Med Internet Res. Jun 15, 2019;21(6):e13803. [FREE Full text] [CrossRef] [Medline]
- Hanson CL, Burton SH, Giraud-Carrier C, West JH, Barnes MD, Hansen B. Tweaking and tweeting: exploring Twitter for nonmedical use of a psychostimulant drug (Adderall) among college students. J Med Internet Res. Apr 17, 2013;15(4):e62. [FREE Full text] [CrossRef] [Medline]
- Slang terms and code words: A reference for law enforcement personnel. Drug Enforcement Administration. 2018. URL: https://www.dea.gov/sites/default/files/2018-07/DIR-022-18.pdf%20 [accessed 2023-06-01]
- Carletta J. Assessing agreement on classification tasks: the kappa statstic. Comput Linguist. 1996;22(2):249-254. [FREE Full text]
- Sarker A, O'Connor K, Ginn R, Scotch M, Smith K, Malone D, et al. Social media mining for toxicovigilance: automatic monitoring of prescription medication abuse from Twitter. Drug Saf. Mar 2016;39(3):231-240. [FREE Full text] [CrossRef] [Medline]
- Rustam F, Khalid M, Aslam W, Rupapara V, Mehmood A, Choi GS. A performance comparison of supervised machine learning models for COVID-19 tweets sentiment analysis. PLoS One. Feb 25, 2021;16(2):e0245909. [FREE Full text] [CrossRef] [Medline]
- Singh B, Desai R, Ashar H, Tank P, Katre N. A trade-off between ML and DL Techniques in natural language processing. J Phys Conf Ser. Mar 01, 2021;1831(1):012025. [CrossRef]
- Kumar A, Yun K, Gebregzabiher T, Tesfay B, Adane S. COVID19 tweeter data set sentiment analysis. Presented at: Fourth International Conference on Computational Intelligence and Communication Technologies (CCICT); July 3, 2021;110-115; Sonepat, India. [CrossRef]
- Zhang W, Yoshida T, Tang X. A comparative study of TF*IDF, LSI and multi-words for text classification. Expert Syst Appl. Mar 2011;38(3):2758-2765. [CrossRef]
- Aydogan A, An M, Yilmaz M. A new approach to social engineering with natural language processing: RAKE. Presented at: 9th International Symposium on Digital Forensics and Security (ISDFS); June 28-29, 2021;1-7; Elaz, Turkey. [CrossRef]
- Rao M, Sindhu C. Detection of sarcasm on Amazon product reviews using machine learning algorithms under sentiment analysis. Presented at: 2021 Sixth International Conference on Wireless Communications, Signal Processing and Networking (WiSPNET); March 25-27, 2021;196-199; Chennai, India. [CrossRef]
- Xu B, Guo X, Ye Y, Cheng J. An improved random forest classifier for text categorization. J Comput Phys. Dec 01, 2012;7(12) [CrossRef]
- Mikolov T, Karafiat M, Burget L, Cernocky J, Khudanpur S. Recurrent neural network based language model. Presented at: International Speech Communication Association (INTERSPEECH) 2011; August 27-31, 2011;1048; Florence, Italy. [CrossRef]
- Chen Z, Watanabe S, Erdogan H, Hershey J. Speech enhancement and recognition using multi-task learning of long short-term memory recurrent neural networks. Presented at: International Speech Communication Association (INTERSPEECH) 2015; 6-10 September, 2015;3274-3278; Dresden, Germany. [CrossRef]
- Moran LV, Ongur D, Hsu J, Castro VM, Perlis RH, Schneeweiss S. Psychosis with methylphenidate or amphetamine in patients with ADHD. N Engl J Med. Mar 21, 2019;380(12):1128-1138. [FREE Full text] [CrossRef] [Medline]
- Faraone SV. The pharmacology of amphetamine and methylphenidate: Relevance to the neurobiology of attention-deficit/hyperactivity disorder and other psychiatric comorbidities. Neurosci Biobehav Rev. Apr 2018;87:255-270. [FREE Full text] [CrossRef] [Medline]
- Hollis C, Chen Q, Chang Z, Quinn PD, Viktorin A, Lichtenstein P, et al. Methylphenidate and the risk of psychosis in adolescents and young adults: a population-based cohort study. Lancet Psychiatry. Aug 2019;6(8):651-658. [FREE Full text] [CrossRef] [Medline]
- Yoo JH, Sharma V, Kim J, McMakin DL, Hong S, Zalesky A, et al. Prediction of sleep side effects following methylphenidate treatment in ADHD youth. Neuroimage Clin. 2020;26:102030. [FREE Full text] [CrossRef] [Medline]
- Labels for Concerta (methylphenidate HCl). Food and Drug Administration. URL: https://www.accessdata.fda.gov/drugsatfda_docs/label/2021/021121s044lbl.pdf [accessed 2023-07-07]
- Roche M, Mircher C, Toulas J, Prioux E, Conte M, Ravel A, et al. Efficacy and safety of methylphenidate on attention deficit hyperactivity disorder in children with Down syndrome. J Intellect Disabil Res. Aug 20, 2021;65(8):795-800. [CrossRef] [Medline]
- Alomar M, Tawfiq AM, Hassan N, Palaian S. Post marketing surveillance of suspected adverse drug reactions through spontaneous reporting: current status, challenges and the future. Ther Adv Drug Saf. 2020;11:2042098620938595. [FREE Full text] [CrossRef] [Medline]
- Koren G, Korn L. The use of methylphenidate for cognitive enhancement in young healthy adults. J Clin Psychopharmacol. Jan 12, 2021;41(2):100-102. [CrossRef]
- Chan W, Dargan P, Haynes C, Green J, Black J, Dart R, et al. Misuse of prescription medicines is as prevalent as the use of recreational drugs and novel psychoactive substances in Singapore: an unrecognised public health issue? Singapore Med J. Oct 2022;63(10):572-576. [FREE Full text] [CrossRef] [Medline]
- Soeiro T, Frauger ?, Pradel V, Micallef J. Doctor shopping for methylphenidate as a proxy for misuse and potential abuse in the 67 million inhabitants in France. Fundam Clin Pharmacol. Aug 04, 2021;35(4):751-761. [CrossRef] [Medline]
- Volkow ND, Wang G, Newcorn JH, Kollins SH, Wigal TL, Telang F, et al. Motivation deficit in ADHD is associated with dysfunction of the dopamine reward pathway. Mol Psychiatry. Nov 21, 2011;16(11):1147-1154. [FREE Full text] [CrossRef] [Medline]
- Teter CJ, McCabe SE, Cranford JA, Boyd CJ, Guthrie SK. Prevalence and motives for illicit use of prescription stimulants in an undergraduate student sample. J Am Coll Health. 2005;53(6):253-262. [CrossRef] [Medline]
- White BP, Becker-Blease KA, Grace-Bishop K. Stimulant medication use, misuse, and abuse in an undergraduate and graduate student sample. J Am Coll Health. 2006;54(5):261-268. [CrossRef] [Medline]
- Arria AM, Wish ED. Nonmedical use of prescription stimulants among students. Pediatr Ann. Aug 2006;35(8):565-571. [FREE Full text] [CrossRef] [Medline]
- Taylor S, Paluszek MM, Rachor GS, McKay D, Asmundson GJ. Substance use and abuse, COVID-19-related distress, and disregard for social distancing: A network analysis. Addict Behav. Mar 2021;114:106754. [FREE Full text] [CrossRef] [Medline]
- Iob E, Steptoe A, Fancourt D. Abuse, self-harm and suicidal ideation in the UK during the COVID-19 pandemic. Br J Psychiatry. Oct 13, 2020;217(4):543-546. [FREE Full text] [CrossRef] [Medline]
- Donohue JM, Miller E. COVID-19 and school closures. JAMA. Sep 01, 2020;324(9):845-847. [CrossRef] [Medline]
- Masonbrink A, Hurley E. Advocating for children during the COVID-19 school closures. Pediatrics. Sep 2020;146(3):e20201440. [CrossRef] [Medline]
- Li J, Li P, Hu X, Yu K. Learning common and label-specific features for multi-Label classification with correlation information. Pattern Recognit. Jan 2022;121:108259. [CrossRef]
- Deniz E, Erbay H, Coşar M. Multi-label classification of e-commerce customer reviews via machine learning. Axioms. Aug 26, 2022;11(9):436. [CrossRef]
Abbreviations
ACF: autocorrelation function |
ADHD: attention-deficit/hyperactivity disorder |
API: application programming interface |
AR: autoregression |
ARIMA: Auto Regressive Integrated Moving Average |
BoW: bag-of-words |
FN: false negative |
FP: false positive |
LSTM: long short-term memory |
NLP: natural language processing |
OR: odds ratio |
PACF: partial autocorrelation function |
SNS: social networking service |
SVM: support vector machine |
TF-IDF: term frequency-inverse document frequency |
TN: true negative |
TP: true positive |
Edited by A Mavragani; submitted 18.12.22; peer-reviewed by MG Kim, J Kim; comments to author 17.01.23; revised version received 09.05.23; accepted 28.06.23; published 16.08.23.
Copyright©Hocheol Shin, Cindra Tri Yuniar, SuA Oh, Sujata Purja, Sera Park, Haeun Lee, Eunyoung Kim. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 16.08.2023.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on https://www.jmir.org/, as well as this copyright and license information must be included.