Introduction

Delayed ejaculation (DE) is a psychological and/or medical condition not associated with other types of psychiatric diagnoses. DE is defined as requiring one of two symptoms: significantly DE, failure to ejaculate, or its infrequency during intercourse at a rate of 75–100% persisting for at least 6 months and causing personal distress [1]. The prevalence of DE in sexually active men is reported to be 3% [2, 3].

Causes of DE can be organic (e.g., penile nerve damage or a spinal cord lesion), psychological, or pharmacological (e.g., antipsychotics, antihypertensive drugs, or selective serotonin re-uptake inhibitors) [4, 5]. Pharmacologic agents used to treat DE have various success. However, due to the lack of randomized, blinded, placebo-controlled studies, no drug has been proven to be superior or effective regarding DE treatment [6]. Furthermore, there is no European Medicines Agency- or Food and Drug Administration-approved drug for treating DE. There is very little literature on the psychological aspects of DE and also little empirical evidence on psychological treatment efficacy. People thus may tend to seek information on the internet.

As access to online health information becomes easier, the internet is increasingly used as a health information resource [7,8,9]. YouTube, the media and video sharing website with over 1 billion users worldwide, is an important information source that people can gather information about various illnesses and even surgical procedures [10, 11]. Recently, several studies have evaluated the quality of YouTube videos on topics such as bladder pain syndrome [12], bladder cancer [13], erectile dysfunction [14], penile prosthesis [15], male infertility [16], Peyronie’s disease [17], and even COVID-19 pandemic [18] hoping to illuminate YouTube’s effectiveness for patient education. However, there is no study evaluating the quality of YouTube videos regarding DE. We aimed to evaluate the quality of YouTube videos related to DE and to define the characteristics of the videos from the users’ point of view. Thus, we tried to determine whether accurate and reliable information about DE could be reached on Youtube.

Materials and methods

Ethics committee approval was not necessary because the videos were open to the public and there were no animal or human study participants. Similar studies in the literature were not ethics-approved either [19, 20]. The studies evaluating video quality on YouTube generally evaluated the top 100 or 200 videos [14, 21]. In addition, it is known that the vast majority of internet users tend to review videos on the first three pages of their search results [22]. Therefore, we decided to evaluate the first 400 videos on DE according to relevance that would cover most of the YouTube videos most likely to be viewed by internet users.

YouTube search

On October 10, 2021, a video search was performed with the term “delayed and retarded ejaculation” in the internet address https://www.youtube.com/. The video search was performed by two independent urologists (TT and MY) using two separate YouTube accounts. In terms of ranking the same video list by two independent urologists and so that the previous search history does not affect the new search, we deleted the search history, when the YouTube search was performed from different computers. In addition, we have also canceled the memberships of previously subscribed medical YouTube channels so that they do not affect the search process. Since we aimed to evaluate the videos on delayed and retarded ejaculation, it was essential for the results of the study to find the videos most related to delayed and retarded ejaculation. Therefore, it was listed the first 400 videos by relevance (YouTube’s default option). The search results were saved in the playlist, and the first 400 videos were evaluated by two independent urologists (TT and MY). Repeated, off-topic, non-English videos and videos with no audio were excluded from the study. Figure 1 shows the flowchart of the video selection process.

Fig. 1: The flowchart of the video selection process.
figure 1

After search on Youtube, 400 videos were assessed for eligibility and 349 videos were excluded. 51 videos included in the study.

Video features and quality analysis

Video length, number of comments, comments rate (comments per day), number of ratings (views/day), the total number of “likes” and “dislikes”, time since the upload date and the video source were recorded. The videos containing animation was also recorded. Video sources were categorized as physician, health-related websites, independent user, non-physician health personnel, and sex therapist. The video power index (VPI) to evaluate the popularity of videos was calculated using the formula as follow: [(total likes/total likes + total dislikes) × 100]. Global Quality Scale (GQS) (Table 1) and modified DISCERN tool (Table 2) were used for quality and reliability analyses.

Table 1 DISCERN reliability tool.
Table 2 Global quality scale (GQS).

GQS is a five-point scale with the lowest 1 point and the highest 5 points. It is a scale for rating of internet videos that measures the videos’ quality, ease of use and flow. A video score of 1 or 2 points is considered low quality, 3 points medium quality, and 4 or 5 points high quality [23].

Modified DISCERN tool is a five-point evaluation tool [24, 25]. We relied on it to assess the reliability of YouTube videos. There are 5 yes and no questions in this scale, and each yes answer counts as 1 point.

Characteristics and quality assessments of videos and quality scale correlations according to source and content of the video were reviewed. Inter-observer agreement between the DISCERN and GQS scores was also evaluated.

The procedures were conducted in accordance with the Helsinki Declarations of 2004.

Statistical analysis

The Statistical Package for Social Sciences 22.0 (SPSS 22.0) software was used for statistical analysis. To understand the distribution of data, Shapiro–Wilk test was used. Median, minimum, maximum, mean, standard deviation, and frequency were used as descriptive methods. Kruskal–Wallis test was used to determine differences between more than two groups of the independent variable. Dunn-Bonferroni post hoc method was used for paired comparison after a significant Kruskal–Wallis test. For correlations Spearman test was used. Mann–Whitney U was used to compare independent groups without normal distribution. For inter-rater agreement, Kappa coefficient was used. p < 0.05 was considered as statistically significant. Bonferroni adjustment was performed by multiplying Dunn’s p value by the number of comparisons.

Results

A total of 349 videos were excluded from the study. Repeated (n = 17), off-topic (n = 279), non-English videos (n = 37) and videos with no audio (n = 16) were excluded from the study. The remaining 51 videos were evaluated. Table 3 summarizes the videos’ features. Video contents were mostly about symptoms (37.2%). Most of the videos (37.2%) were uploaded by independent users. The median DISCERN and GQS scores were 1 and 2, respectively. 23.5% of the videos contained animation, 76.5% of the videos contained real images. According to the GQS, 62.7% of the videos were low quality, 23.5% were intermediate and 13.7% were high quality. DISCERN and GQS scores by the two urologists were highly consistent with each other (0.86 and 0.85, respectively). DISCERN and GQS scores were statistically significantly associated with video durations (r = 0.329, P = 0.018 and r = 0.349, P = 0.012; respectively). A statistically significant association was also observed between and DISCERN and GQS scores with VPI values (r = 0.466, P = 0.001 and r = 0.422, P = 0.002; respectively). No significant relationship between DISCERN and GQS scales and other video features was detected (Table 4). Although the DISCERN and GQS scores of animated videos were higher than those containing real images, there was no statistically significant difference between whether or not the videos contained animation.

Table 3 Characteristics and quality assessments of YouTube videos.
Table 4 The correlations between quality scales and video features.

Discussion

People are increasingly using the internet to learn about their illnesses and often choose YouTube as it contains visual material to get information about illnesses thanks its low cost and easy 24 h access. Since the knowledge on DE is limited, people use the internet, especially Youtube, to get information on DE and its treatment as well as a myriad of others.

Although YouTube videos are easy and free to access, there is no mechanism by which to check their quality and accuracy. Anyone who is a member of YouTube can upload videos. These uploaded videos may be of poor quality and full of misleading information. On the other hand, 75 percent of internet users worry about whether such internet information is reliable [26, 27]. Many studies in the literature have assessed the quality of YouTube videos presenting information on a wide range of diseases [10, 21, 28]. However, no study to date has evaluated the quality of YouTube videos related to DE. In the present study, we evaluated DE videos on Youtube and our study showed that DE videos currently on YouTube are of low quality.

The DISCERN and GQS tools have been applied in the studies evaluating video reliability and quality [29, 30]. In the present study, according to the GQS, 13.7% of the videos were of high quality. Similar to our study, Rittberg et al. [22] reported rates of high-quality videos as 19.6% and 18.4%, respectively. However, on the contrary, Singh et al. [24] and Tolu et al. [31] reported that close to 50% of the videos were of high quality. Differences in the numbers of videos evaluated, the researchers’ evaluation of YouTube videos about various conditions and diseases, and the subjectivity of assessing YouTube videos may be reasons for the diverging results in these investigations.

Whan evaluating video quality in terms of their sources, we noted that the higher quality videos had been uploaded by physicians, while the primary source of low-quality videos was usually independent users (Table 1). Other studies have found that high-quality videos are predominantly uploaded by physicians, academic sources, and universities [32, 33]. Our results show that it is important to consider video sources when using YouTube as a source of health information. Furthermore, it was shown that video content may be inadequate or inconsistent despite having been uploaded by healthcare professionals [34]. Physicians were also an important source of low-quality videos in our study (Table 1). Table 5 shows the quality ratings of YouTube videos according to upload sources.

Table 5 Quality assessments of YouTube videos on delayed ejaculation according to sources of upload.

In the present study, we found that video lengths correlated positively with both the DISCERN and GQS scores. The longer the video is, the more information it can convey and be understood. In line with our study, other researchers have found that high-quality videos last longer than low-quality videos [35]. On the other hand, despite the longer-lasting video content in these studies, there is evidence that users gradually lose interest over time [35]. Videos should therefore provide high-quality information within a reasonable period of time. VPI values correlated positively with GQS and DISCERN scores in our study. This is an indication that the higher a video’s quality is, the more “likes” it attracts. We identified no correlation between DISCERN, GQS and the number of likes, dislikes, comments or views. These parameters did not prove to be indicators of high-quality videos in our study.

Whether the video is animated or not seems immaterial in terms of quality. The study by Gokcen and Gumussuyu [36] showed that the presence of animation does not affect video quality. Independent users uploaded 37.2% of the videos in our study, which may be one reason why many videos were low quality. The inclusion and exclusion criteria in the present study resemble those of other investigations in the literature [37, 38]. Instead of evaluating all the information about DE, we aimed to analyse the information by creating an instant search model from the patients’ perspective. The effect of knowledge on the physician’s thinking is a controversial issue that requires a different perspective.

Study limitations

One of the major limitations of our study was that we interpreted videos in English only, which can create a bias, because the number of excluded non-English videos approximated that of English videos (37 and 51, respectively). However, if we had examined those non-English videos, we might have obtained similar rating—a factor that should not be ignored. A second limitation is that when entering a search keyword on YouTube, an instant data is obtained. The dynamic nature of YouTube caised by ongoing video uploads may make results vary.

Conclusions

Our study demonstrates that DE videos currently on YouTube are of low quality, and that many of those videos were uploaded by independent users. We consider our study important as it is the first to analyse the quality of YouTube videos regarding DE. We hope that our study will serve as a stimulus for the academic community interested in this subject/problem to ensure that more accurate and higher quality information be provided to society. For example, a channel on YouTube licensed by experts interested in this subject could be created: if YouTube videos fulfilled the evaluation criteria we applied in this investigation, the quality of health videos on the website would improve. We thus maintain that health-related videos on YouTube should meet certain quality criteria as a guideline for viewers. Such a solution would help solve the low-quality health video problem on social media.