eHTrust: Model for Trust Evaluation in Content-Driven Health Websites

Sarika Agarwal,Himani Bansal
DOI: https://doi.org/10.1007/s11042-024-19621-x
IF: 2.577
2024-06-20
Multimedia Tools and Applications
Abstract:The presence of information on multiple health websites necessitates assessing its credibility. Users use a search engine to look for content on the website. Internet users do not have time to pay enough attention to websites' credibility. It may be riskier when the user is searching for health data. It could put their lives at risk. When users want to search for data related to content-driven health websites, accessing credibility is necessary, as using the wrong information threatens users' lives. The eHTrust (Electronic Health Trust) model is designed and implemented to automate the credibility of websites. Using the XGboost algorithm, the eHTrust model eliminates all phishing URLs. The XGboost algorithm has a 93.8 accuracy rate. After obtaining legitimate websites, we extract the values of credibility-determining features from their content, such as response time, expert reviews, updated date, responsiveness, and SSL-certified values. The data set is prepared based on the values of the identified features. The Multi-Criteria Decision Analysis (MCDA) method PAPRIKA (Potentially All Pairwise Rankings of All Possible Alternatives) assigns weights to the scale values on each feature, representing the relative importance of the attributes. All the websites are clustered into three groups using a KMeans algorithm. The model's cluster-wise mean credibility score is calculated and labeled with the trust value. The trust values are then compared with the cluster-wise mean credibility score of WOT (Web of Trust). The correlation was found to be 0.8976, thus verifying the model's validity.
computer science, information systems, theory & methods,engineering, electrical & electronic, software engineering
What problem does this paper attempt to address?