Transparent Hate Speech Detection in Norwegian Using Explainable AI

By:

Ehtesham Hashmi
Sule Yildirim Yayilgan
Moham
Rajendra Akerkar

Technology and society

Big data and Emerging Technologies

Article

Id:

July 2025

Publisher:

Springer Nature Link

Journal:

Book: Intelligent and Fuzzy Systems

Text classification is widely used in tasks such as sentiment analysis, spam detection, and hate speech detection, but performance often suffers when models rely on only linguistic features or semantic embeddings. Single-representation methods struggle to capture both syntactic structure and deep semantic meaning, limiting robustness and generalization across tasks. This study proposes a hybrid feature fusion framework that combines interpretable linguistic features with semantic embeddings from Doc2Vec and transformer-based models. The approach is evaluated on five benchmark datasets covering fake news detection, Bloom’s taxonomy classification, and hate speech detection. Experiments with multiple classifiers show that the fused features consistently outperform single-feature baselines. The best results, using a BERT-based fusion approach, achieve accuracies of 81% for fake news detection, 67% for Bloom’s taxonomy classification, and 72% for hate speech detection, with improved precision, recall, and F1-score. Overall, the results demonstrate that integrating linguistic and semantic features provides a robust, domain-agnostic solution for improved text classification.

Link:

Cinference paper: Transparent Hate Speech Detection in Norwegian Using Explaina…

Transparent Hate Speech Detection in Norwegian Using Explainable AI

Violence-Inducing Behaviour Prevention in Social-Cyber Space of Local Communities