Hate Speech Prediction October 2023

Developed a robust hate speech detection algorithm aimed at classifying speech as normal or blocked to facilitate content filtration.

Technical stack Used in the Project -

Utilized the OxAISH-AL-LLM/wiki_toxic dataset from Hugging Face for model training, taking advantage of a pre-existing BERT model. Fine-tuned the last layer and added an additional output layer with two neurons for classification.
Delivered outstanding results with an exceptional accuracy of 91.95%, a testament to the rigorous training and evaluation processes involved.
Implemented model deployment through Flask and Docker, ensuring scalability and ease of integration for content filtration solutions.

The Github code is here