We developed a machine-learning-based method to detect video game players that harass teammates or opponents in chat earlier in the conversation. This real-time technology would allow gaming companies to intervene during games, such as issue warnings or muting or banning a player. In a proof-of-concept experiment on League of Legends data we compute and visualize evaluation metrics for a machine learning classifier as conversations unfold, and observe that the optimal precision and recall of detecting toxic players at each moment in the conversation depends on the confidence threshold of the classifier: the threshold should start low, and increase as the conversation unfolds. How fast this sliding threshold should increase depends on the training set size.
Stoop, W., Kunneman, F., van den Bosch, A., & Miller, B. (2019). Detecting harassment in real-time as conversations develop. In Proceedings of the Third Workshop on Abusive Language Online (pp. 19-24). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/W19-3503