
Edgar Cervantes / Android Authority
TL;DR
- An Android Authority teardown has revealed that Reddit will use an AI mannequin for detecting harassment.
- The mannequin is educated on content material that was beforehand flagged for violating Reddit’s phrases.
We’ve seen giant language fashions (LLMs) used for a wide range of options within the final yr or so, from textual content/picture technology to digital assistants and past. Now, it appears to be like like we will add yet another use case to the checklist because of Reddit.
An APK teardown helps predict options that will arrive on a service sooner or later based mostly on work-in-progress code. Nevertheless, it’s potential that such predicted options could not make it to a public launch.
A teardown of model 2024.10.0 of the Reddit app for Android has revealed that Reddit is now utilizing an LLM to detect harassment on the platform. You may view the related strings under.
Code
<string title="hcf_answer_how_model_trained">The harassment mannequin is an giant language mannequin (LLM) that's educated on content material that our enforcement groups have discovered to be violating. Moderator actions are additionally an enter in how the mannequin is educated.</string>
<string title="hcf_faq_how_model_trained">How is the harassment mannequin educated?</string>
Reddit additionally up to date its assist web page per week in the past to say using an AI mannequin as a part of its harassment filter.
“The filter is powered by a Giant Language Mannequin (LLM) that’s educated on moderator actions and content material eliminated by Reddit’s inner instruments and enforcement groups,” reads an excerpt from the web page.
Both manner, it appears to be like like moderators have one other instrument of their arsenal to combat objectionable content material on Reddit. Will this really do an excellent job of flagging content material, although? We’ll simply have to attend and see.
