Results 341 to 350 of about 488,753 (383)
Some of the next articles are maybe not open access.
ShieldGemma: Generative AI Content Moderation Based on Gemma
arXiv.orgWe present ShieldGemma, a comprehensive suite of LLM-based safety content moderation models built upon Gemma2. These models provide robust, state-of-the-art predictions of safety risks across key harm types (sexually explicit, dangerous content ...
Wenjun Zeng +11 more
semanticscholar +1 more source
Policy-as-Prompt: Rethinking Content Moderation in the Age of Large Language Models
Conference on Fairness, Accountability and TransparencyContent moderation plays a critical role in shaping safe and inclusive online environments, balancing platform standards, user expectations, and regulatory frameworks.
Konstantina Palla +7 more
semanticscholar +1 more source
AEGIS: Online Adaptive AI Content Safety Moderation with Ensemble of LLM Experts
arXiv.orgAs Large Language Models (LLMs) and generative AI become more widespread, the content safety risks associated with their use also increase. We find a notable deficiency in high-quality content safety datasets and benchmarks that comprehensively cover a ...
Shaona Ghosh +3 more
semanticscholar +1 more source
Evaluating the Effectiveness of Deplatforming as a Moderation Strategy on Twitter
Proc. ACM Hum. Comput. Interact., 2021Deplatforming refers to the permanent ban of controversial public figures with large followings on social media sites. In recent years, platforms like Facebook, Twitter and YouTube have deplatformed many influencers to curb the spread of offensive speech.
Shagun Jhaver +3 more
semanticscholar +1 more source
Philosophical Studies, 2015
Typical presentism asserts that whatever exists is present. Moderate presentism more modestly claims that all events are present and thus acknowledges past and future times understood in a substantivalist sense, and past objects understood, following Williamson, as “ex-concrete.” It is argued that moderate presentism retains the most valuable features ...
openaire +2 more sources
Typical presentism asserts that whatever exists is present. Moderate presentism more modestly claims that all events are present and thus acknowledges past and future times understood in a substantivalist sense, and past objects understood, following Williamson, as “ex-concrete.” It is argued that moderate presentism retains the most valuable features ...
openaire +2 more sources
Jailbreaking Large Language Models Against Moderation Guardrails via Cipher Characters
Neural Information Processing SystemsLarge Language Models (LLMs) are typically harmless but remain vulnerable to carefully crafted prompts known as ``jailbreaks'', which can bypass protective measures and induce harmful behavior.
Haibo Jin +3 more
semanticscholar +1 more source
Recent Advances in Online Hate Speech Moderation: Multimodality and the Role of Large Models
Conference on Empirical Methods in Natural Language ProcessingIn the evolving landscape of online communication, moderating hate speech (HS) presents an intricate challenge, compounded by the multimodal nature of digital content.
Ming Shan Hee +6 more
semanticscholar +1 more source
Content moderation by LLM: from accuracy to legitimacy
Artificial Intelligence ReviewOne trending application of LLM (large language model) is to use it for content moderation in online platforms. Most current studies on this application have focused on the metric of accuracy—the extent to which LLMs make correct decisions about content.
Tao Huang
semanticscholar +1 more source
Proceedings of the XX International Conference on Human Computer Interaction, 2019
We demonstrate an interactive prototype where a moded interaction technique can be evaluated more comprehensively by exposing it to significantly longer and more diverse sets of mode-switching sequences. The tablet-based app is designed to abstract a particular type of user-interface feature in such a way that captures the commonly-used moded ...
Katherine Fennedy, Hyowon Lee
openaire +1 more source
We demonstrate an interactive prototype where a moded interaction technique can be evaluated more comprehensively by exposing it to significantly longer and more diverse sets of mode-switching sequences. The tablet-based app is designed to abstract a particular type of user-interface feature in such a way that captures the commonly-used moded ...
Katherine Fennedy, Hyowon Lee
openaire +1 more source
MODERATE MORALISM VERSUS MODERATE AUTONOMISM
The British Journal of Aesthetics, 1998En reponse a l'article de J. Anderson et J. Dean (in «The British journal of aesthetics», 38, 2, 1998) refutant la these du moralisme modere defendue par l'A. dans un article recent (ibid., 36, 3, 1996), l'A. se distingue de l'ethicisme de Gaut, d'une part, et expose sa propre position selon laquelle un defaut moral ou une vertu morale dans une oeuvre ...
openaire +1 more source

