Results 341 to 350 of about 488,753 (383)
Some of the next articles are maybe not open access.

ShieldGemma: Generative AI Content Moderation Based on Gemma

arXiv.org
We present ShieldGemma, a comprehensive suite of LLM-based safety content moderation models built upon Gemma2. These models provide robust, state-of-the-art predictions of safety risks across key harm types (sexually explicit, dangerous content ...
Wenjun Zeng   +11 more
semanticscholar   +1 more source

Policy-as-Prompt: Rethinking Content Moderation in the Age of Large Language Models

Conference on Fairness, Accountability and Transparency
Content moderation plays a critical role in shaping safe and inclusive online environments, balancing platform standards, user expectations, and regulatory frameworks.
Konstantina Palla   +7 more
semanticscholar   +1 more source

AEGIS: Online Adaptive AI Content Safety Moderation with Ensemble of LLM Experts

arXiv.org
As Large Language Models (LLMs) and generative AI become more widespread, the content safety risks associated with their use also increase. We find a notable deficiency in high-quality content safety datasets and benchmarks that comprehensively cover a ...
Shaona Ghosh   +3 more
semanticscholar   +1 more source

Evaluating the Effectiveness of Deplatforming as a Moderation Strategy on Twitter

Proc. ACM Hum. Comput. Interact., 2021
Deplatforming refers to the permanent ban of controversial public figures with large followings on social media sites. In recent years, platforms like Facebook, Twitter and YouTube have deplatformed many influencers to curb the spread of offensive speech.
Shagun Jhaver   +3 more
semanticscholar   +1 more source

Moderate presentism

Philosophical Studies, 2015
Typical presentism asserts that whatever exists is present. Moderate presentism more modestly claims that all events are present and thus acknowledges past and future times understood in a substantivalist sense, and past objects understood, following Williamson, as “ex-concrete.” It is argued that moderate presentism retains the most valuable features ...
openaire   +2 more sources

Jailbreaking Large Language Models Against Moderation Guardrails via Cipher Characters

Neural Information Processing Systems
Large Language Models (LLMs) are typically harmless but remain vulnerable to carefully crafted prompts known as ``jailbreaks'', which can bypass protective measures and induce harmful behavior.
Haibo Jin   +3 more
semanticscholar   +1 more source

Recent Advances in Online Hate Speech Moderation: Multimodality and the Role of Large Models

Conference on Empirical Methods in Natural Language Processing
In the evolving landscape of online communication, moderating hate speech (HS) presents an intricate challenge, compounded by the multimodal nature of digital content.
Ming Shan Hee   +6 more
semanticscholar   +1 more source

Content moderation by LLM: from accuracy to legitimacy

Artificial Intelligence Review
One trending application of LLM (large language model) is to use it for content moderation in online platforms. Most current studies on this application have focused on the metric of accuracy—the extent to which LLMs make correct decisions about content.
Tao Huang
semanticscholar   +1 more source

ModeRate

Proceedings of the XX International Conference on Human Computer Interaction, 2019
We demonstrate an interactive prototype where a moded interaction technique can be evaluated more comprehensively by exposing it to significantly longer and more diverse sets of mode-switching sequences. The tablet-based app is designed to abstract a particular type of user-interface feature in such a way that captures the commonly-used moded ...
Katherine Fennedy, Hyowon Lee
openaire   +1 more source

MODERATE MORALISM VERSUS MODERATE AUTONOMISM

The British Journal of Aesthetics, 1998
En reponse a l'article de J. Anderson et J. Dean (in «The British journal of aesthetics», 38, 2, 1998) refutant la these du moralisme modere defendue par l'A. dans un article recent (ibid., 36, 3, 1996), l'A. se distingue de l'ethicisme de Gaut, d'une part, et expose sa propre position selon laquelle un defaut moral ou une vertu morale dans une oeuvre ...
openaire   +1 more source

Home - About - Disclaimer - Privacy