Hacker Read

fennecfoxy · 2023-06-08 08:13:25

Actually it's also got a flag to moderate on the conversation endpoint as well now, I found a fix for it for the CGPT-demod script you're talking about; just setting the flag false, lmao.

But realistically they could mod forcibly on their end if they really wanted to, only issue is API use may run into issues where a legitimate use ends up getting stomped by moderation.

That's why it's honestly just better for them to make moderation optional, they should have a button for it in CGPT interface just as Google has "safesearch on/off".

Because of the way it works they can fundamentally not prevent it from producing explicit, violent or adversarial output when someone is focussed on getting it to do so without removing the very magic that makes it so good for everything else. So they should stop trying already, like damn.

reply

LeoPanthera | karma 26644 | avg karma 5.67 · | 2024-01-25 13:38:48

I wish the moderation API was available for use outside of sending text to one of the GPTs. It's surprisingly accurate.

Kaze404 | karma 2214 | avg karma 2.11 · | 2021-08-25 17:57:21

I disagree. In my opinion those tools are the bare minimum for effective moderation, and while I love that Discord gives developers an API that allows them to implement those systems, I think it's something that should be handled by Discord themselves.

hunterb123 | karma 372 | avg karma 0.28 · | 2022-01-07 12:30:59

Moderation is needed for any communication platform.

Flag all keywords you would need to do a bomb threat for review.

Preferably have a client side filter, like profanity, so users can customize to their liking and you aren't censoring too heavily.

Some server side filtering for illegal messages like threats.

Open mod logs for even more transparency.

reply

1xdevloper | karma 212 | avg karma 3.59 · | 2023-03-23 03:19:02

The website also calls the moderation API from the client side with the response returned from its conversation API. So if you simply block the request to the moderation endpoint in dev tools, do they still have additional built-in monitoring?

ergo14 | karma 3421 | avg karma 4.03 · | 2016-10-25 08:51:45

Github says that? Can you share a link? I thought moderation is up to users unless someone is actually abusing the service.

StanislavPetrov | karma 3070 | avg karma 1.53 · | 2022-10-03 22:26:22

I think there's a bright line between "content moderation" where speech and behavior is being moderated, and spam and/or bug abuse which has nothing to do with content of speech whatsoever. Rate limiting inputs is not the same as picking and choosing who is allowed to say what based on arbitrary standards of speech content.

pfisch | karma 805 | avg karma 0.9 · | 2022-09-22 19:44:45

Ok, but if they allowed users to do any moderation at all they would be putting themselves in legal jeopardy. And lets be real, 99% of moderation is done by users.

AbrahamParangi | karma 4343 | avg karma 4.02 · | 2023-06-05 08:26:20

Moderation is likely something that ChatGPT would be very good at.

Seattle3503 | karma 1529 | avg karma 4.1 · | 2022-02-15 20:53:11

I write moderation bots. Without an API, moderation would be difficult if not impossible.

mkmk | karma 2183 | avg karma 6.33 · | 2024-06-26 18:15:31

Working on several different consumer tools that use Claude and ChatGPT, I frequently see moderation blocks, especially related to anything that could be perceived as violent or sexual in nature. I wonder if that poses a problem when working on law enforcement or similar projects.

mike-cardwell | karma 13092 | avg karma 3.68 · | 2023-06-13 15:35:11

Just use Chat-GPT. The future of moderation. What could go wrong?

fnordpiglet | karma 11508 | avg karma 2.98 · | 2023-03-08 18:31:09

Here’s a powerful use - content moderation. Today we literally traumatize content moderators with the dregs of the human mind. Chatgpt is fairly good at identifying the classification of content on many dimensions, including the ones it’s actively screens for. Regardless of how you personally feel about content moderation, I would be happy to see humans not have to be actively involved in it and face the traumas they must live with for where the moderation happens. I’m sure it’ll get things wrong, but humans do too.

duxup | karma 38842 | avg karma 3.68 · | 2020-06-18 16:21:14

I don't think communication should be restricted because of scale of moderation issues.

That seems artificial and strange.

reply

ameister14 | karma 3524 | avg karma 1.98 · | 2021-01-22 22:56:35+00:00

One problem with moderating content in this way is that it makes it clearer and clearer that they no longer need the protections provided by section 230 of the Communications Decency Act.

It's not injurious to them to moderate content, clearly, since they are doing it.

By pulling this crap and especially by doing it algorithmically they are pushing the internet in a difficult direction.

reply

withinboredom | karma 7494 | avg karma 1.75 · | 2023-06-14 08:39:12

keyword moderation is terrible and only affects the language(s) you know about. It doesn't actually prevent the content (the goal of these types of filters) from being served. It'd be like a virus scanner preventing a program from running because it had the name 'virus' in it ... which would prevent itself from running -- probably.

xg15 | karma 16670 | avg karma 3.55 · | 2024-02-25 20:06:36

> Moderation tools fall under the “reach” layer: you take all of that speech, but provide a way to limit the reach of stuff you don’t care to see yourself.

Sometimes, people say that BlueSky is “all about free speech” or “doesn’t do moderation.” This is simply inaccurate. Moderation tooling is encoded into the protocol itself, so that it can work with all content on the network, even non-BlueSky applications. Moreover, it gives you the ability to choose your own moderators, so that you aren’t beholden to anyone else’s choice of moderation or lack thereof.

Ah yes, the famous "if you don't want to see this content, just close your eyes" approach to moderation. I know this philosophy is well-liked in Silicon Valley, but I think it's fundamentally flawed: There are legitimate situations in which you want to prevent unrelated other people from talking about a certain thing or acquiring certain content.

Classic examples are cybermobbing, doxxing and revenge porn: Two or more people talking about how to hurt a third person, publishing private, unflattering or false information about the person, etc. Removing this information from the victim's feed is completely useless (in fact it likely won't appear in their feed in the first place) as the harm comes from the fact that other people view the content or engage in the discussion. Nevertheless, the harm is real.

In a system with traditional moderation, a moderator could stop this kind of behaviour by deleting the posts for everyone and/or banning the perpetrators. None of this is possible in a "just hide the content" system.

Shared blocklists or "labels" won't work either as the consumers of the content don't have any motivation to block it - indeed, they want to see the revenge porn. The one who wants to block it is the victim, but it has no power to force everyone to use a particular blocklist. (The whole idea behind this system is to no one can force a blocklist on someone else)

reply

aussieguy1234 | karma 1678 | avg karma 2.28 · | 2019-09-11 07:37:04+00:00

Any kind of automated moderation like this is bound to be abused

madeofpalk | karma 21444 | avg karma 3.75 · | 2021-04-07 22:37:12+00:00

I’m really not sure what you’re saying. Are you suggesting websites shouldn’t be able to moderate itself?

nonethewiser | karma 6061 | avg karma 1.96 · | 2023-07-04 06:28:20

We really cant let OpenAI get away with calling “content moderation” ”safety”. Making sure it isnt offensive isnt a safety measure.

Everyone agrees safety from AI acting autonomously and maliciously is good. But thats not really a threat right now. Less think we need to make it “safe” by making it inoffensive. Its a tool. It should do what I want it to.

reply