@underisk

underisk@lemmy.ml · edit-2 2 years ago

Yeah, exactly. Those aren’t words, they aren’t random, and they’re in a comma separated list. Try asking it to produce something like this:

Green five the scoured very fasting to lightness air bog.

Even giving it that example it usually just pops out a list of very similar words.

underisk@lemmy.ml · 2 years ago

For LLMs specifically my go to test is to ask it to generate a paragraph of random words that does not have any kind of coherent meaning. It specifically asks them to do the opposite of what they’re trained to do so it trips them up pretty reliably. Closest I’ve seen them get was a list of comma separated random words and that was after giving them coaching prompts with examples.

underisk@lemmy.ml · 2 years ago

Invites work in the short term but once the bots get a foothold it quickly falls apart. Back when Gmail was invite only it took only a few months for websites to pop up that automated invite distribution.

underisk@lemmy.ml · edit-2 2 years ago

There will never be any kind of permanent solution to this. Botting is an arms race and as long as you are a large enough target someone is going to figure out the 11ft ladder for your 10ft wall.

That said, generally when coming up with a captcha challenge you need to figure out a way to subvert the common approach just enough that people can’t just pull some off the shelf solution. For example instead of just typing out the letters in an image, ask the potential bot to give the results of a math problem stored in the image. This means the attacker needs more than just a drop in OCR to break it, and OCR is mostly trained on words so its likely going to struggle at math notation. It’s not that difficult to work around but it does require them to write a custom approach for your captcha which can deter most casual attempts for some time.

underisk@lemmy.ml · 2 years ago

For video, I think the best you’re likely to get is embedded players from popular video hosts. The costs and challenges of hosting video content are just not worth it for the people hosting the instances.

GIF as a format is garbage. Terrible compression, poor quality, and weird quirks. They’re so bad that most platforms that host GIFs are just transparently converting them to MP4 videos because they actually take up fewer resources. If they get added the only way I see it happening is as extremely short form video with strict file size limits.

If I’m being honest though I don’t miss ithem at all.

underisk@lemmy.ml · 2 years ago

Stress the fact that federation isnt a new or confusing concept. They already engage with federated services without realizing it. Stuff like email, dns, Usenet, etc are all “federated” they just haven’t been described that way because they existed before that term was used to describe it.

underisk@lemmy.ml · 2 years ago

Telling jokes in a text medium isn’t new and sarcasm is frequently used without hackish writers rushing to reassure everyone that they were only kidding. If you can’t do a sarcasm without an ‘/s’ then just don’t do one.

underisk@lemmy.ml · 2 years ago

Part of the humor in sarcasm is feigned sincerity.

It’s like explaining the joke immediately after telling it. If you have to tell everyone its sarcasm, then you’ve done a bad job at deploying sarcasm.

underisk@lemmy.ml · edit-2 2 years ago

Kill Six Billion Demons

Second place goes to Unsounded

underisk@lemmy.ml · 2 years ago

I would’ve just blocked Louisiana instead of taking responsibility and liability for a bunch of sensitive personal information.