• chuckleslord@lemmy.world
      link
      fedilink
      arrow-up
      38
      ·
      1 day ago

      Bruh, I’m calling shenanigans on this tool. Yes, post is probably ai, but there is no tool that can accurately tell ai text from human text. “The most accurate ai detector” is like me calling a stick “the most accurate god detector”. It’s a meaningless distinction when the detection isn’t possible to do accurately.

      • plutopos@lemmy.zip
        link
        fedilink
        arrow-up
        16
        arrow-down
        1
        ·
        23 hours ago

        But it’s useful when you need to start a witch hunt on a post you dislike!

    • Sturgist@lemmy.ca
      link
      fedilink
      arrow-up
      63
      arrow-down
      1
      ·
      1 day ago

      It’s that em-dash at the end that really solidified it for me. Was kinda on the fence most of the way through.

      • moopet@sh.itjust.works
        link
        fedilink
        English
        arrow-up
        2
        ·
        5 hours ago

        My other half writes a lot of formal documentation as part of her work. She uses bullet lists and em-dashes a lot, and her work comes up as >90% AI. Even the stuff she wrote a decade ago. A while back, she took one of her documents and gave it to ChatGPT and said, “make this pass an AI test” and it went down to about 20% but it was no longer the house style.

      • Rhaedas@fedia.io
        link
        fedilink
        arrow-up
        39
        ·
        1 day ago

        My wife is a professional writer and uses em dash a lot, usually as --, including in her casual messages, as it’s common for her to use.

        It’s the formatting style of the whole thing that sounds AI to me. “Honestly” phrases really jumps out at me now, as well as the “But…” fragments. Not that they’re bad, hell, I type out things that way too. But for it to be all together, it sounds AI after you’ve seen it a lot.

        The em dash is fine here, emphasizing the final point. Although I would have probably used a comma myself for a post and not a formal manuscript.

        Funny thing is, you can get AI to reduce a lot of these tells with a decent system prompt and staging of the writing process. So I’m surprised we’re still seeing it a lot and it hasn’t been weaned out of the latest versions.

        • Pommes_für_dein_Balg@feddit.org
          link
          fedilink
          arrow-up
          16
          ·
          edit-2
          1 day ago

          It’s really hard to get rid of things caused by systematic bias in the training data.

          After inhaling the entire internet, LLMs started being trained on publically available books.
          And due to copyright, those were older ones from a time when em-dashes were used more.
          The training results were tested by humans, which needed to be cheap, but also English language natives.
          So they used workers in English-speaking African countries. Where the English taught in school is also more traditional with a focus on older literature, so the answers coming from the old literature were rated higher by the testers.

          • stormdelay@sh.itjust.works
            link
            fedilink
            arrow-up
            15
            arrow-down
            1
            ·
            1 day ago

            “Due to copyright” did they not all illegally download every book they could, copyrighted or not, to train their LLMs?

        • Sturgist@lemmy.ca
          link
          fedilink
          arrow-up
          4
          ·
          1 day ago

          Fair enough on the em-dash, hadn’t actually considered that LLMs use it extensively because it’s actually used in the wild.

        • Cris_Citrus@piefed.zip
          link
          fedilink
          English
          arrow-up
          4
          ·
          1 day ago

          The use of “–” is interesting, I use dashes to convey pacing constantly because I type as I speak, and so punctuation to me is largely about trying to write the delivery I want the reader to percieve, and I always just use “-” knowing it’s incorrect, but I don’t exactly wanna make myself seem even more like ai by switching lol

          I may try using “–”, thanks for sharing that!

          • Rhaedas@fedia.io
            link
            fedilink
            arrow-up
            5
            ·
            1 day ago

            I actually used to do the same thing with the -, and by em dash becoming a thing I dived into the usage and history of it all, including ; and en dash. And now I’m using - less. But I don’t use em dash more, just tend to throw a comma in.

            Another weird one I learned. em dash spacing. The spacing AI tends to use is not preferred by publishers now, but is more AP style, perhaps picking it up from when it was more popular to have space between the letters. Europe tends to prefer spacing but with a en dash (and I kind of like how that looks too, but it doesn’t fly if you publish in the US).

            • Cris_Citrus@piefed.zip
              link
              fedilink
              English
              arrow-up
              3
              ·
              1 day ago

              Wait, so what spacing is generally preferred for the em dash? Thats interesting, I never formally learned how to use one so I’m curious (I’ve not been to college, I have no idea if its part of typical higher education curriculum if you take any english courses)

              I abuse the fuck out of commas so I reach for dashes or ; when I want a longer pause that isn’t a logical end point for a thought. But a semicolon feels somehow a bit more formal to me, so I use it less for general online chatting

      • lostbit@feddit.nl
        link
        fedilink
        English
        arrow-up
        1
        ·
        23 hours ago

        where you really on the fence? i just cant imagine that. every single line is classic AI.

    • Rain World: Slugcat Game@lemmy.world
      link
      fedilink
      English
      arrow-up
      5
      arrow-down
      9
      ·
      1 day ago

      that batman image,, uhh,,, recite it from memory,,, batman says "this is the tool of the enemy, we shall not use it" or something,,, idk,,,
      (...why?? does lemmy collapse the commas into one??? eww
      and uses the unicode ellipsis??? ugh
      tip: be generous with backslashes! if it is not what you typed, then do not sit there and take it! defend yourself with the slashes!! <slash>! <slash>!!)

      • fushuan@piefed.blahaj.zone
        link
        fedilink
        English
        arrow-up
        8
        arrow-down
        1
        ·
        1 day ago

        Lemmy uses markdown, you can choose to learn it instead of getting frustrated about it lol. ¯\_(ツ)_/¯

        Also, why are you putting question marks in the middle of the sentence? Reads horrible.

        • Rain World: Slugcat Game@lemmy.world
          link
          fedilink
          English
          arrow-up
          2
          arrow-down
          19
          ·
          1 day ago

          xml is better than markdown, you will not change my mind, but i would like to see you try
          when they were writing the code to convert markdown to xml, because no, sane or insane, web browser supports markdown, did, at no point, did they think "gosh, there's got to be a better way"???

          • Dave.@aussie.zone
            link
            fedilink
            arrow-up
            10
            ·
            1 day ago

            xml is better than markdown, you will not change my mind

            Markdown is for humans who are writing their prose on the fly and don’t want to be bothered with the cognitive load associated with the extra verbiage and rulesets of XML.

            XML is for computers which are great with verbiage and rulesets.

            It’s perfectly fine to have a converter between the two to bridge the gap.

            • Rain World: Slugcat Game@lemmy.world
              link
              fedilink
              English
              arrow-up
              1
              arrow-down
              11
              ·
              1 day ago

              forcing a markup language for humans is so therianphobic!!!
              and, it is <em>so easy</em> to write xml. markdown is way harder. escaping markdown, you have to remember to put a backslash before every character that markdown deems special, with xml, you just use &lt;, &gt;, and &amp;.
              it is both silly and saddening how many harmful stereotypes you’ve used in such a short span
              also, xml is an interchange format, markdown is not!!! using markdown in federated software means that it's going to break!!! you could have the editor use markdown and the federation use xml...