• hendrik@palaver.p3x.de
    link
    fedilink
    English
    arrow-up
    10
    ·
    1 day ago

    Nice. Though a bit of context would be helpful. What is a z-score? Where is this from, and where can I read more?

      • Hamartiogonic@sopuli.xyz
        link
        fedilink
        English
        arrow-up
        6
        ·
        1 day ago

        In many cases, a z-score between -2 and +2 is common. Anything outside that range is rare. Further away from zero you go, the rarer it gets.

        In real life though, not all things follow the normal distribution, so z-scores aren’t always so easy to interpret. If the distribution is far from normal, z-scores are only marginally useful.

        • taiyang@lemmy.world
          link
          fedilink
          English
          arrow-up
          2
          ·
          1 day ago

          To add, about 2.5% of cases are below -2 and another 2.5% above +2 (specifically 1.96), so long as the original metric is normally distributed (bell shaped). About 68% of cases actually fall between -1 and 1, as most things tend to be close to average. That’s true of pretty much any true normal distribution, btw.

          It’s a neat way to basically say, this isn’t a normal amount of phrase usage. It’s not a statistical test, though those concepts use a similar approach to probabilities.

          • Hamartiogonic@sopuli.xyz
            link
            fedilink
            English
            arrow-up
            3
            ·
            20 hours ago

            When dealing with real world data, it’s always a good idea to check how normal it is. If that assumption is grossly violated, you have to work a bit harder and use more robust methods.

            When it’s reasonably close to normal, you can take all sorts of nice shortcuts like z-scores and t-tests. I haven’t looked into this case yet, but I assume they knew what they were doing.

            • taiyang@lemmy.world
              link
              fedilink
              English
              arrow-up
              3
              ·
              19 hours ago

              Yup, although technically you can still run z-tests and t-tests for statistical validity because samples are always normally distributed around a population mean (as long as it’s a large enough sample), even for skewed or uniform distributions. Some tests work better than others, though. The graphs z might be a sample, which might make sense is population data is available for these phrases, but otherwise I’m a little confused why that’d use it, lol.

              To clarify for the commenter, though, Z-scores themselves can be applied to individuals; it’s just a simple conversation setting mean to 0 and standard deviation to 1. In that case, it’s strictly a relative case, that 2.5% isn’t a probability so much as the percentile – % cases that fall below that score. It’s a nifty metric to compare things that aren’t on the same scale, for instance.

  • BroBot9000@lemmy.world
    link
    fedilink
    English
    arrow-up
    5
    arrow-down
    4
    ·
    edit-2
    1 day ago

    A totalitarian regime using Ai to push their agenda. Big surprise. How about y’all get out and vote better before you can’t even access websites anymore. Oh wait…

    • KaChilde@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      8
      ·
      1 day ago

      If my representative was using a robot to decide how to promote their ideas, I’d be pissed.

      These people were elected to represent the people based (presumably) on their own words and opinions. If they can’t muster the energy to support their constituents themselves, without relying on a robot to spew up generic bile, then they should step down and let someone who cares do it.

      • Melvin_Ferd@lemmy.world
        link
        fedilink
        English
        arrow-up
        1
        arrow-down
        5
        ·
        1 day ago

        They’re using a glorified spell checker. I don’t elect them so they can sit around writing speeches and day