• rbn@sopuli.xyz
    link
    fedilink
    arrow-up
    7
    ·
    4 months ago

    Couldn’t you just measure against the general occurence of a word in online texts? Whether a word is rare or not shouldn’t be dependent on people’s answers, should it?

    • Zagorath@aussie.zone
      link
      fedilink
      English
      arrow-up
      5
      ·
      4 months ago

      Yeah that’s basically what Ngrams are.

      an online search engine that charts the frequencies of any set of search strings using a yearly count of n-grams found in printed sources published between 1500 and 2019

      Basically: how popular was it to use this word or phrase in all recorded text available to the search engine (including scanned old books), charted over time.