Briggin the Gap: oorNews Scrievin Formal Scots for tae Train AI

Briggin the Gap: oorNews.co.uk Champions Formal Scots in AI Language Models

Briggin the Gap: oorNews Scrievin Formal Scots for tae Lairn AI

In a recent study cryed “The Sociolinguistic Foundations of Language Modeling,” researchers Jack Grieve an his colleagues delve intae the crucial role o linguistic diversity in artificial intelligence. Published in Frontiers in Artificial Intelligence, the airticle unnerscores hoo language models reflect the varieties o leid they are trained on. The authors argue that the societal value an performance o thae models hing on the quality an diversity o their trainin data.

Ane o the pressin issues pynted oot in the study is the representation o the Scots leid ithin AI systems. Scots, aften wrangly viewed as a dialect o English, requires carefu haundlin tae prevent AI ootpits fae soondin informal or oot o place. Sic misrepresentation can be baith discriminatory an offensive tae Scots speikers, reinforcin hairmfu stereotypes o law status an law education.

In response tae thae challenges, oorNews is makin significant strides by providin a muckle corpus o formal register Scots. This initiative aims tae ensure that AI ootpits in Scots are no ainly accurate but respectfu o the leid’s status an yaise in formal prose. By curatin content that reflects formal Scots, oorNews helps prevent AI fae lockin intae damagin stereotypes associated wi informal yaise o the leid.

The integration o formal Scots in AI trainin data is essential for tae create language models that honour an accurately represent Scotland’s linguistic diversity. This approach aligns wi the recommendations fae Grieve et al., advocatin for trainin corpora that genuinely reflect the specific varieties o leid bein modeled. Aw Scots speikers are walcome tae contribute tae oorNews by suggestin ony chynges they waant tae see in an airticle via the comments section aneath.