Corpas na Gaeilge Scríofa (CGS) is a corpus of written Irish containing 131 million words. CGS will be a good fit for anyone who wants to examine edited written texts. In general, such tects will be more standardised than more informal written data such as social media posts, which are abundantly available in the National Corpus for Irish.
It is expected that this corpus will be more useful to the initial learner or to those writing or translating in accordance with the Official Standard and others who wish to check how a particular word or phrase is used.
This corpus supersedes the Corpus of Contemporary Irish which was based on the same principles and had a high number of loyal users. It is worth mentioning that CGS includes translated works, something that was not the case with the Corpus of Contemporary Irish.
CGS is an unbalanced corpus which will be expanded annually to keep it up to date.