Data collection

Permanent URI for this collection

Browse

Recent Submissions

Showing 1 - 2 out of 2 results
  • Item
    CroSentiLex
    (TakeLab: Text Analysis and Knowledge Engineering Lab, University of Zagreb, Faculty of Electrical Engineering and Computing, 2012) Glavaš, Goran; Šnajder, Jan; Dalbelo Bašić, Bojana
  • Item
    HR-GPT Beta Data Collection
    (2024-11) Štefanec, Vanja; Thakkar, Gaurish; Tadić, Marko; Farkaš, Daša; Filko, Matea
    This dataset contains deduplicated text used for pretraining HR-GPT Beta Large Language Models.