Data collection

Permanent URI for this collection

Browse

Recent Submissions

Showing 1 - 1 out of 1 results
  • Item
    HR-GPT Beta Data Collection
    (2024-11) Štefanec, Vanja; Thakkar, Gaurish; Tadić, Marko; Farkaš, Daša; Filko, Matea
    This dataset contains deduplicated text used for pretraining HR-GPT Beta Large Language Models.