RomCro v.2.0 - Paraller corpus of Romance languages ​​and Croatian

Please use the following text to cite this item or export to a predefined format:
Mikelenić,Bojana; Bikić-Carić, Gorana and Bezlaj, Metka, 2023, RomCro v.2.0 - Paraller corpus of Romance languages ​​and Croatian, HR-CLARIN, http://hdl.handle.net/20.500.14615/2-16
Date issued
2023-11-15
Size
19.4 million tokens
Description
The corpus contains originals and translations in all seven languages, and the order of the segments has been changed. The first version (RomCro v.1.0) was published in 2022. RomCro v.2.0 contains 33 original texts, 213 texts in total, 166,742 translation units and 19.4 million words, an increase of 3.7 million compared to the previous version.
Acknowledgement
Collections
This item isPublicly Available
and licensed under:
 Files in this item
Name
RomCro2.0.tmx
Size
172.61 MB
Format
application/octet-stream
Description
Unknown
MD5
2b461e8b443b9f8d8f351f166bbb4e35
Preview
  File Preview
Name
RomCro2.0.tsv
Size
128.43 MB
Format
application/octet-stream
Description
Unknown
MD5
9a4b23b0fe5bd2b2f8d5bec847f6a16f
Preview
  File Preview