Please use the following text to cite this item or export to a predefined format:
Mikelenić, Bojana; Bikić-Carić, Gorana; Bezlaj, Metka; Oliver, Antoni and Tadić, Marko, 2025, RomCro v.2.0 - Parallel corpus of Romance languages and Croatian, HR-CLARIN, http://hdl.handle.net/20.500.14615/2-16
dc.contributor.author | Mikelenić, Bojana |
dc.contributor.author | Bikić-Carić, Gorana |
dc.contributor.author | Bezlaj, Metka |
dc.contributor.author | Oliver, Antoni |
dc.contributor.author | Tadić, Marko |
dc.date.accessioned | 2025-02-04T13:37:35Z |
dc.date.available | 2025-02-04T13:37:35Z |
dc.date.issued | 2025-1-15 |
dc.description | The corpus contains originals and translations in all seven languages, and the order of the segments has been changed. The first version (RomCro v.1.0) was published in 2022. RomCro v.2.0 contains 33 original texts, 213 texts in total, 166,738 translation units and 19.4 million words, an increase of 3.7 million compared to the previous version. In comparison to v.1.0, v.2.0 also contains texts in Catalan. |
dc.description.abstract | RomCro v.2.0 is parallel multilingual and multidirectional corpus of literary texts in six Romance languages (French, Portuguese, Romanian, Italian, Spanish and Catalan) and Croatian. |
dc.identifier.uri | http://hdl.handle.net/20.500.14615/2-16 |
dc.language.iso | es |
dc.publisher | Faculty of Humanities and Social Sciences |
dc.rights | The MIT Licence |
dc.rights.label | PUB |
dc.rights.uri | https://zzl-ffzg.mit-license.org/ |
dc.subject | Parallel corpus |
dc.subject | Catalan language |
dc.subject | Croatian language |
dc.subject | Literary texts |
dc.subject | HUMANITIES and RELIGION::Languages and linguistics::Romance languages::French language |
dc.subject | Italian language |
dc.subject | Portuguese language |
dc.subject | HUMANITIES and RELIGION::Languages and linguistics::Romance languages::Romanian language |
dc.subject | HUMANITIES and RELIGION::Languages and linguistics::Romance languages::Spanish language |
dc.subject | hrvatski jezik |
dc.subject | usporedni korpus |
dc.subject | književni tekstovi |
dc.subject | francuski jezik |
dc.subject | katalonski jezik |
dc.subject | talijanski jezik |
dc.subject | rumunjski jezik |
dc.subject | španjolski jezik |
dc.subject | portugalski jezik |
dc.title | RomCro v.2.0 - Parallel corpus of Romance languages and Croatian |
dc.type | corpus |
local.contact.person | Bojana Mikelenić bmikelen@ffzg.unizg.hr Faculty of Humanities and Social Sciences, University of Zagreb |
local.files.count | 2 |
local.files.size | 315628164 |
local.has.files | yes |
local.size.info | 19.4 million tokens |
local.sponsor | nationalFunds MOBODL-2023-08-9511 Croatian Science Foundation NextGenerationEU |
metashare.ResourceInfo#ContentInfo.mediaType | text |