CroSentiLex

Please use the following text to cite this item or export to a predefined format:
Glavaš, Goran; Šnajder, Jan and Dalbelo Bašić, Bojana, 2012, CroSentiLex, HR-CLARIN, http://hdl.handle.net/20.500.14615/2-24
Date issued
2012
Size
37000 tokens
Language(s)
Description
CroSentiLex is a sentiment lexicon for Croatian. CroSentilex consists of two files (crosentilex-positives.txt and crosentilex-negatives.txt), each containing 37K Croatian lemmas ranked by positivity and negativity, respectively, with the corresponding PageRank scores. The rankings were created automatically based on small positive and negative seed sets and co-occurrence frequencies, using the PageRank algorithm. In addition to the automatically extracted lexicon, human (gold-standard) sentiment annotations for 1200 Croatian lemmas are provided in gs-sentiment-annotations.txt.
Collections
This item isPublicly Available
and licensed under:
 Files in this item
Name
CROSENTILEX.zip
Size
476.5 KB
Format
application/zip
Description
zip
MD5
c2e660fb2bed423e32d58dbcf5c116e7
Preview
  File Preview