-
CLC_EVC – English-Vietnamese bilingual corpus.
-
CLC_FVC – French-Vietnamese bilingual corpus.
-
CLC_KVC – Korean-Vietnamese bilingual corpus.
-
CLC_LVC – Lao-Vietnamese bilingual corpus.
-
CLC_VCC – Vietnamese-Chinese bilingual corpus.
-
CLC_VTB – Vietnamese treebank corpus.
-
CLC_BTEC – Basic Travel Expression Corpus.
Multilingual speech corpus containing tourism-related sentences similar to those that are usually found in phrasebooks for tourists going abroad.
-
Specification documents
Vietnamese word segmentation
Vietnamese POS Tagset
Vietnamese NER Tagset