Resumen |
The main goal of this work is to build resources and a part-of-speech (POS) tagger for Mizo Language. The Mizo Language is the official language of the Mizoram state of India. The Mizo language is also known as Lushai language. The Mizo language belongs to the Kukish branch of the Sino-Tibetan language family. The paper describes the development of a Mizo-to-English dictionary and a part-of-speech tagger. In our Mizo-to-English dictionary, we have collected 26,407 entries, both automatically and manually. We started from studying the Mizo parts of speech and generated the POS tag list. For POS tagging of the Mizo Language, we built a 24-item tag set. The dictionary and the POS tag set will be used for building an automatic POS tagger for the Mizo language. |