Identification and Extraction of Replicated Punbjabi Multi Word Expressions
| Vol-3 | Issue-12 | December 2018 | Published Online: 10 December 2018 PDF ( 229 KB ) | ||
| Author(s) | ||
| Kapil Dev Goyal 1 | ||
|
1Assistant Professor, SBAS Khalsa College,Sandaur, Sangrur (India) |
||
| Abstract | ||
Multiword Expressions (MWEs) play an important role in Natural Language Processing. Multiword Expression is a combination of two or more words but treated as a single word. MWEs in Punjabi are quite varied and many of these are of the types that are not encountered in English. In this paper, we examine different types of MWEs encountered in Punjbai. Many of these have not received adequate attention of investigators. For example, „vaalaa‟ constructs, doublets (word-pairs), replication, and a variety of verb group forms have not been explored as MWEs. We examine these MWEs from machine translation viewpoint. Many of these are frequently used in day-to-day conversations and informal communication but are not that frequently encountered in a formal textual corpus. Most of the conventional statistical methods for MWE identification use corpus with limited linguistic cues. These are found to be inadequate for detecting all types of MWEs that exist in real life. In this paper, we present a methodology for identification and extraction of Punjabi MWEs using linguistic knowledge. Interpretation and representation for some of these from machine translation perspective have also been explored. |
||
| Keywords | ||
| Multiword Expressions (MWEs), Natural Language Processing | ||
|
Statistics
Article View: 402
|
||

