Extraction of multiword expressions from hindi text document (Record no. 10140)

MARC details
000 -LEADER
fixed length control field 03130nam a22001937a 4500
003 - CONTROL NUMBER IDENTIFIER
control field BML
082 ## - DEWEY DECIMAL CLASSIFICATION NUMBER
Classification number 006.3
Item number MIS
100 ## - MAIN ENTRY--PERSONAL NAME
Personal name Mishra, Atul
245 ## - TITLE STATEMENT
Title Extraction of multiword expressions from hindi text document
260 ## - PUBLICATION, DISTRIBUTION, ETC. (IMPRINT)
Place of publication, distribution, etc Gurgaon
Name of publisher, distributor, etc BML Munjal University
Date of publication, distribution, etc 2022
300 ## - PHYSICAL DESCRIPTION
Extent 109p.
502 ## - DISSERTATION NOTE
Dissertation note Thesis submitted in the fulfillment of the requirement for the degree of Doctor of Philosophy by Atul Mishra Under the supervision of Dr. Soharab Hossain Shaikh, Prof. (Dr.) Ratna Sanyal
Degree type Doctor of Philosophy
Year degree granted 2022
520 ## - SUMMARY, ETC.
Summary, etc. Multiword expressions (MWEs) are a significant challenge in many fields of language technology. Multiword extraction from random text data has grown in popularity among the NLP community. This topic of research is strongly connected to statistical analysis and artificial intelligence. This thesis presents a detailed literature assessment and numerous strategies for building an automated Multiword extraction system. The overall contribution of the thesis has been divided into six parts. In this study, a method of Hindi MWEs has been proposed, and the significance of boundary threshold calculations in this study. The main objective of this dissertation work is to develop a generalized mechanism for the extraction of Hindi multiword expressions. The primary goal of this research is to build an approach for extracting Hindi MWEs using syntactical and statistical idiosyncrasy (i.e., the structure of linguistic patterns and association) and context connection between their constituent words. Various combination strategies of different classifiers based on these properties may be applied to develop a multiword extraction mechanism. Hence, creating a best-performing combination strategy is also an objective of this dissertation. There are various hurdles in designing a method using these properties. In statistical filtering, calculating the boundary threshold is a challenging task. Another issue is to combine multiple filters since different combination strategies may be possible. Thus, recognizing the best combination strategy is also a challenge. In the Hybrid method, Semantic Similarity has been used. The study developed a web application using the Flask framework to automatically extract the Hindi MWEs using the Association based and Hybrid methods. The methods, evaluation results, and findings in each contribution have been presented in different chapters. The proposed technique is evaluated using the HDTB Treebank and TDIL dataset, which is freely available. The experiment results reflect the validity and viability of the method and help make a blueprint that shows how well it can work with the current procedures. A comparative study between the performance of previous works and the proposed methods has also been given. At the end of the thesis, the conclusion of the whole dissertation is reported.
650 ## - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical term or geographic name as entry element Engineering and Technology
650 ## - SUBJECT ADDED ENTRY--TOPICAL TERM
Topical term or geographic name as entry element Computer Science Artificial Intelligence
856 ## - ELECTRONIC LOCATION AND ACCESS
Uniform Resource Identifier <a href="https://shodhganga.inflibnet.ac.in/handle/10603/411302">https://shodhganga.inflibnet.ac.in/handle/10603/411302</a>
856 ## - ELECTRONIC LOCATION AND ACCESS
Uniform Resource Identifier <a href="http://drc.bml.edu.in:8080/jspui/handle/123456789/2835">http://drc.bml.edu.in:8080/jspui/handle/123456789/2835</a>
942 ## - ADDED ENTRY ELEMENTS (KOHA)
Source of classification or shelving scheme Dewey Decimal Classification
Koha item type Thesis
Holdings
Withdrawn status Lost status Source of classification or shelving scheme Damaged status Not for loan Collection code Home library Current library Shelving location Date acquired Source of acquisition Full call number Barcode Date last seen Price effective from Koha item type Public note
    Dewey Decimal Classification   Not For Loan Reference BMU Library BMU Library Display-1 12/01/2023 BML Munjal University 006.3 MIS TH06 26/11/2023 12/01/2023 Thesis School of Engineering & Technology

Powered by Koha