Increasing Retrieval Efficiency in Noisy Corpora.
Date of Submission
December 2018
Date of Award
Winter 12-12-2019
Institute Name (Publisher)
Indian Statistical Institute
Document Type
Master's Dissertation
Degree Name
Master of Technology
Subject Name
Computer Science
Department
Computer Vision and Pattern Recognition Unit (CVPR-Kolkata)
Supervisor
Mitra, Mandar (CVPR-Kolkata; ISI)
Abstract (Summary of the Work)
In this thesis we tried to catch the word variations in the noisy corpus. Initially we tried to solve the problem using string similarity and context similarity in the Generalized Language Model. But then this model was unable to improve the retrieval performance as seen experimentally. On delving into the depth of the problem as to why the model was not performing well we came up with a simple and effective approach to solve the problem. This is a simple Query Expansion based method which is used to increase the retrieval performance.
Control Number
ISI-DISS-2018-389
Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.
DOI
http://dspace.isical.ac.in:8080/jspui/handle/10263/6955
Recommended Citation
Roy, Riya, "Increasing Retrieval Efficiency in Noisy Corpora." (2019). Master’s Dissertations. 258.
https://digitalcommons.isical.ac.in/masters-dissertations/258
Comments
ProQuest Collection ID: http://gateway.proquest.com/openurl?url_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:dissertation&res_dat=xri:pqm&rft_dat=xri:pqdiss:28843282