Putting News Articles in Context.
Date of Submission
December 2020
Date of Award
Winter 12-12-2021
Institute Name (Publisher)
Indian Statistical Institute
Document Type
Master's Dissertation
Degree Name
Master of Technology
Subject Name
Computer Science
Department
Computer Vision and Pattern Recognition Unit (CVPR-Kolkata)
Supervisor
Mitra, Mandar (CVPR-Kolkata; ISI)
Abstract (Summary of the Work)
The work of this dissertation has been done along the lines of TREC News Track Background Linking task. The task is, given a news article suggest other news articles that provide context and background to the current article. As we know, context and background are highly subjective terms. Here they are measured by comparing the system retrieved documents with a set of documents already marked relevant according to a panel of experts. The entire task is done on the Washington Post data set, a collection of 591537 news articles that appeared in Washington Post from 2012 to 2017. In this dissertation we explore six methods used to solve this task. These techniques are based on standard Information Retrieval methods and Natural Language Processing techniques. We compare them with each other and pit them against the best performing methods. We use JAVA as the main programming language for data parsing, indexing and searching. Python is also used for data exploration in some limited cases.
Control Number
ISI-DISS-2020-21
Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.
DOI
http://dspace.isical.ac.in:8080/jspui/handle/10263/7175
Recommended Citation
Gautam, Rahul, "Putting News Articles in Context." (2021). Master’s Dissertations. 4.
https://digitalcommons.isical.ac.in/masters-dissertations/4
Comments
ProQuest Collection ID: http://gateway.proquest.com/openurl?url_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:dissertation&res_dat=xri:pqm&rft_dat=xri:pqdiss:28842686