Identification of Reader Specific Difficult Words by Analyzing Eye Gaze and Document Content
Document Type
Conference Article
Publication Title
Proceedings of the International Conference on Document Analysis and Recognition, ICDAR
Abstract
This paper presents an approach for identifying reader specific difficult words while someone is reading a textual document. The work is motivated by the need of developing human-document interaction systems, in general and creating person-specific online educational content, in particular. Eye gaze information gives person specific behavior whereas textual content is analyzed to get general linguistic aspect of the document content. These two pieces of information are fused together through machine learning algorithms to identify the set of difficult words for a particular reader reading a particular document. An annotated dataset has been created where each word in a document is marked with its bounding box information and each reader identifies a set of difficult words while reading the document. The dataset consists of sixteen documents and each document is read by five subjects. The method is evaluated through recall-precision analysis. The impressive precision at high recall attests the feasibility of building a practical application based on this research. The experiment further brings out several interesting facts about human reading behaviour.
First Page
1346
Last Page
1351
DOI
10.1109/ICDAR.2017.221
Publication Date
7-2-2017
Recommended Citation
Garain, Utpal; Pandit, Onkar; Augereau, Olivier; Okoso, Ayano; and Kise, Koichi, "Identification of Reader Specific Difficult Words by Analyzing Eye Gaze and Document Content" (2017). Conference Articles. 230.
https://digitalcommons.isical.ac.in/conf-articles/230