Text segmentation using named entity recognition and co-reference
resolution in greek texts

dc.contributor.author	Φράγκου, Παυλίνα	el
dc.contributor.author	Fragkou, Pavlina	en
dc.date.accessioned	2015-04-25T09:08:52Z
dc.date.available	2015-04-25T09:08:52Z
dc.date.issued	2015-04-25
dc.identifier.uri	http://hdl.handle.net/11400/8909
dc.rights	Default License
dc.source	http://history.icininfo.net/2011/	el
dc.source	http://history.icininfo.net/2011/FileStore/procs_INFO_2011.pdf	el
dc.subject	Text segmentation
dc.subject	Κατάτμηση κειμένου
dc.subject	Named entity recognition
dc.subject	Αναγνώριση οντότητας
dc.subject	Co-reference resolution
dc.subject	Συν-αναφορά ψήφισμα
dc.subject	Information extraction
dc.subject	Εξαγωγή πληροφορίας
dc.title	Text segmentation using named entity recognition and co-reference resolution in greek texts	en
heal.type	conferenceItem
heal.classification	Information sciences
heal.classification	Library science
heal.classification	Πληροφόρηση, Επιστήμη της Πληροφόρησης
heal.classification	Βιβλιοθηκονομία
heal.classificationURI	http://skos.um.es/unescothes/C01988
heal.classificationURI	http://skos.um.es/unescothes/C02286
heal.classificationURI	N/A-Πληροφόρηση, Επιστήμη της Πληροφόρησης
heal.classificationURI	N/A-Βιβλιοθηκονομία
heal.contributorName	Γιαννακόπουλος, Γεώργιος Α. (συντ.)	el
heal.contributorName	Σακκάς, Δαμιανός Π. (συντ.)	el
heal.language	en
heal.access	free
heal.recordProvider	Τεχνολογικό Εκπαιδευτικό Ίδρυμα Αθήνας. Σχολή Διοίκησης και Οικονομίας. Τμήμα Βιβλιοθηκονομίας και Συστημάτων Πληροφόρησης	el
heal.publicationDate	2011-09
heal.bibliographicCitation	Fragkou, P. (2011) Text segmentation using named entity recognition and co-reference resoluion in greek texts. In "International Conference on Integrated Information (IC-ININFO 2011)" Kos Island, Greece, 29 September - 3 Octomber 2011. pp. 34-41. Available from: http://history.icininfo.net/2011/FileStore/procs_INFO_2011.pdf [Accessed: 24/04/20105]	en
heal.abstract	In this paper we examine the benefit of performing named entity recognition and co-reference resolution to a Greek corpus used for text segmentation. Segments consist of portions among one of the 300 documents published by ten different authors in the Greek newspaper "To Vima". The aim here is to examine whether the combination of text segmentation and information extraction (and most specifically the named entity recognition and co-reference resolution steps) can prove to be beneficial for the identification of the various topics that appear in a document. Named entity recognition was performed using an already existing tool which was trained on a similar corpus. The produced annotations were manually corrected and enriched in order to cover four types of named entities (i.e. person name, organization, location and time). Coreference resolution and most specifically substitution of every reference of the same instance with the same named entity identifier was performed in a subsequent step. The evaluation using three well known text segmentation algorithms leads to the conclusion that, the benefit highly depends on the segment's topic, the number of named entity instances appearing in it, as well as the segment's length.	en
heal.fullTextAvailability	true
heal.conferenceName	International Conference on Integrated Information (IC-ININFO 2011)	el
heal.conferenceItemType	full paper

Files in this item

Name: 8.pdf

Size: 3.097Mb

Format: PDF

Open

Publications

Show simple item record

Files in this item

This item appears in the following Collection(s)

Search Hypatia

Browse

All of Hypatia

This Collection

Open Data

About Hypatia

Repository Policies

F.A.Q.

Help

Contact Us