dc.contributor.author |
Φράγκου, Παυλίνα |
el |
dc.contributor.author |
Fragkou, Pavlina |
en |
dc.date.accessioned |
2015-04-25T09:08:52Z |
|
dc.date.available |
2015-04-25T09:08:52Z |
|
dc.date.issued |
2015-04-25 |
|
dc.identifier.uri |
http://hdl.handle.net/11400/8909 |
|
dc.rights |
Default License |
|
dc.source |
http://history.icininfo.net/2011/ |
el |
dc.source |
http://history.icininfo.net/2011/FileStore/procs_INFO_2011.pdf |
el |
dc.subject |
Text segmentation |
|
dc.subject |
Κατάτμηση κειμένου |
|
dc.subject |
Named entity recognition |
|
dc.subject |
Αναγνώριση οντότητας |
|
dc.subject |
Co-reference resolution |
|
dc.subject |
Συν-αναφορά ψήφισμα |
|
dc.subject |
Information extraction |
|
dc.subject |
Εξαγωγή πληροφορίας |
|
dc.title |
Text segmentation using named entity recognition and co-reference
resolution in greek texts |
en |
heal.type |
conferenceItem |
|
heal.classification |
Information sciences |
|
heal.classification |
Library science |
|
heal.classification |
Πληροφόρηση, Επιστήμη της Πληροφόρησης |
|
heal.classification |
Βιβλιοθηκονομία |
|
heal.classificationURI |
http://skos.um.es/unescothes/C01988 |
|
heal.classificationURI |
http://skos.um.es/unescothes/C02286 |
|
heal.classificationURI |
**N/A**-Πληροφόρηση, Επιστήμη της Πληροφόρησης |
|
heal.classificationURI |
**N/A**-Βιβλιοθηκονομία |
|
heal.contributorName |
Γιαννακόπουλος, Γεώργιος Α. (συντ.) |
el |
heal.contributorName |
Σακκάς, Δαμιανός Π. (συντ.) |
el |
heal.language |
en |
|
heal.access |
free |
|
heal.recordProvider |
Τεχνολογικό Εκπαιδευτικό Ίδρυμα Αθήνας. Σχολή Διοίκησης και Οικονομίας. Τμήμα Βιβλιοθηκονομίας και Συστημάτων Πληροφόρησης |
el |
heal.publicationDate |
2011-09 |
|
heal.bibliographicCitation |
Fragkou, P. (2011) Text segmentation using named entity recognition and co-reference resoluion in greek texts. In "International Conference on Integrated Information (IC-ININFO 2011)" Kos Island, Greece, 29 September - 3 Octomber 2011. pp. 34-41. Available from: http://history.icininfo.net/2011/FileStore/procs_INFO_2011.pdf [Accessed: 24/04/20105] |
en |
heal.abstract |
In this paper we examine the benefit of
performing named entity recognition and co-reference
resolution to a Greek corpus used for text segmentation.
Segments consist of portions among one of the 300
documents published by ten different authors in the
Greek newspaper "To Vima". The aim here is to
examine whether the combination of text segmentation
and information extraction (and most specifically the
named entity recognition and co-reference resolution
steps) can prove to be beneficial for the identification of
the various topics that appear in a document. Named
entity recognition was performed using an already
existing tool which was trained on a similar corpus. The
produced annotations were manually corrected and
enriched in order to cover four types of named entities
(i.e. person name, organization, location and time). Coreference
resolution and most specifically substitution
of every reference of the same instance with the same
named entity identifier was performed in a subsequent
step. The evaluation using three well known text
segmentation algorithms leads to the conclusion that,
the benefit highly depends on the segment's topic, the
number of named entity instances appearing in it, as
well as the segment's length. |
en |
heal.fullTextAvailability |
true |
|
heal.conferenceName |
International Conference on Integrated Information (IC-ININFO 2011) |
el |
heal.conferenceItemType |
full paper |
|