dc.contributor.author | Κεσίδης, Αναστάσιος Λ. | el |
dc.contributor.author | Γαλιώτου, Ελένη | el |
dc.contributor.author | Γάτος, Βασίλειος | el |
dc.contributor.author | Πρατικάκης, Ιωάννης Ε. | el |
dc.date.accessioned | 2015-05-24T20:06:11Z | |
dc.date.available | 2015-05-24T20:06:11Z | |
dc.date.issued | 2015-05-24 | |
dc.identifier.uri | http://hdl.handle.net/11400/11084 | |
dc.rights | Αναφορά Δημιουργού-Μη Εμπορική Χρήση-Όχι Παράγωγα Έργα 3.0 Ηνωμένες Πολιτείες | * |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/3.0/us/ | * |
dc.source | http://link.springer.com/article/10.1007%2Fs10032-010-0134-4 | el |
dc.subject | Natural language processing (Computer science) | |
dc.subject | Computational morphology | |
dc.subject | Υπολογιστική μορφολογία | |
dc.subject | Historical document indexing | |
dc.subject | Ιστορική ευρετηρίαση έγγραφου | |
dc.subject | Επεξεργασία φυσικής γλώσσας | |
dc.subject | Word spotting | |
dc.subject | Εντοπισμός λέξεων | |
dc.title | A word spotting framework for historical machine-printed documents | en |
heal.type | journalArticle | |
heal.classification | Computer science | |
heal.classification | Information systems | |
heal.classification | Πληροφορική | |
heal.classification | Πληροφοριακά συστήματα | |
heal.classificationURI | http://skos.um.es/unescothes/C00750 | |
heal.classificationURI | http://skos.um.es/unescothes/C01993 | |
heal.classificationURI | **N/A**-Πληροφορική | |
heal.classificationURI | **N/A**-Πληροφοριακά συστήματα | |
heal.keywordURI | http://id.loc.gov/authorities/subjects/sh88002425 | |
heal.identifier.secondary | ISSN: 14332833 | |
heal.identifier.secondary | DOI: 10.1007/s10032-010-0134-4 | |
heal.language | en | |
heal.access | campus | |
heal.recordProvider | Τεχνολογικό Εκπαιδευτικό Ίδρυμα Αθήνας. Σχολή Τεχνολογικών Εφαρμογών. Τμήμα Μηχανικών Πληροφορικής Τ.Ε. | el |
heal.publicationDate | 2011-06 | |
heal.bibliographicCitation | Kesidis, A. L., Galiotou, E., Gatos, B. and Pratikakis, I. E. (2011). A word spotting framework for historical machine-printed documents. "International Journal on Document Analysis and Recognition", 14(2). June 2011. pp 131-144. Available from: http://link.springer.com/article/10.1007%2Fs10032-010-0134-4. | en |
heal.abstract | In this paper, we propose a word spotting framework for accessing the content of historical machine-printed documents without the use of an optical character recognition engine. A preprocessing step is performed in order to improve the quality of the document images, while word segmentation is accomplished with the use of two complementary segmentation methodologies. In the proposed methodology, synthetic word images are created from keywords, and these images are compared to all the words in the digitized documents. A user feedback process is used in order to refine the search procedure. The methodology has been evaluated in early Modern Greek documents printed during the seventeenth and eighteenth century. In order to improve the efficiency of accessing and search, natural language processing techniques have been addressed that comprise a morphological generator that enables searching in documents using only a base word-form for locating all the corresponding inflected word-forms and a synonym dictionary that further facilitates access to the semantic context of documents. | en |
heal.publisher | Springer-Verlag | en |
heal.journalName | International Journal on Document Analysis and Recognition | en |
heal.journalType | peer-reviewed | |
heal.fullTextAvailability | true |
Οι παρακάτω άδειες σχετίζονται με αυτό το τεκμήριο: