High Performance Software in Multidimensional Reduction Methods for Image Processing with Application to Ancient Manuscript

Arsene, Corneliu TC and Church, Stephen and Dickinson, Mark (2018) High Performance Software in Multidimensional Reduction Methods for Image Processing with Application to Ancient Manuscript. Manuscript Cultures, 11 (11). pp. 73-96. ISSN 1867-9617

[img] Text
mc11_arsene_church_dickinson.pdf

Download (12MB)
Official URL: http://www.manuscript-cultures.uni-hamburg.de

Abstract

Multispectral imaging is an important technique for improving the readability of written or printed text where the letters have faded, either due to deliberate erasing or the ravages of time. Often the text can be read by illumination under a single wavelength of light, but in some cases the multispectral images need enhancement to improve the text clarity. There are many possible enhancement techniques: this paper compares an extended set of dimensionality reduction methods for image processing. We assess 15 dimensionality reduction methods applied to two different manuscripts. This assessment was performed subjectively, by asking the opinions of scholars who were experts in the languages used in the manuscripts, and also by using the Davies-Bouldin and Dunn indexes for evaluating the quality of the resultant image clusters. We found that the Canonical Variates Analysis (CVA) method, implemented in Matlab was superior to all the other tested methods. However, the other approaches may be more suitable in specific circumstances, so we would still recommend that a variety are tried. For example, CVA is a supervised clustering technique and therefore it requires considerably more user time and effort than a non-supervised technique such as the Principle Component Analysis approach (PCA). If the results from PCA are adequate to allow a text to be read then the added effort required for CVA may not be justified. For the purposes of comparing the computational times and the image results, a CVA method is also implemented in the C programming language and using the GNU (GNU’s Not Unix) Scientific Library (GSL) and the OpenCV (OPEN source Computer Vision) computer vision programming library. Therefore high performance software was developed using the GNU GSL library, which drastically reduced the computational complexity and time for the CVA-GNU GSL method. For the CVA-Matlab technique, vectorization was used in order to reduce the respective computational times (i.e. matrix and vector operations instead of loop-based).

Item Type: Article
Subjects: MSC 2010, the AMS's Mathematics Subject Classification > 01 History and biography
MSC 2010, the AMS's Mathematics Subject Classification > 68 Computer science
Divisions: Manchester Institute for the Mathematical Sciences
Depositing User: Dr Corneliu TC Arsene
Date Deposited: 22 Dec 2018 18:23
Last Modified: 22 Dec 2018 18:23
URI: http://eprints.maths.manchester.ac.uk/id/eprint/2679

Actions (login required)

View Item View Item