Information can include text, pictures and signatures that can be scanned into a document format, such as the Portable Document Format (PDF), and easily emailed to recipients around the world. Upon the document’s arrival, the receiver can open and view it using a vast array of different PDF viewing applications such as Adobe Reader and Apple Preview. Hence, today the use of the PDF has become pervasive. Since the scanned PDF is an image format, it is inaccessible to assistive technologies such as a screen reader. Therefore, the retrieval of the information needs Optical Character Recognition (OCR). The OCR software scans the scanned PDF file and through text extraction generates an editable text formatted document. This text document can then be edited, formatted, searched and indexed as well as translated or converted to speech. A problem that the OCR software does not solve is the accurate regeneration of the full text layout. This paper presents a technology that addresses this issue by closely preserving the original textual layout of the scanned PDF using the open source document analysis and OCR system (OCRopus) based on geometric layout and positioning information. The main issues considered in this research are the preservation of the correct reading order, and the representation of common logical structured elements such as section headings, line breaks, paragraphs, captions, and sidebars, foot-bars, running headers, embedded images, graphics, tables and mathematical expressions.
This dateset contain (1) List of articles resulted from the original search (2) List of articles selected for full text review (3) List of selected articles with key data extracted
The ability to accurately perform human gait evaluation is critical for orthopedic foot and ankle surgeons in tracking the recovery process of their patients. The assessment of gait in an objective and accurate manner can lead to improvement in diagnoses, treatments, and recovery. Currently, visual inspection is the most common clinical method for evaluating the gait, but this method can be subjective and inaccurate. The aim of this study is to evaluate the foot drop condition in an accurate and clinically applicable manner. The gait data were collected from 56 patients suffering from foot drop with L5 origin gathered via a system based on inertial measurement unit sensors at different stages of surgical treatment. Various machine learning (ML) algorithms were applied to categorize the data into specific groups associated with the recovery stages. The results revealed that the random forest algorithm performed best out of the selected ML algorithms, with an overall 84.89% classification accuracy and 0.3785 mean absolute error for regression.
This paper describes the problems involved with learning and understanding math for vision impaired students and developing a computer system approach for rendering mathematical formulae into audio form. Access to mathematics is an obstacle for blind students. The lack of easy access to mathematical resources is a barrier to higher education for many blind students and puts them at an unfair disadvantage in school, academia, and industry [1]. Results from the National Assessment of Educational Progress show that there is great disparity between the math skills of students with disabilities and students without disabilities [2]. A methodology for rendering technical documents, in particular, complex mathematical formula, in an audio descriptive form (Mathspeak) is presented in this paper.
Wireless headsets are a great asset to Vision Impaired Persons (VIP's) as they prove to be much easier to use and reliable than wired equivalents. Radio based wireless headsets are the most common and have many favorable characteristics, however for environments where there may be numerous users with wireless headsets, radio channels easily become congested compromising audio quality and reliable operation. The research undertaken in this project attempts to sidestep the radio channel congestion problem and also produce a wireless headset tailored to the requirements of VIP's.
Image stabilization is very important in vision based indoor/outdoor navigation systems. Blurring is one main cause of poor image quality, which can be caused by a movement of the camera at the time of taking the image, a movement of objects in front, atmospheric turbulence or out-of-focus. Out of these factors, camera movement is dominant in navigation systems as the camera is continuously moving. This paper presents the preliminary results of deblurring performed using point spread function (PSF) computed using synchronized inertial sensor data. It uses data of the accelerometer and gyroscope to derive a motion vector calculated from the motion of the smartphone during the image capturing period. This motion vector is applied to the captured image so that the effect of motion is reversed during the debrurring process. This work is a part of an indoor navigation project that aims to assist people with vision impairment. Image processing form a significant part of the proposed system and as such clearly defined edges are essential for path and obstruction identification. Different deblurring methods are compared for their performance in reversing the effect of camera movement. Results indicated that deblurring can be successfully performed using the motion vector and that the resulting images can be used as a readily approach to object and path identification in vision based navigation systems, especially for blind and vision impaired indoor/outdoor navigation. The paper also proposes a novel deblurring algorithm that uses PSF computed for different portions of the image to deblur that portion of the image.
Vision-impaired students face tremendous obstacles in their quest to access learning materials delivered in web-based and other electronic formats. The predominance of visual prompts, use of flash and animation and the inability of screen reading applications to interpret images all contribute to make much of the current e-learning materials associated with computing studies inaccessible by blind or vision-impaired students. This paper describes a university research project undertaken to improve the accessibility of Cisco e-learning materials for vision-impaired computing students.allThe network architecture which supports the delivery of the Cisco courses to both local and remote vision-impaired students is also presented.
There is an increased need for scalable high performance computing systems as the amount of generated data grows. Traditional High Performance Computing (HPC) clusters built to handle big data processing have inherent weaknesses that can be overcome by migrating to a more flexible cloud computing environment. This article discusses high performance computing and the paradigm shift from traditional onsite computing clusters to using the cloud for the same tasks. This article also proposes a solution called HPC+Cloud that enables enterprises to migrate to, and subsequently manage, high performance computing on the cloud. HPC+Cloud manages multiple, disparate, nodes on the hybrid cloud over software defined networks to complete tasks in a queue.
Cooperative research has been conducted to improve navigation services for vision impaired when they are moving through an indoor environment. Hence, to facilitate vision impaired individual in indoor environment requires a formal modelling approach for map generation and decision making about navigation pathways avoiding obstacles. The proposed data model consists "AccessBIM" database (DB) and API functions. The "AccessBIM" features such as, database connection initiation, function call, function return and database connection termination are organized as a series of function objects that meet the various needs of the database to maintain the relational schema. The proposed API primarily consists of two components, namely; DB connector and the API functions. Proposed "AccessBIM" DB will be implemented using real-time database such as "PostgreSQL". The weighted focus will be given to the areas such as Queries, Indexes and Transactions in relation to tuning the DB and the queries. The performance of the proposed API will be evaluated based on the time required to parse and Data insert rate and retrieval rates.