Read https://sourceforge.net/projects/tesseracthindi/files/OCRHindi_using_VietOCR_and_Tesseract.pdf/download for how to use vietocr gui for OCR of Hindi and Sanskrit texts using tesseract-ocr
*****
Please see https://github.com/Shreeshrii/
imagessan and imageshin for newer box/tiff pairs, traineddata files, ocr evaluation statistics and ground truth files with images for Sanskrit and Hindi.
*****
Following is OLD information - saved only for archival purposes.
Tesseract OCR 3.02 provides hin.traineddata for recognizing texts in devanagari scripts. However the Hindi training texts, images and box files are not provided, so it is difficult to improve the accuracy by further improving the traineddata. ...