: Self hosting of recaptcha-like system to digitise my uploaded books From wikipedia reCAPTCHA is a system ... that uses CAPTCHA to help digitize the text of books while protecting websites
From wikipedia
reCAPTCHA is a system ... that uses CAPTCHA to help digitize
the text of books while protecting websites from bots
I have a lot of scanned documents that I'd like to convert, and would like to introduce a captcha on my website, so why not kill two birds with one stone?
The reCAPTCHA project has it's own agenda though focusing on archives of The New York Times and books from Google Books.
Does a similar project exist that I could host and thereby dictate the books/docs that are digitised?
More posts by @Holmes151
1 Comments
Sorted by latest first Latest Oldest Best
Use Google's OCR to digitize those books. As for using your own books to translate, there isn't currently third-party software available for that. For added reasoning against this, an excerpt from the CAPTCHA Site;
Should I Make My Own CAPTCHA?
In general, making your own CAPTCHA script (e.g., using PHP, Perl or .Net) is a bad idea, as there are many failure modes. We recommend that you use a well-tested implementation such as reCAPTCHA.
Further, he spoke at a TED conference on the subject of reCAPTCHA. If you do infact intend on making your own, might as well study up.
Terms of Use Create Support ticket Your support tickets Stock Market News! © vmapp.org2024 All Rights reserved.