Jump to content
peter206

ANSWERED Is Premium required for Scannable OCR support?

Recommended Posts

I could not find this mentioned in the Scannable product info page. It appears Scannable PDFs are just images. Does Premium unlock OCR capabilities? Is there a way to demo the performance first?

Share this post


Link to post

Thanks for your reply.

 

This Evernote blog post:

 

https://blog.evernote.com/tech/2013/07/18/how-evernotes-image-recognition-works/

 

says that OCR is performed on uploaded images, and most PDFs.

 

While PDFs from Scannable do not OCR, I've discovered that if I send a jpg from Scannable to my Camera Roll, and then send that image to Evernote, I do get OCR. PDFs which I bring in from sources other than Scannable are OCRing as well.

 

Can you help me understand why PDFs coming from Scannable do not have parity with this functionality? Evernote already possesses this capability, so is it just concern over the capacity of the infrastructure to handle the (presumably) higher load of documents needing to be analyzed?

 

Peter

Share this post


Link to post

Good questions, all.  It is our intention that PDFs uploaded from Scannable follow the same rules as PDFs uploaded from other sources in Evernote.  If that's not the case, it could be one of two things:

 

1. There is a restriction for the file size of PDFs that are queued for OCR - I believe it's 25MB

2. Something isn't working correctly

 

If the case is #2, I would recommend that you open a support ticket and refer to this thread and we'll see what we can do.

 

P.J.

Share this post


Link to post

I just submitted a support ticket precisely for this reason: documents (PDF) I scan with Scanabble should be indexed and OCRd within Evernote after it has been imported (for premium accounts, like mine) but this is not occurring.

Searching for content in any of these documents that come from Scannable in Evernote returns zero results, even after allowing Evernote 2 whole days to perform its background indexing processes.

Checkout an example note I uploaded.

post-134118-0-48548300-1422943410_thumb.

Share this post


Link to post

Hi Amil

 

Those documents should be returned as search results. I've messaged you privately to try to figure out what's happening here. 

  • Like 2

Share this post


Link to post

I continue to see the results I mentioned above; should I submit a ticket too, klang?

 

Also, the "Best Answer" above says that OCR is not supported for PDFs. It seems this tag should be removed for now, both because this is not resolved and because I believe the answer is inconsistent with expected behavior.

 

Peter

  • Like 1

Share this post


Link to post

Some clarifications:

 

Currently Evernote iOS does not support search-within-PDF if the PDF is an image-based PDF, like the ones Scannable produces. I'll forward this chat to that team. 

 

However searches across all your documents will find a PDF from Scannable if it contains a word that OCR has detected. For example in this case, if you search for Volkswagen, even if this *only* appears in the PDF image, then this will currently be found. 

 

Note that this OCR can take some time, because it's done on the server. It's typical to get back results in minutes for PDFs of a few pages. 

Share this post


Link to post

What is the latest on this issue? I just scanned the same doc and one is pdf and the the other is jpeg. The jpeg ocr search is working but the same document as a pdf the ocr search is not working.

Share this post


Link to post

Would be great to have OCR for every image based pdf! Works great for Post-it and I would love to image pdf's OCR'ed the same way.

Share this post


Link to post

Hi, my first post : ) I scann every bill and invoice, which is great. The specific invoice number is searchable - perfect! But only on Evernote my laptop...? The same invoice/invoice number is not found on my Evernote account on my mobile devices...? (!) Why is that? They are stored in the same place in Evernote.  

Share this post


Link to post

Just took me a few hours to find out that scannable differentiates between 1 and 1+ pages, thereby creating two types of content, which result in OCR and non-OCR. 

Having hundreds of scannable PDF documents in my Evernote...

Here's the billion-dollar-question: If I upgrade to premium NOW, does EN

1. recursively scan my existing documents/content for PDFs and OCR's them (recursively also), so that they become searchable from this point forward? Or, 

2. is OCR-searchability limited to documents I introduce/upload to EN from this point (of signup to premiums) forward? 

Share this post


Link to post

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now

×
×
  • Create New...