Jump to content

Welcome! You're currently a Guest.

If you'd like to join in the Discussion, or access additional features in our forums, please sign in with your Evernote Account here. Have an Evernote Account but forgot your password? Reset it! Don't have an account yet? Create One! You'll need to set your Display Name before your first post.

Photo
Mac

How does Evernote handle OCR?

ocr indexing scanning evernote

  • Please log in to reply
24 replies to this topic

#1 grekko

grekko

  • Pip
  • Title: Member
  • Group: Members
  • 4 posts

Posted 06 December 2011 - 12:56 PM

Hey guys,

I am using Evernote as a document trunk for my private analog mail. I started this by using OCRx and manually Indexing the scanned PDF documents. Since I am now a premium member I wanted to know about EN capabilities for PDF OCR/indexing.

Does Evernote execute the PCR process on the server side?
(Here) spg SCOTT points out, that the OCR is done on the server and it needs a magical span of time until its synced back to the client?

Is the result of OCR/Indexing put back inside the source PDF?

(Here) jbenson2 says "The Evernote OCR process only works in Evernote. Once the PDF is pulled and sent elsewhere it is no longer searchable." Is that true?


Sincerely,

Gregory

#2 jbenson2

jbenson2

  • PipPipPipPipPip
  • Title: User # 142,683
  • Group: Members
  • 5,589 posts

Posted 06 December 2011 - 01:12 PM

#1
It's not "magical". It is based on reality. Premium users get priority over the free users. And yes, it does need to be sync'd back.

Evernote: "Once a file has been recognized, you'll need to sync the results back to your account to see them in your local client."
https://support.ever...t=31&docID=2769



#2
There is an easy to prove my point. Pull out a PDF that Evernote performed the OCR on. Try searching the PDF outside of Evernote.

Evernote: "Once the PDF has been deemed valid for processing, the PDF is run through our best-of-breed OCR engine which generates a searchable form of the same PDF. This version is synced back down to the user's desktop and mobile client applications."

https://support.ever...2 47&docID=2807



#3 BurgersNFries

BurgersNFries

  • Title: Moderator
  • Group: Evernote Evangelist
  • 12,153 posts

Posted 06 December 2011 - 01:46 PM

#1 - yes

#2 - a different, searchable PDF is created that can be saved to your computer, should you want/need it. (Right click the PDF in the Evernote & save searchable.)
I'm not affiliated with Evernote. Evernote is an integral part of my life.

Submit support requests toward the bottom of the help/support page here. If you do not receive an auto reply email with a case #, it did NOT get submitted. Premium users will receive a reply within one business day, California time. Free users receive a reply as time permits.

#4 grekko

grekko

  • Pip
  • Title: Member
  • Group: Members
  • 4 posts

Posted 06 December 2011 - 03:52 PM

#1 - yes

#2 - a different, searchable PDF is created that can be saved to your computer, should you want/need it. (Right click the PDF in the Evernote & save searchable.)



#1
It's not "magical". It is based on reality. Premium users get priority over the free users. And yes, it does need to be sync'd back.

Evernote: "Once a file has been recognized, you'll need to sync the results back to your account to see them in your local client."
https://support.ever...t=31&docID=2769



#2
There is an easy to prove my point. Pull out a PDF that Evernote performed the OCR on. Try searching the PDF outside of Evernote.

Evernote: "Once the PDF has been deemed valid for processing, the PDF is run through our best-of-breed OCR engine which generates a searchable form of the same PDF. This version is synced back down to the user's desktop and mobile client applications."

https://support.ever...2 47&docID=2807


First of all: thank you guys for the quick answer.

To #1: I just wonder how long it takes, because I dropped a pdf into EN this morning and It still got no OCR.
I have a wild theory: Since I am not a friend of premium subscriptions I canceled the premium account to the end of this month, BUT right now I am a paying premium member. Could it be that accounts that are not "unlimited" premium members are treated that way?

To #2: Since I dont even get my PDFs/Notes inside EN indexed I can not check that right now.


So still my documents are not being indexed. Any hints what I can do?


Sincerely,
Gregory

#5 BurgersNFries

BurgersNFries

  • Title: Moderator
  • Group: Evernote Evangelist
  • 12,153 posts

Posted 06 December 2011 - 04:08 PM

There is no partiality toward premium accounts. Be sure you've sync'd, since after the PDF is OCR'd, it needs to be sync'd down to your desktop. Also,

We only attempt to process an Image-based PDF if all of the following conditions are met:

The raw PDF is 25 megabytes or less.
The scan contains no more than 100 pages.
The raw PDF doesn't already contain "searchable" text that you can select and copy.
The PDF isn't encrypted or protected with a passphrase.
The PDF is not of an handwritten document.

If you believe your PDFs meet these criteria, please click the link in my signature to file a support request and we'll examine further.


I'm not affiliated with Evernote. Evernote is an integral part of my life.

Submit support requests toward the bottom of the help/support page here. If you do not receive an auto reply email with a case #, it did NOT get submitted. Premium users will receive a reply within one business day, California time. Free users receive a reply as time permits.

#6 jefito

jefito

  • Title: Evangelist / Moderator
  • Group: Evernote Evangelist
  • 10,756 posts

Posted 06 December 2011 - 04:33 PM

Just as there is no way to be "a little bit pregnant", you're either a premium subscriber, or your're not, as far as I can tell.
~Jeff
EVERNOTE: Getting Started | Support Page | Knowledge Base | Support Requests | Best Practices for submitting a support request
If someone helped you, or you like or agree with someone's post, let them (and us) know by clicking their post's "Like" button.

#7 BurgersNFries

BurgersNFries

  • Title: Moderator
  • Group: Evernote Evangelist
  • 12,153 posts

Posted 06 December 2011 - 04:56 PM

Yeah, guess I should rephrase my answer. There is partiality toward premium accounts vs free accounts. But no partiality toward a premium account that will not be renewed vs one that will. :)
I'm not affiliated with Evernote. Evernote is an integral part of my life.

Submit support requests toward the bottom of the help/support page here. If you do not receive an auto reply email with a case #, it did NOT get submitted. Premium users will receive a reply within one business day, California time. Free users receive a reply as time permits.

#8 grekko

grekko

  • Pip
  • Title: Member
  • Group: Members
  • 4 posts

Posted 06 December 2011 - 08:16 PM

Just as there is no way to be "a little bit pregnant", you're either a premium subscriber, or your're not, as far as I can tell.

Yeah, guess I should rephrase my answer. There is partiality toward premium accounts vs free accounts. But no partiality toward a premium account that will not be renewed vs one that will. :)


I have a premium account that (right now) won't be renewed in january.

@BurgersNFrieds: All these criteria meet my situation, so I'll be posting an issue on your support link.

#9 grekko

grekko

  • Pip
  • Title: Member
  • Group: Members
  • 4 posts

Posted 07 December 2011 - 08:30 AM

Thanks to the support of Giovanna from EN Support I figured my problems out. I thought EN gives me a searchable PDF that I'd be able to select text from. jbenson2 pointed that out.

I must say that I am pretty unhappy with that and am going back to my old workflow (like jbenson2) and index my PDFs locally.

#10 BurgersNFries

BurgersNFries

  • Title: Moderator
  • Group: Evernote Evangelist
  • 12,153 posts

Posted 07 December 2011 - 11:08 AM

#2 - a different, searchable PDF is created that can be saved to your computer, should you want/need it. (Right click the PDF in the Evernote & save searchable.)


I'm not affiliated with Evernote. Evernote is an integral part of my life.

Submit support requests toward the bottom of the help/support page here. If you do not receive an auto reply email with a case #, it did NOT get submitted. Premium users will receive a reply within one business day, California time. Free users receive a reply as time permits.

#11 Nashvillian

Nashvillian

  • Pip
  • Title: Member
  • Group: Members
  • 25 posts

Posted 16 December 2011 - 08:19 PM

I take a screen shot of some text. I put that screen shot in a note. After Evernote syncs, I can search for a word within that screen shot and Evernote finds it. Evernote has converted the picture of the words into searchable text.

Is there a way to copy some or all of that searchable text and paste it into another document as editable text?

#12 Metrodon

Metrodon

  • PipPipPipPipPip
  • Title: Bankrobber
  • Group: Members
  • 4,786 posts

Posted 16 December 2011 - 08:35 PM

Not currently

#13 jefito

jefito

  • Title: Evangelist / Moderator
  • Group: Evernote Evangelist
  • 10,756 posts

Posted 16 December 2011 - 08:40 PM

Is there a way to copy some or all of that searchable text and paste it into another document as editable text?

Not easily, no. Evernote does not create a stream of contiguous words to match whatever text it finds. Rather, it creates a set of guesses at what words exist in the image, each with its location and extents. These word guesses are stored individually along with the note. You can see what they are by exporting the note to Evernote format and pulling the resultant .enex file into a text editor. You'd see them as character data in <t> items under the <recoIndex> section, down towards the end of the note.
~Jeff
EVERNOTE: Getting Started | Support Page | Knowledge Base | Support Requests | Best Practices for submitting a support request
If someone helped you, or you like or agree with someone's post, let them (and us) know by clicking their post's "Like" button.

#14 Metrodon

Metrodon

  • PipPipPipPipPip
  • Title: Bankrobber
  • Group: Members
  • 4,786 posts

Posted 16 December 2011 - 08:51 PM

Ooops my bad, I didn't know that

#15 jefito

jefito

  • Title: Evangelist / Moderator
  • Group: Evernote Evangelist
  • 10,756 posts

Posted 16 December 2011 - 09:33 PM

It's still not particularly useful for what the poster is looking for.
~Jeff
EVERNOTE: Getting Started | Support Page | Knowledge Base | Support Requests | Best Practices for submitting a support request
If someone helped you, or you like or agree with someone's post, let them (and us) know by clicking their post's "Like" button.

#16 kkc0123

kkc0123

  • Pip
  • Title: Member
  • Group: Members
  • 3 posts

Posted 14 June 2012 - 09:06 PM

Google docs has OCR and will "create a stream of contiguous words to match whatever text it finds" in the jpg you upload. I have found that it will not do a negative (white on black) text pic though.

#17 GrumpyMonkey

GrumpyMonkey

  • Title: 不機嫌な猿
  • Group: Evernote Evangelist
  • 10,191 posts

Posted 14 June 2012 - 09:40 PM

Google docs has OCR and will "create a stream of contiguous words to match whatever text it finds" in the jpg you upload. I have found that it will not do a negative (white on black) text pic though.


Hi. Welcome to the forums.

I just found out that Google Drive will not index anything after the first 10 pages of a PDF. I was a bit disappointed to find out how limited it is. I have the PDFs OCR'd, so you'd think it would be no big deal to index it all. What I plan to do as a workaround is to extract the text myself and stick it into a document.

#18 mdh98368

mdh98368

  • Pip
  • Title: Member
  • Group: Members
  • 13 posts

Posted 20 February 2013 - 04:06 AM

I just upgraded to Premium after many years of use. I have lots of PDFs attached to notes. Now that I am premium, will the EN servers automatically start running OCR against all of the PDFs in my existing notes, or do I somehow have to trigger that process. (I'm hoping I don't have to recreate the note!)

 

Thanks.



#19 Jackolicious

Jackolicious

  • Title: Desktop Products
  • Group: Evernote Employee
  • 1,507 posts

Posted 21 February 2013 - 01:50 AM

mdh, give the service some time. It should index everything for you automatically.


My TwitterCheck out our Knowledgebase  our guides here: Mac or Windows. Still need help? Submit a support request here.


#20 mdh98368

mdh98368

  • Pip
  • Title: Member
  • Group: Members
  • 13 posts

Posted 02 March 2013 - 05:23 PM

Thanks for the encouragement... It's been over a week since I upgraded to Premium and the test case file I'm watching still hasn't been indexed. Is that still a reasonable amount of time? Also, is there anyone to force an individual note to get indexed?







Also tagged with one or more of these keywords: mac, ocr, indexing, scanning, evernote

0 user(s) are reading this topic

0 members, 0 guests, 0 anonymous users

Clip to Evernote