Jump to content
  • 0

Web Clipper loses author identity


Joseph Zitt

Idea

I have a problem with the Web Clipper that is making it unusable as a complete research tool. When I clip articles, it frequently loses the attribution within the page. That means that I can't tell who wrote the article. Not useful.

Here are some links to articles for which this happened within the past few days:

https://www.nytimes.com/2017/07/08/opinion/sunday/why-does-donald-trump-keep-dissing-jews.html

http://www.bbc.com/culture/story/20170616-the-1000-year-old-lost-arab-poetry-that-lives-on-in-hebrew

http://welcometotwinpeaks.com/news/reflections-twin-peaks-brad-dukes-interview/

http://lifehacker.com/the-truth-about-cat-allergies-1796464913

https://www.nytimes.com/2017/06/27/magazine/why-the-far-right-wants-to-be-the-new-alternative-culture.html

http://www.hollywoodreporter.com/live-feed/twin-peaks-star-kyle-maclachlan-promises-everything-will-make-sense-1019076

http://nypost.com/2017/07/04/heroin-has-infiltrated-the-hasidic-community/

https://medium.com/the-wtf-economy/do-more-what-amazon-teaches-us-about-ai-and-the-jobless-future-8051b19a66af

https://www.theatlantic.com/international/archive/2017/07/trump-speech-poland/532866/

http://www.cleveland.com/metro/index.ssf/2017/07/churches_and_synagogues_leave.html

Clearly, Evernote should not be deleting this information when it saves the articles. To be really useful, it should be emphasized as a special, searchable field.

This has been true for a long time. I'm kind of amazed that a tool that is so close to useful continues to have this bug.

Link to comment

1 reply to this idea

Recommended Posts

  • Level 5*
On 7/10/2017 at 11:06 AM, Joseph Zitt said:

I have a problem with the Web Clipper that is making it unusable as a complete research tool. When I clip articles, it frequently loses the attribution within the page. That means that I can't tell who wrote the article. Not useful.

I agree, this is a major issue for really web clipping for any purpose, classical research or otherwise.

It should be obvious to all that when you capture text from a source, it must include, author, date published, and URL to be useful for later use/analysis.

I am working on a Mac AppleScript that will capture this info from a web page, IF it is available.  The issue is that there is no standard form/format used by all web pages for this info, although the better sources, like NYT and WSJ, usually include this data in the HTML metadata of the page.  The key parts of my AppleScript are really JavaScript, which could be used by any tool (Mac or Windows) that can do JavaScript injection of a web page.  

I then have another AppleScript which uses this info to insert the following type of info at the top of the EN Mac Note of the captured web page:

59669cb2a6016_2017-07-12_17-02-05(1).png.a0655d233b39ddfa483fbd396f218323.png

Unfortunately, my script/process does NOT work on all web pages, or even a large percentage of them.  This is mostly due to my lack of JavaScript skills.  I would  hope that the experts at Evernote could make this work much better.

If anyone is interested in this script, please reply here, and I'll notify you when the script is ready.

 

Attn: @Evernote ( @gbarry, @rubenb, @Chantal Leonard ):  Would you please look at providing us with this data at the top of all clipped web pages.

Link to comment

Archived

This topic is now archived and is closed to further replies.

×
×
  • Create New...