WARNING: THIS SITE IS A MIRROR OF GITHUB.COM / IT CANNOT LOGIN OR REGISTER ACCOUNTS / THE CONTENTS ARE PROVIDED AS-IS / THIS SITE ASSUMES NO RESPONSIBILITY FOR ANY DISPLAYED CONTENT OR LINKS / IF YOU FOUND SOMETHING MAY NOT GOOD FOR EVERYONE, CONTACT ADMIN AT ilovescratch@foxmail.com
Skip to content
This repository was archived by the owner on May 11, 2020. It is now read-only.

Conversation

@alexeib
Copy link

@alexeib alexeib commented Jul 1, 2015

NReadability has some hard coded settings for being able to extract content from specific sites where automatic extraction fails. They do this by having a dictionary of site url -> html element selector containing the content of the article.

Unfortunately they do not list all possible sites that have this problem and there is no way to inject additional sites. This changeset allows one to optionally pass the same url -> selector dictionary down to nreadability to be added to their hardcoded list. Also adds slashdot to the hardcoded list while we're at it :)

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant