Commit graph

76 commits

Author SHA1 Message Date
Samuel Clay
7aeb1fdab8 Adding two error classes to requests exception handling. 2012-01-04 18:14:28 -08:00
Samuel Clay
6606c2d015 Adding two error classes to requests exception handling. 2012-01-04 18:09:15 -08:00
Samuel Clay
716b34c5ac Adding more advanced exception handling for new requests module, with fallback to urllib2. 2011-12-21 17:11:33 -08:00
Samuel Clay
259bbca9d9 Adding urllib fallback to requests in page fetching. 2011-12-21 09:11:39 -08:00
Samuel Clay
6637c05adf Adding a little bit more leeway in page importer by falling back to urllib2 when requests fail. 2011-12-20 22:30:55 -08:00
Samuel Clay
b112717d95 Revert "Revert "Switching to requests from urllib2/httplib. Bring on the page errors.""
This reverts commit a827b2c864.
2011-11-27 02:41:12 -05:00
Samuel Clay
a827b2c864 Revert "Switching to requests from urllib2/httplib. Bring on the page errors."
This reverts commit 617dc3bb56.

Conflicts:

	local_settings.py.template
2011-11-27 02:40:44 -05:00
Samuel Clay
d07b03a7a4 Revert "Revert "Switching to requests from urllib2/httplib. Bring on the page errors.""
This reverts commit 7972d0bb5c.
2011-11-26 11:15:01 -05:00
Samuel Clay
7972d0bb5c Revert "Switching to requests from urllib2/httplib. Bring on the page errors."
This reverts commit 617dc3bb56.
2011-11-26 02:16:14 -05:00
Samuel Clay
617dc3bb56 Switching to requests from urllib2/httplib. Bring on the page errors. 2011-11-26 02:07:31 -05:00
Samuel Clay
870b428253 Fixing issue around broken pages that have urn:uuid: in their URL. WTF, Google? 2011-09-07 22:35:03 -07:00
Samuel Clay
d2bcce4b6f Better error handling around broken feed pages. 2011-09-04 14:47:47 -07:00
Samuel Clay
acfd9e7614 Adding feed.has_page to weed out feeds with no original page. This should've been in 2 years ago. 2011-09-04 10:59:29 -07:00
Samuel Clay
96d8283ed0 Updating unit tests, fixing datetime issue in json serialization. 2011-08-31 09:41:34 -07:00
Samuel Clay
e9b88c8cc9 Fixing a few issues here and there around feed fetching. 2011-08-27 14:13:28 -07:00
Samuel Clay
dadef6c7b8 Fixing issue around original pages not including a <HEAD> section, so all URLs are relative to NewsBlur. Thanks to Cramer http://getsatisfaction.com/newsblur/topics/problem_with_relative_urls_in_for_example_rarestblog?utm_medium=widget&utm_source=widget_newsblur 2011-08-07 21:44:57 -07:00
Samuel Clay
e88d78a39c Fixing minor bug around incompletereads on page fetch. 2011-04-05 10:52:49 -04:00
Samuel Clay
41b70feff4 Incomplete Read doesn't have a msg. 2011-04-04 21:02:43 -04:00
Samuel Clay
a757b9f88f Fixing a few feed fetching bugs. 2011-04-04 20:53:12 -04:00
Samuel Clay
bb4e81b278 Fixing a whole lot of feed fetch bugs. 2011-02-15 21:08:40 -05:00
Samuel Clay
ea15aa1175 Major improvement to feed fetching and parsing by upgrading feedparser to v5.0. Also sending correctly formatted feed information to debug email. And handling invalid page URLs gracefully. 2011-02-05 20:49:52 -05:00
Samuel Clay
ff3a078d30 Fixing feed error on page importer. Whoops. 2011-02-05 15:47:56 -05:00
Samuel Clay
3723160155 A huge number of misc fixes: fixing potential errors on paypal return screen, mailing all feed/page parse errors, and swithcing to mapreduce item_frequencies in mongoengine. 2011-02-05 15:34:43 -05:00
Samuel Clay
22cf5ede6c Crazy refactor of feedfinder to extract timelimits, so they can be caught by a lower-level api instead of messing up the feed fetcher. 2011-02-02 13:07:12 -05:00
Samuel Clay
75fead0a27 Cleaning up logging, adding several errors and timeout checks on page and icon fetching. Standardizing fetcher headers. 2011-01-29 22:01:09 -05:00
Samuel Clay
a182bb1216 Finally saving proper base64 encoded favicons, along with their URLs, autodetecting when not found at /favicon.ico, and now all thats left is serving icons and their colors. Woohoo! 2011-01-27 19:05:50 -05:00
Renamed from apps/rss_feeds/importer.py (Browse further)