Commit graph

1452 commits

Author SHA1 Message Date
Samuel Clay
d748a41d85 Trimming > 1000 stories from feeds. 2010-01-26 19:59:43 -05:00
Samuel Clay
b1c72c5c9f Huge update: Hooking up slider to hide/show stories based on their score. Also added new story counts, score calculations, and real-time updating of unread counts as you read. 2010-01-21 13:12:29 -05:00
Samuel Clay
35df77cdf6 Adding classification types for publishers vs. stories. 2010-01-12 01:19:37 +00:00
Samuel Clay
8891a859fe Finishing up tags, and the classifier modal in the Reader. Now shows all sorts of classifiers. 2010-01-04 22:26:53 +00:00
Samuel Clay
ef70c288ed Adding a classifier to all stories. Classifies on:
- Story title (highlighted phrase)
- Story author
- Feed
2010-01-04 04:49:59 +00:00
Samuel Clay
838a57baa8 Fixing up sessions so they don't cache. Also adding Authors to admin. 2009-12-18 21:59:14 +00:00
Samuel Clay
8bf5fc8dca Fixing the out-of-date dupe bug. 2009-12-18 20:47:44 +00:00
Samuel Clay
06e94610dc Adding Phrase Filter to Analyzer. Just need to hook up the phrase filter to the story classifier and then save the results to be used for later processing. 2009-11-15 18:57:53 +00:00
Samuel Clay
fb6d0692ce Naive Fisher Bayesian Classifier. Stubbed in some boilerplate for making the analyzer do its job. Still have to connect it to feeds and train on real data, but it's looking good as a starting point. 2009-11-03 03:52:03 +00:00
Samuel Clay
0edbb7ed74 I just don't feel comfortable with Django's thread-unsafe ORM. Putting locks back in place around all database .save() calls. It's The Right Thing To Do. 2009-09-16 04:00:37 +00:00
Samuel Clay
7b08770b3a Getting rid of Queues entirely. Replaced with values being distributed directly to Processes. One day this will change. Until then, all feeds fit in memory. 2009-09-16 03:54:33 +00:00
Samuel Clay
4aa0271047 Adding semaphore locks on all mysql db requests in threads/processes. 2009-09-16 02:34:04 +00:00
Samuel Clay
a7888c2aae Switching from threads to processes. Sure, no big deal. Django is not thread-safe, so I can't use threads. I can, however, use multiprocessing to pass a Queue around between Processes. 2009-09-16 02:22:27 +00:00
Samuel Clay
0446ff2ee3 Adding locks to feed processing. 2009-09-12 20:42:38 +00:00
Samuel Clay
bec2f00979 Exiting after it's over. 2009-09-10 04:24:51 +00:00
Samuel Clay
0b9b778382 Queues not working. Reverting back to 1 feed at a time. Ugh. 2009-09-10 04:16:50 +00:00
Samuel Clay
8437ef29f1 Handling empty queues and increasing thread count 2009-09-10 03:52:43 +00:00
Samuel Clay
67ab0d6c66 Let's try changing the load. 2009-09-10 03:48:22 +00:00
Samuel Clay
02a4d292e8 Moving from threadpool to a Queue. Much simpler. 2009-09-10 03:33:05 +00:00
Samuel Clay
0caf49e9a4 OK. Turning on threading for feed fetching. Enough logging to catch the bugs. Fixed the segmentation fault on exit. 2009-09-10 02:31:55 +00:00
Samuel Clay
2ac39549ae The "Hacker News" bug is fixed. Weirdo HN site doesn't publish guids or timestamps. Now bubbling priorities on ids. Also added 'j' and 'k' keyboard shortcuts. And fixed the escaping url bug on the Reader. 2009-09-08 00:13:49 +00:00
Samuel Clay
c0ca9277b8 - Added specialized unit tests for crappy RSS input (like Slashdot).
- Rewrote the feed view in the reader. It is now a long list and looks a lot better.
- Big update to story title logic. Can now handle many more edge cases. Also uses smarter heading finder.
- Fixed scrollTo bug, so now multiple clicks do not queue awkwardly.
- Moved the story titles pane to the bottom.
2009-09-07 01:56:52 +00:00
Samuel Clay
27e539ba57 What happens if the id/permalink are blank? Same story! No, bad idea. 2009-09-05 02:37:31 +00:00
Samuel Clay
acd464af86 Using story GUID's on top of permalinks. 2009-09-05 02:22:25 +00:00
Samuel Clay
9e522ca528 Woo - Checking for content and title differences to resolve duplicate entries. Integration tests prove a number of differences are successfully resolved. 2009-08-30 00:43:13 +00:00
Samuel Clay
bdd91d714d Refactored the feed fetcher. Added integration tests for dupe matching. Also stubbed in AI score calculations so I can do something with the output soon. 2009-08-29 19:34:42 +00:00
Samuel Clay
3bd43a8cb4 Updating algorithm to insert the necessary base information into the feed's page. 2009-08-26 03:12:55 +00:00
Samuel Clay
d24b56fbf4 Adding new "Return to Page" button. When the user visits other pages in a feed, security prevents NB from reading the iframe, so we provide a convenient hook to bring the iframe back. Call it SnapBack, except, don't. 2009-08-26 02:31:35 +00:00
Samuel Clay
ce9ec58d00 Some feeds don't have pages. Who knew. 2009-08-23 00:07:37 +00:00
Samuel Clay
9c522416a1 Gotta get rid of threading until I can figure out how to thread-safe the Django ORM. It's corrupting my tables! 2009-08-23 00:00:55 +00:00
Samuel Clay
fa123e7f3d Updating feeds to crash with noise. 2009-08-21 13:14:44 +00:00
Samuel Clay
6cf04497ee Checking for threadpool 2009-08-20 02:53:41 +00:00
Samuel Clay
8545e09d9a Using threadpool module and feedjack to import feeds. Largely the same functionality, but now more debugging info, as well as better error handling and threads. 2009-08-20 02:43:01 +00:00
Samuel Clay
492d1fb229 Better imports through error handling. 2009-08-15 15:10:21 +00:00
Samuel Clay
11a9921c07 2009-08-14 02:32:30 +00:00
Samuel Clay
2728ca5722 Making story pane iframe permanent. 2009-08-14 02:27:49 +00:00
Samuel Clay
001ffc942b Fixing failure on bad address. 2009-08-14 01:54:22 +00:00
Samuel Clay
88ccfa7e77 Created admin commands to run refresh_feed/s without invoking Apache. Finally! 2009-08-14 01:48:21 +00:00
Samuel Clay
d97c5ff0c3 Catching the proper exception. 2009-08-13 04:05:49 +00:00
Samuel Clay
0f4cfdffb7 Importing feed's original page along with RSS stories. Wowzers. 2009-08-13 03:26:12 +00:00
Samuel Clay
0582b052bf Tests can now be run from any server/environment. 2009-08-09 19:22:09 +00:00
Samuel Clay
9626b4695a Updating integration tests to include updated feeds. Wahoo! Now to figure out the dupe issues through testing. 2009-08-08 16:52:11 +00:00
Samuel Clay
555b1e3bda Removing a few debug messages for cleaner debugging in production. 2009-08-07 12:21:39 +00:00
Samuel Clay
bd0099ad81 Rewrote ingestion and acquiring of feeds. This needs some serious TLC soon. 2009-08-01 04:26:57 +00:00
Samuel Clay
709beb79b7 Cleaning up tests, reload OPML pane, and minor refactorings for errors. 2009-07-28 22:51:16 +00:00
Samuel Clay
5d185bae74 Optimizing and refactoring a ton of queries and calls. Really, I touched almost everything, from the feed ingestion system, to marking stories as read, to merging ReadStories and Opinions. 2009-07-28 02:27:27 +00:00
Samuel Clay
d4b25e80d2 It's the start of an actual interface. Menu buttons added (still need to be styled and tweaked to auto-close). 2009-07-21 03:18:29 +00:00
Samuel Clay
8d3bc4ec9b Reducing logging. 2009-07-20 05:12:11 +00:00
Samuel Clay
0a8449db50 Numerous memory-saving operations. Chilled out some hoggish sql queries. 2009-07-18 23:39:16 +00:00
Samuel Clay
db09737792 Adding integration tests and fixtures for the RSS Feeds app. In hind-sight, it should've been called 'FeedInjestion' 2009-07-08 02:05:45 +00:00