Commit graph

1158 commits

Author SHA1 Message Date
Samuel Clay
646fcb0819 Moving to 3 hour min for premium sub feed fetch intervals (from 6 hours). This is for feeds with no activity. 2019-05-28 10:23:42 -04:00
Samuel Clay
416ac32536 Interactive debug feed 2019-05-12 16:47:20 -04:00
Samuel Clay
4083d5dec3 Adding logging 2019-05-12 16:44:40 -04:00
Samuel Clay
74f46686e3 Double checking for image urls. 2019-02-18 15:38:44 -05:00
Samuel Clay
3e06b96221 Moving extract image urls into saving for all new stories. 2019-02-04 11:49:01 -05:00
Samuel Clay
9d7fc6827a Enforcing secure image urls when replacing image from text extraction. 2019-02-01 11:55:10 -05:00
Samuel Clay
5865c23e7c Moving feedburner images to the back of the image list to highlight original images where possible. 2019-01-20 14:53:39 -05:00
Samuel Clay
80ee4fd814 Securing images through Text view. Now need to fix shared stories and saved stories. 2019-01-20 13:55:40 -05:00
Samuel Clay
da969804fb Upgrading nginx, adding secure_image_urls to all stories. 2019-01-19 15:37:20 -05:00
Samuel Clay
800d56bb0b Merge branch 'master' into camo
* master:
  Proxy all story view.
  iOS: #1137 (full screen reading)
  Forget the original page for saved stories.
  Fetch original page for saved stories.
  Fixing saved stories inline.
  Adding logging
  Wrong split story hash
  Fidning duplicate feeds in saved stories.
  iOS: #1137 (full screen reading)
  Fix: build postgres container without error
2019-01-19 14:15:23 -05:00
Samuel Clay
a47a026435 Forget the original page for saved stories. 2019-01-18 13:45:32 -05:00
Samuel Clay
52a8e1e2e5 Fetch original page for saved stories. 2019-01-18 13:44:32 -05:00
Samuel Clay
e5c4726932 Creating sgined urls for camo. 2019-01-17 15:05:32 -05:00
Samuel Clay
c6abff0492 Adding to easily make it so feeds never get trimmed of stories. Fixes #1141. 2018-11-04 12:00:14 -05:00
Jordan
bd702371f8 Attempt to fix fetching tumblr rss feeds. 2018-10-26 09:12:00 +00:00
Samuel Clay
7c6594f820 Increasing timeout on feedfinder, just in case (see https://blog.dbrgn.ch/). 2018-08-21 09:58:11 -04:00
Samuel Clay
4a695fd710 Boosting feed fetches due to faster feed ingestion. 2018-08-02 09:59:43 -04:00
Samuel Clay
54a0da95bd On stories with unmarked dates, add a random number of seconds (under a minute) to give stories their own timestamp. Makes feed timelines more deterministic once the stories are injested. 2018-07-26 10:45:09 -04:00
Samuel Clay
692d3f3085 Incorrectly skipping title-less stories. 2018-07-16 10:10:31 -04:00
Samuel Clay
cc615e6d04 Fixing handling of invalid feed url on share story feed finding. 2018-07-13 11:23:56 -04:00
Samuel Clay
ab939f72fe Adding a 'story_title_blank' field to stories to signify that the story title was originally ommitted (even though NewsBlur added one in). Thanks to @jbrayton for the suggestion. 2018-07-12 14:19:14 -04:00
Samuel Clay
9223000139 Adding timeout to feed finder on initial feed add. 2018-06-28 13:38:56 -04:00
Samuel Clay
da4741a3fa Adding videos to facebook posts. Seems to work pretty well. 2018-03-26 17:31:12 -07:00
Samuel Clay
3d31ed2507 Handling connection error when finding feed. 2018-01-18 16:15:17 -08:00
Samuel Clay
8421f667d7 Fixing broken image handling from Mercury Reader that was causing image urls with a srcset to be concat'd together. This one's for @yesthatjwz. 2018-01-17 16:51:06 -08:00
Samuel Clay
b99ccd7045 YouTube feeds now have an auto extracted title. Thank you Bruno! 2017-12-18 21:48:26 -08:00
Samuel Clay
ef96f59c2c Only count months that have stories for the average count. 2017-12-15 17:12:14 -08:00
Samuel Clay
f2ab8145c5 Adding options to control infrequency of infrequent site stories feed. 2017-11-05 14:01:25 -08:00
Samuel Clay
6a27023e12 Merge branch 'master' into infrequent
* master:
  Handling no original doc returning in text importer.
  Assets for icons.
  No longer finding the largest image in a story if the text view already successfully found one. Also using Mercury's builtin image finder.
  Fixing warnings.
  Removing ESPN from original pages.
2017-11-03 14:46:02 -07:00
Samuel Clay
27688c7593 Handling no original doc returning in text importer. 2017-11-03 13:48:44 -07:00
Samuel Clay
2d05dc9222 No longer finding the largest image in a story if the text view already successfully found one. Also using Mercury's builtin image finder. 2017-11-03 13:47:17 -07:00
Samuel Clay
b7574a1ff7 No longer finding the largest image in a story if the text view already successfully found one. Also using Mercury's builtin image finder. 2017-11-02 22:09:37 -07:00
Samuel Clay
f543e408e9 Attempting new Infrequent Site Stories river. 2017-10-16 14:22:37 -07:00
Samuel Clay
920e4be4bd Handling case where every story is the same time. 2017-06-28 17:19:54 -07:00
Samuel Clay
c1834703d9 Adding support for JSON Feeds. 2017-05-22 16:46:56 -07:00
Samuel Clay
6bc9a55bfc Rewriting twitter fetching. Now fetching truncated text, embedding quoted tweets, and fixing URLs so that they show the display url (and not t.co) and link to the expanded url. 2017-05-06 19:38:36 -07:00
Samuel Clay
262d67abf9 Whoops, zrevrange is high to low, which means newest to oldest. 2017-05-01 12:08:56 -07:00
Samuel Clay
44de405195 Only fetch as many guidas as there are stories in the feed. No need to get everything assuming the feed has the latest N stories. 2017-05-01 12:06:56 -07:00
Samuel Clay
08c897e5a1 This is it, the big kahuna of fixes. This corrects for messed up guids that are causing lots of read stories to become unread. 2017-05-01 11:39:24 -07:00
Samuel Clay
c9326a6f02 Perhaps this is the way to find the missing story hashes. Shouldn't cause an issue, but logic for dates may be backwards. 2017-05-01 09:27:31 -07:00
Samuel Clay
ffeeb170e0 Finally have a test case for the Google Blog duping. 2017-04-30 18:47:10 -07:00
Samuel Clay
d84e2af636 New experimental data collector for debugging feeds over time. 2017-04-12 19:13:33 -07:00
Samuel Clay
461c1c4b65 Changing feed log format to include id at the beginning. Also normalizing all feed titles in logs for better searchability. 2017-03-31 19:52:24 -07:00
Samuel Clay
930abb9adb Merge branch 'master' into saved_searches
* master: (22 commits)
  Turn original text caching back on.
  Extracting images from original text's noscript.
  Fetcing the original text now extracts the image url for others.
  Monkey patching SSL for new python, since hostnames don't match with S3.
  Converting videos in email notifications to images.
  Upping quota to 100 shared stories a day.
  Bumping premium shares to 50 per day.
  Android v5.1.0
  Only 20 stories may be shared per day for premiums, 3 for free users. Also hits IFTTT sharing.
  Improving messaging on emails that have OPML backups. Thanks to @frenetic for bringing this up. Closes #1003
  Fixing up postgresql backup.
  Hiding cookie lost message.
  Parallel pgbouncer kill.
  Fixing attribution in twitter RTs.
  Downgrading to elasticsearch 2.4.4, since pyes isn't ready for ES 5.
  Adding support for native RTs in Twitter.
  Automatically disbaling transparent huge pages (THP) on mongo and redis. Also upgrading to elasticsearch 5.2.2, although its untested.
  Add two buttons to get the app
  Fixing broken getsatisfaction community feedback.
  Goodbye Turn Touch campaign.
  ...
2017-03-23 16:54:37 -07:00
Samuel Clay
fdfcc8e798 Turn original text caching back on. 2017-03-23 16:29:15 -07:00
Samuel Clay
82cdae1e4d Extracting images from original text's noscript. 2017-03-23 16:28:47 -07:00
Samuel Clay
2c195cde2a Fetcing the original text now extracts the image url for others. 2017-03-23 16:06:06 -07:00
Samuel Clay
e1bd42612f Adding unique index. 2017-03-07 12:28:21 -08:00
Samuel Clay
2c0bf76e20 Saved search for feeds now works. Need to fix scroll to selected feed (and keyboard shortcuts). Also need to hook up other search types. 2017-03-06 19:55:18 -08:00
Samuel Clay
3435bac504 Showing saved searches. Titles need work. And they don't show the feed yet. 2017-03-03 18:12:27 -05:00