Hacker Read

praptak · 2021-05-26 11:08:18

I know. GPs solution was to point them to a URL which you control, with a suggestion that FN owners would go as far as to scrape these.

Edit: they are obviously already filtering the topics themselves.

reply

barbazoo | karma 7012 | avg karma 2.4 · | 2022-08-30 23:26:08

Thank you for mentioning this. It's so convenient to manage that kind of filtering through a service. Works for all platforms too. I can finally bteak my reddit addiction again.

thirdsun | karma 2349 | avg karma 1.61 · | 2019-06-24 02:28:51

Isn't that solved by using the more specific filter site:reddit.com?

naanalla | karma 12 | avg karma 1.2 · | 2013-03-13 07:50:15+00:00

Seems like reddit has some thoughts how to handle huge pile of information which is 'out there'. As ardent reddit lurker this was most needed feature of reddit.

astrange | karma 14132 | avg karma 1.32 · | 2022-08-22 00:24:36

If this solves the problem where their results are useless unless you add "reddit", it's the best thing they could be doing.

ehsankia | karma 11753 | avg karma 3.62 · | 2021-03-22 14:52:54

Absolutely, I loved seeing spikes in my traffic and going to the reddit/HN thread that caused it. I guess you still can manually figure it out using a search engine maybe, but like you said the smaller weird ones will go under the cover probably.

orangethirty | karma 3518 | avg karma 1.87 · | 2013-02-08 18:22:45

Reddit does have some subreddits that deal with this. And Nuuton will have that functionality once it gets out of ALPHA (in about 6 months).

dsir | karma 111 | avg karma 1.59 · | 2023-06-15 15:17:55

Yeah, that was a key design decision from the start. All posts and comments made on the platform are indexable by search engines.

unixhero | karma 4535 | avg karma 1.33 · | 2021-07-05 02:55:30

The https://reddit.com/r/datahoarder community is working on it

mjr00 | karma 4362 | avg karma 7.75 · | 2022-01-03 10:47:45

My current solution for this is to just tag `site:reddit.com` to the beginning of Google searches. A Google search for `site:reddit.com best miter saw` has a lot of relevant results.

Marketers/SEO people are starting to infiltrate this as well, but since they can't control and SEO the content on Reddit nearly as much, this still works pretty well for now.

reply

NicoJuicy | karma 10294 | avg karma 1.47 · | 2021-01-11 15:54:38+00:00

I crawled reddit in several topics.

It's supported through their api.

reply

fpgaminer | karma 6313 | avg karma 6.56 · | 2015-08-03 01:23:26+00:00

Great idea! I immediately thought "Why didn't I think of that!?"

With regards to the privacy concerns of Research mode, there may be a way solution. For sites like Reddit, it should be possible to build a bloom filter. Have the metafruit server actively spidering Reddit for new, popular threads and add them to a bloom filter. The plugin would download the bloom filter from the metafruit server at some regular interval. That way checking whether any particular URL has an associated conversation is just a local operation. Plus, it's faster than pinging an API, and burns less of the target API's resources.

That would also provide a way to monetize, by giving out the metafruit bloom filter to subscribers only. Or perhaps the free plugin can update its bloom filter once a day, but subscribers can update once per hour.

reply

tdoggette | karma 1118 | avg karma 2.69 · | 2009-08-04 17:39:07+00:00

I'd like to be able to enter a URL and see every place on the internet that people have talked about it.

emodendroket | karma 21781 | avg karma 1.83 · | 2022-12-23 19:25:00

Reddit is pretty well indexed.

spladug | karma 158 | avg karma 5.85 · | 2018-01-16 16:22:02+00:00

Yup! And this sort of tracking is exactly why.

https://github.com/reddit/reddit/blob/master/r2/r2/lib/cssfi...

https://www.reddit.com/r/cssnews/comments/24anzb/css_change_...

reply

eulers_secret | karma 747 | avg karma 3.93 · | 2023-06-09 17:44:53

Just append .json to any Reddit URL and you'll get a full dump of that page, we'll see if they get rid of this feature as well. Way easier than scraping.

def_true_false | karma 295 | avg karma 0.82 · | 2017-11-24 21:38:46+00:00

Something like what keybase does with e.g. reddit should work.

saurabhnanda | karma 240 | avg karma 2.14 · | 2013-06-17 04:22:22

How do you ensure that Google is able to index it? Do you sniff the user-agent and serve it a pre-rendered view of the discussion thread?

Raphmedia | karma 5930 | avg karma 2.63 · | 2019-05-23 17:48:50+00:00

You can already simply click a domain on reddit and it shows you all the posts that linked it.

phdelightful | karma 376 | avg karma 4.13 · | 2022-04-15 07:42:10

I wonder to what extent this is an early step toward walling off Reddit's content from third-party search engines. They probably recognize that without a functional search they'd take a big hit in such a scenario, but with search under their own control they can better influence how people end up seeing various content.