Hacker Read

staunch · 2012-02-21 04:26:59+00:00

It seems reasonable to me that taking meta-data would fall under fair-use but taking the actual media wouldn't.

For example: Scraping <title> tags off Netflix would be legit, but copying Netflix video files wouldn't.

kragen | karma 31428 | avg karma 2.09 · | 2024-05-08 21:58:34

fair use applies when there's a copyright infringement to defend against, but the cc license on stackoverflow clearly permits scraping the content

JoshTriplett | karma 44606 | avg karma 4.76 · | 2019-06-12 09:20:24

> Fair use does not give you the right to wholesale scrape content

Yes, it potentially does. There are court cases establishing precedent that copying something in its entirety can still be fair use, as well as law and court cases establishing specific allowances for archives/libraries/etc.

reply

mthoms | karma 4270 | avg karma 2.02 · | 2016-08-24 19:37:05+00:00

What you've failed to mention is the criteria used to determine if a usage is indeed "fair". There are 4 basic criteria[0] but can be summarized as "If the usage doesn't affect the market for the original work, is substantially transformative, is proportionally insignificant or is used for critique/parody then it is fair". Or, at the risk of over simplifying it: "Does the usage grant a net public benefit without significantly hurting the copyright holders ability to make money?".

>Can I send an email to Netflix and tell them "Hey, if you don't want me to copy your shows, please add this in your page's HEAD element: <meta name='please-dont-download-my-shows-sir'>"?

Actually, under fair use you certainly can make a personal copy (see Betamax case). If you distribute the work you would likely run afoul of the criteria summarized above.

The robots.txt relevancy is being over stated in your argument. The main criteria used in this case is summarized above. The fact that Google provides an opt-out mechanism is a secondary, supporting argument.

>What if I started indexing and rehosting thumbnails? I can assure you that I would get C&D'd almost immediately

A determination of infringement would depend entirely on the context as related to the afore mentioned criteria. The fact that someone might try to sue is a product of the terrible system in general and you're absolutely right - as with any legal matter the entity with the deeper pockets can often bully the other guy into submission.

>In Craigslist v. 3Taps, while primarily a CFAA case, 3Taps was found to be infringing copyrights

My understanding is that the copyright part of the case was thrown out [1] and thus was settled solely around CFAA matters.

>In Ticketmaster v. RMG Technologies , RMG was found to infringe just by parsing a page.

I agree that the logic used for the judgement is absurd (for reasons that are plainly obvious to any HN user). But it's less clear whether the case would meet fair use criteria outlined above should it have come to that. My guess is that it wouldn't qualify since the usage affects the copyright holders ability to make money on the work and doesn't meet any of the other criteria for Fair Use.

>Facebook v. Power Ventures

This is not a case involving a defense of fair use (as far as I can tell). Facebook even acknowledged the users owned the data and had a right to it. The defendant was actually found to be violating CFAA and CAN-SPAM acts.

>It seems Google is the only entity capable of making unauthorized copies and then getting courts to agree that it's fair use. For the rest of us, it's infringement

Provably false [2]. It sounds like perhaps your personal experience has soured your opinion on the matter? That's understandable. But none of the evidence you've cited supports the argument that Google is infringing copyrights in its core activities nor that Google is the only entity where copyright laws and fair use legislation don't apply.

PS: To be clear, my argument revolves specifically around copyright infringement and fair use. I don't have enough understanding of other, separate legislation like CFAA to comment on that except to say that it seems overly broad and unrealistic. But that's another topic. I'm specifically arguing against calling Google a copyright infringer in a broad sense which is what you've done. That's not been proven.

[0] https://en.wikipedia.org/wiki/Fair_use#U.S._fair_use_factors [1] https://techcrunch.com/2013/04/30/craigslist-3taps-lawsuit-d... [2] http://fairuse.stanford.edu/overview/fair-use/cases/

reply

cedsav | karma 788 | avg karma 3.58 · | 2007-08-21 17:04:20+00:00

Fair use certainly doesn't apply. You're using Google search technology (which btw, involves a bit more than 'scraping' the web) and stripping out the ads (their source of revenue). Expect a cease and desist letter soon.

giantrobot | karma 5286 | avg karma 2.0 · | 2023-11-25 10:50:05

Downloading publicly available content for preservation purposes? There's no sane argument that this doesn't squarely fall under Fair Use.

megaman821 | karma 3069 | avg karma 2.7 · | 2024-01-08 12:53:27

That is left to be decided, but I don't think that is what fair use says.

Should it be illegal to gather certain bits of meta data on NYT articles like:

  * word counts
  * word frequency
  * sentiment analysis
  * grammar and spelling
  * facts about the world

paulryanrogers | karma 9107 | avg karma 1.68 · | 2021-08-27 18:26:30

What are the legal implications of grabbing from arbitrary video and profiting off derivative works? Is it fair use?

Have you had to moderate to avoid illegal images from getting into the system?

reply

chii | karma 16512 | avg karma 1.96 · | 2012-09-25 04:35:08+00:00

Indeed - and i believe scraping data off a publically accessible webpage (e.g., one where you do not require registration and login/password) falls under fair use, provided you do not take up more bandwidth/resources than the average user of that site.

I recall there being some precedent for this sort of fair use - something like a phone directory - the information is not copyrighted, but the arrangement and layout is. So hence, you can't just iframe a site and present it, but obtaining the data, and deriving a new work from it should fall under fair use.

reply

kwamenum86 | karma 1731 | avg karma 2.36 · | 2008-12-29 01:56:53+00:00

What you have come up with is a fuzzy case, not clear cut. Even with fair use, the way search engines use content can easily be considered infringement.

I still stand by my statement and the only reason search engines are allowed to copy petabytes of COPYRIGHTED material is because they are so darn useful. I don't know of any other service that sidesteps intellectual property rights (whether through fair use or not) and makes a good amount of money that has been allowed to exist and thrive.

reply

rtpg | karma 18369 | avg karma 3.39 · | 2014-10-04 10:15:13

This example might be fair use, though.

RHSeeger | karma 5929 | avg karma 3.1 · | 2022-06-23 07:53:21

It seems fairly similar, at least to me, to a search engine copying snippets of other people's web sites and displaying them on a page. Admittedly, there's still some discussion as to whether or not _that_ is fair use, but I think enough of the population think it is (with many news organizations disagreeing).

emporas | karma 290 | avg karma 0.78 · | 2023-03-31 09:04:41

They don't copy and reproduce the data. They change it sufficiently for the licence to have any say. Fair use it's called.

realusername | karma 7849 | avg karma 2.39 · | 2017-04-16 13:10:31+00:00

I personally don't believe copyright as a concept is reasonable but even if you do, viewing content you downloaded as you wish seems fair to me.

doki_pen | karma 1686 | avg karma 2.75 · | 2011-03-24 13:36:24

Wouldn't that be fair use? Do you have issue with the content being posted, or the fact that it's scanned?

nl | karma 29762 | avg karma 2.49 · | 2011-11-18 11:09:10+00:00

That may be true, but there is still fair-use.

Sparkyte | karma 210 | avg karma 0.38 · | 2023-08-13 23:45:49

I didn't think either was fair use. I think fair use applies to the utilization not the acquisition of. For example if I bought a movie and I used it with the intention to discuss it with clips, I can recode it because I physically own the medium. The fair use chimes in when the clips are in use. However when I download something with a tool from Google (YouTube) I'm violating their ToS. I might also be violating something in the middle that opens up some legal issues. So when you go to use a video only for fair use, you should ask the poster or the source for an unencrypted one. Also they should do the responsible thing and provide it.

stavros | karma 66636 | avg karma 10.05 · | 2023-10-05 13:47:07

Well, yes, otherwise you wouldn't need fair use, you'd just use it.

crote | karma 7004 | avg karma 4.39 · | 2023-12-09 05:37:48

I also have the right to copy and extract parts of the content under Fair Use. If content providers are making this technically impossible - thus depriving us of the possibility of using it for teaching, research, news reporting, or criticism - how are they not violating the social contract?

jrm4 | karma 6471 | avg karma 2.58 · | 2022-09-10 09:04:19

Ah, I can step in here.

Fair use might work but maybe not? If I were to argue against it, I'd probably compare something like a recording of music vs. a MIDI file. Same raw data scaling.

reply