Hacker Read

Hacker Read top | best | new | newcomments | leaders | about | bookmarklet

login

afiori 2022-10-18 04:58:22 | [–] update item (on: GitHub Copilot, with “public code” blocked, emits my copyrighted code )

I would say that the logic is more like:

Proposition: "They either do not use private code or they did something very very stupid."

Proof: "Not using private code is very easy (for example google does not train its models on workspace users' data, which is why they get inferior features) and they promised multiple time not to use private code so doing in would be hard to justify"

sort by:

page size:

rowanG077 | karma 2770 | avg karma 0.92 | 2023-10-23 15:38:09 | [–] similar comments (on: Cascade: CPU fuzzing via intricate program generation )

> It means no one ever ran those instructions and checked the result.

That's not really true. It means they produced wrong values with some input. They may have tested but not hit the pathological input case. No one is going to sweep the entirety of the domain. You really need a formal proof to avoid these bugs.

schizoidboy | karma 386 | avg karma 2.95 | 2018-10-10 18:34:24 | [–] similar comments (on: Arguing against using protobuffers )

If we suppose that his conclusion is using boolean logic, then what you're saying is a strawman because of his last claim; namely, protobufs are bad if "[...] && !Google":

> They're clearly written by amateurs, unbelievably ad-hoc, mired in gotchas, tricky to compile, and solve a problem that nobody but Google really has.

This dovetails with other arguments that I've seen recently that are becoming more frequent:

Have we entered a new world where the lessons of companies working at massive scales are not only generally superfluous for smaller scales, but are actively harmful?

brvsft | karma 820 | avg karma 2.67 | 2023-06-24 04:51:46 | [–] similar comments (on: Open source licenses need to leave the 1980s and evolve to deal with AI )

The reason they don't train the model on their code is specifically because they don't want it accidentally spitting out snippets of their proprietary code, not because the code is "extremely hard to work with."

I'm amazed you called that argument silly while countering with this.

yqx | karma 154 | avg karma 4.53 | 2021-10-27 15:10:44 | [–] similar comments (on: Nearly a third of new code on GitHub is written with AI help )

> is it not reasonable to assume that the number of security flaws just reflects how insecure most public code is?

It sounds to me like that's not an inference that can easily be drawn. Copilot was trained on predicting code, it doesn't understand the code it produces syntactically. Security issues can be highly context dependent. For example, in most cases it's fine to log a variable, but when it happens to contain a password, it's a security issue. This is a flawed example as the algorithm may be able to learn that variables with names or contexts suggesting that they're secrets should not be logged, but I can imagine much more subtle issues can crop up.

mike706574 | karma 6 | avg karma 3.0 | 2017-11-25 20:11:53+00:00 | [–] similar comments (on: Clojure Design Patterns )

> He clearly hasn't proven anything about his code.

Is that a bad thing? It seems like many people are able to write software that works well and gets the job done without "proving anything" about their code in the way you're describing. Personally, I'm fine with not proving anything if I can deliver quickly and everything works, but I'd like to be convinced otherwise if there's real value there.

scott_s | karma 34069 | avg karma 3.96 | 2016-03-23 21:30:39+00:00 | [–] similar comments (on: Exploring Rust (from C#) )

Since tomp objects to trusting code, then I agree with Manishearth: if you're never willing to trust code, then you can never have guarantees. Even the proof engine is trusted code.

greenshackle2 | karma 1836 | avg karma 2.93 | 2016-10-17 21:58:50+00:00 | [–] similar comments (on: The VeraCrypt Audit Results )

>If we found bug x, y and z will necessarily be found, therefore VeraCrypt is insecure.

I did not make that argument, I made a probabilistic argument. Trying to apply deductive logic fallacies to probabilistic arguments is a type error.

>if you can't identify or find them, you can't claim they exist.

I made no absolute claim about the existence or nonexistence of bugs, only about the expected number of bugs, that is, a probabilistic estimate.

> There's no such thing as a bug "hit rate".

Of course there is, it is, trivially, the number of bugs found by the audit / the total number of bugs.

Look at this way: I have two auditors, Alice and Charles. Charles is known to be pretty sloppy, he looks over code quickly, skip parts, etc, but the bugs he finds are usually actual bugs. Alice is extremely rigorous and misses very few bugs.

I have 2 pieces of software, SuperCrypt and UltraCrypt.

Charles audits SuperCrypt and finds 137 critical bugs. He audits UltraCrypt and finds only one.

I know Alice will review both code bases next week. Where do you think she'll find more bugs?

sosborn | karma 2256 | avg karma 2.41 | 2019-03-17 17:33:16 | [–] similar comments (on: Thinkpad X210 )

> no discernible reason

What if the "no discernible reason" is just poorly coded software?

marcosdumay | karma 27273 | avg karma 1.67 | 2024-03-18 18:31:00 | [–] similar comments (on: Thoughts on the Future of Software Development )

Yeah, people fixated on the meaning of "makes no sense" to evade accepting that the proofs LLMs output are not useful at all.

On a similar fashion, almost all LLM created tests have negative value. They are just easier to verify than proofs, but even the bias into creating more tests (taken from the LLM fire hose) is already harmful.

I am almost confident enough to make a similarly wide claim about code. But I'm still collecting more data.

amelius | karma 42902 | avg karma 1.63 | 2020-03-04 11:20:54+00:00 | [–] similar comments (on: How did software get so reliable without proof? )

> How did software get so reliable without proof?

Because most software is just moving data from one place to another, combined with some very simple business logic.

revnode | karma 168 | avg karma 1.5 | 2020-08-16 14:44:41+00:00 | [–] similar comments (on: USPS Files Patent for a Blockchain-Based Voting System )

The overwhelming majority of the population doesn't have the capacity to understand what you wrote or why that is true. Even the majority of those who understand why it may be true, cannot say for certain that it is true without inspecting the codebase and its operation. That's a major problem.

dao- | karma 1794 | avg karma 3.61 | 2018-12-02 18:38:44+00:00 | [–] similar comments (on: Mozilla hit with takedown request for anti-paywall addons )

Where did I assert something similar to your example? It seems like a false analogy. The criticism wasn't even that my logic was flawed but that the same argument could somehow be made to justify any user-hostile action.

If the problem in your point of view is that I didn't /prove/ how big of a problem sideloading was then yes, I didn't even attempt to do that. There's a separate subthread on that question.

Grimburger | karma 2254 | avg karma 3.4 | 2022-02-19 04:51:47 | [–] similar comments (on: ‘Zero-click’ hacks are growing in popularity )

> safe code is impossible

> Humans cannot write safe software. Ever. No matter what.

Formally proven code does what it says on the box? Do we have different definitions of safe perhaps?

petergeoghegan | karma 1113 | avg karma 6.66 | 2022-05-28 20:47:57 | [–] similar comments (on: Make formal verification and provably correct software practical and mainstream )

> Nirvana fallacy. The point is that it can be much better and eliminate ALL non-design bugs.

Serious question: do you really believe that?

> That's just stupid. Show me the studies.

There is a kind of HAL-9000 quality to many of these arguments. Formal verification is perfect by definition. The fact that it hasn't had very much impact in the real world is all the more evidence of the world being full of wicked people.

serf | karma 9311 | avg karma 3.17 | 2020-04-26 00:40:30 | [–] similar comments (on: Formal Verification Creates Hacker-Proof Code (2016) )

it's not that it's unclear per se, it's just an untenable position to take.

lazy example : a hardware-level side-channel attack is not rendered impossible simply because the software was made with formal verification methods.

pilgrim689 | karma 924 | avg karma 3.55 | 2012-05-23 02:55:19+00:00 | [–] similar comments (on: Leap: A new gesture based interface for devices )

You don't understand my point. I was picking at how you claimed that X was bad because it can't be used for applications designed for Y.

I also doubt your understanding of "negative proof"... But that's off-topic

the_af | karma 13299 | avg karma 2.44 | 2016-07-28 22:18:27+00:00 | [–] similar comments (on: Functional Programming Isn't the Answer (2014) )

> Also empirically disproven, even on HN.

Can you provide an example of someone who claims "FP guarantees good code"?

Merovius | karma 652 | avg karma 3.05 | 2017-10-02 21:37:26+00:00 | [–] similar comments (on: Diminishing returns of static typing )

> The author of the article implicitly equates "statically verified code" with "bug-free code".

Not at all. First, the statements as put here are discrete (boolean even) while I present both "statically verified code" and "bug-freedom" as living on a continuum. Secondly, I don't equate them. If anything, I assume a monotonic, positive relationship between them (strictly speaking not even that. I make pretty clear that the curves could also have whatever shape. But I yield that I am very suggestive in this because I do strongly believe it to be the case). In fact, one of the main points of the argument is that the two are not equal - otherwise, the blue curves I drew would all be straight lines from (0,0) to (1,1). And lastly, none of this is done implicitly. I mention all of this pretty explicitly :)

AlotOfReading | karma 8440 | avg karma 3.59 | 2022-12-26 11:41:19 | [–] similar comments (on: Study finds AI assistants help developers produce code likely to be buggy )

1) verifying code is harder than writing it and

2) verifying code requires domain knowledge, which implies that the utility of these models is limited to things could write myself if I weren't too lazy. That's hugely constricting.

Legal | privacy