Hacker Read

Hacker Read top | best | new | newcomments | leaders | about | bookmarklet

login

citrin_ru 2021-04-23 21:57:30 | [–] update item (on: UK court clears post office staff convicted due to ‘corrupt data’ )

> although computers themselves are mostly infallible

What do you mean? Hardware is fallible too, just less often than software. This may cause problem on its own e. g. bit flips in non-ECC memory, HDD which lie (reply to flush cache before data is actually written) or HW can trigger software errors, e. g. HW can crash at random moment and SW can be not designed to handle this properly.

sort by:

page size:

slavik81 | karma 6917 | avg karma 2.95 | 2015-11-27 21:38:18 | [–] similar comments (on: Why use ECC? )

> "you must have ECC in all your computers otherwise they will constantly and silently corrupt your data + crash" and "statistically speaking, most computers in the world do not use ECC". How can both of these things be true?

What makes you say that? Both those things being true simply means that most computers silently corrupt your data and crash. That matches my experience. My programs occasionally crash. My pictures and videos are occasionally corrupted.

Do I know that those events are caused by memory failures? No. Most of them are probably other sorts of software or hardware failures, but some could be memory errors.

kruczek | karma 246 | avg karma 1.06 | 2017-11-07 10:08:28+00:00 | [–] similar comments (on: How slot machines are designed to be addictive )

> Memory gets randomly corrupted over time.

That's not a good excuse. Same memory gets corrupted on machines which are processing our online money transfers and yet we don't see problems when buying things online. Why? Because there are layers upon layers validating data integrity, so that even if something gets corrupted, no invalid payment is made.

ptx | karma 4263 | avg karma 2.44 | 2021-01-03 17:01:18+00:00 | [–] similar comments (on: ECC matters )

> if one crashes, so what

Crashes might not matter, but silent data corruption does. The owner/user of that data will care when they eventually discover that it at some point mysteriously got corrupted.

omegalulw | karma 725 | avg karma 1.58 | 2021-07-04 18:39:29+00:00 | [–] similar comments (on: Single random bit flip causes error in certificate transparency log )

> Software under normal circumstances is remarkably resilient to having its memory corrupted

Not really? What you are saying applies to anything that uses hashing of some sort, where the goal by design is to have completely different outputs even with a single bit flip.

And "resilience" is not a precise enough term? Is just recovering after errors due to flips? Or is it guaranteeing that the operation will yield correct output (which implies not crashing)? The latter is far harder.

falcolas | karma 33589 | avg karma 3.15 | 2017-04-26 21:17:18+00:00 | [–] similar comments (on: Should I buy ECC memory? (2015) )

> haven't personally seen any kind of data corruption in motion

Ever had a program crash, hang, or act oddly? That's how data corruption in memory surfaces.

Of course, non-perfect programs (i.e. all of them) act the same way, which means that differentiating memory corruption from misbehaving programs is hard.

Fixing the memory errors will result in more stable system, but it still won't be perfect.

foldr | karma 6163 | avg karma 1.17 | 2022-06-18 09:23:29 | [–] similar comments (on: Vale is the fast, safe, and easy programming language )

>In an embedded system such a crash may be as bad as the incorrect access itself.

I don't agree on this point. An incorrect access on an embedded system has the potential to cause all kinds of horribly subtle bugs involving memory corruption. A simple crash is generally much better.

lordnaikon | karma 105 | avg karma 4.2 | 2018-01-21 20:12:39 | [–] similar comments (on: Redox OS Crash Challenge )

Hardware can misbehave. If the kernel can detect that it is reasonable to shut down the machine and preventing data corruptions. I cannot think of a kernel developer who would laugh at that.

beagle3 | karma 16421 | avg karma 2.62 | 2012-06-13 14:30:23 | [–] similar comments (on: Newegg: No, We'll Totally Take Returns After You Install Linux )

> True, but hardware failures are known to happen due to Linux misbehavior

[citation needed]

In the past, Linux was blamed for memory failure because it exposed bad memory when trying to make use of it, and Windows on the same machine didn't. But it was not Linux's fault.

nabla9 | karma 38285 | avg karma 4.67 | 2017-05-30 04:52:20 | [–] similar comments (on: Intel Announces Skylake-X: Bringing 18-Core HCC Silicon to Consumers )

> experience widespread memory issues.

Because they rarely spread wide.

If you edit images or videos, maybe you detect small corruption in the image. If you use databases or do data analysis, there may be one number that is wrong, or some string has one byte of garbage. Sometimes, application may crash.

All this is very rare. It only matters if you need data integrity and do work where data has value.

justin66 | karma 13227 | avg karma 2.6 | 2017-04-26 16:19:08 | [–] similar comments (on: Should I buy ECC memory? (2015) )

> haven't personally seen any kind of data corruption

How would you know? Unless your computer use has been literally trouble-free (and all your archived data has been verified for correctness somehow), you can't know that none of your glitches over the past 17 years has been due to memory errors.

richdougherty | karma 1794 | avg karma 4.52 | 2017-06-25 16:33:18 | [–] similar comments (on: Intel Skylake/Kaby Lake processors: broken hyper-threading )

> haven't seen a single processor hang

If there was data corruption you might not notice.

fulafel | karma 12262 | avg karma 1.65 | 2017-02-19 09:01:10+00:00 | [–] similar comments (on: AMD Ryzen 7 1800X Benchmarked – Giving Intel’s $1000 Chips A Run For It )

Desktop computers are sometimes used for actual work where data integrity is important.

Only a small minority of main memory data corruptions lead to OS crashes, mostly the in-memory application or filesystem data just silently gets corrupted.

jerf | karma 85298 | avg karma 5.28 | 2014-07-02 13:27:28+00:00 | [–] similar comments (on: Job interviews go both ways (2011) )

While a nifty idea, corruption of this sort is so rare and so unbounded (that is, there's no reason to believe it'll strike in your incoming data, it could well strike at the CPU instructions itself or whoknows) that there's not much you can do about it from inside the code. It's all but impossible to deal with corruption rates on the order of 1 in 10^18 (or better! properly functioning hardware is obscenely reliable at doing what it was designed to do [1]) instructions on properly functioning hardware, and all but impossible to deal with failing instructions at a much higher rate on nonfunctioning hardware, except to replace it with functioning hardware.

[1]: If anyone wants to pop up with complaints about that statement, remember that properly functioning hardware is also doing a lot of things very quickly, so it has a lot of chances to fail. ECC RAM is important, for instance, because something that only happens every few billion accesses may still happen several times a day. But this is still an absolutely obscene degree of reliability. Most disciplines would laugh at worrying about something at that rate of occurrence... they wouldn't even be able to detect it.

Rusky | karma 4294 | avg karma 3.38 | 2018-08-19 18:45:12 | [–] similar comments (on: Never patterns, exhaustive matching, and uninhabited types in Rust )

Yes, I know- that was my point. The comment I replied to was talking about CPU/RAM errors somehow bypassing that proof.

hulitu | karma 3686 | avg karma 0.58 | 2023-08-24 10:35:20 | [–] similar comments (on: The History of Windows 2.0 )

> 99% of BSODs are driver or hardware issues related

Of course, memory management is also done by a driver. /s

thworp | karma 328 | avg karma 1.45 | 2023-11-17 07:10:51 | [–] similar comments (on: Disks lie. And the controllers that run them are partners in crime (2012) )

> There's simply no structural reason for power failure (or application crashes) to be able to put hard disk data into an inconsistent, corrupt state.

No structural reason no, but a lot of other reasons. You can install your favourite atomic file operations library in your PL of choice and run some benchmarks to identify reason #1.

JonChesterfield | karma 5107 | avg karma 2.47 | 2024-02-17 18:09:06 | [–] similar comments (on: Options for genuine ECC RAM on the desktop in (early) 2024 )

This one does pervasive damage to software quality though. You get a bug report, it doesn't reproduce easily. The thought is always there: "maybe their hardware had a transient error?". If it did you have no bug to find. That's attractive.

If storage drives reported writes accurately and memory didn't occasionally silently corrupt, software falling over would be more likely to imply an error the developer is empowered to fix.

pixl97 | karma 15630 | avg karma 1.96 | 2020-10-06 19:19:37 | [–] similar comments (on: DDR5 Is Coming: First 64GB DDR5-4800 Modules from SK Hynix )

I've seen a ton of it in the field. In general the ram stability is so bad that large operating systems fail to boot from corruption, so most of the time it doesn't get to the data corruption phase.

ClumsyPilot | karma 18478 | avg karma 2.3 | 2021-01-03 22:29:12+00:00 | [–] similar comments (on: ECC matters )

Can make that statement with any certainty? My personal and family computers have crashed quite a few times, and have corrupted photoes and files, some of them are valuable (taxes, healthcare, etc. Personal computers have valuable data these days)

I couldn't tell, as a user, which if those corruptions and crashes were causes by bitflips. Could you?

Legal | privacy