Brevenotas Linus Torvals carga contra Intel por la ECC en las memorias ram

linus-eff-you-ram-800x454.png

me encantan estas polémicas


Antes que nada, pongámonos en contexto. Los módulos de memorias RAM que utilizan los servidores y sistemas "empresariales" son más caros que los que puedes comprar para los sistemas de escritorio (tu computador personal, tu notebook), además de que probablemente no sean compatibles. La razón de eso es que los módulos de memoria RAM para servidores incluyen un bit extra de corrección de errores y eventualmente avisar si el módulo de memoria RAM está defectuoso. Además, queridos amigos, las memorias con ECC son algo más caras que las memorias normales y no soportan overclocking, pero de que ayudan a la estabilidad y confiabilidad de los sistemas, ayudan. Buenísimo, ¿no?

Pues bien, Linus Torvalds, una prócer rebelde en el mundo de la informática, despotrica en contra de Intel por no homologar su arquitectura de procesadores para que los sistemas de escritorio puedan utilizar menoria RAM con ECC puesto que dicen que "no es necesario, la actual ram es sufucientemente buena para el hogar". También dice que los argumentos de "la economía" de parte de los fabricantes de hardware para no masificar las memorias ECC no tienen sustento técnico.

Como para meterle más caña a la queja, dice que AMD ya la incorporó (AMD con sus Ryzen soporta ECC), ¿por qué no Intel?

No olvidemos que su opinión tiene muchísimo peso pues es padre del kernel de Linux y por lo tanto ve debajo del agua.



ECC absolutely matters.

ECC availability matters a lot - exactly because Intel has been instrumental in killing the whole ECC industry with it's horribly bad market segmentation.

Go out and search for ECC DIMMs - it's really hard to find. Yes - probably entirely thanks to AMD - it may have been gotten slightly better lately, but that's exactly my point.

Intel has been detrimental to the whole industry and to users because of their bad and misguided policies wrt ECC. Seriously.

And if you don't believe me, then just look at multiple generations of rowhammer, where each time Intel and memory manufacturers bleated about how it's going to be fixed next time.

Narrator: "No it wasn't".

And yes, that was - again - entirely about the misguided and arse-backwards policy of "consumers don't need ECC", which made the market for ECC memory go away.

The arguments against ECC were always complete and utter garbage. Now even the memory manufacturers are starting do do ECC internally because they finally owned up to the fact that they absolutely have to.

And the memory manufacturers claim it's because of economics and lower power. And they are lying bastards - let me once again point to row-hammer about how those problems have existed for several generations already, but these f*ckers happily sold broken hardware to consumers and claimed it was an "attack", when it always was "we're cutting corners".

How many times has a row-hammer like bit-flip happened just by pure bad luck on real non-attack loads? We will never know. Because Intel was pushing shit to consumers.

And I absolutely guarantee they happened. The "modern DRAM is so reliable that it doesn't need ECC" was always a bedtime story for children that had been dropped on their heads a bit too many times.

We have decades of odd random kernel oopses that could never be explained and were likely due to bad memory. And if it causes a kernel oops, I can guarantee that there are several orders of magnitude more cases where it just caused a bit-flip that just never ended up being so critical.

Yes, I'm pissed off about it. You can find me complaining about this literally for decades now. I don't want to say "I was right". I want this fixed, and I want ECC.

And AMD did it. Intel didn't.

> I don't really see AMD's unofficial ECC support being a big deal.

I disagree. The difference between "the market for working memory actually exists" and "screw consumers over by selling them subtly unreliable hardware" is an absolutely enormous one.

And the fact that it's "unofficial" for AMD doesn't matter. It works. And it allows the markets to - admittedly probably very slowly - start fixing themselves.

But I blame Intel, because they were the big fish in the pond, and they were the ones that caused the ECC market to basically implode over a couple of decades.

ECC DRAM (or just parity) used to be standard and easily accessible back when. ECC and parity isn't a new thing. It was literally killed by bad Intel policies.

And don't let people tell you that DRAM got so reliable that it wasn't needed. That was never ever really true. See above.

Linus
 

bighead

Cabro Cooleao
Me acuerdo que los PC viejos, asi como con 8088/8086, venían con el bit extra porque la placa madre era tan callampa que no aguantaba ni un solo día sin perder un bit. Ahora podría volver a notarse, no por el hardware malo, sino porque se ha vuelto más "doméstico" mantener encendido el PC dias y dias. De hecho el apagado hibrido de windows no es más que una suspensión a RAM con algunos trucos.
Ahora, no es tan "trivial" esto de corregir errores. Castiga harto dependiendo del polinomio que se ocupe para corregir y ciertamente, si no vas a estar dispuesto a sacrificar velocidad por esta confiabilidad extra, "suspend to disk" parece ser una alternativa mas aceptable.
 

Vorphalack

100tifiko
linus tiene algo ya personal contra intel, pero con buenos argumentos, hace rato también cuestionó las instrucciones AVX-512 diciendo que es puro tollo, en ciencias hay algunas herramientas que usan esta instrucción pero la mejora no es como pa justificar gasto solo por esto

 
Subir