Linus blaming Intel for lack of ECC in consumer systems

This is something I quite agree with. I gave up complaining about 20 years ago but would welcome a change: <https://arstechnica.com/gadgets/2021/01/linus-torvalds-blames-intel-for-lack-of-ecc-ram-in-consumer-pcs/> <https://www.realworldtech.com/forum/?threadid=198497&curpostid=198647>

On 2021-01-16 10:33 p.m., D. Hugh Redelmeier via talk wrote:
This is something I quite agree with. I gave up complaining about 20 years ago but would welcome a change:
As a computer tech, having worked with computers such as the VAX 11/780 and Data General Eclipse, before I ever touched a PC, I'd have to agree.

On 1/16/21 10:41 PM, James Knott via talk wrote:
On 2021-01-16 10:33 p.m., D. Hugh Redelmeier via talk wrote:
This is something I quite agree with. I gave up complaining about 20 years ago but would welcome a change:
As a computer tech, having worked with computers such as the VAX 11/780 and Data General Eclipse, before I ever touched a PC, I'd have to agree.
The other two that annoy me are trying to lock down the number of PCI-E lanes or amount of memory. At least AMD fixed that with Threadripper Pro. Intel really needs to stop letting their marketing team create artificial limits in the product stack. My predication is if ECC were more mainstream actually prices for it would probably stop being 2-3x or even more like they are currently. A lot of people I know if they can use ECC choose not to due to this issue and it being mainstream would help with this. Unfortunately, Intel isn't the only company that creates artificial limits in their hardware stack. It's a good example of why having your marketing team run the product stack is a bad idea, sign. Linus should be complaining about these limits and it's a wonder he hasn't been more vocal about it. At least AMD tries their best to not limit the stack if possible for example board compatibility is far better so long as you flash a new BIOs for the next generation chips and even listened when users wanted more support but they were going to stop first/second generation Ryzen if I recall correctly with new chips, Nick
--- Post to this mailing list talk@gtalug.org Unsubscribe from this mailing list https://gtalug.org/mailman/listinfo/talk

| From: Nicholas Krause via talk <talk@gtalug.org> | The other two that annoy me are trying to lock down the number of PCI-E | lanes or amount of memory. At least AMD fixed that with Threadripper | Pro. Intel really needs to stop letting their marketing team create | artificial limits in the product stack. If the vendor chooses to segment their market by leaving out zero-cost features for the lower priced product, that's a demonstration of monopoly power in the market. Intel has done that a lot. There is a plausible excuse in some cases. If all the cost is in engineering and not in manufacturing, it may make sense to only recoup the engineering costs from those who need it. AMD, when it has been the underdog, has often thrown a lot of features in without extra charge. This has sometimes appeared to drive Intel to move broadly provide the feature (i.e. when Intel appeared to be threatened by competition with AMD). An example that irked me was virtualization hardware. For the first few years, Intel only provided the feature on an odd subset of chips that were expensive. Whenever you were considering a system, you had to consult the spec sheet for the CPU. AMD provided it in all new processors after a they had engineered it. Other examples that come to mind: - Instructions to accelerate cryptography (eg. AES). - PCI lanes, as Nicholas mentioned. - physical address bits leaving the CPU bus - 64-bit mode on x86 Interestingly, there are examples of the opposite behaviour that also indicate to me monopoly power. - Intel engineered a fairly powerful and intricate MMU for the 80286. They essentially forced all customers to pay for it, even though almost nobody used it during the service life of the processors bought in the first years. Motorola spent very roughly the same amount of silicon to implement a much more elegant 32-bit processor (the mc68000). - Intel, a generation later, made customers buy a 32-bit processor (the 80386) when all that they'd use was the 16-bit subset. In both those cases, the evidence is muddied because it was Microsoft's monopoly behaviour that left hardware capability unused. Famously, it was the latent power of the 80386 which incentivized Linus to produce Linux. In general, UNIX could and did exploit both the MMU of the 286 and the 32-bit architecture of the 386 long before Microsoft software. | My predication is if ECC were | more mainstream actually prices for it would probably stop being 2-3x | or even more like they are currently. ECC is generally provided by one extra bit per byte of memory. So the cost premium should be no more than a factor of 9/8. It should be even less since the cost of the board should not increase by that much, only the RAM. | At least AMD tries their best to not limit the stack if possible for | example board compatibility is far better so long as you flash a new | BIOs for the next generation chips and even listened when users | wanted more support but they were going to stop first/second generation | Ryzen if I recall correctly with new chips, I imagine that they are incentivized into this behaviour by their market position. There is a pattern of underdogs changing if they become dominant. It's not driven by morality, it's driven by game theory. That's why anti-trust enforcement is so important for healthy free markets. Something that has faded away in the last fifty years.

On Sun, Jan 17, 2021 at 10:26:07AM -0500, D. Hugh Redelmeier via talk wrote:
If the vendor chooses to segment their market by leaving out zero-cost features for the lower priced product, that's a demonstration of monopoly power in the market.
Intel has done that a lot.
Oh boy have they ever. :)
There is a plausible excuse in some cases. If all the cost is in engineering and not in manufacturing, it may make sense to only recoup the engineering costs from those who need it.
AMD, when it has been the underdog, has often thrown a lot of features in without extra charge. This has sometimes appeared to drive Intel to move broadly provide the feature (i.e. when Intel appeared to be threatened by competition with AMD).
An example that irked me was virtualization hardware. For the first few years, Intel only provided the feature on an odd subset of chips that were expensive. Whenever you were considering a system, you had to consult the spec sheet for the CPU. AMD provided it in all new processors after a they had engineered it.
I didn't help how many companies left the feature disabled in the BIOS and in some cases didn't even provide an option to enable it. Supposedly because it might cause someo compatibility issues, although I don't believe it ever has for anyone.
Other examples that come to mind:
- Instructions to accelerate cryptography (eg. AES).
- PCI lanes, as Nicholas mentioned.
Hmm, I can't recall cases of this, but I may just have missed it.
- physical address bits leaving the CPU bus
- 64-bit mode on x86
I have seen some intel CPUs that the official specs say don't have 64 bit support, but every actual chip I have ever seen had 64 bit support. So is the chip manufactured wrong because they didn't turn it off or was the spec writen wrong? Can you use it and trust the chip you buy next month will still have it or not?
Interestingly, there are examples of the opposite behaviour that also indicate to me monopoly power.
- Intel engineered a fairly powerful and intricate MMU for the 80286. They essentially forced all customers to pay for it, even though almost nobody used it during the service life of the processors bought in the first years.
It's design flaws hurt it a lot. Not being able to leave proteceted mode once you entered was apparently a big problem for operating systems.
Motorola spent very roughly the same amount of silicon to implement a much more elegant 32-bit processor (the mc68000).
Well they had the benefit of not being backwards compatible with anything else. Starting from scratch can be a big advantage.
- Intel, a generation later, made customers buy a 32-bit processor (the 80386) when all that they'd use was the 16-bit subset.
At least the 386 had a much better protected mode design, although ram was probably too expensive still for most people to have much use for the 386's features. And it was able to help out dos in 16 bit mode using the MMU to allow TSRs and such to be loaded outside 640k and was able to pretend extra ram was EMS or XMS which was likely much cheaper than the special EMS memory boards people had been using before.
In both those cases, the evidence is muddied because it was Microsoft's monopoly behaviour that left hardware capability unused.
Famously, it was the latent power of the 80386 which incentivized Linus to produce Linux.
In general, UNIX could and did exploit both the MMU of the 286 and the 32-bit architecture of the 386 long before Microsoft software.
Yes unix on a 286 didn't have to care about going back to real mode so it was able to use it. Microsoft Xenix 286 came out in 1984 which is only 2 years after intel introduced the 286 to the market. That's pretty quick support from Microsoft.
ECC is generally provided by one extra bit per byte of memory. So the cost premium should be no more than a factor of 9/8. It should be even less since the cost of the board should not increase by that much, only the RAM.
Yeah 10% extra at most would seem reasonable.
I imagine that they are incentivized into this behaviour by their market position.
There is a pattern of underdogs changing if they become dominant.
It's not driven by morality, it's driven by game theory.
That's why anti-trust enforcement is so important for healthy free markets. Something that has faded away in the last fifty years.
-- Len Sorensen

On 2021-01-16 10:33 p.m., D. Hugh Redelmeier via talk wrote:
This is something I quite agree with. I gave up complaining about 20 years ago but would welcome a change:
<https://arstechnica.com/gadgets/2021/01/linus-torvalds-blames-intel-for-lack-of-ecc-ram-in-consumer-pcs/> <https://www.realworldtech.com/forum/?threadid=198497&curpostid=198647> ---
Letting a cost accountant into the building is often a business risk (;-)) A disgusted tech support person said "Whenever our app fails, we run a checksum of it, and usually end up telling the customer to delete it and reinstall". I remember when ZFS shipped, and customers were discovering bit-rot because it checksummed /everything/. --dave -- David Collier-Brown, | Always do right. This will gratify System Programmer and Author | some people and astonish the rest davecb@spamcop.net | -- Mark Twain
participants (5)
-
D. Hugh Redelmeier
-
David Collier-Brown
-
James Knott
-
lsorense@csclub.uwaterloo.ca
-
Nicholas Krause