| Author |
Message |
Nathan Bates
Guest
|
Posted:
Wed Oct 12, 2005 12:15 am Post subject:
Pentium M to become THE CPU |
|
|
Pentium M has all the right ingredients for total world domination:
low power consumption, short pipeline stages, hi-performance.
Pentium M will kill its brother Pentium 4 and its bastard cousin
Athlon.
PowerPC is a Neanderthal that's nearing its end (Jobs figured that
out).
But ARM will survive due to its ultra-low power consumption and
elegance. |
|
| Back to top |
|
 |
Casper H.S. Dik
Guest
|
Posted:
Wed Oct 12, 2005 12:15 am Post subject:
Re: Pentium M to become THE CPU |
|
|
"Nathan Bates" <nathanbates99@yahoo.com> writes:
[quote]Pentium M has all the right ingredients for total world domination:
low power consumption, short pipeline stages, hi-performance.
[/quote]
FSB?
Casper
--
Expressed in this posting are my opinions. They are in no way related
to opinions held by my employer, Sun Microsystems.
Statements on Sun products included here are not gospel and may
be fiction rather than truth. |
|
| Back to top |
|
 |
daytripper
Guest
|
Posted:
Wed Oct 12, 2005 6:00 am Post subject:
Re: Pentium M to become THE CPU |
|
|
On 11 Oct 2005 22:01:25 GMT, Casper H.S. Dik <Casper.Dik@Sun.COM> wrote:
[quote]"Nathan Bates" <nathanbates99@yahoo.com> writes:
Pentium M has all the right ingredients for total world domination:
low power consumption, short pipeline stages, hi-performance.
FSB?
[/quote]
Picks up at the same point the "Netburst" chips left off, and then pushes
higher... |
|
| Back to top |
|
 |
Oliver S.
Guest
|
Posted:
Wed Oct 12, 2005 6:04 am Post subject:
Re: Pentium M to become THE CPU |
|
|
[quote]Pentium M has all the right ingredients for total world domination:
low power consumption, short pipeline stages, hi-performance.
Pentium M will kill its brother Pentium 4 and its bastard cousin
Athlon.
PowerPC is a Neanderthal that's nearing its end (Jobs figured that
out).
But ARM will survive due to its ultra-low power consumption and
elegance.
[/quote]
That seems to be a religious issue for you; I think you should
consider visiting a therapist. |
|
| Back to top |
|
 |
Rob Stow
Guest
|
Posted:
Wed Oct 12, 2005 8:10 am Post subject:
Re: Pentium M to become THE CPU |
|
|
Nathan Bates wrote:
[quote]Pentium M has all the right ingredients for total world domination:
low power consumption, short pipeline stages, hi-performance.
Pentium M will kill its brother Pentium 4 and its bastard cousin
Athlon.
PowerPC is a Neanderthal that's nearing its end (Jobs figured that
out).
But ARM will survive due to its ultra-low power consumption and
elegance.
[/quote]
Sigh. Another troll to plonk. |
|
| Back to top |
|
 |
Ketil Malde
Guest
|
Posted:
Wed Oct 12, 2005 8:15 am Post subject:
Re: Pentium M to become THE CPU |
|
|
"Nathan Bates" <nathanbates99@yahoo.com> writes:
[quote]Pentium M has all the right ingredients for total world domination:
low power consumption, short pipeline stages, hi-performance.
[/quote]
Mediocre FP performance, few available motherboards, high price, no
SMP support?
For the price of a high end Pentium M, I can get a dual core AMD where
each core has equivalent integer performance and much better FP. Sure
Pentium M is attractive for some purposes, but total world domination
is still a way off, IMO.
-k
--
If I haven't seen further, it is by standing in the footprints of giants |
|
| Back to top |
|
 |
nobody@nowhere.net
Guest
|
Posted:
Wed Oct 12, 2005 8:15 am Post subject:
Re: Pentium M to become THE CPU |
|
|
On Tue, 11 Oct 2005 21:00:01 -0400, daytripper
<day_trippr@REMOVEyahoo.com> wrote:
[quote]On 11 Oct 2005 22:01:25 GMT, Casper H.S. Dik <Casper.Dik@Sun.COM> wrote:
"Nathan Bates" <nathanbates99@yahoo.com> writes:
Pentium M has all the right ingredients for total world domination:
low power consumption, short pipeline stages, hi-performance.
FSB?
Picks up at the same point the "Netburst" chips left off, and then pushes
higher...
[/quote]
No matter how far you push a Ford, you'll get out of it a Mercury or
at best a Lincoln, but still not even close to BMW (OK, everyone may
have personal preferences, but MSRP speaks for itself - $50,525
Lincoln Town Car vs. $121,295 BMW 760Li - both top trim full size
sedans, data from Edmunds.com)
The whole point is that A64/Opteron has _no_ FSB. No matter how fast
the FSB is, it can't beat on-chip memory controller. And in SMP the
fastest Intel FSB doesn't scale up as well as Opteron's point-to-point
HT links.
NNN |
|
| Back to top |
|
 |
Casper H.S. Dik
Guest
|
Posted:
Wed Oct 12, 2005 2:26 pm Post subject:
Re: Pentium M to become THE CPU |
|
|
"nobody@nowhere.net" <mygarbage2000@hotmail.com> writes:
[quote]The whole point is that A64/Opteron has _no_ FSB. No matter how fast
the FSB is, it can't beat on-chip memory controller. And in SMP the
fastest Intel FSB doesn't scale up as well as Opteron's point-to-point
HT links.
[/quote]
Indeed; the FSB is just about fast enough for one core; it becomes
a bottleneck at two cores.
Casper
--
Expressed in this posting are my opinions. They are in no way related
to opinions held by my employer, Sun Microsystems.
Statements on Sun products included here are not gospel and may
be fiction rather than truth. |
|
| Back to top |
|
 |
Guest
|
Posted:
Wed Oct 12, 2005 4:15 pm Post subject:
Re: Pentium M to become THE CPU |
|
|
Andi Kleen wrote:
[quote]First the memory controller no matter if integrated or not is a
bottleneck for any CPU given a sufficient fast workload. That's
simply because the DIMMs cannot keep up with the CPU.
I would say in practice for a normal desktop machine or a laptop
the limit is how much bandwidth two DIMMs can deliver.
[/quote]
90%+ of the time, the problem is NOT bandwidth, but Latency. The on-die
memory controller gets rid of all of the FSB (latency adding) cycles.
In Opteron, for example, the address associated with an L2 miss can
arrive at the memory controller in less than 2ns, and data arriving at
the pins from the DIMMs can arrive back at the CPU in a similar number.
ON a FSB system, the L2 miss has to get synchronized to the FSB bus,
travel over that bus, get registered in the Memory controler, get
scheduled, and have the address driven out to the DIMMs. A similar
process occurs on the way back. But the clincher is that the memory
controller is implemented in ASIC technology (think 500 MHz) rather
than CPU technology (think 3 GHz); so every little step of memory
controller processing is correspondingly slower.
[quote]For bandwidth a sufficiently fast FSB could supply enough bandwidth
to easily keep up with these two DIMMs.
Where it mainly loses against the integrated IMC+separate link is when there
is a lot of additional IO traffic too (but that tends to be small
compared to memory traffic except perhaps for 3d).
And in latency it is slower of course of course. That is the big win
of the integrated memory controller. Even that can vary though -
e.g. if the FSB has enough bandwidth and the chipset a good memory
controller it could look reasonable again under high load (compared
to idle latency)
[/quote]
If the processor waits at any point because DRAM data has not arrived
and the CPU has nothing left to try to do, then you are in a latency
bound situation and the FSB looses. More bandwidth does not speed up
latency bound problems.
In addition the on-die approach with the HyperTransport fabric
interconnect gives you the property that as you add CPUs, you also add
DRAM bandwidth and bisection bandwidth. A 4 Node Opteron system has ~4
times as much DRAM bandwidth as a 4 node Pentium (single) FSB system
and plenty of chip-to-chip bandwidth to route the data to where it is
needed.
Mitch |
|
| Back to top |
|
 |
Andi Kleen
Guest
|
Posted:
Wed Oct 12, 2005 4:15 pm Post subject:
Re: Pentium M to become THE CPU |
|
|
Casper H.S. Dik <Casper.Dik@Sun.COM> writes:
[quote]"nobody@nowhere.net" <mygarbage2000@hotmail.com> writes:
The whole point is that A64/Opteron has _no_ FSB. No matter how fast
the FSB is, it can't beat on-chip memory controller. And in SMP the
fastest Intel FSB doesn't scale up as well as Opteron's point-to-point
HT links.
Indeed; the FSB is just about fast enough for one core; it becomes
a bottleneck at two cores.
[/quote]
It's a bit more complicated I think:
First the memory controller no matter if integrated or not is a
bottleneck for any CPU given a sufficient fast workload. That's
simply because the DIMMs cannot keep up with the CPU.
I would say in practice for a normal desktop machine or a laptop
the limit is how much bandwidth two DIMMs can deliver.
For bandwidth a sufficiently fast FSB could supply enough bandwidth
to easily keep up with these two DIMMs.
Where it mainly loses against the integrated IMC+separate link is when there
is a lot of additional IO traffic too (but that tends to be small
compared to memory traffic except perhaps for 3d).
And in latency it is slower of course of course. That is the big win
of the integrated memory controller. Even that can vary though -
e.g. if the FSB has enough bandwidth and the chipset a good memory
controller it could look reasonable again under high load (compared
to idle latency)
For servers with multiple sockets, better IO and typically more DIMMs
that can deliver data in parallel it's a different chapter of course.
First sharing the FSB between multiple sockets is of course a
bottleneck, especially when the FSB isn't fast enough for even a
single dual core. And it also needs to carry additional processor
synchronization traffic. But then there is no rule that the FSB
has to be shared between multiple CPUs.
This only works for relatively small systems of course.
Given enough tweaks (higher frequency, split FSBs for multi socket
systems or even multiple cores on one socket) it might be some time
until the FSB setup runs really out of steam.
-Andi |
|
| Back to top |
|
 |
Oliver S.
Guest
|
Posted:
Wed Oct 12, 2005 4:15 pm Post subject:
Re: Pentium M to become THE CPU |
|
|
[quote]Indeed; the FSB is just about fast enough for one core;
it becomes a bottleneck at two cores.
[/quote]
I'm afraid you're a victim of a common misconception about the advantages
of a ccNUMA-architecture like that of the Opteron. The Opteron's NUMA is
less scalable like it looks like, because for every cache-line load from
the main-memory, the Opteron has to broadcast a snoop-message to *all* the
processors in the cc-domain (hopefully in parallel to the speculative load
from the memory) to check whether one of the processors has a more recent
version of this cacheline! With a shared FSB, every processor simply snoops
the cl-loads of the other CPUs and a processor satifies another CPU's burst
-request of a certain cacheline before the chipset satifies this load!
"Only" writes have a performance-advantage on cc-numa-architectures like
that of the Opteron. If you want to avoid this snoop-broadcasts, you would
have to connect all CPUs to a central crossbar that has duplicate tags for
every other CPU's cache; but that's a rather expensive technology. |
|
| Back to top |
|
 |
Kelly Hall
Guest
|
Posted:
Wed Oct 12, 2005 4:15 pm Post subject:
Re: Pentium M to become THE CPU |
|
|
Nathan Bates wrote:
[quote]Pentium M has all the right ingredients for total world domination:
low power consumption, short pipeline stages, hi-performance.
[/quote]
I'm still banking on the 8051 - that thing just won't go down.
Kelly |
|
| Back to top |
|
 |
Oliver S.
Guest
|
Posted:
Wed Oct 12, 2005 4:15 pm Post subject:
Re: Pentium M to become THE CPU |
|
|
[quote]First the memory controller no matter if integrated or not is a
bottleneck for any CPU given a sufficient fast workload. That's
simply because the DIMMs cannot keep up with the CPU.
[/quote]
Right, but you should be aware that there are two flavours of this
aspect: bandwidth and latency.
[quote]I would say in practice for a normal desktop machine or a
laptop the limit is how much bandwidth two DIMMs can deliver.
[/quote]
The problem is in most cases the latency; not the bandwidth.
[quote]And in latency it is slower of course of course.
That is the big win of the integrated memory controller.
[/quote]
Yes, that's the main-advantage of an integrated memory-controller.
[quote]Even that can vary though - e.g. if the FSB has enough bandwidth and
the chipset a good memory controller it could look reasonable again
under high load (compared to idle latency).
[/quote]
I don't think that a chipset memory-controller can keep up with an
integraded memory-controller in terms of latency.
[quote]First sharing the FSB between multiple sockets is of course a
bottleneck, ...
[/quote]
That's not that obvious as one might think:
<434d2742$0$33556$892e7fe2@authen.white.readfreenews.net>
[quote]Given enough tweaks (higher frequency, split FSBs for multi socket
systems or even multiple cores on one socket) it might be some time
until the FSB setup runs really out of steam.
[/quote]
I think that a technology which is common in large ccNUMA multiprocessor
systems will gain importance in PC-SMP-systems in the future: duplicate
tags in the chipset or attached to the chipset. |
|
| Back to top |
|
 |
Oliver S.
Guest
|
Posted:
Wed Oct 12, 2005 4:15 pm Post subject:
Re: Pentium M to become THE CPU |
|
|
[quote]90%+ of the time, the problem is NOT bandwidth, but Latency. The on-die
memory controller gets rid of all of the FSB (latency adding) cycles.
[/quote]
Right! On my simple Athlon-XP 1400+ with a SiS-745-chipset, loading a
cacheline until the data is available in the CPU's register, takes about
320 clock-cycles!!!
[quote]In Opteron, for example, the address associated with an L2 miss can
arrive at the memory controller in less than 2ns, and data arriving at
the pins from the DIMMs can arrive back at the CPU in a similar number.
[/quote]
Yes, but you have to consider the speculative snoops to other CPUs in
the ccNUMA domain also!
[quote]But the clincher is that the memory controller is implemented in ASIC
technology (think 500 MHz) rather than CPU technology (think 3 GHz);
[/quote]
I don't believe that this is the major latency-factor here.
[quote]A 4 Node Opteron system has ~4 times as much DRAM bandwidth as a 4
node Pentium (single) FSB system and plenty of chip-to-chip bandwidth
to route the data to where it is needed.
[/quote]
It has about four times the store-bandwith - but not the load-bandwidth
due to speculative snoops. |
|
| Back to top |
|
 |
Jens Meyer
Guest
|
Posted:
Wed Oct 12, 2005 4:15 pm Post subject:
Re: Pentium M to become THE CPU |
|
|
You forgot to consider a major latency-factor: the cacheline-size. The
P4 has a stupid cacheline-size of 128 bytes (16 times the bus-width!)
in the L2- and L2-caches, whereas the P3, the Pentium-M and all Athlons
have a more reasonable cacheline-size of 64 bytes on all cache-levels. |
|
| Back to top |
|
 |
|
|
|
|