PA-Semi
CASTalk.com Forum Index CASTalk.com
Discussion of DSP, FPGA, storage and embedded system.
 
 FAQFAQ   MemberlistMemberlist     RegisterRegister 
 ProfileProfile   Log in to check your private messagesLog in to check your private messages   Log inLog in 
 
Google
 
Web castalk.com
PA-Semi

 
Post new topic   Reply to topic    CASTalk.com Forum Index -> Computer Architecture
Author Message
Rupert Pigott
Guest





Posted: Wed Oct 26, 2005 12:15 am    Post subject: PA-Semi Reply with quote

Anyone else somewhat gobsmacked by the SPECfp/Watt *per socket*
that the Inquirer claims for the PA Semi chip ? Specifically ~2000
SPECfp per core, two cores per chip. PA Semi claim 5-13W power
consumption for a chip. That appears to include a whole chunk of
stuff like PCIe and 10GigE ports too...

Cigarette packet math indicates that is roughly a factor of 10
improvement over the current high-end cores as far as FLOPS/Watts
goes. Pretty bold claims, it will be interesting to see what the
silicon actually does. If it comes up to scratch I wouldn't be
surprised to see those things getting slotted into a BG/L chassis.

Anyone smelling rats or are you guys quiet because you're checking
your grapevines ? :)

Cheers,
Rupert
Back to top
Felger Carbon
Guest





Posted: Wed Oct 26, 2005 12:15 am    Post subject: Re: PA-Semi Reply with quote

"Rupert Pigott" <darkboong@hotmail.com> wrote in message
news:1130282261.517707.241570@g14g2000cwa.googlegroups.com...
Quote:
Anyone else somewhat gobsmacked by the SPECfp/Watt *per socket*
that the Inquirer claims for the PA Semi chip ? Specifically ~2000
SPECfp per core, two cores per chip. PA Semi claim 5-13W power
consumption for a chip. That appears to include a whole chunk of
stuff like PCIe and 10GigE ports too...

Cigarette packet math indicates that is roughly a factor of 10
improvement over the current high-end cores as far as FLOPS/Watts
goes. Pretty bold claims, it will be interesting to see what the
silicon actually does. If it comes up to scratch I wouldn't be
surprised to see those things getting slotted into a BG/L chassis.

Anyone smelling rats or are you guys quiet because you're checking
your grapevines ? :)

SP or DP FLOPS?
Back to top
Jason Lee Eckhardt
Guest





Posted: Wed Oct 26, 2005 2:25 am    Post subject: Re: PA-Semi Reply with quote

In article <1130282261.517707.241570@g14g2000cwa.googlegroups.com>,
Rupert Pigott <darkboong@hotmail.com> wrote:
Quote:
Anyone else somewhat gobsmacked by the SPECfp/Watt *per socket*
that the Inquirer claims for the PA Semi chip ? Specifically ~2000
SPECfp per core, two cores per chip. PA Semi claim 5-13W power
consumption for a chip. That appears to include a whole chunk of
stuff like PCIe and 10GigE ports too...

Assuming they actually execute well on their development
plan (a big assumption), will these numbers be impressive at
production relative to competitors at the same time? They
won't even sample until Q306:

http://www.eetimes.com/news/semi/showArticle.jhtml?articleID=172303390

"The first implementation of the companys architecture, a dual-core
version, is expected to sample in the third calendar quarter of 2006,
with single-core and quad-core versions due in early and late 2007,
respectively, and an eight-core version planned for 2008. However,
the company did not say what manufacturing process technology the
company is targeting and how much of the performance increase is
expected to come from moving down the miniaturization roadmap, a
strategy open to rival processor vendors."

Quote:

Cigarette packet math indicates that is roughly a factor of 10
improvement over the current high-end cores as far as FLOPS/Watts
goes. Pretty bold claims, it will be interesting to see what the
silicon actually does. If it comes up to scratch I wouldn't be
surprised to see those things getting slotted into a BG/L chassis.

Anyone smelling rats or are you guys quiet because you're checking
your grapevines ? :)

Cheers,
Rupert
Back to top
Greg Lindahl
Guest





Posted: Wed Oct 26, 2005 5:09 am    Post subject: Re: PA-Semi Reply with quote

In article <1130282261.517707.241570@g14g2000cwa.googlegroups.com>,
Rupert Pigott <darkboong@hotmail.com> wrote:

Quote:
Anyone else somewhat gobsmacked by the SPECfp/Watt *per socket*
that the Inquirer claims for the PA Semi chip ? Specifically ~2000
SPECfp per core, two cores per chip.

If true, it would be a nice improvement over the Broadcom 1250/1480,
but nowhere near a factor of 10. Why would you be surprised that a
design optimized for low power is much better per watt than ones which
view high power as a necessary evil?

Interestingly enough, the founders of PA Semi overlap with the
founders of Sibyte, which was purchased by Broadcom.

-- greg
Back to top
John Mashey
Guest





Posted: Wed Oct 26, 2005 5:20 am    Post subject: Re: PA-Semi Reply with quote

Greg Lindahl wrote:

Quote:
Interestingly enough, the founders of PA Semi overlap with the
founders of Sibyte, which was purchased by Broadcom.

Well, some of them also did various DEC Alphas, and perhaps even more
to the point, StrongARM...
Back to top
Rupert Pigott
Guest





Posted: Wed Oct 26, 2005 4:01 pm    Post subject: Re: PA-Semi Reply with quote

Felger Carbon wrote:
Quote:
"Rupert Pigott" <darkboong@hotmail.com> wrote in message
news:1130282261.517707.241570@g14g2000cwa.googlegroups.com...
Anyone else somewhat gobsmacked by the SPECfp/Watt *per socket*

[SNIP]

Quote:
SP or DP FLOPS?

Fair question, I was looking at the SPECfp estimates. A visit
to SPEC.org can answer your question better than I. :)

Cheers,
Rupert
Back to top
Rupert Pigott
Guest





Posted: Wed Oct 26, 2005 4:15 pm    Post subject: Re: PA-Semi Reply with quote

Greg Lindahl wrote:
Quote:
In article <1130282261.517707.241570@g14g2000cwa.googlegroups.com>,
Rupert Pigott <darkboong@hotmail.com> wrote:

Anyone else somewhat gobsmacked by the SPECfp/Watt *per socket*
that the Inquirer claims for the PA Semi chip ? Specifically ~2000
SPECfp per core, two cores per chip.

If true, it would be a nice improvement over the Broadcom 1250/1480,
but nowhere near a factor of 10. Why would you be surprised that a
design optimized for low power is much better per watt than ones which
view high power as a necessary evil?

I am surprised because they are quoting *SPECfp* rather
than peak FLOPs. That suggests to me that they will are
aiming to deliver a very quick general purpose chip rather
than a DSP style core.

It appears that they claim 5W with the I/O subsytem idle.
To my mind 4000SPECfp in 5W *per socket* will be tough to
beat in 06 and 1H07.

Cheers,
Rupert
Back to top
Niels Jørgen Kruse
Guest





Posted: Wed Oct 26, 2005 4:15 pm    Post subject: Re: PA-Semi Reply with quote

Rupert Pigott <darkboong@hotmail.com> wrote:

Quote:
Anyone else somewhat gobsmacked by the SPECfp/Watt *per socket*
that the Inquirer claims for the PA Semi chip ? Specifically ~2000
SPECfp per core, two cores per chip.

The SPECfp is better than a PPC970 at the same clock, so they must be
getting a lot out of the IMC. I suppose they use a 128 byte cacheline?

--
Mvh./Regards, Niels Jørgen Kruse, Vanløse, Denmark
Back to top
Joe Seigh
Guest





Posted: Wed Oct 26, 2005 4:15 pm    Post subject: Re: PA-Semi Reply with quote

Rupert Pigott wrote:
Quote:
Anyone else somewhat gobsmacked by the SPECfp/Watt *per socket*
that the Inquirer claims for the PA Semi chip ? Specifically ~2000
SPECfp per core, two cores per chip. PA Semi claim 5-13W power
consumption for a chip. That appears to include a whole chunk of
stuff like PCIe and 10GigE ports too...
[...]


More information about a processor most of us will never see in a
general purpose computer.

http://www.theinquirer.net/?article=27233

Interesting comment about making the memory model strongly ordered.

--
Joe Seigh

When you get lemons, you make lemonade.
When you get hardware, you make software.
Back to top
Iain McClatchie
Guest





Posted: Wed Oct 26, 2005 10:13 pm    Post subject: Re: PA-Semi Reply with quote

Rupert> Anyone else somewhat gobsmacked by the SPECfp/Watt *per
Rupert> socket*

Yep, but somewhat less gobsmacked by the SpecInt2000 per core
("> 1000"). The Pentium M at 2260 MHz gets 1812, that's roughly
the same frequency, screwier instruction set, similar power
dissipation, shipping now.

I was at MPF yesterday, and I think they said they were on
TSMC 65nm, in which case 2 GHz is roughly a 20 inverter-3 delay
machine. The Pentium M has the advantage of running on a much
faster process, which explains the MHz advantage, but not the
per-MHz advantage.

My guess is that the per-MHz advantage comes from better branch
prediction, maybe better hardware memory prefetch prediction,
and definitely a closer L2 cache. This last issue is quite
curious.

On a dual-core Athlon, each core has it's own L2. The good
thing here is that, since there is no crossbar to cross (no
arbitration), the latency is lower. The annoying bit is that
half the L2 is on the other side of the crossbar (and you take
a miss latency before you see it) and that coherency is more
difficult. Despite longer latency to half the L2, the lower
latency to the half used more often is still better
performance. I doubt it would give them the extra 300
SpecInt2000, but it's probably a strong effect (I'd expect
more than 50 SpecInt2000).

So why didn't they do seperate L2s? It might be a power issue.
To check for a hit in the caches, you have to check both of
their tags. Checking two caches instead of one is sort of
like checking a 4-way associative cache instead of a 2-way.
Tag fetch width doubles, but the bit line loading on each
drops, so the power goes up, um, less than double. Hmm...
Here's a clue: in their presentation, Jim Keller said they
chose 2-way associative caches over 4-way as a tradeoff to
get less power.

The interesting thing here is that more associativity doesn't
*have* to cost more power. If you can predict which way is
most likely to hit, you can check the ways serially, and save
power that way. This works if the prediction is free. Free
predictions are available for most of the PA-Semi design
though. In the case of a core, it's most likely to hit in
the half of the L2 that it fills. In the case of an I/O,
it's most likely to hit in the same side of the L2 that the
I/O stream hit last time.

The more I think about this thing, the more I think AMD has
to get a better core done before Intel ships a Pentium M
with an integrated memory controller into the desktop market.
Back to top
Iain McClatchie
Guest





Posted: Thu Oct 27, 2005 12:15 am    Post subject: Re: PA-Semi Reply with quote

Iain> (I'd expect more than 50 SpecInt2000).

Hmph. A little more calculation and I got 0.07 CPI for their 22-cycle
L2, and a 1.0 CPI overall to make 1000 SpecInt2000. So if they could
shave 8 cycles by avoiding the crossbar, they'd see a 0.025 CPI
improvement, good for an extra 25 SpecInt2000, or about half what I
was guessing above.
Back to top
David Kanter
Guest





Posted: Thu Oct 27, 2005 8:15 am    Post subject: Re: PA-Semi Reply with quote

Joe Seigh wrote:
Quote:
Rupert Pigott wrote:
Anyone else somewhat gobsmacked by the SPECfp/Watt *per socket*
that the Inquirer claims for the PA Semi chip ? Specifically ~2000
SPECfp per core, two cores per chip. PA Semi claim 5-13W power
consumption for a chip. That appears to include a whole chunk of
stuff like PCIe and 10GigE ports too...
[...]

More information about a processor most of us will never see in a
general purpose computer.

http://www.theinquirer.net/?article=27233

Interesting comment about making the memory model strongly ordered.

I'll toot my own horn here:

http://www.realworldtech.com/page.cfm?ArticleID=RWT102405055354

DK
Back to top
Andrew Reilly
Guest





Posted: Thu Oct 27, 2005 1:36 pm    Post subject: Re: PA-Semi Reply with quote

On Wed, 26 Oct 2005 21:02:03 -0700, David Kanter wrote:

Quote:

Joe Seigh wrote:
Rupert Pigott wrote:
Anyone else somewhat gobsmacked by the SPECfp/Watt *per socket*
that the Inquirer claims for the PA Semi chip ? Specifically ~2000
SPECfp per core, two cores per chip. PA Semi claim 5-13W power
consumption for a chip. That appears to include a whole chunk of
stuff like PCIe and 10GigE ports too...
[...]

More information about a processor most of us will never see in a
general purpose computer.

http://www.theinquirer.net/?article=27233

Interesting comment about making the memory model strongly ordered.

I'll toot my own horn here:

http://www.realworldtech.com/page.cfm?ArticleID=RWT102405055354

Any thoughts on how that compares to the Freescale MPC6841D?
http://www.freescale.com/webapp/sps/site/overview.jsp?nodeId=0162468rH3bTdG7249

Is "slightly scaled up in all dimensions" a fair description?

What's the target market(s)?

--
Andrew
Back to top
Anton Ertl
Guest





Posted: Thu Oct 27, 2005 4:15 pm    Post subject: Re: PA-Semi Reply with quote

Andrew Reilly <andrew-newspost@areilly.bpc-users.org> writes:
Quote:
Any thoughts on how that compares to the Freescale MPC6841D?
http://www.freescale.com/webapp/sps/site/overview.jsp?nodeId=0162468rH3bTdG7249

Well, the e600 core of the MPC8641D is apparently the same as the core
used in various recent G4 Macs (MacMini, PowerBook, iBook). So, the
e600's properties compared to the PA6T are:

- It exists (at least in single-core chips, not sure about the 8641D).

- No 64-bit support.

- lower clock rate (currently 1.67GHz; I don't expect much speedup
from 65nm).

- much shorter pipeline (7 stages vs. 14).

- less instruction reordering.

- anton
--
M. Anton Ertl Some things have to be seen to be believed
anton@mips.complang.tuwien.ac.at Most things have to be believed to be seen
http://www.complang.tuwien.ac.at/anton/home.html
Back to top
Niels Jørgen Kruse
Guest





Posted: Fri Oct 28, 2005 12:15 am    Post subject: Re: PA-Semi Reply with quote

Anton Ertl <anton@mips.complang.tuwien.ac.at> wrote:

Quote:
Well, the e600 core of the MPC8641D is apparently the same as the core
used in various recent G4 Macs (MacMini, PowerBook, iBook). So, the
e600's properties compared to the PA6T are:

- It exists (at least in single-core chips, not sure about the 8641D).

- No 64-bit support.

- lower clock rate (currently 1.67GHz; I don't expect much speedup
from 65nm).

There are upgrade cards based on the 7447a clocked up to 2.0 GHz. The
7448 (e600 core) is coming out on 90nm, not 65nm.

Quote:
- much shorter pipeline (7 stages vs. 14).

- less instruction reordering.

- anton


--
Mvh./Regards, Niels Jørgen Kruse, Vanløse, Denmark
Back to top
 
Post new topic   Reply to topic    CASTalk.com Forum Index -> Computer Architecture All times are GMT
Page 1 of 1

 
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum




VoIP Electronics Powered by phpBB