PanFS?
CASTalk.com Forum Index CASTalk.com
Discussion of DSP, FPGA, storage and embedded system.
 
 FAQFAQ   MemberlistMemberlist     RegisterRegister 
 ProfileProfile   Log in to check your private messagesLog in to check your private messages   Log inLog in 
 
Google
 
Web castalk.com
PanFS?
Goto page 1, 2  Next
 
Post new topic   Reply to topic    CASTalk.com Forum Index -> Storage System
Author Message
Ernest Siu
Guest





Posted: Wed Dec 01, 2004 11:20 pm    Post subject: PanFS? Reply with quote

Any know know much about the solutions from Panasas? The PanFS looks
similar to clustered filesystem like CxFS, GFS, StorNext,
Lustre...etc. except it is object-based. Their solution seems to
include the backend storage, control blades, and some NFS/CIFS heads
for interop. What is the benefit of using them versus getting Lustre
and a regular SAN? Any idea?

Regards,
Ernest
Back to top
Faeandar
Guest





Posted: Thu Dec 02, 2004 12:50 am    Post subject: Re: PanFS? Reply with quote

On 1 Dec 2004 10:20:06 -0800, ernestsiu@yahoo.com (Ernest Siu) wrote:

Quote:
Any know know much about the solutions from Panasas? The PanFS looks
similar to clustered filesystem like CxFS, GFS, StorNext,
Lustre...etc. except it is object-based. Their solution seems to
include the backend storage, control blades, and some NFS/CIFS heads
for interop. What is the benefit of using them versus getting Lustre
and a regular SAN? Any idea?

Regards,
Ernest

Well first thing is don't think about Lustre, it's a beast still and
not ready for primetime. The others you mentioned are viable but the
leaders today seem to be Polyserve, Ibrix, GPFS, and GFS. Personally
I like Polyserve personally but Ibrix and GPFS both look good as well.

PanFS is very similar to Polyserve except they do it turnkey and in a
box. The box is proprietary (not necessarily bad) but from what I
understand also fairly expensive. Otherwise I've only heard good
things about them.

I would go for something a little more open. Any of the software
solutions plus my pick of hardware. I like the options. So if I want
to use HDS disk with qlogic controllers and Dell blades, I can. Not
so with PanFS as I understand it.

But if you want something all-in-one then they are likely the best in
town right now.

~F
Back to top
Wolf
Guest





Posted: Thu Dec 02, 2004 5:38 pm    Post subject: Re: PanFS? Reply with quote

"Faeandar" <mr_castalot@yahoo.com> wrote in message
news:l26sq0dbkfraja25cikhoc7qmara8mma6j@4ax.com...
Quote:
On 1 Dec 2004 10:20:06 -0800, ernestsiu@yahoo.com (Ernest Siu) wrote:

Any know know much about the solutions from Panasas? The PanFS looks
similar to clustered filesystem like CxFS, GFS, StorNext,
Lustre...etc. except it is object-based. Their solution seems to
include the backend storage, control blades, and some NFS/CIFS heads
for interop. What is the benefit of using them versus getting Lustre
and a regular SAN? Any idea?

Regards,
Ernest

Hey Guys. I run a large cluster with a lot of storage so that is my
perspective.

Quote:
Well first thing is don't think about Lustre, it's a beast still and
not ready for primetime.

Lustre is a lot of work-there is no doubt about it. There are no
real tools (GUIs etc) to help. But they make their money in
customization I figure. What is really a good thing about Lustre
is that it is very scaleable. Most of the GFSes I have seen like
SANs and most of our boxes are NAS boxes. Lustre will let
us manage all our boxes (OSTs).

Quote:
The others you mentioned are viable but the
leaders today seem to be Polyserve, Ibrix, GPFS, and GFS. Personally
I like Polyserve personally but Ibrix and GPFS both look good as well.

We have a small CX500 w/ 18TB and IBM 2.5GHZ dual Opteron/QLogic
heads for testing. We will be testing IBrix (GPFS and Lustre. I am going to
install RedHat's next week if all goes well with my other work. :)

IBrix has some really good features to help us manage our disks-including
a nice GUI. There are some sophisticated technical features for
fault-tolerance
and recovery you may want to look at.

Quote:
PanFS is very similar to Polyserve except they do it turnkey and in a
box. The box is proprietary (not necessarily bad) but from what I
understand also fairly expensive. Otherwise I've only heard good
things about them.

We have a fair bit of experience with the Panases product. It does not
work for us and we are looking elsewhere-but they are making serious
in-roads. One of my issues is getting as much of my disk servers off NFS
as I can. With it is that I have 150TB (we just ordered another 30+) I
won't be able to get of NFS with.

Performance was really good.

Quote:
I would go for something a little more open. Any of the software
solutions plus my pick of hardware. I like the options. So if I want
to use HDS disk with qlogic controllers and Dell blades, I can. Not
so with PanFS as I understand it.

Pretty much, but then if it works who cares?

Quote:

But if you want something all-in-one then they are likely the best in
town right now.

I would recommend you test, test, test before plucking down $$$. Also
you may want to look at Terragrid from Terrascale. Performance is also
really good.

Anyway, that's my two cents.
--
Wolf
----------------------------------------------------------------
Please post all responses to UseNet. All email cheerfully and automagically
routed to Dave Null
Back to top
Keith Michaels
Guest





Posted: Fri Dec 03, 2004 12:46 am    Post subject: Distributed filesystems (was: Re: PanFS?) Reply with quote

In article <Z0Erd.35310$bP2.18751@newssvr12.news.prodigy.com>,
"Wolf" <wolfdotcom@hotmail.com> writes:
|> "Faeandar" <mr_castalot@yahoo.com> wrote in message
|> news:l26sq0dbkfraja25cikhoc7qmara8mma6j@4ax.com...
|> > On 1 Dec 2004 10:20:06 -0800, ernestsiu@yahoo.com (Ernest Siu) wrote:
|> >
|> > >Any know know much about the solutions from Panasas? The PanFS looks
|> > >similar to clustered filesystem like CxFS, GFS, StorNext,
|> > >Lustre...etc. except it is object-based. Their solution seems to
|> > >include the backend storage, control blades, and some NFS/CIFS heads
|> > >for interop. What is the benefit of using them versus getting Lustre
|> > >and a regular SAN? Any idea?
|> > >
|> > >Regards,
|> > >Ernest
|>
|> Hey Guys. I run a large cluster with a lot of storage so that is my
|> perspective.
|>
|> > Well first thing is don't think about Lustre, it's a beast still and
|> > not ready for primetime.
|>
|> Lustre is a lot of work-there is no doubt about it. There are no
|> real tools (GUIs etc) to help. But they make their money in
|> customization I figure. What is really a good thing about Lustre
|> is that it is very scaleable. Most of the GFSes I have seen like
|> SANs and most of our boxes are NAS boxes. Lustre will let
|> us manage all our boxes (OSTs).
|>
|> > The others you mentioned are viable but the
|> > leaders today seem to be Polyserve, Ibrix, GPFS, and GFS. Personally
|> > I like Polyserve personally but Ibrix and GPFS both look good as well.
....


There is something I don't understand about these distributed
filesystems: how is the namespace mapped to the devices in a fully
distributed environment? If there is complete independence between
the name layer and the storage layer then there is an n-squared
locking problem, if devices are tied to the namespace then there
is a load-leveling/utilization problem.
Back to top
Faeandar
Guest





Posted: Fri Dec 03, 2004 4:51 am    Post subject: Re: Distributed filesystems (was: Re: PanFS?) Reply with quote

On Thu, 2 Dec 2004 19:46:08 GMT, krm@sdc.cs.boeing.com (Keith
Michaels) wrote:

Quote:
In article <Z0Erd.35310$bP2.18751@newssvr12.news.prodigy.com>,
"Wolf" <wolfdotcom@hotmail.com> writes:
|> "Faeandar" <mr_castalot@yahoo.com> wrote in message
|> news:l26sq0dbkfraja25cikhoc7qmara8mma6j@4ax.com...
|> > On 1 Dec 2004 10:20:06 -0800, ernestsiu@yahoo.com (Ernest Siu) wrote:
|
|> > >Any know know much about the solutions from Panasas? The PanFS looks
|> > >similar to clustered filesystem like CxFS, GFS, StorNext,
|> > >Lustre...etc. except it is object-based. Their solution seems to
|> > >include the backend storage, control blades, and some NFS/CIFS heads
|> > >for interop. What is the benefit of using them versus getting Lustre
|> > >and a regular SAN? Any idea?
|
|> > >Regards,
|> > >Ernest
|
|> Hey Guys. I run a large cluster with a lot of storage so that is my
|> perspective.
|
|> > Well first thing is don't think about Lustre, it's a beast still and
|> > not ready for primetime.
|
|> Lustre is a lot of work-there is no doubt about it. There are no
|> real tools (GUIs etc) to help. But they make their money in
|> customization I figure. What is really a good thing about Lustre
|> is that it is very scaleable. Most of the GFSes I have seen like
|> SANs and most of our boxes are NAS boxes. Lustre will let
|> us manage all our boxes (OSTs).
|
|> > The others you mentioned are viable but the
|> > leaders today seem to be Polyserve, Ibrix, GPFS, and GFS. Personally
|> > I like Polyserve personally but Ibrix and GPFS both look good as well.
...


There is something I don't understand about these distributed
filesystems: how is the namespace mapped to the devices in a fully
distributed environment? If there is complete independence between
the name layer and the storage layer then there is an n-squared
locking problem, if devices are tied to the namespace then there
is a load-leveling/utilization problem.

Yes, and yes (though maybe not squared). In most cases the products
use a VIP that clients mount. In cases like Polyserve and GFS the
client requests are round-robin'd among the node servers. Not exactly
load balancing but over time and enough node servers the balance finds
itself.

Products like Acopia and Panasas have internal algorithms that they
user for latency testing and then write to the node server that
returns the fastest. Good to some extent but in the case of Acopia,
not being a clustered file system, it means a potential for one system
to house alot more data than the others. Maybe a problem, maybe not.

The only time you have a potential locking issue is with a clustered
file system and a dedicated metadata server. This can become a
serious bottleneck. Some products use distributed metadata which,
while not alleviating the problem completely, greatly reduces it.

~F
Back to top
Faeandar
Guest





Posted: Fri Dec 03, 2004 4:58 am    Post subject: Re: PanFS? Reply with quote

On Thu, 02 Dec 2004 12:38:17 GMT, "Wolf" <wolfdotcom@hotmail.com>
wrote:

Quote:
"Faeandar" <mr_castalot@yahoo.com> wrote in message
news:l26sq0dbkfraja25cikhoc7qmara8mma6j@4ax.com...
On 1 Dec 2004 10:20:06 -0800, ernestsiu@yahoo.com (Ernest Siu) wrote:

Any know know much about the solutions from Panasas? The PanFS looks
similar to clustered filesystem like CxFS, GFS, StorNext,
Lustre...etc. except it is object-based. Their solution seems to
include the backend storage, control blades, and some NFS/CIFS heads
for interop. What is the benefit of using them versus getting Lustre
and a regular SAN? Any idea?

Regards,
Ernest

Hey Guys. I run a large cluster with a lot of storage so that is my
perspective.

Well first thing is don't think about Lustre, it's a beast still and
not ready for primetime.

Lustre is a lot of work-there is no doubt about it. There are no
real tools (GUIs etc) to help. But they make their money in
customization I figure. What is really a good thing about Lustre
is that it is very scaleable. Most of the GFSes I have seen like
SANs and most of our boxes are NAS boxes. Lustre will let
us manage all our boxes (OSTs).

It's scaleable on the nodes but not on performance. You can have 1000
nodes or more but performance will begin to suffer seriously after
about 30 or 40. Read performance is scaleble, but not the write. And
my guess is most people looking at these types of solutions need write
performance as well as read.

Quote:

The others you mentioned are viable but the
leaders today seem to be Polyserve, Ibrix, GPFS, and GFS. Personally
I like Polyserve personally but Ibrix and GPFS both look good as well.

We have a small CX500 w/ 18TB and IBM 2.5GHZ dual Opteron/QLogic
heads for testing. We will be testing IBrix (GPFS and Lustre. I am going to
install RedHat's next week if all goes well with my other work. :)

Polyserve beat out GFS by about 30% for us. Plus it was immensely
more simple to install and configure. I have yet to get a real
low-down on GPFS but it sounds about the same as the others, not sure
what they may have that the others don't.
Ibrix is extremely attractive if you already have alot of DAS since
each one can be it's own segment server and be part of the namespace
as well. Nice re-use of existing hardware and storage, no one else
offers this in software (Acopia can take advantage of that but it's a
hw solution).

Quote:

I would go for something a little more open. Any of the software
solutions plus my pick of hardware. I like the options. So if I want
to use HDS disk with qlogic controllers and Dell blades, I can. Not
so with PanFS as I understand it.

Pretty much, but then if it works who cares?

I'm thinking in terms of expansion. Depending ont he company you may
or may not get a serious discount on enterprise grade hardware. If
you do then you'd want to take advantage of that since Panasas is
pretty expensive.

Quote:


But if you want something all-in-one then they are likely the best in
town right now.

I would recommend you test, test, test before plucking down $$$. Also
you may want to look at Terragrid from Terrascale. Performance is also
really good.

I'll have to look at terragrid. Heard of it but that's it.

~F
Back to top
Keith Michaels
Guest





Posted: Fri Dec 03, 2004 10:04 pm    Post subject: Re: Distributed filesystems (was: Re: PanFS?) Reply with quote

In article <h8avq0dhipa8l2p7moaf3fbga9bs6h3j77@4ax.com>,
Faeandar <mr_castalot@yahoo.com> writes:
....

|> >There is something I don't understand about these distributed
|> >filesystems: how is the namespace mapped to the devices in a fully
|> >distributed environment? If there is complete independence between
|> >the name layer and the storage layer then there is an n-squared
|> >locking problem, if devices are tied to the namespace then there
|> >is a load-leveling/utilization problem.
|>
|> Yes, and yes (though maybe not squared). In most cases the products
|> use a VIP that clients mount. In cases like Polyserve and GFS the
|> client requests are round-robin'd among the node servers. Not exactly
|> load balancing but over time and enough node servers the balance finds
|> itself.
|>
|> Products like Acopia and Panasas have internal algorithms that they
|> user for latency testing and then write to the node server that
|> returns the fastest. Good to some extent but in the case of Acopia,
|> not being a clustered file system, it means a potential for one system
|> to house alot more data than the others. Maybe a problem, maybe not.
|>
|> The only time you have a potential locking issue is with a clustered
|> file system and a dedicated metadata server. This can become a
|> serious bottleneck. Some products use distributed metadata which,
|> while not alleviating the problem completely, greatly reduces it.
|>
|> ~F

If all nodes are equal (distributed metadata case) then there needs
to be a mapping from filesystem semantics to the underlying
consistency model implemented by the storage system e.g., if
filesystems guarantee strict write ordering (reads always return
the latest write) then the storage system must implement equally
strong cache coherency between all nodes, which doesn't scale.
So there's a tradeoff between performance and utilization. I was
just wondering how modern DFSs deal with this fundamental
limitation. SMPs have hardware support for cache management;
do we need similar functionality in the SAN?
Back to top
Bill Todd
Guest





Posted: Sat Dec 04, 2004 12:29 am    Post subject: Re: Distributed filesystems (was: Re: PanFS?) Reply with quote

"Keith Michaels" <krm@sdc.cs.boeing.com> wrote in message
news:I85orB.A8u@news.boeing.com...
Quote:
In article <h8avq0dhipa8l2p7moaf3fbga9bs6h3j77@4ax.com>,
Faeandar <mr_castalot@yahoo.com> writes:
...

|> >There is something I don't understand about these distributed
|> >filesystems: how is the namespace mapped to the devices in a fully
|> >distributed environment? If there is complete independence between
|> >the name layer and the storage layer then there is an n-squared
|> >locking problem, if devices are tied to the namespace then there
|> >is a load-leveling/utilization problem.
|
|> Yes, and yes (though maybe not squared). In most cases the products
|> use a VIP that clients mount. In cases like Polyserve and GFS the
|> client requests are round-robin'd among the node servers. Not exactly
|> load balancing but over time and enough node servers the balance finds
|> itself.
|
|> Products like Acopia and Panasas have internal algorithms that they
|> user for latency testing and then write to the node server that
|> returns the fastest. Good to some extent but in the case of Acopia,
|> not being a clustered file system, it means a potential for one system
|> to house alot more data than the others. Maybe a problem, maybe not.
|
|> The only time you have a potential locking issue is with a clustered
|> file system and a dedicated metadata server. This can become a
|> serious bottleneck. Some products use distributed metadata which,
|> while not alleviating the problem completely, greatly reduces it.
|
|> ~F

If all nodes are equal (distributed metadata case) then there needs
to be a mapping from filesystem semantics to the underlying
consistency model implemented by the storage system e.g., if
filesystems guarantee strict write ordering (reads always return
the latest write) then the storage system must implement equally
strong cache coherency between all nodes, which doesn't scale.
So there's a tradeoff between performance and utilization. I was
just wondering how modern DFSs deal with this fundamental
limitation. SMPs have hardware support for cache management;
do we need similar functionality in the SAN?

'Distributed metadata' means different things in different contexts.

On VMS clusters, it refers to distributed management of metadata stored on
shared devices by cooperating hosts. But even there only a single host
manages a given metadatum at any given time (using the distributed locking
facility). This approach scales to hundreds of host nodes (the nominal
supported limit on cluster size is 96, but sizes close to 200 have been used
in practice), though contention varies with the degree of access locality to
a given datum (the worst case obviously being if all hosts access it
equally, with intense update activity).

In implementations like Lustre, it refers to low-level local metadata stored
on all storage units, plus file-level metadata stored in a coordinating
central metadata server (or metadata server cluster). The low-level local
metadata is completely partitioned (i.e., of only local significance) and
hence has no scaling problem - but by offloading that portion of metadata
management from the central metadata server helps it scale too.

Another alternative is to partition higher-level metadata as well (e.g.,
localizing the management of metadata for any given file - or even
directory - to a single server (plus perhaps a mirror partner), even if the
file data may be spread more widely). This certainly qualifies as
distributed metadata, even though management of any single metadatum is not
distributed, and scales well at the file system level as long as the
partitioning granularity manages to spread out metadata loads reasonably
(e.g., you'd have a problem if all system activity concentrated on one
humongous database file - though with some ingenuity subdividing at least
some of the metadata management even within a single file is possible).

Until SAN bandwidths and latencies approach those of local RAM much more
closely than they do today, there will likely be little reason to use
hardware management of distributed file caches - especially since the most
effective location for such caching is very often (save in cases of intense
contention) at the client itself, where special hardware is least likely to
be found.

- bill
Back to top
Faeandar
Guest





Posted: Sat Dec 04, 2004 12:53 am    Post subject: Re: Distributed filesystems (was: Re: PanFS?) Reply with quote

On Fri, 3 Dec 2004 14:29:21 -0500, "Bill Todd"
<billtodd@metrocast.net> wrote:

Quote:

"Keith Michaels" <krm@sdc.cs.boeing.com> wrote in message
news:I85orB.A8u@news.boeing.com...
In article <h8avq0dhipa8l2p7moaf3fbga9bs6h3j77@4ax.com>,
Faeandar <mr_castalot@yahoo.com> writes:
...

|> >There is something I don't understand about these distributed
|> >filesystems: how is the namespace mapped to the devices in a fully
|> >distributed environment? If there is complete independence between
|> >the name layer and the storage layer then there is an n-squared
|> >locking problem, if devices are tied to the namespace then there
|> >is a load-leveling/utilization problem.
|
|> Yes, and yes (though maybe not squared). In most cases the products
|> use a VIP that clients mount. In cases like Polyserve and GFS the
|> client requests are round-robin'd among the node servers. Not exactly
|> load balancing but over time and enough node servers the balance finds
|> itself.
|
|> Products like Acopia and Panasas have internal algorithms that they
|> user for latency testing and then write to the node server that
|> returns the fastest. Good to some extent but in the case of Acopia,
|> not being a clustered file system, it means a potential for one system
|> to house alot more data than the others. Maybe a problem, maybe not.
|
|> The only time you have a potential locking issue is with a clustered
|> file system and a dedicated metadata server. This can become a
|> serious bottleneck. Some products use distributed metadata which,
|> while not alleviating the problem completely, greatly reduces it.
|
|> ~F

If all nodes are equal (distributed metadata case) then there needs
to be a mapping from filesystem semantics to the underlying
consistency model implemented by the storage system e.g., if
filesystems guarantee strict write ordering (reads always return
the latest write) then the storage system must implement equally
strong cache coherency between all nodes, which doesn't scale.
So there's a tradeoff between performance and utilization. I was
just wondering how modern DFSs deal with this fundamental
limitation. SMPs have hardware support for cache management;
do we need similar functionality in the SAN?

'Distributed metadata' means different things in different contexts.

On VMS clusters, it refers to distributed management of metadata stored on
shared devices by cooperating hosts. But even there only a single host
manages a given metadatum at any given time (using the distributed locking
facility). This approach scales to hundreds of host nodes (the nominal
supported limit on cluster size is 96, but sizes close to 200 have been used
in practice), though contention varies with the degree of access locality to
a given datum (the worst case obviously being if all hosts access it
equally, with intense update activity).

In implementations like Lustre, it refers to low-level local metadata stored
on all storage units, plus file-level metadata stored in a coordinating
central metadata server (or metadata server cluster). The low-level local
metadata is completely partitioned (i.e., of only local significance) and
hence has no scaling problem - but by offloading that portion of metadata
management from the central metadata server helps it scale too.

Help me understand this. If file level metadata is coordinated by a
single server how is that not a large bottleneck? For reads I can see
why it's not, but for writes (in the 10,00 node cluster they talk
about) this would be like trying to stuff the Atlantic through a hose.
At least in my understanding.

Quote:

Another alternative is to partition higher-level metadata as well (e.g.,
localizing the management of metadata for any given file - or even
directory - to a single server (plus perhaps a mirror partner), even if the
file data may be spread more widely). This certainly qualifies as
distributed metadata, even though management of any single metadatum is not
distributed, and scales well at the file system level as long as the
partitioning granularity manages to spread out metadata loads reasonably
(e.g., you'd have a problem if all system activity concentrated on one
humongous database file - though with some ingenuity subdividing at least
some of the metadata management even within a single file is possible).

Until SAN bandwidths and latencies approach those of local RAM much more
closely than they do today, there will likely be little reason to use
hardware management of distributed file caches - especially since the most
effective location for such caching is very often (save in cases of intense
contention) at the client itself, where special hardware is least likely to
be found.

My guess is we will see something like NVRAM cards inside the hosts
all connected via IB or some other high bandwidth low latency
interconnect. For now though gigabit seems to do the trick primarily
for the reason you mentioned.
Although I gotta say, it is extremely appealing to use NVRAM in thos
hosts anyway just for the added write performance boost. If only they
could be clustered by themselves and not require host-based
clustering.... Ah well, someday.

~F
Back to top
Keith Michaels
Guest





Posted: Sat Dec 04, 2004 1:45 am    Post subject: Re: Distributed filesystems (was: Re: PanFS?) Reply with quote

In article <18ydndgNdq8EIy3cRVn-oQ@metrocastcablevision.com>,
"Bill Todd" <billtodd@metrocast.net> writes:
|>
|> "Keith Michaels" <krm@sdc.cs.boeing.com> wrote in message
|> news:I85orB.A8u@news.boeing.com...
|> > In article <h8avq0dhipa8l2p7moaf3fbga9bs6h3j77@4ax.com>,
|> > Faeandar <mr_castalot@yahoo.com> writes:
|> > ...
|> >
|> > |> >There is something I don't understand about these distributed
|> > |> >filesystems: how is the namespace mapped to the devices in a fully
|> > |> >distributed environment? If there is complete independence between
|> > |> >the name layer and the storage layer then there is an n-squared
|> > |> >locking problem, if devices are tied to the namespace then there
|> > |> >is a load-leveling/utilization problem.
|> > |>
|> > |> Yes, and yes (though maybe not squared). In most cases the products
|> > |> use a VIP that clients mount. In cases like Polyserve and GFS the
|> > |> client requests are round-robin'd among the node servers. Not exactly
|> > |> load balancing but over time and enough node servers the balance finds
|> > |> itself.
|> > |>
|> > |> Products like Acopia and Panasas have internal algorithms that they
|> > |> user for latency testing and then write to the node server that
|> > |> returns the fastest. Good to some extent but in the case of Acopia,
|> > |> not being a clustered file system, it means a potential for one system
|> > |> to house alot more data than the others. Maybe a problem, maybe not.
|> > |>
|> > |> The only time you have a potential locking issue is with a clustered
|> > |> file system and a dedicated metadata server. This can become a
|> > |> serious bottleneck. Some products use distributed metadata which,
|> > |> while not alleviating the problem completely, greatly reduces it.
|> > |>
|> > |> ~F
|> >
|> > If all nodes are equal (distributed metadata case) then there needs
|> > to be a mapping from filesystem semantics to the underlying
|> > consistency model implemented by the storage system e.g., if
|> > filesystems guarantee strict write ordering (reads always return
|> > the latest write) then the storage system must implement equally
|> > strong cache coherency between all nodes, which doesn't scale.
|> > So there's a tradeoff between performance and utilization. I was
|> > just wondering how modern DFSs deal with this fundamental
|> > limitation. SMPs have hardware support for cache management;
|> > do we need similar functionality in the SAN?
|>
|> 'Distributed metadata' means different things in different contexts.
|>
|> On VMS clusters, it refers to distributed management of metadata stored on
|> shared devices by cooperating hosts. But even there only a single host
|> manages a given metadatum at any given time (using the distributed locking
|> facility). This approach scales to hundreds of host nodes (the nominal
|> supported limit on cluster size is 96, but sizes close to 200 have been used
|> in practice), though contention varies with the degree of access locality to
|> a given datum (the worst case obviously being if all hosts access it
|> equally, with intense update activity).
|>
|> In implementations like Lustre, it refers to low-level local metadata stored
|> on all storage units, plus file-level metadata stored in a coordinating
|> central metadata server (or metadata server cluster). The low-level local
|> metadata is completely partitioned (i.e., of only local significance) and
|> hence has no scaling problem - but by offloading that portion of metadata
|> management from the central metadata server helps it scale too.
|>
|> Another alternative is to partition higher-level metadata as well (e.g.,
|> localizing the management of metadata for any given file - or even
|> directory - to a single server (plus perhaps a mirror partner), even if the
|> file data may be spread more widely). This certainly qualifies as
|> distributed metadata, even though management of any single metadatum is not
|> distributed, and scales well at the file system level as long as the
|> partitioning granularity manages to spread out metadata loads reasonably
|> (e.g., you'd have a problem if all system activity concentrated on one
|> humongous database file - though with some ingenuity subdividing at least
|> some of the metadata management even within a single file is possible).
|>
|> Until SAN bandwidths and latencies approach those of local RAM much more
|> closely than they do today, there will likely be little reason to use
|> hardware management of distributed file caches - especially since the most
|> effective location for such caching is very often (save in cases of intense
|> contention) at the client itself, where special hardware is least likely to
|> be found.
|>
|> - bill

Thanks Bill, that's a very interesting assessment. It seems to me
that the promise of grid-based/object storage is scalability; if
we lose that we're back where we started. Lustre cleverly exploits
the difference between local and global attributes but in the end
all of these DFSs lose scalability when trying to emulate strictly
ordered filesystem semantics and centralized resource management
without making assumptions about sharing and access patterns.
Maybe there's a perfect technology coming that will do it all; in
the meantime what is the business case for DFS deployment? Maybe
a better question is how do we move beyond the filesystem model
so that these technologies fit our applications better?
Back to top
Guest






Posted: Sat Dec 04, 2004 2:37 am    Post subject: Re: Distributed filesystems (was: Re: PanFS?) Reply with quote

In article <6hg1r0h8hide29uohvkj5kbdaaa3i19ifp@4ax.com>,
Faeandar <mr_castalot@yahoo.com> wrote:
Quote:
In implementations like Lustre, it refers to low-level local metadata stored
on all storage units, plus file-level metadata stored in a coordinating
central metadata server (or metadata server cluster). The low-level local
metadata is completely partitioned (i.e., of only local significance) and
hence has no scaling problem - but by offloading that portion of metadata
management from the central metadata server helps it scale too.

Help me understand this. If file level metadata is coordinated by a
single server how is that not a large bottleneck? For reads I can see
why it's not, but for writes (in the 10,00 node cluster they talk
about) this would be like trying to stuff the Atlantic through a hose.
At least in my understanding.

Depends. On the definition of "metadata", on the exact mode of
caching, on how the consistency locking is implemented, on the
workload, and most importantly, whether the actual workload matches
the expectation that the implementors of the shared file system had.
There are existance proofs of clustered / distributed filesystems that
centralize metadata handling, yet work extremely well, even when
writing.

This topic is very well studied. There are at least dozens of papers
on it in the open research literature. The number of published
patents about it must be in the hundreds. Many of the experts in the
field could hold talks lasting several hours apiece on the subject. I
suggest that you review the literature on the subject. I don't
offhand know a good overview or introductory textbook; just do a
google search for things like GFS (Minnesota), Lustre, Panasas or
PanFS (also look for CMU NASD, its predecessor), IBM StorageTank or
SAN-FS, GFS (the Google file system), IBM GPFS, Digital
Petal/Frangipani, Berkeley xFS, SGI CXFS, parallel NFSv4, and many
academic predecessors (Sprite, Slice, whatevers).

Quote:
My guess is we will see something like NVRAM cards inside the hosts
all connected via IB or some other high bandwidth low latency
interconnect. For now though gigabit seems to do the trick primarily
for the reason you mentioned.
Although I gotta say, it is extremely appealing to use NVRAM in thos
hosts anyway just for the added write performance boost. If only they
could be clustered by themselves and not require host-based
clustering.... Ah well, someday.

If you had NVRAM, and IB (or even better, something like SCI, which
has the cache-coherency built in), and the NVRAM is fault-tolerant
(meaning: remains accessible even if the host has failed), then
implementing a clustered file system becomes trivial. Or to put it
differently: In an environment where the hardware is plentiful, even
really bad architectural choices will perform adequately. But today,
we don't have hardware NVRAM in every host, even less do we have
fault-tolerant NVRAM, and we have high-latency networking. Or to be
exact: Cost considerations prevent us from deploying
NVRAM/IB/... universally (they certainly do exist, but they are
expensive and not available in commodity computers). This means that
implementing a cluster or distributed file system becomes an art form,
and architectural choices must be well thought out, and matched to the
intended usage pattern and to the customers expectations.

--
The address in the header is invalid for obvious reasons. Please
reconstruct the address from the information below (look for _).
Ralph Becker-Szendy _firstname_@lr_dot_los-gatos_dot_ca.us
Back to top
Faeandar
Guest





Posted: Sat Dec 04, 2004 3:23 am    Post subject: Re: Distributed filesystems (was: Re: PanFS?) Reply with quote

On Fri, 03 Dec 2004 21:37:48 -0000,
_firstname_@lr_dot_los-gatos_dot_ca.us wrote:

Quote:
In article <6hg1r0h8hide29uohvkj5kbdaaa3i19ifp@4ax.com>,
Faeandar <mr_castalot@yahoo.com> wrote:
In implementations like Lustre, it refers to low-level local metadata stored
on all storage units, plus file-level metadata stored in a coordinating
central metadata server (or metadata server cluster). The low-level local
metadata is completely partitioned (i.e., of only local significance) and
hence has no scaling problem - but by offloading that portion of metadata
management from the central metadata server helps it scale too.

Help me understand this. If file level metadata is coordinated by a
single server how is that not a large bottleneck? For reads I can see
why it's not, but for writes (in the 10,00 node cluster they talk
about) this would be like trying to stuff the Atlantic through a hose.
At least in my understanding.

Depends. On the definition of "metadata", on the exact mode of
caching, on how the consistency locking is implemented, on the
workload, and most importantly, whether the actual workload matches
the expectation that the implementors of the shared file system had.
There are existance proofs of clustered / distributed filesystems that
centralize metadata handling, yet work extremely well, even when
writing.

This topic is very well studied. There are at least dozens of papers
on it in the open research literature. The number of published
patents about it must be in the hundreds. Many of the experts in the
field could hold talks lasting several hours apiece on the subject. I
suggest that you review the literature on the subject. I don't
offhand know a good overview or introductory textbook; just do a
google search for things like GFS (Minnesota), Lustre, Panasas or
PanFS (also look for CMU NASD, its predecessor), IBM StorageTank or
SAN-FS, GFS (the Google file system), IBM GPFS, Digital
Petal/Frangipani, Berkeley xFS, SGI CXFS, parallel NFSv4, and many
academic predecessors (Sprite, Slice, whatevers).

Well, so far I've read architectural and white papers on, and talked
to the vendors of (engineers not just sales), GFS, Polyserve, Ibrix,
Lustre, Panasas, Acopia, and soon to be GPFS. pNFS doesn't really fit
this space, at least not for what I want.
In each case the solutions with a single metadata server will see a
significant difference between read and write performance after 30-odd
node servers. Once you get to 60+ the write performance blows chunks,
albeit the read continues to rocket in most cases.

Quote:

My guess is we will see something like NVRAM cards inside the hosts
all connected via IB or some other high bandwidth low latency
interconnect. For now though gigabit seems to do the trick primarily
for the reason you mentioned.
Although I gotta say, it is extremely appealing to use NVRAM in thos
hosts anyway just for the added write performance boost. If only they
could be clustered by themselves and not require host-based
clustering.... Ah well, someday.

If you had NVRAM, and IB (or even better, something like SCI, which
has the cache-coherency built in), and the NVRAM is fault-tolerant
(meaning: remains accessible even if the host has failed), then
implementing a clustered file system becomes trivial. Or to put it
differently: In an environment where the hardware is plentiful, even
really bad architectural choices will perform adequately. But today,
we don't have hardware NVRAM in every host, even less do we have
fault-tolerant NVRAM, and we have high-latency networking. Or to be
exact: Cost considerations prevent us from deploying
NVRAM/IB/... universally (they certainly do exist, but they are
expensive and not available in commodity computers). This means that
implementing a cluster or distributed file system becomes an art form,
and architectural choices must be well thought out, and matched to the
intended usage pattern and to the customers expectations.

NVRAM cards are already avaialable for off-the-shelf servers and they
work quite well for the most part. The big problem in a high
performance file system is that host-based clustering is still
required for the NVRAM failover, it's not builtin directly. I would
love to throw a MicroMemory 4gb NVRAM card in each of my node servers
for enhanced write performance but I would need something on the host
to handle failover and coherency, a layer of complexity I avoid in
this environment because it's not needed and generally not wanted.
That's why I have the file system and 30 node servers to begin with...

~F
Back to top
Faeandar
Guest





Posted: Sat Dec 04, 2004 3:30 am    Post subject: Re: Distributed filesystems (was: Re: PanFS?) Reply with quote

On Fri, 3 Dec 2004 20:45:16 GMT, krm@sdc.cs.boeing.com (Keith
Michaels) wrote:

Quote:
In article <18ydndgNdq8EIy3cRVn-oQ@metrocastcablevision.com>,
"Bill Todd" <billtodd@metrocast.net> writes:
|
|> "Keith Michaels" <krm@sdc.cs.boeing.com> wrote in message
|> news:I85orB.A8u@news.boeing.com...
|> > In article <h8avq0dhipa8l2p7moaf3fbga9bs6h3j77@4ax.com>,
|> > Faeandar <mr_castalot@yahoo.com> writes:
|> > ...
|
|> > |> >There is something I don't understand about these distributed
|> > |> >filesystems: how is the namespace mapped to the devices in a fully
|> > |> >distributed environment? If there is complete independence between
|> > |> >the name layer and the storage layer then there is an n-squared
|> > |> >locking problem, if devices are tied to the namespace then there
|> > |> >is a load-leveling/utilization problem.
|> > |
|> > |> Yes, and yes (though maybe not squared). In most cases the products
|> > |> use a VIP that clients mount. In cases like Polyserve and GFS the
|> > |> client requests are round-robin'd among the node servers. Not exactly
|> > |> load balancing but over time and enough node servers the balance finds
|> > |> itself.
|> > |
|> > |> Products like Acopia and Panasas have internal algorithms that they
|> > |> user for latency testing and then write to the node server that
|> > |> returns the fastest. Good to some extent but in the case of Acopia,
|> > |> not being a clustered file system, it means a potential for one system
|> > |> to house alot more data than the others. Maybe a problem, maybe not.
|> > |
|> > |> The only time you have a potential locking issue is with a clustered
|> > |> file system and a dedicated metadata server. This can become a
|> > |> serious bottleneck. Some products use distributed metadata which,
|> > |> while not alleviating the problem completely, greatly reduces it.
|> > |
|> > |> ~F
|
|> > If all nodes are equal (distributed metadata case) then there needs
|> > to be a mapping from filesystem semantics to the underlying
|> > consistency model implemented by the storage system e.g., if
|> > filesystems guarantee strict write ordering (reads always return
|> > the latest write) then the storage system must implement equally
|> > strong cache coherency between all nodes, which doesn't scale.
|> > So there's a tradeoff between performance and utilization. I was
|> > just wondering how modern DFSs deal with this fundamental
|> > limitation. SMPs have hardware support for cache management;
|> > do we need similar functionality in the SAN?
|
|> 'Distributed metadata' means different things in different contexts.
|
|> On VMS clusters, it refers to distributed management of metadata stored on
|> shared devices by cooperating hosts. But even there only a single host
|> manages a given metadatum at any given time (using the distributed locking
|> facility). This approach scales to hundreds of host nodes (the nominal
|> supported limit on cluster size is 96, but sizes close to 200 have been used
|> in practice), though contention varies with the degree of access locality to
|> a given datum (the worst case obviously being if all hosts access it
|> equally, with intense update activity).
|
|> In implementations like Lustre, it refers to low-level local metadata stored
|> on all storage units, plus file-level metadata stored in a coordinating
|> central metadata server (or metadata server cluster). The low-level local
|> metadata is completely partitioned (i.e., of only local significance) and
|> hence has no scaling problem - but by offloading that portion of metadata
|> management from the central metadata server helps it scale too.
|
|> Another alternative is to partition higher-level metadata as well (e.g.,
|> localizing the management of metadata for any given file - or even
|> directory - to a single server (plus perhaps a mirror partner), even if the
|> file data may be spread more widely). This certainly qualifies as
|> distributed metadata, even though management of any single metadatum is not
|> distributed, and scales well at the file system level as long as the
|> partitioning granularity manages to spread out metadata loads reasonably
|> (e.g., you'd have a problem if all system activity concentrated on one
|> humongous database file - though with some ingenuity subdividing at least
|> some of the metadata management even within a single file is possible).
|
|> Until SAN bandwidths and latencies approach those of local RAM much more
|> closely than they do today, there will likely be little reason to use
|> hardware management of distributed file caches - especially since the most
|> effective location for such caching is very often (save in cases of intense
|> contention) at the client itself, where special hardware is least likely to
|> be found.
|
|> - bill

Thanks Bill, that's a very interesting assessment. It seems to me
that the promise of grid-based/object storage is scalability; if
we lose that we're back where we started. Lustre cleverly exploits
the difference between local and global attributes but in the end
all of these DFSs lose scalability when trying to emulate strictly
ordered filesystem semantics and centralized resource management
without making assumptions about sharing and access patterns.
Maybe there's a perfect technology coming that will do it all; in
the meantime what is the business case for DFS deployment? Maybe
a better question is how do we move beyond the filesystem model
so that these technologies fit our applications better?

I don't think that's strictly true, that we lose scalability. A
single file system by today's norm is over a single host, so right out
of the gate the CFS's beat that, DFS's too for that matter. The
purpose of these file systems came out of high performance computing
needs, basically striping writes/reads across multiple hosts to gain
(in theory) linear performance increases per node addition. Obviously
that's not the case but it definitely improves performance
dramatically. So scalability is already there.

Can it scale to 10,000 nodes? Well not currently (read-only not
withstanding) but 10 years ago scaling this type of file system to
anything beyond 1 server was voodoo in the open systems world.

~F
Back to top
Bill Todd
Guest





Posted: Sat Dec 04, 2004 4:25 am    Post subject: Re: Distributed filesystems (was: Re: PanFS?) Reply with quote

"Faeandar" <mr_castalot@yahoo.com> wrote in message
news:6hg1r0h8hide29uohvkj5kbdaaa3i19ifp@4ax.com...
Quote:
On Fri, 3 Dec 2004 14:29:21 -0500, "Bill Todd"
billtodd@metrocast.net> wrote:

....

Quote:
In implementations like Lustre, it refers to low-level local metadata
stored
on all storage units, plus file-level metadata stored in a coordinating
central metadata server (or metadata server cluster). The low-level
local
metadata is completely partitioned (i.e., of only local significance) and
hence has no scaling problem - but by offloading that portion of metadata
management from the central metadata server helps it scale too.

Help me understand this. If file level metadata is coordinated by a
single server how is that not a large bottleneck?

By making the server suitably hefty. Or, as noted above, by using a
metadata server cluster to divvy up the load (though even a relatively
modest SMP system can handle impressive levels of metadata management before
clustering it is necessary for anything save to increase availability via
fail-over mechanisms).

For reads I can see
Quote:
why it's not, but for writes (in the 10,00 node cluster they talk
about) this would be like trying to stuff the Atlantic through a hose.
At least in my understanding.

The actual user data doesn't pass through the metadata server, just the
metadata operations. So in a product like Lustre (or, IIUC, Storage Tank),
the metadata server merely need be made aware of the write operation to
update things like last-modified date and EOF mark, while the grunt-level
data servers handle the actual movement to disk (and at least potentially
any interlocking involved, though the interaction of byte-range locks with
full-file locks on a file large enough to span many data servers becomes an
interesting challenge then).

Directory operations may often place a heavier load on metadata servers than
even write-intensive file activity.

....

Quote:
Until SAN bandwidths and latencies approach those of local RAM much more
closely than they do today, there will likely be little reason to use
hardware management of distributed file caches - especially since the
most
effective location for such caching is very often (save in cases of
intense
contention) at the client itself, where special hardware is least likely
to
be found.

My guess is we will see something like NVRAM cards inside the hosts
all connected via IB or some other high bandwidth low latency
interconnect. For now though gigabit seems to do the trick primarily
for the reason you mentioned.
Although I gotta say, it is extremely appealing to use NVRAM in thos
hosts anyway just for the added write performance boost. If only they
could be clustered by themselves and not require host-based
clustering.... Ah well, someday.

Use of NVRAM is eminently feasible today, and has (or at least with suitable
design need have) relatively little connection to distributed caching (let
alone with hardware support for same) - especially if clients aren't
completely trustworthy.

- bill
Back to top
Keith Michaels
Guest





Posted: Sat Dec 04, 2004 4:43 am    Post subject: Re: Distributed filesystems (was: Re: PanFS?) Reply with quote

In article <jrp1r0hj9qjckuiq0vr7i6guj8adeqamrf@4ax.com>,
Faeandar <mr_castalot@yahoo.com> writes:
|> On Fri, 3 Dec 2004 20:45:16 GMT, krm@sdc.cs.boeing.com (Keith
|> Michaels) wrote:
|>
|> >In article <18ydndgNdq8EIy3cRVn-oQ@metrocastcablevision.com>,
|> > "Bill Todd" <billtodd@metrocast.net> writes:
|> >|>
|> >|> "Keith Michaels" <krm@sdc.cs.boeing.com> wrote in message
|> >|> news:I85orB.A8u@news.boeing.com...
|> >|> > In article <h8avq0dhipa8l2p7moaf3fbga9bs6h3j77@4ax.com>,
|> >|> > Faeandar <mr_castalot@yahoo.com> writes:
|> >|> > ...
|> >|> >
|> >|> > |> >There is something I don't understand about these distributed
|> >|> > |> >filesystems: how is the namespace mapped to the devices in a fully
|> >|> > |> >distributed environment? If there is complete independence between
|> >|> > |> >the name layer and the storage layer then there is an n-squared
|> >|> > |> >locking problem, if devices are tied to the namespace then there
|> >|> > |> >is a load-leveling/utilization problem.
|> >|> > |>
|> >|> > |> Yes, and yes (though maybe not squared). In most cases the products
|> >|> > |> use a VIP that clients mount. In cases like Polyserve and GFS the
|> >|> > |> client requests are round-robin'd among the node servers. Not exactly
|> >|> > |> load balancing but over time and enough node servers the balance finds
|> >|> > |> itself.
|> >|> > |>
|> >|> > |> Products like Acopia and Panasas have internal algorithms that they
|> >|> > |> user for latency testing and then write to the node server that
|> >|> > |> returns the fastest. Good to some extent but in the case of Acopia,
|> >|> > |> not being a clustered file system, it means a potential for one system
|> >|> > |> to house alot more data than the others. Maybe a problem, maybe not.
|> >|> > |>
|> >|> > |> The only time you have a potential locking issue is with a clustered
|> >|> > |> file system and a dedicated metadata server. This can become a
|> >|> > |> serious bottleneck. Some products use distributed metadata which,
|> >|> > |> while not alleviating the problem completely, greatly reduces it.
|> >|> > |>
|> >|> > |> ~F
|> >|> >
|> >|> > If all nodes are equal (distributed metadata case) then there needs
|> >|> > to be a mapping from filesystem semantics to the underlying
|> >|> > consistency model implemented by the storage system e.g., if
|> >|> > filesystems guarantee strict write ordering (reads always return
|> >|> > the latest write) then the storage system must implement equally
|> >|> > strong cache coherency between all nodes, which doesn't scale.
|> >|> > So there's a tradeoff between performance and utilization. I was
|> >|> > just wondering how modern DFSs deal with this fundamental
|> >|> > limitation. SMPs have hardware support for cache management;
|> >|> > do we need similar functionality in the SAN?
|> >|>
|> >|> 'Distributed metadata' means different things in different contexts.
|> >|>
|> >|> On VMS clusters, it refers to distributed management of metadata stored on
|> >|> shared devices by cooperating hosts. But even there only a single host
|> >|> manages a given metadatum at any given time (using the distributed locking
|> >|> facility). This approach scales to hundreds of host nodes (the nominal
|> >|> supported limit on cluster size is 96, but sizes close to 200 have been used
|> >|> in practice), though contention varies with the degree of access locality to
|> >|> a given datum (the worst case obviously being if all hosts access it
|> >|> equally, with intense update activity).
|> >|>
|> >|> In implementations like Lustre, it refers to low-level local metadata stored
|> >|> on all storage units, plus file-level metadata stored in a coordinating
|> >|> central metadata server (or metadata server cluster). The low-level local
|> >|> metadata is completely partitioned (i.e., of only local significance) and
|> >|> hence has no scaling problem - but by offloading that portion of metadata
|> >|> management from the central metadata server helps it scale too.
|> >|>
|> >|> Another alternative is to partition higher-level metadata as well (e.g.,
|> >|> localizing the management of metadata for any given file - or even
|> >|> directory - to a single server (plus perhaps a mirror partner), even if the
|> >|> file data may be spread more widely). This certainly qualifies as
|> >|> distributed metadata, even though management of any single metadatum is not
|> >|> distributed, and scales well at the file system level as long as the
|> >|> partitioning granularity manages to spread out metadata loads reasonably
|> >|> (e.g., you'd have a problem if all system activity concentrated on one
|> >|> humongous database file - though with some ingenuity subdividing at least
|> >|> some of the metadata management even within a single file is possible).
|> >|>
|> >|> Until SAN bandwidths and latencies approach those of local RAM much more
|> >|> closely than they do today, there will likely be little reason to use
|> >|> hardware management of distributed file caches - especially since the most
|> >|> effective location for such caching is very often (save in cases of intense
|> >|> contention) at the client itself, where special hardware is least likely to
|> >|> be found.
|> >|>
|> >|> - bill
|> >
|> >Thanks Bill, that's a very interesting assessment. It seems to me
|> >that the promise of grid-based/object storage is scalability; if
|> >we lose that we're back where we started. Lustre cleverly exploits
|> >the difference between local and global attributes but in the end
|> >all of these DFSs lose scalability when trying to emulate strictly
|> >ordered filesystem semantics and centralized resource management
|> >without making assumptions about sharing and access patterns.
|> >Maybe there's a perfect technology coming that will do it all; in
|> >the meantime what is the business case for DFS deployment? Maybe
|> >a better question is how do we move beyond the filesystem model
|> >so that these technologies fit our applications better?
|>
|> I don't think that's strictly true, that we lose scalability. A
|> single file system by today's norm is over a single host, so right out
|> of the gate the CFS's beat that, DFS's too for that matter. The
|> purpose of these file systems came out of high performance computing
|> needs, basically striping writes/reads across multiple hosts to gain
|> (in theory) linear performance increases per node addition. Obviously
|> that's not the case but it definitely improves performance
|> dramatically. So scalability is already there.
|>
|> Can it scale to 10,000 nodes? Well not currently (read-only not
|> withstanding) but 10 years ago scaling this type of file system to
|> anything beyond 1 server was voodoo in the open systems world.
|>
|> ~F

The way to preserve scalability is to stripe, which improves data
bandwidth to a host, and to partition the metadata, which controls
the cost of allocation and other metadata updates. Both of these
strategies reduce utilization; striping across a SAN gobbles
up switch bandwidth geometrically, partitioning the metadata
either suffers latency (metaserver far away), or uneven allocation
(nearby storage is full). We need global load-leveling
to achieve true grid-based storage and managing the namespace
outside the hosts is part of that; so we put the namespace and
global attributes in LDAP, let it handle metadata locality and let
OSDs handle the rest. Just my $.02
Back to top
 
Post new topic   Reply to topic    CASTalk.com Forum Index -> Storage System All times are GMT
Goto page 1, 2  Next
Page 1 of 2

 
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum




VoIP Electronics Powered by phpBB