ILM and Full Text Search
Web Server forum
Back To The Forum Home!Search!Private Messaging System

Web Server Talk Web Server Talk > WebserverTalk Community > Data Storage > ILM and Full Text Search




Pages (2): [1] 2 »   Last Thread   Next Thread Next
  Show Printable Version Email this Page Subscribe to this Thread      Post New Thread    Post A Reply      

    ILM and Full Text Search  
ron.lindman@gmail.com


View Ip Address Report This Message To A Moderator Edit/Delete Message


 
01-31-07 12:15 AM

Hello,

I'm looking into various ILM products such as those from Kazeon, EMC,
NeoPath, etc. One question that comes up is how these products behave
when a client does a full-text search against a volume that contains
data that's been migrated away.

>From what I understand, a file access causes many of these products to
bring the file back from a secondary tier. I know that some ILM API's
allow for redirection, which would seemingly avoid this issue.
However, others do not have redirection. Wouldn't this mean that a
full-text search causes the entire set of data to be brought back onto
the primary tier? Doesn't this cause capacity issues?

What am I missing? Your help is greatly appreciated.

Thanks,
Ron






[ Post a follow-up to this message ]



    Re: ILM and Full Text Search  
Nik Simpson


View Ip Address Report This Message To A Moderator Edit/Delete Message


 
01-31-07 12:15 AM

ron.lindman@gmail.com wrote:
> Hello,
>
> I'm looking into various ILM products such as those from Kazeon, EMC,
> NeoPath, etc. One question that comes up is how these products behave
> when a client does a full-text search against a volume that contains
> data that's been migrated away.
> 
> bring the file back from a secondary tier. I know that some ILM API's
> allow for redirection, which would seemingly avoid this issue.
> However, others do not have redirection. Wouldn't this mean that a
> full-text search causes the entire set of data to be brought back onto
> the primary tier? Doesn't this cause capacity issues?
>
> What am I missing? Your help is greatly appreciated.

Typically, a content search is performed against a content index, not
against the original file, so the search doesn't touch the file at all.
The file is read during the indexing process, if that occurs before
migration then the file will not be hit after migration.


PS. If you looking at this space you should also take a look at Scentric
(FTR I work for Scentric, well at least for another ten days :-)

--
Nik Simpson





[ Post a follow-up to this message ]



    Re: ILM and Full Text Search  
dvymiller@yahoo.com


View Ip Address Report This Message To A Moderator Edit/Delete Message


 
02-01-07 06:13 PM

On Jan 30, 2:47 pm, Nik Simpson <n_simp...@bellsouth.net> wrote:
> ron.lind...@gmail.com wrote: 
> 
> 
> 
>
> Typically, a content search is performed against a content index, not
> against the original file, so the search doesn't touch the file at all.
> The file is read during the indexing process, if that occurs before
> migration then the file will not be hit after migration.
>
> PS. If you looking at this space you should also take a look at Scentric
> (FTR I work for Scentric, well at least for another ten days :-)
>
> --
> Nik Simpson

What happens when someone opens Windows file explorer and performs a
search through it's search tool?  Wont it try and read all the files
off of the NAS and to the OPs point, wont it cause all the files to be
moved from tier II to tier I again?

Dvy






[ Post a follow-up to this message ]



    Re: ILM and Full Text Search  
Nik Simpson


View Ip Address Report This Message To A Moderator Edit/Delete Message


 
02-02-07 12:13 AM

dvymiller@yahoo.com wrote:
>
>
> What happens when someone opens Windows file explorer and performs a
> search through it's search tool?  Wont it try and read all the files
> off of the NAS and to the OPs point, wont it cause all the files to be
> moved from tier II to tier I again?
>

ON XP, Think the answer would be yes, unless the ILM solution is smart
and recognizes the type of access as being something it should not
migrate for. In Vista (or if you are using something like Google Desktop
search which maintains an index this should not be such a big problem.
--
Nik Simpson





[ Post a follow-up to this message ]



    Re: ILM and Full Text Search  
bcwalrus


View Ip Address Report This Message To A Moderator Edit/Delete Message


 
02-02-07 12:13 AM

On Jan 30, 2:01 pm, "ron.lind...@gmail.com" <ron.lind...@gmail.com>
wrote:
> Hello,
>
> I'm looking into various ILM products such as those from Kazeon, EMC,
> NeoPath, etc. One question that comes up is how these products behave
> when a client does a full-text search against a volume that contains
> data that's been migrated away.
> 
>
> bring the file back from a secondary tier. I know that some ILM API's
> allow for redirection, which would seemingly avoid this issue.
> However, others do not have redirection. Wouldn't this mean that a
> full-text search causes the entire set of data to be brought back onto
> the primary tier? Doesn't this cause capacity issues?
>
> What am I missing? Your help is greatly appreciated.
>
> Thanks,
> Ron

Not for the NeoPath FileDirector. They redirect access traffic to the
migration destination. If you access it frequent enough, then
depending on how you set up the placement policy, data may be migrated
back to the primary tier. Or you can set up your policy not to do
that. In other words, data access and data placement policy are
independent.

(I happen to be the NFS guy at NeoPath.)

Cheers,
bc






[ Post a follow-up to this message ]



    Re: ILM and Full Text Search  
Faeandar


View Ip Address Report This Message To A Moderator Edit/Delete Message


 
02-02-07 12:13 AM

On 30 Jan 2007 14:01:52 -0800, "ron.lindman@gmail.com"
<ron.lindman@gmail.com> wrote:

>Hello,
>
>I'm looking into various ILM products such as those from Kazeon, EMC,
>NeoPath, etc. One question that comes up is how these products behave
>when a client does a full-text search against a volume that contains
>data that's been migrated away.
> 
>bring the file back from a secondary tier. I know that some ILM API's
>allow for redirection, which would seemingly avoid this issue.
>However, others do not have redirection. Wouldn't this mean that a
>full-text search causes the entire set of data to be brought back onto
>the primary tier? Doesn't this cause capacity issues?
>
>What am I missing? Your help is greatly appreciated.
>
>Thanks,
>Ron


So, since we have two people from companies in this space I'd like to
pose the competitive question:

What are your thoughts on Index Engines?

Thanks.

~F





[ Post a follow-up to this message ]



    Re: ILM and Full Text Search  
Nik Simpson


View Ip Address Report This Message To A Moderator Edit/Delete Message


 
02-02-07 06:16 AM

Faeandar wrote:
>
> So, since we have two people from companies in this space I'd like to
> pose the competitive question:
>
> What are your thoughts on Index Engines?
>


First, right now I would not see Index Engines as a direct competitor,
they are purely a search application and don't offer much in the way of
classification or policy-based data management which is needed for ILM.

Second for enterprise wide search the problem is that when I'm looking
for document X, I'd rather find it on disk than buried on a backup tape.
If I can't find it online, then I'd go backup tape. So other than as an
application for helping me keep better track of what I've backed up I
don't see much of a future for it.

Interesting technology that I suspect will get embedded in things like
VTLs and D2D disk backup appliances. I don't see it as a standalone
technology. Good acquisition candidate for somebody in that space.

--
Nik Simpson





[ Post a follow-up to this message ]



    Re: ILM and Full Text Search  
dvymiller@yahoo.com


View Ip Address Report This Message To A Moderator Edit/Delete Message


 
02-02-07 06:16 AM

On Feb 1, 5:54 pm, Nik Simpson <n_simp...@bellsouth.net> wrote:
> Faeandar wrote:
> 
> 
>
> First, right now I would not see Index Engines as a direct competitor,
> they are purely a search application and don't offer much in the way of
> classification or policy-based data management which is needed for ILM.
>
> Second for enterprise wide search the problem is that when I'm looking
> for document X, I'd rather find it on disk than buried on a backup tape.
> If I can't find it online, then I'd go backup tape. So other than as an
> application for helping me keep better track of what I've backed up I
> don't see much of a future for it.
>
> Interesting technology that I suspect will get embedded in things like
> VTLs and D2D disk backup appliances. I don't see it as a standalone
> technology. Good acquisition candidate for somebody in that space.
>
> --
> Nik Simpson

Where does the google search appliance fit into this?

Dvy






[ Post a follow-up to this message ]



    Re: ILM and Full Text Search  
Faeandar


View Ip Address Report This Message To A Moderator Edit/Delete Message


 
02-02-07 06:16 AM

On Thu, 01 Feb 2007 20:54:20 -0500, Nik Simpson
<n_simpson@bellsouth.net> wrote:

>Faeandar wrote: 
>
>
>First, right now I would not see Index Engines as a direct competitor,
>they are purely a search application and don't offer much in the way of
>classification or policy-based data management which is needed for ILM.
>
>Second for enterprise wide search the problem is that when I'm looking
>for document X, I'd rather find it on disk than buried on a backup tape.
>If I can't find it online, then I'd go backup tape. So other than as an
>application for helping me keep better track of what I've backed up I
>don't see much of a future for it.
>
>Interesting technology that I suspect will get embedded in things like
>VTLs and D2D disk backup appliances. I don't see it as a standalone
>technology. Good acquisition candidate for somebody in that space.


They can get metadata directly from NDMP dumps.  If someone figures
out how to flag the dump to only pass the metadata then they will be
able to get an entire storage array's metadata in a matter of hours
instead of days that file crawlers will take.
Even without the flag they still get data far faster than any file
crawler.

I may have been asking far too open ended a question.  My needs are
fairly simple; tell me what, where, how big, how frequently accessed,
what type of file, etc.  I've no need for a deep dive of content.

I'm looking for typical SRM stats, but on a fair scale.

Hopefully this provides more to go on.

Thanks.

~F





[ Post a follow-up to this message ]



    Re: ILM and Full Text Search  
Nik Simpson


View Ip Address Report This Message To A Moderator Edit/Delete Message


 
02-03-07 12:13 PM

Faeandar wrote:
>
>
> They can get metadata directly from NDMP dumps.  If someone figures
> out how to flag the dump to only pass the metadata then they will be
> able to get an entire storage array's metadata in a matter of hours
> instead of days that file crawlers will take.
> Even without the flag they still get data far faster than any file
> crawler.
>

Yes, they could do that, but then so could every other competitor, NDMP
is available to anybody, not just Index Engines. EMC does something
similar, though probably proprietary with it's classification product
which gets a "dump" of metadata from Celerra file servers rather walking
the file system over the network.

> I may have been asking far too open ended a question.  My needs are
> fairly simple; tell me what, where, how big, how frequently accessed,
> what type of file, etc.  I've no need for a deep dive of content.

Index Engines wouldn't be a solution then, since to the best of my
knowledge it's all about content indexing & search. However, both
Scentric and Kazeon can do what you want without having to generate a
content index.

>
> I'm looking for typical SRM stats, but on a fair scale.

So you don't actually want to take any actions like migrating little
used stuff to tier2? Anyway, both Scentric and Kazeon offer extensive
SRM reporting, though if reporting is all you want, you might want to
take a look at Monosphere which has a pure file SRM solution. How big is
a "fair scale" to you, 10s, 100s, 1000s of TB?

If you do want to take actions, then a policy engine is something you
want to look at. I can't speak for Kazeon's policy engine, but Scentric
lets you build policies with classification rules. For example "find all
OFFICE files larger than 50MB, & not accessed in 30days" which can be
combined with one or more actions that work on the results of the
filter. Actions include move, copy, delete, script, archive with
retention, etc.

You can schedule these policies on a calendar or event trigger (i.e.
once a week, or when file system has less than 20% free), you can also
trigger them from external scripts.



--
Nik Simpson





[ Post a follow-up to this message ]



    Sponsored Links  




 





   All times are GMT. The time now is 09:08 PM.      Post New Thread    Post A Reply      
Pages (2): [1] 2 »   Last Thread   Next Thread Next


Most Popular forums 

Forum Jump:
Rate This Thread:

Forum Rules:
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts
HTML code is OFF
vB code is ON
Smilies are ON
[IMG] code is OFF
 
Medical and Health forum | Computer Games Reviews | Graphics design forum

Back To The Top
Home | Usercp | Faq | Register