Problem crawling CMS2002 site with SPS2003
Web Server forum
Back To The Forum Home!Search!Private Messaging System

Web Server Talk Web Server Talk > Web Servers reviews > Microsoft Content Management Server > Problem crawling CMS2002 site with SPS2003




  Last Thread   Next Thread Next
  Show Printable Version Email this Page Subscribe to this Thread      Post New Thread    Post A Reply      

    Problem crawling CMS2002 site with SPS2003  
Hugo Apol?nio


View Ip Address Report This Message To A Moderator Edit/Delete Message


 
09-07-04 12:48 PM

Hi.

I've created a very simple site in CMS2002 to try searching it using
SPS2003 services. It has only one channel, one template and one post -
the homepage.

In SPS I've set a "non portal content source" to
"http://localhost/my_cms_site", and added
"http://localhost/my_cms_site/*" to "Exclude and Include Content" with
my domain account (which is an admin in the server, and is the same
user used in all the installations and configurations performed).

The problem is that when I run a "full update" on this source nothing
gets indexed (and the search thru the portal site returns nothing). In
the "gatherer log" I get the following messages:

----
9/6/2004 5:11:25 PM Add Content source URL:
<a>http://localhost/my_cms_site</a>
The Full update is complete

9/6/2004 5:11:23 PM Add <a>http://localhost/other_site</a>
The address could not be found, (0x80041201 - The object was not
found. )

9/6/2004 5:11:14 PM Add <a>Sts2://localhost/other_site/webid=008/</a>
The address could not be found, (0x80041201 - The object was not
found. )

9/6/2004 5:11:10 PM Add
<a>Sts2://localhost/my_cms_site/webid=009/</a>
The address could not be found, (0x80041201 - The object was not
found. )

9/6/2004 5:11:05 PM Add Content source URL:
<a>http://localhost/my_cms_site</a>
The Full update was started
----

When I click on the link "Sts2://localhost/my_cms_site/webid=009/" it
redirects to "http://localhost/my_cms_site" and shows the site ok.

What could be happening?? What am I missing here??


I tried indexing the WoodgroveIntranet sample site and it went fine,
except for the links returned in the search results that were in the
form "http://localhost/NR/exeres/54940CA0-879C-4F01-927B-F7E18C175B8E,framel
ess.htm"
(where it should be "http://localhost/WoodgroveIntranet/About+Us/").

Can't find any difference either in the configuration of this
Woodgrove site and my CMS test site.

By the way, all apps are in the same box: IIS 6, SQL Server 2000 SP3,
SPS2003, MCMS2002 SP1a, Connector, VS.NET 2003; and using the same
"default web site".

Thanks in advance for any pointers.

Hugo





[ Post a follow-up to this message ]



    Re: Problem crawling CMS2002 site with SPS2003  
Stefan [MSFT]


View Ip Address Report This Message To A Moderator Edit/Delete Message


 
09-07-04 12:48 PM

Hi Hugo,

did you check the IIS log of the MCMS server machine?
What is logged here for the crawling requests?

Cheers,
Stefan

--
This posting is provided "AS IS" with no warranties, and confers no rights.

MCMS FAQ:
http://download.microsoft.com/downl...>
MCMS+2002+-+(complete)+FAQ.htm
MCMS Blog: http://blogs.msdn.com/stefan_gossner/category/4983.aspx
MCMS Sample Code:
http://www.gotdotnet.com/community/...nagement+Server
MCMS Whitepapers and other docs:
http://blogs.msdn.com/stefan_gossne...2/07/41859.aspx
--------------------------------


"Hugo Apol?nio" <hugo_apolonio@hotmail.com> wrote in message
news:b99d6132.0409070345.1b42f3cd@posting.google.com...
> Hi.
>
> I've created a very simple site in CMS2002 to try searching it using
> SPS2003 services. It has only one channel, one template and one post -
> the homepage.
>
> In SPS I've set a "non portal content source" to
> "http://localhost/my_cms_site", and added
> "http://localhost/my_cms_site/*" to "Exclude and Include Content" with
> my domain account (which is an admin in the server, and is the same
> user used in all the installations and configurations performed).
>
> The problem is that when I run a "full update" on this source nothing
> gets indexed (and the search thru the portal site returns nothing). In
> the "gatherer log" I get the following messages:
>
> ----
> 9/6/2004 5:11:25 PM Add Content source URL:
> <a>http://localhost/my_cms_site</a>
>       The Full update is complete
>
>  9/6/2004 5:11:23 PM Add <a>http://localhost/other_site</a>
>       The address could not be found, (0x80041201 - The object was not
> found. )
>
>  9/6/2004 5:11:14 PM Add <a>Sts2://localhost/other_site/webid=008/</a>
>       The address could not be found, (0x80041201 - The object was not
> found. )
>
>  9/6/2004 5:11:10 PM Add
> <a>Sts2://localhost/my_cms_site/webid=009/</a>
>       The address could not be found, (0x80041201 - The object was not
> found. )
>
>  9/6/2004 5:11:05 PM Add Content source URL:
> <a>http://localhost/my_cms_site</a>
>       The Full update was started
> ----
>
> When I click on the link "Sts2://localhost/my_cms_site/webid=009/" it
> redirects to "http://localhost/my_cms_site" and shows the site ok.
>
> What could be happening?? What am I missing here??
>
>
> I tried indexing the WoodgroveIntranet sample site and it went fine,
> except for the links returned in the search results that were in the
> form
"http://localhost/NR/exeres/54940CA0-879C-4F01-927B-F7E18C175B8E,frameless.h
tm"
> (where it should be "http://localhost/WoodgroveIntranet/About+Us/").
>
> Can't find any difference either in the configuration of this
> Woodgrove site and my CMS test site.
>
> By the way, all apps are in the same box: IIS 6, SQL Server 2000 SP3,
> SPS2003, MCMS2002 SP1a, Connector, VS.NET 2003; and using the same
> "default web site".
>
> Thanks in advance for any pointers.
>
> Hugo







[ Post a follow-up to this message ]



    Re: Problem crawling CMS2002 site with SPS2003  
Hugo Apol?nio


View Ip Address Report This Message To A Moderator Edit/Delete Message


 
09-08-04 12:48 PM

Hi there.

Found the problem source!

After removing the HTTP Header "MicrosoftSharePointTeamServices" from
the my_cms_site web app, the crawler started working fine.

This is refered in the "MS CMS2002 Connector for SP Technologies Help"
(http://go.microsoft.com/fwlink/?LinkId=22907), topic Operations >
Configuring
your site for MCMS search > Indexing your MCMS site.

According to the docs, the excludeCMS.exe tool should have removed
that HTTP Header, but it didn't work quite well.

See u.


-------------------

hugo_apolonio@hotmail.com (Hugo Apolónio) wrote in message news:<b99d6132.0409070345.1b42f3c
d@posting.google.com>...
> Hi.
>
> I've created a very simple site in CMS2002 to try searching it using
> SPS2003 services. It has only one channel, one template and one post -
> the homepage.
>
> In SPS I've set a "non portal content source" to
> "http://localhost/my_cms_site", and added
> "http://localhost/my_cms_site/*" to "Exclude and Include Content" with
> my domain account (which is an admin in the server, and is the same
> user used in all the installations and configurations performed).
>
> The problem is that when I run a "full update" on this source nothing
> gets indexed (and the search thru the portal site returns nothing). In
> the "gatherer log" I get the following messages:
>
> ----
> 9/6/2004 5:11:25 PM Add Content source URL:
> <a>http://localhost/my_cms_site</a>
>       The Full update is complete
>
>  9/6/2004 5:11:23 PM Add <a>http://localhost/other_site</a>
>       The address could not be found, (0x80041201 - The object was not
> found. )
>
>  9/6/2004 5:11:14 PM Add <a>Sts2://localhost/other_site/webid=008/</a>
>       The address could not be found, (0x80041201 - The object was not
> found. )
>
>  9/6/2004 5:11:10 PM Add
> <a>Sts2://localhost/my_cms_site/webid=009/</a>
>       The address could not be found, (0x80041201 - The object was not
> found. )
>
>  9/6/2004 5:11:05 PM Add Content source URL:
> <a>http://localhost/my_cms_site</a>
>       The Full update was started
> ----
>
> When I click on the link "Sts2://localhost/my_cms_site/webid=009/" it
> redirects to "http://localhost/my_cms_site" and shows the site ok.
>
> What could be happening?? What am I missing here??
>
>
> I tried indexing the WoodgroveIntranet sample site and it went fine,
> except for the links returned in the search results that were in the
> form "http://localhost/NR/exeres/54940CA0-879C-4F01-927B-F7E18C175B8E,fram
eless.htm"
> (where it should be "http://localhost/WoodgroveIntranet/About+Us/").
>
> Can't find any difference either in the configuration of this
> Woodgrove site and my CMS test site.
>
> By the way, all apps are in the same box: IIS 6, SQL Server 2000 SP3,
> SPS2003, MCMS2002 SP1a, Connector, VS.NET 2003; and using the same
> "default web site".
>
> Thanks in advance for any pointers.
>
> Hugo





[ Post a follow-up to this message ]



    Sponsored Links  




 





   All times are GMT. The time now is 10:01 AM.      Post New Thread    Post A Reply      
  Last Thread   Next Thread Next


Most Popular forums 

Forum Jump:
Rate This Thread:

Forum Rules:
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts
HTML code is OFF
vB code is ON
Smilies are ON
[IMG] code is OFF
 
Medical and Health forum | Computer Games Reviews | Graphics design forum

Back To The Top
Home | Usercp | Faq | Register