|
Home > Archive > Microsoft Content Management Server > September 2004 > Problem crawling CMS2002 site with SPS2003
You are viewing an archived Text-only version of the thread.
To view this thread in it's original format and/or if you want to reply to
this thread please [click here]
| Author |
Problem crawling CMS2002 site with SPS2003
|
|
| Hugo Apol?nio 2004-09-07, 7:48 am |
| Hi.
I've created a very simple site in CMS2002 to try searching it using
SPS2003 services. It has only one channel, one template and one post -
the homepage.
In SPS I've set a "non portal content source" to
"http://localhost/my_cms_site", and added
"http://localhost/my_cms_site/*" to "Exclude and Include Content" with
my domain account (which is an admin in the server, and is the same
user used in all the installations and configurations performed).
The problem is that when I run a "full update" on this source nothing
gets indexed (and the search thru the portal site returns nothing). In
the "gatherer log" I get the following messages:
----
9/6/2004 5:11:25 PM Add Content source URL:
<a>http://localhost/my_cms_site</a>
The Full update is complete
9/6/2004 5:11:23 PM Add <a>http://localhost/other_site</a>
The address could not be found, (0x80041201 - The object was not
found. )
9/6/2004 5:11:14 PM Add <a>Sts2://localhost/other_site/webid=008/</a>
The address could not be found, (0x80041201 - The object was not
found. )
9/6/2004 5:11:10 PM Add
<a>Sts2://localhost/my_cms_site/webid=009/</a>
The address could not be found, (0x80041201 - The object was not
found. )
9/6/2004 5:11:05 PM Add Content source URL:
<a>http://localhost/my_cms_site</a>
The Full update was started
----
When I click on the link "Sts2://localhost/my_cms_site/webid=009/" it
redirects to "http://localhost/my_cms_site" and shows the site ok.
What could be happening?? What am I missing here??
I tried indexing the WoodgroveIntranet sample site and it went fine,
except for the links returned in the search results that were in the
form "http://localhost/NR/exeres/54940CA0-879C-4F01-927B-F7E18C175B8E,frameless.htm"
(where it should be "http://localhost/WoodgroveIntranet/About+Us/").
Can't find any difference either in the configuration of this
Woodgrove site and my CMS test site.
By the way, all apps are in the same box: IIS 6, SQL Server 2000 SP3,
SPS2003, MCMS2002 SP1a, Connector, VS.NET 2003; and using the same
"default web site".
Thanks in advance for any pointers.
Hugo
| |
| Stefan [MSFT] 2004-09-07, 7:48 am |
| Hi Hugo,
did you check the IIS log of the MCMS server machine?
What is logged here for the crawling requests?
Cheers,
Stefan
--
This posting is provided "AS IS" with no warranties, and confers no rights.
MCMS FAQ:
http://download.microsoft.com/downl...6a/MCMS+2002+-+(complete)+FAQ.htm
MCMS Blog: http://blogs.msdn.com/stefan_gossner/category/4983.aspx
MCMS Sample Code:
http://www.gotdotnet.com/community/...t+S
erver
MCMS Whitepapers and other docs:
http://blogs.msdn.com/stefan_gossne...2/07/41859.aspx
--------------------------------
"Hugo Apol?nio" <hugo_apolonio@hotmail.com> wrote in message
news:b99d6132.0409070345.1b42f3cd@posting.google.com...
> Hi.
>
> I've created a very simple site in CMS2002 to try searching it using
> SPS2003 services. It has only one channel, one template and one post -
> the homepage.
>
> In SPS I've set a "non portal content source" to
> "http://localhost/my_cms_site", and added
> "http://localhost/my_cms_site/*" to "Exclude and Include Content" with
> my domain account (which is an admin in the server, and is the same
> user used in all the installations and configurations performed).
>
> The problem is that when I run a "full update" on this source nothing
> gets indexed (and the search thru the portal site returns nothing). In
> the "gatherer log" I get the following messages:
>
> ----
> 9/6/2004 5:11:25 PM Add Content source URL:
> <a>http://localhost/my_cms_site</a>
> The Full update is complete
>
> 9/6/2004 5:11:23 PM Add <a>http://localhost/other_site</a>
> The address could not be found, (0x80041201 - The object was not
> found. )
>
> 9/6/2004 5:11:14 PM Add <a>Sts2://localhost/other_site/webid=008/</a>
> The address could not be found, (0x80041201 - The object was not
> found. )
>
> 9/6/2004 5:11:10 PM Add
> <a>Sts2://localhost/my_cms_site/webid=009/</a>
> The address could not be found, (0x80041201 - The object was not
> found. )
>
> 9/6/2004 5:11:05 PM Add Content source URL:
> <a>http://localhost/my_cms_site</a>
> The Full update was started
> ----
>
> When I click on the link "Sts2://localhost/my_cms_site/webid=009/" it
> redirects to "http://localhost/my_cms_site" and shows the site ok.
>
> What could be happening?? What am I missing here??
>
>
> I tried indexing the WoodgroveIntranet sample site and it went fine,
> except for the links returned in the search results that were in the
> form
"http://localhost/NR/exeres/54940CA0-879C-4F01-927B-F7E18C175B8E,frameless.h
tm"
> (where it should be "http://localhost/WoodgroveIntranet/About+Us/").
>
> Can't find any difference either in the configuration of this
> Woodgrove site and my CMS test site.
>
> By the way, all apps are in the same box: IIS 6, SQL Server 2000 SP3,
> SPS2003, MCMS2002 SP1a, Connector, VS.NET 2003; and using the same
> "default web site".
>
> Thanks in advance for any pointers.
>
> Hugo
| |
| Hugo Apol?nio 2004-09-08, 7:48 am |
| Hi there.
Found the problem source!
After removing the HTTP Header "MicrosoftSharePointTeamServices" from
the my_cms_site web app, the crawler started working fine.
This is refered in the "MS CMS2002 Connector for SP Technologies Help"
(http://go.microsoft.com/fwlink/?LinkId=22907), topic Operations >
Configuring
your site for MCMS search > Indexing your MCMS site.
According to the docs, the excludeCMS.exe tool should have removed
that HTTP Header, but it didn't work quite well.
See u.
-------------------
hugo_apolonio@hotmail.com (Hugo Apolónio) wrote in message news:<b99d6132.0409070345.1b42f3cd@posting.google.com>...
> Hi.
>
> I've created a very simple site in CMS2002 to try searching it using
> SPS2003 services. It has only one channel, one template and one post -
> the homepage.
>
> In SPS I've set a "non portal content source" to
> "http://localhost/my_cms_site", and added
> "http://localhost/my_cms_site/*" to "Exclude and Include Content" with
> my domain account (which is an admin in the server, and is the same
> user used in all the installations and configurations performed).
>
> The problem is that when I run a "full update" on this source nothing
> gets indexed (and the search thru the portal site returns nothing). In
> the "gatherer log" I get the following messages:
>
> ----
> 9/6/2004 5:11:25 PM Add Content source URL:
> <a>http://localhost/my_cms_site</a>
> The Full update is complete
>
> 9/6/2004 5:11:23 PM Add <a>http://localhost/other_site</a>
> The address could not be found, (0x80041201 - The object was not
> found. )
>
> 9/6/2004 5:11:14 PM Add <a>Sts2://localhost/other_site/webid=008/</a>
> The address could not be found, (0x80041201 - The object was not
> found. )
>
> 9/6/2004 5:11:10 PM Add
> <a>Sts2://localhost/my_cms_site/webid=009/</a>
> The address could not be found, (0x80041201 - The object was not
> found. )
>
> 9/6/2004 5:11:05 PM Add Content source URL:
> <a>http://localhost/my_cms_site</a>
> The Full update was started
> ----
>
> When I click on the link "Sts2://localhost/my_cms_site/webid=009/" it
> redirects to "http://localhost/my_cms_site" and shows the site ok.
>
> What could be happening?? What am I missing here??
>
>
> I tried indexing the WoodgroveIntranet sample site and it went fine,
> except for the links returned in the search results that were in the
> form "http://localhost/NR/exeres/54940CA0-879C-4F01-927B-F7E18C175B8E,frameless.htm"
> (where it should be "http://localhost/WoodgroveIntranet/About+Us/").
>
> Can't find any difference either in the configuration of this
> Woodgrove site and my CMS test site.
>
> By the way, all apps are in the same box: IIS 6, SQL Server 2000 SP3,
> SPS2003, MCMS2002 SP1a, Connector, VS.NET 2003; and using the same
> "default web site".
>
> Thanks in advance for any pointers.
>
> Hugo
|
|
|
|
|