Varying content encoding
Web Server forum
Back To The Forum Home!Search!Private Messaging System

Web Server Talk Web Server Talk > Web Servers reviews > Microsoft Content Management Server > Varying content encoding




  Last Thread   Next Thread Next
  Show Printable Version Email this Page Subscribe to this Thread      Post New Thread    Post A Reply      

    Varying content encoding  
Arto Erkinantti


View Ip Address Report This Message To A Moderator Edit/Delete Message


 
07-28-04 11:09 PM

We have a problem in that it appears that the content in our CMS 2002 sp1
web site(s) has ended up with varying encodings. This becomes a problem with
content with special characters and/or multilingual content. A posting can
have utf-8, but second posting might have ascii.
How come, not exactly known, will investigate further, but input is usual
authoring via vanilla htlmplaceholders or auth. connector, should it not all
end up as utf-8...?

This problem has developed as there are generally no problems when viewing
the content. However the search engine that has been used will not recognize
some encodings for special characters (while they should be valid as content
as such at a given encoding) and thus will miss part of the seach hits for a
given search phrase containing special chars.

I guess that we need to make sure that all content that we get via
HtmlPlaceholder controls and Word auth. conn. is encoded uniformly (it seems
not be sufficient to just to carry correct meta tags for each encoding since
the search engine can't deal with them all) - how to? (This while we can't
quickly upgrade the search engine right now.)

Any suggestions/better ideas ;)

-Arto







[ Post a follow-up to this message ]



    Re: Varying content encoding  
Stefan [MSFT]


View Ip Address Report This Message To A Moderator Edit/Delete Message


 
07-28-04 11:09 PM

Hi Arto,

yes you should ensure that all content is UTF-8 encoded when uploaded to the
CMS repository.

Cheers,
Stefan.

--
This posting is provided "AS IS" with no warranties, and confers no rights.

MCMS FAQ:
http://download.microsoft.com/downl...>
MCMS+2002+-+(complete)+FAQ.htm
MCMS Blog: http://blogs.msdn.com/stefan_gossner/category/4983.aspx
MCMS Sample Code:
http://www.gotdotnet.com/community/...nagement+Server
MCMS Whitepapers and other docs:
http://blogs.msdn.com/stefan_gossne...2/07/41859.aspx
--------------------------------


"Arto Erkinantti" <arto_e@spamoff.hotmail.com> wrote in message
news:OU9CSnwcEHA.1000@TK2MSFTNGP12.phx.gbl...
> We have a problem in that it appears that the content in our CMS 2002 sp1
> web site(s) has ended up with varying encodings. This becomes a problem
with
> content with special characters and/or multilingual content. A posting can
> have utf-8, but second posting might have ascii.
> How come, not exactly known, will investigate further, but input is usual
> authoring via vanilla htlmplaceholders or auth. connector, should it not
all
> end up as utf-8...?
>
> This problem has developed as there are generally no problems when viewing
> the content. However the search engine that has been used will not
recognize
> some encodings for special characters (while they should be valid as
content
> as such at a given encoding) and thus will miss part of the seach hits for
a
> given search phrase containing special chars.
>
> I guess that we need to make sure that all content that we get via
> HtmlPlaceholder controls and Word auth. conn. is encoded uniformly (it
seems
> not be sufficient to just to carry correct meta tags for each encoding
since
> the search engine can't deal with them all) - how to? (This while we can't
> quickly upgrade the search engine right now.)
>
> Any suggestions/better ideas ;)
>
> -Arto
>
>







[ Post a follow-up to this message ]



    Re: Varying content encoding  
Arto Erkinantti


View Ip Address Report This Message To A Moderator Edit/Delete Message


 
07-28-04 11:09 PM

Thanks Stefan. Could you please advice on suggested/best practice here or
give links to documentation on how we could ensure or enforce UTF-8 encoding
?

-Arto


"Stefan [MSFT]" <stefang@online.microsoft.com> wrote in message
news:OcyE$1wcEHA.644@tk2msftngp13.phx.gbl...
> Hi Arto,
>
> yes you should ensure that all content is UTF-8 encoded when uploaded to
the
> CMS repository.
>
> Cheers,
> Stefan.
>
> --
> This posting is provided "AS IS" with no warranties, and confers no
rights.
>
> MCMS FAQ:
>

> MCMS Blog: [url]http://blogs.msdn.com/stefan_gossner/category/4983.aspx" target="_blank">http://download.microsoft.com/downl...egory/4983.aspx
> MCMS Sample Code:
>
http://www.gotdotnet.com/community/...
+Server
> MCMS Whitepapers and other docs:
> http://blogs.msdn.com/stefan_gossne...2/07/41859.aspx
> --------------------------------
>
>
> "Arto Erkinantti" <arto_e@spamoff.hotmail.com> wrote in message
> news:OU9CSnwcEHA.1000@TK2MSFTNGP12.phx.gbl... 
sp1[vbcol=seagreen] 
> with 
can[vbcol=seagreen] 
usual[vbcol=seagreen] 
> all 
viewing[vbcol=seagreen] 
> recognize 
> content 
for[vbcol=seagreen]
> a 
> seems 
> since 
can't[vbcol=seagreen] 
>
>







[ Post a follow-up to this message ]



    Re: Varying content encoding  
Stefan [MSFT]


View Ip Address Report This Message To A Moderator Edit/Delete Message


 
07-28-04 11:09 PM

Hi Arto,

ensure that the encoding in the web.config files of all projects is UTF-8.

Cheers,
Stefan.

--
This posting is provided "AS IS" with no warranties, and confers no rights.

MCMS FAQ:
http://download.microsoft.com/downl...>
MCMS+2002+-+(complete)+FAQ.htm
MCMS Blog: http://blogs.msdn.com/stefan_gossner/category/4983.aspx
MCMS Sample Code:
http://www.gotdotnet.com/community/...nagement+Server
MCMS Whitepapers and other docs:
http://blogs.msdn.com/stefan_gossne...2/07/41859.aspx
--------------------------------


"Arto Erkinantti" <arto_e@spamoff.hotmail.com> wrote in message
news:e$wJ09wcEHA.2504@TK2MSFTNGP12.phx.gbl...
> Thanks Stefan. Could you please advice on suggested/best practice here or
> give links to documentation on how we could ensure or enforce UTF-8
encoding
> ?
>
> -Arto
>
>
> "Stefan [MSFT]" <stefang@online.microsoft.com> wrote in message
> news:OcyE$1wcEHA.644@tk2msftngp13.phx.gbl... 
> the 
> rights. 
>
 
>
[url]http://www.gotdotnet.com/community/usersamples/Default.aspx?ProductDropDownList=Content+Management
+Server" target="_blank">http://download.microsoft.com/downl...
+Server 
> sp1 
problem[vbcol=seagreen] 
> can 
> usual 
not[vbcol=seagreen] 
> viewing 
> for 
> can't 
>
>







[ Post a follow-up to this message ]



    Re: Varying content encoding  
Arto Erkinantti


View Ip Address Report This Message To A Moderator Edit/Delete Message


 
07-28-04 11:09 PM

We have

<globalization requestEncoding="utf-8" responseEncoding="utf-8" />
in the web.config files of all projects. Should this be sufficient?

-Arto



"Stefan [MSFT]" <stefang@online.microsoft.com> wrote in message
news:%23CTt0AxcEHA.3300@TK2MSFTNGP09.phx.gbl...
> Hi Arto,
>
> ensure that the encoding in the web.config files of all projects is UTF-8.
>
> Cheers,
> Stefan.
>
> --
> This posting is provided "AS IS" with no warranties, and confers no
rights.
>
> MCMS FAQ:
>

> MCMS Blog: [url]http://blogs.msdn.com/stefan_gossner/category/4983.aspx" target="_blank">http://download.microsoft.com/downl...egory/4983.aspx
> MCMS Sample Code:
>
http://www.gotdotnet.com/community/...
+Server
> MCMS Whitepapers and other docs:
> http://blogs.msdn.com/stefan_gossne...2/07/41859.aspx
> --------------------------------
>
>
> "Arto Erkinantti" <arto_e@spamoff.hotmail.com> wrote in message
> news:e$wJ09wcEHA.2504@TK2MSFTNGP12.phx.gbl... 
or[vbcol=seagreen] 
> encoding 
to[vbcol=seagreen] 
>
 
>
[url]http://www.gotdotnet.com/community/usersamples/Default.aspx?ProductDropDownList=Content+Management
+Server" target="_blank">http://download.microsoft.com/downl...
+Server 
2002[vbcol=seagreen] 
> problem 
posting[vbcol=seagreen] 
> not 
hits[vbcol=seagreen] 
(it[vbcol=seagreen] 
encoding[vbcol=seagreen] 
>
>







[ Post a follow-up to this message ]



    Re: Varying content encoding  
Stefan [MSFT]


View Ip Address Report This Message To A Moderator Edit/Delete Message


 
07-28-04 11:09 PM

Hi Arto,

this is more an ASP.NET related question but afaik this should be
sufficient.

Cheers,
Stefan.

--
This posting is provided "AS IS" with no warranties, and confers no rights.

MCMS FAQ:
http://download.microsoft.com/downl...>
MCMS+2002+-+(complete)+FAQ.htm
MCMS Blog: http://blogs.msdn.com/stefan_gossner/category/4983.aspx
MCMS Sample Code:
http://www.gotdotnet.com/community/...nagement+Server
MCMS Whitepapers and other docs:
http://blogs.msdn.com/stefan_gossne...2/07/41859.aspx
--------------------------------


"Arto Erkinantti" <arto_e@spamoff.hotmail.com> wrote in message
news:ugsc$HxcEHA.1672@TK2MSFTNGP12.phx.gbl...
> We have
>
> <globalization requestEncoding="utf-8" responseEncoding="utf-8" />
> in the web.config files of all projects. Should this be sufficient?
>
> -Arto
>
>
>
> "Stefan [MSFT]" <stefang@online.microsoft.com> wrote in message
> news:%23CTt0AxcEHA.3300@TK2MSFTNGP09.phx.gbl... 
UTF-8.[vbcol=seagreen] 
> rights. 
>
 
>
[url]http://www.gotdotnet.com/community/usersamples/Default.aspx?ProductDropDownList=Content+Management
+Server" target="_blank">http://download.microsoft.com/downl...
+Server 
> or 
uploaded[vbcol=seagreen]
> to 
>
 
>
[url]http://www.gotdotnet.com/community/usersamples/Default.aspx?ProductDropDownList=Content+Management
+Server" target="_blank">http://download.microsoft.com/downl...
+Server 
> 2002 
> posting 
is[vbcol=seagreen] 
it[vbcol=seagreen] 
as[vbcol=seagreen] 
> hits 
> (it 
> encoding 
we[vbcol=seagreen] 
>
>







[ Post a follow-up to this message ]



    Sponsored Links  




 





   All times are GMT. The time now is 11:27 AM.      Post New Thread    Post A Reply      
  Last Thread   Next Thread Next


Most Popular forums 

Forum Jump:
Rate This Thread:

Forum Rules:
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts
HTML code is OFF
vB code is ON
Smilies are ON
[IMG] code is OFF
 
Medical and Health forum | Computer Games Reviews | Graphics design forum

Back To The Top
Home | Usercp | Faq | Register