|
Home > Archive > Microsoft Content Management Server > July 2004 > Varying content encoding
You are viewing an archived Text-only version of the thread.
To view this thread in it's original format and/or if you want to reply to
this thread please [click here]
| Author |
Varying content encoding
|
|
| Arto Erkinantti 2004-07-28, 6:09 pm |
| We have a problem in that it appears that the content in our CMS 2002 sp1
web site(s) has ended up with varying encodings. This becomes a problem with
content with special characters and/or multilingual content. A posting can
have utf-8, but second posting might have ascii.
How come, not exactly known, will investigate further, but input is usual
authoring via vanilla htlmplaceholders or auth. connector, should it not all
end up as utf-8...?
This problem has developed as there are generally no problems when viewing
the content. However the search engine that has been used will not recognize
some encodings for special characters (while they should be valid as content
as such at a given encoding) and thus will miss part of the seach hits for a
given search phrase containing special chars.
I guess that we need to make sure that all content that we get via
HtmlPlaceholder controls and Word auth. conn. is encoded uniformly (it seems
not be sufficient to just to carry correct meta tags for each encoding since
the search engine can't deal with them all) - how to? (This while we can't
quickly upgrade the search engine right now.)
Any suggestions/better ideas ;)
-Arto
| |
| Stefan [MSFT] 2004-07-28, 6:09 pm |
| Hi Arto,
yes you should ensure that all content is UTF-8 encoded when uploaded to the
CMS repository.
Cheers,
Stefan.
--
This posting is provided "AS IS" with no warranties, and confers no rights.
MCMS FAQ:
http://download.microsoft.com/downl...6a/MCMS+2002+-+(complete)+FAQ.htm
MCMS Blog: http://blogs.msdn.com/stefan_gossner/category/4983.aspx
MCMS Sample Code:
http://www.gotdotnet.com/community/...t+S
erver
MCMS Whitepapers and other docs:
http://blogs.msdn.com/stefan_gossne...2/07/41859.aspx
--------------------------------
"Arto Erkinantti" <arto_e@spamoff.hotmail.com> wrote in message
news:OU9CSnwcEHA.1000@TK2MSFTNGP12.phx.gbl...
> We have a problem in that it appears that the content in our CMS 2002 sp1
> web site(s) has ended up with varying encodings. This becomes a problem
with
> content with special characters and/or multilingual content. A posting can
> have utf-8, but second posting might have ascii.
> How come, not exactly known, will investigate further, but input is usual
> authoring via vanilla htlmplaceholders or auth. connector, should it not
all
> end up as utf-8...?
>
> This problem has developed as there are generally no problems when viewing
> the content. However the search engine that has been used will not
recognize
> some encodings for special characters (while they should be valid as
content
> as such at a given encoding) and thus will miss part of the seach hits for
a
> given search phrase containing special chars.
>
> I guess that we need to make sure that all content that we get via
> HtmlPlaceholder controls and Word auth. conn. is encoded uniformly (it
seems
> not be sufficient to just to carry correct meta tags for each encoding
since
> the search engine can't deal with them all) - how to? (This while we can't
> quickly upgrade the search engine right now.)
>
> Any suggestions/better ideas ;)
>
> -Arto
>
>
| |
| Arto Erkinantti 2004-07-28, 6:09 pm |
| Thanks Stefan. Could you please advice on suggested/best practice here or
give links to documentation on how we could ensure or enforce UTF-8 encoding
?
-Arto
"Stefan [MSFT]" <stefang@online.microsoft.com> wrote in message
news:OcyE$1wcEHA.644@tk2msftngp13.phx.gbl...
> Hi Arto,
>
> yes you should ensure that all content is UTF-8 encoded when uploaded to
the
> CMS repository.
>
> Cheers,
> Stefan.
>
> --
> This posting is provided "AS IS" with no warranties, and confers no
rights.
>
> MCMS FAQ:
>
http://download.microsoft.com/downl...6a/MCMS+2002+-+(complete)+FAQ.htm
> MCMS Blog: http://blogs.msdn.com/stefan_gossner/category/4983.aspx
> MCMS Sample Code:
>
http://www.gotdotnet.com/community/...t+S
erver
> MCMS Whitepapers and other docs:
> http://blogs.msdn.com/stefan_gossne...2/07/41859.aspx
> --------------------------------
>
>
> "Arto Erkinantti" <arto_e@spamoff.hotmail.com> wrote in message
> news:OU9CSnwcEHA.1000@TK2MSFTNGP12.phx.gbl...
sp1[vbcol=seagreen]
> with
can[vbcol=seagreen]
usual[vbcol=seagreen]
> all
viewing[vbcol=seagreen]
> recognize
> content
for[vbcol=seagreen]
> a
> seems
> since
can't[vbcol=seagreen]
>
>
| |
|
|
|
|
|
|
|
|
|