FW: [Tikiwiki-users] Problem with character encoding (bug)
Marc Laporte 26 May 2003 08:16:56
Since this is dev-related, I am forwarding to the dev-list.
Thanks Gil for this detailed explanation.
M
-----Original Message----- From: tikiwiki-users-admin-5NWGOfrQmneRv+LV9MX5uipxlwaOVQ5f@public.gmane.org [mailto:tikiwiki-users-admin-5NWGOfrQmneRv+LV9MX5uipxlwaOVQ5f@public.gmane.org] On Behalf Of Gil Dev Sent: May 26, 2003 1:42 AM To: Marc Laporte; 'Inge Solvoll' Cc: tikiwiki-users-5NWGOfrQmneRv+LV9MX5uipxlwaOVQ5f@public.gmane.org Subject: RE: [Tikiwiki-users] Problem with character encoding (bug)
Hi Marc,
Let me summarize the charset issue once more.
In general Tiki is working with UTF-8 encoding, which enables it to process and display characters from multiple languages (not just Western European) within any site. This character set is a representation (encoding) of the Unicode charset, and is usually considered the preferred charset of choice for the web.
Problems with charset are usually encountered due to three reasons:
1. Wrong setting of HTTP headers, e.g. setting Apache's option AddDefaultCharset in httpd.conf to ISO-8859-1, which would cause an override on Tiki's specification for UTF-8. This would present Tiki's strings incorrectly (everything but English within the ASCII range), and would send back data to Tiki, which will not be UTF-8 encoded, thus polluting the DB.
2. Wrong charset of the language.php file. If a translation is not saved in UTF-8 encoding, but rather in some language specific setting, then in the case where everything else works fine, i.e. no wrong HTTP headers, etc., the translation strings will not be displayed correctly, while the rest of the site, as well as input processing will work fine. I emphasis that the language.php file should be saved UTF-8 encoded, and this usually need to be done and verified by the translator, who has the local charset installed on its machine as well. A good tool to assist in doing this is SciTE editor, for example.
3. Wrong charset of input data. This is a bit of a guess regarding Inge's report, since it related to data entry from external sources. By now you can guess that such data needs to be UTF-8 encoded to be consistent with the rest of Tiki's data. I am not sure if Tiki validates external input to be in UTF-8 and convert it otherwise (my guess is that it is not). This is especially true for the mail-in option, because many mail clients are not necessarily configured to encode using UTF-8.
I still think UTF-8 is a good choice, as it provides a common environment to all users and developers. Using local charsets would potentially cause more support problems, as some problems would be even more difficult to track. There may be some bug reports that need to be open regarding charset processing in mail-in etc., but the most important thing to remember about charsets and Tiki for now is that as long as EVERYTHING is UTF-8 encoded, everything is fine. You just need to make sure you are being consistent.
Gil
-----Original Message----- From: Marc Laporte [mailto:marc-5VF8g4rcdUm22ymzAWvf8A@public.gmane.org] Sent: Monday, May 26, 2003 2:56 AM To: 'Inge Solvoll'; tikiwiki-users-5NWGOfrQmneRv+LV9MX5uipxlwaOVQ5f@public.gmane.org Cc: gilshwartz-Rn4VEauK+AKRv+LV9MX5uipxlwaOVQ5f@public.gmane.org Subject: RE: [Tikiwiki-users] Problem with character encoding (bug)
Hi!
It's the same problem with French characters.
I've had success in the past by converting & uploading the language.php file in UTF-8 format. However, Al did this for the demo site and it doesn't work.
-----Original Message----- From: tikiwiki-users-admin-5NWGOfrQmneRv+LV9MX5uipxlwaOVQ5f@public.gmane.org [mailto:tikiwiki-users-admin-5NWGOfrQmneRv+LV9MX5uipxlwaOVQ5f@public.gmane.org] On Behalf Of Inge Solvoll Sent: May 25, 2003 4:25 PM To: tikiwiki-users-5NWGOfrQmneRv+LV9MX5uipxlwaOVQ5f@public.gmane.org Subject: [Tikiwiki-users] Problem with character encoding (bug)
The character encoding for country-specific characters gets messed up sometimes in Tiki.
- When using the mailin-functionality, the Norwegian characters in the mail are viewed incorrectly when inserted into the Wiki page. - This also happens when I use this script for importing a news box from a newspaper: <script src="http://www.vg.no/export/JS/jsalle.js"></script>
I insert the script tag in a module that I've made. All non-english characters are messed up.
Has anyone else discovered this problem? I don't think it is a problem with my server configuration, because the character encoding is correct when the text is inputted by the user directly and not imported.
Inge
------------------------------------------------------- This SF.net email is sponsored by: ObjectStore. If flattening out C++ or Java code to make your application fit in a relational database is painful, don't do it! Check out ObjectStore. Now part of Progress Software. http://www.objectstore.net/sourceforge _______________________________________________ Tikiwiki-users mailing list Tikiwiki-users-5NWGOfrQmneRv+LV9MX5uipxlwaOVQ5f@public.gmane.org https://lists.sourceforge.net/lists/listinfo/tikiwiki-users
------------------------------------------------------- This SF.net email is sponsored by: ObjectStore. If flattening out C++ or Java code to make your application fit in a relational database is painful, don't do it! Check out ObjectStore. Now part of Progress Software. http://www.objectstore.net/sourceforge _______________________________________________ Tikiwiki-users mailing list Tikiwiki-users-5NWGOfrQmneRv+LV9MX5uipxlwaOVQ5f@public.gmane.org https://lists.sourceforge.net/lists/listinfo/tikiwiki-users
------------------------------------------------------- This SF.net email is sponsored by: ObjectStore. If flattening out C++ or Java code to make your application fit in a relational database is painful, don't do it! Check out ObjectStore. Now part of Progress Software. http://www.objectstore.net/sourceforge
If you would like to report an abuse of our service, such as a spam message, please . Если Вы хотите пожаловаться на содержимое этой страницы, пожалуйста .