Ticket #116 (closed enhancement: invalid)

Opened 3 months ago

Last modified 2 months ago

No interpretation for character entities

Reported by: Dom's Assigned to: jf.hovinne
Priority: minor Milestone:
Component: Editor Version: trunk
Keywords: character entities Cc:

Description

Well, this is a special case.

I am currently working on a kung fu website mixing french and chinese langage. Main langage is french and chinese' is used to name the kung fu technics : Wing Tsun Kung Fu Toulouse

I join 3 screenshots to well described this feature request.

Picture 1

I used the HTML view to insert the chinese ideograms with chinese unicode character : 詠 and 春
For example purposes I then add two html character entities : ¨

Here is the sentence I then wrote :

<p>&#168;Wing Tsun Kung Fu&#168; : &#35424;&#26149;</p>

Picture 1 show both the HTML view and the result view : everything is fine

Picture 2

This show the field content transmitted by WYMeditor after submiting form. Notice the chinese unicode characters and HTML entities are both well transmitted.

Picture 3

The WYMeditor result view show us the chinese unicode character are misunterpreted : they are replaced by ? character. The wrong behaviour seems to come from database (MySQL in this case).

Feature request

One simple way to solve this problem is for WYMeditor to transmit the HTML code in place of some interpreted code. In this case :

<p>&#168;Wing Tsun Kung Fu&#168; : &#35424;&#26149;</p>

in place of

¨Wing Tsun Kung Fu¨ : 詠春

This could be done by a boolean.

This appear simple to me but I guess it could be not !

Attachments

PICTURE_1.jpg (46.9 kB) - added by Dom's on 08/31/08 13:42:47.
PICTURE_2.jpg (21.1 kB) - added by Dom's on 08/31/08 13:43:45.
PICTURE_3.jpg (40.7 kB) - added by Dom's on 08/31/08 13:44:07.

Change History

08/31/08 13:42:47 changed by Dom's

  • attachment PICTURE_1.jpg added.

08/31/08 13:43:45 changed by Dom's

  • attachment PICTURE_2.jpg added.

08/31/08 13:44:07 changed by Dom's

  • attachment PICTURE_3.jpg added.

09/10/08 22:21:31 changed by jf.hovinne

  • status changed from new to closed.
  • resolution set to invalid.

I think you should configure MySQL and PHP to use UTF-8 charset, and utf8_unicode_ci as the connection collation, as well as for the DB collation.

You'll then be able to store and retrieve UTF-8 characters without the need to convert them to numeric character references.

FYI, these references aren't interpreted by WYMeditor.

It's also possible (but useless if you use UTF-8) to convert the characters at the server-side. See: charset_decode_utf_8

I close the ticket as invalid but please feel free to reopen it.