summaryrefslogtreecommitdiff
path: root/mjit_worker.c
diff options
context:
space:
mode:
authorMartin Dürst <duerst@it.aoyama.ac.jp>2019-07-14 10:58:50 +0900
committerMartin Dürst <duerst@it.aoyama.ac.jp>2019-07-14 10:58:50 +0900
commit369ff79394765ce198ac7cee872a8c739d895aaa (patch)
treed373a8e2a3b835f981a85dbb0e25e93730fba776 /mjit_worker.c
parentac2866005b96baf986072f86ecd3dfd887f2bda3 (diff)
add encoding conversion from/to CESU-8
Add encoding conversion (transcoding) from UTF-8 to CESU-8 and back. CESU-8 is an encoding similar to UTF-8, but encodes codepoints above U+FFFF as two surrogates, these surrogates again being encoded as if they were UTF-8 codepoints. This preserves the same binary sorting order as in UTF-16. It is also somewhat similar (although not exactly identical) to an encoding used internally by Java. This completes issue #15995. enc/trans/cesu_8.trans: Add encoding conversion from/to CESU-8 test/ruby/test_transcode.rb: Add tests for above
Diffstat (limited to 'mjit_worker.c')
0 files changed, 0 insertions, 0 deletions