Forum: MemoQ support
Topic: Error in coding of TM in TMX format from memoQ
Poster: esperantisto
Post title: Depends
[quote]Kevin Dias wrote:
The file size of UTF-16 files ends up being about double that of UTF-8. [/quote]
That is generally incorrect. It only happens for Western-European languages and for Eastern-European ones using Latin script. For Cyrillic or Greek, virtually no difference as the characters are two-byte in any case (or, well, some characters are three-byte for non-Slavic Cyrillic alphabets), and for the Far-Eastern scripts UTF-8 (with three bytes per character) means even bigger file sizes as compared to UTF-16 (two bytes).
But the file size is not very important, because the decoded data take the same memory anyway.
Topic: Error in coding of TM in TMX format from memoQ
Poster: esperantisto
Post title: Depends
[quote]Kevin Dias wrote:
The file size of UTF-16 files ends up being about double that of UTF-8. [/quote]
That is generally incorrect. It only happens for Western-European languages and for Eastern-European ones using Latin script. For Cyrillic or Greek, virtually no difference as the characters are two-byte in any case (or, well, some characters are three-byte for non-Slavic Cyrillic alphabets), and for the Far-Eastern scripts UTF-8 (with three bytes per character) means even bigger file sizes as compared to UTF-16 (two bytes).
But the file size is not very important, because the decoded data take the same memory anyway.