Quantcast
Channel: ProZ.com Translation Forums
Viewing all articles
Browse latest Browse all 7927

Error in coding of TM in TMX format from memoQ | Depends

$
0
0
Forum: MemoQ support
Topic: Error in coding of TM in TMX format from memoQ
Poster: esperantisto
Post title: Depends

[quote]Kevin Dias wrote:

The file size of UTF-16 files ends up being about double that of UTF-8. [/quote]

That is generally incorrect. It only happens for Western-European languages and for Eastern-European ones using Latin script. For Cyrillic or Greek, virtually no difference as the characters are two-byte in any case (or, well, some characters are three-byte for non-Slavic Cyrillic alphabets), and for the Far-Eastern scripts UTF-8 (with three bytes per character) means even bigger file sizes as compared to UTF-16 (two bytes).

But the file size is not very important, because the decoded data take the same memory anyway.

Viewing all articles
Browse latest Browse all 7927

Trending Articles