To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
---|---|---|---|
ISO-8859-1 | ??m}??m{^ | 001111110011111101101101011111010011111100111111011011010111101101011110 | 3f3f6d7d3f3f6d7b5e |
SJIS-WIN | 面?m}面?m{^ | 1001011011001010001111110110110101111101100101101100101000111111011011010111101101011110 | 96ca3f6d7d96ca3f6d7b5e |
EUC-JP | 面?m}面?m{^ | 1100110011001100001111110110110101111101110011001100110000111111011011010111101101011110 | cccc3f6d7dcccc3f6d7b5e |
UTF-8 | 面젱m}面젱m{^ | 1110100110011101101000101110110010100000101100010110110101111101111010011001110110100010111011001010000010110001011011010111101101011110 | e99da2eca0b16d7de99da2eca0b16d7b5e |
UHC | 面젱m}面젱m{^ | 11011000111111001100000110101101011011010111110111011000111111001100000110101101011011010111101101011110 | d8fcc1ad6d7dd8fcc1ad6d7b5e |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)