To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
---|---|---|---|
ISO-8859-1 | ¹ã|[¹ã|[^ | 1001111110111001111000110111110001011011100111111011100111100011011111000101101101011110 | 9fb9e37c5b9fb9e37c5b5e |
SJIS-WIN | ???|[???|[^ | 0011111100111111001111110111110001011011001111110011111100111111011111000101101101011110 | 3f3f3f7c5b3f3f3f7c5b5e |
EUC-JP | ??ã|[??ã|[^ | 001111110011111110001111101010111010101001111100010110110011111100111111100011111010101110101010011111000101101101011110 | 3f3f8fabaa7c5b3f3f8fabaa7c5b5e |
UTF-8 | ¹ã|[¹ã|[^ | 1100001010011111110000101011100111000011101000110111110001011011110000101001111111000010101110011100001110100011011111000101101101011110 | c29fc2b9c3a37c5bc29fc2b9c3a37c5b5e |
UHC | ?¹?|[?¹?|[^ | 00111111101010011111011000111111011111000101101100111111101010011111011000111111011111000101101101011110 | 3fa9f63f7c5b3fa9f63f7c5b5e |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)