To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
---|---|---|---|
ISO-8859-1 | ??{??s?? | 0011111100111111011110110011111100111111011100110011111100111111 | 3f3f7b3f3f733f3f |
SJIS-WIN | 霎ソ{蜿ゥs遶ェ | 1110100010111110101111110111101111100101100011111010100101110011111001111010101110101010 | e8bebf7be58fa973e7abaa |
EUC-JP | 霎ソ{蜿ゥs遶ェ | 1111000011000000100011101011111101111011111010011110111110001110101010010111001111101110101011011000111010101010 | f0c08ebf7be9ef8ea973eead8eaa |
UTF-8 | 霎ソ{蜿ゥs遶ェ | 1110100110011100100011101110111110111101101111110111101111101000100111001011111111101111101111011010100101110011111010011000000110110110111011111011110110101010 | e99c8eefbdbf7be89cbfefbda973e981b6efbdaa |
UHC | ??{??s?? | 0011111100111111011110110011111100111111011100110011111100111111 | 3f3f7b3f3f733f3f |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)