To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
---|---|---|---|
ISO-8859-1 | ??????k | 00111111001111110011111100111111001111110011111101101011 | 3f3f3f3f3f3f6b |
SJIS-WIN | 褓耶ーュ矼キk | 11100101111011101001011011101011101100001010110111100001111000111011011101101011 | e5ee96ebb0ade1e3b76b |
EUC-JP | 褓耶ーュ矼キk | 11101010111100001100110011101101100011101011000010001110101011011110001011100101100011101011011101101011 | eaf0cced8eb08eade2e58eb76b |
UTF-8 | 褓耶ーュ矼キk | 11101000101001001001001111101000100000001011011011101111101111011011000011101111101111011010110111100111100111111011110011101111101111011011011101101011 | e8a493e880b6efbdb0efbdade79fbcefbdb76b |
UHC | 褓耶????k | 110111001100111011100101101011010011111100111111001111110011111101101011 | dccee5ad3f3f3f3f6b |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)