To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
---|---|---|---|
ISO-8859-1 | ?P????] | 00111111010100000011111100111111001111110011111101011101 | 3f503f3f3f3f5d |
SJIS-WIN | ?P泄?矮?] | 001111110101000010011111100101010011111111100001111000100011111101011101 | 3f509f953fe1e23f5d |
EUC-JP | ?P泄?矮?] | 001111110101000011011101111101010011111111100010111001000011111101011101 | 3f50ddf53fe2e43f5d |
UTF-8 | 숷P泄숷矮了] | 1110110010001000101101110101000011100110101100111000010011101100100010001011011111100111100111111010111011101111101001101011101001011101 | ec88b750e6b384ec88b7e79faeefa6ba5d |
UHC | 숷P泄숷矮了] | 100110100100110001010000111000001101110010011010010011001110100011100001111010001110011101011101 | 9a4c50e0dc9a4ce8e1e8e75d |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)