To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鬱葛榧???葛榧?? 100111110101010010001010100010111001111011010000001111110011111100111111100010101000101110011110110100000011111100111111 9f548a8b9ed03f3f3f8a8b9ed03f3f
EUC-JP 鬱葛榧???葛榧?? 110111011011010110110011111010111101110011010010001111110011111100111111101100111110101111011100110100100011111100111111 ddb5b3ebdcd23f3f3fb3ebdcd23f3f
UTF-8 鬱葛榧炡며렎葛榧炡랜 111010011010110010110001111010001001000110011011111001101010011010100111111001111000001010100001111010111010100110110000111010111010000010001110111010001001000110011011111001101010011010100111111001111000001010100001111010111001111010011100 e9acb1e8919be6a6a7e782a1eba9b0eba08ee8919be6a6a7e782a1eb9e9c
UHC 鬱葛榧炡며렎葛榧炡랜 1110101010100110110010101110011111011101111011101110111111101000101110001110011110001110101001001100101011100111110111011110111011101111111010001011011110100011 eaa6cae7ddeeefe8b8e78ea4cae7ddeeefe8b7a3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)