To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????B 00111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f42
SJIS-WIN 懊??怏??音??B 10011100111000110011111100111111100111001000100100111111001111111000100110111001001111110011111101000010 9ce33f3f9c893f3f89b93f3f42
EUC-JP 懊??怏??音??B 11011000111001010011111100111111110101111110100100111111001111111011001010111011001111110011111101000010 d8e53f3fd7e93f3fb2bb3f3f42
UTF-8 懊볦눥怏잒슀音쇔뒣B 11100110100001111000101011101011101100111010011011101011100010001010010111100110100000001000111111101100100111101001001011101100100010101000000011101001100111111011001111101100100001111001010011101011100100101010001101000010 e6878aebb3a6eb88a5e6808fec9e92ec8a80e99fb3ec8794eb92a342
UHC 懊볦눥怏잒슀音쇔뒣B 11100111111110001001001111101100100001111011110011100100111010001001111111101000100110101001001111101011111001011011110011100101100010101001111101000010 e7f893ec87bce4e89fe89a93ebe5bce58a9f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)