To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????B 00111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f42
SJIS-WIN 弱??張??言⑧?B 1000111011100011001111110011111110010010101000110011111100111111100011001011111010000111010001110011111101000010 8ee33f3f92a33f3f8cbe87473f42
EUC-JP 弱??張??言??B 10111100111001010011111100111111110001001010010100111111001111111011100011000000001111110011111101000010 bce53f3fc4a53f3fb8c03f3f42
UTF-8 弱꾣뜥張멨컙言⑧눡B 11100101101111001011000111101010101111101010001111101011100111001010010111100101101111001011010111101011101010011010100011101100101110111001100111101000101010001000000011100010100100011010011111101011100010001010000101000010 e5bcb1eabea3eb9ca5e5bcb5eba9a8ecbb99e8a880e291a7eb88a142
UHC 弱꾣뜥張멨컙言⑧눡B 11100101101100001000010011100110100011011010100011101101111001011011100011100101101100001000010011100101111010111010100011101110100001111011100001000010 e5b084e68da8ede5b8e5b084e5eba8ee87b842

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)