To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???預??鸚??外 00111111001111110011111110010111011000010011111100111111111010100101111100111111001111111000101001001111 3f3f3f97613f3fea5f3f3f8a4f
EUC-JP 邕??預??鸚??外 100011111110000111101101001111110011111111001101110000100011111100111111111100111100000000111111001111111011001110110000 8fe1ed3f3fcdc23f3ff3c03f3fb3b0
UTF-8 邕멨ㅁ預앶썚鸚김뒡外 111010011000001010010101111010111010100110101000111000111000010110000001111010011010000010010000111011001001010110110110111011001000110110011010111010011011100010011010111010101011100110000000111010111001001010100001111001011010010010010110 e98295eba9a8e38581e9a090ec95b6ec8d9ae9b89aeab980eb92a1e5a496
UHC 邕멨ㅁ預앶썚鸚김뒡外 1110100010111011101110001110010110100100101100011110011111101000100111011110100110011011100011011110010110100100101100011110100010001010100111011110100011100010 e8bbb8e5a4b1e7e89de99b8de5a4b1e88a9de8e2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)