To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 嚥????????曄??恂??倚??隱??鉛 10011010100010110011111100111111001111110011111100111111001111110011111100111111100111100100000000111111001111111001110010010110001111110011111110011000110111110011111100111111111010001010101000111111001111111000100110010100 9a8b3f3f3f3f3f3f3f3f9e403f3f9c963f3f98df3f3fe8aa3f3f8994
EUC-JP 嚥????????曄??恂??倚??隱??鉛 11010011111010110011111100111111001111110011111100111111001111110011111100111111110110111010000100111111001111111101011111110110001111110011111111010000111000010011111100111111111100001010110000111111001111111011000111110100 d3eb3f3f3f3f3f3f3f3fdba13f3fd7f63f3fd0e13f3ff0ac3f3fb1f4
UTF-8 嚥잒찎溜긱끆琉뗨썔曄먬썿恂욇룂倚덂떀隱⑴럫鉛 111001011001101010100101111011001001111010010010111011001011000010001110111011111010011110001011111010101011100010110001111010111000000110000110111011111010011110001100111010111001011110101000111011001000110110010100111001101001101110000100111010111010100010101100111011001000110110111111111001101000000110000010111011001001101010000111111010111010001110000010111001011000000010011010111010111000110110000010111010111001011010000000111010011001101010110001111000101001000110110100111010111001111110101011111010011000100110011011 e59aa5ec9e92ecb08eefa78beab8b1eb8186efa78ceb97a8ec8d94e69b84eba8acec8dbfe68182ec9a87eba382e5809aeb8d82eb9680e99ab1e291b4eb9fabe9899b
UHC 嚥잒찎溜긱끆琉뗨썔曄먬썿恂욇룂倚덂떀隱⑴럫鉛 1110011010111111100111111110100010101001100100001110101011111110101100011110001110000101101110101110101110100100100010111110100010011011100001111110011110100101100100001110100110011011101010011110001011100001100111101110100110001111100000111110101111101111100010001110010110001011100101101110101111011111101010011110011110001110100011101110011011100111 e6bf9fe8a990eafeb1e385baeba48be89b87e7a590e99ba9e2e19ee98f83ebef88e58b96ebdfa9e78e8ee6e7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)