To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 筌??筍?筌??筍?B 111000101010001100111111001111111110001010100001001111111110001010100011001111110011111111100010101000010011111101000010 e2a33f3fe2a13fe2a33f3fe2a13f42
EUC-JP 筌™?筍?筌™?筍?B 11100100101001011000111110100010111011110011111111100100101000110011111111100100101001011000111110100010111011110011111111100100101000110011111101000010 e4a58fa2ef3fe4a33fe4a58fa2ef3fe4a33f42
UTF-8 筌™뫁筍뻟筌™뫁筍뻟B 11100111101011011000110011100010100001001010001011101011101010111000000111100111101011011000110111101011101110111001111111100111101011011000110011100010100001001010001011101011101010111000000111100111101011011000110111101011101110111001111101000010 e7ad8ce284a2ebab81e7ad8debbb9fe7ad8ce284a2ebab81e7ad8debbb9f42
UHC 筌™뫁筍뻟筌™뫁筍뻟B 111011111010011110100010111000101001000110100101111000101110110010010110011010011110111110100111101000101110001010010001101001011110001011101100100101100110100101000010 efa7a2e291a5e2ec9669efa7a2e291a5e2ec966942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)