To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 陷ゥ莠・迚定惓莠・迚耽 11101000100111001010100111100100101110101010010111100111100010011001001011101000100111001010100111100100101110101010010111100111100010011001001001011110 e89ca9e4baa5e78992e89ca9e4baa5e789925e
EUC-JP 陷ゥ莠・迚定惓莠・迚耽 11101111111111001000111010101001111010001011110010001110101001011110110111101001110001001110101011011000101010111110100010111100100011101010010111101101111010011100001110111111 effc8ea9e8bc8ea5ede9c4ead8abe8bc8ea5ede9c3bf
UTF-8 陷ゥ莠・迚定惓莠・迚耽 111010011001100110110111111011111011110110101001111010001000111010100000111011111011110110100101111010001011111110011010111001011010111010011010111001101000001110010011111010001000111010100000111011111011110110100101111010001011111110011010111010001000000010111101 e999b7efbda9e88ea0efbda5e8bf9ae5ae9ae68393e88ea0efbda5e8bf9ae880bd
UHC 陷????定????耽 1111100111101000001111110011111100111111001111111110111111010010001111110011111100111111001111111111011110110000 f9e83f3f3f3fefd23f3f3f3ff7b0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)