To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????^ 00111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f5e
SJIS-WIN ?蕎悼?蕎悼^ 0011111110001011101111001001001110001001001111111000101110111100100100111000100101011110 3f8bbc93893f8bbc93895e
EUC-JP 邕蕎悼邕蕎悼^ 100011111110000111101101101101101011111011000101111010011000111111100001111011011011011010111110110001011110100101011110 8fe1edb6bec5e98fe1edb6bec5e95e
UTF-8 邕蕎悼邕蕎悼^ 11101001100000101001010111101000100101011000111011100110100000101011110011101001100000101001010111101000100101011000111011100110100000101011110001011110 e98295e8958ee682bce98295e8958ee682bc5e
UHC 邕蕎悼邕蕎悼^ 11101000101110111100111011110000110100111111101011101000101110111100111011110000110100111111101001011110 e8bbcef0d3fae8bbcef0d3fa5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)