To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 竪奪其誰遜蔵竪族 10010010010001111001001001000100100100011011010010010010010011101001000110111011100100011010000010010010010001111001000110110000 9247924491b4924e91bb91a0924791b0
EUC-JP 竪奪其誰遜蔵竪族 11000011101010001100001110100101110000101011011011000011101011111100001010111101110000101010001011000011101010001100001010110010 c3a8c3a5c2b6c3afc2bdc2a2c3a8c2b2
UTF-8 竪奪其誰遜蔵竪族 111001111010101110101010111001011010010110101010111001011000010110110110111010001010101010110000111010011000000110011100111010001001010010110101111001111010101110101010111001101001011110001111 e7abaae5a5aae585b6e8aab0e9819ce894b5e7abaae6978f
UHC 竪奪其誰遜?竪族 111000101011010111110111101011001101000011101100111000101100000111100001111000010011111111100010101101011111000011101001 e2b5f7acd0ece2c1e1e13fe2b5f0e9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)