To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 鱈奪其誰遜他竪達 10010010010011001001001001000100100100011011010010010010010011101001000110111011100100011011110010010010010001111001001001000010 924c924491b4924e91bb91bc92479242
EUC-JP 鱈奪其誰遜他竪達 11000011101011011100001110100101110000101011011011000011101011111100001010111101110000101011111011000011101010001100001110100011 c3adc3a5c2b6c3afc2bdc2bec3a8c3a3
UTF-8 鱈奪其誰遜他竪達 111010011011000110001000111001011010010110101010111001011000010110110110111010001010101010110000111010011000000110011100111001001011101110010110111001111010101110101010111010011000000110010100 e9b188e5a5aae585b6e8aab0e9819ce4bb96e7abaae98194
UHC ?奪其誰遜他竪達 001111111111011110101100110100001110110011100010110000011110000111100001111101101110001011100010101101011101001110111001 3ff7acd0ece2c1e1e1f6e2e2b5d3b9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)