To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 竪端束誰遜俗奪遜造竪 1001001001000111100100100101101110010001101010011001001001001110100100011011101110010001101011011001001001000100100100011011101110010001101000101001001001000111 9247925b91a9924e91bb91ad924491bb91a29247
EUC-JP 竪端束誰遜俗奪遜造竪 1100001110101000110000111011110011000010101010111100001110101111110000101011110111000010101011111100001110100101110000101011110111000010101001001100001110101000 c3a8c3bcc2abc3afc2bdc2afc3a5c2bdc2a4c3a8
UTF-8 竪端束誰遜俗奪遜造竪 111001111010101110101010111001111010101110101111111001101001110110011111111010001010101010110000111010011000000110011100111001001011111110010111111001011010010110101010111010011000000110011100111010011000000010100000111001111010101110101010 e7abaae7abafe69d9fe8aab0e9819ce4bf97e5a5aae9819ce980a0e7abaa
UHC 竪端束誰遜俗奪遜造竪 1110001010110101110100111010111011100001110101101110001011000001111000011110000111100001110101001111011110101100111000011110000111110000111000111110001010110101 e2b5d3aee1d6e2c1e1e1e1d4f7ace1e1f0e3e2b5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)