To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????h 00111111001111110011111100111111001111110011111101101000 3f3f3f3f3f3f68
SJIS-WIN 竪揃束誰遜賊h 10010010010001111001000110110101100100011010100110010010010011101001000110111011100100011010111101101000 924791b591a9924e91bb91af68
EUC-JP 竪揃束誰遜賊h 11000011101010001100001010110111110000101010101111000011101011111100001010111101110000101011000101101000 c3a8c2b7c2abc3afc2bdc2b168
UTF-8 竪揃束誰遜賊h 11100111101010111010101011100110100011111000001111100110100111011001111111101000101010101011000011101001100000011001110011101000101100111000101001101000 e7abaae68f83e69d9fe8aab0e9819ce8b38a68
UHC 竪?束誰遜賊h 111000101011010100111111111000011101011011100010110000011110000111100001111011101110010001101000 e2b53fe1d6e2c1e1e1eee468

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)