To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 阡篇崎・イ關ャ 1110100010010100100101011101000110001101111010001010010110110010111010001001000010101100 e89495d18de8a5b2e890ac
EUC-JP 阡篇崎・イ關ャ 1110111111110100110010101101001110111010111010101000111010100101100011101011001011101111111100001000111010101100 eff4cad3baea8ea58eb2eff08eac
UTF-8 阡篇崎・イ關ャ 111010011001100010100001111001111010111110000111111001011011010010001110111011111011110110100101111011111011110110110010111010011001011110011100111011111011110110101100 e998a1e7af87e5b48eefbda5efbdb2e9979cefbdac
UHC 阡篇崎??關? 1111010011000110111110001011100111010000111110000011111100111111110011101011110000111111 f4c6f8b9d0f83f3fcebc3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)