To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 治鴉゙闔イ室 111100011111100010001110101000011110100111101011110111101110100010001110101100101000111010111010 f1f88ea1e9ebdee88eb28eba
EUC-JP ?治鴉゙闔イ室 00111111101111001010001111110010111011011000111011011110111011111110111010001110101100101011110010111100 3fbca3f2ed8edeefee8eb2bcbc
UTF-8 治鴉゙闔イ室 111011101000010110110011111001101011001010111011111010011011010010001001111011111011111010011110111010011001011110010100111011111011110110110010111001011010111010100100 ee85b3e6b2bbe9b489efbe9ee99794efbdb2e5aea4
UHC ?治鴉?闔?室 0011111111110110101111011110010010111100001111111111100111101111001111111110001111111000 3ff6bde4bc3ff9ef3fe3f8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)