To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN ???已??幽 001111110011111100111111100110111101111100111111001111111001011101001000 3f3f3f9bdf3f3f9748
EUC-JP 艅??已??幽 1000111111010110111111010011111100111111110101101110000100111111001111111100110110101001 8fd6fd3f3fd6e13f3fcda9
UTF-8 艅덈뛼已꿩찄幽 111010001000100110000101111010111000110110001000111010111001101110111100111001011011011110110010111010101011111110101001111011001011000010000100111001011011100110111101 e88985eb8d88eb9bbce5b7b2eabfa9ecb084e5b9bd
UHC 艅덈뛼已꿩찄幽 1110011010101001100010001110101110001101100000101110110010101011101100101110011010101001100010001110101011101011 e6a988eb8d82ecabb2e6a988eaeb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)