To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 陟「譽庭渇迴・ 111010001010000010100010111001101010001110010010111010111000101010001001111001111000111110100101 e8a0a2e6a392eb8a89e78fa5
EUC-JP 陟「譽庭渇迴・ 1111000010100010100011101010001011101100101001011100010011101101101100111110100111101101111011111000111010100101 f0a28ea2eca5c4edb3e9edef8ea5
UTF-8 陟「譽庭渇迴・ 111010011001100110011111111011111011110110100010111010001010110110111101111001011011101010101101111001101011100010000111111010001011111110110100111011111011110110100101 e9999fefbda2e8adbde5baade6b887e8bfb4efbda5
UHC 陟?譽庭??? 11110100101100110011111111100111111000101110111111010100001111110011111100111111 f4b33fe7e2efd43f3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)