To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 鳥戡??第賂??爭 1001001010111001100111010100000100111111001111111001000111100110100110000100011100111111001111111110000010100101 92b99d413f3f91e698473f3fe0a5
EUC-JP 鳥戡??第賂??爭 1100010010111011110110011010001000111111001111111100001011101000110011111010100000111111001111111110000010100111 c4bbd9a23f3fc2e8cfa83f3fe0a7
UTF-8 鳥戡렰렦第賂렰렠爭 111010011011001110100101111001101000100010100001111010111010000010110000111010111010000010100110111001111010110010101100111010001011001110000010111010111010000010110000111010111010000010100000111001111000100010101101 e9b3a5e688a1eba0b0eba0a6e7acace8b382eba0b0eba0a0e788ad
UHC 鳥戡렰렦第賂렰렠爭 111100001110100011001010111100011000111010111101100011101011010111110000101011111101011011110001100011101011110110001110101100011110111010110011 f0e8caf18ebd8eb5f0afd6f18ebd8eb1eeb3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)