To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 題絲?企?絲?臼 10010001111010001110001101001110001111111000101011101001001111111110001101001110001111111000100101010000 91e8e34e3f8ae93fe34e3f8950
EUC-JP 題絲?企?絲?臼 11000010111010101110010110101111001111111011010011101011001111111110010110101111001111111011000110110001 c2eae5af3fb4eb3fe5af3fb1b1
UTF-8 題絲렡企렩絲렡臼 111010011010000110001100111001111011010110110010111010111010000010100001111001001011110010000001111010111010000010101001111001111011010110110010111010111010000010100001111010001000011110111100 e9a18ce7b5b2eba0a1e4bc81eba0a9e7b5b2eba0a1e887bc
UHC 題絲렡企렩絲렡臼 11110000101110011101111011101010100011101011001011010000111010101000111010110111110111101110101010001110101100101100111110111111 f0b9deea8eb2d0ea8eb7deea8eb2cfbf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)