To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 綜???蹄???祭? 10010001100011100011111100111111001111111001001011111011001111110011111100111111100011011101010100111111 918e3f3f3f92fb3f3f3f8dd53f
EUC-JP 綜???蹄???祭? 11000001111011100011111100111111001111111100010011111101001111110011111100111111101110101101011100111111 c1ee3f3f3fc4fd3f3f3fbad73f
UTF-8 綜숄렰렲蹄ㆁ렰렗祭렢 111001111011011010011100111011001000100010000100111010111010000010110000111010111010000010110010111010001011100110000100111000111000011010000001111010111010000010110000111010111010000010010111111001111010010110101101111010111010000010100010 e7b69cec8884eba0b0eba0b2e8b984e38681eba0b0eba097e7a5adeba0a2
UHC 綜숄렰렲蹄ㆁ렰렗祭렢 1111000011111100101111001111000110001110101111011000111010111111111100001011010010100100111100011000111010111101100011101010110011110000101011101000111010110011 f0fcbcf18ebd8ebff0b4a4f18ebd8eacf0ae8eb3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)