To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 莊?雨?祭?? 11100100101101010011111110001001010010100011111110001101110101010011111100111111 e4b53f894a3f8dd53f3f
EUC-JP 莊?雨?祭泮? 111010001011011100111111101100011010101100111111101110101101011110001111110001111010100000111111 e8b73fb1ab3fbad78fc7a83f
UTF-8 莊렱雨렭祭泮짬 111010001000111010001010111010111010000010110001111010011001101110101000111010111010000010101101111001111010010110101101111001101011001110101110111011001010011110101100 e88e8aeba0b1e99ba8eba0ade7a5ade6b3aeeca7ac
UHC 莊렱雨렭祭泮짬 1110110111110110100011101011111011101001111010111000111010111010111100001010111011011010111010101100001010101011 edf68ebee9eb8ebaf0aedaeac2ab

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)