To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 枝賂??頂頃?? 100011100111110110011000010001110011111100111111100100101011100010001101101000000011111100111111 8e7d98473f3f92b88da03f3f
EUC-JP 枝賂??頂頃?? 101110111101111011001111101010000011111100111111110001001011101010111010101000100011111100111111 bbdecfa83f3fc4babaa23f3f
UTF-8 枝賂렰렋頂頃렰렑 111001101001111010011101111010001011001110000010111010111010000010110000111010111010000010001011111010011010000010000010111010011010000010000011111010111010000010110000111010111010000010010001 e69e9de8b382eba0b0eba08be9a082e9a083eba0b0eba091
UHC 枝賂렰렋頂頃렰렑 11110010101010111101011011110001100011101011110110001110101000101111000010100010110011001111000110001110101111011000111010100110 f2abd6f18ebd8ea2f0a2ccf18ebd8ea6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)