To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 腫?而?雕??? 1000111011101110001111111000111010100111001111111110100010111000001111110011111100111111 8eee3f8ea73fe8b83f3f3f
EUC-JP 腫?而?雕??? 1011110011110000001111111011110010101001001111111111000010111010001111110011111100111111 bcf03fbca93ff0ba3f3f3f
UTF-8 腫렣而렲雕계렫렲 111010001000010110101011111010111010000010100011111010001000000010001100111010111010000010110010111010011001101110010101111010101011001110000100111010111010000010101011111010111010000010110010 e885abeba0a3e8808ceba0b2e99b95eab384eba0abeba0b2
UHC 腫렣而렲雕계렫렲 11110000111111101000111010110100111011001011101110001110101111111111000011100111101100001110100010001110101110011000111010111111 f0fe8eb4ecbb8ebff0e7b0e88eb98ebf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)