To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???z???zB 001111110011111100111111011110100011111100111111001111110111101001000010 3f3f3f7a3f3f3f7a42
SJIS-WIN ??被z??被zB 0011111100111111100101001110110101111010001111110011111110010100111011010111101001000010 3f3f94ed7a3f3f94ed7a42
EUC-JP ??被z??被zB 0011111100111111110010001110111101111010001111110011111111001000111011110111101001000010 3f3fc8ef7a3f3fc8ef7a42
UTF-8 𲎿찴被z𲎿찴被zB 1111000010110010100011101011111111101100101100001011010011101000101000101010101101111010111100001011001010001110101111111110110010110000101101001110100010100010101010110111101001000010 f0b28ebfecb0b4e8a2ab7af0b28ebfecb0b4e8a2ab7a42
UHC ?찴被z?찴被zB 00111111101010100100101011111001101011000111101000111111101010100100101011111001101011000111101001000010 3faa4af9ac7a3faa4af9ac7a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)