To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 陷溢。ソ寘懷スヲ 111010001001110010001000111011001010000110111111111110101010100110011100111001011011110110100110 e89c88eca1bffaa99ce5bda6
EUC-JP 陷溢。ソ寘懷スヲ 1110111111111100101100001110111010001110101000011000111010111111100011111011101011100001110110001110011110001110101111011000111010100110 effcb0ee8ea18ebf8fbae1d8e78ebd8ea6
UTF-8 陷溢。ソ寘懷スヲ 111010011001100110110111111001101011101010100010111011111011110110100001111011111011110110111111111001011010111110011000111001101000011110110111111011111011110110111101111011111011110110100110 e999b7e6baa2efbda1efbdbfe5af98e687b7efbdbdefbda6
UHC 陷溢???懷?? 1111100111101000111011001110111000111111001111110011111111111100111000110011111100111111 f9e8ecee3f3f3ffce33f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)