To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 霓????霓????B 11101000101111010011111100111111001111110011111111101000101111010011111100111111001111110011111101000010 e8bd3f3f3f3fe8bd3f3f3f3f42
EUC-JP 霓????霓????B 11110000101111110011111100111111001111110011111111110000101111110011111100111111001111110011111101000010 f0bf3f3f3f3ff0bf3f3f3f3f42
UTF-8 霓낅끆利퉕霓낅끆利퉕B 11101001100111001001001111101011100000101000010111101011100000011000011011101111101001111001110111101101100010011001010111101001100111001001001111101011100000101000010111101011100000011000011011101111101001111001110111101101100010011001010101000010 e99c93eb8285eb8186efa79ded8995e99c93eb8285eb8186efa79ded899542
UHC 霓낅끆利퉕霓낅끆利퉕B 111001111110011110000101111010111000010110111010111011001010011010111001011010011110011111100111100001011110101110000101101110101110110010100110101110010110100101000010 e7e785eb85baeca6b969e7e785eb85baeca6b96942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)