To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??畯脈??製??競? 001111110011111111111011011011111001011010101100001111110011111110010000101110110011111100111111100010111010001100111111 3f3ffb6f96ac3f3f90bb3f3f8ba33f
EUC-JP ??畯脈??製?饔競? 001111110011111110001111110011011011101111001100101011100011111100111111110000001011110100111111100011111110100011101111101101101010010100111111 3f3f8fcdbbccae3f3fc0bd3f8fe8efb6a53f
UTF-8 亐렕畯脈롛렣製렩饔競렲 111001001011101010010000111010111010000010010101111001111001010110101111111010001000010010001000111010111010000110011011111010111010000010100011111010001010001110111101111010111010000010101001111010011010010110010100111001111010101110110110111010111010000010110010 e4ba90eba095e795afe88488eba19beba0a3e8a3bdeba0a9e9a594e7abb6eba0b2
UHC 亐렕畯脈롛렣製렩饔競렲 11101010101001111000111010101010111100011110000111011000111001101000111011011111100011101011010011110000101100101000111010110111111010001011110111001100111001101000111010111111 eaa78eaaf1e1d8e68edf8eb4f0b28eb7e8bdcce68ebf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)