To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 咳??海?垓?楷 100010100101000000111111001111111000101001000011001111111001101010110100001111111001111010110010 8a503f3f8a433f9ab43f9eb2
EUC-JP 咳??海?垓?楷 101100111011000100111111001111111011001110100100001111111101010010110110001111111101110010110100 b3b13f3fb3a43fd4b63fdcb4
UTF-8 咳띌씌海렭垓렡楷 111001011001001010110011111010111001110110001100111011001001010010001100111001101011010110110111111010111010000010101101111001011001111010010011111010111010000010100001111001101010010110110111 e592b3eb9d8cec948ce6b5b7eba0ade59e93eba0a1e6a5b7
UHC 咳띌씌海렭垓렡楷 11111010101001101011011011101001101111101011101011111010101011011000111010111010111110101010011110001110101100101111101010101100 faa6b6e9bebafaad8ebafaa78eb2faac

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)