To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???靭??攸??域??苡??恂ш???? 001111110011111100111111100100000111100000111111001111111001110110111111001111110011111110001000111001100011111100111111111001001000111100111111001111111001110010010110100001001000101000111111001111110011111100111111 3f3f3f90783f3f9dbf3f3f88e63f3fe48f3f3f9c96848a3f3f3f3f
EUC-JP ???靭??攸??域??苡??恂ш?孼?? 0011111100111111001111111011111111011001001111110011111111011010110000010011111100111111101100001110100000111111001111111110011111101111001111110011111111010111111101101010011111101010001111111000111110111010110000110011111100111111 3f3f3fbfd93f3fdac13f3fb0e83f3fe7ef3f3fd7f6a7ea3f8fbac33f3f
UTF-8 麗몃쓷靭뚳㎘攸곷룆域뱀늽苡긷렚恂ш섶孼뽰겕 1110111110100110100010001110101110101010100000111110110010010011101101111110100110011101101011011110101110011010101100111110001110001110100110001110011010010100101110001110101010110011101101111110101110100011100001101110010110011111100111111110101110110001100000001110101110001010101111011110100010001011101000011110101010111000101101111110101110100000100110101110011010000001100000101101000110001000111011001000010010110110111001011010110110111100111010111011110110110000111010101011001010010101 efa688ebaa83ec93b7e99dadeb9ab3e38e98e694b8eab3b7eba386e59f9febb180eb8abde88ba1eab8b7eba09ae68182d188ec84b6e5adbcebbdb0eab295
UHC 麗몃쓷靭뚳㎘攸곷룆域뱀늽苡긷렚恂ш섶孼뽰겕 111001101011000010111000111010111001110110010100111011001110010110001100111011111010011110100101111010101111001010000001111010111000111110000101111001101011010010111001111011001000100010000110111011001011111010110001111001011000111010101101111000101110000110101100111010101011110010111011111001011110110110010110111011001000000110101100 e6b0b8eb9d94ece58cefa7a5eaf281eb8f85e6b4b9ec8886ecbeb1e58eade2e1aceabcbbe5ed96ec81ac

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)