To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 耳?圓???缺??缺耳?圓???缺??缺B 1000111010101000001111111001101010100010001111110011111100111111111000111001111000111111001111111110001110011110100011101010100000111111100110101010001000111111001111110011111111100011100111100011111100111111111000111001111001000010 8ea83f9aa23f3f3fe39e3f3fe39e8ea83f9aa23f3f3fe39e3f3fe39e42
EUC-JP 耳?圓???缺??缺耳?圓???缺??缺B 1011110010101010001111111101010010100100001111110011111100111111111001011111111000111111001111111110010111111110101111001010101000111111110101001010010000111111001111110011111111100101111111100011111100111111111001011111111001000010 bcaa3fd4a43f3f3fe5fe3f3fe5febcaa3fd4a43f3f3fe5fe3f3fe5fe42
UTF-8 耳렲圓꿱렱곈缺裏곈缺耳렲圓꿱렱곈缺裏곈缺B 11101000100000001011001111101011101000001011001011100101100111001001001111101010101111111011000111101011101000001011000111101010101100111000100011100111101111001011101011101111101001111010011111101010101100111000100011100111101111001011101011101000100000001011001111101011101000001011001011100101100111001001001111101010101111111011000111101011101000001011000111101010101100111000100011100111101111001011101011101111101001111010011111101010101100111000100011100111101111001011101001000010 e880b3eba0b2e59c93eabfb1eba0b1eab388e7bcbaefa7a7eab388e7bcbae880b3eba0b2e59c93eabfb1eba0b1eab388e7bcbaefa7a7eab388e7bcba42
UHC 耳렲圓꿱렱곈缺裏곈缺耳렲圓꿱렱곈缺裏곈缺B 1110110010111100100011101011111111101010101011011011001011101000100011101011111010110000111010011100110011000000111011001100000010110000111010011100110011000000111011001011110010001110101111111110101010101101101100101110100010001110101111101011000011101001110011001100000011101100110000001011000011101001110011001100000001000010 ecbc8ebfeaadb2e88ebeb0e9ccc0ecc0b0e9ccc0ecbc8ebfeaadb2e88ebeb0e9ccc0ecc0b0e9ccc042

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)