To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 饒??節??齬??譯??埃??循??嚥?? 11101001011000000011111100111111100100001101111100111111001111111110101010010111001111110011111111100110101000010011111100111111100110101011101000111111001111111000111101111010001111110011111110011010100010110011111100111111 e9603f3f90df3f3fea973f3fe6a13f3f9aba3f3f8f7a3f3f9a8b3f3f
EUC-JP 饒??節??齬??譯??埃??循??嚥?? 11110001110000010011111100111111110000001110000100111111001111111111001111110111001111110011111111101100101000110011111100111111110101001011110000111111001111111011110111011011001111110011111111010011111010110011111100111111 f1c13f3fc0e13f3ff3f73f3feca33f3fd4bc3f3fbddb3f3fd3eb3f3f
UTF-8 饒묈뫛節얇굥齬뜻옅譯꾬슛埃삼슭循뗦꺐嚥뜻뱮 111010011010010110010010111010111010110010001000111010111010101110011011111001111010111110000000111011001001011010000111111010101011010110100101111010011011110110101100111010111001110010111011111011001001100010000101111010001010110110101111111010101011111010101100111011001000101010011011111001011001111110000011111011001000001010111100111011001000101010101101111001011011111010101010111010111001011110100110111010101011101010010000111001011001101010100101111010111001110010111011111010111011000110101110 e9a592ebac88ebab9be7af80ec9687eab5a5e9bdaceb9cbbec9885e8adafeabeacec8a9be59f83ec82bcec8aade5beaaeb97a6eaba90e59aa5eb9cbbebb1ae
UHC 饒묈뫛節얇굥齬뜻옅譯꾬슛埃삼슭循뗦꺐嚥뜻뱮 111010011010111010010001111001011001000110111011111011111011110110111110111000111000001010001011111001011110000110110110111001101011111110110110111001101011101110000100111011111011110110111000111001001110111110111011111011111011110110111110111000101110000010001011111001101000001110110110111001101011111110110110111001101001001110010100 e9ae91e591bbefbdbee3828be5e1b6e6bfb6e6bb84efbdb8e4efbbefbdbee2e08be683b6e6bfb6e69394

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)