To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 狎??柔??音??沃??而g?矣??? 1110000010111110001111110011111110001111010111110011111100111111100010011011100100111111001111111001011110000000001111110011111110001110101001111000001010000111001111111110000111100001001111110011111100111111 e0be3f3f8f5f3f3f89b93f3f97803f3f8ea782873fe1e13f3f3f
EUC-JP 狎??柔??音??沃??而g?矣??孼 11100000110000000011111100111111101111011100000000111111001111111011001010111011001111110011111111001101111000000011111100111111101111001010100110100011111001110011111111100010111000110011111100111111100011111011101011000011 e0c03f3fbdc03f3fb2bb3f3fcde03f3fbca9a3e73fe2e33f3f8fbac3
UTF-8 狎뜯뫀柔꾢첎音썬룋沃섅굦而g춯矣롫닗孼 111001111000101110001110111010111001110010101111111010111010101110000000111001101001111110010100111010101011111010100010111011001011001010001110111010011001111110110011111011001000110110101100111010111010001110001011111001101011001010000011111011001000010010000101111010101011010110100110111010001000000010001100111011111011110110000111111011001011011010101111111001111001111110100011111010111010000110101011111010111000101110010111111001011010110110111100 e78b8eeb9cafebab80e69f94eabea2ecb28ee99fb3ec8daceba38be6b283ec8485eab5a6e8808cefbd87ecb6afe79fa3eba1abeb8b97e5adbc
UHC 狎뜯뫀柔꾢첎音썬룋沃섅굦而g춯矣롫닗孼 1110010011100100101101101110001010010001101001001110101011110101100001001110010110101010100110111110101111100101101111011110001110001111100010101110100010101010100110001110001110000010100011001110110010111011101000111110011110101101100011001110101111111000100011101110101110001000100110111110010111101101 e4e4b6e291a4eaf584e5aa9bebe5bde38f8ae8aa98e3828cecbba3e7ad8cebf88eeb889be5ed

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)