To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 蒻??藥??章??曄??節e?午??藥?ⅷ^ 11100100111010000011111100111111111001010101101000111111001111111000111111001101001111110011111110011110010000000011111100111111100100001101111110000010100001010011111110001100110111110011111100111111111001010101101000111111111110100100011101011110 e4e83f3fe55a3f3f8fcd3f3f9e403f3f90df82853f8cdf3f3fe55a3ffa475e
EUC-JP 蒻??藥??章??曄??節e?午??藥??^ 111010001110101000111111001111111110100110111011001111110011111110111110110011110011111100111111110110111010000100111111001111111100000011100001101000111110010100111111101110001110000100111111001111111110100110111011001111110011111101011110 e8ea3f3fe9bb3f3fbecf3f3fdba13f3fc0e1a3e53fb8e13f3fe9bb3f3f5e
UTF-8 蒻몌숲藥먫룴章듿톾曄ⓨ떳節e톾午⑻뱤藥먪ⅷ^ 11101000100100101011101111101011101010101000110011101100100010001011001011101000100101111010010111101011101010001010101111101011101000111011010011100111101010111010000011101011100100111011111111101101100001101011111011100110100110111000010011100010100100111010100011101011100101101011001111100111101011111000000011101111101111011000010111101101100001101011111011100101100011011000100011100010100100011011101111101011101100011010010011101000100101111010010111101011101010001010101011100010100001011011011101011110 e892bbebaa8cec88b2e897a5eba8abeba3b4e7aba0eb93bfed86bee69b84e293a8eb96b3e7af80efbd85ed86bee58d88e291bbebb1a4e897a5eba8aae285b75e
UHC 蒻몌숲藥먫룴章듿톾曄ⓨ떳節e톾午⑻뱤藥먪ⅷ^ 11100101101101101011100011101111101111011010001111100101101101111001000011101000100011111010100111101101111100011000101011100101101101111001000011100111101001011010100011100101101101101011100011101111101111011010001111100101101101111001000011100111111011011010100111101110100100111000101011100101101101111001000011100111101001011010100001011110 e5b6b8efbda3e5b790e88fa9edf18ae5b790e7a5a8e5b6b8efbda3e5b790e7eda9ee938ae5b790e7a5a85e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)