To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 藥??藥??靄?????節??靄??藥??^ 11100101010110100011111100111111111001010101101000111111001111111110100011001001001111110011111100111111001111110011111110010000110111110011111100111111111010001100100100111111001111111110010101011010001111110011111101011110 e55a3f3fe55a3f3fe8c93f3f3f3f3f90df3f3fe8c93f3fe55a3f3f5e
EUC-JP 藥??藥??靄?????節??靄??藥??^ 11101001101110110011111100111111111010011011101100111111001111111111000011001011001111110011111100111111001111110011111111000000111000010011111100111111111100001100101100111111001111111110100110111011001111110011111101011110 e9bb3f3fe9bb3f3ff0cb3f3f3f3f3fc0e13f3ff0cb3f3fe9bb3f3f5e
UTF-8 藥먲쉑藥먨솳靄ㅵ톾嶺묈톾節㎩톾靄ⓨ솮藥먪븳^ 11101000100101111010010111101011101010001011001011101100100010011001000111101000100101111010010111101011101010001010100011101100100001101011001111101001100111011000010011100011100001011011010111101101100001101011111011101111101001101010101111101011101011001000100011101101100001101011111011100111101011111000000011100011100011101010100111101101100001101011111011101001100111011000010011100010100100111010100011101100100001101010111011101000100101111010010111101011101010001010101011101011101110001011001101011110 e897a5eba8b2ec8991e897a5eba8a8ec86b3e99d84e385b5ed86beefa6abebac88ed86bee7af80e38ea9ed86bee99d84e293a8ec86aee897a5eba8aaebb8b35e
UHC 藥먲쉑藥먨솳靄ㅵ톾嶺묈톾節㎩톾靄ⓨ솮藥먪븳^ 11100101101101111001000011101111101111011010011111100101101101111001000011100101100110011010100011100100111101111010010011100101101101111001000011100111101011011001000111100101101101111001000011101111101111011010011111100101101101111001000011100100111101111010100011100101100110011010010011100101101101111001000011100111100101011001110001011110 e5b790efbda7e5b790e599a8e4f7a4e5b790e7ad91e5b790efbda7e5b790e4f7a8e599a4e5b790e7959c5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)