To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 藥??逸??癒ъ????侑??幽??癲???^ 111001010101101000111111001111111000100011101101001111110011111110010110111111001000010010001100001111110011111100111111001111111001100011010000001111110011111110010111010010000011111100111111111000011001111100111111001111110011111101011110 e55a3f3f88ed3f3f96fc848c3f3f3f3f98d03f3f97483f3fe19f3f3f3f5e
EUC-JP 藥??逸??癒ъ????侑??幽??癲???^ 111010011011101100111111001111111011000011101111001111110011111111001100111111101010011111101100001111110011111100111111001111111101000011010010001111110011111111001101101010010011111100111111111000101010000100111111001111110011111101011110 e9bb3f3fb0ef3f3fccfea7ec3f3f3f3fd0d23f3fcda93f3fe2a13f3f3f5e
UTF-8 藥띲끏逸썹독癒ъ넽蓮용끋侑좂독幽뚰뭻癲앗뉗뿨^ 111010001001011110100101111010111001110110110010111010111000000110001111111010011000000010111000111011001000110110111001111010111000111110000101111001111001100110010010110100011000101011101011100001001011110111101111101001101001100111101100100110101010100111101011100000011000101111100100101111101001000111101100101000101000001011101011100011111000010111100101101110011011110111101011100110101011000011101011101011011011101111100111100110011011001011101100100101011001011111101011100010011001011111101011101111111010100001011110 e897a5eb9db2eb818fe980b8ec8db9eb8f85e79992d18aeb84bdefa699ec9aa9eb818be4be91eca282eb8f85e5b9bdeb9ab0ebadbbe799b2ec9597eb8997ebbfa85e
UHC 藥띲끏逸썹독癒ъ넽蓮용끋侑좂독幽뚰뭻癲앗뉗뿨^ 111001011011011110001101111000111000010110111111111011001110111110111101111001111011010110110110111010111010100010101100111011001000011010110111111001101110010110111111111010111000010110111101111010101110001010100000111001111011010110110110111010101110101110001100111011011001001010001010111011111010011010111110110100011000011111101100100101111010100001011110 e5b78de385bfecefbde7b5b6eba8acec86b7e6e5bfeb85bdeae2a0e7b5b6eaeb8ced928aefa6bed187ec97a85e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)