To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 逋語サ喜発隱シ崧マ逋語サ喜発隱シ崧ボ^ 111001111001100110001100111010101011101110001010111011001001010010101101111010001010101010111100111110101010111110000011011111011110011110011001100011001110101010111011100010101110110010010100101011011110100010101010101111001111101010101111100000110111101101011110 e7998ceabb8aec94ade8aabcfaaf837de7998ceabb8aec94ade8aabcfaaf837b5e
EUC-JP 逋語サ喜発隱シ崧マ逋語サ喜発隱シ崧ボ^ 111011011111100110111000111011001000111010111011101101001110111011001000101011111111000010101100100011101011110010001111101110111100101010100101110111101110110111111001101110001110110010001110101110111011010011101110110010001010111111110000101011001000111010111100100011111011101111001010101001011101110001011110 edf9b8ec8ebbb4eec8aff0ac8ebc8fbbcaa5deedf9b8ec8ebbb4eec8aff0ac8ebc8fbbcaa5dc5e
UTF-8 逋語サ喜発隱シ崧マ逋語サ喜発隱シ崧ボ^ 11101001100000001000101111101000101010101001111011101111101111011011101111100101100101101001110011100111100110011011101011101001100110101011000111101111101111011011110011100101101101001010011111100011100000111001111011101001100000001000101111101000101010101001111011101111101111011011101111100101100101101001110011100111100110011011101011101001100110101011000111101111101111011011110011100101101101001010011111100011100000111001110001011110 e9808be8aa9eefbdbbe5969ce799bae99ab1efbdbce5b4a7e3839ee9808be8aa9eefbdbbe5969ce799bae99ab1efbdbce5b4a7e3839c5e
UHC 逋語?喜?隱?崧マ逋語?喜?隱?崧ボ^ 11111000111001111110010111011110001111111111110111101100001111111110101111011111001111111110001011111110101010111101111011111000111001111110010111011110001111111111110111101100001111111110101111011111001111111110001011111110101010111101110001011110 f8e7e5de3ffdec3febdf3fe2feabdef8e7e5de3ffdec3febdf3fe2feabdc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)