To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????\}????????\{^ 001111110011111100111111001111110011111100111111001111110011111101011100011111010011111100111111001111110011111100111111001111110011111100111111010111000111101101011110 3f3f3f3f3f3f3f3f5c7d3f3f3f3f3f3f3f3f5c7b5e
SJIS-WIN 豺シ螟ゑスー驫ュ\}豺シ螟ゑスー驫ュ\{^ 1110011010110111101111001110010110100100100000101110111110111101101100001110100110001010101011010101110001111101111001101011011110111100111001011010010010000010111011111011110110110000111010011000101010101101010111000111101101011110 e6b7bce5a482efbdb0e98aad5c7de6b7bce5a482efbdb0e98aad5c7b5e
EUC-JP 豺シ螟ゑスー驫ュ\}豺シ螟ゑスー驫ュ\{^ 11101100101110011000111010111100111010101010011010100100111100011000111010111101100011101011000011110001111010101000111010101101010111000111110111101100101110011000111010111100111010101010011010100100111100011000111010111101100011101011000011110001111010101000111010101101010111000111101101011110 ecb98ebceaa6a4f18ebd8eb0f1ea8ead5c7decb98ebceaa6a4f18ebd8eb0f1ea8ead5c7b5e
UTF-8 豺シ螟ゑスー驫ュ\}豺シ螟ゑスー驫ュ\{^ 1110100010110001101110101110111110111101101111001110100010011110100111111110001110000010100100011110111110111101101111011110111110111101101100001110100110101001101010111110111110111101101011010101110001111101111010001011000110111010111011111011110110111100111010001001111010011111111000111000001010010001111011111011110110111101111011111011110110110000111010011010100110101011111011111011110110101101010111000111101101011110 e8b1baefbdbce89e9fe38291efbdbdefbdb0e9a9abefbdad5c7de8b1baefbdbce89e9fe38291efbdbdefbdb0e9a9abefbdad5c7b5e
UHC 豺?螟ゑ????\}豺?螟ゑ????\{^ 111000111100111100111111110110011010110110101010111100010011111100111111001111110011111101011100011111011110001111001111001111111101100110101101101010101111000100111111001111110011111100111111010111000111101101011110 e3cf3fd9adaaf13f3f3f3f5c7de3cf3fd9adaaf13f3f3f3f5c7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)