To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 鯒タヒワクタホチ・ス鯒タヒワクタホチ・ス^ 111010011100010111000000110010111101110010111000110000001100111011000001101001011011110111110000010000111110100111000101110000001100101111011100101110001100000011001110110000011010010110111101111100000100010001011110 e9c5c0cbdcb8c0cec1a5bdf043e9c5c0cbdcb8c0cec1a5bdf0445e
EUC-JP 鯒タヒワクタホチ・ス?鯒タヒワクタホチ・ス?^ 11110010110001111000111011000000100011101100101110001110110111001000111010111000100011101100000010001110110011101000111011000001100011101010010110001110101111010011111111110010110001111000111011000000100011101100101110001110110111001000111010111000100011101100000010001110110011101000111011000001100011101010010110001110101111010011111101011110 f2c78ec08ecb8edc8eb88ec08ece8ec18ea58ebd3ff2c78ec08ecb8edc8eb88ec08ece8ec18ea58ebd3f5e
UTF-8 鯒タヒワクタホチ・ス鯒タヒワクタホチ・ス^ 11101001101011111001001011101111101111101000000011101111101111101000101111101111101111101001110011101111101111011011100011101111101111101000000011101111101111101000111011101111101111101000000111101111101111011010010111101111101111011011110111101110100000001000001111101001101011111001001011101111101111101000000011101111101111101000101111101111101111101001110011101111101111011011100011101111101111101000000011101111101111101000111011101111101111101000000111101111101111011010010111101111101111011011110111101110100000001000010001011110 e9af92efbe80efbe8befbe9cefbdb8efbe80efbe8eefbe81efbda5efbdbdee8083e9af92efbe80efbe8befbe9cefbdb8efbe80efbe8eefbe81efbda5efbdbdee80845e
UHC ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)