To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 陝抵ス」陜ィ莨懣酔陝抵ス」阨」莨懈凄^ 11101000100111111001001011101111101111011010001111101000100111011010100011100100101111001001110011101110100100001000110011101000100111111001001011101111101111011010001111101000100101011010001111100100101111001001110011100110100100001010011001011110 e89f92efbda3e89da8e4bc9cee908ce89f92efbda3e895a3e4bc9ce690a65e
EUC-JP 陝抵ス」陜ィ莨懣酔陝抵ス」阨」莨懈凄^ 11110000101000011100010011110001100011101011110110001110101000111110111111111101100011101010100011101000101111101101100011110000101111111110110011110000101000011100010011110001100011101011110110001110101000111110111111110101100011101010001111101000101111101101100011101000110000001010100001011110 f0a1c4f18ebd8ea3effd8ea8e8bed8f0bfecf0a1c4f18ebd8ea3eff58ea3e8bed8e8c0a85e
UTF-8 陝抵ス」陜ィ莨懣酔陝抵ス」阨」莨懈凄^ 11101001100110011001110111100110100010101011010111101111101111011011110111101111101111011010001111101001100110011001110011101111101111011010100011101000100011101010100011100110100001111010001111101001100001011001010011101001100110011001110111100110100010101011010111101111101111011011110111101111101111011010001111101001100110001010100011101111101111011010001111101000100011101010100011100110100001111000100011100101100001111000010001011110 e9999de68ab5efbdbdefbda3e9999cefbda8e88ea8e687a3e98594e9999de68ab5efbdbdefbda3e998a8efbda3e88ea8e68788e587845e
UHC 陝抵??陜????陝抵?????懈凄^ 1110000011101101111011101011110100111111001111111111100111110000001111110011111100111111001111111110000011101101111011101011110100111111001111110011111100111111001111111111101010101011111101001010001001011110 e0edeebd3f3ff9f03f3f3f3fe0edeebd3f3f3f3f3ffaabf4a25e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)