To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 誤?????蟻??癲???伍?…???矣??^ 1000110011101011001111110011111100111111001111110011111110001011011000010011111100111111111000011001111100111111001111110011111110001100110111100011111110000001011000110011111100111111001111111110000111100001001111110011111101011110 8ceb3f3f3f3f3f8b613f3fe19f3f3f3f8cde3f81633f3f3fe1e13f3f5e
EUC-JP 誤?????蟻??癲???伍?…???矣??^ 1011100011101101001111110011111100111111001111110011111110110101110000100011111100111111111000101010000100111111001111110011111110111000111000000011111110100001110001000011111100111111001111111110001011100011001111110011111101011110 b8ed3f3f3f3f3fb5c23f3fe2a13f3f3fb8e03fa1c43f3f3fe2e33f3f5e
UTF-8 誤곥굞泥곫콨蟻믪내癲뗪퉩첩伍밸…吏섊뛾矣꾨윴^ 11101000101010101010010011101010101100111010010111101010101101011001111011101111101001111010001111101010101100111010101111101100101111011010100011101000100111111011101111101011101011111010101011101011100000101011010011100111100110011011001011101011100101111010101011101101100010011010100111101100101100101010100111100100101111001000110111101011101100001011100011100010100000001010011011101111101001111001111011101100100001001000101011101011100110111011111011100111100111111010001111101010101111101010100011101100100111001011010001011110 e8aaa4eab3a5eab59eefa7a3eab3abecbda8e89fbbebafaaeb82b4e799b2eb97aaed89a9ecb2a9e4bc8debb0b8e280a6efa79eec848aeb9bbee79fa3eabea8ec9cb45e
UHC 誤곥굞泥곫콨蟻믪내癲뗪퉩첩伍밸…吏섊뛾矣꾨윴^ 111010001010011010000001111000111000001010000110111011001011001010000001111001101011000110011101111010111111110010010010111011001011001110111011111011111010011010001011111010101011100110000001110000111011100011100111111010101011100111101011101000011010011011101100101001111001100011100111100011011000010011101011111110001000010011101011100111111011000001011110 e8a681e38286ecb281e6b19debfc92ecb3bbefa68beab981c3b8e7eab9eba1a6eca798e78d84ebf884eb9fb05e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)