To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????D?????????D^ 001111110011111100111111001111110011111100111111001111110011111100111111010001000011111100111111001111110011111100111111001111110011111100111111001111110100010001011110 3f3f3f3f3f3f3f3f3f443f3f3f3f3f3f3f3f3f445e
SJIS-WIN 悟??巡??率??D悟??巡??率??D^ 100011001110010100111111001111111000111110000100001111110011111110010111101001100011111100111111010001001000110011100101001111110011111110001111100001000011111100111111100101111010011000111111001111110100010001011110 8ce53f3f8f843f3f97a63f3f448ce53f3f8f843f3f97a63f3f445e
EUC-JP 悟??巡??率??D悟??巡??率??D^ 101110001110011100111111001111111011110111100100001111110011111111001110101010000011111100111111010001001011100011100111001111110011111110111101111001000011111100111111110011101010100000111111001111110100010001011110 b8e73f3fbde43f3fcea83f3f44b8e73f3fbde43f3fcea83f3f445e
UTF-8 悟귘뀤巡뗨쁻率쎻뵮D悟귘뀤巡뗨쁻率쎻뵮D^ 111001101000001010011111111010101011011110011000111010111000000010100100111001011011011110100001111010111001011110101000111011001000000110111011111001111000111010000111111011001000111010111011111010111011010110101110010001001110011010000010100111111110101010110111100110001110101110000000101001001110010110110111101000011110101110010111101010001110110010000001101110111110011110001110100001111110110010001110101110111110101110110101101011100100010001011110 e6829feab798eb80a4e5b7a1eb97a8ec81bbe78e87ec8ebbebb5ae44e6829feab798eb80a4e5b7a1eb97a8ec81bbe78e87ec8ebbebb5ae445e
UHC 悟귘뀤巡뗨쁻率쎻뵮D悟귘뀤巡뗨쁻率쎻뵮D^ 111001111111011010000010111000101000010110011011111000101101111010001011111010001001100010000010111000011110001110011011111000101001010010101100010001001110011111110110100000101110001010000101100110111110001011011110100010111110100010011000100000101110000111100011100110111110001010010100101011000100010001011110 e7f682e2859be2de8be89882e1e39be294ac44e7f682e2859be2de8be89882e1e39be294ac445e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)