To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????^?????????^B 001111110011111100111111001111110011111100111111001111110011111100111111010111100011111100111111001111110011111100111111001111110011111100111111001111110101111001000010 3f3f3f3f3f3f3f3f3f5e3f3f3f3f3f3f3f3f3f5e42
SJIS-WIN 悟?????擬??^悟?????擬??^B 10001100111001010011111100111111001111110011111100111111100010110101101100111111001111110101111010001100111001010011111100111111001111110011111100111111100010110101101100111111001111110101111001000010 8ce53f3f3f3f3f8b5b3f3f5e8ce53f3f3f3f3f8b5b3f3f5e42
EUC-JP 悟??瑗??擬??^悟??瑗??擬??^B 1011100011100111001111110011111110001111110011001100000000111111001111111011010110111100001111110011111101011110101110001110011100111111001111111000111111001100110000000011111100111111101101011011110000111111001111110101111001000010 b8e73f3f8fccc03f3fb5bc3f3f5eb8e73f3f8fccc03f3fb5bc3f3f5e42
UTF-8 悟귣뀛瑗뉛쬃擬뺤떼^悟귣뀛瑗뉛쬃擬뺤떼^B 111001101000001010011111111010101011011110100011111010111000000010011011111001111001000110010111111010111000100110011011111011001010110010000011111001101001001110101100111010111011101010100100111010111001011010111100010111101110011010000010100111111110101010110111101000111110101110000000100110111110011110010001100101111110101110001001100110111110110010101100100000111110011010010011101011001110101110111010101001001110101110010110101111000101111001000010 e6829feab7a3eb809be79197eb899becac83e693acebbaa4eb96bc5ee6829feab7a3eb809be79197eb899becac83e693acebbaa4eb96bc5e42
UHC 悟귣뀛瑗뉛쬃擬뺤떼^悟귣뀛瑗뉛쬃擬뺤떼^B 111001111111011010000010111010111000010110010100111010101011110010000111111011111010011010011010111010111111010010010101111011001011011010111100010111101110011111110110100000101110101110000101100101001110101010111100100001111110111110100110100110101110101111110100100101011110110010110110101111000101111001000010 e7f682eb8594eabc87efa69aebf495ecb6bc5ee7f682eb8594eabc87efa69aebf495ecb6bc5e42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)