To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????z?????????zB 001111110011111100111111001111110011111100111111001111110011111100111111011110100011111100111111001111110011111100111111001111110011111100111111001111110111101001000010 3f3f3f3f3f3f3f3f3f7a3f3f3f3f3f3f3f3f3f7a42
SJIS-WIN 嗚??肄?????z嗚??肄?????zB 10011010011010100011111100111111111000111110010100111111001111110011111100111111001111110111101010011010011010100011111100111111111000111110010100111111001111110011111100111111001111110111101001000010 9a6a3f3fe3e53f3f3f3f3f7a9a6a3f3fe3e53f3f3f3f3f7a42
EUC-JP 嗚??肄?????z嗚??肄?????zB 11010011110010110011111100111111111001101110011100111111001111110011111100111111001111110111101011010011110010110011111100111111111001101110011100111111001111110011111100111111001111110111101001000010 d3cb3f3fe6e73f3f3f3f3f7ad3cb3f3fe6e73f3f3f3f3f7a42
UTF-8 嗚삳떧肄낁뮲硫⑹쐡z嗚삳떧肄낁뮲硫⑹쐡zB 111001011001011110011010111011001000001010110011111010111001011010100111111010001000001010000100111010111000001010000001111010111010111010110010111011111010011110001110111000101001000110111001111011001001000010100001011110101110010110010111100110101110110010000010101100111110101110010110101001111110100010000010100001001110101110000010100000011110101110101110101100101110111110100111100011101110001010010001101110011110110010010000101000010111101001000010 e5979aec82b3eb96a7e88284eb8281ebaeb2efa78ee291b9ec90a17ae5979aec82b3eb96a7e88284eb8281ebaeb2efa78ee291b9ec90a17a42
UHC 嗚삳떧肄낁뮲硫⑹쐡z嗚삳떧肄낁뮲硫⑹쐡zB 111001111111000010111011111010111000101110111010111011001011110110000101111010001001001010111011111010111010100110101001111011001001110010000111011110101110011111110000101110111110101110001011101110101110110010111101100001011110100010010010101110111110101110101001101010011110110010011100100001110111101001000010 e7f0bbeb8bbaecbd85e892bbeba9a9ec9c877ae7f0bbeb8bbaecbd85e892bbeba9a9ec9c877a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)