To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 蘂??意??儒?????夷у?諛??沃?? 11100101010000010011111100111111100010001101001100111111001111111000111011110010001111110011111100111111001111110011111110001000110011101000010010000101001111111110011010000111001111110011111110010111100000000011111100111111 e5413f3f88d33f3f8ef23f3f3f3f3f88ce84853fe6873f3f97803f3f
EUC-JP 蘂??意??儒?????夷у?諛??沃?? 11101001101000100011111100111111101100001101010100111111001111111011110011110100001111110011111100111111001111110011111110110000110100001010011111100101001111111110101111100111001111110011111111001101111000000011111100111111 e9a23f3fb0d53f3fbcf43f3f3f3f3fb0d0a7e53febe73f3fcde03f3f
UTF-8 蘂띠눖意덄뙴儒삠걶嶺띲굥夷у넇諛멸턂沃쇱퓤 1110100010011000100000101110101110011101101000001110101110001000100101101110011010000100100011111110101110001101100001001110101110011001101101001110010110000100100100101110110010000010101000001110101010110001101101101110111110100110101010111110101110011101101100101110101010110101101001011110010110100100101101111101000110000011111010111000010010000111111010001010101110011011111010111010100110111000111011011000010010000010111001101011001010000011111011001000011110110001111011011001001110100100 e89882eb9da0eb8896e6848feb8d84eb99b4e58492ec82a0eab1b6efa6abeb9db2eab5a5e5a4b7d183eb8487e8ab9beba9b8ed8482e6b283ec87b1ed93a4
UHC 蘂띠눖意덄뙴儒삠걶嶺띲굥夷у넇諛멸턂沃쇱퓤 111001111101111010110110111011001000011110110000111010111111001010001000111001111000110010110111111010101110001110111011111000111000000110011100111001111010110110001101111000111000001010001011111011001010100010101100111001011000011010010111111010111011000010111000111010101011010110011110111010001010101010111100111011001011111110001101 e7deb6ec87b0ebf288e78cb7eae3bbe3819ce7ad8de3828beca8ace58697ebb0b8eab59ee8aabcecbf8d

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)