To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 伍?????獄?????伍?????獄??? 1000110011011110001111110011111100111111001111110011111110001101100101100011111100111111001111110011111100111111100011001101111000111111001111110011111100111111001111111000110110010110001111110011111100111111 8cde3f3f3f3f3f8d963f3f3f3f3f8cde3f3f3f3f3f8d963f3f3f
EUC-JP 伍?????獄?????伍?????獄??? 1011100011100000001111110011111100111111001111110011111110111001111101100011111100111111001111110011111100111111101110001110000000111111001111110011111100111111001111111011100111110110001111110011111100111111 b8e03f3f3f3f3fb9f63f3f3f3f3fb8e03f3f3f3f3fb9f63f3f3f
UTF-8 伍곸컮溜곕젙獄몄웾溜곕젻伍곸컮溜곕젙獄몄눊溜 111001001011110010001101111010101011001110111000111011001011101110101110111011111010011110001011111010101011001110010101111011001010000010011001111001111000110110000100111010111010101010000100111011001001101110111110111011111010011110001011111010101011001110010101111011001010000010111011111001001011110010001101111010101011001110111000111011001011101110101110111011111010011110001011111010101011001110010101111011001010000010011001111001111000110110000100111010111010101010000100111010111000100010001010111011111010011110001011 e4bc8deab3b8ecbbaeefa78beab395eca099e78d84ebaa84ec9bbeefa78beab395eca0bbe4bc8deab3b8ecbbaeefa78beab395eca099e78d84ebaa84eb888aefa78b
UHC 伍곸컮溜곕젙獄몄웾溜곕젻伍곸컮溜곕젙獄몄눊溜 1110011111101010100000011110110010110000100101001110101011111110101100001110101110100000100101011110100010101011101110001110110010011111100010011110101011111110101100001110101110100000101011101110011111101010100000011110110010110000100101001110101011111110101100001110101110100000100101011110100010101011101110001110110010000111101010001110101011111110 e7ea81ecb094eafeb0eba095e8abb8ec9f89eafeb0eba0aee7ea81ecb094eafeb0eba095e8abb8ec87a8eafe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)