To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????[??????????[^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111101011011001111110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN セヒ辞宍邪セヒ辞宍邪[セヒ辞宍邪セヒ辞宍邪[^ 1011111011001011100011101010101110001110101100111000111011010111101111101100101110001110101010111000111010110011100011101101011101011011101111101100101110001110101010111000111010110011100011101101011110111110110010111000111010101011100011101011001110001110110101110101101101011110 becb8eab8eb38ed7becb8eab8eb38ed75bbecb8eab8eb38ed7becb8eab8eb38ed75b5e
EUC-JP セヒ辞宍邪セヒ辞宍邪[セヒ辞宍邪セヒ辞宍邪[^ 10001110101111101000111011001011101111001010110110111100101101011011110011011001100011101011111010001110110010111011110010101101101111001011010110111100110110010101101110001110101111101000111011001011101111001010110110111100101101011011110011011001100011101011111010001110110010111011110010101101101111001011010110111100110110010101101101011110 8ebe8ecbbcadbcb5bcd98ebe8ecbbcadbcb5bcd95b8ebe8ecbbcadbcb5bcd98ebe8ecbbcadbcb5bcd95b5e
UTF-8 セヒ辞宍邪セヒ辞宍邪[セヒ辞宍邪セヒ辞宍邪[^ 111011111011110110111110111011111011111010001011111010001011111010011110111001011010111010001101111010011000001010101010111011111011110110111110111011111011111010001011111010001011111010011110111001011010111010001101111010011000001010101010010110111110111110111101101111101110111110111110100010111110100010111110100111101110010110101110100011011110100110000010101010101110111110111101101111101110111110111110100010111110100010111110100111101110010110101110100011011110100110000010101010100101101101011110 efbdbeefbe8be8be9ee5ae8de982aaefbdbeefbe8be8be9ee5ae8de982aa5befbdbeefbe8be8be9ee5ae8de982aaefbdbeefbe8be8be9ee5ae8de982aa5b5e
UHC ????邪????邪[????邪????邪[^ 001111110011111100111111001111111101111011110111001111110011111100111111001111111101111011110111010110110011111100111111001111110011111111011110111101110011111100111111001111110011111111011110111101110101101101011110 3f3f3f3fdef73f3f3f3fdef75b3f3f3f3fdef73f3f3f3fdef75b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)