To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 莊???√???蜚?咀蒡莊???√???蜚?咀蒡^ 1110010010110101001111110011111100111111100000011110001100111111001111110011111111100101100101000011111110011001111100001110010011101110111001001011010100111111001111110011111110000001111000110011111100111111001111111110010110010100001111111001100111110000111001001110111001011110 e4b53f3f3f81e33f3f3fe5943f99f0e4eee4b53f3f3f81e33f3f3fe5943f99f0e4ee5e
EUC-JP 莊???√???蜚?咀蒡莊???√???蜚?咀蒡^ 1110100010110111001111110011111100111111101000101110010100111111001111110011111111101001111101000011111111010010111100101110100011110000111010001011011100111111001111110011111110100010111001010011111100111111001111111110100111110100001111111101001011110010111010001111000001011110 e8b73f3f3fa2e53f3f3fe9f43fd2f2e8f0e8b73f3f3fa2e53f3f3fe9f43fd2f2e8f05e
UTF-8 莊렱뤰쨴√붤龍핊蜚렊咀蒡莊렱뤰쨴√붤龍핊蜚렊咀蒡^ 11101000100011101000101011101011101000001011000111101011101001001011000011101100101010001011010011100010100010001001101011101011101101101010010011101111101001111000010011101101100101011000101011101000100111001001101011101011101000001000101011100101100100101000000011101000100100101010000111101000100011101000101011101011101000001011000111101011101001001011000011101100101010001011010011100010100010001001101011101011101101101010010011101111101001111000010011101101100101011000101011101000100111001001101011101011101000001000101011100101100100101000000011101000100100101010000101011110 e88e8aeba0b1eba4b0eca8b4e2889aebb6a4efa784ed958ae89c9aeba08ae59280e892a1e88e8aeba0b1eba4b0eca8b4e2889aebb6a4efa784ed958ae89c9aeba08ae59280e892a15e
UHC 莊렱뤰쨴√붤龍핊蜚렊咀蒡莊렱뤰쨴√붤龍핊蜚렊咀蒡^ 11101101111101101000111010111110100011111101111010100100100011101010000111101110101110101101110011101001110011001100000010001111110111101010010010001110101000011110111010111010110110111011110011101101111101101000111010111110100011111101111010100100100011101010000111101110101110101101110011101001110011001100000010001111110111101010010010001110101000011110111010111010110110111011110001011110 edf68ebe8fdea48ea1eebadce9ccc08fdea48ea1eebadbbcedf68ebe8fdea48ea1eebadce9ccc08fdea48ea1eebadbbc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)