To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 癲?8宜??音??獄?????遺?よ?巡??^ 11100001100111110011111110000010010101111000101101011000001111110011111110001001101110010011111100111111100011011001011000111111001111110011111100111111001111111000100011100010001111111000001011100110001111111000111110000100001111110011111101011110 e19f3f82578b583f3f89b93f3f8d963f3f3f3f3f88e23f82e63f8f843f3f5e
EUC-JP 癲?8宜??音??獄?????遺?よ?巡??^ 11100010101000010011111110100011101110001011010110111001001111110011111110110010101110110011111100111111101110011111011000111111001111110011111100111111001111111011000011100100001111111010010011101000001111111011110111100100001111110011111101011110 e2a13fa3b8b5b93f3fb2bb3f3fb9f63f3f3f3f3fb0e43fa4e83fbde43f3f5e
UTF-8 癲쒕8宜룩눧音붾옒獄쏅끆柳닸뇻遺압よ뿥巡볦퍤^ 11100111100110011011001011101100100100101001010111101111101111001001100011100101101011101001110011101011101000111010100111101011100010001010011111101001100111111011001111101011101101101011111011101100100110001001001011100111100011011000010011101100100011111000010111101011100000011000011011101111101001111000100111101011100010111011100011101011100001111011101111101001100000011011101011101100100101011001010111100011100000101000100011101011101111111010010111100101101101111010000111101011101100111010011011101101100011011010010001011110 e799b2ec9295efbc98e5ae9ceba3a9eb88a7e99fb3ebb6beec9892e78d84ec8f85eb8186efa789eb8bb8eb87bbe981baec9595e38288ebbfa5e5b7a1ebb3a6ed8da45e
UHC 癲쒕8宜룩눧音붾옒獄쏅끆柳닸뇻遺압よ뿥巡볦퍤^ 111011111010011010011100111010111010001110111000111010111111000110110111111010001000011110111110111010111110010110010100111010111001111010011000111010001010101110011011111010111000010110111010111010101111011110110100111001101011010010100111111010111011011010111110110100001010101011101000100101111010010111100010110111101001001111101100101110111001101101011110 efa69ceba3b8ebf1b7e887beebe594eb9e98e8ab9beb85baeaf7b4e6b4a7ebb6bed0aae897a5e2de93ecbb9b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)