To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????喩??歪???渦??揖??喩?2瑤 001111110011111100111111001111110011111100111111100110100110011100111111001111111001100001100011001111110011111100111111100010010101000100111111001111111001011101001011001111110011111110011010011001110011111110000010010100011110101010100010 3f3f3f3f3f3f9a673f3f98633f3f3f89513f3f974b3f3f9a673f8251eaa2
EUC-JP ???佾??喩??歪???渦??揖??喩?2瑤 0011111100111111001111111000111110110000111110110011111100111111110100111100100000111111001111111100111111000100001111110011111100111111101100011011001000111111001111111100110110101100001111110011111111010011110010000011111110100011101100101111010010100100 3f3f3f8fb0fb3f3fd3c83f3fcfc43f3f3fb1b23f3fcdac3f3fd3c83fa3b2f4a4
UTF-8 麗몃쓷佾듿칰喩볥뼅歪뫮븐쓩渦기뫀揖묈칰喩붿2瑤 111011111010011010001000111010111010101010000011111011001001001110110111111001001011110110111110111010111001001110111111111011001011100110110000111001011001011010101001111010111011001110100101111010111011110010000101111001101010110110101010111010111010101110101110111010111011100010010000111011001001001110101001111001101011100010100110111010101011100010110000111010111010101110000000111001101000111110010110111010111010110010001000111011001011100110110000111001011001011010101001111010111011011010111111111011111011110010010010111001111001000110100100 efa688ebaa83ec93b7e4bdbeeb93bfecb9b0e596a9ebb3a5ebbc85e6adaaebabaeebb890ec93a9e6b8a6eab8b0ebab80e68f96ebac88ecb9b0e596a9ebb6bfefbc92e791a4
UHC 麗몃쓷佾듿칰喩볥뼅歪뫮븐쓩渦기뫀揖묈칰喩붿2瑤 11100110101100001011100011101011100111011001010011101100111010111000101011100101101011111000001111101010111001111001001111101011100101101000111111101000111000001001000111001110101110101110110010111110101100011110100010111110101100011110001010010001101001001110101111100111100100011110010110101111100000111110101011100111100101001110110010100011101100101110100011111101 e6b0b8eb9d94eceb8ae5af83eae793eb968fe8e091cebaecbeb1e8beb1e291a4ebe791e5af83eae794eca3b2e8fd

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)