To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????p}?????????p{^ 0011111100111111001111110011111100111111001111110011111100111111001111110111000001111101001111110011111100111111001111110011111100111111001111110011111100111111011100000111101101011110 3f3f3f3f3f3f3f3f3f707d3f3f3f3f3f3f3f3f3f707b5e
SJIS-WIN ??堤??????p}??堤??????p{^ 00111111001111111001001011100111001111110011111100111111001111110011111100111111011100000111110100111111001111111001001011100111001111110011111100111111001111110011111100111111011100000111101101011110 3f3f92e73f3f3f3f3f3f707d3f3f92e73f3f3f3f3f3f707b5e
EUC-JP ??堤??????p}??堤??????p{^ 00111111001111111100010011101001001111110011111100111111001111110011111100111111011100000111110100111111001111111100010011101001001111110011111100111111001111110011111100111111011100000111101101011110 3f3fc4e93f3f3f3f3f3f707d3f3fc4e93f3f3f3f3f3f707b5e
UTF-8 셈섞堤렯렞렯렍렯렠p}셈섞堤렯렞렯렍렯렠p{^ 1110110010000101100010001110110010000100100111101110010110100000101001001110101110100000101011111110101110100000100111101110101110100000101011111110101110100000100011011110101110100000101011111110101110100000101000000111000001111101111011001000010110001000111011001000010010011110111001011010000010100100111010111010000010101111111010111010000010011110111010111010000010101111111010111010000010001101111010111010000010101111111010111010000010100000011100000111101101011110 ec8588ec849ee5a0a4eba0afeba09eeba0afeba08deba0afeba0a0707dec8588ec849ee5a0a4eba0afeba09eeba0afeba08deba0afeba0a0707b5e
UHC 셈섞堤렯렞렯렍렯렠p}셈섞堤렯렞렯렍렯렠p{^ 1011110011000000101111001010111111110000101001111000111010111100100011101010111110001110101111001000111010100011100011101011110010001110101100010111000001111101101111001100000010111100101011111111000010100111100011101011110010001110101011111000111010111100100011101010001110001110101111001000111010110001011100000111101101011110 bcc0bcaff0a78ebc8eaf8ebc8ea38ebc8eb1707dbcc0bcaff0a78ebc8eaf8ebc8ea38ebc8eb1707b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)