To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????喩??????ε?音?????逾 0011111100111111001111110011111100111111001111111001101001100111001111110011111100111111001111110011111100111111100000111100001100111111100010011011100100111111001111110011111100111111001111111110011110100101 3f3f3f3f3f3f9a673f3f3f3f3f3f83c33f89b93f3f3f3f3fe7a5
EUC-JP ???佾??喩??????ε?音?????逾 00111111001111110011111110001111101100001111101100111111001111111101001111001000001111110011111100111111001111110011111100111111101001101100010100111111101100101011101100111111001111110011111100111111001111111110111010100111 3f3f3f8fb0fb3f3fd3c83f3f3f3f3f3fa6c53fb2bb3f3f3f3f3feea7
UTF-8 麗몃쓷佾듿칰喩먮윹廬믪쉶杻ε첎音뷀뫛廬믩쑐逾 1110111110100110100010001110101110101010100000111110110010010011101101111110010010111101101111101110101110010011101111111110110010111001101100001110010110010110101010011110101110101000101011101110110010011100101110011110111110100110100000101110101110101111101010101110110010001001101101101110111110100111100010001100111010110101111011001011001010001110111010011001111110110011111010111011011110000000111010111010101110011011111011111010011010000010111010111010111110101001111011001001000110010000111010011000000010111110 efa688ebaa83ec93b7e4bdbeeb93bfecb9b0e596a9eba8aeec9cb9efa682ebafaaec89b6efa788ceb5ecb28ee99fb3ebb780ebab9befa682ebafa9ec9190e980be
UHC 麗몃쓷佾듿칰喩먮윹廬믪쉶杻ε첎音뷀뫛廬믩쑐逾 1110011010110000101110001110101110011101100101001110110011101011100010101110010110101111100000111110101011100111100100001110101110011111101100111110010111111110100100101110110010011010100011001110101011110100101001011110010110101010100110111110101111100101100101001110110110010001101110111110010111111110100100101110101110011100101011111110101110110101 e6b0b8eb9d94eceb8ae5af83eae790eb9fb3e5fe92ec9a8ceaf4a5e5aa9bebe594ed91bbe5fe92eb9cafebb5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)