To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????U 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101010101 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f55
SJIS-WIN ???溢????溢????溢????溢?U 00111111001111110011111110001000111011000011111100111111001111110011111110001000111011000011111100111111001111110011111110001000111011000011111100111111001111110011111110001000111011000011111101010101 3f3f3f88ec3f3f3f3f88ec3f3f3f3f88ec3f3f3f3f88ec3f55
EUC-JP ???溢????溢????溢????溢?U 00111111001111110011111110110000111011100011111100111111001111110011111110110000111011100011111100111111001111110011111110110000111011100011111100111111001111110011111110110000111011100011111101010101 3f3f3fb0ee3f3f3f3fb0ee3f3f3f3fb0ee3f3f3f3fb0ee3f55
UTF-8 列룸벝溢큌列룸벝溢큈列룸벝溢큂列룸벝溢퀹U 11101111101001101001110011101011101000111011100011101011101100101001110111100110101110101010001011101101100000011000110011101111101001101001110011101011101000111011100011101011101100101001110111100110101110101010001011101101100000011000100011101111101001101001110011101011101000111011100011101011101100101001110111100110101110101010001011101101100000011000001011101111101001101001110011101011101000111011100011101011101100101001110111100110101110101010001011101101100000001011100101010101 efa69ceba3b8ebb29de6baa2ed818cefa69ceba3b8ebb29de6baa2ed8188efa69ceba3b8ebb29de6baa2ed8182efa69ceba3b8ebb29de6baa2ed80b955
UHC 列룸벝溢큌列룸벝溢큈列룸벝溢큂列룸벝溢퀹U 1110011011101010101101111110101110010011101110001110110011101110101101000101011111100110111010101011011111101011100100111011100011101100111011101011010001010100111001101110101010110111111010111001001110111000111011001110111010110100010100011110011011101010101101111110101110010011101110001110110011101110101101000100100101010101 e6eab7eb93b8eceeb457e6eab7eb93b8eceeb454e6eab7eb93b8eceeb451e6eab7eb93b8eceeb44955

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)