To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???宥??揄k?孃 0011111100111111001111111001011101000111001111110011111110011101100010011000001010001011001111111001101101101111 3f3f3f97473f3f9d89828b3f9b6f
EUC-JP ???宥??揄k?孃 0011111100111111001111111100110110101000001111110011111111011001111010011010001111101011001111111101010111010000 3f3f3fcda83f3fd9e9a3eb3fd5d0
UTF-8 列룸쑐宥꿰뙴揄k늹孃 111011111010011010011100111010111010001110111000111011001001000110010000111001011010111010100101111010101011111110110000111010111001100110110100111001101000111110000100111011111011110110001011111010111000101010111001111001011010110110000011 efa69ceba3b8ec9190e5aea5eabfb0eb99b4e68f84efbd8beb8ab9e5ad83
UHC 列룸쑐宥꿰뙴揄k늹孃 1110011011101010101101111110101110011100101011111110101011101001101100101110011110001100101101111110101011110001101000111110101110001000100000101110010110111110 e6eab7eb9cafeae9b2e78cb7eaf1a3eb8882e5be

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)