To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???泣→?應??瑤??乳??魏??筌λ?竊 00111111001111110011111110001011100000111000000110101000001111111001110011100100001111110011111111101010101000100011111100111111100100111111101100111111001111111110100110110000001111110011111111100010101000111000001111001001001111111110001010000110 3f3f3f8b8381a83f9ce43f3feaa23f3f93fb3f3fe9b03f3fe2a383c93fe286
EUC-JP ???泣→?應??瑤??乳??魏??筌λ?竊 00111111001111110011111110110101111000111010001010101010001111111101100011100110001111110011111111110100101001000011111100111111110001101111110100111111001111111111001010110010001111110011111111100100101001011010011011001011001111111110001111100110 3f3f3fb5e3a2aa3fd8e63f3ff4a43f3fc6fd3f3ff2b23f3fe4a5a6cb3fe3e6
UTF-8 捻꿔끇泣→쨫應쇱쭍瑤뗭슦乳면쪛魏껎돪筌λ맕竊 1110111110100110101001001110101010111111100101001110101110000001100001111110011010110011101000111110001010000110100100101110110010101000101010111110011010000111100010011110110010000111101100011110110010101101100011011110011110010001101001001110101110010111101011011110110010001010101001101110010010111001101100111110101110101001101101001110110010101010100110111110100110101101100011111110101010111011100011101110101110001111101010101110011110101101100011001100111010111011111010111010011110010101111001111010101110001010 efa6a4eabf94eb8187e6b3a3e28692eca8abe68789ec87b1ecad8de791a4eb97adec8aa6e4b9b3eba9b4ecaa9be9ad8feabb8eeb8faae7ad8ccebbeba795e7ab8a
UHC 捻꿔끇泣→쨫應쇱쭍瑤뗭슦乳면쪛魏껎돪筌λ맕竊 1110011011110111101100101110001110000101101110111110101111101000101000011110011010100100100001011110101111101011101111001110110010100111100001101110100011111101100010111110110010011010101100001110101011100001101110001110100110100101100101001110101011100000100000111110110110001001101011011110111110100111101001011110101110010000101001111110111110111100 e6f7b2e385bbebe8a1e6a485ebebbceca786e8fd8bec9ab0eae1b8e9a594eae083ed89adefa7a5eb90a7efbc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)