To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???泣→?應??嚥▲?乳??魏???κ?釗 00111111001111110011111110001011100000111000000110101000001111111001110011100100001111110011111110011010100010111000000110100011001111111001001111111011001111110011111111101001101100000011111100111111001111111000001111001000001111111111101110111011 3f3f3f8b8381a83f9ce43f3f9a8b81a33f93fb3f3fe9b03f3f3f83c83ffbbb
EUC-JP ???泣→?應??嚥▲?乳??魏???κ?釗 0011111100111111001111111011010111100011101000101010101000111111110110001110011000111111001111111101001111101011101000101010010100111111110001101111110100111111001111111111001010110010001111110011111100111111101001101100101000111111100011111110001110100110 3f3f3fb5e3a2aa3fd8e63f3fd3eba2a53fc6fd3f3ff2b23f3f3fa6ca3f8fe3a6
UTF-8 捻꿔끇泣→쨫應뀀뎠嚥▲룂乳면쪛魏녿츎若κ랬釗 1110111110100110101001001110101010111111100101001110101110000001100001111110011010110011101000111110001010000110100100101110110010101000101010111110011010000111100010011110101110000000100000001110101110001110101000001110010110011010101001011110001010010110101100101110101110100011100000101110010010111001101100111110101110101001101101001110110010101010100110111110100110101101100011111110101110000101101111111110110010111000100011101110111110100101101101001100111010111010111010111001111010101100111010011000011110010111 efa6a4eabf94eb8187e6b3a3e28692eca8abe68789eb8080eb8ea0e59aa5e296b2eba382e4b9b3eba9b4ecaa9be9ad8feb85bfecb88eefa5b4cebaeb9eace98797
UHC 捻꿔끇泣→쨫應뀀뎠嚥▲룂乳면쪛魏녿츎若κ랬釗 1110011011110111101100101110001110000101101110111110101111101000101000011110011010100100100001011110101111101011101100101110101110110101101100011110011010111111101000011110001110001111100000111110101011100001101110001110100110100101100101001110101011100000100001101110101110101110100010011110010110101110101001011110101010110111101010001110000111110010 e6f7b2e385bbebe8a1e6a485ebebb2ebb5b1e6bfa1e38f83eae1b8e9a594eae086ebae89e5aea5eab7a8e1f2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)