To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???泣→?應??嚥♂??藥??倚ф??? 0011111100111111001111111000101110000011100000011010100000111111100111001110010000111111001111111001101010001011100000011000100100111111001111111110010101011010001111110011111110011000110111111000010010000110001111110011111100111111 3f3f3f8b8381a83f9ce43f3f9a8b81893f3fe55a3f3f98df84863f3f3f
EUC-JP ???泣→?應??嚥♂??藥??倚ф??? 0011111100111111001111111011010111100011101000101010101000111111110110001110011000111111001111111101001111101011101000011110100100111111001111111110100110111011001111110011111111010000111000011010011111100110001111110011111100111111 3f3f3fb5e3a2aa3fd8e63f3fd3eba1e93f3fe9bb3f3fd0e1a7e63f3f3f
UTF-8 捻꿔끇泣→쨫應뀀뎠嚥♂쇰뤊藥띲꺁倚ф뤃鱗껧 1110111110100110101001001110101010111111100101001110101110000001100001111110011010110011101000111110001010000110100100101110110010101000101010111110011010000111100010011110101110000000100000001110101110001110101000001110010110011010101001011110001010011001100000101110110010000111101100001110101110100100100010101110100010010111101001011110101110011101101100101110101010111010100000011110010110000000100110101101000110000100111010111010010010000011111011111010011110110010111010101011101110100111 efa6a4eabf94eb8187e6b3a3e28692eca8abe68789eb8080eb8ea0e59aa5e29982ec87b0eba48ae897a5eb9db2eaba81e5809ad184eba483efa7b2eabba7
UHC 捻꿔끇泣→쨫應뀀뎠嚥♂쇰뤊藥띲꺁倚ф뤃鱗껧 111001101111011110110010111000111000010110111011111010111110100010100001111001101010010010000101111010111110101110110010111010111011010110110001111001101011111110100001110011101011110011101011100011111011101011100101101101111000110111100011100000111010101011101011111011111010110011100110100011111011010011101100111001111000010001000010 e6f7b2e385bbebe8a1e6a485ebebb2ebb5b1e6bfa1cebceb8fbae5b78de383aaebeface68fb4ece78442

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)