To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???泣?ぜ恂??齬??宥?????鶯??唯 0011111100111111001111111000101110000011001111111000001010111010100111001001011000111111001111111110101010010111001111110011111110010111010001110011111100111111001111110011111100111111111010011111001000111111001111111001011101000010 3f3f3f8b833f82ba9c963f3fea973f3f97473f3f3f3f3fe9f23f3f9742
EUC-JP ???泣?ぜ恂??齬??宥??洧??鶯??唯 00111111001111110011111110110101111000110011111110100100101111001101011111110110001111110011111111110011111101110011111100111111110011011010100000111111001111111000111111000111101101000011111100111111111100101111010000111111001111111100110110100011 3f3f3fb5e33fa4bcd7f63f3ff3f73f3fcda83f3f8fc7b43f3ff2f43f3fcda3
UTF-8 捻뀀씮泣쒑ぜ恂⑹뜗齬잕퀬宥꾤춯洧얜쎘鶯밸같唯 111011111010011010100100111010111000000010000000111011001001010010101110111001101011001110100011111011001001001010010001111000111000000110011100111001101000000110000010111000101001000110111001111010111001110010010111111010011011110110101100111011001001111010010101111011011000000010101100111001011010111010100101111010101011111010100100111011001011011010101111111001101011010010100111111011001001011010011100111011001000111010011000111010011011011010101111111010111011000010111000111010101011000010011001111001011001010010101111 efa6a4eb8080ec94aee6b3a3ec9291e3819ce68182e291b9eb9c97e9bdacec9e95ed80ace5aea5eabea4ecb6afe6b4a7ec969cec8e98e9b6afebb0b8eab099e594af
UHC 捻뀀씮泣쒑ぜ恂⑹뜗齬잕퀬宥꾤춯洧얜쎘鶯밸같唯 1110011011110111101100101110101110011101101111111110101111101000100111001110100010101010101111001110001011100001101010011110110010001101100110101110010111100001100111111110101010110011101000001110101011101001100001001110011110101101100011001110101011111011101111101110101110011011101111111110010110100011101110011110101110110000101100001110101011100110 e6f7b2eb9dbfebe89ce8aabce2e1a9ec8d9ae5e19feab3a0eae984e7ad8ceafbbeeb9bbfe5a3b9ebb0b0eae6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)