To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN 艤????吟???}艤????吟???{^ 11100100011111100011111100111111001111110011111110001011111000010011111100111111001111110111110111100100011111100011111100111111001111110011111110001011111000010011111100111111001111110111101101011110 e47e3f3f3f3f8be13f3f3f7de47e3f3f3f3f8be13f3f3f7b5e
EUC-JP 艤????吟???}艤????吟???{^ 11100111110111110011111100111111001111110011111110110110111000110011111100111111001111110111110111100111110111110011111100111111001111110011111110110110111000110011111100111111001111110111101101011110 e7df3f3f3f3fb6e33f3f3f7de7df3f3f3f3fb6e33f3f3f7b5e
UTF-8 艤쇱렮곡ㄽ吟ㅾ렠렋}艤쇱렮곡ㄽ吟ㅾ렠렋{^ 111010001000100110100100111011001000011110110001111010111010000010101110111010101011001110100001111000111000010010111101111001011001000010011111111000111000010110111110111010111010000010100000111010111010000010001011011111011110100010001001101001001110110010000111101100011110101110100000101011101110101010110011101000011110001110000100101111011110010110010000100111111110001110000101101111101110101110100000101000001110101110100000100010110111101101011110 e889a4ec87b1eba0aeeab3a1e384bde5909fe385beeba0a0eba08b7de889a4ec87b1eba0aeeab3a1e384bde5909fe385beeba0a0eba08b7b5e
UHC 艤쇱렮곡ㄽ吟ㅾ렠렋}艤쇱렮곡ㄽ吟ㅾ렠렋{^ 111010111111101010111100111011001000111010111011101100001110111010100100101011011110101111100001101001001110111010001110101100011000111010100010011111011110101111111010101111001110110010001110101110111011000011101110101001001010110111101011111000011010010011101110100011101011000110001110101000100111101101011110 ebfabcec8ebbb0eea4adebe1a4ee8eb18ea27debfabcec8ebbb0eea4adebe1a4ee8eb18ea27b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)