To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ??????膺??娃??誼??柔ロ?沃??^ 00111111001111110011111100111111001111110011111111100100010111100011111100111111100010001010000100111111001111111000101101100010001111110011111110001111010111111000001110001101001111111001011110000000001111110011111101011110 3f3f3f3f3f3fe45e3f3f88a13f3f8b623f3f8f5f838d3f97803f3f5e
EUC-JP ???靷??膺??娃??誼??柔ロ?沃??^ 001111110011111100111111100011111110011110111101001111110011111111100111101111110011111100111111101100001010001100111111001111111011010111000011001111110011111110111101110000001010010111101101001111111100110111100000001111110011111101011110 3f3f3f8fe7bd3f3fe7bf3f3fb0a33f3fb5c33f3fbdc0a5ed3fcde03f3f5e
UTF-8 嶺뚮뿫靷숅걬膺우젘娃븐뼚誼당춯柔ロ닑沃쇱뎸^ 11101111101001101010101111101011100110101010111011101011101111111010101111101001100111011011011111101100100010001000010111101010101100011010110011101000100001101011101011101100100110101011000011101100101000001001100011100101101010001000001111101011101110001001000011101011101111001001101011101000101010101011110011101011100010111011100111101100101101101010111111100110100111111001010011100011100000111010110111101011100010111001000111100110101100101000001111101100100001111011000111101011100011101011100001011110 efa6abeb9aaeebbfabe99db7ec8885eab1ace886baec9ab0eca098e5a883ebb890ebbc9ae8aabceb8bb9ecb6afe69f94e383adeb8b91e6b283ec87b1eb8eb85e
UHC 嶺뚮뿫靷숅걬膺우젘娃븐뼚誼당춯柔ロ닑沃쇱뎸^ 11100111101011011000110011101011100101111010101111101100111001101001100111101001100000011001010111101011111011001011111111101100101000001001010011101000110111111011101011101100100101101010000011101011111111101011010011100111101011011000110011101010111101011010101111101101100010001001011011101000101010101011110011101100100010011000101101011110 e7ad8ceb97abece699e98195ebecbfeca094e8dfbaec96a0ebfeb4e7ad8ceaf5abed8896e8aabcec898b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)