To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???鍮?ぜ濡??筌??誼??矣??沃?? 00111111001111110011111111101000010010100011111110000010101110101001010001000111001111110011111111100010101000110011111100111111100010110110001000111111001111111110000111100001001111110011111110010111100000000011111100111111 3f3f3fe84a3f82ba94473f3fe2a33f3f8b623f3fe1e13f3f97803f3f
EUC-JP ???鍮?ぜ濡??筌??誼??矣??沃?? 00111111001111110011111111101111101010110011111110100100101111001100011110101000001111110011111111100100101001010011111100111111101101011100001100111111001111111110001011100011001111110011111111001101111000000011111100111111 3f3f3fefab3fa4bcc7a83f3fe4a53f3fb5c33f3fe2e33f3fcde03f3f
UTF-8 捻꿸낯鍮뽬ぜ濡⑸븸筌먲퐣誼욄쾬矣뚣뀖沃쇱칳 111011111010011010100100111010101011111110111000111010111000001010101111111010011000110110101110111010111011110110101100111000111000000110011100111001101011111110100001111000101001000110111000111010111011100010111000111001111010110110001100111010111010100010110010111011011001000010100011111010001010101010111100111011001001101010000100111011001011111010101100111001111001111110100011111010111001101010100011111010111000000010010110111001101011001010000011111011001000011110110001111011001011100110110011 efa6a4eabfb8eb82afe98daeebbdace3819ce6bfa1e291b8ebb8b8e7ad8ceba8b2ed90a3e8aabcec9a84ecbeace79fa3eb9aa3eb8096e6b283ec87b1ecb9b3
UHC 捻꿸낯鍮뽬ぜ濡⑸븸筌먲퐣誼욄쾬矣뚣뀖沃쇱칳 111001101111011110110010111010101011001110111000111010111011100110010110111010001010101010111100111010111010000110101001111010111001010110100001111011111010011110010000111011111011110110001100111010111111111010011110111001101011001010000011111010111111100010001100111000111000010110001111111010001010101010111100111011001010111110000110 e6f7b2eab3b8ebb996e8aabceba1a9eb95a1efa790efbd8cebfe9ee6b283ebf88ce3858fe8aabcecaf86

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)