To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 縡??絅?鎖??粟瓦??よ?悅?梯?趙陌?^ 1110001101110001001111110011111111100011010001000011111110001101101111010011111100111111100010001011111010001010101000100011111100111111100000101110011000111111111110101011110100111111100100101111001000111111111001101110001011101000100110010011111101011110 e3713f3fe3443f8dbd3f3f88be8aa23f3f82e63ffabd3f92f23fe6e2e8993f5e
EUC-JP 縡?饔絅?鎖??粟瓦??よ???梯?趙陌?^ 111001011101001000111111100011111110100011101111111001011010010100111111101110101011111100111111001111111011000011000000101101001010010000111111001111111010010011101000001111110011111100111111110001001111010000111111111011001110010011101111111110010011111101011110 e5d23f8fe8efe5a53fbabf3f3fb0c0b4a43f3fa4e83f3f3fc4f43fece4eff93f5e
UTF-8 縡렕饔絅뤈鎖쵌곧粟瓦狀춲よ싱悅멜梯렟趙陌욱^ 11100111101110001010000111101011101000001001010111101001101001011001010011100111101101011000010111101011101001001000100011101001100011101001011011101100101101011000110011101010101100111010011111100111101100101001111111100111100100111010011011101111101001111011101011101100101101101011001011100011100000101000100011101100100010111011000111100110100000101000010111101011101010011001110011100110101000101010111111101011101000001001111111101000101101101001100111101001100110011000110011101100100110101011000101011110 e7b8a1eba095e9a594e7b585eba488e98e96ecb58ceab3a7e7b29fe793a6efa7baecb6b2e38288ec8bb1e68285eba99ce6a2afeba09fe8b699e9998cec9ab15e
UHC 縡렕饔絅뤈鎖쵌곧粟瓦狀춲よ싱悅멜梯렟趙陌욱^ 11101110101011011000111010101010111010001011110111001100111001111000111110111000111000011111000010101100100011101011000011110000111000011101100011101000101111111110110111101110101011011000111010101010111010001011110111001100111001101110110110111000111000011111000010101100100011101011000011110000111000011101100011101000101111111110110101011110 eead8eaae8bdcce78fb8e1f0ac8eb0f0e1d8e8bfedeead8eaae8bdcce6edb8e1f0ac8eb0f0e1d8e8bfed5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)