To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????®???????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111101011100011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3fae3f3f3f3f3f3f3f3f
SJIS-WIN 燿③?節??軟?????橈??軟??窈 1110000010100000100001110100001000111111100100001101111100111111001111111001001111101110001111110011111100111111001111110011111110011110111101000011111100111111100100111110111000111111001111111110001001110111 e0a087423f90df3f3f93ee3f3f3f3f3f9ef43f3f93ee3f3fe277
EUC-JP 燿??節??軟??璵®?橈??軟??窈 1110000010100010001111110011111111000000111000010011111100111111110001101111000000111111001111111000111111001100111001101000111110100010111011100011111111011100111101100011111100111111110001101111000000111111001111111110001111011000 e0a23f3fc0e13f3fc6f03f3f8fcce68fa2ee3fdcf63f3fc6f03f3fe3d8
UTF-8 燿③줁節루럷軟뚪뿆璵®뀒橈띸럷軟뚪뿆窈 1110011110000111101111111110001010010001101000101110110010100100100000011110011110101111100000001110101110100011101010001110101110011111101101111110100010111011100111111110101110011010101010101110101110111111100001101110011110010010101101011100001010101110111010111000000010010010111001101010100110001000111010111001110110111000111010111001111110110111111010001011101110011111111010111001101010101010111010111011111110000110111001111010101010001000 e787bfe291a2eca481e7af80eba3a8eb9fb7e8bb9feb9aaaebbf86e792b5c2aeeb8092e6a988eb9db8eb9fb7e8bb9feb9aaaebbf86e7aa88
UHC 燿③줁節루럷軟뚪뿆璵®뀒橈띸럷軟뚪뿆窈 1110100011111100101010001110100110100001100110001110111110111101101101111110011110001110100101101110011011100011100011001110100110010111100011011110011010100101101000101110011110000101100011001110100011111010100011011110011110001110100101101110011011100011100011001110100110010111100011011110100110100001 e8fca8e9a198efbdb7e78e96e6e38ce9978de6a5a2e7858ce8fa8de78e96e6e38ce9978de9a1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)